Dear, 
(sorry for multiple postings)

We are recruiting a post-doc at IRIT (France) in the context of the DACE-DL 
project (DAta-CEntric AI-driven Data Linking).
Recruitment is scheduled for early 2022 for 24 months.

Thank you for circulating this offer in your networks.

Regards, 
Cassia Trojahn and Olivier Teste

-----------------------------------------------------------------
** Post-doctoral position at IRIT: Data Linking **

* Context: ANR project DACE-DL (DAta-CEntric AI-driven Data Linking)  *
Data linking is the scientific challenge of automatically establishing typed 
links between the entities of two or more structured datasets. A variety of 
complex data linking systems exists, evaluated on public benchmarks. While they 
have allowed for the generation of vast amounts of linked data in the context 
of various dedicated projects, data generic systems often have limited 
applicability in many real-world scenarios, where data are highly heterogeneous 
and domain-specific. DACE-DL targets a paradigm shift in the data linking field 
with a data-centric bottom-up methodology relying on machine learning and 
representation learning models. We hypothesize there exists a finite number of 
identifiable and generalisable linking problem types (LPTs), that we need to 
categorize and analyse to provide better linking results.

 * Topic: Data collect, consolidation, and data linking systems modularization  
*

This research is articulated in two main tasks. The first task consists in (1) 
carrying out an in-depth analysis of the quality of the existing data linking 
datasets, identifying erroneous statements and providing a high-quality set of 
datasets by correcting those statements; and (ii) generating additional links 
using existing high-precision linking systems on the chosen datasets. Data 
quality metrics such as accuracy, consistency and conciseness will be 
considered. 
The aim of the second task is manifold : (1) to provide an inventory of 
publicly available and functional linking tools that are able to deal with a 
large spectrum of data linking problem; (2) to propose a theoretical approach 
for the modularization of these tools into atomic modules easy to combine in 
order to build more complex solutions in a linking ecosystem; (3) to make the 
produced modules available to the data linking community. To do the 
modularization at scale, we plan to call upon unsupervised ML algorithms, 
enhanced by a human-in-the-loop approach. The objective is to provide a set of 
correspondences between the modules and the LPTs. 

Starting period: January 2022 – duration of 24 months

 * Work environment and Salary  *

Localization : Institut de Recherche en informatique de Toulouse (IRIT) – 
Universite Toulouse - Jean Jaures / Maison de la Recherche, 5, allees Antonio 
Machado 31058 Toulouse.
Salary between 2200€ and 2700€ gross monthly depending on qualifications and 
situation.

* How to apply *

Applicants are required to have a PhD in Computer Science, a strong background 
in semantic web technologies, ontology matching and data linking. Fluency in 
written / spoken English is required too. A good publication record and strong 
programming skills will be a plus.  Applications will be accepted until the 
position is closed.  Applicants should send a full CV including a complete list 
of publications, a cover letter indicating their research interests, 
achievements to date and vision for the future, as well as either support 
letters or the name of 2 persons that have worked with them.  

Contact: Cassia Trojahn (cassia.troj...@irit.fr) and Olivier Teste 
(olivier.te...@irit.fr)

_______________________________________________
uai mailing list
uai@engr.orst.edu
https://it.engineering.oregonstate.edu/mailman/listinfo/uai

Reply via email to