Hi everyone, 

(apologies for cross-postings) 

Tiphaine Viard and Maria Boritchev are offering a master 2 internship 
at Télécom Paris. Feel free to contact us if you have any questions 
about the offer or the project, the details on the offer are below: 

Advisors: Maria Boritchev, Tiphaine Viard 
Duration: 5-6 months, starting from February 1st (negociable) 
Location: Télécom Paris, 19 Pl. Marguerite Perey, 91120 Palaiseau 
Gratification: Approximately 600 euros per month (more or less 20 euros per 
month, the precise amount 
will depend on changes in the French labor code in 2026) 
Requirements: Applicants must be enrolled in a Master’s 2 program at the time 
of application and through 
the duration of the internship. We are looking for applications from students 
with solid skills (and 
ideally experience) in Natural Language Processing, Machine Learning and Deep 
Learning, Computa- 
tional Social Sciences. Knowledge of English is necessary 

Context: 
The recent years have seen a surge of initiatives with the goal of defining 
what “ethical” artificial intelligence 
would or should entail, resulting in the publication of various charters and 
manifestos discussing AI ethics; 
these documents originate from academia, AI industry companies, non-profits, 
regulatory institutions, and 
the civil society. The contents of such documents vary wildly, from short, 
vague position statements to 
verbatims of democratic debates or impact assessment studies. As such, they are 
a marker of the social world 
of artificial intelligence, outlining the tenets of different actors, the 
consensus and dissensus on important 
goals, and so on [Gornet et al., 2024]. We have assembled a corpus of charters 
and manifestos of Ethics 
of AI, in English, written by different actors of the current AI landscape. 
This corpus is called MapAIE: 
[ https://mapaie.telecom-paris.fr/ | https://mapaie.telecom-paris.fr/ ] . We 
are conducting research on data from MapAIE both from a 
sociological and linguistic perspectives: 
• Sociologically, who are the groups of people who write about Ethics of AI? 
• Linguistically, what type of vocabulary or semantic constructions do people 
use to write about Ethics 
of AI? 
• Socio-linguistically, is there a difference in linguistic usage between 
different groups of people who write 
about Ethics of AI? 
To conduct these investigations, we would like to go further than traditional 
tools: we intend to develop 
graph-based natural language processing and computational sociology approaches 
making better use of mod- 
ern NLP methods to explore our data. In particular, we could to exploit word 
sense induction approaches 
to automatically extract different linguistic usages. 

Objectives: 
The goal of this internship is to investigate MapAIE by using and developing 
graph-based natural language 
processing and computational sociology approaches. 
The internship will proceed in three steps: 
1. Conduct a state of the art exploration on existing graph-based natural 
language processing and com- 
putational sociology techniques, starting from Abstract Meaning Representations 
(AMR, 
[ https://github.com/amrisi/amr-guidelines/blob/master/amr.md | 
https://github.com/amrisi/amr-guidelines/blob/master/amr.md ] ) and Cortext 
(https://www.cortext.net/). 
2. Re-implement existing techniques identified in (1), in particular [Eyal et 
al., 2022], and analyse the 
obtained results sociologically and linguistically in view of the research 
questions of the project. 
3. Propose new research questions and new graph-based data exploration 
approaches relevant to MapAIE. 

Application: 
Deadline: October 25th, 2025. 
Application: To apply for this position, please send an email with your CV and 
a few words explaining 
your interest in this project to Maria Boritchev and Tiphaine Viard. 

References 
[Becker, 1976] Becker, H. S. (1976). Art worlds and social types. American 
behavioral scientist, 19(6):703– 
718. 
[Cefa¨ı, 2016] Cefa¨ı, D. (2016). Publics, probl`emes publics, ar`enes 
publiques.... Questions de communication, 
30(2):25–64. 
[Eyal et al., 2022] Eyal, M., Sadde, S., Taub-Tabib, H., and Goldberg, Y. 
(2022). Large scale substitution- 
based word sense induction. In Proceedings of the 60th Annual Meeting of the 
Association for Computa- 
tional Linguistics (Volume 1: Long Papers), pages 4738–4752. 
[Gornet et al., 2024] Gornet, M., Delarue, S., Boritchev, M., and Viard, T. 
(2024). Mapping ai ethics: a 
mesoscale analysis of its charters and manifestos. In Proceedings of the 2024 
FAccT conference on Fairness, 
Accountability and Transparency in Machine Learning. 
[Roth and Hellsten, 2023] Roth, C. and Hellsten, I. (2023). Socio-semantic 
configuration of an online con- 
versation space: The case of twitter users discussing the# ipcc reports. Social 
Networks, 75:186–196. 
_______________________________________________
Corpora mailing list -- [email protected]
https://list.elra.info/mailman3/postorius/lists/corpora.list.elra.info/
To unsubscribe send an email to [email protected]

Reply via email to