Bio

I’m an Assistant Professor at the Department of Cognitive Science and Artificial Intelligence of Tilburg University. I work on algorithmic monitoring and auditing as part of the Tilburg Algorithm Observatory, and am interested in the (harmful) effects of intelligent systems on our lives; systems that uncover our personal information, monitor and change our behavior, subtly restrict our exposure to information, and treat us unfairly.

I defended my dissertation “User-centered Security in Natural Language Processing” in January 2023, supervised by Grzegorz ChrupałaEric Postma, and Walter Daelemans.

I'm a member of the faculty council (DCA.I.) and the Data Science and Society program committee.

Expertise

I have a multidisciplinary background in humanities and computer science. My primary area of expertise is identifying and attempting to subvert (harmful) inferences made through Machine Learning (ML). I have mainly worked on adversarial attacks on Deep Learning algorithms trained on language data (Natural Language Processing or NLP), with a focus on privacy and security. My work critically analyzes the current, and more distant impact such algorithms have on society. I'm a strong advocate of a user-centered, open-source approach to ML, and the automation of society in general.

Within NLP, I have worked on various topics such as (adversarial) stylometry (or author profiling), cyberbullying/toxicity detection, data augmentation through lexical substitution, language generation, machine translation, and more generally scientific development of reproducible research pipelines.

Teaching

I’m currently the course coordinator for both Data Processing (Python course) in context of our Data Science master, and Language & AI (NLP for Data Science course) for our joint Data Science bachelor with TU/e (JADS). Previously, I taught Text Mining and Spatiotemporal Data Analysis (both one semester), and Data Mining (five years). I focus on innovating the courses I am involved in, primarily by connecting theory to practical use cases. I believe this makes the lectures more fun, and easier to conceptualize the utility of the material. It also provides a soft introduction to applications students might see in their future careers. A recent example is my EDUiLAB project to familiarize Data Processing students with code versioning, repositories, and build servers using GitHub.

Courses

Top publications

  1. User-Centered Security in Natural Language Processing

    Emmery, C. (2023). User-Centered Security in Natural Language Processing. [Doctoral Thesis, Tilburg University]. Ridderprint.
  2. Adversarial Stylometry in the Wild - Transferable Lexical Substitutio…

    Emmery, C., Kádár, Á., & Chrupała, G. (2021). Adversarial Stylometry in the Wild: Transferable Lexical Substitution Attacks on Author Profiling. In Proceedings of the 16th Conference of the European Chapter of the Association for Computational Linguistics: Volume 1, Long Papers (pp. 2388-2402). Association for Computational Linguistics.
  3. Cyberbullying Classifiers are Sensitive to Model-Agnostic Perturbatio…

    Emmery, C., Kádár, A., Chrupała, G., & Daelemans, W. M. P. (2022). Cyberbullying Classifiers are Sensitive to Model-Agnostic Perturbations. Paper presented at Language Resources and Evaluation Conference, Marseille, France. https://arxiv.org/abs/2201.06384
  4. Style Obfuscation by Invariance

    Emmery, C., Manjavacas, E., & Chrupala, G. (2018). Style Obfuscation by Invariance. In COLING 2018 Association for Computational Linguistics. https://arxiv.org/abs/1805.07143
  5. Current Limitations in Cyberbullying Detection - on Evaluation Criter…

    Emmery, C., Verhoeven, B., De Pauw, G., Jacobs, G., Van Hee, C., Lefever, E., Desmet, B., Hoste, V., & Daelemans, W. (2020). Current Limitations in Cyberbullying Detection: on Evaluation Criteria, Reproducibility, and Data Scarcity. Language Resources and Evaluation, 55, 597-633.

Find an expert or expertise