My goal is to push the boundaries of natural language processing research by means of building machine learning models and open access tools/datasets for world’s languages. In particular, my research interests include (but not limited to) semantics, procedural language; and abstraction and reasoning capabilities of large language models.
Currently, I am an Asst. Prof at Koç University Computer Science and Engineering Department, where I am also affiliated with KUIS AI. Previously, I was a postdoctoral researcher at UKP in Technical University of Darmstadt, working with Prof. Iryna Gurevych. Before that, I got my PhD degree from Computer Engineering Department in Istanbul Technical University, where I was supervised by Prof. Eşref Adalı and was a member of İTÜ NLP group. During my PhD studies, I’ve visited Institute for Language, Cognition and Computation (ILCC), where I’ve worked with Prof. Mark Steedman.
News
2023
- September 2023: Website for GGLab is live here
- September 2023: Two papers accepted to IJCNLP-AACL 2023!
- July 2023: Metric-based learning paper accepted to INLG23!
- March 2023: Giving a talk entitled “Living in the world of large language models: Successes and Failures” as part of KOLT Webinar
- February 2023: Two papers accepted to EACL 2023!
2022
- July 2022: I gave a lecture entitled “Towards Fair NLP Models: An Overview of Recent Bias Detection and Mitigation Strategies” at Text Mining and Natural Language Processing for Computational Social Sciences Summer School. All materials can be found here.
- July 2022: Our article “On the rate of convergence of a classifier based on a Transformer encoder” is accepted to IEEE Transactions on Information Theory!
- June 2022: I will serve as an Area Chair for the “Low-resourced and less studied languages” track in COLING 2022.
- June 2022: I am excited to welcome pre-doctoral students/interns within the scope of Fatima Fellowship and Koç University Summer Research Program (KUSRP)!!
- April 2022: My application to the “Tübitak 2236-Co-Funded Brain Circulation Scheme2 (CoCirculation2)” fellowship has also been approved! (Due to receiving 2232b fellowship, I had to reject it.)
- March 2022: I have received a very special 3-year fellowship from The Scientific and Technological Research Council of Turkey: “Tübitak 2232 B-International Fellowship for Outstanding Researchers”! I will be hiring 2 Masters, 2 PhD students and one postdoctoral researcher within the scope of the project entitled: “Automatic Learning of Procedural Language from Natural Language Instructions for Intelligent Assistance”. See this page for more details.
- February 2022: I have joined Koç University Computer Science and Engineering Department as an Asst. Prof, where I will be collaborating closely with KUIS AI.
- October 2021: We have organized the first multilingual representation workshop at EMNLP 2021 with Duygu Ataman, Alexandra Birch, Alexis Conneau, Orhan Firat and Sebastian Ruder. The second edition will be co-located at EMNLP 2022, stay tuned!
Recent Work
2023
- Haritz Puerto, Gözde Gül Şahin, Iryna Gurevych. MetaQA: Combining Expert Agents for Multi-Skill Question Answering. In Proceedings of EACL 2023[pdf]
- Jan-Christoph Klie, Ji-Ung Lee, Kevin Stowe, Gözde Gül Şahin, Nafise Sadat Moosavi, Luke Bates, Dominic Petrak, Richard Eckart de Castilho and Iryna Gurevych. Lessons Learned from a Citizen Science Project for Natural Language Processing. In Proceedings of EACL 2023[pdf]
2022
- Tim Baumgärtner, Kexin Wang, Rachneet Sachdeva, Max Eichler, Gregor Geigle, Clifton Poth, Hannah Sterz, Haritz Puerto, Leonardo F. R. Ribeiro, Jonas Pfeiffer, Nils Reimers, Gözde Gül Şahin, Iryna Gurevych. UKP-SQUARE: An Online Platform for Question Answering Research. In Proceedings of ACL 2022, Demo Track.[pdf]
- Gözde Gül Şahin. To Augment or Not to Augment? A Comparative Study on Text Augmentation Techniques for Low-Resource NLP. (Computational Linguistics Journal, Vol 48, March 2022)[pdf]
- Iryna Gurevych, Michael Kohler, Gözde Gül Şahin. On the rate of convergence of a classifier based on a Transformer encoder. (IEEE Transactions on Information Theory, 2022, doi: 10.1109/TIT.2022.3191747)[pdf]
2021
- Haritz Puerto, Gözde Gül Şahin, Iryna Gurevych. MetaQA: Combining Expert Agents for Multi-Skill Question Answering. (Under review)[pdf]
2020
- Gözde Gül Şahin, Yova Kementchedjhieva, Phillip Rust, Iryna Gurevych. PuzzLing Machines: A Challenge on Learning From Small Data. In Proceedings of ACL 2020.[pdf][website]
- Gözde Gül Şahin, Iryna Gurevych. Two Birds with One Stone: Investigating Invertible Neural Networks for Inverse Problems in Morphology. In Proceedings of AAAI 2020. [pdf]
- Gözde Gül Şahin, Clara Vania, Ilia Kuznetsov, Iryna Gurevych. LINSPECTOR: Multilingual Probing Tasks for Word Representations. (Computational Linguistics Journal, June 2020 & presented in ACL 2020.) [pdf][code]
2019
- Max Eichler, Gözde Gül Şahin, Iryna Gurevych. LINSPECTOR WEB: A Multilingual Probing Suite for Word Representations. In Proceedings of EMNLP 2019: System Demonstrations. [pdf][website][cite][code]
- Steffen Eger, Gözde Gül Şahin, Andreas Rücklé, Ji-Ung Lee, Claudia Schulz, Mohsen Mesgar, Krishnkant Swarnkar, Edwin Simpson, Iryna Gurevych. Text Processing Like Humans Do: Visually Attacking and Shielding NLP Systems. In Proceedings of NAACL-HLT 2019. [pdf][video][cite][code]
2018
- Gözde Gül Şahin, Mark Steedman. Data Augmentation via Dependency Tree Morphing for Low-Resource Languages. In Proceedings of EMNLP 2018. [pdf][cite][code]
- Gözde Gül Şahin, Mark Steedman. Character-Level Models versus Morphology in Semantic Role Labeling. In Proceedings of ACL 2018. [pdf][cite][code]
- Gözde Gül Şahin, Eşref Adalı. Annotation of semantic roles for the Turkish Proposition Bank. Language Resources and Evaluation volume 52, pages 673–706 (2018). [website][pdf]
- Gözde Gül Şahin, Erdem Emekligil, Seçil Arslan, Onur Ağın, Gülşen Eryiğit. Relation extraction via one-shot dependency parsing on intersentential,higher-order, and nested relations. Turkish Journal of Electrical Engineering & Computer Sciences, 26, 830-843 (2018).[pdf][cite]
- Gözde Gül Şahin. Building of Turkish PropBank and Semantic Role Labeling of Turkish. PhD thesis, Istanbul Technical University, January 2018 [pdf]
For older publications, check my google scholar page
Students/Mentees
Bachelor Students
- Max Eichler - now PhD student at UKP/TU Darmstadt
- Marvin Kaster - now Masters student at UKP/TU Darmstadt
Predoctoral Research Interns
- Haritz Puerto - now PhD student at UKP/TU Darmstadt