Welcome to Yiping’s Homepage!
[NEWS] I’m relocating to Santiago de Chile to join Topsort as a senior data scientist to work on auction and AI-based monetization technology for the e-commerce ecosystem. Thus, I’ll leave NLP research and stop maintaining this site.
[NEWS] Our paper Disentangling Hate Across Target Identities was accepted to NAACL 2025 main conference.
In academia, I worked as a visiting postdoc at Universitat Pompeu Fabra from Oct 2022 till March 2025, supervised by Prof. Leo Wanner. My research focus was the evaluation and biases in hate speech detection.
I worked as a Staff Research Scientist/Senior R&D Advisor at Knorex from 2015 till 2025. My research topics include weakly-supervised learning methods, text mining, and natural language generation. Before joining Knorex, I worked as a researcher at Baidu-I2R Research Centre under Dr. Su Jian, mainly on information extraction projects.
I received Bachelor of Computer Science (1st Class Honours) from the National University of Singapore (supervised by Associate Prof. Min-Yen Kan) and M.S & Ph.D from Chulalongkorn University (working with Asst. Prof. Dittaya Wanvarie; received summa cum laude for Ph.D. thesis).
If you want to reach out to me, please drop me an email or write to me on LinkedIn.
Research Interests
My research focus is detecting hate speech across domains (Jin et al. 2023), functionalities (Jin et al. 2024), target identities (Jin et al. 2024), and ultimately individual persons. Hate speech detection is a highly subjective task, where a universalism approach may harm the vulnerable groups we want to project. I’m thrilled to investigate how we can model individuals’ or groups’ perspectives to the task.
I’m broadly interested in the applications of natural language processing. The focus of my PhD thesis is weakly-supervised text classification (aka. dataless classification) where we induce classifiers without any manually labeled document (Jin et al. 2017, Charoenphakdee et al. 2019, Jin et al. 2020, Jin et al. 2021a, Jin et al. 2021b).
I’m also keen on natural language generation, especially controlling the topic, style, and content of generated texts without additional supervision (Jin and Le, 2016, Jin et al. 2021, Jin et al. 2022).