I am a Member of Technical Staff at Microsoft AI SuperIntelligence. I earned my Ph.D. in Computer Science from Johns Hopkins University in 2024, co-advised by Philipp Koehn and Kenton Murray, with the topic of foundational training of large language models, machine translation, and multilinguality.
My recent research explores next-generation AI agents, focusing on the complex reasoning and automated engineering of language models, aiming to build highly capable agents for technical domains. Please find up-to-date list of all my publications on my Google Scholar profile.
I also had good fortune to intern at Microsoft, Meta (Facebook) AI Research and Amazon Alexa AI.
Ph.D. in Computer Science, 2024
Johns Hopkins University
M.S. in Computer Science, 2020
Johns Hopkins University
B.E. in Information Engineering, 2018
East China University of Science and Technology