Tuo An

MARS Lab. Multimodal embodied AI and Robotics System Lab, Nanyang Technological University

life_image_2.jpg

I am a first-year PhD student at Nanyang Technological University, supervised by Prof. Jianfei Yang. My research is supported by NTU graduate research scholarship. Before that, I received my Bachelor’s degree from the School of Artificial Intelligence, Nanjing University.

My research interests focus on Multimodal Perception and Embodied Intelligence, with an emphasis on developing robust and efficient learning methods for agents operating in the physical world. Specifically, I am interested in the following aspects:

Multimodal Perception: Explore how vision, language, and heterogeneous sensor data (e.g., IoT signals) can be jointly leveraged to enhance an agent’s perception and understanding of real-world environments.

Efficient Learning: Explore how Vision-Language-Action models can adapt to new tasks under limited data through efficient learning.

Generalization and Continual Learning: Explore generalization and continual learning in VLA models, especially in addressing catastrophic forgetting of previously acquired skills and world knowledge inherited from vision–language models.

news

Oct 02, 2025 IoT-LLM is accepted at Patterns, Cell Press, as a Cover Paper. :tada: :tada: :tada:
Aug 11, 2025 I am glad to join the MARS Lab as a PhD student in August 2025. Fighting for the future of AI! :rocket: :smile:
Nov 01, 2024 Awarded National Scholarship (Ranked 3/102)
Aug 07, 2023 Awarded Best Performance Award during the NJU Natural Language Processing(NLP) Summer Camp.

selected publications

  1. Under Review
    optimaztion.png
    Optimization-Guided Diffusion for Interactive Scene Generation
    Shihao Li, Naisheng Ye, Tianyu Li, and 7 more authors
    In arXiv preprint, 2025
  2. Cell Press
    iot-llm.png
    Iot-llm: Enhancing real-world iot task reasoning with large language models
    Tuo An, Yunjiao Zhou, Han Zou, and 1 more author
    Patterns, Cell Press (Cover Paper), 2025
  3. EMNLP
    EFUF.png
    EFUF: Efficient Fine-grained Unlearning Framework for Mitigating Hallucinations in Multimodal Large Language Models
    Shangyu Xing, Fei Zhao, Zhen Wu, and 5 more authors
    In Proceedings of the 2024 Conference on Empirical Methods in Natural Language Processing, 2024

education

NTU Logo

Nanyang Technological University (NTU), Singapore

2025 - Present

Ph.D in Mechanical and Aerospace Engineering, Advisor: Prof. Jianfei Yang

NJU Logo

Nanjing University (NJU), China

2021 - 2025

B.Eng. in Artificial Intelligence, Overall GPA 91.2/100

selected honors & awards

  • National Scholarship (the highest honor for Chinese students), 2024
  • Outstanding Student of Nanjing University, 2023
  • School of Artificial Intelligence Scholarship, 2022