![]() |
|
Intro
- Starting the next phase, 2025 - Now.
- Between 2019 and 2025, I served as a Director of Robotics Research at ByteDance, where I established and led a great robotics research team, spearheading the development of cutting-edge robotic technologies and systems. I received my Ph.D. from Tsinghua University in 2019, advised by Fuchun Sun. I visited the University of Pennsylvania, working with Jianbo Shi. My research lies in the field of robot learning and computer vision, with a particular emphasis on devising scalable, AI powered algorithms and systems that enable robots to perceive and act in the real world.
- We are actively seeking full-time researchers and engineers specializing in robotics, with a focus on robot foundation models and systems. If you are interested in these positions, please drop me an email.
Selected Projects (15k+ Citations)
-
GR-2: A Generative Video-Language-Action Model with Web-Scale Knowledge for Robot Manipulation
Robotics Research Team, ByteDance Research
Tech report, 2024
[Project][Paper][Video][News][GR-1][GR-MG][Visual-Force Learning][Findings]
-
Vision-Language Foundation Models as Effective Robot Imitators
Xinghang Li, Minghuan Liu, Hanbo Zhang, Cunjun Yu, Jie Xu, Hongtao Wu, Chilam Cheang, Ya Jing, Weinan Zhang, Huaping Liu, Hang Li, Tao Kong
Providing an Open-Source Robotics Learning Framework Based on VLM that Enables the Learning of a Wide Variety of Robot Skills.
ICLR, 2024
[Project][Paper][Code][News][RoboVLMs][VLM study]
-
Navigating to Objects in Unseen Environments by Distance Prediction
Minzhao Zhu, Binglei Zhao, Tao Kong
Our base method to win the Habitat ObjectNav Challenge 2022.
IROS, 2022 (Oral)
[Paper][News][Active Perception]
-
iBOT: Image BERT Pre-Training with Online Tokenizer
Jinghao Zhou, Chen Wei, Huiyu Wang, Wei Shen, Cihang Xie, Alan Yuille and Tao Kong
Among the Most Influential ICLR Papers in Google Scholar Metrics 2023
ICLR, 2022
[Paper][Code][News][dBOT][TWIST]
-
SOLO: Segmenting Objects by Locations
Xinlong Wang, Tao Kong, Chunhua Shen, Yuning Jiang and Lei Li
Among the Most Influential ECCV Papers in Google Scholar Metrics 2022/2023
ECCV, 2020
[Project][Paper][SOLOv2][Final][Code]
-
FoveaBox: Beyond Anchor-based Object Detector
Tao Kong, Fuchun Sun, Huaping Liu, Yuning Jiang, Lei Li, Jianbo Shi
ESI Highly Cited Paper (Top 1%).
Among the Most Influential TIP Papers in Google Scholar Metrics 2023
TIP, 2020
[Project][Paper][Code][Loss][HyperNet]
Honors & Awards
- IROS 2024 New Generation Star Program
- ICRA 2024 Co-manipulation Workshop Best Paper Finalists
- WAIC-Yunfan Award 2024
- CAA First Prize of the Natural Science Award 2023
- Habitat ObjectNav Challenge Winner Award 2022
- CAAI Excellent Doctoral Dissertation Nomination Award, 2020
- IROS Robotic Grasping and Manipulation Competition Winner Award 2016
- The CCF Outstanding Undergraduate Award, 2013
- University Young Science Award, 2013
- National Scholarship, 2012/2013
Last update: June, 2025