ByteDance Releases GR-2 Robot AI Large Model
The research team of ByteDance has recently launched the second-generation large-scale model GR-2 (Generative Robot 2.0). Its highlight lies in innovatively constructing the “robot infancy” learning stage, imitating human growth and learning complex tasks, possessing outstanding generalization ability and multitask versatility.
Like many other AI models, the GR-2 model undergoes two processes: pre-training and fine-tuning.
During the pre-training phase, GR-2 ‘watched’ up to 38 million internet videos from various public datasets and 500 billion tokens, covering a wide range of daily scenes such as home, outdoor, and office environments. This enables GR-2 to have generalization capabilities across a wide range of robot tasks and environments in subsequent reinforcement learning.
In the fine-tuning stage, the team used robot trajectory fine-tuning for video generation and action prediction, demonstrating outstanding multitasking capabilities, achieving an average success rate of 97.7% in over 100 tasks.
In addition, GR-2 demonstrates excellent generalization ability in novel and previously unseen scenarios, including new backgrounds, environments, objects, and tasks.
SEE ALSO: ByteDance’s Doubao Video Generation Large Model Released