A new breed of artificial intelligence is emerging, capable of tasks it was never explicitly taught. This technology, known as zero-shot learning, is pushing the boundaries of AI capabilities and raising questions about the future of machine intelligence.
OpenAI’s GPT-4, for example, is a language model that showcases zero-shot learning abilities. Without any specific legal training, GPT-4 scored in the 90th percentile on the Uniform Bar Exam. The model also exhibited zero-shot translation capabilities, accurately translating between language pairs it had never seen before, such as Slovenian to Swahili.
Zero-shot learning allows AI systems to perform tasks or recognize objects without prior specific training. Traditional machine learning models require extensive datasets for each new task, but zero-shot learning algorithms apply existing knowledge to novel situations, mimicking human-like inference.
The roots of zero-shot learning trace back to a 2009 paper by Carnegie Mellon University researchers titled “Zero-shot Learning with Semantic Output Codes.” Since then, the field has advanced rapidly, with breakthroughs occurring in the past five years.
The impact of zero-shot learning extends beyond language processing. Researchers published a study in Nature Biomedical Engineering demonstrating CheXzero, an AI model that detects various diseases from chest X-rays using zero-shot learning. The model successfully identified conditions on which it had never been trained, such as pneumothorax (collapsed lung), by leveraging its understanding of related medical concepts and image features.
Google DeepMind unveiled Gato, a generalist AI agent that showcases zero-shot learning across diverse domains. In one demonstration, Gato played a new Atari game it had never seen before, applying strategies learned from other games to achieve a high score within minutes of exposure to the new game.
Zero-shot learning is also making waves in drug discovery. Researchers at MIT used a zero-shot learning model to predict the antimicrobial properties of molecules it had never encountered. The model identified a novel antibiotic compound effective against drug-resistant bacteria despite never being trained on that specific class of antibiotics.
In computer vision, Facebook AI (now part of Meta) developed Contrastive Language-Image Pre-training(CLIP), a zero-shot learning system. CLIP can classify images it has never seen before based solely on textual descriptions. For example, CLIP accurately categorized them using only their textual descriptions when presented with pictures of rare animals, not in its training data.
Microsoft Research’s recent work on zero-shot learning in computer vision demonstrates models identifying objects in images without specific training. Its system, VL-GPT, can generate detailed captions for pictures of complex scenarios it has never encountered before, such as describing the actions in a novel sport or the components of an unfamiliar technological device.
Robotics stands to benefit from zero-shot learning. MIT’s Computer Science and Artificial Intelligence Laboratory showcased robots manipulating previously unseen objects. In one experiment, a robot successfully grasped and used 20 different tools it had never encountered before, extrapolating from its understanding of tool use to determine how to manipulate each new item.
Zero-shot learning represents a significant shift in AI development. Its ability to generalize knowledge to new tasks could lead to more adaptable and efficient AI systems. Researchers at DeepMind predict that zero-shot learning could be vital to developing artificial general intelligence (AGI), machines with human-like cognitive abilities across a wide range of tasks.
As zero-shot learning advances, its impact on various industries becomes increasingly apparent. From healthcare to finance, robotics to language processing, this technology is poised to reshape how we approach complex problems and interact with AI systems. In the coming years, we will likely see a surge in zero-shot learning applications, accompanied by ongoing debates about their implications for society, privacy and the future of work.
For all PYMNTS AI coverage, subscribe to the daily AI Newsletter.