Apple researchers have released Pico-Banana-400K, a comprehensive dataset of 400,000 curated images that's been specifically designed to improve how AI systems edit photos based on text prompts. The massive dataset
Apple researchers have released Pico-Banana-400K, a comprehensive dataset of 400,000 curated images that's been specifically designed to improve how AI systems edit photos based on text prompts.


The massive dataset aims to address what Apple describes as a gap in current AI image editing training. While systems like GPT-4o can make impressive edits, the researchers say progress has been limited by inadequate training data built from real photographs. Apple's new dataset aims to improve the situation.

Pico-Banana-400K features images organized into 35 different edit types across eight categories, from basic adjustments like color changes to complex transformations such as converting people into Pixar-style characters or LEGO figures. Each image went through Apple's AI-powered quality control system, with Google's Gemini-2.5-Pro being used to evaluate the results based on instruction compliance and technical quality.

The dataset also includes three specialized subsets: 258,000 single-edit examples for basic training, 56,000 preference pairs comparing successful and failed edits, and 72,000 multi-turn sequences showing how images evolve through multiple consecutive edits.

Apple built the dataset using Google's Gemini-2.5-Flash-Image (aka Nano-Banana) editing model, which was released just a few months ago. However, Apple's research revealed its limitations. While global style changes succeeded 93% of the time, precise tasks like relocating objects or editing text seriously struggled, with success rates below 60%.


Despite the limitations, researchers say their aim with Pico-Banana-400K is to establish "a robust foundation for training and benchmarking the next generation of text-guided image editing models." The complete dataset is freely available for non-commercial research use on GitHub, so developers can use it to train more capable image editing AI.
This article, "Apple's New AI Dataset Aims to Improve Photo Editing Models" first appeared on MacRumors.com

Discuss this article in our forums

original link


You may also be interested in this

Apple’s Vision Pro mixed …

After years of speculation, Apple CEO Tim Cook hailed the arrival of the sleek goggles -- dubbed ``Vision Pro'' -- at the company's annual developers conference.

How to make a TikTok soun…

It's possible to create text tones or ringtones for your iPhone from your favorite TikToks — and for free. Here's how to do it.TikTok is a hub for burgeoning creators

iOS 19 might transform yo…

An iPhone running iOS 19 connected will be able to display applications running in windows on an external screen. (via Cult of Mac - Apple news, rumors, reviews and how-tos)

iPhone 15 Pro models will…

In a letter to its shareholders, Apple supplier Cirrus Logic has ostensibly confirmed that the iPhone 15 will not have the much-rumored solid-state buttons.Solid-state buttons were expected for iPhone 15

Android can’t compete: Ap…

Apple’s commitment to user privacy is the focus of a new in-depth interview featuring Katie Skinner, Apple’s User Privacy Engineering Manager, and Sandy Parakilas, Apple’s Privacy Product Marketing Lead. The

iPhone 16 Pro Max assembl…

Luxshare could see significant growth through 2023 and 2024 as Apple helps it build production lines in India while also offering it iPhone 16 Pro Max production.Luxshare getting help from

Apple’s Vision Pro mixed …

After years of speculation, Apple CEO Tim Cook hailed the arrival of the sleek goggles -- dubbed ``Vision Pro'' -- at the company's annual developers conference.

Apple Stops Signing iOS 1…

Apple today stopped signing the iOS 17.2.1 update, preventing iPhone users from downgrading to that version of iOS going forward. iOS 17.2.1 is no longer being signed following the January
X

A whimsical homage to the days in black and white, celebrating the magic of Mac OS. Dress up your blog with retro, chunky-grade pixellated graphics to evoke some serious computer nostalgia. Supports a custom menu, custom header image, custom background, two footer widget areas, and a full-width page template. I updated Stuart Brown's 2011 masterpiece to meet the needs of the times, made it responsive , got dark mode, custom search widget and more.You can download it from tigaman.com, where you can also find more useful code snippets and plugins to get even more out of wordpress.