Apple researchers have released Pico-Banana-400K, a comprehensive dataset of 400,000 curated images that's been specifically designed to improve how AI systems edit photos based on text prompts. The massive dataset
Apple researchers have released Pico-Banana-400K, a comprehensive dataset of 400,000 curated images that's been specifically designed to improve how AI systems edit photos based on text prompts.


The massive dataset aims to address what Apple describes as a gap in current AI image editing training. While systems like GPT-4o can make impressive edits, the researchers say progress has been limited by inadequate training data built from real photographs. Apple's new dataset aims to improve the situation.

Pico-Banana-400K features images organized into 35 different edit types across eight categories, from basic adjustments like color changes to complex transformations such as converting people into Pixar-style characters or LEGO figures. Each image went through Apple's AI-powered quality control system, with Google's Gemini-2.5-Pro being used to evaluate the results based on instruction compliance and technical quality.

The dataset also includes three specialized subsets: 258,000 single-edit examples for basic training, 56,000 preference pairs comparing successful and failed edits, and 72,000 multi-turn sequences showing how images evolve through multiple consecutive edits.

Apple built the dataset using Google's Gemini-2.5-Flash-Image (aka Nano-Banana) editing model, which was released just a few months ago. However, Apple's research revealed its limitations. While global style changes succeeded 93% of the time, precise tasks like relocating objects or editing text seriously struggled, with success rates below 60%.


Despite the limitations, researchers say their aim with Pico-Banana-400K is to establish "a robust foundation for training and benchmarking the next generation of text-guided image editing models." The complete dataset is freely available for non-commercial research use on GitHub, so developers can use it to train more capable image editing AI.
This article, "Apple's New AI Dataset Aims to Improve Photo Editing Models" first appeared on MacRumors.com

Discuss this article in our forums

original link


You may also be interested in this

Georgia launches support …

Georgia has officially launched support to use driver’s licenses or state IDs on iPhone. It becomes the fourth US state to debut compatibility with Apple Wallet. While the support is

Apple shares hit new all-…

In Nasdaq trading today, shares of Apple Inc. (AAPL) rose $2.44, or 0.97%, to $253.48, a new all-time closing high. Apple’s intraday high was also set today at $253.81. Apple’s

Google’s latest privacy c…

Macworld Back in 2022, Google surprised most of the web-browsing world with the announcement of the Privacy Sandbox, a multi-year initiative to phase out third-party cookies and limit tracking. Two

How to use two iPhones wi…

Macworld Years ago, a tech executive mentioned having a “day” iPhone and a “night” iPhone. The poor fellow was ribbed for months about it when he was more accurately describing

Tested: Anker’s new trian…

Anker’s latest MagSafe charging stand just hit the scene earlier this spring, arriving just in time to pair perfectly with the new StandBy mode for iPhone 14 coming in iOS

Apple touts revenue growt…

As it battles App Store regulatory pushback around the world, Apple has published the results of a new study from economists at the Analysis Group. According to the study, which

Google pays Apple $18 bil…

Alphabet pays Apple $18 billion to $20 billion per year for Google to be Safari’s default search engine, research and brokerage firm Bernstein estimates. DOJ antitrust lawsuit targets Google’s multibillion-dollar

iOS 16.6 and macOS 13.5 a…

iPhone and Mac users can now install iOS 16.6 and macOS Ventura 13.5, squashing some bugs. There are also updates for iPad and Apple Watch. (via Cult of Mac -
X

A whimsical homage to the days in black and white, celebrating the magic of Mac OS. Dress up your blog with retro, chunky-grade pixellated graphics to evoke some serious computer nostalgia. Supports a custom menu, custom header image, custom background, two footer widget areas, and a full-width page template. I updated Stuart Brown's 2011 masterpiece to meet the needs of the times, made it responsive , got dark mode, custom search widget and more.You can download it from tigaman.com, where you can also find more useful code snippets and plugins to get even more out of wordpress.