Generic selectors
Exact matches only
Search in title
Search in content
Post Type Selectors

Apple Introduces Open Source Multimodal LLM, Ferret

- Advertisement -

The multimodal LLM can use parts of images as queries using the GRIT Dataset consists around 1.1Mn examples.

Apple Inc. in collaboration with Columbia University’s AI researchers has quietly introduced an open-source multimodal large language model named “Ferret.” This model, unveiled on GitHub in October, gained significant attention from the AI research community, despite no official announcement.

Ferret is trained on 8 A100 GPUs with 80GB memory. The dataset used in the project is governed by the CC BY NC 4.0 licence, which permits non-commercial use only. The key contributions of the project include the Ferret model, GRIT dataset and Ferret-Bench.

- Advertisement -

The Ferret model combines a hybrid region representation with a spatial-aware visual sampler to enable fine-grained and open-vocabulary referring and grounding within a multimodal large language model (MLLM). This capability enhances the model’s ability to understand and respond to complex queries that involve both text and images.

The project introduces the GRIT Dataset, which consists of approximately 1.1 million examples. This dataset is designed to support large-scale, hierarchical, and robust instruction tuning for grounding and referring tasks. It serves as a valuable resource for training and evaluating AI models in tasks related to understanding and responding to instructions.

Ferret-Bench is a multimodal evaluation benchmark created as part of the project. It is designed to assess the performance of AI models across various dimensions, including Referring/Grounding, Semantics, Knowledge, and Reasoning. This benchmark provides a comprehensive testing ground for evaluating the capabilities of models like Ferret in real-world scenarios.

Ferret is described as a model that can use parts of images as queries, making it a powerful multimodal AI system. Its working involves examination of a specific region of an image. It then identifies elements within that region that could be relevant to a query and draws bounding boxes around these elements. Then it uses the identified elements as part of a query to provide responses in a traditional language model manner.

This means if a user highlights an image of an animal within a larger image and asks what the animal is, Ferret identifies the species of the creature and can use context from other elements in the image to provide further information or context.

The release of Ferret is seen as significant because it represents an unexpected level of openness from Apple, a company known for its secrecy. This open-source approach contrasts with Apple’s traditional practices.

One reason for this openness may be Apple’s need to compete in the AI industry, where it faces challenges from rivals like Microsoft and Google. Apple’s infrastructure is not optimised for serving large language models (LLMs) at scale, which puts it at a disadvantage. To address this, Apple must choose between partnering with cloud hyperscalers for AI or sharing its work with the open-source community, a strategy similar to what Meta Platforms Inc. (formerly Facebook) has adopted.

Ferret’s release demonstrates Apple’s willingness to collaborate and contribute to the AI research community, reflecting a shift in its approach to AI development.

- Advertisement -

Most Popular Articles

Industry's Buzz

ED Raids Gensol Engineering Offices; Documents And Devices Seized

0
As the BluSmart scandal unfolds, ED seizes records from Gensol offices; shares plunge 47% in 13 days amid SEBI and FEMA scrutiny. Jaggi brothers...

India May Permit Up To 26% Chinese Stake In Select Electronics JVs

0
In some specific electronics JV, India may greenlight up to 26% of the Chinese stake, and a tech boost is reportedly expected as firms...

India, France Sign Deal For 26 Rafale-Marine Jets For Indian Navy

0
India inks landmark deal with France for 26 Rafale-Marine jets, boosting naval air power, local defence production, and jobs under Aatmanirbhar Bharat by 2030. India...
Sasken

Sasken Technologies Posts Q4 FY25 Results, Reports Consecutive Growth

0
Posting its fifth straight quarter of growth, Sasken Technologies commends major global deals and profits, signalling strong momentum in digital transformation and semiconductor services. Sasken...

Solar Drive Innovation In Large-Scale Solar Systems

0
In a major advancement for the renewable energy sector, Semikron Danfoss’ latest power module, featuring ROHM’s 2kV SiC MOSFETs, has been integrated into SMA...

Learn From Leaders

Dr Venkatesh Vadde, Co-founder and CEO, Sensio Enterprises

“We Are One Of The Very Few—Perhaps Two Or Three—Companies Globally That Are Actually...

0
A decade ago, smartwatches were unknown. But now, the still nascent wearable market is talking about smart rings for health monitoring! What does Bengaluru’s...
Agalya Kondappan, Managing Director, Glonix Electronics Private Limited

“By Procuring Components In One Lot And Fabricating The Boards At Once, Clients Can...

0
Calling themselves a comprehensive solution provider, how is a company ensuring component authenticity, managing pricing, fabricating and assembling, then offering cost-effective bulk solutions? Agalya...
Kiran M S, Founder and Managing Director, Indus Technologies

“We Collaborate Directly With Customers Due To Ongoing Market Volatility To Create More Realistic,...

0
Can volatile supply chains be tackled without traditional forecasting tools? With 15 years in the industry, Kiran M S of Indus Technologies tells EFY’s...
Avesh Memon, Founder and CEO, Rilox EV Private Limited

“There Should Be Additional PLI Schemes For SMEs”- Avesh Memon, Rilox EV

0
As electric logistics gain momentum, key roadblocks remain. Avesh Memon of Rilox EV breaks down how limited charging infrastructure, high EV costs, and battery...

“We Aim To Make Every Garage In India EV-Ready” – Shubham Mishra of BatteryOK...

0
How Battery diagnostics in electric vehicles can be upgraded leveraging artificial intelligence? Shubham Mishra of BatteryOK Technologies, shares these insights with EFY’s Aryaman Raghuvanshi...

Startups

Dr Venkatesh Vadde, Co-founder and CEO, Sensio Enterprises

“We Are One Of The Very Few—Perhaps Two Or Three—Companies Globally That Are Actually...

0
A decade ago, smartwatches were unknown. But now, the still nascent wearable market is talking about smart rings for health monitoring! What does Bengaluru’s...

Inside BluSmart’s Stunning Fall After SEBI Crackdown Turns Green Ride Red

0
Once India’s EV poster child, BluSmart has halted services after SEBI exposed massive fund misuse, unravelling a cautionary tale of ambition, misgovernance, and lost...

IG Drones Reports 330% Revenue Growth In FY25, Eyes ₹1B Target

0
Soaring with 330% revenue growth in FY25, startup IG Drones eyes ₹1 billion in FY26, backed by booming demand, new drones, and global expansion...

Plugzmart’s Indigenous EV Fast Charger Gets ARAI Approval

0
Powering heavy-duty vehicles in 20 minutes while backing India’s push for tech self-reliance, Plugzmart’s fully homegrown 240kW EV fast charger earns ARAI nod. Plugzmart,...

Ather Energy Mulls $50 Million IPO Cut

0
Due to market volatility, Ather Energy may trim its $400 million IPO by $50 million but plans to proceed with the offering in the...