Date

2 months ago

Organization

Paper URL

Tags

Model Souping was jointly proposed in July 2022 by a research team from the University of Washington, Google, and other universities and institutions. The related research results were published in the paper "...".Model soups: averaging weights of multiple fine-tuned models improves accuracy without increasing inference time", selected for ICML 2022.

Model Souping refers to averaging the weights of multiple independently fine-tuned models to improve model accuracy and robustness. This paradigm only performs weighted averaging on the fine-tuned models after hyperparameter sweeping, requiring no additional training and not increasing computational costs during inference. When fine-tuning large pre-trained models such as ViT-G pre-trained with CLIP, ALIGN, and JFT, the Model Souping method significantly improves upon the best single model obtained through hyperparameter sweeping on ImageNet. The resulting ViT-G model achieved an accuracy of 90.941 TP3T on ImageNet, reaching a new technical level. Furthermore, this method can be extended to various image classification and natural language processing tasks, not only improving out-of-distribution generalization performance but also enhancing zero-shot learning capabilities in new downstream tasks.

Build AI with AI

From idea to launch — accelerate your AI development with free AI co-coding, out-of-the-box environment and best price of GPUs.

AI Co-coding

Ready-to-use GPUs

Best Pricing

Get Started View Pricing

HyperAI Newsletters

Subscribe to our latest updates

We will deliver the latest updates of the week to your inbox at nine o'clock every Monday morning

HyperAI

Date

2 months ago

Organization

Paper URL

2203.05482

Related Wiki

Sparse Code Tree Decoding Tree Sketching

By leveraging GPU parallelism to efficiently expand the decoding tree, fast and scalable optimization of the inference path is achieved.

2 months ago

Safety Comparison Method: Deep Aligned Visual Safety Prompt

It effectively solves the key challenges in LVLM secure alignment.

2 months ago

Theory of Space

Spatial theory refers to the framework of an intelligent agent’s ability to construct, update and utilize spatial beliefs in an environment of incomplete information through active exploration.

a month ago

SoCE Class Expert Soup

SoCE is a model optimization paradigm based on an automatic category-aware expert selection mechanism and combined with multiple benchmark tasks.

2 months ago

Guided Thought Reinforcement

GTR can guide model reasoning in complex visual environments and prevent "brain breakdown".

a month ago

Peak-Return Greedy Slicing

PRGS significantly enhances the ability of offline reinforcement learning models to stitch together high-reward experiences.

a month ago

Learning While Deploying

LWD is a fleet-level offline-to-online reinforcement learning framework that enables general-purpose robots to continuously collect experience and achieve self-evolution of policies.

13 days ago

iSeal Fingerprint Recognition Method

iSeal achieves a 100% fingerprint success rate (FSR) against more than 10 attacks on 12 LLMs.

2 months ago

Mean Speed Strategy (MVP)

MVP achieves single-step action generation with both high expressive power and extremely fast computation by modeling the average velocity field.

a month ago

Build AI with AI

From idea to launch — accelerate your AI development with free AI co-coding, out-of-the-box environment and best price of GPUs.

AI Co-coding

Ready-to-use GPUs

Best Pricing

Get Started View Pricing

HyperAI Newsletters

Subscribe to our latest updates

We will deliver the latest updates of the week to your inbox at nine o'clock every Monday morning

Command Palette

Model Souping

Build AI with AI

HyperAI Newsletters

Command Palette

Model Souping

Related Wiki

Sparse Code Tree Decoding Tree Sketching

Safety Comparison Method: Deep Aligned Visual Safety Prompt

Theory of Space

SoCE Class Expert Soup

Guided Thought Reinforcement

Peak-Return Greedy Slicing

Learning While Deploying

iSeal Fingerprint Recognition Method

Mean Speed Strategy (MVP)

Build AI with AI

HyperAI Newsletters

Command Palette

Model Souping

Related Wiki

Sparse Code Tree Decoding Tree Sketching

Safety Comparison Method: Deep Aligned Visual Safety Prompt

Theory of Space

SoCE Class Expert Soup

Guided Thought Reinforcement

Peak-Return Greedy Slicing

Learning While Deploying

iSeal Fingerprint Recognition Method

Mean Speed Strategy (MVP)

Build AI with AI

HyperAI Newsletters

Related Wiki

Sparse Code Tree Decoding Tree Sketching

Safety Comparison Method: Deep Aligned Visual Safety Prompt

Theory of Space

SoCE Class Expert Soup

Guided Thought Reinforcement

Peak-Return Greedy Slicing

Learning While Deploying

iSeal Fingerprint Recognition Method

Mean Speed Strategy (MVP)

Related Wiki

Sparse Code Tree Decoding Tree Sketching

Safety Comparison Method: Deep Aligned Visual Safety Prompt

Theory of Space

SoCE Class Expert Soup

Guided Thought Reinforcement

Peak-Return Greedy Slicing

Learning While Deploying

iSeal Fingerprint Recognition Method

Mean Speed Strategy (MVP)