Connect with us

Hi, what are you looking for?

Science

PolyU Innovates AI Training with Cost-Effective Model Fusion

The Hong Kong Polytechnic University (PolyU) has unveiled significant advances in Generative AI (GenAI) research, introducing a novel collaborative training approach called Co-GenAI. This innovative model aims to decentralize AI training, reducing costs and making high-level AI research more accessible to institutions worldwide. By lowering the resource requirements traditionally associated with training large AI models, PolyU is reshaping the landscape of AI research.

Traditionally, training foundation models has been prohibitively expensive, often requiring millions of hours of graphics processing unit (GPU) time. This has limited participation in advanced AI research to a select few organizations. The PAAI team has identified three major barriers: the high computational costs of model training, the siloing of data due to privacy and copyright issues, and the static nature of existing models that inhibit rapid iteration. To address these challenges, the team has developed a framework for ultra-low-resource training and decentralized model fusion.

Revolutionizing AI Training

PolyU is making strides as the first academic institution to open-source an end-to-end FP8 low-bit training solution, encompassing both continual pre-training (CPT) and post-training stages. This groundbreaking approach allows for over 20% faster training compared to BF16 precision, while also decreasing peak memory usage by more than 10%. Such advancements dramatically reduce training overheads without compromising performance.

The new framework integrates multiple training methodologies, including supervised fine-tuning (SFT) and reinforcement learning (RL), to achieve BF16 quality while minimizing training time and memory footprint. The team is now exploring even lower-cost FP4 precision training, with promising initial results documented in their research. In medical applications, the models trained using these pipelines have outperformed peer models in diagnosis and reasoning, showcasing their potential in critical fields.

The PolyU InfiFusion model fusion represents a significant achievement in AI research. By utilizing only hundreds of GPU hours, the team has successfully merged four state-of-the-art models that would typically require 1 to 2 million GPU hours to train from scratch. This breakthrough not only avoids substantial financial investments but also delivers fused models that surpass the original benchmarks.

Prof. YANG Hongxia, Executive Director of PAAI, remarked, “Ultra-low-resource foundation model training, combined with efficient model fusion, enables academic researchers worldwide to advance GenAI research through collaborative innovation.” The team’s work has been validated through rigorous mathematical derivation, leading to the introduction of the “Model Merging Scaling Law,” which suggests a new path toward achieving artificial general intelligence (AGI).

Collaborative Applications and Future Directions

PolyU’s PAAI is also collaborating with esteemed institutions such as Huashan Hospital affiliated with Fudan University and the Sun Yat-sen University Cancer Center to enhance medical foundations and cancer AI models. These collaborations aim to integrate high-quality, domain-specific data, allowing for personalized treatment options and AI-driven radiotherapy solutions.

Additionally, PAAI has launched a cutting-edge agentic AI application designed to assist in academic research. This tool serves as a graduate-level academic paper writer that supports a multimodal patent-search engine, streamlining the research and manuscript drafting process.

Prof. Christopher CHAO, Senior Vice President of Research and Innovation at PolyU, stated, “AI is a key driver in accelerating the development of new quality productive forces. The newly established PAAI is dedicated to expediting AI integration across key sectors and developing domain-specific models for diverse industries.”

The project, led by Prof. YANG, is supported by various funding initiatives, including the Theme-based Research Scheme 2025/26 under the Research Grants Council and the Artificial Intelligence Subsidy Scheme under Cyberport. This initiative marks a significant advancement for Hong Kong in the realm of global AI innovation, promoting the democratization and industrial implementation of AI technologies.

You May Also Like

Top Stories

URGENT UPDATE: The family of 15-year-old Thom Hosking has issued a heartfelt tribute following his tragic death in a crash in Bendigo on October...

Top Stories

UPDATE: The search for missing four-year-old August “Gus” Lamont in South Australia has taken a grim turn, with officials reporting “zero evidence” the child...

Sports

Fans of English football were treated to a compelling analysis of crucial refereeing decisions during two marquee matches on October 21, 2023. In a...

Sports

Mason Cox, a beloved figure at the Collingwood Football Club, has announced he will not be offered a new contract for the upcoming season....

Top Stories

BREAKING NEWS: Global discount retailer Costco is set to revolutionize shopping in Perth as it announces plans to open its first store in the...

Education

This week offers a vibrant array of cultural experiences, from an exhibition spotlighting the literary genius of John le Carré to a bold theatre...

Top Stories

UPDATE: The mother of allegedly murdered teen Pheobe Bishop has reached out with a poignant letter to the family of Gus, a four-year-old who...

Sports

Jake Connor, the Super League Man of Steel, has not been selected for the England squad ahead of the Rugby League Ashes series against...

Sports

The Melbourne Storm will not pressure coach Craig Bellamy to make a decision regarding his future beyond 2026, despite overtures from the Gold Coast...

Technology

A major data breach affecting approximately 5.7 million customers has prompted Qantas Airways to seek legal protection in the NSW Supreme Court. The airline...

Politics

Recent allegations have surfaced regarding a toxic work culture at Westpac Rescue, a prominent emergency service organization in Australia. Reports indicate that staff members...

Entertainment

Abbie Chatfield, the former star of *The Bachelor* and a prominent social media influencer, has acknowledged defaming her ex-friend, Heath Kelley. The admission follows...

Copyright © All rights reserved. This website provides general news and educational content for informational purposes only. While we strive for accuracy, we do not guarantee the completeness or reliability of the information presented. The content should not be considered professional advice of any kind. Readers are encouraged to verify facts and consult appropriate experts when needed. We are not responsible for any loss or inconvenience resulting from the use of information on this site.