Tencent’s ‘training-free’ AI model improvement technique sparks debate

Paper argues that large language models can improve through experience on the job without needing to change their parameters Researchers at ...

Paper argues that large language models can improve through experience on the job without needing to change their parameters

Researchers at Tencent Holdings have proposed a new "lightweight" technique to get AI models to improve by using "experience" without retraining, sparking a debate about whether that could be the key to more cost-effective continual learning.

The paper titled "Training-Free Group Relative Policy Optimisation", published last week on open-access repository arXiv, argued that large language models (LLMs) can improve through on-the-job experience, without needing to change their parameters.

Current training methods for making LLMs more useful in real-life tasks are reliant on techniques such as reinforcement learning, in which the model's parameters - the variables encoding its "intelligence" - are adjusted through algorithms such as Group Relative Policy Optimisation (GRPO).

Do you have questions about the biggest topics and trends from around the world? Get the answers with SCMP Knowledge, our new platform of curated content with explainers, FAQs, analyses and infographics brought to you by our award-winning team.

Under GRPO, the model makes multiple attempts at a task, then adjusts its parameters based on the scores of those attempts. However, this process can be slow and computationally costly.

Instead, the researchers from Tencent's AI research lab suggested that LLMs could simply log the rules and heuristics from this GRPO process in an "experience library" and deploy them when faced with new tasks.

The paper provided examples of the kinds of heuristics the model comes up with itself, such as: "When solving geometry problems with intersections, validate solutions lie within bounded regions or segments, not on extensions, to avoid extraneous answers."

When the model next encounters a geometry problem, it will read this as part of its context and adjust its response accordingly. The model therefore becomes more capable as the experience library gets updated, rather than the model parameters.

The researchers applied this "training-free" GRPO algorithm to DeepSeek's V3.1-Terminus model, a 671 billion parameter model, and found that it beat Alibaba Cloud's Qwen2.5-32B-Instruct, a 32 billion parameter model fine-tuned using more conventional methods, on mathematical reasoning and web search tasks.

The result was achieved more efficiently, they claimed, with only 100 additional training examples needed to improve DeepSeek-V3.1-Terminus at a cost of around US$18, versus US$10,000 and 17,000 training examples for Qwen2.5-32B-Instruct.

Still, AI researchers online raised doubts about the research findings, pointing out that the experiment on models of different parameter sizes was not conducive to assessment of the relative benefits of the training-free GRPO technique.

The researchers noted this in the paper themselves, after finding that training-free GRPO as applied to Qwen2.5-32B-Instruct yielded worse performance than baseline scores.

"This may suggest that the effectiveness of our method is dependent on the underlying model's reasoning and tool-use capabilities ... indicating that model capability is a prerequisite for effective experience-based optimisation," they wrote.

While LLMs have driven much of the generative AI boom in recent years, their limitations in real-world domains have spurred researchers globally to focus on improving their self-improvement and continual learning capabilities.

In China, limited access to advanced US semiconductors has made cost-efficiency an additional priority, as companies look to scale up capabilities without incurring heavy computational costs.

More Articles from SCMP

Chinese airlines oppose US flight plan, Nets ditch China’s Zeng: SCMP daily highlights

US and China flex muscles as narrative war over trade tensions heats up

Hong Kong’s Metropol dim sum restaurant may be dead, but its neon sign will live on

Hong Kong’s respite agencies must be held accountable for refusing services

This article originally appeared on the South China Morning Post (www.scmp.com), the leading news media reporting on China and Asia.

Copyright (c) 2025. South China Morning Post Publishers Ltd. All rights reserved.

COMMENTS

Name

accessibility,1,activists,2,actors and actresses,5,adventures,1,advertising,1,africa,32,aging,3,agriculture,8,ai chatbots,2,air travel,5,airline industry,4,alternative energy,1,alzheimer's disease,1,ambassador,1,ancient egypt,1,ancient history,2,angelina jolie,1,animal behavior,1,animals,3,antitrust activities,1,antitrust law,1,apple,2,apple products and services,1,archaeologists,1,archaeology,2,armed forces,6,armenia,1,arrests,2,arsenal fc,1,art,3,artificial intelligence,24,artwork,3,asia,5,astronomy,1,astrophysics,1,asylum seekers,2,athletes,4,audits,1,austria,1,automation,6,automotive industry,3,autos,5,aviation,8,aviation accidents,6,babies,3,bags,1,banking,2,batteries,1,battery electric vehicles,6,bears,1,beauty,1,biology,2,biotech & biomedical,1,books,5,border crossings,1,borders,1,box office grosses,1,brain cancer,1,brain health,1,breaking news,2,breast cancer,1,breast cancer awareness,1,british monarchy,2,british royal family,3,broadband,1,broadcast media,2,broadcasting,3,buildings,3,business,171,business funding,1,cadillac,1,cancer,2,cancer patients,1,car companies,4,car design,1,car models,1,cars,11,catholic churches,1,catholicism,1,celebrities,32,celebrity and music,8,celebrity gossip,21,central banks,1,challenges,1,character and personality,1,charities,1,charity,1,chatgpt,1,children,9,children and families,9,chronic conditions and diseases,1,churches,1,cities and towns,2,civil aviation,3,climate,2,climate change,4,climatology,1,cloud computing,1,cloud services,2,coca cola,1,colleges and universities,8,commerce,36,community,22,compassion,2,complaints,1,computer security,1,computers,2,concerts,4,conservation,2,construction,4,consumer electronics,5,contests and competitions,2,controversies,55,cooking,1,corruption,5,cosmology,1,cost of living,2,counterfeiting,1,couples,2,courts,20,crabs,1,cricket,3,cricket players,3,crime,81,crimes,22,criminal cases,18,criminal justice,10,criminal law,3,criminal prosecution,3,crops,2,cryptocurrency,1,culinary arts,1,culture,27,customs,1,cybercrime,3,cybersecurity,2,data centers,1,data science and analytics,1,dating and relationships,1,dating apps,1,deep learning,1,dementia,1,democracy,2,diet and nutrition,3,dining,1,dining out,1,diplomacy and diplomats,7,disability,2,disaster management,7,disasters,34,diversity,1,divorce,1,dogs,1,donald trump,3,donald trump trial,1,driving,5,drugs,4,e commerce,1,earthquakes,1,ecology,2,economic inequality,2,economic policy,10,economics,32,economy of china,3,education,43,education reform,1,educational systems,16,educators,2,egypt,1,egypt history,1,egyptology,1,election commission of india,3,elections,3,electric batteries,1,electric cars,5,electric power,7,electronics,2,emergencies,18,emergency management,8,emergency services,1,employees,1,employment,2,employment law,1,empowerment,1,energy sector,6,energy sustainability,1,engineering,5,entertainment,29,entertainment industry,4,entrepreneurship,2,environment,1,environmental disasters,6,environmental friendliness,1,environmental health,3,environmental pollution,2,environmentalism,12,equities,1,europe,6,european football,1,european union,2,event tickets,1,events and festivals,5,exercise,1,exhibitions,4,extortion,1,factory reset,1,faith,1,faith and religion,1,farmers,1,farming,4,fashion & style,5,fashion and style,4,fashion brands,1,fashion design,1,fashion industry,1,fashion trends,2,female empowerment,3,festivals,1,film festivals,2,finance news,7,financial analysis,1,financial crime,2,financial literacy,1,financial markets,5,financial services,5,fintech,3,fire,1,fish,2,fisheries,2,fitness,2,flights,2,flooding,8,flying,4,food and beverage industry,1,food and drink,8,food banks,1,food culture,3,food safety,2,food service industry,1,foodies,1,football clubs,9,football players,3,foreign policy,10,foreign relations of iran,1,foreign relations of pakistan,1,formula 1,6,french,4,funding,1,funerals,1,future of cryptocurrencies,1,gadgets,2,gambling,1,gaming,1,gaza israel conflict,7,german,4,global warming,3,gold,1,google chrome,1,google products and services,1,government,237,government of pakistan,1,government regulations,6,governors,2,grief,4,handbags and purses,1,hate crimes,1,health,35,health & fitness,3,health and exercise,4,health and healthcare economics,3,health benefits,1,health risks,1,healthcare and medicine,39,healthy eating,1,healthy living,2,healthy workouts,1,heat,1,heating,1,heritage,4,higher education,11,history,3,home and property,10,hong kong,2,horror,1,hospitals,3,human rights,22,human trafficking,1,humanitarian aid,6,humanitarianism,6,hurricanes,2,hydrogen,1,identity verification,1,illness,11,immigrants,1,immigration,6,immigration policy,4,incident,36,india elections,1,indian national news,16,infectious diseases,6,inflation,4,information security,5,infrared light,1,infrastructure,10,innovation,22,insects,1,insurance,1,international cricket,2,international economics,2,international relations,21,international trade,3,internet data centers,1,internet security,3,investing,18,investing business news,18,investing company news,23,investing economy,3,investing market news,4,investing technology,4,investing news,7,investors,5,iphone,2,iran,1,iran and the middle east,1,iran nuclear deal,1,islam,1,israel and the gaza strip,7,israel and the middle east,5,jeep,1,jeffrey epstein,1,job,1,jobs and careers,1,johnson & johnson,1,journalism,23,journalists,2,judiciaries,11,kardashians,1,kenya,1,kidnapping,2,kim jong un,1,korean,5,labor unions,2,laws and regulations,21,layoffs,1,leadership,6,licenses,1,licensing,1,life insurance,1,lifestyle,5,literacy,3,lithium,1,live music,2,liverpool f.c.,1,living,1,loans,1,local and municipal government,3,local businesses,1,local news,57,love,2,luxury cars,1,machine learning,10,macroeconomics policy,2,manchester united,1,manufacturing,10,marriage,5,maternity,2,mathematics,1,meals,1,medical conditions and diseases,6,medicine and healthcare,30,meghan markle,2,mental health,3,mental illness,1,mentors and mentoring,1,metals,2,meteorology,2,microorganisms,1,migrants,5,military,30,military history and wars,1,military technology,3,mineralogy,1,mining,3,mining industry,1,misinformation,1,missing persons,1,mobile phones,4,mobile technology,7,modern warfare,2,money,17,morocco,1,motor bikes,7,motor fuel,1,motorcycle racing,2,motorcycle riding,7,motorsports,17,mountains,2,movies,6,multilingualism,2,murder cases,1,music,10,music and lyrics,4,music industry,4,music recording,1,musicians,5,nasa,1,national security,7,nato,1,natural disasters,5,nature,1,nature conservation,2,naval forces,1,nba,1,network security,1,news,808,news media,68,nigeria,9,north korea,1,nuclear energy,1,nuclear power plants,1,nuclear program of iran,1,nuclear reactors,1,nutrition,1,obituaries,6,off roading,1,oil,5,oil and fuel prices,1,oil and gas industry,2,oil refineries,1,olympic sports,1,oncology,1,outages,2,packing tips,1,pakistan,3,palestine,3,parasitology,1,parking lots,1,parkinson's disease,1,partnership,1,passports and international travel,1,performing arts,4,pests and diseases,1,petroleum,3,pfizer,1,philanthropy,2,pilots,1,planetary science,1,planning,3,police and law enforcement,20,police reports,20,political asylum,1,political corruption,2,political debates,8,political parties,5,political polling,1,political science,2,politics,376,politics and government,243,politics and law,194,politics of japan,1,politics of saudi arabia,1,politics of south korea,2,pop culture,2,pop music,1,popular culture,11,porsche,1,power outages,1,pregnancy,2,premier league,1,prince harry duke of sussex,1,product recalls,1,product reviews,1,protests,7,psychology,1,psychology of everyday life,1,public education,5,public health,13,public health and safety,5,public policy,27,public transportation,2,racing,15,racing drivers,3,racism,1,railroads,2,railway systems,3,rain,2,rainfall,3,rats,1,real estate,9,real estate market,5,reality television,1,recycling,1,red bull,1,red bull racing,2,refrigeration,1,refugees,4,regulation,2,relationships,7,religion,8,renewable energy,3,retail,2,robotics,1,rodents,1,roman catholic church,3,romania,2,rumors,1,russia ukraine conflict,5,russian,3,russian politics,3,safety,11,samsung,1,samsung products,1,saudi arabia,1,scandals,6,school children,2,school principals,1,school teachers,1,schools,13,science,18,science education,2,scientific research,3,scotland,1,seafood,1,search and rescue,2,securities,1,security,9,seismology,1,senior citizens,3,severe weather,2,shopping,1,shortages,3,singapore,3,skills,2,smartphones,2,smuggling,1,soccer,10,soccer player transfers,2,social issues,6,social justice,1,social media,3,society,8,sodas,1,software and applications,1,solar energy,1,solar panels,1,soldiers,1,south africa,1,space exploration,1,space travel,2,special education,1,spirituality,1,sports,41,sports betting,1,startups and entrepreneurship,1,state of palestine,2,stem education,1,stocks,6,storms,1,streaming,1,style,2,subaru,1,sustainability,4,suvs,1,sweeteners,1,syria,2,taiwan,1,taliban,1,tattoos,2,taylor swift,1,teachers,1,teaching,7,tech companies,3,technology,137,technology companies,10,technology industry,19,technology trends,21,telecommunications,2,television,1,temperature,1,tennis,1,tesla,1,tesla autopilot,1,the big bang theory,1,theft,1,tourist attractions,1,tourists,1,tournaments,3,track and field,2,traditions,1,tragedies,22,train travel,1,trains,1,transportation,7,trauma,2,travel,8,travel destinations,1,tropical storms,1,trump administration,1,tuna,1,tv,1,u.s. china relations,1,uber,1,ufo sightings,1,uganda,2,ukraine,7,ukraine politics,7,unemployment,1,united nations,8,unmanned aerial vehicles and drones,1,urban and regional planning,2,urology,1,vaccines,3,vandalism,1,video games,1,viral diseases,1,viruses,2,visual arts,1,volcanic eruptions,2,volcanoes,2,volcanology,2,volodymyr zelenskyi,2,walmart,1,warfare,4,waste management,1,water crises,1,water management,4,water quality,1,water supply,1,water treatment,1,weapons,7,weather forecasts,1,web browsers,1,weddings,2,wellness,2,whistleblowers,1,wildlife,2,wildlife conservation,1,wind energy,1,wind turbines,1,women,11,women's rights,1,workers,8,workforce,1,working out,1,world,16,youth,3,
ltr
item
Union Hotel: Tencent’s ‘training-free’ AI model improvement technique sparks debate
Tencent’s ‘training-free’ AI model improvement technique sparks debate
https://img-s-msn-com.akamaized.net/tenant/amp/entityid/AA1OCCmO.jpg
Union Hotel
https://www.unionhotel.us/2025/10/tencents-training-free-ai-model.html
https://www.unionhotel.us/
https://www.unionhotel.us/
https://www.unionhotel.us/2025/10/tencents-training-free-ai-model.html
true
676919279331320286
UTF-8
Loaded All Posts Not found any posts VIEW ALL Readmore Reply Cancel reply Delete By Home PAGES POSTS View All RECOMMENDED FOR YOU LABEL ARCHIVE SEARCH ALL POSTS Not found any post match with your request Back Home Sunday Monday Tuesday Wednesday Thursday Friday Saturday Sun Mon Tue Wed Thu Fri Sat January February March April May June July August September October November December Jan Feb Mar Apr May Jun Jul Aug Sep Oct Nov Dec just now 1 minute ago $$1$$ minutes ago 1 hour ago $$1$$ hours ago Yesterday $$1$$ days ago $$1$$ weeks ago more than 5 weeks ago Followers Follow THIS PREMIUM CONTENT IS LOCKED STEP 1: Share. STEP 2: Click the link you shared to unlock Copy All Code Select All Code All codes were copied to your clipboard Can not copy the codes / texts, please press [CTRL]+[C] (or CMD+C with Mac) to copy