Mia @aLanguageModel
I ❤️🔥 data Joined August 2012-
Tweets268
-
Followers40
-
Following1K
-
Likes2K
@dwarkesh_sp Today’s models still suck in certain types of brainstorming such as: 1. Ideating names for things (model-generated names are slop) 2. Providing comprehensive lists of things (“list restaurants I might reserve tonight” -> model only lists a few options, none desirable)
MAI-Thinking-1: A powerful reasoning model developed from scratch that is competitive with models of similar size on STEM reasoning and coding tasks. Our pre-training focused on a simple scaling emphasizing data-driven iterative improvements to our architecture and data. Our reinforcement learning (RL) framework is optimized for sustained log-linear climbs over many thousands of steps We are openly sharing all technical details and learnings to build a transparent and science-driven approach to further development in AI Read More: msft.it/6011vj86J
Looking good 👀 Historically MiniMax models have worked well in OpenHands, looking forward to giving this one a whirl!
Introducing MiniMax M3: The First Open-Weights Model to Combine Three Frontier Capabilities - Coding & Agentic Frontier: 59.0% SWE-Bench Pro, 66.0% Terminal Bench 2.1, 34.8% SWE-fficiency, 28.8% KernelBench Hard, 74.2% MCP Atlas - MiniMax Sparse Attention scales context to 1M -
Introducing MiniMax M3: The First Open-Weights Model to Combine Three Frontier Capabilities - Coding & Agentic Frontier: 59.0% SWE-Bench Pro, 66.0% Terminal Bench 2.1, 34.8% SWE-fficiency, 28.8% KernelBench Hard, 74.2% MCP Atlas - MiniMax Sparse Attention scales context to 1M - Natively Multimodal from Step Zero API: platform.minimax.io Token Plan: platform.minimax.io/subscribe/toke… 🚀New! MiniMax Code: code.minimax.io Weights & Tech Report in ~10 Days
AI agents are advancing research-level math. 🚀 I’m thrilled to share @GoogleDeepMind’s AlphaProof Nexus - an agentic framework for formal proof search powered by Gemini. When applied to a set of open formal math problems, our agent autonomously solved: ✅ 9 open Erdős problems (including two open for 56 years!) ✅ 44 Online Encyclopedia of Integer Sequences (OEIS) problems ✅ A 15-year-old open problem in algebraic geometry ✅ A 7-year-old open question in min-max optimization We are collaborating with mathematicians across disciplines - from combinatorics and graph theory to quantum optics. Ultimately, these results show the massive potential of even simple agentic loops powered by Gemini. Read the paper here: arxiv.org/abs/2605.22763…
AI has now solved a major open problem -- one of the best known Erdos problems called the unit distance problem, one of Erdos's favourite questions and one that many mathematicians had tried. openai.com/index/model-di…
@nrehiew_ Is there more or less forgetting vs RL, if one constructs SFT dataset like this (for 0/1 rewards): Annotators label data sampled from the initial model’s policy, and then we only keep the high-reward samples. Will that be closer to original distribution than RL distribution?
Jensen is one the smartest and most far seeing folks the world. "If an AI scientist warns people that AI is going to permeate across radiology and radiologists are going to get wiped out, it might seem helpful but it's hurtful. If we convince everybody not to be radiologists and we now need radiologists, that actually is hurtful to society. "It is hurtful to convince all the young college graduates not to study software engineering because we are going to need more software engineers than ever. That's hurtful." "Scaring people with nonsensical things, which are not going to happen, that this is an existential threat, there's a 20% chance that is is existential, that's ridiculous. "That it's going to wipe out 50% of college level jobs. "That is it going to completely destroy democracy. "These kinds of comments are not helpful. They are made by...CEOS. And you become a CEO, maybe you adopt a God complex and somehow you know everything." Brutal. And right.
Big Update🤩: #paperclip now includes full papers from all of arXiv, PubMed Central and 150 million abstracts!🖇️ You can give your LLM all that knowledge in one line—all optimally indexed for AI agents. Much more thorough and ~100x faster than web search, and free.
Students are learning to build with Codex, and building to learn. Here’s what @UCBerkeley students built at the Codex Creator Challenge with @joinHandshake.
New work with @AlecRad and @DavidDuvenaud: Have you ever dreamed of talking to someone from the past? Introducing talkie, a 13B model trained only on pre-1931 text. Vintage models should help us to understand how LMs generalize (e.g., can we teach talkie to code?). Thread:
It is liberating being able to talk about what you work on.
Researchers' brilliant ideas often get lost in the sea of endless SOTA claims on weak baselines. At Marin we battle-test ideas in an open arena, where anyone's idea can be promoted to the next hero run. One that recently rose up was @Jianlin_S MoE Quantile Balancing, used in our last 1e22 and ongoing 130B run. Animated visuals of how QB performed are available in the OpenAthena blog. openathena.ai/blog/quantile-…
In my doctorate, I proved the Erdős Primitive Set Conjecture, showing that the primes themselves are maximal among all primitive sets. This problem will always be in my heart: I worked on it for 4 years (even when my mentors recommended against it!) and loved every minute of it. [Primitive sets are a vast generalization of the prime numbers: A set S is called primitive if no number in S divides another.] Now Erdős#1196 is an asymptotic version of Erdős' conjecture, for primitive sets of "large" numbers. It was posed in 1966 by the Hungarian legends Paul Erdős, András Sárközy, and Endre Szemerédi. I'd been working on it for many years, and consulted/badgered many experts about it, including my mentors Carl Pomerance and James Maynard. The the proof produced by GPT5.4 Pro was quite surprising, since it rejected the "gambit" that was implicit in all works on the subject since Erdős' original 1935 paper. The idea to pass from analysis to probability was so natural & tempting from a human-conceptual point of view, that it obscured a technical possibility to retain (efficient, yet counter-intuitve) analytic terminology throughout, by use of the von Mangoldt function \Lambda(n). The closest analogy I would give would be that the main openings in chess were well-studied, but AI discovers a new opening line that had been overlooked based on human aesthetics and convention. In fact, the von Mangoldt function itself is celebrated for it's connection to primes and the Riemann zeta function--but its piecewise definition appears to be odd and unmotivated to students seeing it for the first time. By the same token, in Erdős#1196, the von Mangoldt weights seem odd and unmotivated but turn out to cleverly encode a fundamental identity \sum_{q|n}\Lambda(q) = \log n, which is equivalent to unique factorization of n into primes. This is the exact trick that breaks the analytic issues arising in the "usual opening". Moreover, Terry Tao has long suspected that the applications of probability to number theory are unnecessarily complicated and this "trick" might actually clarify the general theory, which would have a broader impact than solving a single conjecture.
This is one of the coolest such examples! See comments from Lichtman below, who proved the related primitive set conjecture arxiv.org/abs/2202.02384
We’ve been thinking a lot about scaling laws, wondering if there is a more effective way to scale FLOPs without increasing parameters. Turns out the answer is YES – by looping blocks of layers during training. We find that predictable scaling laws exist for layer looping, allowing us to use looping to achieve the quality of a Transformer twice the size. Our scaling laws suggest that for a fixed parameter budget, data and looping should be increased in tandem! 🧵👇
Pleased to share our engineering practices for medium-sized LLMs in multi-turn agentic search, where we boosted Qwen3 8B and Qwen3 A3B from 1-2 turn search and 10% accuracy on Browsecomp-Plus to 15+ and 20+ turns with 30% accuracy. The devils are in the details; we hope our practices in stable RL training and data processing can help the community! Link: agate-slipper-ef0.notion.site/Cut-the-Bill-K… Chinese version: zhuanlan.zhihu.com/p/198709298638…
This, and few more tricks are covered in Today's @character_ai blogpost blog.character.ai/squinch/.
🧵 I compared the performance of @boundaryML BAML and @DSPyOSS for a variety of structured outputs, and the results are interesting: different datasets, models and schema formats results in wildly different outcomes, some of then unexpected. There's no universal winner (which means that prompt optimization matters, more than ever, because it's *very* hard to discover the right prompts as a human). The benchmarks compare BAML's and DSPy's performance (with similar user instructions and description annotations). I also use `BAMLAdapter` in DSPy (which implements BAML's schema formatting in a custom DSPy adapter). Why use a custom adapter in DSPy? Because it has benefits, especially when dealing with nested data, as the experiments show. 👇🏽 1/7
kookkai x @XKookkai66374
1 Followers 83 Following
Min Kopkop @MinKopkop76693
0 Followers 145 Following
michael @michael67640879
143 Followers 708 Following
Sayed Aqil Aledruce A... @AledruceSayed
152 Followers 4K Following
mouk ganas @zulkifligenji
21 Followers 799 Following
Vidya Chummun @ChummunVid
66 Followers 285 Following
Seasarslirn @Seasarslirn3qL
133 Followers 3K Following
Stina @StinatRYa6h8
53 Followers 912 Following
BenWhitman @DrBenWhitman
297 Followers 3K Following Crafting tools to measure & improve AI performance for product people, prompt engineers and devs working with LLMs
Nick @Nick3143644518
2K Followers 7K Following
Morinkashi @Meimoshitate
53 Followers 2K Following Hola soy un joven que hace dibujos de vez en cuando y mayormente me gusta jugar videojuegos y dormir xd
Talfan @talfanevans
1K Followers 1K Following Cofounder @cursive_ai It's denoising - all the way down. Prev. research at @Deepmind. 🏴
Andrea 🤌🏾 Ranie... @4ndr3aR
976 Followers 1K Following Deep learning researcher @ CNR-IMATI. If you have a problem, if no one else can help, if your model doesn't learn...
Elizabeth Boutelle @ElizabethBoute7
281 Followers 454 Following Big time GOD FAN, MY CHILDREN, AND THE WASHINGTON REDSKINS. HTTR No mean words or foul language but will get THE BRAT POST now and then. Just Saying
CryptoPhoenix @teddydemask36
80 Followers 585 Following Investing in crypto founders since 17 @BlockOGCapital , running a closed community of all the crypto founders in India.
C-Dub @Caleb_W32
248 Followers 311 Following If you got a problem with Canada Gooses you’ve got a problem with me and I suggest you let that one marinate IG: @Caleb_white1998
Your Daily AI Dose @YourDailyAIDose
123 Followers 1K Following 🌐 | Latest AI news & breakthroughs 📆 | AIDailyDose 👀👉🌐What's Next ?
BusinessIntelligence @bimedotcom
16K Followers 6K Following Alex: Consultant, VC. I share and discuss #AI #GenerativeAI #Ethics #BI #Fintech #HealthTech #Blockchain #Metaverse #DataPrivacy #FreeSpeech #Geopolitics
Data Society TW @DataSocietyTW
35K Followers 35K Following Our society generate even more Data. We are a Data Society. This is a Social Channel on #BigData #Analytics #BI #DigitalTransformation.
Dr. Theophano Mitsa �... @theomitsa
10K Followers 8K Following Data scientist, Ph.D., Christian, book author, 11 patents. I love the truth. #Medicare4all #DataScience #DeepLearning, #machinelearning, #python
Akridata Inc. @akridata
3K Followers 279 Following AI Platform for Visual Data #AI #ComputerVision #Manufacturing #Transformers #humaninspection #qualitycontrol
SwissCognitive, AI Ve... @SwissCognitive
145K Followers 99K Following We are committed to unleashing the power of AI in the business world. With our AI research, advisory, and ventures, we bring a blend of expertise to the Table.
Great Expectations @expectgreatdata
4K Followers 1K Following We help data teams have confidence in their data, no matter what. GX Cloud, our end-to-end SaaS data quality platform, is powered by the open source GX Core.
SabrePC @sabrepc
7K Followers 2K Following SabrePC is a global provider of #HPC, Audio & Visual, and Enterprise hardware & technology. #AI #ML #DeepLearning #MachineLearning #AV #ComputerVision
Andrei Gheorghiu @Andrei_Teaches
154 Followers 556 Following trainer / thinker / speaker / doer / giver.
🐧 FOSS and #Linux ... @FOSS_Linux
32K Followers 3K Following We tweet about Free #OpenSource Software and #Linux https://t.co/aTZrANKch6 on 🐧 A project by @OrganicSoMe
Knut Jägersberg @JagersbergKnut
7K Followers 6K Following Content Strategy & AI https://t.co/xnBUK02hWS https://t.co/XFrvARyX4j
Chris Mauck @cmauck10
137 Followers 528 Following Data Scientist @ Cleanlab, Car Enthusiast, and Food Connoisseur
Matthew Barnett @MatthewJBar
9K Followers 391 Following Co-founder of @MechanizeWork Married to @natalia__coelho email: matthew at mechanize dot work
Rudolf Laine @LRudL_
3K Followers 421 Following What I'm doing: https://t.co/7tVMLt1OwN For my writing, see: https://t.co/GwKY6jk3Tl
Workshop Labs @WorkshopLabs
3K Followers 1 Following Workshop Labs is an AI research company with a mission to make people irreplaceable.
Luke Drago @luke_drago_
4K Followers 810 Following making people irreplaceable @thinkymachines. prev @workshoplabs co-founder. opinions my own.
Diyi Yang @Diyi_Yang
22K Followers 2K Following Assistant Professor @Stanford CS @StanfordNLP @StanfordAILab Part time @humansand LLMs for Humans
Larry Dial @classiclarryd
2K Followers 41 Following Technical Staff at Open Athena, working on Marin
Noah Ziems @NoahZiems
4K Followers 2K Following Applied Research @PrimeIntellect. Prev @MIT_CSAIL under @lateinteraction, PhD @NotreDame
toucan @distributionat
6K Followers 896 Following toucan beaks are models of lightweight strength • prev @AnthropicAI @scale_AI
Vals AI @ValsAI
11K Followers 257 Following Public LLM Evaluation // https://t.co/FjWabQY2jk @8vc @BloombergBeta @pearvc
Rohun Saxena @RohunSaxena
413 Followers 579 Following pretraining @MicrosoftAI prev: pretraining Gemini ⚡ @GoogleDeepMind, @LuminousAI, @StanfordAILab
Hanxiao Liu @Hanxiao_6
2K Followers 138 Following @Microsoft AI, ex-Inflection, Google Brain, DeepMind We are hiring!
Ryan Bahlous-Boldi @RyanBoldi
3K Followers 741 Following PhD @MIT_CSAIL | Continual RL, Open-Endedness, Evolution
Neal Wu @neal_wu
17K Followers 398 Following @thinkymachines, prev new stealth co, @cognition, @tryramp, @GoogleBrain, competitive programming
Steven Dillmann @StevenDillmann
551 Followers 1K Following Stanford PhD working on #AI4Science and maintaining Terminal-Bench Science @StanfordAILab 🧬🤖🪐
Maksym Andriushchenko @maksym_andr
6K Followers 954 Following Principal investigator @ELLISInst_Tue & @MPI_IS, mentor @MATSprogram, PhD from @EPFL, past works: AgentHarm, HalluHard, Claudini, PostTrainBench, InferenceBench
Ofir Press @OfirPress
18K Followers 8K Following I push the AI frontier by building tough benchmarks with amazing people. SWE-bench, SWE-agent, SciCode, AlgoTune. Postdoc @Princeton. PhD @nlpnoah @UW.
Soren Larson @hypersoren
5K Followers 2K Following applied cybernetics // maths music theory econ @harvard @swarthmore
Mimee // smart casual... @MimeeXu
886 Followers 375 Following what good can I do with my life if I only know math and computers. Doctorated on ML x security/privacy. Helpful honest and harmless not for profit
Anmol Gulati @anmol01gulati
4K Followers 1K Following Co-founder @coreautoai Prev: TL Agents Research @ Gemini, Google Deepmind. Cofounder @AdeptAILabs. Google Brain.
Mirendil @MirendilAI
193 Followers 0 Following
Harsh Mehta @HarshMeh1a
6K Followers 461 Following @MirendilAI, Past: AI R&D @AnthropicAI, @GoogleDeepmind, Gemini
Ethan Dyer @ethansdyer
1K Followers 137 Following
Anas Mahmoud @nas_mahmoud_
145 Followers 179 Following Post-training Research @ScaleAILabs | Prev Research @mila_quebec | Research Intern @Meta FAIR | PhD @UofT
MohammadHossein Rezae... @mhrezaeics
238 Followers 922 Following Post-training Research @ScaleAILabs | Ex Research Intern @StanfordNLP | CS @UArizona
jpark @jparkjmc
9K Followers 1K Following ceo @hillclimbai / ex pro valorant @misfitsgg research eng @googledeepmind
Sohail Prasad @sohailprasad
3K Followers 882 Following Founder & CEO @ Destiny ($DXYZ | https://t.co/JXar7RbKcb) Founder, fmr CEO @ Forge ($FRGE | https://t.co/jD5HNkvGj9) Seed Investor in 150+ startups, 10+ unicorns YC S12. Thiel Fellow. 30u30.
Mahesh Sathiamoorthy @madiator
15K Followers 1K Following RL Environment Curation. Data Curation (OpenThoughts). Post-training. CEO @bespokelabsai. Ex-GoogleDeepMind.
Guohao Li 🐫 @guohao_li
14K Followers 4K Following Founder @Eigent_AI / @CamelAIOrg. Scaling RL Environments for Agents. Prev Oxford, KAUST, ETHz, Intel, Kumo.
Tom Brown @NotTomBrown
27K Followers 429 Following Co-founder and Chief Compute Officer @AnthropicAI
Xiangyi Li @xdotli
5K Followers 1K Following your friendly neighborhood eval guy, creator of SkillsBench ClawsBench chat about data, evals, rl environments, skills https://t.co/Jl1qzLItZn
BenchFlow @benchflow_ai
624 Followers 28 Following data and environment lab best place to chat about benchmarks: https://t.co/jIOqi4jAFf
Dougal Maclaurin @DougalMaclaurin
604 Followers 249 Following
Reiner Pope @reinerpope
19K Followers 459 Following CEO and founder, @MatXComputing, developing high throughput chips tailored for LLMs
James Bradbury @jekbradbury
17K Followers 9K Following Compute at @AnthropicAI! Previously JAX, TPUs, and LLMs at Google, MetaMind/@SFResearch, @Stanford Linguistics, @Caixin.
Conor Bronsdon @ConorBronsdon
4K Followers 491 Following Working @Modular 🎙Host @Chain_OfThought Pod The views expressed on this account are my own and have not been reviewed or approved by my employer.
Fred Sala @fredsala
2K Followers 830 Following Assistant Professor @WisconsinCS. Chief scientist @SnorkelAI. Working on machine learning & information theory.
Kevin Kwok @kevinakwok
35K Followers 5K Following
Alexander Terenin @avt_im
8K Followers 2K Following Something new: soon · decision-making, machine learning, artificial intelligence · anti-ideological · Assistant Research Professor @Cornell, prev @CambridgeMLG
AI Security Institute @AISecurityInst
16K Followers 30 Following We conduct scientific research to understand AI’s most serious risks and develop and test mitigations.
Zhengyang Qi @qi_zhengyang
805 Followers 5K Following Research @SnorkelAI | Previously: @Scale_AI Multiturn RL, reward modeling, interactive environments, artificial social intelligence
Daytona @daytonaio
10K Followers 136 Following Daytona is a Secure and Elastic Infrastructure for Running AI-Generated Code.





































