Hiranmay Darshane @hdarshane
research intern https://t.co/Uyagq8Ilgm. deep learning and large language models. football (banter) fan. 18. hiranmay.com Mumbai, India. Joined October 2019-
Tweets5K
-
Followers681
-
Following1K
-
Likes32K
I'm having fun 🤗
Must be wild to do industry internships now. This sounds so unfun
tbh I too am kinda convinced the research work I do isn't meaningfully outside the scope of abilities autoresearch etc. will have soon enough
tbh I suspect most ML research jobs are gone in ~4 years bc of autoresearch the last premium paying jobs will be for adaptable, high agency ops glue guys who can - get in the trenches - discover and prioritize problems - use global context to rapidly implement a scalable fix
it's actually extremely beautiful work -- there may be so many implications hidden in sight as we keep on chewing on and digesting the fact that language modeling is solved
extremely tasteful work!!
8/ With that, we reframed multimodal generation as structured text/code generation. Diffusion just renders pixels. Planning, logic, reasoning all live in the LLM — so training looks like normal LLM training, and inherits all benefits of it: data + model scaling, reasoning, RL,
obv joking a bit here, just in case someone thinks I'm stereotyping an entire city :)
x.com/vkhosla/status… "sorry not sorry" tell me this isn't a Dilli trait :)
IMO all these Vinod Khosla anecdotes are downstream of a single fact i.e. he is from Dilli
truth nuke
7/ Looking beyond this paper: scaling compute against a fixed, limited pool of data will need new primitives. Searching over a population of models is a different problem than standard gradient descent training and we've barely scratched the surface. We hope q0 pushes people toward crazy ideas in multi-epoch training and scaling compute in general!!
yay
1/ Now that we're running out of data, how do you optimally scale multi-epoch pretraining to hundreds of epochs? Our first paper from Q! q0 trains a population of models, instead of single model that saturates fast, reaching a dramatically lower loss at *every* epoch budget. w/
@soldni regularization is BACK i suppose. dropout 0.15 is quite large and i don't think anyone else uses dropout in the big 26. also rather high std for init these days but you can't go wrong with a good old 0.02. also why depth scale output proj when you have sandwich norm??
🔥
This paper empirically ~verifies the section of my first Zipfian grokking blog post where I hypothesize about how capacity competition dynamics extrapolate from the grokking to language pretraining case Cool work from the authors! :)
q: "why don't Sora-like models learn compositional physics understanding or do ICL like how language models learn compositional semantics?" a: every attempt to date heavily leaks information from the future. some even bake it into the bottleneck design without realizing (!!!)
is this not regulated by SEC?
Rule changes for the SpaceX $SPCX IPO: Index providers waived the profitability requirement and cut the seasoning window from 90 days to 5. This forces over $30 trillion in passive 401k and retirement money to buy SpaceX at IPO valuations. Bloomberg Intelligence estimates S&P
The following animation convey the intuition: when a 1-neuron model tries to learn two tasks, the frequent task updates suppress the infrequent task updates. The 2-neuron model can dedicate a neuron to the infrequent task once the frequent one is fully learned.
a quick way to force oneself into thinking about a thing is maintaining a list of words about that thing and just staring at it something something required circuits activate from high cosine similarity
I was thinking about it again recently, Google Allo was really ahead on the idea of chatting with Google Assistant or @'ing in conversations to build out this Agent/AI UX we have now
Rohan @rohanagrwl
287 Followers 2K Following Building UPI Infra at 86400. Prev built @stockalpha_ & EIR @join_ef
Sree @itzSreez
10 Followers 448 Following https://t.co/Q7kWxjeapt | happy and moisturized | Playing poker
Kish Flix @FlixKish
104 Followers 3K Following A network security expert with physics and computer science background. Love outdoor activities, movies and wine.
Onʇɹᴉpǝɹ @CDLXXXIII
167 Followers 2K Following
Gee Cee @GeeCee802
22 Followers 527 Following Center Right politically.. (key word “center”)…. ashamed by Trump… pro USA 🇺🇸 from forever…. but know we need allies/partners…. Let’s all be smart, not dumb!!
Aman @beingamanFF
411 Followers 910 Following GSOC ’26 @ GNU Octave | DS @HiLabs_inc | alum @iitroorkee | Explaining SOTA DL & systems | building, lifting, running
Bishwas @bishmdl76
63 Followers 70 Following i like training ai models | research @ https://t.co/25Ona6XwZL, cs phd
//TODO: fix later �... @enjoyingthewind
799 Followers 8K Following
Muzaffer Kal @🏡 �... @MuzafferKal_
2K Followers 7K Following Chips: ASIC, FPGA. CV/ML. Duck pictures by the lake. Some bread making.
serdarml @cs_serdar
114 Followers 595 Following Aspiring AI researcher, undergrad student @TU_Muenchen
Abhinav Singh @imabhi0703
184 Followers 2K Following Avid Learner | Pupil @ Codeforces | You are on your own | Coz I'm a young man after all
📗 @__the__human__
26 Followers 3K Following
Cam Howe @camhowe1729
116 Followers 2K Following a humble rebel interested in basement agi, higher order thinking and positive sum games.
truthixify @truthixifi
461 Followers 497 Following prev fellow @onlydust_com & @pldevguild | 0.5x hackathon winner
Alchemist ☢☢ 🇮... @LLawliet126
399 Followers 871 Following Atheist. Futurist/Transhumanist. Astronomy/Aerospace enthusiast. Defense enthusiast. Anime-Manga enjoyer. e/acc. Autocratic. priv @lawlietlight126
Abhipray Chavan @abhipray_chavan
7 Followers 779 Following
Chinmay @ChinmayKak
4K Followers 1K Following 21. gradient ascender. RL and Agents @MSFTResearch . love @teamIvLabs. dms open!
Dimitris Papailiopoul... @DimitrisPapail
28K Followers 1K Following Researcher @MSFTResearch, AI Frontiers | Prof @UWMadison (on leave) | babas of Inez Lily.
Akshay @akshayvegesna
431 Followers 172 Following Working on generalization at Q Labs. https://t.co/ExPhN2Kb4X Previously perception @nuro, math @caltech
calvin @calvingenuity
21 Followers 140 Following performative nomad. indulging in the science of next word speakers and more.
Josh Harkins-Finn @JHarkFinn
2 Followers 8K Following
nrRNjkitRHmMP @RNjkit72037
0 Followers 4K Following I'm interested in category theory and machine learning.
Neev Parikh @neev_parikh
946 Followers 2K Following are you ready for the intelligence explosion anon? ML research at @METR_Evals. prev @Stripe opinions my own.
Stéphane Deny @StphTphsn1
4K Followers 7K Following Neuroscience & ML Researcher. Posting about various topics on here. I retweet papers to increase their visibility (I do not read all), tag me for a retweet.
Jasper Gilley @0xjasper
1K Followers 621 Following representation learning, interpretability, aesthetics | the greatest art is yet to be created
Andrew Lampinen @AndrewLampinen
12K Followers 2K Following Interested in cognition and artificial intelligence. MTS at @AnthropicAI. Previously @DeepMind, cognitive science @StanfordPsych. Tweets are mine.
Amit Ranjan @amitranjan
32K Followers 4K Following Tinkering @ JauntLabs | Prev: CoFounded @SlideShare, Architect @DigiLocker_Ind #IndiaStack #DPI | Angel Investor
Melisa @MelisaSeah
5K Followers 975 Following creative experimentation — shaping storytelling @reve and exploring the future of AI & creativity
Shmuel Berman @ShmuelBerman
84 Followers 155 Following PVL Lab @ Princeton | Memory and Perception | Anthropic Fellow | https://t.co/jdfRoBjvfJ
Lisan al Gaib @scaling01
46K Followers 1K Following lead them to paradise LisanBench: https://t.co/vorVk7NMCy Impressum & Datenschutz: https://t.co/lFLgiu8EAU
Max Weinbach @mweinbach
293K Followers 8K Following Analyst @creativestrat | Analyst and Market Research Firm | Typo ignorer Email: [email protected]
✈️ Flight Leader ... @HQuarterma43504
1K Followers 501 Following High performance computing. AWACS Thunderbird ally.
Dhruv Batra @DhruvBatra_
21K Followers 728 Following Co-founder & Chief Scientist @yutori_ai. Prev: Senior Director leading FAIR Embodied AI @MetaAI and Professor @GeorgiaTech.
John Schulman @johnschulman2
75K Followers 2K Following Recently started @thinkymachines. Interested in reinforcement learning, alignment, birds, jazz music
Xiangdong Zhang @aHapBean
256 Followers 53 Following AI PhD student at @sjtu1896. I’m currently exploring llm pre-training. REDstar Intern at @xiaohongshu Dots (formerly Hi Lab). [email protected]
Brad Gerstner @altcap
206K Followers 1K Following Founder - Altimeter, Invest America | Trump Accounts, Center for Heart Attack Prevention. One precious life. Views here personal. @investamerica24 @bg2pod
Larry Dial @classiclarryd
2K Followers 42 Following Technical Staff at Open Athena, working on Marin
Tantum Collins @tantumscollins
870 Followers 64 Following CEO of @inherent_labs, previously @GoogleDeepMind
Jukan @jukan05
140K Followers 319 Following Tech otakus save the world | Not Investment Advice | DYODD
Jason Dean @_Jason_Dean_
8K Followers 5K Following “What must it be like to live in this world, seeing it just the way it is, and think that it will never change, never get any better?”
The OpenAI Foundation @FoundationOAI
7K Followers 0 Following OpenAI was founded in 2015 as a nonprofit; its mission is to ensure artificial general intelligence benefits all of humanity.
Taylor Sorensen @ma_tay_
2K Followers 605 Following make LLMs good for people! | PhD from @uwnlp, prev @humansand @stanfordnlp @byuacme, intern @GoogleDeepMind, @allen_ai | LLMs + alignment, pluralism, diversity
Amarillo Slim @Amarillo_Slim1
12K Followers 659 Following
pranav @pranav_so
484 Followers 979 Following 21 | use this as a note taking app | econ @ashokauniv | now https://t.co/JYoxUalFH8 via @vidhi_india + https://t.co/YZf9VQHQNn | increasing economic growth & improving dev outcomes
Laurie Whitwell @lauriewhitwell
484K Followers 999 Following Journalist for @TheAthleticFC, covering Manchester United. Instagram: lauriewhitwell
David Bessis @davidbessis
19K Followers 460 Following Rogue mathematician. "The product of mathematics is clarity and understanding." — Bill Thurston https://t.co/l95RHuWz2S
Albin Sheqiri @albinsheqiri
32K Followers 351 Following Assistant Coach @cercleofficial • UEFA PRO (2026-2028)
Nicholas Joseph @nickevanjoseph
8K Followers 51 Following Pretraining @AnthropicAI, formerly safety @OpenAI
Jiaxin Wen @jiaxinwen22
6K Followers 192 Following research @berkeley_ai @anthropicai. prev @tsinghua_univ.
Bishwas @bishmdl76
63 Followers 70 Following i like training ai models | research @ https://t.co/25Ona6XwZL, cs phd
serdarml @cs_serdar
114 Followers 595 Following Aspiring AI researcher, undergrad student @TU_Muenchen
Colossus @colossusmag
45K Followers 136 Following Subscribe: https://t.co/Zu7Sv2Efxd. Listen: @InvestLikeBest, @FoundersPodcast, @BizBreakdowns, @joyscompounding.
Swapan Dasgupta @swapan55
1.1M Followers 1K Following MLA from Rashbehari (Kolkata). Ideologically conservative & nationalist. Padma Bhushan (2015). Former MP (Rajya Sabha). Member BJP National Executive.
Maharashtra Progress ... @abhirammodak
13K Followers 157 Following Solution Arch, BFSI Specialist, Music Composer & Hobbytual Theoretical Physicist,dev/infra/industry/jobs tracker for Pune&MH- Retired, now Freelancer
Elon Litman @elon_lit
5K Followers 288 Following AI researcher @Stanford. hypernetworks, energy-based models, genomics. Everettian. information geometer.
Quinn Barry @quinnmbarry
796 Followers 1K Following thinking. ex: @maplefinance. @avarilabs. @stanford.
Tom Brown @NotTomBrown
27K Followers 427 Following Co-founder and Chief Compute Officer @AnthropicAI
Mao Shichigan @weAllGonnaDye
4K Followers 2K Following shitposting about Arsenal and my deteriorating mental health. Professional Bhanda Ghasser. I Probably tweet every minute, sorry for the tl spam.
Renny @rennyzucker
12K Followers 2K Following 28. A little long here, some short there. Head of trading, fintwit sarcasm desk.
🪓 @crazyxedi
1K Followers 859 Following
Dune Quotes @DuneQuoteBot
50K Followers 0 Following Unofficial bot that spits out a random quote from Frank Herbert's Dune books, even though spit is a terrible waste of water. By @thatjasonweiser










































