Modular @Modular

Building AI’s unified compute layer. We are hiring → https://t.co/cPTAes0HMt 🚀 modular.com Joined January 2022

Tweets

1K
Followers

23K
Following

2
Likes

722

Modular @Modular

2 days ago

Working through our GPU Puzzles? Don't sleep on our companion YouTube series that walks through puzzles 1 through 5. Follow along, pause, and rewind to make sure you grok the solution: youtube.com/watch?v=-VsP4k…

0 2 14 1K 5

View Details

Modular @Modular

3 days ago

At @AMD AI DevDay, @clattner_llvm showed that AMD MI355X paired with Modular platform delivers equivalent image gen performance to Blackwell at 5.5x lower total cost. Watch Chris' luminary talk: youtube.com/watch?v=FjFC__… Thanks again to @AIatAMD for a great event!

1 2 21 2K 5

View Details

Modular @Modular

6 days ago

We were happy to sponsor #MLSys 2026. Across the talks, posters, and keynotes, three themes defined the current state of inference serving: 1. Agentic engineering 2. KV Cache optimization 3. Heterogenous hardware Our read on each: modular.com/blog/three-tre…

1 2 29 2K 8

View Details

Modular @Modular

7 days ago

In the latest Modular Tech Talk, Mojo Compiler Engineer Billy Zhu presents Mojo's attribute-based expression system and how it enables Mojo's powerful type-safe meta-programming: youtu.be/4DKInnobCjY

1 4 26 2K 5

View Details

Modular @Modular

a week ago

One meeting: BLAS routines in Mojo, raylib Mojo bindings, consumer hardware support, MAX vs. vLLM, and where clauses in parametric structs. The community brought presentations, demos, and questions. Catch up with the full recording of yesterday's community meeting: youtube.com/watch?v=BasUPN…

0 4 17 2K 2

View Details

Modular @Modular

a week ago

Haven't had time to explore Mojo 1.0 beta yet? @InfoWorld published a piece on Mojo 1.0 that will get you up to speed on language basics, metaprogramming, Python interop, GPU support, and more: infoworld.com/article/417315…

1 6 39 2K 9

View Details

Modular @Modular

a week ago

Tomorrow's community meeting is a deep dive into 3 community Mojo projects: a Mojo implementation of BLAS routines, Raylib v6 Mojo bindings, and replacing OpenCL with a Mojo kernel within Darktable, an open source photo editing program. See what the community is building: luma.com/may-modular

0 1 16 1K 0

View Details

Modular @Modular

2 weeks ago

Want to bring a Modular meetup to your city? We're sponsoring events globally: forms.gle/tQuNpH2ahkZbST…

0 0 5 777 0

View Details

Modular @Modular

2 weeks ago

Seoul showed up! Packed room, sharp questions, a special message from @clattner_llvm, and an intro to Mojo 🔥 and MAX. Our first developer meetup in Korea. Thank you to SqueezeBits for making this happen, and to everyone who came out.

2 4 25 6K 3

View Details

Modular @Modular

2 weeks ago

Traditional load balancers were built for stateless services, but LLM inference backends aren't stateless. Part 2 of our inference routing series covers the data layer that queries live backend state across hundreds of pods at microsecond latency: modular.com/blog/why-llm-i…

0 3 22 1K 8

View Details

Modular @Modular

2 weeks ago

Agentic engineering is changing what one developer can ship with Mojo in a few weeks. @ehsanmok set out to build a pastebin service in pure Mojo 🔥 using our recently released first Mojo 1.0 beta. Using AI coding agents and Mojo's agent skills, he built 10 libraries from scratch: a full networking stack, SQLite bindings, high-performance JSON, reflection-driven serde, fuzz testing tools, and more. The app is live at mobin.fly.dev. Read about his stress test of Mojo 1.0 beta: modular.com/blog/how-i-bui…

1 2 35 3K 8

View Details

Modular @Modular

2 weeks ago

AI agents in healthcare face tight constraints: latency can't exceed 800ms per turn, the first turn processes 10k tokens of context, and safety models analyze the conversation in parallel. Using our MAX framework, @hippocraticai keeps patient conversations instant (sub-second TTFT), hits aggressive performance targets without sacrificing model accuracy, and runs across accelerators as new hardware comes to market. A look at how regulated enterprises like Hippocratic AI use MAX in production for real-time patient conversations: modular.com/blog/hippocrat…

0 6 28 5K 5

View Details

Modular @Modular

3 weeks ago

The MAX-LLM book just made it even easier to build an LLM from scratch. The new notebook format lets you run the GPT-2 components interactively, inspect real tensor shapes, and generate text from pretrained weights. Prefer to browse first? The pre-rendered version shows all outputs without running a cell: github.com/modular/max-ll…

0 8 66 3K 33

View Details

Ant Ling @AntLingAGI

3 weeks ago

🚀 Ring-2.6-1T is now open source. A trillion-scale flagship thinking model built for real-world complex tasks: Agent workflows, coding & engineering, long-horizon tasks, complex reasoning, research, and enterprise automation. It is designed to move beyond “answering” toward execution: understanding context, planning steps, calling tools, and staying stable across long task chains. Highlights： - Advanced agentic workflow support. - Reasoning effort levels: high for agentic tasks, xhigh for complex reasoning. - Scalable asynchronous RL via the IcePop algorithm, enabling stable, trillion-scale training for long-horizon agentic RL.

53 105 690 4.4M 213

View Details

Modular @Modular

3 weeks ago

Two can't-miss events coming up in Seoul: 👉 Modular's Judy Heflin presents an intro to Mojo & MAX at the Efficient AI Offline Meetup on May 16th at Dream Plus Gangnam: event-us.kr/squeezebits/ev… 👉 Our inaugural Modular Developer Meetup in Seoul on May 19th at Belgium Jazz Cafe: luma.com/modular-seoul Come for the talks from Modular & SqueezeBits. Stay for the NVIDIA RTX 5080 + AMD Radeon RX 9070 raffles 👀

1 2 11 1K 1

View Details

Modular @Modular

3 weeks ago

Mojo has minimal boilerplate, a strict type system, and compile-time validation of code, all things that make it well-suited for use with AI coding agents. We're taking this up a level by publishing a set of Mojo agent skills that make translating code to Mojo a breeze. Full writeup + CUDA kernel ➡️ Mojo translation demo: modular.com/blog/translati…

2 8 65 3K 19

View Details

Modular @Modular

3 weeks ago

Craft your own at inkwell.modular.com and tag us when you share - we'll send you swag!

0 0 4 493 1

View Details

Modular @Modular

3 weeks ago

Bolt's big dark: your best friend Bolt is a small silver robot with one wobbly antenna and a tiny light on his chest that blinks when he's nervous. He says the dark feels too big and too quiet and he doesn't know what's in it. Bedtime is in 10 minutes, and it's up to you to reassure him that everything will be okay: inkwell.modular.com/shared/bolt-s-…

1 0 4 610 0

View Details

Modular @Modular

3 weeks ago

What would you build with lightning fast image generation? Inkwell is @iamtimdavis' answer: a dynamic storybook-building app that uses @bfl_ml's FLUX2 and @googlegemma 4 to write and illustrate in real time. Powered by Modular Cloud. We sat down with Tim to talk through how Inkwell works under the hood: youtube.com/watch?v=F1X5bm… The short version: LLM tokens stream directly into the image prompt before the story finishes generating. First pixel under 500ms. Built on Mojo kernels and MAX serving infra.