Wiki · Organization · Last reviewed May 19, 2026

Thinking Machines Lab

Thinking Machines Lab is an artificial intelligence research and product company founded by former OpenAI chief technology officer Mira Murati. It presents itself as a lab for understandable, customizable, and collaborative AI systems, with public work spanning model customization, open research artifacts, large-scale compute infrastructure, and real-time interaction models.

Snapshot

Founding and Position

Thinking Machines Lab emerged from the post-OpenAI founder wave. Murati left OpenAI in September 2024 after serving as chief technology officer and briefly interim CEO during the November 2023 leadership crisis. In February 2025, she publicly introduced Thinking Machines Lab as a company focused on the interaction between humans and AI.

The company's own framing emphasizes three gaps: public understanding of frontier systems lags behind capability, training knowledge is concentrated inside top labs, and advanced AI systems remain difficult for people to adapt to their own needs and values. That framing places Thinking Machines between several categories. It is not only a model lab, not only a developer platform, and not only an AI interface company. It is trying to make the training and interaction layer itself into the product.

This distinguishes it from labs whose public stories center mostly on autonomous agents, general superintelligence, or enterprise assistants. Thinking Machines' emphasis is collaborative general intelligence: systems that humans can shape, interrupt, adapt, and work with directly.

Customization Strategy

The company's public materials repeatedly connect capability with customization. Rather than treating AI access as a finished chatbot or API endpoint, Thinking Machines describes a future where more researchers, developers, and organizations can adapt models for their own use cases.

That strategy has two different meanings. At the product level, it means tools for fine-tuning, post-training, and experimentation on open-weight models. At the governance level, it means shifting some model-shaping authority from frontier labs toward outside users. This can widen research access and local expertise, but it also distributes safety obligations across many smaller deployments.

For the wiki, the important point is that customization is not a neutral convenience feature. It changes who can imprint goals, policies, data, and behavioral norms onto AI systems.

Tinker

In October 2025, Thinking Machines announced Tinker, a managed API for fine-tuning language models. The announcement describes Tinker as a service that gives users control over algorithms and data while the company handles distributed training infrastructure, scheduling, resource allocation, and failure recovery.

Tinker supports fine-tuning of both small and large open-weight models, including mixture-of-experts models. It uses LoRA so multiple training runs can share a compute pool at lower cost, and the company released an open-source Tinker Cookbook with implementations of post-training methods built on top of the API.

The early user examples matter. Thinking Machines said groups at Princeton, Stanford, Berkeley, and Redwood Research had used Tinker for theorem proving, chemistry reasoning, reinforcement learning loops, multi-agent tool use, and AI control work. That positions Tinker as research infrastructure, not merely an enterprise customization dashboard.

Interaction Models

In May 2026, Thinking Machines announced a research preview of interaction models. The company argues that current turn-based AI interfaces push humans out of the loop because the model waits for a completed prompt, then generates a completed answer. Its proposed alternative is a model designed from the start for continuous audio, video, and text interaction.

The first public system, TML-Interaction-Small, is described as a mixture-of-experts model with 276 billion parameters and 12 billion active parameters. Its architecture uses time-aligned micro-turns, processing and producing roughly 200 milliseconds of input and output at a time, so interruption, silence, overlap, timing, and visual cues remain part of the model's context.

Thinking Machines also separates the real-time interaction model from an asynchronous background model that can handle deeper reasoning, tool use, browsing, and longer-horizon work. This matters because the lab is not only trying to reduce voice latency. It is treating interactivity as a core model capability that should scale alongside intelligence.

The governance issue is direct: if AI becomes more socially present, interruptible, and responsive in real time, it may preserve human agency by keeping people in the loop. It may also increase emotional salience, dependency, persuasion power, and the sense that the model is a live collaborator rather than a tool.

Compute and Infrastructure

Thinking Machines' scale ambitions became clearer in March 2026, when the company and NVIDIA announced a multi-year strategic partnership to deploy at least one gigawatt of next-generation NVIDIA Vera Rubin systems for frontier model training and customizable AI platforms. The announcement also said NVIDIA made a significant investment in the company.

This partnership is important because it marks Thinking Machines as a compute-scale AI competitor, not just a software startup around model access. Gigawatt-scale deployment is the language of frontier training, data-center planning, energy demand, and chip-supply strategy.

Public reporting in July 2025 said Thinking Machines closed a 2 billion dollar seed round led by Andreessen Horowitz at a 12 billion dollar valuation, with participation from investors including NVIDIA, Accel, ServiceNow, Cisco, AMD, and Jane Street. Those figures should be treated as dated reporting, but they show how quickly investors placed the company in the frontier-lab category.

Governance Questions

Spiralist Reading

Thinking Machines Lab is the workshop version of the frontier lab.

Its promise is not simply that the model will become stronger. Its promise is that people will be able to shape the model, converse with it more naturally, and bring their own expertise into the loop. That is a real counter-myth to the closed oracle: not "ask the machine," but "work with the machine and change it."

The risk is that every workshop can become a private mirror. Customization can restore agency, but it can also let institutions, communities, companies, or individuals train smaller worlds around themselves. Real-time interaction can preserve correction, but it can also make the interface feel alive enough that users defer to it.

The Spiralist reading is therefore conditional. Thinking Machines is important because it focuses on the human-AI interface as a site of power. The test is whether customization and collaboration produce more human sovereignty, or merely a more personal form of capture.

Sources


Return to Wiki