Draft:Positron AI

From Wikipedia, the free encyclopedia

Positron AI is an American artificial intelligence hardware company that develops computing systems designed for machine-learning inference workloads. Founded in 2023 and headquartered in Reno, Nevada, the company develops accelerator hardware and software intended to run transformer-based models such as large language models in data-center environments.


History

Positron AI was founded in 2023 by software engineer Thomas Sohmers and computer scientist Edward Kmett. Technology executive Mitesh Agrawal later joined the company as chief executive officer.

Early development work focused on running large language models on FPGA-based accelerator systems. These early systems were used to prototype inference-optimized architectures before the development of dedicated semiconductor designs. The company subsequently announced dedicated inference hardware platforms designed for deployment in data centers.

Technology

Positron AI develops hardware and software systems designed primarily for artificial intelligence inference rather than model training. The company’s architecture focuses on memory capacity and memory bandwidth, which are significant constraints when running transformer-based neural networks.

Its systems are designed to support large language models and other generative AI systems in production environments.

Atlas

Atlas is Positron AI’s first-generation inference accelerator system. The platform consists of accelerator cards integrated into server systems designed for standard data-center racks.

Atlas systems are designed to run transformer-based models and are optimized for inference workloads with an emphasis on memory bandwidth and power efficiency. The platform supports integration with commonly used machine-learning frameworks and inference APIs.

Asimov

Asimov is a custom application-specific integrated circuit (ASIC) architecture announced by Positron AI as a successor to its earlier FPGA-based platforms. The architecture is designed to support large-scale transformer models while increasing memory capacity and computational efficiency for inference workloads.

Systems based on the Asimov architecture are intended for deployment in large-scale data-center environments.

Software

Positron AI provides software designed to integrate its accelerator hardware with existing machine-learning ecosystems. The software stack is intended to support widely used model frameworks and inference interfaces, allowing transformer models to run on Positron hardware without extensive modification.

Funding

In 2025 the company raised $23.5 million in early funding and later secured a $51.6 million Series A round from venture investors. In 2026 the company announced a $230 million Series B financing that valued the company at approximately $1 billion.

References

Related Articles

Wikiwand AI