Mojo Language: Python's Syntax, C's Speed?

May 26, 2023

python dev

Mojo made a splash in May 2023, promising to be a superset of Python with C/C++ performance. The benchmarks were eye-popping. Let’s look at what’s actually delivered.

The Pitch

Mojo claims:

Python-compatible syntax
35,000x faster than Python (in some cases)
No garbage collection overhead
MLIR-based compilation
First-class AI/ML support

Hello Mojo

fn main():
    print("Hello, Mojo!")

Looks like Python. But fn instead of def signals something different.

Key Differences

Strong Typing

# Python-like (dynamic)
def dynamic_add(a, b):
    return a + b

# Mojo (typed)
fn typed_add(a: Int, b: Int) -> Int:
    return a + b

Value Semantics

struct Point:
    var x: Float32
    var y: Float32
    
    fn __init__(inout self, x: Float32, y: Float32):
        self.x = x
        self.y = y
    
    fn distance(self, other: Point) -> Float32:
        let dx = self.x - other.x
        let dy = self.y - other.y
        return sqrt(dx*dx + dy*dy)

struct uses value semantics (copied, not referenced).

Memory Ownership

fn process(owned text: String):
    # We own text, can mutate it
    pass

fn read_only(borrowed text: String):
    # We borrow text, can't mutate
    pass

fn mutable(inout text: String):
    # We can mutate the original
    text += " modified"

Like Rust, but with Python ergonomics.

Why It’s Fast

MLIR Backend

Python code → CPython interpreter → slow

Mojo code → MLIR → LLVM → native binary → fast

MLIR (Multi-Level Intermediate Representation) enables aggressive optimization.

Zero-Cost Abstractions

# This compiles to extremely efficient machine code
fn mandelbrot_kernel[
    T: DType
](c: ComplexSIMD[T, simd_width]) -> SIMD[T, simd_width]:
    var z = c
    var iters = SIMD[T, simd_width](0)
    
    for _ in range(max_iters):
        if all(z.squared_norm() > 4):
            break
        z = z * z + c
        iters = iters + (z.squared_norm() <= 4).select(1, 0)
    
    return iters

SIMD (Single Instruction Multiple Data) operations are first-class.

No GIL

Python’s Global Interpreter Lock prevents true parallelism:

# Python - GIL limits parallelism
from concurrent.futures import ThreadPoolExecutor
# Threads don't run simultaneously for CPU work

# Mojo - true parallelism
fn parallel_work():
    @parameter
    fn worker(i: Int):
        # Actually runs in parallel
        compute(i)
    
    parallelize[worker](num_workers)

The Benchmarks

Matrix Multiplication

Python NumPy:     ~1x (baseline)
Pure Python:      ~60,000x slower
Mojo:             ~5x faster than NumPy
Mojo (optimized): ~35,000x faster than pure Python

Mandelbrot

Python:  1x
Mojo:    35,000x faster

These are real numbers, but context matters.

The Reality

What’s Working (2023)

Basic language features
SIMD and parallel primitives
Integration with Python
Jupyter notebook support

What’s Missing (2023)

Full Python compatibility
Windows support
Open-source compiler
Package ecosystem
Production stability

The 35,000x Caveat

The comparison is against pure Python, not NumPy:

# Pure Python (slow)
def mandelbrot_python():
    for i in range(size):
        for j in range(size):
            # pixel-by-pixel computation

# NumPy (fast)
import numpy as np
# Vectorized operations

Most Python ML code already uses NumPy/PyTorch, not pure Python loops.

Real Use Cases

Custom Kernels

# Write performance-critical code in Mojo
fn custom_attention[T: DType](
    q: Tensor[T], 
    k: Tensor[T], 
    v: Tensor[T]
) -> Tensor[T]:
    # Optimized attention implementation
    ...

Call from Python, get C speed.

Replacing C Extensions

Instead of:

Python → Cython → C → compile → Python extension

Just:

Python → Mojo → Python extension

AI Model Inference

# Deploy models with minimal overhead
fn inference(input: Tensor) -> Tensor:
    # Runs as fast as optimized C++
    return model.forward(input)

Compared to Alternatives

Language	Python Compat	Speed	Ecosystem	Learning Curve
Mojo	High	Fastest	Growing	Medium
Cython	High	Fast	Mature	Medium
Numba	Limited	Fast	Mature	Low
Rust+PyO3	Interface	Fast	Mature	High
Julia	Import	Fast	Growing	Medium

Should You Care?

Yes, If

You write performance-critical Python
You’re tired of the Python/C++ split
You work in AI/ML infrastructure
You want to try new languages

Not Yet, If

You need production stability
You rely on Python’s ecosystem
Your Python code is already fast enough
You can’t wait for the ecosystem

My Take

Mojo is genuinely interesting. The technical choices are sound:

MLIR enables real optimization
Python syntax lowers adoption barriers
AI focus matches market needs

But it’s early. The “35,000x faster” headlines require context. For most developers, the ecosystem isn’t there yet.

Watch the space. Check back in 2024.

Mojo: Promising, but patience required.