@RosannaInvests: 🧠 Inference is the next phase ...

19 views Jun 22, 2026

🧠 Inference is the next phase of AI, and inference is bottlenecked by memory, not compute.

Every agent, every model serving real users, needs fast memory sitting next to the GPU. The market is starting to price the GPUs. It has not yet priced the memory layer underneath them.

$PENG (Penguin Solutions) sits exactly there. 🐧 Thread. 🧵

The thesis in one line.

As AI shifts from training to inference, the constraint moves from raw compute to memory bandwidth and capacity. You cannot serve a model fast if the data cannot reach the processor fast.

$PENG builds the memory architecture for that exact problem → CXL memory, KV cache servers, and a photonic memory appliance in development.

This is the inference-memory chokepoint. And it is profitable today. 💸

@RosannaInvests: 🧠 Inference is the next phase ...

Actions

What You Can Do