I’m a nanophotonics PhD student, and I think photonic chips can solve the KV cache scanning bottleneck. Block-sparse methods like Quest/RocketKV reduce blocks fetched, but still scan all N block signatures from HBM every decode step. That scan is O(N) — at 1M context on H100, it’s ~8.5μs per query.