Performance Modeling Engineer
Etched
San JoseFull-time8d ago
About the role
About Etched
Etched is building the world’s first AI inference system purpose-built for transformers - delivering over 10x higher performance and dramatically lower cost and latency than a B200. With Etched ASICs, you can build products that would be impossible with GPUs, like real-time video generation models and extremely deep & parallel chain-of-thought reasoning agents. Backed by hundreds of millions from top-tier investors and staffed by leading engineers, Etched is redefining the infrastructure layer for the fastest growing industry in history.
Key responsibilities
- Develop comprehensive performance models and projections for Sohu's transformer-specific architecture across varying workloads and configurations
- Profile and analyze deep learning workloads on Sohu to identify micro-architectural bottlenecks and influence optimization opportunities
- Drive hardware/software co-optimization by identifying where architectural features can unlock performance improvements
- R