Skip to content

Performance Modeling Engineer

Etched

San JoseFull-time8d ago

About the role

About Etched Etched is building the world’s first AI inference system purpose-built for transformers - delivering over 10x higher performance and dramatically lower cost and latency than a B200. With Etched ASICs, you can build products that would be impossible with GPUs, like real-time video generation models and extremely deep & parallel chain-of-thought reasoning agents. Backed by hundreds of millions from top-tier investors and staffed by leading engineers, Etched is redefining the infrastructure layer for the fastest growing industry in history. Key responsibilities - Develop comprehensive performance models and projections for Sohu's transformer-specific architecture across varying workloads and configurations - Profile and analyze deep learning workloads on Sohu to identify micro-architectural bottlenecks and influence optimization opportunities - Drive hardware/software co-optimization by identifying where architectural features can unlock performance improvements - R

More at Etched