Ethan Justice

Software engineer. Interested in how things work.

Finishing a CS degree at Michigan, starting at Google YouTube in June. This is where I put things I’ve built, things I’m thinking about, and things I learned later than I should have.

  • Q1 2026

    quarterly · Apr 1, 2026

    Finishing Michigan, landed Google, and learned that technical work needs the right context.

  • Why I Rebuilt This Site

    notes · Apr 4, 2026

    I rebuilt this site because the old version was a dressed up resume and this one is trying to be more.

  • Software Engineer

    Google YouTube · Incoming · Jun 2026

    Backend infrastructure and LLM orchestration for YouTube Ask. C++ and Go.

    • c++
    • go
    • distributed systems
    • infrastructure
  • Machine Learning Researcher

    University of Michigan · Nov 2025 – Mar 2026

    Focused on KV-cache management in LLM inference systems, working within the vLLM and LMCache codebases. The goal was improving prefill speed by implementing bidirectional loading with overlapping IO and compute. A100s and H100s, profiled with Nvidia Nsight.

    • python
    • vllm
    • cuda
    • llm inference
  • AI Engineering Intern

    PersistOS · Aug – Nov 2025

    One of the first engineers at an agentic memory platform. Built the multimodal pipeline, a semantic caching layer that cut API response time by 99%, and the LLM-as-a-Judge testing suite to verify memory usage.

    • python
    • redis
    • gcp
    • llm
  • Heterogeneous Multi Agent Debate

    EECS 498 · ML Research · Aug – Dec 2025

    Capstone research project on Heterogeneous Multi Agent Debate for LLM inference. Confidence based gating to avoid redundant debate, cutting total FLOPs by 40%. Identified syntactic determinism as a class of failure modes that makes the approach unsuitable for syntax heavy fields, but reduces hallucinations in factual retrieval.

    • python
    • pytorch
    • vllm
    • research
all work →

I started coding in middle school writing control software for a robotics team. Went 30 miles south to the University of Michigan for CS.

Graduated in three years. Interned at UWM and Little Caesars doing backend work, then joined PersistOS as one of the first engineers. Spent my last semester at Michigan doing ML inference research on KV-cache management in vLLM and LMCache.

Starting at Google YouTube in June. I don't know what comes after that.

  • University of Michigan

    B.S.E. Computer Science · Dean’s List, University Honors

    Aug 2023 – May 2026