Elton Pinto

PhD student at Georgia Tech
Resume

Research

I currently work on LLM inference systems and solver-based job scheduling. On the inference side, I'm a project lead on Vajra, studying hierarchical prefix caching and tensor exchange mechanisms. More broadly, I enjoy solving problems at the intersection of systems and programming languages.

Last summer I interned at Uber's Programming Systems Group, where I developed a parallel version of the RTA callgraph construction algorithm for the Go programming language.

During my master's, I worked with Daan Leijen on prototyping an implementation of Perceus memory management for OCaml.

During my undergrad I was part of TINKER lab where I was advised by Prof. Thomas Conte and Dr. Jeff Young. While there, I wrote a space-efficient implementation of the quantum verification of matrix products (QVMP) algorithm.

Fellowships

  • Sutter Hill Ventures Codepoint Fellowship (2023-2024)
  • NSF GRFP (declined to pursue SHV fellowship)

Publications