Projects

Check out what I've been working on.

Local Activity Recorder for Computer-Use Modeling

A lightweight recorder that logs everyday computer use (keyboard, screen, mouse events) as structured traces for computer-use agent development.

See demo
LLM Evaluation Uncertainty

Conformal prediction for LLM-as-judge systems to quantify evaluation uncertainty. Includes LLM-as-judge evaluation using Sonnet 4, multiple conformal prediction strategies, and interactive visualization for uncertainty quantification.

Lightweight eBPF Runtime for SmallSat Operations

Research on novel runtime systems for multi-tenant satellite operations using eBPF. Addresses the challenges of managing hundreds of commodity SmallSats under intermittent communication. Published at ACM HotNets '25.