Denis Sudakov

DENIS SUDAKOV[ru/en]

Building infrastructure for AI agents, language model post-training,
and local AI systems.
Moscow, Russia

Email · GitHub · CV

~~~

ABOUT

I build practical AI systems, with a focus on core infrastructure for
agents and post-training workflows.

Most of my work focuses on reliable, composable systems for AI agents
and local-first agentic infrastructure. I believe the current ecosystem
lacks foundational tooling for building useful, extensible agents that
are easy to share, modify, and integrate. My goal is to design and
implement these missing primitives as open, composable systems.

Currently studying geology at Lomonosov Moscow State University while
spending most of my time working on AI systems and ML research.

~~~

PROJECTS

 - su-agent (WIP)
   Local agent runtime and backend for running extensible AI agents on
   a user's machine. Exposes a system API for multiple client apps,
   skill/tool packages, session and memory management. Built around
   modularity, multi-client support, and local execution.

 - su-memory (WIP)
   Extension for su-agent (also compatible with Pi coding agent).
   File-based persistent memory and skill incubator. No vector search,
   embeddings, or RAG — just files.

 - su-subagent (WIP)
   Extension for su-agent (also compatible with Pi coding agent).
   Minimal sub-agent delegation via spawned agent processes. No shared
   context, no nested conversations, no framework.

 - su-ASR
   OpenAI-compatible speech recognition server. Real-time streaming,
   keyword biasing, multilingual support, low-latency inference.

 - Madiz
   Radiation monitoring system on a stratospheric satellite platform
   (RPi 4 + custom sensors). Handled hardware integration, onboard
   processing, compression pipeline, and control logic.

Early-stage

 - su-code
   Coding agent multiplexer on top of su-agent, Codex-style interface.
   Originally forked from t3-code.

 - su-claw
   Personal extensible agent with memory and skill management.
   Inspired by Hermes-style agent architectures.

Research

 - GSPO-PyTorch
   Implementation of Group Sequence Policy Optimization — an RL
   algorithm for LMs addressing GRPO limitations via sequence-level
   importance sampling.

 - aiijc-701w
   RL fine-tuning of Qwen models for mathematical reasoning. Dataset
   construction, benchmarking, GRPO-based training pipelines.

~~~

WRITING

 - Coming soon — research notes and system design write-ups

~~~

INTERESTS

 - LLM post-training for agentic environments
 - Local AI infrastructure
 - AI developer tooling

~~~

TEACHING

 - Instructor, International Summer School
   MSU University Gymnasium (2026)
   Teaching AI, machine learning, and modern agent systems to
   high school students.

~~~

EDUCATION

 - Lomonosov Moscow State University
   BSc Geology (2025–2029)

Independent coursework

 - Stanford CS336 — Language Modeling from Scratch
 - Stanford CS229 — Machine Learning
 - Stanford CS230 — Deep Learning
 - Harvard CS50AI
 - Harvard CS50
 - Harvard CS50 Web