Research

A project portfolio of active investigations and implementation experiments.

Agent Evaluation Workbench

Benchmarking multi-step tool-use reliability with reproducible test suites.

Next.jsTypeScriptPostgreSQLOpenAI API

Comparing context window, vector retrieval, and hybrid memory pipelines.

PythonFAISSFastAPIRedis

Tracing task latency and failure modes across agent orchestration layers.

OpenTelemetryNode.jsGrafanaDocker

Designing a bilingual publishing workflow for research notes and technical essays.

Next.jsMDXnext-intlTailwindCSS