# Recent runs across all tiersfastkernels history# History for a specific operatorfastkernels history --op rms_norm# Best-ever speedup per operatorfastkernels history --best# Show more resultsfastkernels history --limit 50
Show Sample output: fastkernels history --op rms_norm
====================================================================== TRACKING HISTORY: rms_norm====================================================================== TIMESTAMP RUN NAME SPEEDUP PASS ERR_RATIO RUN ID ────────────────────────────────────────────────────────────────── 2026-03-16 17:03 agent_L2_Llama-3.1-8B-Inst -- -- -- 797bf661 2026-03-16 16:55 agent_L1_Llama-3.1-8B-Inst -- gen=FAIL -- 4f879e0a 2026-03-16 16:52 agent_L1_Llama-3.1-8B-Inst -- gen=OK -- df8ab5e3======================================================================
Show Sample output: fastkernels history --best
====================================================================== BEST SPEEDUP PER OPERATOR (from kernel benchmarks)====================================================================== OPERATOR BEST SPEEDUP DATE RUN ID ────────────────────────────────────────────────────────────────── rms_norm 1.64x 2026-03-16 16:44 a1b2c3d4 rotary_emb 1.22x 2026-03-15 11:30 d3e4f5a6 silu_and_mul 1.45x 2026-03-14 09:55 b7c8d9e0======================================================================
Experiment list (left sidebar) — select the fastkernels experiment to see all tracked runs.
Runs table — each row is a tracked run. Columns show run name, start time, duration, and logged metrics. Click column headers to sort (e.g., sort by e2e_speedup to find your fastest runs).
Search and filter — use the search bar above the runs table with MLflow filter syntax:
Parameters — model, level, TP degree, LLM model, seed, and other run configuration
Metrics — per-operator generation success (gen_rms_norm_success), unit test results (utest_rms_norm_success, utest_rms_norm_max_diff), and end-to-end results (e2e_speedup, e2e_token_match_rate)
Artifacts — browse the kernels/ folder to view and download the exact source code of every generated kernel; failed generations store error traces under errors/
From any run’s artifact browser, click a kernel file (e.g., kernels/rms_norm.py) to preview it. Use the download button to save it locally.You can also download artifacts programmatically:
import mlflowmlflow.set_tracking_uri("file://path/to/fastkernels/mlruns")client = mlflow.tracking.MlflowClient()# Download a specific kernel from a runclient.download_artifacts("<run_id>", "kernels/rms_norm.py", "/tmp/")# List all artifacts for a runfor artifact in client.list_artifacts("<run_id>", "kernels"): print(artifact.path)
Any kernel optimization script or custom agent can use the tracking API directly. This is the same API that fastkernels agent, fastkernels kernels, fastkernels eval, and fastkernels e2e use internally.
Tracking is always-on when mlflow is installed. To disable:
pip uninstall mlflow
The system degrades gracefully — a single warning is printed on the first tracking call, and all subsequent calls are silent no-ops. No code changes needed.