Auraison Sprint Plan
Generated: 2026-03-09 | Updated: 2026-03-14 Planning horizon: Sprints 8--14 (14 weeks, 2026-03-10 to 2026-06-15) Assignee: Pantelis Monogioudis (availability 0.6) Sprint capacity: 2 weeks x 5 days x 0.6 availability x 0.8 buffer = 4.8 days/sprint ~ 4 effective days
Summary
| Metric | Value |
|---|---|
| Open issues | 46 (including 7 epics) |
| In progress | 5 |
| Ready (unblocked) | 24 |
| Blocked | 8 |
| Closed (all time) | 30 |
| Commits (last 8 weeks) | 82 |
| Observed velocity | ~10 commits/week |
| Estimated total effort | 78 days |
| Sprints needed | 7 (Sprints 8--14) |
Epics
| Epic ID | Jira | Title | Status |
|---|---|---|---|
| auraison-ejd | AURA-176 | Data Plane: lakehouse package - COCO-Caption Experiment #0 | Closed |
| auraison-dji | AURA-356 | Backend wiring - Postgres, Redis, connect all agents to API | Open |
| auraison-l5h | AURA-357 | Infra / Ray + Docker GPU - deploy cluster, job service | Open |
| auraison-3hy | AURA-382 | Frontend - jobs, clusters, experiments UI pages | Open |
| auraison-rj7 | AURA-342 | AR4 Physical-AI application (LeRobot + GR00T stack) | Open |
| auraison-ncq | AURA-366 | Multi-agent information theory | Open |
| auraison-rks | AURA-376 | Compass component management for Auraison | Open |
Estimation
Individual issue estimates
| ID | Type | Title | Est (days) | Status |
|---|---|---|---|---|
| Data Plane (Epic: AURA-176) | Closed | |||
| auraison-rmo | feature | LakehouseCatalog with experiment/run schema | 5 | Closed |
| auraison-l1c | feature | sync_from_hf (HF Hub -> MinIO Parquet ingestion) | 5 | Closed |
| auraison-5va | feature | LakehouseQuery with typed Arrow/pandas results | 5 | Closed |
| auraison-q50 | feature | visualize() routing to Rerun or W&B | 5 | Closed |
| auraison-cqu | feature | streaming egress via HF IterableDataset | 5 | Closed |
| auraison-cxv | feature | Pydantic-AI agent with query/sample/quality_check tools | 5 | Closed |
| auraison-4w9 | task | full unit test suite - all modules green | 3 | Closed |
| auraison-08l | feature | COCO-Caption demo script (Experiment #0) | 5 | Closed |
| auraison-rpj | feature | COCO-Caption demo notebook | 5 | Closed |
| auraison-1m3 | task | final integration test - full suite + end-to-end | 3 | Closed |
| Data Plane subtotal | 46 | |||
| Data Plane - New work (post-Sprint 8) | ||||
| auraison-amc | feature | Onboard VisDrone dataset + Rerun video notebook | 3 | In progress |
| auraison-lzs | feature | Artifacts table + R2 public storage for Rerun recordings | 3 | In progress |
| auraison-gca | task | Complete visdrone migration using mc cp | 1 | Open |
| auraison-1b9 | task | Blog: Rerun + Cloudflare R2 public viewer tutorial | 1 | In progress |
| Backend Wiring (Epic: AURA-356) | ||||
| auraison-dji.1 | feature | Wire Postgres job store | 5 | Open |
| auraison-dji.2 | feature | Wire Redis for async job dispatch | 5 | Open |
| auraison-dji.3 | feature | Connect NotebookAgent to POST /api/v1/jobs | 5 | Open |
| auraison-dji.4 | feature | Job status polling + session resume | 5 | Open |
| auraison-dji.5 | feature | eaia copyback webhook endpoint | 5 | Open |
| auraison-dji.6 | feature | Connect ClusterAgent to GET /api/v1/clusters | 5 | Open |
| auraison-dji.7 | feature | Connect WandBAgent to GET /api/v1/experiments | 5 | Open |
| auraison-dji.8 | feature | Connect LakehouseAgent to GET /api/v1/lakehouse | 5 | Open |
| Backend subtotal | 40 | |||
| Infra / Ray (Epic: AURA-357) | ||||
| auraison-l5h.1 | task | Deploy Ray head node on control server | 3 | Closed |
| auraison-l5h.2 | task | Join GPU server as Ray worker | 3 | Open |
| auraison-l5h.3 | feature | FastAPI job service with Ray actor queue | 5 | Open (blocked) |
| auraison-l5h.4 | task | Sample training Docker image + submitter | 3 | Open (blocked) |
| auraison-oqk | feature | Scaffold Terraform IaC for Cloudflare + move K8s manifests | 3 | In progress |
| Infra subtotal | 17 | |||
| Frontend (Epic: AURA-382) | ||||
| auraison-3hy.1 | feature | Jobs page - list + submit form | 5 | Open |
| auraison-3hy.2 | feature | Clusters page - KubeRay health | 5 | Open |
| auraison-3hy.3 | feature | Experiments page - W&B run browser | 5 | Open |
| auraison-3hy.4 | feature | Live job status updates | 5 | Open |
| Frontend subtotal | 20 | |||
| AR4 Physical-AI (Epic: AURA-342) | ||||
| auraison-azf | task | AR4 URDF + Gazebo Harmonic simulation with gz_ros2_control | 5 | Open |
| auraison-dgd | task | MoveIt2 configuration for AR4 motion planning | 3 | Open (blocked) |
| auraison-b7v | task | Policy Server: model-agnostic VLA inference endpoint | 5 | Open (blocked) |
| auraison-4b0 | task | Episode logging and LeRobot dataset builder | 3 | Open (blocked) |
| auraison-dkb | task | Skill library: manipulation primitives for AR4 | 3 | Open (blocked) |
| auraison-drp | task | GR00T N1 integration via Policy Server backend swap | 3 | Open (blocked) |
| auraison-efq | task | AR4 hardware bridge: Teensy 4.1 serial interface | 3 | Open |
| auraison-3dh | task | Synthetic data pipeline via GR00T-Dreams and Cosmos | 3 | Open (blocked) |
| AR4 subtotal | 28 | |||
| Multi-Agent Info Theory (Epic: AURA-366) | ||||
| auraison-ncq.1 | task | MIMO symbolic communication | 3 | Open |
| auraison-ncq.2 | task | Rate-distortion interpretation of context windows and RAG | 3 | Open |
| auraison-ncq.3 | task | Python MIMO hallucination simulator | 3 | Open |
| auraison-ncq.4 | task | Soft symbols and semantic distortion in embedding space | 3 | Open |
| auraison-ncq.5 | task | Turbo codes, belief propagation, and chain-of-thought | 3 | Open |
| auraison-ncq.6 | task | Transformers as soft-symbol encoders in MIMO framework | 3 | Open (blocked) |
| auraison-wh0 | feature | MAC eval framework: information-theoretic LLM evaluation | 5 | Open |
| Info theory subtotal | 23 | |||
| Compass (Epic: AURA-376) | ||||
| auraison-ehv | task | Define Compass components from four-plane architecture | 3 | Open |
| auraison-0t9 | task | Map AURA Jira issues to Compass components | 2 | Open (blocked) |
| auraison-afb | task | Link GitHub repositories to Compass components | 2 | Open (blocked) |
| auraison-0r7 | task | Transfer Compass component ownership to Pantelis | 1 | In progress |
| auraison-zgv | task | Create Claude Code Jira service account | 1 | Open |
| Compass subtotal | 9 | |||
| Standalone tasks | ||||
| auraison-eco | task | Update design doc: control - user plane comms | 3 | Open |
| auraison-4vq | task | Review building-generative-ai-services -> ADR | 3 | Open |
| auraison-fgb | feature | Introduce Cognetivy as agent workflow state management | 3 | Open |
| auraison-rcd | feature | Software architect agent with Compass component sync | 3 | Open |
| auraison-zja | task | Audit notebook management skills for overlaps | 2 | Open |
| Standalone total | 14 |
Dependency DAG and Critical Path
Data Plane Critical Path (completed)
rmo --+
l1c --+---> cxv ---> 4w9 ---> 08l ---> rpj ---> 1m3 (ALL CLOSED)
cqu --+ q50 --+
^
5va
Infra Critical Path
l5h.1 (closed) --+
+---> l5h.3 ---> l5h.4
l5h.2 -----------+
oqk (IaC, in progress)
Critical path: l5h.2 -> l5h.3 -> l5h.4 = 3 + 5 + 3 = 11 days (~ 3 sprints)
AR4 Critical Path
azf ---> dgd ---> dkb
|
+---> b7v ---> 4b0 ---> drp
|
+---> 3dh
Critical path: azf -> b7v -> 4b0 -> drp = 5 + 5 + 3 + 3 = 16 days (~ 4 sprints)
Backend Wiring
No internal dependencies -- all 8 features are independent and ready. Can be parallelized with other work.
Frontend (P2)
No internal dependencies -- all 4 features are independent. Scheduled after P1 backend wiring.
Sprint Allocation
Sprint 8 (2026-03-10 to 2026-03-21) - Capacity: 4 days
Theme: Data Plane core (completed) + VisDrone + IaC
| ID | Title | Est | Priority | Status |
|---|---|---|---|---|
| auraison-amc | Onboard VisDrone dataset + Rerun notebook | 3 | P2 | In progress |
| auraison-lzs | Artifacts table + R2 public storage | 3 | P2 | In progress |
| auraison-oqk | Scaffold Terraform IaC + move K8s manifests | 3 | P1 | In progress |
| auraison-1b9 | Blog: Rerun + Cloudflare R2 tutorial | 1 | P2 | In progress |
| auraison-gca | Complete visdrone migration (mc cp) | 1 | P1 | Open |
Data Plane epic (AURA-176) closed during Sprint 8. Current Sprint 8 work is new data-plane features (VisDrone, artifacts) and IaC foundation.
Sprint 9 (2026-03-24 to 2026-04-04) - Capacity: 4 days
Theme: Infra GPU worker + Backend wiring starts
| ID | Title | Est | Priority |
|---|---|---|---|
| auraison-l5h.2 | Join GPU server as Ray worker | 3 | P1 |
| auraison-dji.1 | Wire Postgres job store | 5 | P1 |
| Total | 8 |
Sprint 10 (2026-04-07 to 2026-04-18) - Capacity: 4 days
Theme: Ray job service + Backend wiring continues
| ID | Title | Est | Priority |
|---|---|---|---|
| auraison-l5h.3 | FastAPI job service with Ray actor queue | 5 | P1 |
| auraison-dji.2 | Wire Redis for async job dispatch | 5 | P1 |
| Total | 10 |
Sprint 11 (2026-04-21 to 2026-05-02) - Capacity: 4 days
Theme: Job submitter + Agent wiring
| ID | Title | Est | Priority |
|---|---|---|---|
| auraison-l5h.4 | Sample training Docker image + submitter | 3 | P1 |
| auraison-dji.3 | Connect NotebookAgent to POST /api/v1/jobs | 5 | P1 |
| auraison-dji.4 | Job status polling + session resume | 5 | P1 |
| Total | 13 |
Sprint 12 (2026-05-05 to 2026-05-16) - Capacity: 4 days
Theme: Agent wiring completion + AR4 kickoff
| ID | Title | Est | Priority |
|---|---|---|---|
| auraison-dji.5 | eaia copyback webhook endpoint | 5 | P1 |
| auraison-dji.6 | Connect ClusterAgent | 5 | P1 |
| auraison-azf | AR4 URDF + Gazebo Harmonic simulation | 5 | P1 |
| Total | 15 |
Sprint 13 (2026-05-19 to 2026-05-30) - Capacity: 4 days
Theme: Final agent wiring + AR4 continues
| ID | Title | Est | Priority |
|---|---|---|---|
| auraison-dji.7 | Connect WandBAgent | 5 | P1 |
| auraison-dji.8 | Connect LakehouseAgent | 5 | P1 |
| auraison-eco | Update design doc: control - user plane comms | 3 | P1 |
| Total | 13 |
Sprint 14 (2026-06-02 to 2026-06-13) - Capacity: 4 days
Theme: Frontend MVP + AR4 motion planning
| ID | Title | Est | Priority |
|---|---|---|---|
| auraison-3hy.1 | Jobs page - list + submit form | 5 | P2 |
| auraison-dgd | MoveIt2 configuration for AR4 | 3 | P2 |
| Total | 8 |
Beyond Sprint 14 (backlog for Sprint 15+)
Remaining P2 (frontend):
- auraison-3hy.2: Clusters page (5 days)
- auraison-3hy.3: Experiments page (5 days)
- auraison-3hy.4: Live job status (5 days)
Remaining P2 (AR4 AURA-342):
- auraison-b7v: Policy Server VLA inference (5 days, blocked by azf)
- auraison-4b0: Episode logging + LeRobot dataset (3 days, blocked by b7v)
- auraison-dkb: Skill library: manipulation primitives (3 days, blocked by b7v, dgd)
- auraison-efq: AR4 hardware bridge Teensy 4.1 (3 days)
P2 (multi-agent info theory AURA-366):
- auraison-ncq.1 through ncq.6 + wh0 (23 days total)
P2 (Compass AURA-376):
- auraison-ehv: Define Compass components (3 days)
- auraison-0t9: Map Jira issues to Compass (2 days, blocked by ehv)
- auraison-afb: Link GitHub repos to Compass (2 days, blocked by ehv)
P2 (standalone):
- auraison-4vq: Review patterns -> ADR (3 days)
- auraison-fgb: Cognetivy agent workflow state management (3 days)
- auraison-rcd: Software architect agent with Compass sync (3 days)
- auraison-zja: Audit notebook skills (2 days)
P3 (AR4):
- auraison-drp: GR00T N1 integration (3 days, blocked by 4b0, b7v)
- auraison-3dh: Synthetic data pipeline (3 days, blocked by 4b0)
Milestones
| Milestone | Target Sprint | Key Deliverable | Jira |
|---|---|---|---|
| M0: Data Plane Core | Sprint 8 | LakehouseCatalog + sync + query + streaming + demo | AURA-176 (closed) |
| M1: IaC Foundation | Sprint 8 | Terraform scaffolding, R2 artifacts pipeline | |
| M2: Infra Ready | Sprint 10 | Ray cluster deployed, job service accepting submissions | AURA-357 |
| M3: Backend Wired | Sprint 13 | All agents connected to API routes, Postgres + Redis live | AURA-356 |
| M4: AR4 Sim Ready | Sprint 12 | URDF + Gazebo sim running | AURA-342 |
| M5: Frontend MVP | Sprint 15 (est.) | Jobs, Clusters, Experiments pages functional | AURA-382 |
Risks
- Heavy sprints (10+ days estimated vs 4-day capacity): Sprints 10-13 are overcommitted. This assumes some tasks complete faster than estimated and some parallel work across different codebases (data-plane vs infra vs backend).
- Single assignee: No parallelism across people. Availability of 0.6 is already factored in.
- Infra dependencies: Ray deployment (l5h.2) depends on hardware availability and network configuration.
- AR4 critical path is 16 days: Any slip on the azf/b7v/4b0 chain delays robotics milestones.
- IaC remediation: Existing manual Cloudflare config must be imported into Terraform before further provisioning.
Notes
- Sprint numbering continues from AURA Sprint 7 (last completed).
- Epic issues (ejd, dji, l5h, 3hy, rj7, ncq, rks) are not individually scheduled; they close when all children close.
- All estimates include 20% epic-level buffer but no per-issue contingency.
- Data Plane epic (AURA-176) fully closed as of Sprint 8.