/ docs / ROADMAP-CYCLE-3.md
ROADMAP-CYCLE-3.md
  1  # Radicle Development - Cycle 3: Operational Excellence
  2  
  3  **Start Date**: November 12, 2025
  4  **Duration**: 6 weeks (Nov 12 - Dec 23, 2025)
  5  **Focus**: Dogfooding infrastructure & establishing operational practices
  6  **Status**: 🚀 **STARTING**
  7  
  8  ---
  9  
 10  ## 🎯 Cycle Overview
 11  
 12  **Objective**: Achieve 100% utilization of our own infrastructure through systematic dogfooding.
 13  
 14  **Current State**: Elite infrastructure built (DORA 4.0/4.0, Security 100/100) but only ~30% actually used
 15  **Target State**: 100% feature utilization with established operational habits
 16  
 17  **Why This Cycle?**
 18  - Validate infrastructure through real use
 19  - Find and fix issues before team adoption
 20  - Build sustainable operational practices
 21  - Demonstrate value of all features
 22  - Prepare for team onboarding
 23  
 24  ---
 25  
 26  ## 📊 Cycle Success Metrics
 27  
 28  ### Primary Metrics
 29  - **Feature Utilization**: 30% → 100%
 30  - **DORA Score**: Maintain Elite 4.0/4.0
 31  - **Security Score**: Maintain 100/100
 32  - **Infrastructure Health**: >99.5% uptime
 33  
 34  ### Operational Metrics
 35  - **Patch Workflow Usage**: 0% → 100% of changes
 36  - **CI Trigger Rate**: 0% → 100% of patches
 37  - **Dashboard Checks**: 0 → 5+ days/week
 38  - **Security Reviews**: 0% → 100% weekly
 39  - **Packages Published**: 0 → 3+ packages
 40  - **DR Tests Completed**: 0 → 3 scenarios
 41  - **Documentation Usage**: Ad-hoc → Systematic
 42  
 43  ---
 44  
 45  ## 🗓️ Phase Breakdown
 46  
 47  ### Week 1-2: Foundation & Quick Wins
 48  
 49  #### Phase 14: Establish Daily Rituals (Nov 12-18)
 50  **Goal**: Build muscle memory for core operational practices
 51  
 52  **Tasks**:
 53  - [ ] Set up daily dashboard check (5 min/day)
 54    - Open Grafana every morning
 55    - Review DORA, Security, Infrastructure, CI/CD dashboards
 56    - Document any issues found
 57  - [ ] Install shellcheck and fix all warnings
 58    - `brew install shellcheck`
 59    - Run on all scripts: `find scripts -name "*.sh" -exec shellcheck {} \;`
 60    - Fix all warnings (currently ~10-15)
 61    - Update pre-commit hook to require clean shellcheck
 62  - [ ] First weekly security review
 63    - Review last 7 days of security scan reports
 64    - Document findings in `docs/security/weekly-reviews/2025-11-18.md`
 65    - Address any issues found
 66  - [ ] Configure CI for Radicle repository
 67    - Run: `./scripts/ci-cd/configure-ci-project.sh --repo rad:z2s159BoUPWefbmtu6s5DV5vvxymy`
 68    - Create `.radicle-ci.yml` configuration
 69    - Test with dummy patch
 70  
 71  **Success Criteria**:
 72  - ✅ Grafana checked 5+ days this week
 73  - ✅ All shellcheck warnings fixed
 74  - ✅ First security review completed
 75  - ✅ CI configured and tested
 76  
 77  **Time Estimate**: 6-8 hours
 78  
 79  ---
 80  
 81  #### Phase 15: Package Registry Activation (Nov 19-25)
 82  **Goal**: Actually use the package registries we built
 83  
 84  **Tasks**:
 85  - [ ] Create and publish npm package
 86    - Package: `@auxo/radicle-tools`
 87    - Include: workflow scripts, monitoring tools
 88    - Create package.json with bin entries
 89    - Publish to Verdaccio: `npm publish --registry http://localhost:4873`
 90    - Test installation: `npm install -g @auxo/radicle-tools --registry http://localhost:4873`
 91  - [ ] Create and publish Python package
 92    - Package: `radicle-monitoring`
 93    - Include: monitoring scripts, metrics collectors
 94    - Create setup.py and pyproject.toml
 95    - Build: `python3 -m build`
 96    - Publish to devpi: `twine upload --repository-url http://localhost:3141/root/pypi/ dist/*`
 97    - Test installation: `pip install radicle-monitoring --index-url http://localhost:3141/root/pypi/+simple/`
 98  - [ ] Create and publish Docker image
 99    - Image: `localhost:5000/radicle-ci:2.0`
100    - Include: CI environment with all tools
101    - Create Dockerfile
102    - Build: `docker build -t localhost:5000/radicle-ci:2.0 .`
103    - Push: `docker push localhost:5000/radicle-ci:2.0`
104    - Test: Use in CI pipeline
105  
106  **Success Criteria**:
107  - ✅ 3 packages published (npm, PyPI, Docker)
108  - ✅ All packages installable from local registries
109  - ✅ At least one package actually used in workflow
110  
111  **Time Estimate**: 8-10 hours
112  
113  ---
114  
115  #### Phase 15.5: Package Registry Optimization (Nov 12)
116  **Goal**: Streamline package publishing workflow based on learnings
117  
118  **Tasks**:
119  - [ ] Create registry setup automation
120    - Script: `scripts/registry/setup-auth.sh`
121    - Auto-configure Verdaccio, devpi, Docker auth
122    - Verify connectivity and permissions
123  - [ ] Build package publishing helpers
124    - Script: `scripts/packages/publish-all.sh [version]`
125    - Script: `scripts/packages/version-bump.sh [major|minor|patch]`
126    - One-command publishing to all 3 registries
127  - [ ] Add shellcheck management tools
128    - Script: `scripts/quality/shellcheck-report.sh`
129    - Script: `scripts/quality/shellcheck-tracker.sh`
130    - Track progress toward zero warnings
131  - [ ] Create publishing documentation
132    - Guide: `docs/guides/publishing-packages.md`
133    - Troubleshooting: `docs/troubleshooting/registry-issues.md`
134    - Common issues and solutions
135  - [ ] Enhance pre-commit hooks
136    - Validate package.json syntax
137    - Validate pyproject.toml syntax
138    - Add Dockerfile linting (hadolint)
139  
140  **Success Criteria**:
141  - ✅ Single command publishes to all 3 registries
142  - ✅ Registry auth automated
143  - ✅ Shellcheck progress trackable
144  - ✅ Publishing guide complete
145  
146  **Time Estimate**: 2-3 hours
147  
148  ---
149  
150  ### Week 3-4: Operational Maturity
151  
152  #### Phase 16: Disaster Recovery Validation (Nov 26 - Dec 2)
153  **Goal**: Prove backups work through actual restore testing
154  
155  **Tasks**:
156  - [ ] Test Scenario 1: Registry Data Loss
157    - Stop all registries: `~/radicle-registry/registry-manager.sh stop`
158    - Backup current state: `cp -r ~/radicle-registry ~/radicle-registry.backup`
159    - Delete registry data: `rm -rf ~/radicle-registry/verdaccio/storage ~/radicle-registry/devpi/data`
160    - Restore from backup: `~/radicle-registry/restore-registries.sh --full`
161    - Verify packages accessible
162    - Document time taken and issues
163    - Create: `docs/operations/dr-tests/2025-11-26-registry-loss.md`
164  - [ ] Test Scenario 2: Radicle Node Data Loss
165    - Backup current state: `cp -r ~/.radicle ~/.radicle.backup`
166    - Stop node: `rad node stop`
167    - Delete storage: `rm -rf ~/.radicle/storage`
168    - Restore from backup
169    - Verify repos accessible
170    - Document results
171  - [ ] Test Scenario 3: Complete System Recovery
172    - Use VM or spare machine
173    - Fresh macOS install
174    - Restore everything from backup
175    - Document full procedure
176    - Time the complete recovery
177    - Update DR documentation with findings
178  - [ ] Update DR documentation
179    - Incorporate learnings from tests
180    - Update RTOs based on actual times
181    - Add troubleshooting for issues found
182    - Create runbook improvements
183  
184  **Success Criteria**:
185  - ✅ All 3 DR scenarios tested successfully
186  - ✅ RTO targets met: Registry <1h, Node <30min, Full <4h
187  - ✅ Documentation updated with real test results
188  - ✅ At least one issue found and fixed
189  
190  **Time Estimate**: 10-12 hours
191  
192  ---
193  
194  #### Phase 17: Workflow Transformation (Dec 3-9)
195  **Goal**: Switch to patch-based workflow for all changes
196  
197  **Tasks**:
198  - [ ] Document patch workflow in practice
199    - Create guide: `docs/guides/patch-workflow-daily-use.md`
200    - Include real examples from this cycle
201    - Add troubleshooting section
202  - [ ] Establish patch workflow rule
203    - All changes must go through patches (no exceptions)
204    - Even small fixes use patch workflow
205    - Self-review before merge
206  - [ ] Create 5+ patches this week
207    - Use for: bug fixes, documentation updates, feature additions
208    - Test CI on each patch
209    - Document CI results
210    - Track metrics: creation time, review time, merge time
211  - [ ] Review and merge patches properly
212    - Use `./scripts/workflow/review-patch.sh`
213    - Check CI results before merge
214    - Use `./scripts/workflow/merge-patch.sh` (not direct merge)
215  - [ ] Track patch workflow metrics
216    - Lead time per patch
217    - CI success rate
218    - Review turnaround time
219    - Add to weekly metrics review
220  
221  **Success Criteria**:
222  - ✅ 5+ patches created and merged
223  - ✅ 100% of changes go through patches
224  - ✅ CI triggered on every patch
225  - ✅ DORA metrics reflect patch workflow
226  
227  **Time Estimate**: 6-8 hours (spread throughout week)
228  
229  ---
230  
231  ### Week 5-6: Sustainability & Habits
232  
233  #### Phase 18: Metrics & Reporting (Dec 10-16)
234  **Goal**: Establish data-driven operational practices
235  
236  **Tasks**:
237  - [ ] Set up weekly metrics review meeting
238    - Schedule: Friday 2 PM, 30 minutes
239    - Create agenda template: `docs/operations/weekly-metrics-template.md`
240    - Prepare dashboard snapshot tool
241  - [ ] Create first 3 weekly reports
242    - Week 1 (Dec 13): Baseline
243    - Week 2 (Dec 20): Progress
244    - Week 3 (Dec 27): Full cycle review
245    - Include: DORA, security, CI, infrastructure health
246    - Document in: `docs/operations/weekly-reports/YYYY-MM-DD.md`
247  - [ ] Build metrics dashboard
248    - Custom Grafana dashboard for dogfooding metrics
249    - Track: feature utilization, patch workflow, reviews, DR tests
250    - Add annotations for major events
251  - [ ] Create improvement actions
252    - Based on metrics, identify improvements
253    - Create patches for improvements
254    - Track impact of changes
255  
256  **Success Criteria**:
257  - ✅ 3 weekly reviews completed
258  - ✅ Custom dogfooding dashboard created
259  - ✅ At least 2 improvement actions identified and implemented
260  - ✅ Metrics show progress toward 100% utilization
261  
262  **Time Estimate**: 6-8 hours
263  
264  ---
265  
266  #### Phase 19: Documentation & Knowledge Transfer (Dec 17-23)
267  **Goal**: Document everything learned for team onboarding
268  
269  **Tasks**:
270  - [ ] Create operational runbooks
271    - Daily operations: `docs/operations/runbooks/daily-ops.md`
272    - Weekly maintenance: `docs/operations/runbooks/weekly-maintenance.md`
273    - Monthly procedures: `docs/operations/runbooks/monthly-procedures.md`
274    - Incident response: `docs/operations/runbooks/incident-response.md`
275  - [ ] Update all documentation
276    - Add real usage examples from this cycle
277    - Fix any errors discovered through use
278    - Add troubleshooting sections
279    - Cross-link related documents
280  - [ ] Create onboarding guide
281    - For new team members
282    - Based on actual dogfooding experience
283    - Include common pitfalls and solutions
284    - Estimated time: 1 day to full productivity
285  - [ ] Document lessons learned
286    - What worked well
287    - What didn't work
288    - Unexpected benefits
289    - Areas for improvement
290    - Create: `docs/CYCLE-3-RETROSPECTIVE.md`
291  
292  **Success Criteria**:
293  - ✅ 4 runbooks created
294  - ✅ 10+ documentation improvements made
295  - ✅ Onboarding guide completed and tested
296  - ✅ Retrospective documented
297  
298  **Time Estimate**: 8-10 hours
299  
300  ---
301  
302  ## 📋 Daily & Weekly Rituals
303  
304  ### Daily Rituals (5 minutes)
305  **Morning (9:00 AM)**:
306  ```bash
307  # 1. Open Grafana dashboards
308  open http://localhost:3000
309  
310  # 2. Check overnight activity
311  ./scripts/monitoring/health-check.sh
312  
313  # 3. Review notifications
314  tail -50 ~/radicle-ci/logs/notification-server.log
315  ```
316  
317  **Evening (5:00 PM)**:
318  ```bash
319  # 1. Review day's work
320  git log --since="1 day ago" --oneline
321  
322  # 2. Check pending patches
323  ./scripts/workflow/list-patches.sh --open
324  
325  # 3. Commit/patch any work in progress
326  ```
327  
328  ### Weekly Rituals
329  
330  **Monday (9:00 AM) - 15 minutes**:
331  ```bash
332  # Security review
333  cat ~/radicle-ci/security-reports/$(ls -t ~/radicle-ci/security-reports/ | head -1)
334  # Document findings
335  vim docs/security/weekly-reviews/$(date +%Y-%m-%d).md
336  ```
337  
338  **Friday (2:00 PM) - 30 minutes**:
339  ```bash
340  # Metrics review meeting
341  ./scripts/monitoring/dora-metrics.sh --days 7
342  ./scripts/monitoring/security-metrics.sh --days 7
343  ./scripts/monitoring/ci-metrics.sh --since 7-days-ago
344  
345  # Document and plan improvements
346  vim docs/operations/weekly-reports/$(date +%Y-%m-%d).md
347  ```
348  
349  ---
350  
351  ## 🎯 Milestone Tracking
352  
353  ### Week 1-2 Milestones
354  - [ ] Shellcheck installed and all warnings fixed
355  - [ ] CI configured and running on patches
356  - [ ] 3 packages published to registries
357  - [ ] Daily dashboard check habit established (10/14 days)
358  
359  ### Week 3-4 Milestones
360  - [ ] All 3 DR scenarios tested successfully
361  - [ ] Patch workflow adopted for all changes
362  - [ ] 5+ patches created and merged
363  - [ ] First weekly metrics review completed
364  
365  ### Week 5-6 Milestones
366  - [ ] 3 weekly metrics reviews done
367  - [ ] Custom dogfooding dashboard created
368  - [ ] 4 runbooks completed
369  - [ ] Cycle retrospective documented
370  
371  ### End of Cycle Success Criteria
372  - [ ] **Feature Utilization**: 100% achieved
373  - [ ] **DORA Score**: Elite 4.0/4.0 maintained
374  - [ ] **Security Score**: 100/100 maintained
375  - [ ] **Patches**: 15+ created and merged
376  - [ ] **Packages**: 3+ published and used
377  - [ ] **DR Tests**: All 3 scenarios passed
378  - [ ] **Dashboards**: Checked 25+ days out of 42
379  - [ ] **Documentation**: 20+ improvements made
380  - [ ] **Habits**: Daily/weekly rituals established
381  
382  ---
383  
384  ## 📊 Cycle Dashboard
385  
386  ### Feature Utilization Tracker
387  
388  | Feature | Pre-Cycle | Target | Current | Status |
389  |---------|-----------|--------|---------|--------|
390  | **Patch Workflow** | 0% | 100% | _tracking_ | 🟡 In Progress |
391  | **CI/CD** | 0% | 100% | _tracking_ | 🟡 In Progress |
392  | **Security Reviews** | 0% | 100% | _tracking_ | 🟡 In Progress |
393  | **Dashboard Checks** | 0% | 80%+ | _tracking_ | 🟡 In Progress |
394  | **Package Registries** | 0% | 3+ packages | _tracking_ | 🟡 In Progress |
395  | **DR Testing** | Never | 3 scenarios | _tracking_ | 🟡 In Progress |
396  | **Documentation** | Ad-hoc | Systematic | _tracking_ | 🟡 In Progress |
397  | **Metrics Reviews** | None | Weekly | _tracking_ | 🟡 In Progress |
398  | **Pre-commit Hooks** | Passive | Active | _tracking_ | 🟡 In Progress |
399  | **Notifications** | Unused | Integrated | _tracking_ | 🟡 In Progress |
400  
401  ### Phase Completion Tracker
402  
403  | Phase | Status | Start Date | Complete Date | Time Spent |
404  |-------|--------|------------|---------------|------------|
405  | Phase 14: Daily Rituals | 🟡 Not Started | Nov 12 | - | - |
406  | Phase 15: Package Registries | 🟡 Not Started | Nov 19 | - | - |
407  | Phase 16: DR Validation | 🟡 Not Started | Nov 26 | - | - |
408  | Phase 17: Workflow Transform | 🟡 Not Started | Dec 3 | - | - |
409  | Phase 18: Metrics & Reporting | 🟡 Not Started | Dec 10 | - | - |
410  | Phase 19: Documentation | 🟡 Not Started | Dec 17 | - | - |
411  
412  ---
413  
414  ## 🚀 Getting Started Checklist
415  
416  ### Today (Day 1 - Nov 12)
417  - [ ] Read full roadmap
418  - [ ] Review `docs/DOGFOODING-PLAN.md`
419  - [ ] Open Grafana dashboards for first time
420  - [ ] Check latest security scan
421  - [ ] Run health check: `./scripts/monitoring/health-check.sh`
422  
423  ### This Week (Week 1)
424  - [ ] Install shellcheck: `brew install shellcheck`
425  - [ ] Configure CI for repository
426  - [ ] Start daily dashboard check habit
427  - [ ] Plan first security review (Monday)
428  - [ ] Review all shellcheck warnings
429  
430  ### This Month (Weeks 1-4)
431  - [ ] Complete Phases 14-17
432  - [ ] Publish 3 packages
433  - [ ] Test all DR scenarios
434  - [ ] Create 10+ patches
435  - [ ] Establish daily/weekly rituals
436  
437  ---
438  
439  ## 🔄 Iteration & Adaptation
440  
441  This roadmap is a living document. We'll adapt based on:
442  
443  **Weekly Reviews**:
444  - What's working?
445  - What's not working?
446  - Do timelines need adjustment?
447  - Are metrics improving?
448  
449  **Adaptation Triggers**:
450  - If phase takes 2x longer → reassess complexity
451  - If blocking issue → pause and resolve
452  - If new opportunities → evaluate priority
453  - If metrics not improving → investigate why
454  
455  **Communication**:
456  - Document progress in weekly reports
457  - Update phase status regularly
458  - Note blockers and resolutions
459  - Celebrate wins
460  
461  ---
462  
463  ## 📈 Expected Outcomes
464  
465  ### By End of Cycle
466  
467  **Infrastructure Usage**:
468  - All features actively used daily/weekly
469  - Proven through real operational scenarios
470  - Documented best practices established
471  - Team-ready for onboarding
472  
473  **Operational Maturity**:
474  - Daily rituals become automatic
475  - Weekly reviews drive improvements
476  - Metrics tracked and acted upon
477  - Issues caught and resolved quickly
478  
479  **Documentation Quality**:
480  - Tested through real usage
481  - Errors found and fixed
482  - Examples from actual practice
483  - Runbooks for all operations
484  
485  **Team Readiness**:
486  - Infrastructure proven in practice
487  - Clear onboarding path
488  - Documented workflows
489  - Established operational standards
490  
491  **Continuous Improvement**:
492  - Data-driven decisions
493  - Regular retrospectives
494  - Iterative refinement
495  - Sustainable practices
496  
497  ---
498  
499  ## 🎯 Success Definition
500  
501  **Cycle 3 is successful if**:
502  1. ✅ Feature utilization reaches 100%
503  2. ✅ Elite DORA and Security scores maintained
504  3. ✅ All DR scenarios tested and pass
505  4. ✅ Daily/weekly habits established and sustained
506  5. ✅ Documentation improved through use
507  6. ✅ Infrastructure ready for team adoption
508  7. ✅ Lessons learned documented for future
509  
510  **Bonus Goals**:
511  - Find and fix 3+ issues through dogfooding
512  - Reduce operational overhead through automation
513  - Create reusable operational templates
514  - Build confidence in infrastructure reliability
515  
516  ---
517  
518  ## 📚 Related Documentation
519  
520  - **Dogfooding Plan**: `docs/DOGFOODING-PLAN.md` - Detailed strategy
521  - **Previous Roadmap**: `docs/roadmap-progress-review.md` - Phases 1-13
522  - **Using Infrastructure**: `docs/using-the-infrastructure.md` - Usage guide
523  - **Weekly Reports**: `docs/operations/weekly-reports/` - Track progress
524  - **DR Tests**: `docs/operations/dr-tests/` - Test results
525  - **Security Reviews**: `docs/security/weekly-reviews/` - Security findings
526  
527  ---
528  
529  ## 📞 Questions & Support
530  
531  If stuck or blocked:
532  1. Check relevant documentation first
533  2. Review troubleshooting guides
534  3. Check Grafana for metrics/alerts
535  4. Document the issue for future reference
536  
537  For tracking:
538  - Create issues/tasks in `docs/operations/tasks/`
539  - Document blockers in weekly reports
540  - Note learnings in retrospective
541  
542  ---
543  
544  **Roadmap Version**: 1.0
545  **Created**: November 12, 2025
546  **Last Updated**: November 12, 2025
547  **Status**: Active - Starting Phase 14
548  
549  **Let's build sustainable operational excellence!** 🚀