Initial

2025-07-20 03:56:21 -04:00
commit 59539f4daa
65 changed files with 6964 additions and 0 deletions
--- a/README.md
+++ b/README.md
@@ -0,0 +1,182 @@
+# The Ubiquity of Space-Time Tradeoffs: Experiments & Implementation
+
+This repository contains the experimental code, case studies, and interactive dashboard accompanying the paper "The Ubiquity of Space-Time Simulation in Modern Computing: From Theory to Practice".
+
+**Paper Repository**: [github.com/sqrtspace/sqrtspace-paper](https://github.com/sqrtspace/sqrtspace-paper)  
+**Interactive Dashboard**: Run locally with `streamlit run dashboard/app.py`  
+**Based on**: Ryan Williams' 2025 result that TIME[t] ⊆ SPACE[√(t log t)]
+
+## Overview
+
+This project demonstrates how theoretical space-time tradeoffs manifest in real-world systems through:
+- **Controlled experiments** validating the √n relationship
+- **Production system analysis** (PostgreSQL, Flash Attention, MapReduce)
+- **Interactive visualizations** exploring memory hierarchies
+- **Practical tools** for optimizing space-time tradeoffs
+
+## Key Findings
+
+- Theory predicts √n slowdown, practice shows 100-10,000× due to constant factors
+- Memory hierarchy (L1/L2/L3/RAM/Disk) dominates performance
+- Cache-friendly algorithms can be faster with less memory
+- The √n pattern appears everywhere: database buffers, ML checkpointing, distributed systems
+
+## Experiments
+
+### 1. Maze Solver (C#)
+**Location:** `experiments/maze_solver/`
+
+Demonstrates graph traversal with memory constraints:
+- BFS: O(n) memory, 1ms runtime
+- Memory-Limited DFS: O(√n) memory, 5ms runtime (5× slower)
+
+```bash
+cd experiments/maze_solver
+dotnet run
+```
+
+### 2. Checkpointed Sorting (Python)
+**Location:** `experiments/checkpointed_sorting/`
+
+Shows massive I/O penalties when reducing memory:
+- In-memory: O(n) space, 0.0001s
+- Checkpointed: O(√n) space, 0.268s (2,680× slower!)
+
+```bash
+cd experiments/checkpointed_sorting
+python checkpointed_sort.py
+```
+
+### 3. Stream Processing (Python)
+**Location:** `experiments/stream_processing/`
+
+Reveals when less memory is actually faster:
+- Full history: O(n) memory, 0.33s
+- Sliding window: O(w) memory, 0.011s (30× faster!)
+
+```bash
+cd experiments/stream_processing
+python sliding_window.py
+```
+
+## Case Studies
+
+### Database Systems (`case_studies/database_systems.md`)
+- PostgreSQL buffer pool sizing follows √(database_size)
+- Query optimizer chooses algorithms based on available memory
+- Hash joins (fast) vs nested loops (slow) show 200× performance difference
+
+### Large Language Models (`case_studies/llm_transformers.md`)
+- Flash Attention: O(n²) → O(n) memory for 10× longer contexts
+- Gradient checkpointing: √n layers stored
+- Quantization: 8× memory reduction for 2-3× slowdown
+
+### Distributed Computing (`case_studies/distributed_computing.md`)
+- MapReduce: Optimal shuffle buffer = √(data_per_node)
+- Spark: Memory fraction settings control space-time tradeoffs
+- Hierarchical aggregation naturally forms √n levels
+
+## Quick Start
+
+### Prerequisites
+- Python 3.8+ (for Python experiments)
+- .NET Core SDK (for C# maze solver)
+- 2GB free memory for experiments
+
+### Installation
+```bash
+# Clone repository
+git clone https://github.com/sqrtspace/sqrtspace-experiments.git
+cd Ubiquity
+
+# Install Python dependencies
+pip install -r requirements.txt
+
+# Run the dashboard
+streamlit run dashboard/app.py
+```
+
+### Running All Experiments
+```bash
+# Run each experiment
+cd experiments/maze_solver && dotnet run && cd ../..
+cd experiments/checkpointed_sorting && python checkpointed_sort.py && cd ../..
+cd experiments/stream_processing && python sliding_window.py && cd ../..
+```
+
+## Repository Structure
+
+```
+├── experiments/           # Core experiments demonstrating tradeoffs
+│   ├── maze_solver/      # C# graph traversal with memory limits
+│   ├── checkpointed_sorting/  # Python external sorting
+│   └── stream_processing/     # Python sliding window vs full storage
+├── case_studies/         # Analysis of production systems
+│   ├── database_systems.md
+│   ├── llm_transformers.md
+│   └── distributed_computing.md
+├── dashboard/            # Interactive Streamlit visualizations
+│   └── app.py           # 6-page interactive dashboard
+├── SUMMARY.md           # Comprehensive findings
+└── FINDINGS.md          # Experimental results analysis
+```
+
+## Interactive Dashboard
+
+The dashboard (`dashboard/app.py`) includes:
+1. **Space-Time Calculator**: Find optimal configurations
+2. **Memory Hierarchy Simulator**: Visualize cache effects
+3. **Algorithm Comparisons**: See tradeoffs in action
+4. **LLM Optimizations**: Flash Attention demonstrations
+5. **Production Examples**: Real-world case studies
+
+## Measurement Framework
+
+`experiments/measurement_framework.py` provides:
+- Continuous memory monitoring (10ms intervals)
+- Cache-aware benchmarking
+- Statistical analysis across multiple runs
+- Automated visualization generation
+
+## Extending the Work
+
+### Adding New Experiments
+1. Create folder in `experiments/`
+2. Implement space-time tradeoff variants
+3. Use `measurement_framework.py` for profiling
+4. Document findings in experiment README
+
+### Contributing Case Studies
+1. Analyze a system with space-time tradeoffs
+2. Document the √n patterns you find
+3. Add to `case_studies/` folder
+4. Submit pull request
+
+## Citation
+
+If you use this code or build upon our work:
+
+```bibtex
+@article{friedel2025ubiquity,
+  title={The Ubiquity of Space-Time Simulation in Modern Computing: From Theory to Practice},
+  author={Friedel Jr., David H.},
+  journal={arXiv preprint arXiv:25XX.XXXXX},
+  year={2025}
+}
+```
+
+## Contact
+
+**Author**: David H. Friedel Jr.  
+**Organization**: MarketAlly LLC (USA) & MarketAlly Pte. Ltd. (Singapore)  
+**Email**: dfriedel@marketally.com
+
+## License
+
+This work is licensed under CC BY 4.0. You may share and adapt the material with proper attribution.
+
+## Acknowledgments
+
+- Ryan Williams for the theoretical foundation
+- The authors of Flash Attention, PostgreSQL, and Apache Spark
+- Early-stage R&D support from MarketAlly LLC and MarketAlly Pte. Ltd.