Overview
Timepoint Pro exports simulation data as JSONL (JSON Lines) training examples. Each line is a complete prompt/completion pair with structured SNAG context: M3 knowledge provenance, M6 entity state, M7 causal history, M10 atmosphere, M11 dialog context, and M13 relationships. This format is ideal for:- Fine-tuning causal reasoning models
- Training temporal consistency models
- Multi-agent roleplay datasets
- Diffusion models conditioned on causal graphs
JSONL Format
Each line is a valid JSON object:SNAG Context Structure
SNAG (Social Network Augmented Generation) provides rich structured context:M7: Causal History
Timeline leading to current moment:M3: Knowledge Provenance
How entity acquired current knowledge:M10: Atmospheric Context
Scene atmosphere and physical environment:M6: Entity State
Current cognitive and physical state:M13: Relationship Context
Relationships with entities present:Example Training Record
From examples/sample_training_data.jsonl:Export Configuration
Enable JSONL export inOutputConfig:
Using ExportFormatFactory
Streaming Export
For large datasets, use streaming:Compression
JSONL supports gzip and bz2 compression:Model Licensing for Training Data
If you plan to fine-tune models with Pro outputs, use MIT or Apache 2.0 licensed models:| License | Models | Training Data Status |
|---|---|---|
| MIT | DeepSeek Chat, DeepSeek R1 | Fully unrestricted |
| Apache 2.0 | Mistral, Mixtral | Fully unrestricted |
| Llama | Llama 3.1, Llama 4 Scout | Restricted (cannot train non-Llama models) |
| Qwen | Qwen 2.5, QwQ 32B | Permissive |
for_training_data=True or OXEN_API_KEY is set.
Oxen.ai Integration
WhenOXEN_API_KEY is set, training data uploads automatically:
- Training JSONL
- Metadata JSON
- Entity tensors
- Causal graph
Training Data Quality
SNAG training data is uniquely rich:- Causal ancestry: Every example includes full causal chain
- Provenance tracking: Knowledge sources explicitly labeled
- Temporal consistency: States evolve coherently across time
- Counterfactuals: BRANCHING mode generates alternative paths
- Quantitative state: Emotional valence, arousal, energy, confidence
Example: Mars Mission Portal
From EXAMPLE_RUN.md:- Template:
mars_mission_portal - Training examples: 20
- Temporal mode: PORTAL (backward inference)
- Timespan: 2031 → 2026 (5 years)
- Entities: 4 crew members
- Dialog turns: 78
- Cost: $0.18
- Full causal chain from 2026 to failure in 2031
- Knowledge provenance (who learned what, when)
- Emotional arcs (Lin Zhang: valence -0.20, arousal 0.94)
- Relationship evolution (tensions between engineers and director)
See Also
- TDF Format - Timepoint Data Format
- SQLite Export - Queryable database
- Training Data & Model Licensing
- Sample Training Data

