Manage Training Workflow, 3D Visualizations, Ollama Integration and Model Artifact Clean Up
Architecture, Design and Development by Franz Ayestaran / Enhanced Pair Programming with Claude Code & OpenAI GPT
Loading workflow documentation...
Download the complete PowerPoint presentation covering the LLM creation and Ollama deployment pipeline
Upload a .txt, .json, or .jsonl training file. JSON datasets can use records like {"instruction": "...", "output": "..."}.
Continue training from a saved checkpoint instead of starting from scratch.
Recommended resets the form to the dashboard baseline. Export saves the current dashboard settings as a versioned JSON config. Load applies a previously saved config back into the form and restores the referenced training file when available.
Train Phase-1 deep learning models, Phase-2 classical ML models, and the new Phase-3 Vision Transformer and small text transformer models from one dashboard page. The sections stay stacked in order so Phase 3 appears below the existing classical workflow.
Choose a model to see its supported dataset format.
Train Random Forest, SVM, and Logistic Regression models on CSV datasets with schema preview, target-column control, optional ONNX export, telemetry, and downloadable `.pkl` artifacts.
Choose a classical model to see its supported dataset format.
Train Vision Transformer plus tiny causal LLaMA text models with image-folder or JSONL datasets, live telemetry, ONNX export, ONNX Runtime inference support, and GGUF/Ollama export for the Phase-3 text models.
Choose a Phase-3 model to see its supported dataset format.
ℹ️ About: This will import your trained model into Ollama, allowing you to run it locally using the ollama run command.
⚠️ No GGUF file? If you don't see any GGUF files below, you need to convert your trained model first.
ollama run <name>
☁️ Cloud deploy: Use the second button to upload the selected GGUF and create the model on ollama.ayestaran.dev. This requires SSH access from the dashboard host to the server.
ℹ️ About: Manage models on ollama.ayestaran.dev. Listing uses the remote tags endpoint, and deletion uses SSH access from the dashboard host.
Loading artifacts...
Interactive 3D visualizations of your model's training process, checkpoints, embeddings, layer structures, and internal LLM topology.
Checking for visualizations...
A dedicated scene for inspecting how query, key, and value paths move through a transformer block. The layout is intentionally closer to a cinematic architecture diagram than the existing topology plot, with Q/K/V weight slabs, vector stages, layer norm, attention scoring, and animated token flow.
Separate Q, K, and V stages, token score routing, residual handoff, and downstream attention aggregation in one navigable 3D scene.
Orbit, pan, and zoom the scene. Click a subsystem to pin metadata and use the scene controls to jump straight to Q, K, V, or the attention matrix.
The page uses a hand-built scene rather than Plotly so the composition can read more like a technical explainer, matching the reference style more closely.
A dedicated brain-shaped scene for travelling through your trained model, now with a semantic Embedding Galaxy mode. Switch between the structural 3D brain and a token-star universe where clusters become constellations, dense regions become nebulae, and model-specific concepts form their own colored territories.
Brain mode shows the residual spine, attention branches, and sampled learned pathways from the real checkpoint. Galaxy mode remaps sampled token embeddings into a 3D star field.
Use the built-in waypoints to jump from input cortex to mid-layer reasoning and then into the output crown. Click any node to pin its layer, tensor path, and sampled row metrics.
This atlas is sampled, not literal-all-edges. Rendering every trained connection in a browser is not feasible at LLM scale, so the scene uses the strongest learned pathways to stay explorable.
Overview of the Phase-1 ML/DL, Phase-2 classical ML, and Phase-3 transformer workflows that power this dashboard across NVIDIA CUDA and Apple Silicon MLX/MPS environments.
Primary runtime for Phase-1 CNN/MLP/RNN training, custom transformer experiments, Phase-3 Vision Transformer pipelines, and shared optimization and export flows.
Hugging Face stack for tokenization, pre-trained checkpoints, compact text transformers, and PEFT-based LoRA fine-tuning workflows.
CUDA accelerates NVIDIA training and quantized fine-tuning workloads.
Apple Silicon (MPS / MLX): Native GPU acceleration for M-series processors via Metal Performance Shaders, with MLX-LM enabling Apple-native quantized LoRA and QLoRA training and fused export flows.
Flask serves the unified Phase-1, Phase-2, and Phase-3 dashboard, training APIs, dataset upload routes, telemetry, and artifact export endpoints.
Interactive 3D visualizations for checkpoints, embeddings, layer structures, neural atlases, and interpretability scenes.
Markdown parsing for rendering documentation and workflow guides directly in the dashboard.
Array-centric processing for tabular, text, image, and geometry workflows across ML and deep-learning training.
Real-time system and process monitoring for CPU, memory, GPU, and disk usage tracking.
Progress reporting for fine-tuning, classical ML jobs, dataset conversion, and export pipelines with ETA estimates.
Checkpoint and adapter serialization for transformer fine-tuning, resume bundles, and model conversion pipelines.
GGUF export path for PyTorch and Apple-native MLX outputs, with direct Ollama import for local inference.
llama.cpp quantization supports GGUF deployment, while ONNX and ONNX Runtime cover Phase-2 exports plus optional ONNX and GGUF paths for the Phase-3 text models.
Loss, learning-rate, and gradient tracking for deep-learning runs and checkpoint inspection.
Experiment tracking and collaboration platform for machine learning projects.
Phase-2 classical ML stack for Random Forest, SVM, and Logistic Regression training, metrics, and optional skl2onnx export.
LLM Training Dashboard progressively exposes the internal logic of large language models through twelve hierarchical layers, moving from high-level semantic geometry down to neuron-level concept discovery.
Data Pipeline: CSV / JSONL / image-folder input → preview and validation → dataset builders → training loaders
Training Loop: Phase-1 deep learning loops / Phase-2 classical fit / Phase-3 transformer fine-tuning → telemetry → checkpoints and artifacts
Model Export: SafeTensors / .pkl / ONNX / GGUF → Ollama / ONNX Runtime / local artifact downloads
Dashboard: Flask API ↔ Phase-1/2/3 controls ↔ real-time updates ↔ 3D visualizations ↔ system monitoring
🚀 Built for Production-Ready LLM Training
Architecture, Design and Development by Franz Ayestaran / Enhanced Pair Programming with Claude Code & OpenAI GPT