Chapter 6: Structuring Your Experiments: Runs, Projects, and Tags

Introduction: Bringing Order to Your ML Chaos

Welcome back, aspiring ML experimenter! In our previous chapters, you’ve mastered the basics of installing Trackio and logging simple metrics. That’s a fantastic start! However, as your machine learning journey progresses, you’ll quickly find yourself running dozens, if not hundreds, of experiments. Without a robust system to keep track of them, you’ll soon be lost in a sea of unnamed runs and forgotten configurations.

This chapter is all about bringing order to that potential chaos. We’ll dive into Trackio’s fundamental organizational concepts: Runs, Projects, and Tags. Understanding and effectively using these will transform your experiment tracking from a mere logging exercise into a powerful tool for managing, comparing, and reproducing your machine learning work. Get ready to learn how to structure your experiments like a pro!

Core Concepts: The Building Blocks of Organized Experiments

Before we jump into code, let’s clearly define what Trackio means by “Runs,” “Projects,” and “Tags.” Think of them as a hierarchical system designed to make your life easier.

What is a Run? Your Single Experiment Snapshot

At its most fundamental level, a Run in Trackio represents a single execution of your machine learning training script or evaluation process. Every time you start a script that uses trackio.init(), a new run is created.

Why it matters: Each run is a self-contained record. It captures all the metrics, parameters, and other data you log during that specific execution. This isolation is crucial for comparing different attempts at solving a problem, whether you’re tweaking hyperparameters, trying new model architectures, or experimenting with different datasets.

By default, Trackio assigns a unique, often random, name to each run. While functional, giving your runs meaningful names will significantly improve their discoverability later.

A Project in Trackio is a collection of related runs. Imagine you’re working on a specific machine learning task, like classifying images of cats and dogs. All the experiments you run for that specific task—trying different models, hyperparameters, or data augmentations—would logically belong to the same project.

Why it matters: Projects provide a high-level organizational structure. They allow you to:

View all experiments related to a particular problem in one place.
Easily compare results across different runs within the same project.
Keep your dashboard clean by separating unrelated work.

If you don’t explicitly specify a project, Trackio will typically assign runs to a default project (e.g., “uncategorized” or based on your script name). However, using explicit project names is a best practice for clarity.

What are Tags? Flexible Labels for Fine-Grained Categorization

Tags are lightweight, flexible labels that you can attach to individual runs. Unlike projects, which represent a broader grouping, tags allow you to add specific, descriptive keywords to runs. A run can have multiple tags.

Why it matters: Tags are incredibly versatile for:

Identifying specific characteristics: “hyperparam-sweep”, “early-stopping”, “large-dataset”.
Marking versions: “model-v1.0”, “data-split-A”.
Tracking experiment phases: “prototype”, “final-tune”, “ablation-study”.
Filtering and searching: In the Trackio dashboard, you can easily filter runs by their tags, making it simple to find all runs that used a specific optimizer or a particular dataset version.

How They Work Together: A Visual Analogy

Think of your entire ML workflow as a library:

Projects are like the main sections of the library (e.g., “Computer Vision,” “Natural Language Processing”).
Runs are individual books within those sections (e.g., “ResNet50 Experiment 1,” “BERT Fine-tuning Run 3”).
Tags are sticky notes or labels you put on the books to describe their specific content (e.g., “Adam optimizer,” “ImageNet subset,” “Learning Rate 1e-4”).

Let’s visualize this hierarchy with a simple diagram:

This diagram shows how different runs for “Image Classification” can be grouped under one project, and then further categorized by specific tags.

Step-by-Step Implementation: Organizing Your Code

Now, let’s put these concepts into practice. We’ll build a simple script that simulates a machine learning experiment and uses Trackio to organize its runs.

First, ensure you have Trackio installed (e.g., pip install trackio==0.2.1 as of late 2025).

Step 1: Start with a Basic `trackio.init()`

Let’s begin with a very basic script, similar to what you might have used in previous chapters.

Create a file named organized_experiment.py:

import trackio
import time
import random

print("Starting a basic experiment...")

# Initialize Trackio - this creates a new run
trackio.init() 

# Simulate some training steps
for i in range(5):
    loss = 1.0 / (i + 1) + random.uniform(-0.1, 0.1)
    accuracy = 0.5 + (i * 0.1) + random.uniform(-0.05, 0.05)
    
    # Log metrics
    trackio.log({"loss": loss, "accuracy": accuracy})
    print(f"Step {i+1}: Loss = {loss:.4f}, Accuracy = {accuracy:.4f}")
    time.sleep(0.5)

print("Experiment finished.")

Run this script: python organized_experiment.py. Trackio will print a message indicating a run has started and where to view the dashboard. Notice that Trackio assigns a default, often random, name to this run. This is fine for quick tests, but not ideal for complex projects.

Step 2: Giving Your Run a Meaningful Name

Let’s make our run more identifiable by providing a custom run_name. This is done directly in the trackio.init() call.

Modify organized_experiment.py:

import trackio
import time
import random

print("Starting a basic experiment with a custom run name...")

# Initialize Trackio with a custom run name
trackio.init(run_name="my-first-named-experiment") 

# Simulate some training steps (rest of the code is the same)
for i in range(5):
    loss = 1.0 / (i + 1) + random.uniform(-0.1, 0.1)
    accuracy = 0.5 + (i * 0.1) + random.uniform(-0.05, 0.05)
    
    # Log metrics
    trackio.log({"loss": loss, "accuracy": accuracy})
    print(f"Step {i+1}: Loss = {loss:.4f}, Accuracy = {accuracy:.4f}")
    time.sleep(0.5)

print("Experiment finished.")

Run it again. Now, when you check your Trackio dashboard (usually by running trackio dashboard in a new terminal, or clicking the link provided by trackio.init()), you’ll see a run clearly labeled “my-first-named-experiment”. Much better for identification!

Step 3: Organizing Runs into a Project

Now, let’s group this experiment into a specific project. This is also done via the trackio.init() function using the project parameter.

Modify organized_experiment.py:

import trackio
import time
import random

print("Starting an experiment within a specific project...")

# Initialize Trackio with a custom run name AND a project name
trackio.init(project="Image-Classifier-Project", run_name="ResNet50-Baseline-Run") 

# Simulate some training steps (rest of the code is the same)
for i in range(5):
    loss = 1.0 / (i + 1) + random.uniform(-0.1, 0.1)
    accuracy = 0.5 + (i * 0.1) + random.uniform(-0.05, 0.05)
    
    # Log metrics
    trackio.log({"loss": loss, "accuracy": accuracy})
    print(f"Step {i+1}: Loss = {loss:.4f}, Accuracy = {accuracy:.4f}")
    time.sleep(0.5)

print("Experiment finished.")

Run this script. When you open your Trackio dashboard, you’ll likely see a new project created, named “Image-Classifier-Project”, and your “ResNet50-Baseline-Run” will be nested within it. If the project already exists, the run will simply be added to it.

Step 4: Adding Tags for Further Categorization

Finally, let’s add some tags to our run to provide even more context. You can specify tags when you initialize the run, or add them dynamically during the run’s execution.

Option A: Initializing with Tags

Modify organized_experiment.py to include tags in trackio.init():

import trackio
import time
import random

print("Starting an experiment with project, run name, and initial tags...")

# Initialize Trackio with project, run name, and a list of tags
# Tags should be a list of strings
trackio.init(
    project="Image-Classifier-Project",
    run_name="ResNet50-Adam-LR_0.001",
    tags=["resnet50", "adam-optimizer", "baseline", "v1.0-data"]
) 

# Simulate some training steps (rest of the code is the same)
for i in range(5):
    loss = 1.0 / (i + 1) + random.uniform(-0.1, 0.1)
    accuracy = 0.5 + (i * 0.1) + random.uniform(-0.05, 0.05)
    
    # Log metrics
    trackio.log({"loss": loss, "accuracy": accuracy})
    print(f"Step {i+1}: Loss = {loss:.4f}, Accuracy = {accuracy:.4f}")
    time.sleep(0.5)

print("Experiment finished.")

Run this script. Check your dashboard, and you’ll see the specified tags associated with your run. You can then use these tags to filter and find specific experiments.

Option B: Adding Tags Dynamically During a Run

Sometimes, you might want to add tags based on conditions that arise during the experiment. Trackio allows you to do this using trackio.run.tags.add().

Let’s create a new script, dynamic_tags_experiment.py:

import trackio
import time
import random

print("Starting an experiment with dynamic tags...")

# Initialize Trackio without initial tags
trackio.init(
    project="Dynamic-Tag-Project",
    run_name="Experiment-Conditional-Tag",
) 

# Access the current run object
current_run = trackio.run

# Add a base tag
current_run.tags.add("initial-setup")
print("Added tag: 'initial-setup'")

# Simulate some training steps
for i in range(5):
    loss = 1.0 / (i + 1) + random.uniform(-0.1, 0.1)
    accuracy = 0.5 + (i * 0.1) + random.uniform(-0.05, 0.05)
    
    # Log metrics
    trackio.log({"loss": loss, "accuracy": accuracy})
    print(f"Step {i+1}: Loss = {loss:.4f}, Accuracy = {accuracy:.4f}")
    time.sleep(0.5)

    # Add a tag based on a condition (e.g., if accuracy is high)
    if accuracy > 0.7:
        if "high-accuracy-achieved" not in current_run.tags: # Avoid adding duplicate tags
            current_run.tags.add("high-accuracy-achieved")
            print("Added tag: 'high-accuracy-achieved'")

print("Experiment finished.")

Run python dynamic_tags_experiment.py. You’ll observe that the “high-accuracy-achieved” tag is added only if the condition accuracy > 0.7 is met during the run. This demonstrates the flexibility of dynamic tagging.

Mini-Challenge: Hyperparameter Sweep Organization

It’s your turn to apply what you’ve learned!

Challenge: You are tasked with running a small hyperparameter sweep for a hypothetical model. Create a Python script that simulates three different runs:

Run 1: Uses learning_rate = 0.01 and optimizer = 'Adam'.
Run 2: Uses learning_rate = 0.001 and optimizer = 'Adam'.
Run 3: Uses learning_rate = 0.01 and optimizer = 'SGD'.

All three runs should belong to the same project named “Hyperparameter-Sweep-Project”. Each run should have a descriptive run_name that includes its specific hyperparameters, and appropriate tags (e.g., “adam”, “sgd”, “lr_0.01”). Log a dummy metric like val_loss for 3 steps in each run.

Hint: You’ll need to call trackio.init() three separate times, once for each simulated run. Remember to restart your script for each run, or wrap each run in a function call to ensure proper re-initialization.

What to observe/learn: When you view your Trackio dashboard, you should see one project (“Hyperparameter-Sweep-Project”) containing three distinct runs, each with its unique name and tags. This structure will make it incredibly easy to compare the performance of different hyperparameter combinations.

Common Pitfalls & Troubleshooting

Even with these clear concepts, you might run into a few snags. Here are some common pitfalls and how to address them:

Runs Not Appearing in the Intended Project:
- Mistake: Forgetting to specify the project argument in trackio.init(), or having a typo in the project name.
- Solution: Double-check your trackio.init(project="YourProjectName") call. Ensure the project name is consistent across all runs you want to group. If a project name is misspelled, Trackio will simply create a new project with the misspelled name.
- Tip: Project names are case-sensitive. “my-project” is different from “My-Project”.
Tags Not Showing Up or Being Duplicated:
- Mistake: Forgetting to pass tags as a list of strings to trackio.init(), or attempting to add a tag that already exists.
- Solution:
  - For initial tags: tags=["tag1", "tag2"].
  - For dynamic tags: Ensure you’re using trackio.run.tags.add("new-tag"). Trackio’s add method is generally idempotent, meaning adding the same tag twice won’t create duplicates in the dashboard, but it’s good practice to check for existence if you’re doing conditional adds (as shown in the dynamic tags example).
- Observation: In the Trackio dashboard, tags are usually displayed as clickable labels on each run.
Confusing Run Names and Project Names:
- Mistake: Using a very generic run_name like “experiment” across multiple runs within the same project, making it hard to distinguish them.
- Solution: Always aim for descriptive run_names that capture the unique aspects of that specific run (e.g., “ResNet50-Adam-LR_0.001”). Project names should be broader, covering the overall goal or model family.

Summary: Your Organized ML Future

Congratulations! You’ve now mastered the essential organizational principles of Trackio. Here’s a quick recap of what we covered:

Runs are individual executions of your code, capturing all logged data.
Projects are collections of related runs, providing a high-level grouping for your experiments.
Tags are flexible labels that allow for fine-grained categorization and filtering of runs.
You learned how to specify custom run_names, projects, and tags using trackio.init() and how to add tags dynamically with trackio.run.tags.add().
We discussed common pitfalls and best practices for naming and tagging.

By diligently applying these concepts, you’ll ensure your machine learning experiments are always well-structured, easy to navigate, and reproducible. This will save you countless hours of confusion and significantly boost your productivity.

In the next chapter, we’ll expand our logging capabilities beyond simple scalars and explore how to log more complex data types like images, audio, and model artifacts. Get ready to make your experiments even richer!

References

This page is AI-assisted and reviewed. It references official documentation and recognized resources where relevant.