Week 1: Foundations

Day 4: The Basic Pipeline

Learn the full SignLLM pipeline at a high level.

Day 4 of 2818 minGoal - Learn - Example - Practice - Checkpoint

Goal

Learn the full pipeline at a high level.

Learn

  • A common pipeline is: capture video, extract frames, detect pose/keypoints, save structured data, add labels or gloss, clean the dataset, train the model, validate it, and manually inspect the outputs.
  • Each step can introduce errors. Bad capture affects pose. Bad pose affects training. Bad labels teach the model the wrong target.
  • The pipeline is not finished when a model runs. It is finished only after review shows that the output is useful, honest, and safe for the intended purpose.

Example

  • One sample may start as clip_0042.mp4. Frames are extracted, DWPose or another pose tool creates keypoints, the annotator labels the clip as THANK-YOU, and the final training record points to clip_0042.npz.
  • If the signer moves out of frame, the sample may be marked review or reject instead of going into training.

Practice

  1. Draw this flow: capture -> frames -> pose/keypoints -> gloss/labels -> NPZ -> train -> validate -> human review.
  2. Add one possible failure under each step.

Checkpoint

Before moving on

You can describe the journey from signer video to training data and reviewed model output.

Pipeline note

Pipeline note

The middle steps are where most of the labor lives. SignLLM work is data engineering, language work, and review work, not just model training.