AlignAIR Logo
Documentation
// examples

Examples Gallery

Real-world examples of AlignAIR in action — input sequences, commands, and expected outputs for different use cases.

// categories
01 / basic

Basic analysis

Single sequence processing

02 / batch

Batch processing

Large dataset analysis

03 / custom

Custom parameters

Optimized configurations

04 / advanced

Advanced

Complex workflows

// example 01

Basic heavy chain analysis

Single sequence V(D)J assignment with default parameters.

input · sequences.csv
sequence_id,sequence
seq_001,CAGGTGCAGCTGGTGGAGTCTGGG...
seq_002,GAGGTGCAGCTGGTGGAGTCTGGG...
command
python app.py run \
  --model-checkpoint=/app/pretrained_models/IGH_S5F_576 \
  --chain-type=heavy \
  --sequences=/data/input/sequences.csv \
  --save-path=/data/output/results
output · results.csv
sequence_id,v_call,d_call,j_call,productive
seq_001,IGHV1-2*01,IGHD3-3*01,IGHJ4*01,True
seq_002,IGHV1-3*01,IGHD2-2*01,IGHJ6*01,True
// processing
  • 2 sequences processed
  • Default thresholds (V:0.75, D:0.3, J:0.8)
  • Both sequences are productive
// example 02

Light chain with custom thresholds

High-stringency analysis for clean data.

// use case

High-quality light chain sequences from flow-sorted B cells. Stricter thresholds for precise allele calling.

command
python app.py run \
  --model-checkpoint=/app/pretrained_models/IGL_S5F_576 \
  --chain-type=light \
  --sequences=/data/input/light_chains.csv \
  --save-path=/data/output/light_results \
  --v-allele-threshold=0.9 \
  --j-allele-threshold=0.85 \
  --airr-format
// threshold impact
V calls — default (0.75)1,850
V calls — strict (0.9)1,650

Higher confidence, fewer ambiguous calls.

// output format

Full AIRR Schema with standardized column names for downstream analysis pipelines.

// example 03

Large dataset processing

Optimized parameters for 100K+ sequences.

command
python app.py run \
  --model-checkpoint=/app/pretrained_models/IGH_S5F_576 \
  --chain-type=heavy \
  --sequences=/data/input/large_dataset.csv \
  --save-path=/data/output/batch_results \
  --batch-size=4096 \
  --fix-orientation
// performance tips
  • Increased batch size to 4096
  • Enabled orientation fixing
  • GPU memory: 16GB+
// processing times
100K sequences45 minutes
500K sequences3.5 hours
// resource usage
12GB
GPU memory
95%
GPU utilization
// templates

Quick start templates

standard heavy chain

python app.py run \
  --model-checkpoint=/app/pretrained_models/IGH_S5F_576 \
  --chain-type=heavy \
  --sequences=/data/input/sequences.csv \
  --save-path=/data/output

high-quality light chain

python app.py run \
  --model-checkpoint=/app/pretrained_models/IGL_S5F_576 \
  --chain-type=light \
  --sequences=/data/input/light_chains.csv \
  --save-path=/data/output \
  --v-allele-threshold=0.85 \
  --j-allele-threshold=0.9 \
  --airr-format
// next

Ready to try these?

Use these examples as starting points for your own analyses. Modify parameters based on your specific data and requirements.

© 2025 AlignAIR. All rights reserved.Advancing computational biology through AI