Introduction ============ This package contains utilities which may be useful in working with AIRR-C JSON germline sets Installation ------------ The package requires python, and can be installed using pip: ``pip install receptor-germline-tools`` It requires `Biopython `_, which may need to be installed separately. Using the rich information in AIRR-C germline sets -------------------------------------------------- Quick summary: Given an AIRR-C germline set ``germlines.json``, create a FASTA-formatted IMGT-gapped set, in which the header lists paralogs (other items in the set with the same sequence) and aliases (other names by which this item has been known):: create_fasta -i germlines.json -o germlines.fasta -g -f paralogs,aliases Given an AIRR-C rearrangements file or Changeo file ``rearrangements.tsv``, create a copy that also includes columns for subgroup number, gene number and allele number of each V,D and J allele, using information from the germline set:: add_germline_annotations rearrangements.tsv germlines.json \ rearrangements_annotated.tsv \ -f subgroup_designation,gene_designation,allele_designation :ref:`list_of_fields` More detail: :ref:`create_fasta` - Create a gapped or ungapped FASTA file from an AIRR-C JSON-format germline set, optionally containing additional information in the header :ref:`annotate_rearrangements` - Annotate an AIRR-C rearrangements file with additional germline information from a AIRR-C JSON-format germline set Working with IGLabel-style sequence names ----------------------------------------- Alleles in the mouse germline sets are allocated four-letter `IgLabel-style `_ labels with 'dummy' subgroup and allele designations. These can be used as-is with many tools. :ref:`Further information` :ref:`add_germline_annotations` - Convert IgLabel-style labels in an alignment file to dummy IUIS format :ref:`convert_fasta_labels` - Create a custom auxiliary data file for IgBlast