Search for T cell receptor beta CDR3 amino acid sequences with known antigen specificity from IEDB, VdjDB and McPASDb
Source:R/searchDB.R
searchDB.Rd
Search for published T cell receptor beta CDR3 amino acid sequences with known antigen specificity in a list of data frames.
Arguments
- study_table
A tibble generated by the LymphoSeq2 functions
readImmunoSeq()
,productiveSeq()
,searchPublished()
ortopSeqs()
. "junction_aa", "duplicate_frequency", and "duplicate_count" are required columns.- dbname
A vector of database source to search for the antigenic specificity of a T-cell or B-cell in the dataset:
"all"
,"IEDB"
,"McPAS-TCR"
,"VdjDB"
.- chain
The receptor chain type to search in the public databases:
"tra"
,"trb"
,"light"
,"heavy"
.
Value
Returns the input table annotated with the any recorded antigenic specificity from the public databases.
Examples
file_path <- system.file("extdata", "TCRB_sequencing",
package = "LymphoSeq2")
study_table <- LymphoSeq2::readImmunoSeq(path = file_path, threads = 1)
study_table <- LymphoSeq2::topSeqs(study_table, top = 100)
amino_table <- LymphoSeq2::productiveSeq(study_table = study_table,
aggregate = "junction_aa")
top_seqs <- LymphoSeq2::topSeqs(productive_table = amino_table, top = 1)
LymphoSeq2::searchDB(study_table = top_seqs, dbname = "all", chain = "trb")
#> # A tibble: 10 × 26
#> repertoire_id junction_aa v_call d_call j_call v_family d_family j_family
#> <chr> <chr> <chr> <chr> <chr> <chr> <chr> <chr>
#> 1 TRB_CD4_949 CASSESAGST… TRBV1… TRBD2… TRBJ2… TRBV10 TRBD2 TRBJ2
#> 2 TRB_CD8_949 CASSPAGAYY… NA TRBD2… TRBJ2… NA TRBD2 TRBJ2
#> 3 TRB_CD8_CMV_369 CASSPPTGER… TRBV7… TRBD2… TRBJ2… TRBV7 TRBD2 TRBJ2
#> 4 TRB_Unsorted_0 CASSPVSNEQ… TRBV2… TRBD2… TRBJ2… TRBV28 TRBD2 TRBJ2
#> 5 TRB_Unsorted_1320 CASSPAGAYY… NA TRBD2… TRBJ2… NA TRBD2 TRBJ2
#> 6 TRB_Unsorted_1496 CASSPAGAYY… NA TRBD2… TRBJ2… NA TRBD2 TRBJ2
#> 7 TRB_Unsorted_32 CASSLQGREK… TRBV2… TRBD1… TRBJ1… TRBV27 TRBD1 TRBJ1
#> 8 TRB_Unsorted_369 CASSQDWERL… TRBV1… NA TRBJ2… TRBV14 NA TRBJ2
#> 9 TRB_Unsorted_83 CASSLQGREK… TRBV2… TRBD1… TRBJ1… TRBV27 TRBD1 TRBJ1
#> 10 TRB_Unsorted_949 CASSPAGAYY… NA TRBD2… TRBJ2… NA TRBD2 TRBJ2
#> # ℹ 18 more variables: reading_frame <chr>, duplicate_count <dbl>,
#> # duplicate_frequency <dbl>, tra_cdr3_aa <chr>, gene <chr>, epitope <chr>,
#> # pathology <chr>, antigen <chr>, tra_v_call <chr>, tra_j_call <chr>,
#> # mhc_allele <chr>, reference <chr>, score <dbl>, cell_type <chr>,
#> # source <chr>, trb_v_call <chr>, trb_j_call <chr>, Species <chr>