Sequences
The following sequences are available for this feature:
Gene sequence (with intron)
Legend: exonCDSpolypeptide
Hold the cursor over a type above to highlight its positions in the sequence below.
ATGCTATGTCTCAATTTAAGGATGTCAGTTTTCCTAGAAAGTCAATTAGAAACAGTGAAACAGGCTTGGGAAAAATTAACTGTAGATAGAAAGGCTAAATTTACAAGCAAATATGGCCATCTAGCTCAGCTCATGTATGTACAAGTTAATTATTCTGTATTAAAAGCTTTGATTCGACATTGGGATCCAGCCTATAGATGTTTCACATTTGGCTCAATTGACATGACTCCTACAATAGAGGAATATCAATCCCTTCTGCATATGCCAACACGAACAGAGGTTGAAGCTTATTCTTATGATCAAGAGCTTACAATGAAAAGAGCATTATCTACTCTTTTGGGCAAGATTCGTACGAGCGACATTGAGAAACAAGTAAAGATAAAAGGAGAGAACACATGCCTACCCCTGGACTACATCCTTACTCTTCAACAAAAATTTGCAAATGAAGACAAGGAGTTAACTTTACTGGCATTGTGTATCTTTAATGTTGTTTTGTTTCCTAAAGTATGTGGATATGTTGAGGAACGGTGGTCAAGTTGTTTGCTAAGATAGAAATAGGAGTTGATCCCATTATTCTTGTATTGGCAGAAACTTTTCGTTCATTGAATTATTGCAGAACAAAGGAACAGGAAGATTTATAGGGTGTGCACCATTATTGTACATTTGGGTCCTTAGTCATGTGAAGTGTCCACCCGAATTCAAATGTCCAGAGATCAAATTTTCAAGTTCCTGGAATAAGCTGCGAAATCCCATTTCAGAGTTCGTGCAATCAGGTTGGAGTTCATCCTCCCCTGAAAGGAGCGCTTGGGAAGCCTTTTTCTTCGAACTAAAGGTAGAAGATGTAATGTGGAGAGCCCCTTGGATGTCAACCAGGCCAATGATCTATAAATGTGGTAAATTTCAAAGTTTACCTCTTTTGGGTCCTTGGGGATGTATAGCCTATGCTCCTTTATTGGTAGTACGCCAAATTTGGGTCCGACAATTCATCCCTGCTACGCATGAGTTGAAGGATTTCGAATTTGCTTATGATAAAGGCTTCTGTAAAGATAGAATTCAGAAAATTGTGAAAGCATGGAAAATGATCACTAAAATCCAGAGTGGTCAGTTTCATGATGATACCACGGAGGCGTACAAAACATGGCATGCAAAAGAGCTAAAACCGTGCTTGTGTGACCAAACATGAAAACCAAGATAAAACTTAATGCAAAGGTGATATCAGATCAACAGACAGAACAAGCAGCACACGAAAAAGAATGTGATGAATTGAGAAAAGCGAATTCATCATTGGTTCAAGAAAATGAAAGGCTGCAATTGGAGGTACAGCAAGGTTTGTTGCGCAATGTTGAACTAGAAAAAGAGTTAAACCGATTAAAGGGCAGTGTCAGCAAACAAGAACAGTTAGAAAAAGAAATTTCAGCATTAGACACAGAGGCCCGCGACCTGAACAGAAGAATGCATCGATTAAGAAGGGATAAGGAAGTCTCCCAAGCAACTCTCAAGTCAAGGAATGACCAAGTTTTGAAGCAACAATCTGAGATTGCGTCACTTCATGAGTTGATGAAAGAGCTCGAAGATTGCAATAGTTTGAGGAACCAAACGATTACTGAGGTAGAAGAAAAGAATGGAACGCTATGTCGAACAATTGCGATCTACAATTAACGCTCAAGATTAGAGAATATCAACTAGGGGAGCTCATCAACGACAACAAGGGTCTAAGAGAGTCCGTTCAGTCACTTAATGTTCGCCTCGGTAAGTATCAGGATGCCACTGATAGATTAATGAAAGACTATACCTATTTAAAGGAGCAGTACAACAGATTAAGCGATGATTTTGGGTTTGCGAGACAGAACCACGCGACACTACGAAGTAAAGCGGAACATATGCTCACTCAGATTAGGAGAGTCACTCGAAGGGCAGATGAACTAGCAGAAGATGCACGTACTCTCTCTAAAGTCATAGCACCTACACAGCCGAATAGCAAGAATGTGCTCAAGTTTCTGGGAAAACTTCGTATAAGTTTAGAATATTGGGGGCAATTTTATTAACTCTTTACTTTAGATTATGTTTTATTTAGATAGTTGAAATATCTTGTAATTCAATTTGCACAAATTTTGCTGCTACATTGTTTTATCATTTTCCAATTTAATATCAAAGCCATTTTTTTTCTTCTTCTCTTCTATCACTTTTTTTTGTTTCGTTTTGTCTTTCTCTTTTCTTACACTAAAATCTCTTTTCATTAAAAATATAAACAGAATCATAAGATAGCTCGGTCACCTCGAATCCGCCGCACATACGTCACAAGATACAAGACAAGGATCATGGAAGAGCAGAGTACTGAGATGGAGAAAACAAAGAAAGATATTGAGGAGTTACGAGAAAAAATGGATGCCATTCTTGTCGCCCTGGAAAGAGGCAAAATAATACCTGATATTGCTCAGTCCAGCAATACAATGATCGACCCTCCAATCCGGCAATCAACAGAGGGTACTACTCCAAAATATCATCCATTGTACAATATTCCAGTAGAGCAGCACCCATTTCCATTTTTCAAGAATGAGCAAGTGCTGTACACAATCAACCTGGATTTTCACTACCCACAGAGGTACCTCCCAAGGTGACCATTACAGTTCCCAATTTAGATGATCCTGAAATCAGAAAAGAGCTAACGGGAGGAGAGAAAGTTTCTTCTAGTGAAAAGCTTGAAGTCTTGGAGGAAAGATTAAGGGCAGTAGAAGGAACATACGTCTTCGGAAATATAGATGCGACCAAGCTATGCTTGGTACCAGATGTAATCCTCCCTCCAAAATTCAAGGTGCCCGAGTTTGAAAAGTATGATGGAGCATCCTGTCCTAAGAACCATCTCATCATGTATTGTAGGAAGATGGCAGCATACGTCCAAAATGACAAGCTGTTAATTCACTGCTTCCAGGACAGTCTTACTGGTCCAGCATCTCGATGGTATATGCAGTTAGACAGCACTCATATATGTTCATGGAAGAATCTAGCCGATTCATTTTTAAAGCAATATAAGCACAACATAGATATGGCTCCTGACCGCTTAGACCTCCAGAGGATGGAAAGAAGAGCACAGAAAGCTTTAAAGAGTACGCCCAAAGGTGGAGGGATACTGCTGCTCAGGTGCAACCACCTTTAGCTGATAAGGAGCTGTCGACCATGTTTATTAATACTCTCAAATCTCCTTTCTATGATAAGATGATTGGGAGCCCTCTACCAATTTCTCTGACATAATGACAATTGGAGAGAGAATCGAGTACGAATTAAGCATGGAAAGATAACTGATACGGCTGGATCATCAACAGCGAAAAAGGGGGTTCCATCGAAGAAAAAAGAGGGAGAAGTTCAGATGATTGGTTTCAATTCAAGACAACCATACCCTAATGCCGGAGTGCCACAATATCACTATCCACCTCCATATGTTTACCCTCAACCCTATATCAATAATACGTCAGCCCAATATTCATCCCCTTACGTCCAAAATCCTCGTCCTACTCAAGGCTACCACCTCGGAATCAACATAACACTCCTTATGTCCAAGGACACCAAAACAATAAAGGCGTCCGTAGACAGACTCATTTTGATCCGATTCCGATGACATATACCGAACTTCTACCCCAACTCTTCCAGAATAATCAGCTGGCACCCGTACCAATAGACCCAGTAAAGCCACCTTACCCAAAGTGGTATGACCCAAATGCCCGTTGCGACTACCATGCAGGAGCAATTGGACATTCCACTGAAAACTGTACTGCACTCAAGCATAGGGTGCAAGCATTGATAAAGGCAGGATGGTTGAACTTTAAGAAAGAAAATGGTCCAGATGTCAACAACAATCCTCTGCCAAACCATCAGAATGCACAAGTAAATGCAATAGAGGTTCATGGAGCTGATTTAAGAAAGAATGCTGAGAGCATAGTGACTCCCATGGGAGAACTATTCGAAATATTATTGAATAATGGATACATTGGAGTAGAACGCCTCCAATTAGATTTGGGTGTCAGAGCATACGATGACAGTTTGATGTGTTCTTATCACACTGGGGCAAAAGGGCATTCTATTGACCAATGTCCTCATTTTCGTCTGAAAGTCCAGGAGTTGTTAGATTCACATTTTTTAACAGTTTCTCAAAAGATGGTTCAGCTCCCTCAGTATGGGGAAGTTGATATTATAGAAGAATGCTCAAGGTGTCTCTCAAGCCAAAACCGTTAACAATTTCTTATCGCGAGAAGCCCAGTACCCCAAATTCCAAGCCAAGACCGATTACCATCCAGATTCCGACTCCCTTTGAATATAAAAGTTCAAAAGCAGTACCTTGGAACTATGAATATAAGGTAACTGTTGGTTCTGAACCCCTCCAATCCATAATATTAGTGGGATAGGAGGTTAACACGAAGCGGGAAGTGTTATACACCAGAAGACCTATTGAAACCCAAGGGTAAAGAAAAAGGAAAAGCCAAGATCACTGAAGATATCAAGGAAAAAACAGAAGAGCCCATTGCAGTGAAAAACCTGGATGTTAAACAGCCAGCATCTGAAGATGACATCCAAGAATTTTTGAAGTTGGTTAAACAAAGTGATTATAAAGTGGTTGAACAGTTAGGTCGAACCCCTGCAAAGATTTCCATACTGGCTTTGTTACTGGCTTCAGATACGCATCGTAAGACTTTGTTGGACATTTTGAATCAAACTTATGTTCCGCAGGATATTACGGTGGACAACTTGGATAACATGTTGGAAACATAA
mRNA sequence
ATGCTATGTCTCAATTTAAGGATGTCAGTTTTCCTAGAAAGTCAATTAGAAACAGTGAAACAGGCTTGGGAAAAATTAACTGTAGATAGAAAGGCTAAATTTACAAGCAAATATGGCCATCTAGCTCAGCTCATGTATGTACAAGTTAATTATTCTGTATTAAAAGCTTTGATTCGACATTGGGATCCAGCCTATAGATGTTTCACATTTGGCTCAATTGACATGACTCCTACAATAGAGGAATATCAATCCCTTCTGCATATGCCAACACGAACAGAGGTTGAAGCTTATTCTTATGATCAAGAGCTTACAATGAAAAGAGCATTATCTACTCTTTTGGGCAAGATTCGTACGAGCGACATTGAGAAACAAGTAAAGATAAAAGGAGAGAACACATGCCTACCCCTGGACTACATCCTTACTCTTCAACAAAAATTTGCAAATGAAGACAAGGAGTTAACTTTACTGGCATTGTGTATCTTTAATGTTGTTTTGTTTCCTAAAAACAAAGGAACAGGAAGATTTATAGGGTGTGCACCATTATTGTACATTTGGGTCCTTAGTCATGTGAAGTGTCCACCCGAATTCAAATGTCCAGAGATCAAATTTTCAAGTTCCTGGAATAAGCTGCGAAATCCCATTTCAGAGTTCGTGCAATCAGGTTGGAGTTCATCCTCCCCTGAAAGGAGCGCTTGGGAAGCCTTTTTCTTCGAACTAAAGGTAGAAGATGTAATGTGGAGAGCCCCTTGGATGTCAACCAGGCCAATGATCTATAAATGTGGTAAATTTCAAAGTTTACCTCTTTTGGGTCCTTGGGGATGTATAGCCTATGCTCCTTTATTGGTAGTACGCCAAATTTGGGTCCGACAATTCATCCCTGCTACGCATGAGTTGAAGGATTTCGAATTTGCTTATGATAAAGGCTTCTGTAAAGATAGAATTCAGAAAATTGTGAAAGCATGGAAAATGATCACTAAAATCCAGAGTGATCAACAGACAGAACAAGCAGCACACGAAAAAGAATGTGATGAATTGAGAAAAGCGAATTCATCATTGGTTCAAGAAAATGAAAGGCTGCAATTGGAGGTACAGCAAGGTTTGTTGCGCAATGTTGAACTAGAAAAAGAGTTAAACCGATTAAAGGGCAGTGTCAGCAAACAAGAACAGTTAGAAAAAGAAATTTCAGCATTAGACACAGAGGCCCGCGACCTGAACAGAAGAATGCATCGATTAAGAAGGGATAAGGAAGTCTCCCAAGCAACTCTCAAGTCAAGGAATGACCAAGTTTTGAAGCAACAATCTGAGATTGCGTCACTTCATGAGTTGATGAAAGAGCTCGAAGATTGCAATAGTTTGAGGAACCAAACGATTACTGAGGATGCCACTGATAGATTAATGAAAGACTATACCTATTTAAAGGAGCAGTACAACAGATTAAGCGATGATTTTGGGTTTGCGAGACAGAACCACGCGACACTACGAAGTAAAGCGGAACATATGCTCACTCAGATTAGGAGAGTCACTCGAAGGGCAGATGAACTAGCAGAAGATGCACGTACTCTCTCTAAAGTCATAGCACCTACACAGCCGAATAGCAAGAATAATCATAAGATAGCTCGGTCACCTCGAATCCGCCGCACATACGTCACAAGATACAAGACAAGGATCATGGAAGAGCAGAGTACTGAGATGGAGAAAACAAAGAAAGATATTGAGGAGTTACGAGAAAAAATGGATGCCATTCTTGTCGCCCTGGAAAGAGGCAAAATAATACCTGATATTGCTCAGTCCAGCAATACAATGATCGACCCTCCAATCCGGCAATCAACAGAGGAGGTACCTCCCAAGGTGACCATTACAGTTCCCAATTTAGATGATCCTGAAATCAGAAAAGAGCTAACGGGAGGAGAGAAAGTTTCTTCTAGTGAAAAGCTTGAAGTCTTGGAGGAAAGATTAAGGGCAGTAGAAGGAACATACGTCTTCGGAAATATAGATGCGACCAAGCTATGCTTGGTACCAGATGTAATCCTCCCTCCAAAATTCAAGGTGCCCGAGTTTGAAAAGTATGATGGAGCATCCTGTCCTAAGAACCATCTCATCATGTATTGTAGGAAGATGGCAGCATACGTCCAAAATGACAAGCTGTTAATTCACTGCTTCCAGGACAGTCTTACTGGTCCAGCATCTCGATGGTATATGCAGTTAGACAGCACTCATATATGTTCATGGAAGAATCTAGCCGATTCATTTTTAAAGCAATATAAGCACAACATAGATATGGCTCCTGACCGCTTAGACCTCCAGAGGATGGAAAGAAGAGCACAGAAAGCTTTAAAGAGTACGCCCAAAGGTGGAGGGATACTGCTGCTCAGCCCAATATTCATCCCCTTACGTCCAAAATCCTCGTCCTACTCAAGGCTACCACCTCGGAATCAACATAACACTCCTTATGTCCAAGGACACCAAAACAATAAAGGCGTCCGTAGACAGACTCATTTTGATCCGATTCCGATGACATATACCGAACTTCTACCCCAACTCTTCCAGAATAATCAGCTGGCACCCGTACCAATAGACCCAGTAAAGCCACCTTACCCAAAGTGGTATGACCCAAATGCCCGTTGCGACTACCATGCAGGAGCAATTGGACATTCCACTGAAAACTGTACTGCACTCAAGCATAGGGTGCAAGCATTGATAAAGGCAGGATGGTTGAACTTTAAGAAAGAAAATGGTCCAGATGTCAACAACAATCCTCTGCCAAACCATCAGAATGCACAAGTAAATGCAATAGAGGTTCATGGAGCTGATTTAAGAAAGAATGCTGAGAGCATAGTGACTCCCATGGGAGAACTATTCGAAATATTATTGAATAATGGATACATTGGAGTAGAACGCCTCCAATTAGATTTGGGTGTCAGAGCATACGATGACAGTTTGATGTGTTCTTATCACACTGGGGCAAAAGGGCATTCTATTGACCAATGTCCTCATTTTCGTCTGAAAGTCCAGGAGTTGTTAGATTCACATTTTTTAACAGTTTCTCAAAAGATGGTTCAGCTCCCTCAGTATGGGGAAGTTGATATTATAGAAGAATGCTCAAGGAGGTTAACACGAAGCGGGAAGTGTTATACACCAGAAGACCTATTGAAACCCAAGGGTAAAGAAAAAGGAAAAGCCAAGATCACTGAAGATATCAAGGAAAAAACAGAAGAGCCCATTGCAGTGAAAAACCTGGATGTTAAACAGCCAGCATCTGAAGATGACATCCAAGAATTTTTGAAGTTGGTTAAACAAAGTGATTATAAAGTGGTTGAACAGTTAGGTCGAACCCCTGCAAAGATTTCCATACTGGCTTTGTTACTGGCTTCAGATACGCATCGTAAGACTTTGTTGGACATTTTGAATCAAACTTATGTTCCGCAGGATATTACGGTGGACAACTTGGATAACATGTTGGAAACATAA
Coding sequence (CDS)
ATGCTATGTCTCAATTTAAGGATGTCAGTTTTCCTAGAAAGTCAATTAGAAACAGTGAAACAGGCTTGGGAAAAATTAACTGTAGATAGAAAGGCTAAATTTACAAGCAAATATGGCCATCTAGCTCAGCTCATGTATGTACAAGTTAATTATTCTGTATTAAAAGCTTTGATTCGACATTGGGATCCAGCCTATAGATGTTTCACATTTGGCTCAATTGACATGACTCCTACAATAGAGGAATATCAATCCCTTCTGCATATGCCAACACGAACAGAGGTTGAAGCTTATTCTTATGATCAAGAGCTTACAATGAAAAGAGCATTATCTACTCTTTTGGGCAAGATTCGTACGAGCGACATTGAGAAACAAGTAAAGATAAAAGGAGAGAACACATGCCTACCCCTGGACTACATCCTTACTCTTCAACAAAAATTTGCAAATGAAGACAAGGAGTTAACTTTACTGGCATTGTGTATCTTTAATGTTGTTTTGTTTCCTAAAAACAAAGGAACAGGAAGATTTATAGGGTGTGCACCATTATTGTACATTTGGGTCCTTAGTCATGTGAAGTGTCCACCCGAATTCAAATGTCCAGAGATCAAATTTTCAAGTTCCTGGAATAAGCTGCGAAATCCCATTTCAGAGTTCGTGCAATCAGGTTGGAGTTCATCCTCCCCTGAAAGGAGCGCTTGGGAAGCCTTTTTCTTCGAACTAAAGGTAGAAGATGTAATGTGGAGAGCCCCTTGGATGTCAACCAGGCCAATGATCTATAAATGTGGTAAATTTCAAAGTTTACCTCTTTTGGGTCCTTGGGGATGTATAGCCTATGCTCCTTTATTGGTAGTACGCCAAATTTGGGTCCGACAATTCATCCCTGCTACGCATGAGTTGAAGGATTTCGAATTTGCTTATGATAAAGGCTTCTGTAAAGATAGAATTCAGAAAATTGTGAAAGCATGGAAAATGATCACTAAAATCCAGAGTGATCAACAGACAGAACAAGCAGCACACGAAAAAGAATGTGATGAATTGAGAAAAGCGAATTCATCATTGGTTCAAGAAAATGAAAGGCTGCAATTGGAGGTACAGCAAGGTTTGTTGCGCAATGTTGAACTAGAAAAAGAGTTAAACCGATTAAAGGGCAGTGTCAGCAAACAAGAACAGTTAGAAAAAGAAATTTCAGCATTAGACACAGAGGCCCGCGACCTGAACAGAAGAATGCATCGATTAAGAAGGGATAAGGAAGTCTCCCAAGCAACTCTCAAGTCAAGGAATGACCAAGTTTTGAAGCAACAATCTGAGATTGCGTCACTTCATGAGTTGATGAAAGAGCTCGAAGATTGCAATAGTTTGAGGAACCAAACGATTACTGAGGATGCCACTGATAGATTAATGAAAGACTATACCTATTTAAAGGAGCAGTACAACAGATTAAGCGATGATTTTGGGTTTGCGAGACAGAACCACGCGACACTACGAAGTAAAGCGGAACATATGCTCACTCAGATTAGGAGAGTCACTCGAAGGGCAGATGAACTAGCAGAAGATGCACGTACTCTCTCTAAAGTCATAGCACCTACACAGCCGAATAGCAAGAATAATCATAAGATAGCTCGGTCACCTCGAATCCGCCGCACATACGTCACAAGATACAAGACAAGGATCATGGAAGAGCAGAGTACTGAGATGGAGAAAACAAAGAAAGATATTGAGGAGTTACGAGAAAAAATGGATGCCATTCTTGTCGCCCTGGAAAGAGGCAAAATAATACCTGATATTGCTCAGTCCAGCAATACAATGATCGACCCTCCAATCCGGCAATCAACAGAGGAGGTACCTCCCAAGGTGACCATTACAGTTCCCAATTTAGATGATCCTGAAATCAGAAAAGAGCTAACGGGAGGAGAGAAAGTTTCTTCTAGTGAAAAGCTTGAAGTCTTGGAGGAAAGATTAAGGGCAGTAGAAGGAACATACGTCTTCGGAAATATAGATGCGACCAAGCTATGCTTGGTACCAGATGTAATCCTCCCTCCAAAATTCAAGGTGCCCGAGTTTGAAAAGTATGATGGAGCATCCTGTCCTAAGAACCATCTCATCATGTATTGTAGGAAGATGGCAGCATACGTCCAAAATGACAAGCTGTTAATTCACTGCTTCCAGGACAGTCTTACTGGTCCAGCATCTCGATGGTATATGCAGTTAGACAGCACTCATATATGTTCATGGAAGAATCTAGCCGATTCATTTTTAAAGCAATATAAGCACAACATAGATATGGCTCCTGACCGCTTAGACCTCCAGAGGATGGAAAGAAGAGCACAGAAAGCTTTAAAGAGTACGCCCAAAGGTGGAGGGATACTGCTGCTCAGCCCAATATTCATCCCCTTACGTCCAAAATCCTCGTCCTACTCAAGGCTACCACCTCGGAATCAACATAACACTCCTTATGTCCAAGGACACCAAAACAATAAAGGCGTCCGTAGACAGACTCATTTTGATCCGATTCCGATGACATATACCGAACTTCTACCCCAACTCTTCCAGAATAATCAGCTGGCACCCGTACCAATAGACCCAGTAAAGCCACCTTACCCAAAGTGGTATGACCCAAATGCCCGTTGCGACTACCATGCAGGAGCAATTGGACATTCCACTGAAAACTGTACTGCACTCAAGCATAGGGTGCAAGCATTGATAAAGGCAGGATGGTTGAACTTTAAGAAAGAAAATGGTCCAGATGTCAACAACAATCCTCTGCCAAACCATCAGAATGCACAAGTAAATGCAATAGAGGTTCATGGAGCTGATTTAAGAAAGAATGCTGAGAGCATAGTGACTCCCATGGGAGAACTATTCGAAATATTATTGAATAATGGATACATTGGAGTAGAACGCCTCCAATTAGATTTGGGTGTCAGAGCATACGATGACAGTTTGATGTGTTCTTATCACACTGGGGCAAAAGGGCATTCTATTGACCAATGTCCTCATTTTCGTCTGAAAGTCCAGGAGTTGTTAGATTCACATTTTTTAACAGTTTCTCAAAAGATGGTTCAGCTCCCTCAGTATGGGGAAGTTGATATTATAGAAGAATGCTCAAGGAGGTTAACACGAAGCGGGAAGTGTTATACACCAGAAGACCTATTGAAACCCAAGGGTAAAGAAAAAGGAAAAGCCAAGATCACTGAAGATATCAAGGAAAAAACAGAAGAGCCCATTGCAGTGAAAAACCTGGATGTTAAACAGCCAGCATCTGAAGATGACATCCAAGAATTTTTGAAGTTGGTTAAACAAAGTGATTATAAAGTGGTTGAACAGTTAGGTCGAACCCCTGCAAAGATTTCCATACTGGCTTTGTTACTGGCTTCAGATACGCATCGTAAGACTTTGTTGGACATTTTGAATCAAACTTATGTTCCGCAGGATATTACGGTGGACAACTTGGATAACATGTTGGAAACATAA
Protein sequence
MLCLNLRMSVFLESQLETVKQAWEKLTVDRKAKFTSKYGHLAQLMYVQVNYSVLKALIRHWDPAYRCFTFGSIDMTPTIEEYQSLLHMPTRTEVEAYSYDQELTMKRALSTLLGKIRTSDIEKQVKIKGENTCLPLDYILTLQQKFANEDKELTLLALCIFNVVLFPKNKGTGRFIGCAPLLYIWVLSHVKCPPEFKCPEIKFSSSWNKLRNPISEFVQSGWSSSSPERSAWEAFFFELKVEDVMWRAPWMSTRPMIYKCGKFQSLPLLGPWGCIAYAPLLVVRQIWVRQFIPATHELKDFEFAYDKGFCKDRIQKIVKAWKMITKIQSDQQTEQAAHEKECDELRKANSSLVQENERLQLEVQQGLLRNVELEKELNRLKGSVSKQEQLEKEISALDTEARDLNRRMHRLRRDKEVSQATLKSRNDQVLKQQSEIASLHELMKELEDCNSLRNQTITEDATDRLMKDYTYLKEQYNRLSDDFGFARQNHATLRSKAEHMLTQIRRVTRRADELAEDARTLSKVIAPTQPNSKNNHKIARSPRIRRTYVTRYKTRIMEEQSTEMEKTKKDIEELREKMDAILVALERGKIIPDIAQSSNTMIDPPIRQSTEEVPPKVTITVPNLDDPEIRKELTGGEKVSSSEKLEVLEERLRAVEGTYVFGNIDATKLCLVPDVILPPKFKVPEFEKYDGASCPKNHLIMYCRKMAAYVQNDKLLIHCFQDSLTGPASRWYMQLDSTHICSWKNLADSFLKQYKHNIDMAPDRLDLQRMERRAQKALKSTPKGGGILLLSPIFIPLRPKSSSYSRLPPRNQHNTPYVQGHQNNKGVRRQTHFDPIPMTYTELLPQLFQNNQLAPVPIDPVKPPYPKWYDPNARCDYHAGAIGHSTENCTALKHRVQALIKAGWLNFKKENGPDVNNNPLPNHQNAQVNAIEVHGADLRKNAESIVTPMGELFEILLNNGYIGVERLQLDLGVRAYDDSLMCSYHTGAKGHSIDQCPHFRLKVQELLDSHFLTVSQKMVQLPQYGEVDIIEECSRRLTRSGKCYTPEDLLKPKGKEKGKAKITEDIKEKTEEPIAVKNLDVKQPASEDDIQEFLKLVKQSDYKVVEQLGRTPAKISILALLLASDTHRKTLLDILNQTYVPQDITVDNLDNMLET
Homology
BLAST of Lag0015740 vs. NCBI nr
Match:
KAA0036933.1 (uncharacterized protein E6C27_scaffold86G00060 [Cucumis melo var. makuwa])
HSP 1 Score: 703.7 bits (1815), Expect = 2.5e-198
Identity = 453/1299 (34.87%), Postives = 647/1299 (49.81%), Query Frame = 0
Query: 23 WEKLTVDRKAKFTSKYGHLAQLMYVQVNYSVLKALIRHWDPAYRCFTFGSIDMTPTIEEY 82
WE LT R+ F+ KYGH+A+LMY+ VNY L+A+I DPAY CFTFGS D+ PTIEEY
Sbjct: 3 WEALTPQRRFMFSKKYGHIAELMYIPVNYFALRAIINFGDPAYGCFTFGSCDLLPTIEEY 62
Query: 83 QSLLHMPTRTEVEAYSYDQELTMKRALSTLLGKIRTSDIEKQVKIKGENTCLPLDYILTL 142
Q++L MP + Y ++ + T KR LS L + ++I+K +K KG +P DY++ +
Sbjct: 63 QAMLSMPQKEREIVYFFNPKQTTKRTLSKFLETVHATEIQKYIKAKGGEENVPFDYLIKM 122
Query: 143 QQKFANEDKELTLLALCIFNVVLFPK---------------------------------- 202
Q + +EDK LTLLALCI+ V+FPK
Sbjct: 123 TQTYIDEDKGLTLLALCIYGAVIFPKAEGYVDRKVIKLFFQMERGVNPIIPILAETFRSL 182
Query: 203 ----NKG--------TGRFIGCAPLLYIWVLSHVKCPPEFKCPEIKFSSSWNKLRNPISE 262
NKG G+ C PLLYIW+ SH+K P EF+CP + FSS WN +RN ISE
Sbjct: 183 NYCRNKGEGKLNCCVRGKLNCCVPLLYIWIHSHIKFPAEFRCPRLDFSSPWNLMRNTISE 242
Query: 263 FVQSGWSSSSPERSAWEAFFFELKVEDVMWRAPWMSTRPMIYKCGKFQSLPLLGPWGCIA 322
F + W + P + AW +FF +L E+V+W+A WM + +IY+CG F S+PLLGPWG +
Sbjct: 243 FGMAVWDPTYPRKEAWLSFFAKLTSENVIWKAQWMPLKAVIYRCGDFHSVPLLGPWGGVN 302
Query: 323 YAPLLVVRQIWVRQFIPATHELKDFEFAYDKGFCK-------DRIQKIVKAWKMITKIQS 382
Y PLLV+RQ+W++QFIP TH LK + + +G +R + I+ + + +
Sbjct: 303 YTPLLVLRQVWLKQFIPPTHNLKIKDKGHYEGVTSGYEAWQANRRKNIIDISREVVERGK 362
Query: 383 DQQTEQAAH--EKECDELRKANSSLVQENERLQLEVQQGLLRNVELEKELNRLKGSVSKQ 442
+ EQ EK EL + N L QENE+L+ E Q + L+ EL + K + Q
Sbjct: 363 ETSFEQPNQWIEKSI-ELEQKNRLLEQENEKLRKETSQWMDHATYLQNELEKTKSFLKNQ 422
Query: 443 EQLEKEISALDTEARDLNRRMHRLRRDKEVSQATLKSRNDQVLKQQSEIASLHELMKELE 502
++LE ++ LD E R +N+ ++ +K QAT+ + + +E + ++++K
Sbjct: 423 DKLETDLETLDKEMRRMNKANRSMKNEKTTLQATV-----GLHLKMAERSEEYKILKNYA 482
Query: 503 DCNSLRNQTITEDATDRLMKDYTYLKEQYNRLSDDFGFARQNHATLRSKAEHMLTQIRRV 562
D + T ++++ R+ ++Y L Y ++ D+ ++ L + + + +R V
Sbjct: 483 DFLHYQ-LTALQNSSKRITQEYESLNTDYVQMKVDYDLHTRDFQVLVERVDQTIEFLRMV 542
Query: 563 TRRADELAEDARTLSKVIAPTQPNSKNNHKIARSPRIRRTYVTRYKTRIMEEQSTEMEKT 622
++RA+ AE A K + IR Y TRYK++IMEE+ +M+K
Sbjct: 543 SKRANGFAEWA-----------------GKYSSFTPIRHPYNTRYKSQIMEEKDKDMDKM 602
Query: 623 KKDIEELREKMDAI--LVALERGKIIPDIAQSSNTMIDPPIRQSTEEVPPKVTITVPNLD 682
+++I L E++ I L+++ +GK D QSSN PI+ + + + P T +++
Sbjct: 603 RQEINNLGEQVSKILELLSMGKGKAAVDTTQSSN-----PIQDTDDPIYPP-GFTPYHIN 662
Query: 683 DPEIRKELTGGEKVSSSEKLEVLEERLRAVEGTYVFGNIDATKLCLVPDVILPPKFKVPE 742
P ++ T ++ S +KL+VLEERLRA+E T V+GNIDAT+LCLVP +I+P KFKVPE
Sbjct: 663 VPRLKPLNTMFLRIRSKQKLDVLEERLRAIEETDVYGNIDATQLCLVPGLIIPAKFKVPE 722
Query: 743 FEKYDGASCPKNHLIMYCRKMAAYVQNDKLLIHCFQDSLTGPASRWYMQLDSTHICSWKN 802
F KYDG++CP++HLIMYCRKMA ++ NDKLL+HCFQDSLT PASRWY+QLD+ HI WK+
Sbjct: 723 FNKYDGSTCPRSHLIMYCRKMAVHINNDKLLVHCFQDSLTDPASRWYIQLDNAHIHVWKD 782
Query: 803 LADSFLKQYKHNIDMAPDRLDLQRMERRAQKALK-------------------------- 862
LAD+FLKQYK NIDMAPDRLDLQRME+++ ++ K
Sbjct: 783 LADAFLKQYKLNIDMAPDRLDLQRMEKKSSESFKEYAQRWRDMAAEVQPPLTDKEMTSMF 842
Query: 863 -----------------------------------------STPKGGGILL--------- 922
+T + GGI
Sbjct: 843 MNTLRAPFYERMIGNASTNFSDIIVIGERIEYGIKHGRLAEATTEYGGIKKGTISKKKEG 902
Query: 923 -LSPIFIPLRPKSSSYSRL---------------------------------PPRNQHNT 982
+ I P K S L P +
Sbjct: 903 EVHAIGFPNSGKHKSIFGLRKYEQNFPSYINNVSHIPYNSYVPAHTVSETPKPVNSNSPR 962
Query: 983 PYVQGHQNNKGVRRQTHFDPIPMTYTELLPQLFQNNQLAPVPIDPVKPPYPKWYDPNARC 1042
P+VQG Q +K FDPIPMTYTELLPQL QN QLA +P+ P++PPYPKWYD NARC
Sbjct: 963 PFVQG-QGSKTNSDTWRFDPIPMTYTELLPQLIQNRQLASIPMIPIQPPYPKWYDSNARC 1022
Query: 1043 DYHAGAIGHSTENCTALKHRVQALIKAGWLNFKKE-NGPDVNNNPLPNHQNAQVNAIEVH 1102
DYHAG +GHSTENC ALK VQ+LI AGWL+FKK +VN NPLP+ +N +VN ++
Sbjct: 1023 DYHAGGVGHSTENCLALKRNVQSLINAGWLSFKKSGEKSNVNENPLPDPENPKVNVVDSL 1082
Query: 1103 GADLRKNAESIVTPMGELFEILLNNGYIGVERLQLDLGVRAYDDSLMCSYHTGAKGHSID 1154
+ IV PM +F L GY+ E L ++ ++ AK H D
Sbjct: 1083 VEKCKNEVHEIVMPMEAVFG-LFEAGYVSHEYLDPNIRYEGKNEK------RNAKEHCKD 1142
BLAST of Lag0015740 vs. NCBI nr
Match:
XP_022158986.1 (LOW QUALITY PROTEIN: uncharacterized protein LOC111025431 [Momordica charantia])
HSP 1 Score: 625.5 bits (1612), Expect = 8.8e-175
Identity = 359/756 (47.49%), Postives = 431/756 (57.01%), Query Frame = 0
Query: 620 TVPNLDDPEIRKELTGGEKVSSSEKLEVLEERLRAVEGTYVFGNIDATKLCLVPDVILPP 679
TV NL + +L G+ S+EK EVLEERLRA+EGTYVFGNIDA++LCLV +++PP
Sbjct: 6 TVLNLGGLPAKTDLV-GQNAPSNEKFEVLEERLRAIEGTYVFGNIDASQLCLVSGLVIPP 65
Query: 680 KFKVPEFEKYDGASCPKNHLIMYCRKMAAYVQNDKLLIHCFQDSLTGPASRWYMQLDSTH 739
KFKVPEFEKYDG+SCPKNHLIMYCRKMAAYVQNDKLLIHCFQDSL+GPASRWYMQLDS++
Sbjct: 66 KFKVPEFEKYDGSSCPKNHLIMYCRKMAAYVQNDKLLIHCFQDSLSGPASRWYMQLDSSN 125
Query: 740 ICSWKNLADSFLKQYKHNIDMAPDRLDLQRMERR-------------------------- 799
+ SWKNLADSFLKQYKHNIDMAPDRLDLQRME++
Sbjct: 126 VGSWKNLADSFLKQYKHNIDMAPDRLDLQRMEKKSTESFKEYAQRWRDTAAQVQPPLTDK 185
Query: 800 ------------------------------------------------------AQKALK 859
A+KA
Sbjct: 186 ELSAMFINTLKHPFYDRMIGNASTNFSDIMTIGERIEYGVRHGRITSTVDEPLAAKKASH 245
Query: 860 STPKGGGILLL------------------SPIFIP------------------------- 919
S K G + ++ +P + P
Sbjct: 246 SKKKEGEVQMVGADRHSWKQQPYSRTPRYTPYYYPTPYGYNQPFVNNATSHYSPYTFQNF 305
Query: 920 -------LRPKSSSYSRLPPRNQHNTPYVQGHQNNKGVRRQTHFDPIPMTYTELLPQLFQ 979
+P +S + P QHNT Y Q Q N+G R+QT FDPIPMTYTELLPQLFQ
Sbjct: 306 RPPASQNFQPTPASQNFQPRGQQHNTLYTQEQQTNRGARKQTQFDPIPMTYTELLPQLFQ 365
Query: 980 NNQLAPVPIDPVKPPYPKWYDPNARCDYHAGAIGHSTENCTALKHRVQALIKAGWLNFKK 1039
NNQLAPVP+DP++PPYP+WYD NARCDYHAGAIGHSTENCTALK+RVQALIKAGWLNFKK
Sbjct: 366 NNQLAPVPVDPIQPPYPRWYDTNARCDYHAGAIGHSTENCTALKYRVQALIKAGWLNFKK 425
Query: 1040 ENGPDVNNNPLPNHQNAQVNAIEVHGADLRKNAESIVTPMGELFEILLNNGYIGVERLQL 1099
ENGPDV+ NPLPNHQN Q+NAIE + + I TPM ELFEILL +GY+ VE L
Sbjct: 426 ENGPDVSKNPLPNHQNVQINAIECQEIESKSKVADIRTPMVELFEILLGSGYVSVEYLCP 485
Query: 1100 DLGVRAYDDSLMCSYHTGAKGHSIDQCPHFRLKVQELLDSHFLTVS----QKMVQLPQ-- 1154
+L + YD+SL C +H GAKGHS++QC FR+KVQELLDS LTV+ +K + + +
Sbjct: 486 NLKYKGYDESLTCPFHAGAKGHSLEQCNSFRMKVQELLDSKILTVANSHQKKGINIVEDV 545
BLAST of Lag0015740 vs. NCBI nr
Match:
XP_022147189.1 (LOW QUALITY PROTEIN: uncharacterized protein LOC111016200 [Momordica charantia])
HSP 1 Score: 593.6 bits (1529), Expect = 3.7e-165
Identity = 351/756 (46.43%), Postives = 420/756 (55.56%), Query Frame = 0
Query: 620 TVPNLDDPEIRKELTGGEKVSSSEKLEVLEERLRAVEGTYVFGNIDATKLCLVPDVILPP 679
TV NL D + K G+ S+EK EVL+ERLRA+E T VFGNIDA++LC V +++PP
Sbjct: 46 TVLNLGD-LLAKTDPVGQNAPSNEKFEVLKERLRAIERTDVFGNIDASQLCSVSGLVIPP 105
Query: 680 KFKVPEFEKYDGASCPKNHLIMYCRKMAAYVQNDKLLIHCFQDSLTGPASRWYMQLDSTH 739
K KVPEFEKY+G+SCPKNHL MYCRKMAAYVQNDKLLIHCFQDSL+GPASRWYMQLDS+H
Sbjct: 106 KLKVPEFEKYNGSSCPKNHLXMYCRKMAAYVQNDKLLIHCFQDSLSGPASRWYMQLDSSH 165
Query: 740 ICSWKNLADSFLKQYKHNIDMAPDRLDLQRMERR-------------------------- 799
+ SWKNLADSFLKQYKHNIDMAPDRLDLQRME++
Sbjct: 166 VGSWKNLADSFLKQYKHNIDMAPDRLDLQRMEKKSTKSFKEYAQRWRDTAAQVQPPLIDK 225
Query: 800 ------------------------------------------------------AQKALK 859
A+KA
Sbjct: 226 ELSAMFINTLKHPFYDRMIGSASTNFSDIMTIGERIEYGVRHGRITSTTDEPLAAKKASH 285
Query: 860 STPKGGGILLL------------------SPIFIP------------------------L 919
S K G + ++ SP + P
Sbjct: 286 SKKKEGEVQMVGADRHSWKQQPYRRTPQYSPYYYPTPYGYNQPFVNNATSHYYPYASQNF 345
Query: 920 RPKSSSYSRLPPRN--------QHNTPYVQGHQNNKGVRRQTHFDPIPMTYTELLPQLFQ 979
RP +S +L P + QHNT Y QG QNN+G R+QT FDPIPMTYTELLPQLFQ
Sbjct: 346 RPPASQNFQLTPTSQNFQPRGQQHNTFYTQGQQNNRGARKQTQFDPIPMTYTELLPQLFQ 405
Query: 980 NNQLAPVPIDPVKPPYPKWYDPNARCDYHAGAIGHSTENCTALKHRVQALIKAGWLNFKK 1039
NNQLAPVP+DP++PPYP+WYD NARCDYHAGAI HSTENCT LK+RVQALIKAGW NFKK
Sbjct: 406 NNQLAPVPVDPIQPPYPRWYDANARCDYHAGAIXHSTENCTXLKYRVQALIKAGWXNFKK 465
Query: 1040 ENGPDVNNNPLPNHQNAQVNAIEVHGADLRKNAESIVTPMGELFEILLNNGYIGVERLQL 1099
ENG DV+ L NHQN Q+NAIE G + + I TPM ELFEILL +GYI VE L
Sbjct: 466 ENGXDVSKXXLXNHQNVQINAIECQGIESKSKVABITTPMXELFEILLGSGYISVE--YL 525
Query: 1100 DLGVRAYDDSLMCSYHTGAKGHSIDQCPHFRLKVQELLDSHFLTVS----QKMVQLPQ-- 1154
+ YD+SL C +H GAKGHS++QC FR+KVQELLDS LT + +K + +
Sbjct: 526 CPKYKGYDESLTCXFHXGAKGHSLEQCNXFRMKVQELLDSKILTXANSHXKKXTNVVEDI 585
BLAST of Lag0015740 vs. NCBI nr
Match:
EOX94372.1 (Uncharacterized protein TCM_003960 [Theobroma cacao])
HSP 1 Score: 592.0 bits (1525), Expect = 1.1e-164
Identity = 438/1348 (32.49%), Postives = 644/1348 (47.77%), Query Frame = 0
Query: 23 WEKLTVDRKAKFTSKYGHLAQLMYVQVNYSVLKALIRHWDPAYRCFTFGSIDMTPTIEEY 82
W+K +A F KYGH+A+L+ VQ++ +LKA+++ WDP+YRCF F +DM PTIEEY
Sbjct: 59 WDKWGATTQANFDRKYGHIARLLKVQIDEHLLKAIVQFWDPSYRCFVFNKVDMVPTIEEY 118
Query: 83 QSLLHMPTRTEVEAYSYDQELTMKRALSTLLGKIRTSDIEKQVKIKGENTCLPLDYILTL 142
+LL + + Y Q+ +R L+ ++G I +++I+ ++ KG+N C+P ++ +
Sbjct: 119 SALLQIDLDNPDKIYWRGQKTGHRRKLAKMMG-ITSAEIDHNLRKKGDNECIPWSFLRSY 178
Query: 143 QQKFANEDKELTLLALCIFNVVLFPK---------------------------------- 202
K + ++ ++AL I+ +V+FPK
Sbjct: 179 IMKHRDTEQAQLVMALGIYGLVIFPKILGHIEVGIIDFFEQVVNKANPSPSILAKTLRSL 238
Query: 203 ----NKGTGRFIGCAPLLYIWVLSHVKCPPEFKCPEIKFSSSWNKLRNPISEFVQSGWSS 262
KG GRF+GCA LL IW++SH F+C KF ++ PI EF +S W
Sbjct: 239 NYCRRKGEGRFVGCAQLLSIWIVSH------FECKIDKFRKPFHLQTAPIREFCESEWPE 298
Query: 263 SSPERSAWEAFFFELKVEDVMWRAPWMSTRPMIYKCGKFQSLPLLGPWGCIAYAPLLVVR 322
+ + W + F +L +V WRAPWM P++YKC +PL+GPWG I+YAP++V R
Sbjct: 299 NR-TKEQWISRFRKLMSVEVTWRAPWMPHHPVLYKCENEPWVPLMGPWGAISYAPIMVRR 358
Query: 323 QIWVRQFIPATHELKDFEFAY-DKGFCKDRIQKIVKAWKMITKIQS-------------- 382
Q QF+P TH L EFAY + GF K RI++I +AWK +++
Sbjct: 359 QFGSEQFVPMTHRLNTLEFAYGEPGFLK-RIEEIAQAWKKTSRVDQGRYTDEVTTGYQMW 418
Query: 383 -DQQTEQAAHEKECDELR------KANSSLVQENERLQLEVQQG--LLRNVELEKELNRL 442
DQ+ + + KE D +R ++ L E R + E + R +L+KE ++
Sbjct: 419 HDQRVKDVVYPKE-DAIRGPVDPEPRDALLESELARKKSEAENASWKQRYEDLQKECEKM 478
Query: 443 KGSVSKQ----EQLEKEISALDTEARDLNRRMHRLRRDKEVSQATLKSRND----QVLKQ 502
K VS+Q +++E + +L+ + + R + KE L++ ND QV Q
Sbjct: 479 KREVSQQRKKVQKMEGKYESLNDKFSATTSELQREIQVKENRGNELQTHNDGLRRQVRFQ 538
Query: 503 QSEIASLHELMKELEDCNSLRNQTITEDATDRLMKDYTYLKEQYNRLSDDFGFARQNHAT 562
Q I L + +ELE + Q +Y LK+Q R+ RQ +
Sbjct: 539 QESIQILRQEYEELEGVMTTYQQ------------EYESLKQQSTRIQKWGESYRQAYTE 598
Query: 563 LRSKAEHMLTQIRRVTRRADELAEDARTLSKVIAPTQPNSKNNHKIARSPRIRRTYVTRY 622
+ ++++ Q+R V +A +A + L I P K ++ +
Sbjct: 599 KYDQMDYLVWQMREVAYKARSMAWETDILRSQIFPV---GKQEQQLIKH--------LDE 658
Query: 623 KTRIMEEQSTE----MEKTKKDIEELREKMDAILVALERGKII---------PDIAQSSN 682
+ RIMEE+ E ME+ ++++ E KM ++++L +GK P S N
Sbjct: 659 RARIMEEEQGERMDRMERAQEEMREQLAKMMELMMSLSKGKRAIEEPAPSENPPAQDSGN 718
Query: 683 TMID--------PPIRQSTEEVPPKV-------------------------------TIT 742
D PP Q+ + V P+V I
Sbjct: 719 QRDDPSYPPGFTPPHAQTFQRVHPQVMPSIYYNAPPPLGHQPTHGQFGPYPGINPAEPIN 778
Query: 743 VPNLDDPEIRKEL-----TGGEKVSSSEKLEVLEERLRAVEGTYVFGNIDATKLCLVPDV 802
VP+LDDP+ +++L GE +K ++LEERLRA+EG FG +DAT+LCLVPDV
Sbjct: 779 VPDLDDPKEQEKLRKDSSQTGENEKDQKKYDLLEERLRAIEGVDRFGTMDATELCLVPDV 838
Query: 803 ILPPKFKVPEFEKYDGASCPKNHLIMYCRKMAAYVQNDKLLIHCFQDSLTGPASRWYMQL 862
++P KFKVP+FEKYDG CP H+ MYCRKMAA +DKLLIH FQDSLTG A+R +
Sbjct: 839 LIPAKFKVPKFEKYDGTKCPMAHITMYCRKMAAQSHDDKLLIHFFQDSLTGSAARCSKKG 898
Query: 863 DSTHICSWKNLADSFLKQYKHNIDMAPDRLDLQRMERRAQKALKSTPKGGGILLLSPIFI 922
+ A + Q HN + Q P G I ++
Sbjct: 899 STPKKKEGDVQAVAHDSQQAHNFNPYYPYPPYQPF----------YPHIGSITQNPYVYQ 958
Query: 923 PLRPKSSSYSRLP------PRNQHNTPYVQGHQNNKGVRRQTHFDPIPMTYTELLPQLFQ 982
P+ + + LP P N P G + K + FDPIP+ YT LLPQL +
Sbjct: 959 PIPQPTFQTNVLPQTPPPRPVASTNNP-GNGQRGPKTTLERPKFDPIPVPYTTLLPQLIE 1018
Query: 983 NNQLAPVPIDPVKPPYPKWYDPNARCDYHAGAIGHSTENCTALKHRVQALIKAGWLNFKK 1042
N LA P++P++P +PKWYDPNA CDYH G GHSTENCTALKH+VQ LIKAG LNF K
Sbjct: 1019 NRLLARTPLEPLRPSFPKWYDPNAHCDYHFGIQGHSTENCTALKHKVQVLIKAGLLNFTK 1078
Query: 1043 ENGPDVNNNPLPNHQNAQVNAI-EVHGADLRKNAESIVTPMGELFEILLNNGYIGVERLQ 1102
++ V+ NPLPNH VNAI E ++K + I TPM ++FE L + E
Sbjct: 1079 KDSSGVDGNPLPNHGRPTVNAIHEGMIRMVKKGIDEIQTPMDKVFEALSKINAVTPE--P 1138
Query: 1103 LDLGVRAYDDSLMCSYHTGAKGHSIDQCPHFRLKVQELLDSHFL---------------- 1154
+D +D + C +H GA GHSI FR K+QEL+DS +
Sbjct: 1139 IDTKELGHDLTYSCKFHMGAIGHSIQNYDGFRRKLQELMDSSVIEFYEGAEENLVGTING 1198
BLAST of Lag0015740 vs. NCBI nr
Match:
EOY09468.1 (Uncharacterized protein TCM_024883 [Theobroma cacao])
HSP 1 Score: 588.6 bits (1516), Expect = 1.2e-163
Identity = 452/1457 (31.02%), Postives = 663/1457 (45.50%), Query Frame = 0
Query: 23 WEKLTVDRKAKFTSKYGHLAQLMYVQVNYSVLKALIRHWDPAYRCFTFGSIDMTPTIEEY 82
W+K +A F KYGH+A+L+ VQ++ +LKA+++ WDP+YRCF F IDM
Sbjct: 59 WDKWGAITRANFDRKYGHIARLLKVQIDEHLLKAIVQFWDPSYRCFVFNKIDM------- 118
Query: 83 QSLLHMPTRTEVEAYSYDQELTMKRALSTLLGKIRTSDIEKQVKIKGENTCLPLDYILTL 142
+ +R L+ ++G I ++++++ ++ KG+N C+P ++ +
Sbjct: 119 -------------------KTGHRRKLAKMMG-ITSAEVDQNLRKKGDNECIPWSFLRSY 178
Query: 143 QQKFANEDKELTLLALCIFNVVLFPK---------------------------------- 202
K + ++ ++AL I+ +V+FPK
Sbjct: 179 IMKQRDTEQGQLVMALGIYGLVIFPKVLGHIEVRIIDFFEQVVNKANPSPSILAETLRSL 238
Query: 203 ----NKGTGRFIGCAPLLYIWVLSHVKCPPEFKCPEIKFSSSWNKLRNPISEFVQSGWSS 262
KG GRF+GCA LL IW++SH F+C KF ++ PI EF +S W
Sbjct: 239 NYCRRKGEGRFVGCAQLLSIWIVSH------FECKVDKFRKPFHPQTAPIREFCESEWPE 298
Query: 263 SSPERSAWEAFFFELKVEDVMWRAPWMSTRPMIYKCGKFQSLPLLGPWGCIAYAPLLVVR 322
+ + W + EL +V WRAPWM P++YKCG + L+GPWG I+YAP++V R
Sbjct: 299 NR-TKEQWISRLRELMSVEVTWRAPWMPHHPVLYKCGNEPWVQLMGPWGAISYAPIMVRR 358
Query: 323 QIWVRQFIPATHELKDFEFAYDK-GFCKDRIQKIVKAWKMITKIQS-------------- 382
Q QF+P TH L EFAY++ GF K RI++I +AWK +++
Sbjct: 359 QFGSEQFVPMTHRLNTLEFAYEEPGFLK-RIEEIAQAWKKTSRVDQGRYTDEVTIGYQIW 418
Query: 383 -DQQTEQAAHEKECDELR------KANSSLVQENERLQLEVQQG--LLRNVELEKELNRL 442
DQ+ + + KE D LR ++ L E R + E + R +L+KE ++
Sbjct: 419 HDQRVKDVVYPKE-DVLRGPVDPEPRDALLESELARKKSEAENASWKQRYEDLQKECEKM 478
Query: 443 KGSVSKQ----EQLEKEISALDTEARDLNRRMHRLRRDKEVSQATLKSRND----QVLKQ 502
K VS+Q ++E + +L+ + + R + +E L++ ND QV Q
Sbjct: 479 KREVSEQRKKVRKMEGKYESLNDKFSTTTSELQREIQVRENRGNELQTHNDGLRRQVRFQ 538
Query: 503 QSEIASLHELMKELEDCNSLRNQTITEDATDRLMKDYTYLKEQYNRLSDDFGFARQNHAT 562
Q I L + +ELE + Q +Y LK+Q R+ + RQ +
Sbjct: 539 QESIQLLRQEYEELEGVMTTYQQ------------EYERLKQQSTRIQEWGESYRQAYTE 598
Query: 563 LRSKAEHMLTQIRRVTRRADELAEDARTLSKVIAPTQPNSKNNHKIARSPRIRRTYVTRY 622
+ ++++ Q+R V +A +A L I P K ++ + Y+
Sbjct: 599 KYDQMDYLVWQMREVAYKARSMAWKTDILRSQIFPV---GKQEQQLIK-------YLDE- 658
Query: 623 KTRIMEEQSTE----MEKTKKDIEELREKMDAILVALERGKII---------PDIAQSSN 682
+ RIMEE+ E ME+ ++++ E KM ++++L +GK P S N
Sbjct: 659 RARIMEEEQRERMDRMERAQEEMREQLAKMMKLMMSLSKGKRAIEEPAPSENPPAQDSGN 718
Query: 683 TMIDPPI--------RQSTEEVPPKV-------------------------------TIT 742
DPP Q+++ V P+V I
Sbjct: 719 QREDPPYPPGFTPPHAQTSQRVHPQVMPSVYYNAPPPMGHQPTHGQFGPYLGVNPIEPIH 778
Query: 743 VPNLDDPEIRKELTGG-----EKVSSSEKLEVLEERLRAVEGTYVFGNIDATKLCLVPDV 802
VP+LDDP+ +++L E +K ++LEERLRA+EG FG +DAT+LCLVPDV
Sbjct: 779 VPDLDDPKEQEKLRKDSSQTRENEKDQKKYDLLEERLRAIEGVDRFGTMDATELCLVPDV 838
Query: 803 ILPPKFKVPEFEKYDGASCPKNHLIMYCRKMAAYVQNDKLLIHCFQDSLTGPASRWYMQL 862
++P KFKVPEFEKYD CP H+ M CRKMAA +DKLLIH FQDSLTG A+RWY+QL
Sbjct: 839 LIPAKFKVPEFEKYDETKCPMAHITMNCRKMAAQSHDDKLLIHLFQDSLTGSAARWYVQL 898
Query: 863 DSTHICSWKNLADSFLKQYKHNIDMAPDRLDLQRMERRAQKALKS--------------- 922
D I +WK+LA +F+ QYKH ++APDRL LQ ME++ + K
Sbjct: 899 DRNRIKTWKDLARAFIAQYKHVAELAPDRLSLQTMEKKQSENFKEYAQRWRDTAAQVQPP 958
Query: 923 -TPKGGGILLLSPIFIPLRPK---------------------------------SSSYSR 982
T K +L ++ + P + +SS
Sbjct: 959 LTDKEMTVLFINTLRAPFYERLIGNATKNFTDLVLSGEIIEGAIKSGKIEGHEVASSKKG 1018
Query: 983 LPPRNQ--------------HN----------------------TPYV------------ 1042
PR + HN PYV
Sbjct: 1019 STPRKKEGDVQAVAHDSQQAHNFNLYYPYPPYQPFYPHIGNITQNPYVYQPIPQPTFQTN 1078
Query: 1043 ------------------QGHQNNKGVRRQTHFDPIPMTYTELLPQLFQNNQLAPVPIDP 1102
G + K + FD IP+ YT LLPQL + L P++P
Sbjct: 1079 VLPQTPPPRPIASTNNPGHGQRGPKTTPERPKFDHIPVPYTTLLPQLIEKRLLTQTPLEP 1138
Query: 1103 VKPPYPKWYDPNARCDYHAGAIGHSTENCTALKHRVQALIKAGWLNFKKENGPDVNNNPL 1154
++PP+PKWYDPNA CDYH G GHSTENCTALKH+VQALIKAG LNF K++ V+ NPL
Sbjct: 1139 LRPPFPKWYDPNAHCDYHFGIQGHSTENCTALKHKVQALIKAGLLNFTKKDSSSVDGNPL 1198
BLAST of Lag0015740 vs. ExPASy TrEMBL
Match:
A0A5A7T1W2 (Retrotrans_gag domain-containing protein OS=Cucumis melo var. makuwa OX=1194695 GN=E6C27_scaffold86G00060 PE=4 SV=1)
HSP 1 Score: 703.7 bits (1815), Expect = 1.2e-198
Identity = 453/1299 (34.87%), Postives = 647/1299 (49.81%), Query Frame = 0
Query: 23 WEKLTVDRKAKFTSKYGHLAQLMYVQVNYSVLKALIRHWDPAYRCFTFGSIDMTPTIEEY 82
WE LT R+ F+ KYGH+A+LMY+ VNY L+A+I DPAY CFTFGS D+ PTIEEY
Sbjct: 3 WEALTPQRRFMFSKKYGHIAELMYIPVNYFALRAIINFGDPAYGCFTFGSCDLLPTIEEY 62
Query: 83 QSLLHMPTRTEVEAYSYDQELTMKRALSTLLGKIRTSDIEKQVKIKGENTCLPLDYILTL 142
Q++L MP + Y ++ + T KR LS L + ++I+K +K KG +P DY++ +
Sbjct: 63 QAMLSMPQKEREIVYFFNPKQTTKRTLSKFLETVHATEIQKYIKAKGGEENVPFDYLIKM 122
Query: 143 QQKFANEDKELTLLALCIFNVVLFPK---------------------------------- 202
Q + +EDK LTLLALCI+ V+FPK
Sbjct: 123 TQTYIDEDKGLTLLALCIYGAVIFPKAEGYVDRKVIKLFFQMERGVNPIIPILAETFRSL 182
Query: 203 ----NKG--------TGRFIGCAPLLYIWVLSHVKCPPEFKCPEIKFSSSWNKLRNPISE 262
NKG G+ C PLLYIW+ SH+K P EF+CP + FSS WN +RN ISE
Sbjct: 183 NYCRNKGEGKLNCCVRGKLNCCVPLLYIWIHSHIKFPAEFRCPRLDFSSPWNLMRNTISE 242
Query: 263 FVQSGWSSSSPERSAWEAFFFELKVEDVMWRAPWMSTRPMIYKCGKFQSLPLLGPWGCIA 322
F + W + P + AW +FF +L E+V+W+A WM + +IY+CG F S+PLLGPWG +
Sbjct: 243 FGMAVWDPTYPRKEAWLSFFAKLTSENVIWKAQWMPLKAVIYRCGDFHSVPLLGPWGGVN 302
Query: 323 YAPLLVVRQIWVRQFIPATHELKDFEFAYDKGFCK-------DRIQKIVKAWKMITKIQS 382
Y PLLV+RQ+W++QFIP TH LK + + +G +R + I+ + + +
Sbjct: 303 YTPLLVLRQVWLKQFIPPTHNLKIKDKGHYEGVTSGYEAWQANRRKNIIDISREVVERGK 362
Query: 383 DQQTEQAAH--EKECDELRKANSSLVQENERLQLEVQQGLLRNVELEKELNRLKGSVSKQ 442
+ EQ EK EL + N L QENE+L+ E Q + L+ EL + K + Q
Sbjct: 363 ETSFEQPNQWIEKSI-ELEQKNRLLEQENEKLRKETSQWMDHATYLQNELEKTKSFLKNQ 422
Query: 443 EQLEKEISALDTEARDLNRRMHRLRRDKEVSQATLKSRNDQVLKQQSEIASLHELMKELE 502
++LE ++ LD E R +N+ ++ +K QAT+ + + +E + ++++K
Sbjct: 423 DKLETDLETLDKEMRRMNKANRSMKNEKTTLQATV-----GLHLKMAERSEEYKILKNYA 482
Query: 503 DCNSLRNQTITEDATDRLMKDYTYLKEQYNRLSDDFGFARQNHATLRSKAEHMLTQIRRV 562
D + T ++++ R+ ++Y L Y ++ D+ ++ L + + + +R V
Sbjct: 483 DFLHYQ-LTALQNSSKRITQEYESLNTDYVQMKVDYDLHTRDFQVLVERVDQTIEFLRMV 542
Query: 563 TRRADELAEDARTLSKVIAPTQPNSKNNHKIARSPRIRRTYVTRYKTRIMEEQSTEMEKT 622
++RA+ AE A K + IR Y TRYK++IMEE+ +M+K
Sbjct: 543 SKRANGFAEWA-----------------GKYSSFTPIRHPYNTRYKSQIMEEKDKDMDKM 602
Query: 623 KKDIEELREKMDAI--LVALERGKIIPDIAQSSNTMIDPPIRQSTEEVPPKVTITVPNLD 682
+++I L E++ I L+++ +GK D QSSN PI+ + + + P T +++
Sbjct: 603 RQEINNLGEQVSKILELLSMGKGKAAVDTTQSSN-----PIQDTDDPIYPP-GFTPYHIN 662
Query: 683 DPEIRKELTGGEKVSSSEKLEVLEERLRAVEGTYVFGNIDATKLCLVPDVILPPKFKVPE 742
P ++ T ++ S +KL+VLEERLRA+E T V+GNIDAT+LCLVP +I+P KFKVPE
Sbjct: 663 VPRLKPLNTMFLRIRSKQKLDVLEERLRAIEETDVYGNIDATQLCLVPGLIIPAKFKVPE 722
Query: 743 FEKYDGASCPKNHLIMYCRKMAAYVQNDKLLIHCFQDSLTGPASRWYMQLDSTHICSWKN 802
F KYDG++CP++HLIMYCRKMA ++ NDKLL+HCFQDSLT PASRWY+QLD+ HI WK+
Sbjct: 723 FNKYDGSTCPRSHLIMYCRKMAVHINNDKLLVHCFQDSLTDPASRWYIQLDNAHIHVWKD 782
Query: 803 LADSFLKQYKHNIDMAPDRLDLQRMERRAQKALK-------------------------- 862
LAD+FLKQYK NIDMAPDRLDLQRME+++ ++ K
Sbjct: 783 LADAFLKQYKLNIDMAPDRLDLQRMEKKSSESFKEYAQRWRDMAAEVQPPLTDKEMTSMF 842
Query: 863 -----------------------------------------STPKGGGILL--------- 922
+T + GGI
Sbjct: 843 MNTLRAPFYERMIGNASTNFSDIIVIGERIEYGIKHGRLAEATTEYGGIKKGTISKKKEG 902
Query: 923 -LSPIFIPLRPKSSSYSRL---------------------------------PPRNQHNT 982
+ I P K S L P +
Sbjct: 903 EVHAIGFPNSGKHKSIFGLRKYEQNFPSYINNVSHIPYNSYVPAHTVSETPKPVNSNSPR 962
Query: 983 PYVQGHQNNKGVRRQTHFDPIPMTYTELLPQLFQNNQLAPVPIDPVKPPYPKWYDPNARC 1042
P+VQG Q +K FDPIPMTYTELLPQL QN QLA +P+ P++PPYPKWYD NARC
Sbjct: 963 PFVQG-QGSKTNSDTWRFDPIPMTYTELLPQLIQNRQLASIPMIPIQPPYPKWYDSNARC 1022
Query: 1043 DYHAGAIGHSTENCTALKHRVQALIKAGWLNFKKE-NGPDVNNNPLPNHQNAQVNAIEVH 1102
DYHAG +GHSTENC ALK VQ+LI AGWL+FKK +VN NPLP+ +N +VN ++
Sbjct: 1023 DYHAGGVGHSTENCLALKRNVQSLINAGWLSFKKSGEKSNVNENPLPDPENPKVNVVDSL 1082
Query: 1103 GADLRKNAESIVTPMGELFEILLNNGYIGVERLQLDLGVRAYDDSLMCSYHTGAKGHSID 1154
+ IV PM +F L GY+ E L ++ ++ AK H D
Sbjct: 1083 VEKCKNEVHEIVMPMEAVFG-LFEAGYVSHEYLDPNIRYEGKNEK------RNAKEHCKD 1142
BLAST of Lag0015740 vs. ExPASy TrEMBL
Match:
A0A6J1E2J7 (Ribonuclease H OS=Momordica charantia OX=3673 GN=LOC111025431 PE=4 SV=1)
HSP 1 Score: 625.5 bits (1612), Expect = 4.3e-175
Identity = 359/756 (47.49%), Postives = 431/756 (57.01%), Query Frame = 0
Query: 620 TVPNLDDPEIRKELTGGEKVSSSEKLEVLEERLRAVEGTYVFGNIDATKLCLVPDVILPP 679
TV NL + +L G+ S+EK EVLEERLRA+EGTYVFGNIDA++LCLV +++PP
Sbjct: 6 TVLNLGGLPAKTDLV-GQNAPSNEKFEVLEERLRAIEGTYVFGNIDASQLCLVSGLVIPP 65
Query: 680 KFKVPEFEKYDGASCPKNHLIMYCRKMAAYVQNDKLLIHCFQDSLTGPASRWYMQLDSTH 739
KFKVPEFEKYDG+SCPKNHLIMYCRKMAAYVQNDKLLIHCFQDSL+GPASRWYMQLDS++
Sbjct: 66 KFKVPEFEKYDGSSCPKNHLIMYCRKMAAYVQNDKLLIHCFQDSLSGPASRWYMQLDSSN 125
Query: 740 ICSWKNLADSFLKQYKHNIDMAPDRLDLQRMERR-------------------------- 799
+ SWKNLADSFLKQYKHNIDMAPDRLDLQRME++
Sbjct: 126 VGSWKNLADSFLKQYKHNIDMAPDRLDLQRMEKKSTESFKEYAQRWRDTAAQVQPPLTDK 185
Query: 800 ------------------------------------------------------AQKALK 859
A+KA
Sbjct: 186 ELSAMFINTLKHPFYDRMIGNASTNFSDIMTIGERIEYGVRHGRITSTVDEPLAAKKASH 245
Query: 860 STPKGGGILLL------------------SPIFIP------------------------- 919
S K G + ++ +P + P
Sbjct: 246 SKKKEGEVQMVGADRHSWKQQPYSRTPRYTPYYYPTPYGYNQPFVNNATSHYSPYTFQNF 305
Query: 920 -------LRPKSSSYSRLPPRNQHNTPYVQGHQNNKGVRRQTHFDPIPMTYTELLPQLFQ 979
+P +S + P QHNT Y Q Q N+G R+QT FDPIPMTYTELLPQLFQ
Sbjct: 306 RPPASQNFQPTPASQNFQPRGQQHNTLYTQEQQTNRGARKQTQFDPIPMTYTELLPQLFQ 365
Query: 980 NNQLAPVPIDPVKPPYPKWYDPNARCDYHAGAIGHSTENCTALKHRVQALIKAGWLNFKK 1039
NNQLAPVP+DP++PPYP+WYD NARCDYHAGAIGHSTENCTALK+RVQALIKAGWLNFKK
Sbjct: 366 NNQLAPVPVDPIQPPYPRWYDTNARCDYHAGAIGHSTENCTALKYRVQALIKAGWLNFKK 425
Query: 1040 ENGPDVNNNPLPNHQNAQVNAIEVHGADLRKNAESIVTPMGELFEILLNNGYIGVERLQL 1099
ENGPDV+ NPLPNHQN Q+NAIE + + I TPM ELFEILL +GY+ VE L
Sbjct: 426 ENGPDVSKNPLPNHQNVQINAIECQEIESKSKVADIRTPMVELFEILLGSGYVSVEYLCP 485
Query: 1100 DLGVRAYDDSLMCSYHTGAKGHSIDQCPHFRLKVQELLDSHFLTVS----QKMVQLPQ-- 1154
+L + YD+SL C +H GAKGHS++QC FR+KVQELLDS LTV+ +K + + +
Sbjct: 486 NLKYKGYDESLTCPFHAGAKGHSLEQCNSFRMKVQELLDSKILTVANSHQKKGINIVEDV 545
BLAST of Lag0015740 vs. ExPASy TrEMBL
Match:
A0A6J1D099 (Ribonuclease H OS=Momordica charantia OX=3673 GN=LOC111016200 PE=4 SV=1)
HSP 1 Score: 593.6 bits (1529), Expect = 1.8e-165
Identity = 351/756 (46.43%), Postives = 420/756 (55.56%), Query Frame = 0
Query: 620 TVPNLDDPEIRKELTGGEKVSSSEKLEVLEERLRAVEGTYVFGNIDATKLCLVPDVILPP 679
TV NL D + K G+ S+EK EVL+ERLRA+E T VFGNIDA++LC V +++PP
Sbjct: 46 TVLNLGD-LLAKTDPVGQNAPSNEKFEVLKERLRAIERTDVFGNIDASQLCSVSGLVIPP 105
Query: 680 KFKVPEFEKYDGASCPKNHLIMYCRKMAAYVQNDKLLIHCFQDSLTGPASRWYMQLDSTH 739
K KVPEFEKY+G+SCPKNHL MYCRKMAAYVQNDKLLIHCFQDSL+GPASRWYMQLDS+H
Sbjct: 106 KLKVPEFEKYNGSSCPKNHLXMYCRKMAAYVQNDKLLIHCFQDSLSGPASRWYMQLDSSH 165
Query: 740 ICSWKNLADSFLKQYKHNIDMAPDRLDLQRMERR-------------------------- 799
+ SWKNLADSFLKQYKHNIDMAPDRLDLQRME++
Sbjct: 166 VGSWKNLADSFLKQYKHNIDMAPDRLDLQRMEKKSTKSFKEYAQRWRDTAAQVQPPLIDK 225
Query: 800 ------------------------------------------------------AQKALK 859
A+KA
Sbjct: 226 ELSAMFINTLKHPFYDRMIGSASTNFSDIMTIGERIEYGVRHGRITSTTDEPLAAKKASH 285
Query: 860 STPKGGGILLL------------------SPIFIP------------------------L 919
S K G + ++ SP + P
Sbjct: 286 SKKKEGEVQMVGADRHSWKQQPYRRTPQYSPYYYPTPYGYNQPFVNNATSHYYPYASQNF 345
Query: 920 RPKSSSYSRLPPRN--------QHNTPYVQGHQNNKGVRRQTHFDPIPMTYTELLPQLFQ 979
RP +S +L P + QHNT Y QG QNN+G R+QT FDPIPMTYTELLPQLFQ
Sbjct: 346 RPPASQNFQLTPTSQNFQPRGQQHNTFYTQGQQNNRGARKQTQFDPIPMTYTELLPQLFQ 405
Query: 980 NNQLAPVPIDPVKPPYPKWYDPNARCDYHAGAIGHSTENCTALKHRVQALIKAGWLNFKK 1039
NNQLAPVP+DP++PPYP+WYD NARCDYHAGAI HSTENCT LK+RVQALIKAGW NFKK
Sbjct: 406 NNQLAPVPVDPIQPPYPRWYDANARCDYHAGAIXHSTENCTXLKYRVQALIKAGWXNFKK 465
Query: 1040 ENGPDVNNNPLPNHQNAQVNAIEVHGADLRKNAESIVTPMGELFEILLNNGYIGVERLQL 1099
ENG DV+ L NHQN Q+NAIE G + + I TPM ELFEILL +GYI VE L
Sbjct: 466 ENGXDVSKXXLXNHQNVQINAIECQGIESKSKVADITTPMXELFEILLGSGYISVE--YL 525
Query: 1100 DLGVRAYDDSLMCSYHTGAKGHSIDQCPHFRLKVQELLDSHFLTVS----QKMVQLPQ-- 1154
+ YD+SL C +H GAKGHS++QC FR+KVQELLDS LT + +K + +
Sbjct: 526 CPKYKGYDESLTCXFHXGAKGHSLEQCNXFRMKVQELLDSKILTXANSHXKKXTNVVEDI 585
BLAST of Lag0015740 vs. ExPASy TrEMBL
Match:
A0A061DPM2 (Uncharacterized protein OS=Theobroma cacao OX=3641 GN=TCM_003960 PE=4 SV=1)
HSP 1 Score: 592.0 bits (1525), Expect = 5.2e-165
Identity = 438/1348 (32.49%), Postives = 644/1348 (47.77%), Query Frame = 0
Query: 23 WEKLTVDRKAKFTSKYGHLAQLMYVQVNYSVLKALIRHWDPAYRCFTFGSIDMTPTIEEY 82
W+K +A F KYGH+A+L+ VQ++ +LKA+++ WDP+YRCF F +DM PTIEEY
Sbjct: 59 WDKWGATTQANFDRKYGHIARLLKVQIDEHLLKAIVQFWDPSYRCFVFNKVDMVPTIEEY 118
Query: 83 QSLLHMPTRTEVEAYSYDQELTMKRALSTLLGKIRTSDIEKQVKIKGENTCLPLDYILTL 142
+LL + + Y Q+ +R L+ ++G I +++I+ ++ KG+N C+P ++ +
Sbjct: 119 SALLQIDLDNPDKIYWRGQKTGHRRKLAKMMG-ITSAEIDHNLRKKGDNECIPWSFLRSY 178
Query: 143 QQKFANEDKELTLLALCIFNVVLFPK---------------------------------- 202
K + ++ ++AL I+ +V+FPK
Sbjct: 179 IMKHRDTEQAQLVMALGIYGLVIFPKILGHIEVGIIDFFEQVVNKANPSPSILAKTLRSL 238
Query: 203 ----NKGTGRFIGCAPLLYIWVLSHVKCPPEFKCPEIKFSSSWNKLRNPISEFVQSGWSS 262
KG GRF+GCA LL IW++SH F+C KF ++ PI EF +S W
Sbjct: 239 NYCRRKGEGRFVGCAQLLSIWIVSH------FECKIDKFRKPFHLQTAPIREFCESEWPE 298
Query: 263 SSPERSAWEAFFFELKVEDVMWRAPWMSTRPMIYKCGKFQSLPLLGPWGCIAYAPLLVVR 322
+ + W + F +L +V WRAPWM P++YKC +PL+GPWG I+YAP++V R
Sbjct: 299 NR-TKEQWISRFRKLMSVEVTWRAPWMPHHPVLYKCENEPWVPLMGPWGAISYAPIMVRR 358
Query: 323 QIWVRQFIPATHELKDFEFAY-DKGFCKDRIQKIVKAWKMITKIQS-------------- 382
Q QF+P TH L EFAY + GF K RI++I +AWK +++
Sbjct: 359 QFGSEQFVPMTHRLNTLEFAYGEPGFLK-RIEEIAQAWKKTSRVDQGRYTDEVTTGYQMW 418
Query: 383 -DQQTEQAAHEKECDELR------KANSSLVQENERLQLEVQQG--LLRNVELEKELNRL 442
DQ+ + + KE D +R ++ L E R + E + R +L+KE ++
Sbjct: 419 HDQRVKDVVYPKE-DAIRGPVDPEPRDALLESELARKKSEAENASWKQRYEDLQKECEKM 478
Query: 443 KGSVSKQ----EQLEKEISALDTEARDLNRRMHRLRRDKEVSQATLKSRND----QVLKQ 502
K VS+Q +++E + +L+ + + R + KE L++ ND QV Q
Sbjct: 479 KREVSQQRKKVQKMEGKYESLNDKFSATTSELQREIQVKENRGNELQTHNDGLRRQVRFQ 538
Query: 503 QSEIASLHELMKELEDCNSLRNQTITEDATDRLMKDYTYLKEQYNRLSDDFGFARQNHAT 562
Q I L + +ELE + Q +Y LK+Q R+ RQ +
Sbjct: 539 QESIQILRQEYEELEGVMTTYQQ------------EYESLKQQSTRIQKWGESYRQAYTE 598
Query: 563 LRSKAEHMLTQIRRVTRRADELAEDARTLSKVIAPTQPNSKNNHKIARSPRIRRTYVTRY 622
+ ++++ Q+R V +A +A + L I P K ++ +
Sbjct: 599 KYDQMDYLVWQMREVAYKARSMAWETDILRSQIFPV---GKQEQQLIKH--------LDE 658
Query: 623 KTRIMEEQSTE----MEKTKKDIEELREKMDAILVALERGKII---------PDIAQSSN 682
+ RIMEE+ E ME+ ++++ E KM ++++L +GK P S N
Sbjct: 659 RARIMEEEQGERMDRMERAQEEMREQLAKMMELMMSLSKGKRAIEEPAPSENPPAQDSGN 718
Query: 683 TMID--------PPIRQSTEEVPPKV-------------------------------TIT 742
D PP Q+ + V P+V I
Sbjct: 719 QRDDPSYPPGFTPPHAQTFQRVHPQVMPSIYYNAPPPLGHQPTHGQFGPYPGINPAEPIN 778
Query: 743 VPNLDDPEIRKEL-----TGGEKVSSSEKLEVLEERLRAVEGTYVFGNIDATKLCLVPDV 802
VP+LDDP+ +++L GE +K ++LEERLRA+EG FG +DAT+LCLVPDV
Sbjct: 779 VPDLDDPKEQEKLRKDSSQTGENEKDQKKYDLLEERLRAIEGVDRFGTMDATELCLVPDV 838
Query: 803 ILPPKFKVPEFEKYDGASCPKNHLIMYCRKMAAYVQNDKLLIHCFQDSLTGPASRWYMQL 862
++P KFKVP+FEKYDG CP H+ MYCRKMAA +DKLLIH FQDSLTG A+R +
Sbjct: 839 LIPAKFKVPKFEKYDGTKCPMAHITMYCRKMAAQSHDDKLLIHFFQDSLTGSAARCSKKG 898
Query: 863 DSTHICSWKNLADSFLKQYKHNIDMAPDRLDLQRMERRAQKALKSTPKGGGILLLSPIFI 922
+ A + Q HN + Q P G I ++
Sbjct: 899 STPKKKEGDVQAVAHDSQQAHNFNPYYPYPPYQPF----------YPHIGSITQNPYVYQ 958
Query: 923 PLRPKSSSYSRLP------PRNQHNTPYVQGHQNNKGVRRQTHFDPIPMTYTELLPQLFQ 982
P+ + + LP P N P G + K + FDPIP+ YT LLPQL +
Sbjct: 959 PIPQPTFQTNVLPQTPPPRPVASTNNP-GNGQRGPKTTLERPKFDPIPVPYTTLLPQLIE 1018
Query: 983 NNQLAPVPIDPVKPPYPKWYDPNARCDYHAGAIGHSTENCTALKHRVQALIKAGWLNFKK 1042
N LA P++P++P +PKWYDPNA CDYH G GHSTENCTALKH+VQ LIKAG LNF K
Sbjct: 1019 NRLLARTPLEPLRPSFPKWYDPNAHCDYHFGIQGHSTENCTALKHKVQVLIKAGLLNFTK 1078
Query: 1043 ENGPDVNNNPLPNHQNAQVNAI-EVHGADLRKNAESIVTPMGELFEILLNNGYIGVERLQ 1102
++ V+ NPLPNH VNAI E ++K + I TPM ++FE L + E
Sbjct: 1079 KDSSGVDGNPLPNHGRPTVNAIHEGMIRMVKKGIDEIQTPMDKVFEALSKINAVTPE--P 1138
Query: 1103 LDLGVRAYDDSLMCSYHTGAKGHSIDQCPHFRLKVQELLDSHFL---------------- 1154
+D +D + C +H GA GHSI FR K+QEL+DS +
Sbjct: 1139 IDTKELGHDLTYSCKFHMGAIGHSIQNYDGFRRKLQELMDSSVIEFYEGAEENLVGTING 1198
BLAST of Lag0015740 vs. ExPASy TrEMBL
Match:
A0A061EXR3 (G-patch domain-containing protein OS=Theobroma cacao OX=3641 GN=TCM_024883 PE=4 SV=1)
HSP 1 Score: 588.6 bits (1516), Expect = 5.8e-164
Identity = 452/1457 (31.02%), Postives = 663/1457 (45.50%), Query Frame = 0
Query: 23 WEKLTVDRKAKFTSKYGHLAQLMYVQVNYSVLKALIRHWDPAYRCFTFGSIDMTPTIEEY 82
W+K +A F KYGH+A+L+ VQ++ +LKA+++ WDP+YRCF F IDM
Sbjct: 59 WDKWGAITRANFDRKYGHIARLLKVQIDEHLLKAIVQFWDPSYRCFVFNKIDM------- 118
Query: 83 QSLLHMPTRTEVEAYSYDQELTMKRALSTLLGKIRTSDIEKQVKIKGENTCLPLDYILTL 142
+ +R L+ ++G I ++++++ ++ KG+N C+P ++ +
Sbjct: 119 -------------------KTGHRRKLAKMMG-ITSAEVDQNLRKKGDNECIPWSFLRSY 178
Query: 143 QQKFANEDKELTLLALCIFNVVLFPK---------------------------------- 202
K + ++ ++AL I+ +V+FPK
Sbjct: 179 IMKQRDTEQGQLVMALGIYGLVIFPKVLGHIEVRIIDFFEQVVNKANPSPSILAETLRSL 238
Query: 203 ----NKGTGRFIGCAPLLYIWVLSHVKCPPEFKCPEIKFSSSWNKLRNPISEFVQSGWSS 262
KG GRF+GCA LL IW++SH F+C KF ++ PI EF +S W
Sbjct: 239 NYCRRKGEGRFVGCAQLLSIWIVSH------FECKVDKFRKPFHPQTAPIREFCESEWPE 298
Query: 263 SSPERSAWEAFFFELKVEDVMWRAPWMSTRPMIYKCGKFQSLPLLGPWGCIAYAPLLVVR 322
+ + W + EL +V WRAPWM P++YKCG + L+GPWG I+YAP++V R
Sbjct: 299 NR-TKEQWISRLRELMSVEVTWRAPWMPHHPVLYKCGNEPWVQLMGPWGAISYAPIMVRR 358
Query: 323 QIWVRQFIPATHELKDFEFAYDK-GFCKDRIQKIVKAWKMITKIQS-------------- 382
Q QF+P TH L EFAY++ GF K RI++I +AWK +++
Sbjct: 359 QFGSEQFVPMTHRLNTLEFAYEEPGFLK-RIEEIAQAWKKTSRVDQGRYTDEVTIGYQIW 418
Query: 383 -DQQTEQAAHEKECDELR------KANSSLVQENERLQLEVQQG--LLRNVELEKELNRL 442
DQ+ + + KE D LR ++ L E R + E + R +L+KE ++
Sbjct: 419 HDQRVKDVVYPKE-DVLRGPVDPEPRDALLESELARKKSEAENASWKQRYEDLQKECEKM 478
Query: 443 KGSVSKQ----EQLEKEISALDTEARDLNRRMHRLRRDKEVSQATLKSRND----QVLKQ 502
K VS+Q ++E + +L+ + + R + +E L++ ND QV Q
Sbjct: 479 KREVSEQRKKVRKMEGKYESLNDKFSTTTSELQREIQVRENRGNELQTHNDGLRRQVRFQ 538
Query: 503 QSEIASLHELMKELEDCNSLRNQTITEDATDRLMKDYTYLKEQYNRLSDDFGFARQNHAT 562
Q I L + +ELE + Q +Y LK+Q R+ + RQ +
Sbjct: 539 QESIQLLRQEYEELEGVMTTYQQ------------EYERLKQQSTRIQEWGESYRQAYTE 598
Query: 563 LRSKAEHMLTQIRRVTRRADELAEDARTLSKVIAPTQPNSKNNHKIARSPRIRRTYVTRY 622
+ ++++ Q+R V +A +A L I P K ++ + Y+
Sbjct: 599 KYDQMDYLVWQMREVAYKARSMAWKTDILRSQIFPV---GKQEQQLIK-------YLDE- 658
Query: 623 KTRIMEEQSTE----MEKTKKDIEELREKMDAILVALERGKII---------PDIAQSSN 682
+ RIMEE+ E ME+ ++++ E KM ++++L +GK P S N
Sbjct: 659 RARIMEEEQRERMDRMERAQEEMREQLAKMMKLMMSLSKGKRAIEEPAPSENPPAQDSGN 718
Query: 683 TMIDPPI--------RQSTEEVPPKV-------------------------------TIT 742
DPP Q+++ V P+V I
Sbjct: 719 QREDPPYPPGFTPPHAQTSQRVHPQVMPSVYYNAPPPMGHQPTHGQFGPYLGVNPIEPIH 778
Query: 743 VPNLDDPEIRKELTGG-----EKVSSSEKLEVLEERLRAVEGTYVFGNIDATKLCLVPDV 802
VP+LDDP+ +++L E +K ++LEERLRA+EG FG +DAT+LCLVPDV
Sbjct: 779 VPDLDDPKEQEKLRKDSSQTRENEKDQKKYDLLEERLRAIEGVDRFGTMDATELCLVPDV 838
Query: 803 ILPPKFKVPEFEKYDGASCPKNHLIMYCRKMAAYVQNDKLLIHCFQDSLTGPASRWYMQL 862
++P KFKVPEFEKYD CP H+ M CRKMAA +DKLLIH FQDSLTG A+RWY+QL
Sbjct: 839 LIPAKFKVPEFEKYDETKCPMAHITMNCRKMAAQSHDDKLLIHLFQDSLTGSAARWYVQL 898
Query: 863 DSTHICSWKNLADSFLKQYKHNIDMAPDRLDLQRMERRAQKALKS--------------- 922
D I +WK+LA +F+ QYKH ++APDRL LQ ME++ + K
Sbjct: 899 DRNRIKTWKDLARAFIAQYKHVAELAPDRLSLQTMEKKQSENFKEYAQRWRDTAAQVQPP 958
Query: 923 -TPKGGGILLLSPIFIPLRPK---------------------------------SSSYSR 982
T K +L ++ + P + +SS
Sbjct: 959 LTDKEMTVLFINTLRAPFYERLIGNATKNFTDLVLSGEIIEGAIKSGKIEGHEVASSKKG 1018
Query: 983 LPPRNQ--------------HN----------------------TPYV------------ 1042
PR + HN PYV
Sbjct: 1019 STPRKKEGDVQAVAHDSQQAHNFNLYYPYPPYQPFYPHIGNITQNPYVYQPIPQPTFQTN 1078
Query: 1043 ------------------QGHQNNKGVRRQTHFDPIPMTYTELLPQLFQNNQLAPVPIDP 1102
G + K + FD IP+ YT LLPQL + L P++P
Sbjct: 1079 VLPQTPPPRPIASTNNPGHGQRGPKTTPERPKFDHIPVPYTTLLPQLIEKRLLTQTPLEP 1138
Query: 1103 VKPPYPKWYDPNARCDYHAGAIGHSTENCTALKHRVQALIKAGWLNFKKENGPDVNNNPL 1154
++PP+PKWYDPNA CDYH G GHSTENCTALKH+VQALIKAG LNF K++ V+ NPL
Sbjct: 1139 LRPPFPKWYDPNAHCDYHFGIQGHSTENCTALKHKVQALIKAGLLNFTKKDSSSVDGNPL 1198
The following BLAST results are available for this feature:
Match Name | E-value | Identity | Description | |
KAA0036933.1 | 2.5e-198 | 34.87 | uncharacterized protein E6C27_scaffold86G00060 [Cucumis melo var. makuwa] | [more] |
XP_022158986.1 | 8.8e-175 | 47.49 | LOW QUALITY PROTEIN: uncharacterized protein LOC111025431 [Momordica charantia] | [more] |
XP_022147189.1 | 3.7e-165 | 46.43 | LOW QUALITY PROTEIN: uncharacterized protein LOC111016200 [Momordica charantia] | [more] |
EOX94372.1 | 1.1e-164 | 32.49 | Uncharacterized protein TCM_003960 [Theobroma cacao] | [more] |
EOY09468.1 | 1.2e-163 | 31.02 | Uncharacterized protein TCM_024883 [Theobroma cacao] | [more] |
Match Name | E-value | Identity | Description | |
Match Name | E-value | Identity | Description | |
A0A5A7T1W2 | 1.2e-198 | 34.87 | Retrotrans_gag domain-containing protein OS=Cucumis melo var. makuwa OX=1194695 ... | [more] |
A0A6J1E2J7 | 4.3e-175 | 47.49 | Ribonuclease H OS=Momordica charantia OX=3673 GN=LOC111025431 PE=4 SV=1 | [more] |
A0A6J1D099 | 1.8e-165 | 46.43 | Ribonuclease H OS=Momordica charantia OX=3673 GN=LOC111016200 PE=4 SV=1 | [more] |
A0A061DPM2 | 5.2e-165 | 32.49 | Uncharacterized protein OS=Theobroma cacao OX=3641 GN=TCM_003960 PE=4 SV=1 | [more] |
A0A061EXR3 | 5.8e-164 | 31.02 | G-patch domain-containing protein OS=Theobroma cacao OX=3641 GN=TCM_024883 PE=4 ... | [more] |
Match Name | E-value | Identity | Description | |