Lag0026447 (gene) Sponge gourd (AG‐4) v1

Overview
NameLag0026447
Typegene
OrganismLuffa acutangula (Sponge gourd (AG‐4) v1)
DescriptionReverse transcriptase
Locationchr10: 36977442 .. 36981567 (+)
RNA-Seq ExpressionLag0026447
SyntenyLag0026447
Sequences
The following sequences are available for this feature:

Gene sequence (with intron)

Legend: exonCDSpolypeptide
Hold the cursor over a type above to highlight its positions in the sequence below.
ATGAGCGATCCGCCTGGGGTACGGTTTGAGCTTGACCAGAAATTGAAAGGACATTTAGGAACAGAAGGAGGGAGCAGCGCAGAAATCAGATGGAGAACGTGCCGCAACTCCGCAGGTTCCTGAAGGCCAGTAGCAGTGAACCCCCAGCAGAATCCGTTGCTGAGCAAAACCCACTTTTGAGCAAAATGAGCAGCGAAATAATCAGGCTGAGAATCCTATCTTGATAGCAAACGATAGGACACAGCCATTCGAGCATATGCTGTCCCGATGTTTAATGAGTTGAATCCAGGGATTGCACGTCCCCAAATCCAAGCGGCGAATTTTGAAATGAAACCGGTAATGTTTCAGATGTTGCAAACCGTGGGGCAATTCCATGGTTTGTCATCTGAAGACCCTCATTTACATCTTAAGTCTTTTCTAGGAGTTAGTGATTCTTTTGTAATTCAGGAGTGCCGAGAGATGCTCTTAGATTAACTTTGTTCCCGTATTCTCTTAGAGATGGAGCAAATCATGGTTAAACTCTTTTGCTCCAGGATCAATTAGGACATGGGATGAGTTAGCTGAAATTTTTTGAGTAAATATTTCCCACCTAATAGAAATGCTAAATTAAGGAGTGAAATAGTAGGGTTTAGGCAACTTGAAGATGAGACTTTTAGTGAGGCTTGGGAGAGGTTTAAGGAGCTTTTGCGAAAGTGTCCCCACCATGGGTTACCTCATTGTATTCAAATGGAAACATTTTACAATGGTTTGAATGGAGTAACCCAAGGTATGGTCGATGCTTCGGCTGGAGGGGCCCTTTTGGCAAAAACTTTTGATGAAGCCTATGAAATTTTAGAAAGAATATCTATTAATAGTTGTCAGTGGTCGGATGTTAGAGGCACAAATAAAAAGGTTAAGAGTGTGTTAGAGGTTGATGGTGTGTCCACCATTAGGGCTGATCTTGCTATGATTGCTAACGCTCTTAAGAATGTGACAGTGATTAGTCATCAGCAGCCACCAGCTATGGAGCCTGCAGCAGTAGTGAACCAAGTCACGGACGAAGCATGTGTCTATTGCGGTGAAGATCACAACTACGAGTTTTGCCCCAGCAATCCAGCTTCTGTGTTTTTTGTAGGTAATCAGAGGAACAACCCTTATTCTAACTTCTATAATCCAGGTTGGCGCAACCACCCCAACTTCTCATGGGGAGGTCAAGGAAATAACGTTCAAGCGCAACAAAAGGTGAACCAGTCGGGATTTGCTAAAGCGCAGGTAATGCCCCAGCAAAATAAGCCGGCTTTGCCCCAGCAAAATTCGGGAAATTCTCTCGAGACAATGATGAAAGAATTTATGGCTCGCACATACGCCGCAATTCAAAGTAATCAAGCTTCGATGAGGGCCCTGGAATTGCAAGTGGGTCAGCTAGCTAATGAGCTAAAGGCGAGGCCTCAAGGGAAACTTCCTTCGGATACTGAACACCCTCGAAGGGAAGGTAAGGAGCAGGTAAAGGCAGTAACTCTTAGGAGTGGTAAGCCATTAGAAGAGTCTAGAAAGACCCAGGATTTAAATAGTAATAGTGATAATATTGTTGTTATTGAAAAAGAGTTGGAGTCTGGTCAGAGTGCTGGAGGCAGCAAAGAAAATGCTGGAGCATCTGGTTCTGTGCCAGATGTAGAACCACCATATGTGCCACCCCCACCTTATGTACCACCTCTACCTTTTCCACAAAGGCAAAAGCCTAAGAATCAGGATGGTCAATTTAAAAAGTTTTTAGAGATTCTTAAGCAGTTGCATATAAATATCCCTTTAGTAGAAGCTATTGAGCAAATGCCTAATTATGCTAAATTTCTTAAGGATATTTTTACTAAAAAGAAAAGGTTAGGTGAGTTCGAAACTGTATCTCTTACTGAGGAGTGTAGTGCTATTCTTAAGAATGGGCTACCCCCAAGGCTAAGGATCCAGGGTCATTTACTATACCTGTGTCTATTGGTGGAAAGGAATTAGGTAGAGCACTCTGTGATTTAGGTGCAAGCATTAACCTTATGCCTCTTTCGGTCTATCGGAAGTTAGGTATTGGTGAAGCTAGACCTACCACAGTTACGCTCCAATTAGCTGATAGGTCTATCACATATCTAGAGGGTAAGATTGAGGATGTCTTAGTAAAAGTGGATAAATTCATATTTCCTGTTGATTTCATTATTTTAGATTATGAGGCTGATAGAGATGTCCCAATTATTCTAGGACGTCCATTTTTTGCTACTGGTAGGGCGTTAATAGATGTTCAAAAAGGGGAATTAACAATGAGAGTTTGTAATGAGGAAGTAAAATTTAATGTGTTTAAGGCCATGAAGTATCCAGACGAGATGGAAGATTGCTCTTTCATTAGGATTCTGGAGAACACAATTGTTGAGACAGCAATTCAGGATTCGACTAATAAGCATTTGGAAAAGCATGGAGAGGTTAGCGTAGAGGATTTAGAAGTTTGTTCTTTAAAAGAAAAAATGAAAAAGAAGTGTCTAGGTGTGAGCATGTTTTTGAGTCTTTAGATTTGGATGAAAGGAAGGCTCCTCCAATTAAGTCATTCCTGATTGAGGCACCCACTTTAGATTTGAAGCCCTTGCCGGATCATCTAAAGTATGTGTATCTTGGGGAGGGTGAGACGGTTCCCATTATTGTTGCATCAGATTTAACTATAATCCATATTAAGGCAGTGAAAACACCTTTGTATGATGACTTTTTCGATTACCTTGATTTTGGAAATTTTCCTCCTGGTTTATCAAAAGAACAAATGAAAGAATTTTTCCATGGGGTGAAGTTTTATTTATCGAATGATGCATCTATGGTTAAACAATGTGATGAAAAAGATGGGAGAGTATTCAAGGTGAATGGACAGCGTGTGAAGCATTATTGGGGTGAGGAGTTTCAGTCGAAATATCCTTCCCTAAGGTTGGTCGATGTTTGAGAGAGCAGGACTTTTGCGGGAGCATTTCACAGAGTAAAATTTTGAGCTCCCGAAGTTTGAATTGATTTTATTTTTCGTTAATTTGATTTCGGTTAGGATAGATGTAATTTTGAATGATGTATTAGATTTTATCTTTTAAGTATTTTTATATTTTTTGTAGGTTAGATTTGATCTATTTTTTTTTATTTGATTTGATTTTCGGGTTCTTTATTTAAATTATCTTGCAATTATCTTTTAGTTAGTTAGATTTTATTTTATCTAGATTTTATTTTCGGTTATTTGAATTCTAGTTTTAATAGAGATTTTCTCTAAGTTATTCAGCTTCTTTTGAGAATAACAGGTTAATTTTGCTTTGATTGGTTTAATTTGAAAGGATTTTATTTGTCTATTTTAAAATTCAAATTCCGTATCTAAGATTCTTCAGGTTCGGCAGATTGTTGCGGCAAAGATATTGCTGGAGCAAAATATGCCGAGTTAGAAGGGTTTCCGTTGGAATTTATTATTATTACCGTTGGGTTTGACTTTGCATTCTTGTAGGTAAATGTGCGTGCATCGTCTGATGAGGCCACGTGGCGCAGAGTAATTATCCAACGGCTCCAATGATGACAGAGCATTAATTTTTACCATGGCTGTTTGGAATTTGCAATTTTACCGTTGGTAATTTTTGAATTTCAGGTTTGCATTTTTAATTGCAACGGTAATATTAGTTATGACCGGGCAAATCCTTTCGGTTAGGATTCTTGTTTGCTGCAGCCCTATTTATTCGCAATCTCTGGTAAGTACATTTCACTTCGCTTTCTTGATCAAATTCTTGTGCGTGTGTTATTTTTTCTTTCTTTGCAAAACCCTATTATACCTTCCATGGCTAAGACGAGAGCGCGAAAAGAAAGGGAGAGTGAGGAGGAAGAGATACCGGTCACACCAGAAGTTCAAAAAGGAAAGTCTAAGAAGAAGAGGACGCCAGAGGAAAAGGAAGCAAAGAGAAGAAGAAGGCAACAGAGGGCTACAGAACAGAAGGAAGTTCAGGAGGTGGCAGACGTTGTTGCCACTACTGCGGAGGAAGGAAGTACTCAAGAACCAGCAATACAAAACCCAGATACGGTTCAAGAACAGATTGCTGAGAAAAATCAAGAAACAGAGAATGAAGGTGAGCATAACAAGGAGAAAACACCGGAGCCGGTGTAG

mRNA sequence

ATGAGCGATCCGCCTGGGGTACGGTTTGAGCTTGACCAGAAATTGAAAGGACATTTAGGAACAGAAGGAGGGAGCAGCGCAGAAATCAGATGGAGAACGTGCCGCAACTCCGCAGGGATTGCACGTCCCCAAATCCAAGCGGCGAATTTTGAAATGAAACCGGTAATGTTTCAGATGTTGCAAACCGTGGGGCAATTCCATGGAGTGCCGAGAGATGCTCTTAGATTAACTTTGTTCCCGTATTCTCTTAGAGATGGAGCAAATCATGGGTTTAGGCAACTTGAAGATGAGACTTTTAGTGAGGCTTGGGAGAGGTTTAAGGAGCTTTTGCGAAAGTGTCCCCACCATGGGTTACCTCATTGTATTCAAATGGAAACATTTTACAATGGTTTGAATGGAGTAACCCAAGGTATGGTCGATGCTTCGGCTGGAGGGGCCCTTTTGGCAAAAACTTTTGATGAAGCCTATGAAATTTTAGAAAGAATATCTATTAATAGTTGTCAGTGGTCGGATGTTAGAGGCACAAATAAAAAGGTTAAGAGTGTGTTAGAGGTTGATGGTGTGTCCACCATTAGGGCTGATCTTGCTATGATTGCTAACGCTCTTAAGAATGTGACAGTGATTAGTCATCAGCAGCCACCAGCTATGGAGCCTGCAGCAGTAGTGAACCAAGTCACGGACGAAGCATGTGTCTATTGCGGTGAAGATCACAACTACGAGTTTTGCCCCAGCAATCCAGCTTCTGTGTTTTTTGTAGGTAATCAGAGGAACAACCCTTATTCTAACTTCTATAATCCAGGTTGGCGCAACCACCCCAACTTCTCATGGGGAGGTCAAGGAAATAACGTTCAAGCGCAACAAAAGGTGAACCAGTCGGGATTTGCTAAAGCGCAGGTAATGCCCCAGCAAAATAAGCCGGCTTTGCCCCAGCAAAATTCGGGAAATTCTCTCGAGACAATGATGAAAGAATTTATGGCTCGCACATACGCCGCAATTCAAAGTAATCAAGCTTCGATGAGGGCCCTGGAATTGCAAGTGGGTCAGCTAGCTAATGAGCTAAAGGCGAGGCCTCAAGGGAAACTTCCTTCGGATACTGAACACCCTCGAAGGGAAGGTAAGGAGCAGGTAAAGGCAGTAACTCTTAGGAGTGGTAAGCCATTAGAAGAGTCTAGAAAGACCCAGGATTTAAATAGTAATAGTGATAATATTGTTGTTATTGAAAAAGAGTTGGAGTCTGGTCAGAGTGCTGGAGGCAGCAAAGAAAATGCTGGAGCATCTGGTTCTGTGCCAGATGTAGAACCACCATATGTGCCACCCCCACCTTATGTACCACCTCTACCTTTTCCACAAAGGCAAAAGCCTAAGAATCAGGATGAATGGGCTACCCCCAAGGCTAAGGATCCAGGGTCATTTACTATACCTGTGTCTATTGGTGGAAAGGAATTAGGTAGAGCACTCTGTGATTTAGGTGCAAGCATTAACCTTATGCCTCTTTCGGTCTATCGGAAGTTAGGTATTGGTGAAGCTAGACCTACCACAGTTACGCTCCAATTAGCTGATAGGTCTATCACATATCTAGAGGGTAAGATTGAGGATGTCTTAGTAAAAGTGGATAAATTCATATTTCCTGTTGATTTCATTATTTTAGATTATGAGGCTGATAGAGATGTCCCAATTATTCTAGGACGTCCATTTTTTGCTACTGGTAGGGCGTTAATAGATGTTCAAAAAGGGGAATTAACAATGAGAGTTTGTAATGAGGAAGTAAAATTTAATGTGTTTAAGGCCATGAAGTATCCAGACGAGATGGAAGATTGCTCTTTCATTAGGATTCTGGAGAACACAATTGTTGAGACAGCAATTCAGGATTCGACTAATAAGCATTTGGAAAAGCATGGAGAGGCTCCTCCAATTAAGTCATTCCTGATTGAGGCACCCACTTTAGATTTGAAGCCCTTGCCGGATCATCTAAAGTATGTGTATCTTGGGGAGGGTGAGACGGTTCCCATTATTGTTGCATCAGATTTAACTATAATCCATATTAAGGCAGTGAAAACACCTTTGTATGATGACTTTTTCGATTACCTTGATTTTGGAAATTTTCCTCCTGGTTTATCAAAAGAACAAATGAAAGAATTTTTCCATGGGGTGAAGTTTTATTTATCGAATGATGCATCTATGGTTAAACAATGTGATGAAAAAGATGGGAGAGTATTCAAGGTGAATGGACAGCGTGTGAAGCATTATTGGGGTGAGGAGTTTCAGTCGAAATATCCTTCCCTAAGGTTTGCATTTTTAATTGCAACGGTAATATTAGTTATGACCGGGCAAATCCTTTCGGTTAGGATTCTTGTTTGCTGCAGCCCTATTTATTCGCAATCTCTGACGAGAGCGCGAAAAGAAAGGGAGAGTGAGGAGGAAGAGATACCGGTCACACCAGAAGTTCAAAAAGGAAAGTCTAAGAAGAAGAGGACGCCAGAGGAAAAGGAAGCAAAGAGAAGAAGAAGGCAACAGAGGGCTACAGAACAGAAGGAAGTTCAGGAGGTGGCAGACGTTGTTGCCACTACTGCGGAGGAAGGAAGTACTCAAGAACCAGCAATACAAAACCCAGATACGGTTCAAGAACAGATTGCTGAGAAAAATCAAGAAACAGAGAATGAAGGTGAGCATAACAAGGAGAAAACACCGGAGCCGGTGTAG

Coding sequence (CDS)

ATGAGCGATCCGCCTGGGGTACGGTTTGAGCTTGACCAGAAATTGAAAGGACATTTAGGAACAGAAGGAGGGAGCAGCGCAGAAATCAGATGGAGAACGTGCCGCAACTCCGCAGGGATTGCACGTCCCCAAATCCAAGCGGCGAATTTTGAAATGAAACCGGTAATGTTTCAGATGTTGCAAACCGTGGGGCAATTCCATGGAGTGCCGAGAGATGCTCTTAGATTAACTTTGTTCCCGTATTCTCTTAGAGATGGAGCAAATCATGGGTTTAGGCAACTTGAAGATGAGACTTTTAGTGAGGCTTGGGAGAGGTTTAAGGAGCTTTTGCGAAAGTGTCCCCACCATGGGTTACCTCATTGTATTCAAATGGAAACATTTTACAATGGTTTGAATGGAGTAACCCAAGGTATGGTCGATGCTTCGGCTGGAGGGGCCCTTTTGGCAAAAACTTTTGATGAAGCCTATGAAATTTTAGAAAGAATATCTATTAATAGTTGTCAGTGGTCGGATGTTAGAGGCACAAATAAAAAGGTTAAGAGTGTGTTAGAGGTTGATGGTGTGTCCACCATTAGGGCTGATCTTGCTATGATTGCTAACGCTCTTAAGAATGTGACAGTGATTAGTCATCAGCAGCCACCAGCTATGGAGCCTGCAGCAGTAGTGAACCAAGTCACGGACGAAGCATGTGTCTATTGCGGTGAAGATCACAACTACGAGTTTTGCCCCAGCAATCCAGCTTCTGTGTTTTTTGTAGGTAATCAGAGGAACAACCCTTATTCTAACTTCTATAATCCAGGTTGGCGCAACCACCCCAACTTCTCATGGGGAGGTCAAGGAAATAACGTTCAAGCGCAACAAAAGGTGAACCAGTCGGGATTTGCTAAAGCGCAGGTAATGCCCCAGCAAAATAAGCCGGCTTTGCCCCAGCAAAATTCGGGAAATTCTCTCGAGACAATGATGAAAGAATTTATGGCTCGCACATACGCCGCAATTCAAAGTAATCAAGCTTCGATGAGGGCCCTGGAATTGCAAGTGGGTCAGCTAGCTAATGAGCTAAAGGCGAGGCCTCAAGGGAAACTTCCTTCGGATACTGAACACCCTCGAAGGGAAGGTAAGGAGCAGGTAAAGGCAGTAACTCTTAGGAGTGGTAAGCCATTAGAAGAGTCTAGAAAGACCCAGGATTTAAATAGTAATAGTGATAATATTGTTGTTATTGAAAAAGAGTTGGAGTCTGGTCAGAGTGCTGGAGGCAGCAAAGAAAATGCTGGAGCATCTGGTTCTGTGCCAGATGTAGAACCACCATATGTGCCACCCCCACCTTATGTACCACCTCTACCTTTTCCACAAAGGCAAAAGCCTAAGAATCAGGATGAATGGGCTACCCCCAAGGCTAAGGATCCAGGGTCATTTACTATACCTGTGTCTATTGGTGGAAAGGAATTAGGTAGAGCACTCTGTGATTTAGGTGCAAGCATTAACCTTATGCCTCTTTCGGTCTATCGGAAGTTAGGTATTGGTGAAGCTAGACCTACCACAGTTACGCTCCAATTAGCTGATAGGTCTATCACATATCTAGAGGGTAAGATTGAGGATGTCTTAGTAAAAGTGGATAAATTCATATTTCCTGTTGATTTCATTATTTTAGATTATGAGGCTGATAGAGATGTCCCAATTATTCTAGGACGTCCATTTTTTGCTACTGGTAGGGCGTTAATAGATGTTCAAAAAGGGGAATTAACAATGAGAGTTTGTAATGAGGAAGTAAAATTTAATGTGTTTAAGGCCATGAAGTATCCAGACGAGATGGAAGATTGCTCTTTCATTAGGATTCTGGAGAACACAATTGTTGAGACAGCAATTCAGGATTCGACTAATAAGCATTTGGAAAAGCATGGAGAGGCTCCTCCAATTAAGTCATTCCTGATTGAGGCACCCACTTTAGATTTGAAGCCCTTGCCGGATCATCTAAAGTATGTGTATCTTGGGGAGGGTGAGACGGTTCCCATTATTGTTGCATCAGATTTAACTATAATCCATATTAAGGCAGTGAAAACACCTTTGTATGATGACTTTTTCGATTACCTTGATTTTGGAAATTTTCCTCCTGGTTTATCAAAAGAACAAATGAAAGAATTTTTCCATGGGGTGAAGTTTTATTTATCGAATGATGCATCTATGGTTAAACAATGTGATGAAAAAGATGGGAGAGTATTCAAGGTGAATGGACAGCGTGTGAAGCATTATTGGGGTGAGGAGTTTCAGTCGAAATATCCTTCCCTAAGGTTTGCATTTTTAATTGCAACGGTAATATTAGTTATGACCGGGCAAATCCTTTCGGTTAGGATTCTTGTTTGCTGCAGCCCTATTTATTCGCAATCTCTGACGAGAGCGCGAAAAGAAAGGGAGAGTGAGGAGGAAGAGATACCGGTCACACCAGAAGTTCAAAAAGGAAAGTCTAAGAAGAAGAGGACGCCAGAGGAAAAGGAAGCAAAGAGAAGAAGAAGGCAACAGAGGGCTACAGAACAGAAGGAAGTTCAGGAGGTGGCAGACGTTGTTGCCACTACTGCGGAGGAAGGAAGTACTCAAGAACCAGCAATACAAAACCCAGATACGGTTCAAGAACAGATTGCTGAGAAAAATCAAGAAACAGAGAATGAAGGTGAGCATAACAAGGAGAAAACACCGGAGCCGGTGTAG

Protein sequence

MSDPPGVRFELDQKLKGHLGTEGGSSAEIRWRTCRNSAGIARPQIQAANFEMKPVMFQMLQTVGQFHGVPRDALRLTLFPYSLRDGANHGFRQLEDETFSEAWERFKELLRKCPHHGLPHCIQMETFYNGLNGVTQGMVDASAGGALLAKTFDEAYEILERISINSCQWSDVRGTNKKVKSVLEVDGVSTIRADLAMIANALKNVTVISHQQPPAMEPAAVVNQVTDEACVYCGEDHNYEFCPSNPASVFFVGNQRNNPYSNFYNPGWRNHPNFSWGGQGNNVQAQQKVNQSGFAKAQVMPQQNKPALPQQNSGNSLETMMKEFMARTYAAIQSNQASMRALELQVGQLANELKARPQGKLPSDTEHPRREGKEQVKAVTLRSGKPLEESRKTQDLNSNSDNIVVIEKELESGQSAGGSKENAGASGSVPDVEPPYVPPPPYVPPLPFPQRQKPKNQDEWATPKAKDPGSFTIPVSIGGKELGRALCDLGASINLMPLSVYRKLGIGEARPTTVTLQLADRSITYLEGKIEDVLVKVDKFIFPVDFIILDYEADRDVPIILGRPFFATGRALIDVQKGELTMRVCNEEVKFNVFKAMKYPDEMEDCSFIRILENTIVETAIQDSTNKHLEKHGEAPPIKSFLIEAPTLDLKPLPDHLKYVYLGEGETVPIIVASDLTIIHIKAVKTPLYDDFFDYLDFGNFPPGLSKEQMKEFFHGVKFYLSNDASMVKQCDEKDGRVFKVNGQRVKHYWGEEFQSKYPSLRFAFLIATVILVMTGQILSVRILVCCSPIYSQSLTRARKERESEEEEIPVTPEVQKGKSKKKRTPEEKEAKRRRRQQRATEQKEVQEVADVVATTAEEGSTQEPAIQNPDTVQEQIAEKNQETENEGEHNKEKTPEPV
Homology
BLAST of Lag0026447 vs. NCBI nr
Match: XP_022929949.1 (uncharacterized protein LOC111436411 [Cucurbita moschata])

HSP 1 Score: 522.3 bits (1344), Expect = 8.2e-144
Identity = 318/681 (46.70%), Postives = 400/681 (58.74%), Query Frame = 0

Query: 40  IARPQIQAANFEMKPVMFQMLQTVGQFHGVP----------------------------- 99
           I RP+IQ   FE+KPVMFQMLQT+GQFHG+P                             
Sbjct: 89  IIRPEIQGTTFELKPVMFQMLQTIGQFHGLPLEDPHLHLKSFLGVSDSFRFHSDSFRFQG 148

Query: 100 --RDALRLTLFPYSLRDGANH--------------------------------------G 159
             +D +RL+LFPY LRDGA                                         
Sbjct: 149 VDKDMIRLSLFPYLLRDGAKSWLNTLAPGTIDSWNSLAENFLIKYFPPTRNARFKNEIVT 208

Query: 160 FRQLEDETFSEAWERFKELLRKCPHHGLPHCIQMETFYNGLNGVTQGMVDASAGGALLAK 219
           F+Q EDET SEA ERFKE+LRKCPHHGLPHCIQMETFYNGLN VT+ +VDASA GA+L+K
Sbjct: 209 FQQFEDETLSEACERFKEMLRKCPHHGLPHCIQMETFYNGLNIVTKQVVDASANGAILSK 268

Query: 220 TFDEAYEILERISINSCQWSDVRGT-NKKVKSVLEVDGVSTIRADLAMIANALKNVTVIS 279
           T++EAYEILERI+ N+CQW+DVR    +K + VLEVD +S+I A LA + N L+N+ +  
Sbjct: 269 TYNEAYEILERIASNNCQWADVRSNPGRKTRGVLEVDALSSINAQLASVTNILQNLALGQ 328

Query: 280 HQQPPA-MEPAAVVNQVTDEACVYCGEDHNYEFCPSNPASVFFVGNQ------RNNPYSN 339
                A +  AA +NQ   E+CVYCGE+H ++ CPSNPAS+F+VGNQ      +NNP+SN
Sbjct: 329 DSMIKAPVHTAAAINQTAAESCVYCGEEHTFDQCPSNPASIFYVGNQASQGNLKNNPFSN 388

Query: 340 FYNPGWRNHPNFSWGGQG-NNVQAQQKV---------NQSGFAKAQVMPQQNKPALPQQN 399
            YNPGWRNHPNFSW GQ   N Q   K          NQ  ++  QV  Q       Q  
Sbjct: 389 TYNPGWRNHPNFSWKGQSLYNQQMPPKANYPSGFRLQNQLAYSSQQVNTQGKGTTQAQYT 448

Query: 400 SGNSLETMMKEFMARTYAAIQSNQASMRALELQVGQLANELKARPQGKLPSDTEHPRREG 459
           S  S+E+++KE+MA+  A IQS QAS+R LE+Q+G   N  +     +  +DT+  +R  
Sbjct: 449 SETSIESLIKEYMAKNDAVIQSQQASLRNLEVQIGGEKNAEQGDSHSQETADTQ--QRNE 508

Query: 460 KEQVKAVTLRSGKPLEESRKTQDLNSNSDNIVVIEKELESGQSAGGSKENAGASG----- 519
           +  V+    +    +EE  K Q   S+              Q     KE A         
Sbjct: 509 EAAVQKEHSKDYAEVEEQPKMQTTASSEQESRTYTPSPPFPQRIKRKKEEAHFEKFMDIL 568

Query: 520 -----SVPDVE-----PPYVP--PPPYVPPLPFPQRQKPKNQDEWAT-------PKAKDP 579
                ++P VE     P YV       +    F + +     +E +         K KDP
Sbjct: 569 KEIHINIPLVEALKQMPNYVKFLKDVLINRRKFEEFKVVSLNEECSAILKNKIPLKEKDP 628

Query: 580 GSFTIPVSIGGKELGRALCDLGASINLMPLSVYRKLGIGEARPTTVTLQLADRSITYLEG 610
           GSFTIPVSIGGKELGRALCDLGA+INLMPLS+Y+KLGIGEARPTTVTLQLADRSITY EG
Sbjct: 629 GSFTIPVSIGGKELGRALCDLGANINLMPLSIYKKLGIGEARPTTVTLQLADRSITYPEG 688

BLAST of Lag0026447 vs. NCBI nr
Match: XP_030505184.1 (uncharacterized protein LOC115720166 [Cannabis sativa])

HSP 1 Score: 484.2 bits (1245), Expect = 2.5e-132
Identity = 302/709 (42.60%), Postives = 394/709 (55.57%), Query Frame = 0

Query: 39  GIARPQIQAANFEMKPVMFQMLQTVGQF------------------------HGVPRDAL 98
           GI RP+IQA  FE+KPVMFQMLQTVGQF                         GV  +  
Sbjct: 42  GIVRPEIQAPQFELKPVMFQMLQTVGQFSEMPTEDPHLHLRSFLEMSDSFKIQGVSEEVR 101

Query: 99  RLTLFPYSLRDGANH--------------------------------------GFRQLED 158
           RL LFP+SLRD A                                         F QLED
Sbjct: 102 RLKLFPFSLRDRARSWLNTLSPDSVTNWNDFAEKFLRKYFPPTRNAKFRSEIMSFHQLED 161

Query: 159 ETFSEAWERFKELLRKCPHHGLPHCIQMETFYNGLNGVTQGMVDASAGGALLAKTFDEAY 218
           E+ S+AWERFKELLRKCPHHG+PHCIQMETFYNGLN  +Q ++DASA GA+L+K+++EA+
Sbjct: 162 ESASDAWERFKELLRKCPHHGIPHCIQMETFYNGLNATSQMVLDASANGAILSKSYNEAF 221

Query: 219 EILERISINSCQWSDVRGT-NKKVKSVLEVDGVSTIRADLAMIANALKNVTVISHQQPPA 278
           EILE I+ N+ QWS+ R   ++KV  VLEVD ++ +   +A + N LKN+++ + +    
Sbjct: 222 EILETIASNNYQWSNTRAPGSRKVAGVLEVDAITALTTQMASMTNVLKNLSIGNSKN--- 281

Query: 279 MEPAAVVNQVTDEACVYCGEDHNYEFCPSNPASVFFVGNQ----RNNPYSNFYNPGWRNH 338
           ++PAA + Q  D +CV+C E H +E CPSNP SV ++GNQ     N  +SN YN  W+NH
Sbjct: 282 IQPAAAI-QSDDVSCVFCREGHAFEKCPSNPESVCYMGNQNFNRNNGAFSNSYNQAWKNH 341

Query: 339 PNFSWGGQG--NNVQAQQKVNQSGFAKAQVMPQQNKPALPQQNSGNSLETMMKEFMARTY 398
           PN SWG +    +    ++    GF++     Q   P   Q +  +SLE++M+++MA+  
Sbjct: 342 PNLSWGSRSKLKHFDQGRQAYPPGFSQ-----QLRHPQHAQNSQPSSLESLMRDYMAKND 401

Query: 399 AAIQSNQASMRALELQVGQLANELKARPQGKLPSDTEHPRREGKEQVKAVTLRSGKPLEE 458
           A IQS  A +R LELQ+G LANELKARPQG LPSDTE+PRR+GKEQ K++ LRSGK L+ 
Sbjct: 402 AVIQSQAAFLRNLELQLGHLANELKARPQGSLPSDTENPRRDGKEQCKSIHLRSGKHLKN 461

Query: 459 SRKTQDLNSNSDNIVVIEK-ELESGQSAGGSKENAGASGSVPDVEPPYVPPPPYVPPLPF 518
           S +    +    +I + EK   ++ Q    ++    A+G   + +          PPLPF
Sbjct: 462 SEEEIKGSGEPTSIQIDEKLSKKTAQEIADTRRVDTATGQQSNSQQSAPVCSSSKPPLPF 521

Query: 519 PQRQKPKNQD-------------------------------------------------- 578
           PQR + + QD                                                  
Sbjct: 522 PQRFQKQQQDGQFKKFLDVLKQLHINIPLVEALEQMPNYVKFLKDILTKKRRLGEFESRL 581

Query: 579 ---------EWATPKAKDPGSFTIPVSIGGKELGRALCDLGASINLMPLSVYRKLGIGEA 619
                        PK KDPGSFTIP+SIGG                      R LGIGEA
Sbjct: 582 TEGQKAMLKNKIPPKLKDPGSFTIPISIGG----------------------RDLGIGEA 641

BLAST of Lag0026447 vs. NCBI nr
Match: XP_024042858.1 (uncharacterized protein LOC112099671 [Citrus clementina])

HSP 1 Score: 469.9 bits (1208), Expect = 4.8e-128
Identity = 303/716 (42.32%), Postives = 397/716 (55.45%), Query Frame = 0

Query: 56  MFQMLQTVGQFHGVPRDALRLTLFPYSLRDGAN--------------------------- 115
           +F  L    +  G  +DALRL LFPYSLRD A                            
Sbjct: 51  LFLELSDTFKTTGATQDALRLRLFPYSLRDRARAWLNSLPYDSITTWNELADKFLMKYFP 110

Query: 116 -----------HGFRQLEDETFSEAWERFKELLRKCPHHGLPHCIQMETFYNGLNGVTQG 175
                        F QLEDE+  + WERFKELLR+CPHHG+P CIQ+ETFYNGLN  T+ 
Sbjct: 111 PTKNAKLQNEITSFHQLEDESLYKTWERFKELLRRCPHHGIPCCIQLETFYNGLNPSTRL 170

Query: 176 MVDASAGGALLAKTFDEAYEILERISINSCQWSDVRGT-NKKVKSVLEVDGVSTIRADLA 235
           MVDASA  ALL K++ EAYEILERI+ N+ QW   R T  ++   V  +D ++T+ A + 
Sbjct: 171 MVDASANRALLFKSYTEAYEILERIANNNYQWPSTRQTAARRAARVHNIDAITTLSAQVT 230

Query: 236 MIANALKNVTVISHQQPPAMEPAAVVNQVTDEACVYCGEDHNYEFCPSNPASVFFVGN-- 295
            + N +K +T             A V Q+ + +C+YCGE+H ++ CP N ASV +VGN  
Sbjct: 231 SLTNMVKAMT----------SAPAAVKQIAELSCMYCGEEHVFDNCPGNLASVNYVGNFN 290

Query: 296 --QRNNPYSNFYNPGWRNHPNFSWGGQGNNVQAQQKVNQSGFAKAQVMPQQNKPALPQQN 355
              +NNPY N YN GW+ HPNFSW  Q  N       N++               L QQN
Sbjct: 291 RQPQNNPYLNTYNSGWKQHPNFSWSNQNQNASTLSGQNRN----------TQPSGLHQQN 350

Query: 356 SG---------NSLETMMKEFMARTYAAIQSNQASMRALELQVGQLANELKARPQGKLPS 415
            G          SLET++KE++A+  A +QS   S+R LE Q+GQLA  + +R QG LPS
Sbjct: 351 QGQKHTRHDLLTSLETLIKEYIAKNEAIVQSQVVSLRNLENQIGQLATTMSSRSQGSLPS 410

Query: 416 DTEHPRREGKEQVKAVTLRSGKP----LEESRKTQDLNSNSDNIVVIEKELESGQSAGGS 475
           +T+ PRREG E  K + LRSGK     ++ ++K  +LNS+       E   +       S
Sbjct: 411 NTKDPRREGNEHCKVINLRSGKNVDILVDVTKKRLELNSSQ------ETPQDQSMLQQPS 470

Query: 476 KENAGASGSVPDVEPPYVP------PPPYVPPLPFPQRQKPKNQDEWATPKAKDPGSFTI 535
            ++ G SG    +     P          V      Q + PK        K KDPGSFTI
Sbjct: 471 HQDTGVSGQAITIMEGNQPINTKEEVATLVENSHMLQSKIPK--------KLKDPGSFTI 530

Query: 536 PVSIGGKELGRALCDLGASINLMPLSVYRKLGIGEARPTTVTLQLADRSITYLEGKIEDV 595
           P SIG +  GRALCDLGA+INLM LSV+++L + E RPTTVTLQLA+RS  Y E KIEDV
Sbjct: 531 PCSIGTRYNGRALCDLGANINLMQLSVFKQLRVEECRPTTVTLQLANRSHAYPEEKIEDV 590

Query: 596 LVKVDKFIFPVDFIILDYEADRDVPIILGRPFFATGRALIDVQKGELTMRVCNEEVKFNV 655
           LVKVDKFIFPVDFI+LD+EAD++VPIILGRPF A  + LIDVQK ELTMR+ +++V FNV
Sbjct: 591 LVKVDKFIFPVDFIVLDFEADKEVPIILGRPFLAIEKTLIDVQKRELTMRMNDQQVTFNV 650

Query: 656 FKAMKYPDEMEDCSFIRILE------------NTIVETAI------QDSTNKHLEKHGEA 673
            +AMK  DE +DC+F+ +++            N +++ A       +D     +E  GE 
Sbjct: 651 LEAMKNLDEAQDCNFLSVVDFVVADRVNKCCSNDVIKVATFESFEEEDVVANQIEWMGER 710

BLAST of Lag0026447 vs. NCBI nr
Match: KAG8501049.1 (hypothetical protein CXB51_003148 [Gossypium anomalum])

HSP 1 Score: 468.8 bits (1205), Expect = 1.1e-127
Identity = 292/683 (42.75%), Postives = 388/683 (56.81%), Query Frame = 0

Query: 94  LEDETFSEAWERFKELLRKCPHHGLPHCIQMETFYNGLNGVTQGMVDASAGGALLAKTFD 153
           ++DE+  EAWERFKELLRKCPHHG+PHCIQ+ETFYNGLN  T+ +VDASA GA+L+K+++
Sbjct: 1   MDDESLYEAWERFKELLRKCPHHGIPHCIQLETFYNGLNTQTRMVVDASANGAILSKSYN 60

Query: 154 EAYEILERISINSCQWSDVRGTN-KKVKSVLEVDGVSTIRADLAMIANALKNVTVISHQQ 213
           EAYEI+ERI+ NS QW   R T+ ++V  + EVD  +++ + ++ I++ LKN+T      
Sbjct: 61  EAYEIIERIASNSYQWPTNRATSGRRVAGIHEVDAFTSLASQVSSISSMLKNLTTNGSNS 120

Query: 214 PPAMEPAAVVNQVTDEACVYCGEDHNYEFCPSNPASVFFVGNQRNNPYSNFYNPGWRNHP 273
             A  P    NQ  + ACVYCGE H +E CPSNP S++++G   +N   N+  P     P
Sbjct: 121 FTAQPP----NQYENIACVYCGEGHVFEECPSNPESMYYIGAGGSN---NYAQPRPTQLP 180

Query: 274 NFSWGGQGNNVQAQQKVNQSGFAKAQVMPQQNKPALPQQNSGNSLETMMKEFMARTYAAI 333
            FS        Q Q+ V                    Q  S NSLE ++K +MA+     
Sbjct: 181 TFS-------QQVQRPV--------------------QAESSNSLENLLKAYMAK----- 240

Query: 334 QSNQASMRALELQVGQLANELKARPQGKLPSDTEHPRREGKEQVKAVTLRSGKPLEESRK 393
             N A++R LE QVGQLA EL+ RPQG LPSDT++ R  GKE  KA+TLRS + +E +  
Sbjct: 241 --NDATLRNLENQVGQLATELRNRPQGALPSDTKNLRNPGKEHCKALTLRSRETVEPN-- 300

Query: 394 TQDLNSNSDNIVVIEKELESGQSAGGSKENAGASGSVPDVEPPYVPPPPYVPPLPFPQRQ 453
                       +IE E E  ++    +     +  +P      VP   Y PP P+PQR 
Sbjct: 301 ------------IIEAEKEHVEAQDSEEPTNPLAVELPPKINQLVPTSVYKPPPPYPQRL 360

Query: 454 KPKNQD--------------------------------------------EWAT------ 513
           + + Q+                                            E+ T      
Sbjct: 361 QKQKQEVQFKKFLDVLKQLHINIPLVEALEQMPNYVKFMKDILSKKRKLGEFETVALTKE 420

Query: 514 ----------PKAKDPGSFTIPVSIGGKELGRALCDLGASINLMPLSVYRKLGIGEARPT 573
                     PK KDPG FTIP +IG    G+ALCDLGASINLMP+S+++KLGIGE RPT
Sbjct: 421 CTTYLQDKLPPKLKDPGCFTIPCNIGVAYCGKALCDLGASINLMPMSIFKKLGIGEFRPT 480

Query: 574 TVTLQLADRSITYLEGKIEDVLVKVDKFIFPVDFIILDYEADRDVPIILGRPFFATGRAL 633
           TVTLQLADRS+ +LEGKI+DVLV+VDKFIFP DF+ILD+EAD++VPIILGRPF ATGR L
Sbjct: 481 TVTLQLADRSLAHLEGKIDDVLVRVDKFIFPADFVILDFEADKEVPIILGRPFLATGRTL 540

Query: 634 IDVQKGELTMRVCNEEVKFNVFKAMKYPDEMEDCSFIRILENTIVE-------------- 678
           IDVQKGELTM V +++V FNVFK+M++PD ++DCS +  LE+ IVE              
Sbjct: 541 IDVQKGELTMSVQDDQVTFNVFKSMQFPDAIDDCSVVSDLEDLIVEKELNSVEDLLERIL 600

BLAST of Lag0026447 vs. NCBI nr
Match: XP_023521781.1 (LOW QUALITY PROTEIN: uncharacterized protein LOC111785639, partial [Cucurbita pepo subsp. pepo])

HSP 1 Score: 453.8 bits (1166), Expect = 3.6e-123
Identity = 273/598 (45.65%), Postives = 360/598 (60.20%), Query Frame = 0

Query: 124 METFYNGLNGVTQGMVDASAGGALLAKTFDEAYEILERISINSCQWSDVRGT-NKKVKSV 183
           METFYNGLN  T+ +VDASA GA+L+KT++EAYEILERI+ N+CQW+DVR    +K + V
Sbjct: 1   METFYNGLNIATKQVVDASANGAILSKTYNEAYEILERIASNNCQWADVRSNPGRKTQGV 60

Query: 184 LEVDGVSTIRADLAMIANALKNVTVISHQQPPA-MEPAAVVNQVTDEACVYCGEDHNYEF 243
           LEVD +S+I A LA + N L+N+ +       A +  AAV+NQ   E+CV CGE+H ++ 
Sbjct: 61  LEVDALSSINAQLASVTNILQNLALGQDSNIKAPVHTAAVINQTAAESCVCCGEEHTFDQ 120

Query: 244 CPSNPASVFFVGNQ------RNNPYSNFYNPGWRNHPNFSWGGQGN-NVQAQQKV----- 303
           C SN  S+F+VGNQ      +NNP+SN YNPGWRNHPNFSW GQG+ N Q   K      
Sbjct: 121 CSSNLTSIFYVGNQVSQGNPKNNPFSNTYNPGWRNHPNFSWKGQGSYNQQMPPKANYPPG 180

Query: 304 ----NQSGFAKAQVMPQQNKPALPQQNSGNSLETMMKEFMARTYAAIQSNQASMRALELQ 363
               NQ  +   Q   Q    +  Q   G SLE+++KE+MA+  A IQS QAS+R LE+Q
Sbjct: 181 FGLQNQLTYGSQQATTQGEGTSQAQHIPGTSLESLIKEYMAKNDAVIQSQQASLRNLEVQ 240

Query: 364 VGQLANELKARPQGKLPSDTEHPRREGKEQVKAVTLRSGKPLEESRKTQDLNSNSDNIVV 423
           VGQLANEL+ RP            ++    VK +       L   RK ++       +V 
Sbjct: 241 VGQLANELRNRPLA---------LKQMPNYVKFLK----DVLTNRRKFEEF-----KVVP 300

Query: 424 IEKELESGQSAGGSKENAGASGSVPDVEPPYVPPPPYVPPLPFPQRQKPKNQDEWATPKA 483
           + +E            +A     +P                                 K 
Sbjct: 301 LNEEC-----------SAILKNKIP--------------------------------LKE 360

Query: 484 KDPGSFTIPVSIGGKELGRALCDLGASINLMPLSVYRKLGIGEARPTTVTLQLADRSITY 543
           KDPGSFTIP+SIGGK+LGRALCDLG+SINLMPLS+Y+KLGIGEARPTTVTLQLADRS TY
Sbjct: 361 KDPGSFTIPISIGGKKLGRALCDLGSSINLMPLSIYKKLGIGEARPTTVTLQLADRSFTY 420

Query: 544 LEGKIEDVLVKVDKFIFPVDFIILDYEADRDVPIILGRPFFATGRALIDVQKGELTMRVC 603
            EGKIED+L++VDKF FP DFIILDYEAD DVPIILGRPF  TGR L+DV KG +T+R+ 
Sbjct: 421 PEGKIEDILIQVDKFTFPADFIILDYEADHDVPIILGRPFLKTGRTLVDVYKGTITLRMG 480

Query: 604 NEEVKFNVFKAMKYPDEMEDCSFIRIL----------------------ENTIVETAIQD 663
           +++V+FN+  +MKYP   ++CS +  L                       + I + A+  
Sbjct: 481 DQKVEFNINDSMKYPAVTKECSAVYELTEQPATEEWDDGESGQEEDSSWNDRIEQLAVLG 537

Query: 664 STNKHLE----KHGEAPPIKSFLIEAPTLDLKPLPDHLKYVYLGEGETVPIIVASDLT 678
             N+  E    +  ++ P++  + EAP LDLKPLP +LKY YLG+ +T+PII+++ L+
Sbjct: 541 EFNRTFESLECEGRKSSPMRPSIEEAPQLDLKPLPPNLKYAYLGDKKTLPIIISATLS 537

BLAST of Lag0026447 vs. ExPASy TrEMBL
Match: A0A6J1EQ90 (uncharacterized protein LOC111436411 OS=Cucurbita moschata OX=3662 GN=LOC111436411 PE=4 SV=1)

HSP 1 Score: 522.3 bits (1344), Expect = 4.0e-144
Identity = 318/681 (46.70%), Postives = 400/681 (58.74%), Query Frame = 0

Query: 40  IARPQIQAANFEMKPVMFQMLQTVGQFHGVP----------------------------- 99
           I RP+IQ   FE+KPVMFQMLQT+GQFHG+P                             
Sbjct: 89  IIRPEIQGTTFELKPVMFQMLQTIGQFHGLPLEDPHLHLKSFLGVSDSFRFHSDSFRFQG 148

Query: 100 --RDALRLTLFPYSLRDGANH--------------------------------------G 159
             +D +RL+LFPY LRDGA                                         
Sbjct: 149 VDKDMIRLSLFPYLLRDGAKSWLNTLAPGTIDSWNSLAENFLIKYFPPTRNARFKNEIVT 208

Query: 160 FRQLEDETFSEAWERFKELLRKCPHHGLPHCIQMETFYNGLNGVTQGMVDASAGGALLAK 219
           F+Q EDET SEA ERFKE+LRKCPHHGLPHCIQMETFYNGLN VT+ +VDASA GA+L+K
Sbjct: 209 FQQFEDETLSEACERFKEMLRKCPHHGLPHCIQMETFYNGLNIVTKQVVDASANGAILSK 268

Query: 220 TFDEAYEILERISINSCQWSDVRGT-NKKVKSVLEVDGVSTIRADLAMIANALKNVTVIS 279
           T++EAYEILERI+ N+CQW+DVR    +K + VLEVD +S+I A LA + N L+N+ +  
Sbjct: 269 TYNEAYEILERIASNNCQWADVRSNPGRKTRGVLEVDALSSINAQLASVTNILQNLALGQ 328

Query: 280 HQQPPA-MEPAAVVNQVTDEACVYCGEDHNYEFCPSNPASVFFVGNQ------RNNPYSN 339
                A +  AA +NQ   E+CVYCGE+H ++ CPSNPAS+F+VGNQ      +NNP+SN
Sbjct: 329 DSMIKAPVHTAAAINQTAAESCVYCGEEHTFDQCPSNPASIFYVGNQASQGNLKNNPFSN 388

Query: 340 FYNPGWRNHPNFSWGGQG-NNVQAQQKV---------NQSGFAKAQVMPQQNKPALPQQN 399
            YNPGWRNHPNFSW GQ   N Q   K          NQ  ++  QV  Q       Q  
Sbjct: 389 TYNPGWRNHPNFSWKGQSLYNQQMPPKANYPSGFRLQNQLAYSSQQVNTQGKGTTQAQYT 448

Query: 400 SGNSLETMMKEFMARTYAAIQSNQASMRALELQVGQLANELKARPQGKLPSDTEHPRREG 459
           S  S+E+++KE+MA+  A IQS QAS+R LE+Q+G   N  +     +  +DT+  +R  
Sbjct: 449 SETSIESLIKEYMAKNDAVIQSQQASLRNLEVQIGGEKNAEQGDSHSQETADTQ--QRNE 508

Query: 460 KEQVKAVTLRSGKPLEESRKTQDLNSNSDNIVVIEKELESGQSAGGSKENAGASG----- 519
           +  V+    +    +EE  K Q   S+              Q     KE A         
Sbjct: 509 EAAVQKEHSKDYAEVEEQPKMQTTASSEQESRTYTPSPPFPQRIKRKKEEAHFEKFMDIL 568

Query: 520 -----SVPDVE-----PPYVP--PPPYVPPLPFPQRQKPKNQDEWAT-------PKAKDP 579
                ++P VE     P YV       +    F + +     +E +         K KDP
Sbjct: 569 KEIHINIPLVEALKQMPNYVKFLKDVLINRRKFEEFKVVSLNEECSAILKNKIPLKEKDP 628

Query: 580 GSFTIPVSIGGKELGRALCDLGASINLMPLSVYRKLGIGEARPTTVTLQLADRSITYLEG 610
           GSFTIPVSIGGKELGRALCDLGA+INLMPLS+Y+KLGIGEARPTTVTLQLADRSITY EG
Sbjct: 629 GSFTIPVSIGGKELGRALCDLGANINLMPLSIYKKLGIGEARPTTVTLQLADRSITYPEG 688

BLAST of Lag0026447 vs. ExPASy TrEMBL
Match: A0A6A2WLX1 (Reverse transcriptase OS=Hibiscus syriacus OX=106335 GN=F3Y22_tig00116939pilonHSYRG00212 PE=3 SV=1)

HSP 1 Score: 435.3 bits (1118), Expect = 6.4e-118
Identity = 301/828 (36.35%), Postives = 404/828 (48.79%), Query Frame = 0

Query: 39   GIARPQIQAANFEMKPVMFQMLQTVGQFHGVP------------------------RDAL 98
            GI  P+IQAA+FEMKPVMF ML ++GQF G+P                         D L
Sbjct: 428  GIVAPEIQAAHFEMKPVMFNMLNSIGQFGGMPTEDVRQHIRNFLEVCDSFRQEGVHEDFL 487

Query: 99   RLTLFPYSLRDGAN--------------------------------------HGFRQLED 158
            +L LFPYSLRD A                                         FRQ +D
Sbjct: 488  KLKLFPYSLRDRARAWLSGVPVGSMESWVDLCKSFLLRYNPPNMNTQLRNEISSFRQGDD 547

Query: 159  ETFSEAWERFKELLRKCPHHGLPHCIQMETFYNGLNGVTQGMVDASAGGALLAKTFDEAY 218
            E+  E W+R+K LL+KC +HG     Q+  FYNG+N  T+ ++DASA G LL K+  EA+
Sbjct: 548  ESMYECWDRYKSLLQKCSYHGFHDWTQVVMFYNGVNAPTRMLLDASANGTLLDKSPTEAF 607

Query: 219  EILERISINSCQWSDVR-GTNKKVKSVLEVDGVSTIRADLAMIANALKNVTVISHQQPPA 278
             IL+RI+ N  Q+   R G+ ++     E++   ++ A L++I N LKN+   +  +   
Sbjct: 608  AILDRIANNDYQFPSSRLGSGRRAPGAFELEAKDSVSAQLSVITNMLKNLQCSTDVKEV- 667

Query: 279  MEPAAVVNQVTDEACVYCGEDHNYEFCPSNPASVFFVGNQR---NNPYSNFYNPGWRNHP 338
                    + T  AC+ C  +H+   CP+N  S+ FVGN     NNPYSN YN GWR HP
Sbjct: 668  --------KTTSLACLLCQGNHHESECPTNHESINFVGNYNRGSNNPYSNTYNAGWRQHP 727

Query: 339  NFSWGGQG----NNVQAQQKVNQ-SGFAKAQVMPQQNKPALPQQNSGNSLETMMKEFMAR 398
            NFSW  QG    N    QQ  N+  G+  A      NK AL    S +SLE  ++EF++ 
Sbjct: 728  NFSWENQGAHNANQPTRQQNHNEPQGYQNAMPWHNANKRAL-SSASISSLEATIQEFIST 787

Query: 399  TY---------------------AAIQSNQASMRALELQVGQLANELKARPQGKLPSDTE 458
            T                      A IQS+ +S+RALE QVGQ+A  L+ R QG+LPSDTE
Sbjct: 788  TKTMLQDHSTSIKNQGALLHSQGALIQSHSSSLRALEGQVGQIATALQERQQGRLPSDTE 847

Query: 459  HPRREGKEQVKAVTLRSGKPLEESRKTQDLNSNSDNIVVIEKELESGQSAGGSKENAGAS 518
              +  GKE    +TLRSG  +    K +D     D    +             KEN    
Sbjct: 848  VTKGPGKEHCNVLTLRSGTQINRQDKEEDFAKVPDYDAKV-------------KEN---- 907

Query: 519  GSVPDVEPPYVPPPPYV-PPLPFPQRQKPKNQD--------------------------- 578
                     ++P      PP PFPQR K  N +                           
Sbjct: 908  ---------FIPAAKEARPPPPFPQRLKKHNNEVQFKKFVDILDQLHINIPLLEAVEQMP 967

Query: 579  -----------------------------EWATPKAKDPGSFTIPVSIGGKELGRALCDL 638
                                            +PK  DPGSF IP SIG   +G+ALCDL
Sbjct: 968  MYAKFMKDICTKKRKVETVATATEFCSSSSKLSPKRNDPGSFIIPCSIGANFVGKALCDL 1027

Query: 639  GASINLMPLSVYRKLGIGEARPTTVTLQLADRSITYLEGKIEDVLVKVDKFIFPVDFIIL 678
            G+S+NL+P S++ KLGIG+ARPT+V LQLAD+S   LEG++EDV+V+VDKF+F VDF+IL
Sbjct: 1028 GSSVNLIPKSIFLKLGIGDARPTSVILQLADKSHVKLEGRVEDVIVRVDKFVFTVDFLIL 1087

BLAST of Lag0026447 vs. ExPASy TrEMBL
Match: A0A6J0ZX64 (LOW QUALITY PROTEIN: uncharacterized protein LOC110412945 OS=Herrania umbratica OX=108875 GN=LOC110412945 PE=4 SV=1)

HSP 1 Score: 433.0 bits (1112), Expect = 3.2e-117
Identity = 306/838 (36.52%), Postives = 414/838 (49.40%), Query Frame = 0

Query: 40  IARPQIQAANFEMKPVMFQMLQTVGQFHGVPR------------------------DALR 99
           I RP I A NFE+KP   QM+Q+  QF G+P                         DA+R
Sbjct: 74  IRRPSINANNFEIKPAYIQMIQSSVQFSGLPSDDPNSHLVNFLEICDTFKYNGVTDDAIR 133

Query: 100 LTLFPYSLRDGANH--------------------------------------GFRQLEDE 159
           L LFP+SLRD A                                         F Q + E
Sbjct: 134 LRLFPFSLRDKAKSWLNSLPNGSITTWEDLAQKFLAKFFPPAKTAKMRNDITSFIQFDGE 193

Query: 160 TFSEAWERFKELLRKCPHHGLPHCIQMETFYNGLNGVTQGMVDASAGGALLAKTFDEAYE 219
           +  EAWERFKELLR+CPHHG+P  +Q++TFYNGL G  + ++DA+AGGAL++K   +AY 
Sbjct: 194 SLYEAWERFKELLRRCPHHGIPDWLQVQTFYNGLVGSIKTIIDAAAGGALMSKNAVDAYN 253

Query: 220 ILERISINSCQWSDVRGTNKKVKSVLEVDGVSTIRADLAMIANALKNVTVISHQQPPAME 279
           +LE ++ N+ QW   R  ++K     E+D + T+   +A ++  L  + V          
Sbjct: 254 LLEEMASNNYQWPSERSGSRKAVGAYEIDALGTLTTQVAALSKKLDTLGV---------- 313

Query: 280 PAAVVNQVTDEACVYCGEDHNYEFCPSNPASVFFVGN---QRNNPYSNFYNPGWRNHPNF 339
             AV N +    C  CG+ H+Y+ CP N  SV FVGN   Q+NNPYSN YNPGWRNHPNF
Sbjct: 314 -HAVQNSLV--VCEMCGDSHSYDQCPYNSESVQFVGNFNRQQNNPYSNTYNPGWRNHPNF 373

Query: 340 SWGGQGNNVQAQQKVNQSGFAKAQVMPQQNKPALPQQNSGNSLETMMKEFMARTYAAIQS 399
           SW          + +   GF       QQ +P +P++ S   LE ++ +++++T A IQS
Sbjct: 374 SWSNNA-GPSNPKPIMPPGF------QQQARPQIPEKKS--QLEELLLQYISKTDAIIQS 433

Query: 400 NQASMRALELQVGQLANELKARPQGKLPSDTE-HPRREGKEQVKAVTLRSGKPLE--ESR 459
             AS+R LE QVGQLAN +  RPQG LPSDT+ +P+  GKEQ +A+TLRSGK +E    +
Sbjct: 434 QGASLRNLETQVGQLANSINNRPQGSLPSDTQINPK--GKEQCQAITLRSGKEIEGVNQK 493

Query: 460 KTQDLNSNSDNIVVIEKELESGQSAGGSKENAGASGSVPDVEPPYVPPPPYVPPL----- 519
             +    + D   + E E+E  Q      EN G S  +        PPPP+   L     
Sbjct: 494 AVESEIEHVDKEGMCENEIEIQQKDDDKAENQGTSQVIH-------PPPPFPQRLQKQKL 553

Query: 520 ------------------PFPQR---------------QKPKNQDEWAT----------- 579
                             PF +                 K +   E+ T           
Sbjct: 554 EKQFQKFLNVFKKLHINIPFAEALEQMPSYVKFLKDILSKKRKLGEFETVFLTEECSAIL 613

Query: 580 -----PKAKDPGSFTIPVSIGGKELGRALCDLGASINLMPLSVYRKLGIGEARPTTVTLQ 639
                PK KDPGSFTIP +IG     +AL DLGASINLMP S++ KLG+GE +PT+VTLQ
Sbjct: 614 QNKLPPKLKDPGSFTIPCTIGNLFFTKALSDLGASINLMPWSIFEKLGLGECKPTSVTLQ 673

Query: 640 LADRSITYLEGKIEDVLVKVDKFIFPVDFIILDYEADRDVPIILGRPFFATGRALIDVQK 699
           LADRS  Y  G IEDVLVKVDKFIFPVDF+ILD E DR +PIILGRPF AT  A+IDV++
Sbjct: 674 LADRSYVYPRGIIEDVLVKVDKFIFPVDFLILDMEEDRQIPIILGRPFLATAGAIIDVRE 733

Query: 700 GELTMRVCNEEVKFNVFKAMKYPDEMEDCSFIRILENTIVETAIQDSTNKHLEKHGE--A 754
           G+++ +V  E V+FN+F A K+P     C  + +++                E  GE  +
Sbjct: 734 GKISFKVGEEVVEFNIFNASKHPSSTNYCDRVELID----------------EGKGELIS 793

BLAST of Lag0026447 vs. ExPASy TrEMBL
Match: A0A6J1DY39 (uncharacterized protein LOC111025653 OS=Momordica charantia OX=3673 GN=LOC111025653 PE=4 SV=1)

HSP 1 Score: 433.0 bits (1112), Expect = 3.2e-117
Identity = 293/755 (38.81%), Postives = 387/755 (51.26%), Query Frame = 0

Query: 48  ANFEMKPVMFQMLQTVGQFHGV----PR--------------------DALRLTLFPYSL 107
           A FE KP+M QML  +GQF G+    PR                    DALRLTLFP+S+
Sbjct: 43  AKFEFKPMMLQMLNNIGQFGGLEHEDPRSHLKSFIKVANTFRLPGISDDALRLTLFPFSV 102

Query: 108 RDGANH--------------------------------------GFRQLEDETFSEAWER 167
              A                                         FRQ E+E  + AWER
Sbjct: 103 SGQATAWLNAFPSDTITTWSDMVDKFLVKYFPPTRNADVREEIISFRQKENEAVNVAWER 162

Query: 168 FKELLRKCPHHGLPHCIQMETFYNGLNGVTQGMVDASAGGALLAKTFDEAYEILERISIN 227
           FK+L+  CP+ G+P C+Q+E F+ G + +T+ M++ +A G   +K+F+E  EIL+++S +
Sbjct: 163 FKDLIMNCPNIGIPACVQIEHFFRGCDILTKMMLNGAANGKFTSKSFNEIVEILDQLSEH 222

Query: 228 SCQWSDVRGTNKKVKS----VLEVDGVSTIRADLAMIANALKNVTVISHQQPPAMEPA-- 287
           + QW   +   +  ++    VL +D +++++  +  I   LKN+   +     A      
Sbjct: 223 NYQWCSEKSRTQSKRADPAGVLALDNMTSMQKQIDTITQMLKNMEKNNAXAASAXATTNP 282

Query: 288 AVVNQVTDEACVYCGEDHNYEFCPSNPASVFFVG---NQRNNPYSNFYNPGWRNHPNFSW 347
           + V Q+ +  C YCG+ H  E CPSNP+S+++VG    Q+ NPYSN YNPGW+ HPNFSW
Sbjct: 283 SPVYQIAESTCYYCGDLHPSENCPSNPSSMYYVGQMNQQKFNPYSNTYNPGWKQHPNFSW 342

Query: 348 GGQ------GNNVQAQQKVNQSGFAKAQVMP-------QQNKPALPQQNSGNSLETMMKE 407
            GQ      G+N Q ++     GF  +   P       QQ     P Q + +++E +MKE
Sbjct: 343 SGQGSSNTTGHNQQYKEAYTPPGFPNSPAFPPTPHQYNQQKNYVQPAQQNLSNMEILMKE 402

Query: 408 FMARTYAAIQS---------------------NQASMRALELQVGQLANELKARPQGKLP 467
            + +  A ++                      N  ++R LE+Q+GQL NE++ RPQG LP
Sbjct: 403 LITKNDATMKELMTRTDVTMKDMKDVKDYMGRNDVTVRKLEMQLGQLVNEVRTRPQGSLP 462

Query: 468 SDTEHPRREGKEQVKAVTLRSGKPLEESRKTQDLNSNSDNIVVIEKELESGQSAGGSKEN 527
           S TE PRR GKE   ++  RSG   E  R                   ES  S    K+ 
Sbjct: 463 SSTEEPRRIGKEHCNSIATRSGLKYEGPRMPD----------------ESSHSPSREKD- 522

Query: 528 AGASGSVPD--VEPPY-VPPPPYV----PPLPFPQRQKPKNQD----------------- 587
              + +VPD  VEP   VP  P V    PP PFPQR   KNQD                 
Sbjct: 523 ---TQAVPDKIVEPAVSVPVAPQVSNSRPPPPFPQRLVRKNQDNNFRKFLDILKQLHINI 582

Query: 588 ---------------------------EWAT----------------PKAKDPGSFTIPV 631
                                      E+ T                PK KDPGSFTIP 
Sbjct: 583 PFVEALEQMPTYAKFIKDIITRKKKLGEYETVALTECSSNVFKSKMPPKLKDPGSFTIPC 642

BLAST of Lag0026447 vs. ExPASy TrEMBL
Match: A0A2G9GK35 (Reverse transcriptase OS=Handroanthus impetiginosus OX=429701 GN=CDL12_21798 PE=4 SV=1)

HSP 1 Score: 419.9 bits (1078), Expect = 2.8e-113
Identity = 273/695 (39.28%), Postives = 364/695 (52.37%), Query Frame = 0

Query: 91  FRQLEDETFSEAWERFKELLRKCPHHGLPHCIQMETFYNGLNGVTQGMVDASAGGALLAK 150
           FRQ   ET  EAW RF+++LR CP+H +P  IQ+ TFY+GL    +  +D   G + L+ 
Sbjct: 3   FRQGVSETVYEAWSRFRKMLRNCPNHDIPRHIQVHTFYHGLTDGGKDKLDHLNGDSFLSG 62

Query: 151 TFDEAYEILERISINSCQWSDVRGTNKKVKSVLEVDGVSTIRADLAMIANALKNVTVISH 210
           T  E + +L  +  N  +    R T  K   V+EVD V+ + A +  +  ++KN  V   
Sbjct: 63  TTAECHNLLNNLVANHYEKKSERATPPKAAGVIEVDQVTALNAKIDFLMQSMKNFGVNQV 122

Query: 211 QQPPAMEPAAVVNQVTDEACVYCGEDHNYEFCPSNPASVFFVGNQR---NNPYSNFYNPG 270
           Q  P               C  CGE H  + CP +  S+ FV N R   NNPYSN YNPG
Sbjct: 123 QHTPV-------------TCDECGESHPSDQCPHSVESIQFVSNARKPQNNPYSNTYNPG 182

Query: 271 WRNHPNFSWGGQGNNVQAQ---QKVNQSGFAKAQVMPQQNKPALPQQNSGNSLETMMKEF 330
           WR HPNFSW    NN Q Q    +  Q G  + Q   Q+ KP         SLE  + +F
Sbjct: 183 WRQHPNFSW----NNNQGQGSAPRFQQGGQQQVQQPIQEKKP---------SLEETLIQF 242

Query: 331 MARTYAAIQSNQASMRALELQVGQLANELKARPQGKLPSDTE-HPRREGKEQVKAVTLRS 390
           MA       S  A+ + +E Q+GQLAN + +RPQG LPS+TE +PR++GK Q +AVTLR+
Sbjct: 243 MA-------STAANFKTMETQIGQLANAINSRPQGSLPSNTEPNPRQDGKAQCQAVTLRN 302

Query: 391 GKPLEESRKTQDLNSNSDNIVVIEKELESGQSAGGSKENAGASGSVPDVEPPYVPPPPYV 450
           G+ L+E  K +   S    ++  EKE E                    VE P     P  
Sbjct: 303 GRELQEVVK-EPTKSKEKEVISEEKEKE--------------------VEAPLEVSKPTT 362

Query: 451 PPLPFPQR-QKPKNQDEW------------------------------------------ 510
              PFPQR QK K + ++                                          
Sbjct: 363 LQPPFPQRLQKQKLEKQFLKFLEVFKKLHINIPFAEALEQMPSYVKFMKDILSKKRRLGD 422

Query: 511 -----------------ATPKAKDPGSFTIPVSIGGKELGRALCDLGASINLMPLSVYRK 570
                              PK KDPGSFTIP +IG    GRALCDLGASINLMP S+YR 
Sbjct: 423 YETVALTEECSAIIQNKLPPKLKDPGSFTIPCTIGTHFSGRALCDLGASINLMPYSIYRT 482

Query: 571 LGIGEARPTTVTLQLADRSITYLEGKIEDVLVKVDKFIFPVDFIILDYEADRDVPIILGR 630
           LG+GEA+PT++TLQLADRS+TY +G IED+LVKVDKFIFP DF++LD E D +VPIILGR
Sbjct: 483 LGLGEAKPTSITLQLADRSLTYPKGVIEDILVKVDKFIFPADFVVLDMEVDIEVPIILGR 542

Query: 631 PFFATGRALIDVQKGELTMRVCNEEVKFNVFKAMKYPDEMEDCSFIRILEN-----TIVE 683
           PF ATGR LIDVQKGELTMRV ++++ FNVFKAMK+P+E ++C  + + +N     +I E
Sbjct: 543 PFLATGRTLIDVQKGELTMRVQDQQITFNVFKAMKFPNESDECFAVNLFDNLAGNESIAE 602

The following BLAST results are available for this feature:
Match NameE-valueIdentityDescription
XP_022929949.18.2e-14446.70uncharacterized protein LOC111436411 [Cucurbita moschata][more]
XP_030505184.12.5e-13242.60uncharacterized protein LOC115720166 [Cannabis sativa][more]
XP_024042858.14.8e-12842.32uncharacterized protein LOC112099671 [Citrus clementina][more]
KAG8501049.11.1e-12742.75hypothetical protein CXB51_003148 [Gossypium anomalum][more]
XP_023521781.13.6e-12345.65LOW QUALITY PROTEIN: uncharacterized protein LOC111785639, partial [Cucurbita pe... [more]
Match NameE-valueIdentityDescription
Match NameE-valueIdentityDescription
A0A6J1EQ904.0e-14446.70uncharacterized protein LOC111436411 OS=Cucurbita moschata OX=3662 GN=LOC1114364... [more]
A0A6A2WLX16.4e-11836.35Reverse transcriptase OS=Hibiscus syriacus OX=106335 GN=F3Y22_tig00116939pilonHS... [more]
A0A6J0ZX643.2e-11736.52LOW QUALITY PROTEIN: uncharacterized protein LOC110412945 OS=Herrania umbratica ... [more]
A0A6J1DY393.2e-11738.81uncharacterized protein LOC111025653 OS=Momordica charantia OX=3673 GN=LOC111025... [more]
A0A2G9GK352.8e-11339.28Reverse transcriptase OS=Handroanthus impetiginosus OX=429701 GN=CDL12_21798 PE=... [more]
Match NameE-valueIdentityDescription
InterPro
Analysis Name: InterPro Annotations of Sponge gourd (AG-4) v1
Date Performed: 2022-08-01
IPR TermIPR DescriptionSourceSource TermSource DescriptionAlignment
IPR005162Retrotransposon gag domainPFAMPF03732Retrotrans_gagcoord: 90..132
e-value: 7.1E-6
score: 26.2
NoneNo IPR availablePFAMPF13650Asp_protease_2coord: 473..565
e-value: 7.9E-6
score: 26.5
NoneNo IPR availableMOBIDB_LITEmobidb-litedisorder_predictioncoord: 432..451
NoneNo IPR availableMOBIDB_LITEmobidb-litedisorder_predictioncoord: 796..899
NoneNo IPR availableMOBIDB_LITEmobidb-litedisorder_predictioncoord: 878..899
NoneNo IPR availableMOBIDB_LITEmobidb-litedisorder_predictioncoord: 353..467
NoneNo IPR availableMOBIDB_LITEmobidb-litedisorder_predictioncoord: 359..395
NoneNo IPR availableMOBIDB_LITEmobidb-litedisorder_predictioncoord: 796..846
NoneNo IPR availableMOBIDB_LITEmobidb-litedisorder_predictioncoord: 856..877
NoneNo IPR availablePANTHERPTHR33067FAMILY NOT NAMEDcoord: 463..610
NoneNo IPR availablePANTHERPTHR33067:SF28SUBFAMILY NOT NAMEDcoord: 463..610
NoneNo IPR availableCDDcd00303retropepsin_likecoord: 473..565
e-value: 8.88007E-18
score: 76.9916
IPR021109Aspartic peptidase domain superfamilyGENE3D2.40.70.10Acid Proteasescoord: 452..593
e-value: 1.6E-29
score: 104.4

Relationships

The following mRNA feature(s) are a part of this gene:

Feature NameUnique NameType
Lag0026447.1Lag0026447.1mRNA


GO Annotation
GO Assignments
This gene is annotated with the following GO terms.
Category Term Accession Term Name
molecular_function GO:0003824 catalytic activity