PI0023494 (gene) Melon (PI 482460) v1

Overview
NamePI0023494
Typegene
OrganismCucumis metuliferus (Melon (PI 482460) v1)
DescriptionRetrotrans_gag domain-containing protein
Locationchr06: 17966261 .. 17967622 (+)
RNA-Seq ExpressionPI0023494
SyntenyPI0023494
Sequences
The following sequences are available for this feature:

Gene sequence (with intron)

Legend: exonCDSpolypeptide
Hold the cursor over a type above to highlight its positions in the sequence below.
ATGGAAAATAACAACAGAAATGCCCCTCCGCCGCAAGCTGACCCAGAACCAAACACCGCCTATATAGCACATGACTTGGATAGACCGATTAGATCTTATGCGGCGCCCAACCTCTATAACTTCAACCCAGGAATCGCCTACCCTGTATTTGGCGAGAACGCCAGGTTTGAAATCAAACCTGTTATGCTTCAAATGATTCAGAACGCCGGACAATTCGGCGGACATCCTGGGGAAGATCCACACGAACATATAAGGAGTTTCTACTCCATCTGCGCTTCCTTCCATATGCCAGGCATCTCACCTGAAGAATTAAGATTCGCTCTTTTCCCGTTAACTCTGAGGGATGAGGCGAAGAGGTGGGTGAATGCCCTAGAAGATGGCGAGGTGGGAACATGGGATCAATTAATAGAAAAATTTATGAAGAAATTTTTCCCACCTCACGAAAATGCTAGAAGGAGGAAGGAACTTATGAGCTTCCAGCAGAAGGATAAAGAAAACCTACATGACGCGTGGAGTAGGTTCAAACGGATGGTCAAAGCATGCCCCCACAATGGCATTCCTAAATGCATATTGATGGAGGTTTTCTACTTTAGACTAAACAAGGCCACACAGCAGACTGTTGATGCTGTGTTTGTAGACGATATGCTGAAAAGTACATACAACCAGATTAAGACGACGCAGGACACGATGGCCAGCAATAATGAAGAATGGGACGAAGATGATTTCGGCAATCGCCGAGGAGGACGAGCAAAAGATGATGGCATGGATAAGAACGTCGTGGTGGCATTGCAGGGACAAATGACTGCGATGAACAATTTACTTAAATCAATGGCAATATCGCAAGTCAATGCCGCAGGAAGCTATGTGCTCGCGGCTAATCAAATTGATGACATGGGATGTGTGGGATGCGACGGTCATCATAACACTGACGCATGCCCACTCAATACTGAAACCGTCGCGTTCATAAGGAACGACCCCTTCTCCAATACCTACAACCCTGGTTGGAGGAACCATCTCAACTTTGGATGGGGAGGATCGAGTCAACAACAAAGGCGACATGGTGGTCAAAGTGACCATCGCGGGGAAGCACCTGGCTCCCACGCGAGGTACCAAAACAATAGACTCCAACAATCCCATCATCAACAGCATCCCACCACCACCGCCTCGTCCACCTCTCCCATGGAAAACCTCCTCCGCGAATACATGCAGAAAAATGATGCTCTTCTGCAAAGCCAAGCTTCATCAATTCGTAATCTGGAGGTACAGTTAGGTCAGCTCGCTAGTGATTTCTCCGGAAGACAGCAAGGATCCCTTCCGAGCAATACAGAAACGCCAAATCAGGCGGGAGGATCTGGTAAATAG

mRNA sequence

ATGGAAAATAACAACAGAAATGCCCCTCCGCCGCAAGCTGACCCAGAACCAAACACCGCCTATATAGCACATGACTTGGATAGACCGATTAGATCTTATGCGGCGCCCAACCTCTATAACTTCAACCCAGGAATCGCCTACCCTGTATTTGGCGAGAACGCCAGGTTTGAAATCAAACCTGTTATGCTTCAAATGATTCAGAACGCCGGACAATTCGGCGGACATCCTGGGGAAGATCCACACGAACATATAAGGAGTTTCTACTCCATCTGCGCTTCCTTCCATATGCCAGGCATCTCACCTGAAGAATTAAGATTCGCTCTTTTCCCGTTAACTCTGAGGGATGAGGCGAAGAGGTGGGTGAATGCCCTAGAAGATGGCGAGGTGGGAACATGGGATCAATTAATAGAAAAATTTATGAAGAAATTTTTCCCACCTCACGAAAATGCTAGAAGGAGGAAGGAACTTATGAGCTTCCAGCAGAAGGATAAAGAAAACCTACATGACGCGTGGAGTAGGTTCAAACGGATGGTCAAAGCATGCCCCCACAATGGCATTCCTAAATGCATATTGATGGAGGTTTTCTACTTTAGACTAAACAAGGCCACACAGCAGACTGTTGATGCTGTGTTTGTAGACGATATGCTGAAAAGTACATACAACCAGATTAAGACGACGCAGGACACGATGGCCAGCAATAATGAAGAATGGGACGAAGATGATTTCGGCAATCGCCGAGGAGGACGAGCAAAAGATGATGGCATGGATAAGAACGTCGTGGTGGCATTGCAGGGACAAATGACTGCGATGAACAATTTACTTAAATCAATGGCAATATCGCAAGTCAATGCCGCAGGAAGCTATGTGCTCGCGGCTAATCAAATTGATGACATGGGATGTGTGGGATGCGACGGTCATCATAACACTGACGCATGCCCACTCAATACTGAAACCGTCGCACTCCAACAATCCCATCATCAACAGCATCCCACCACCACCGCCTCGTCCACCTCTCCCATGGAAAACCTCCTCCGCGAATACATGCAGAAAAATGATGCTCTTCTGCAAAGCCAAGCTTCATCAATTCGTAATCTGGAGGTACAGTTAGGTCAGCTCGCTAGTGATTTCTCCGGAAGACAGCAAGGATCCCTTCCGAGCAATACAGAAACGCCAAATCAGGCGGGAGGATCTGGTAAATAG

Coding sequence (CDS)

ATGGAAAATAACAACAGAAATGCCCCTCCGCCGCAAGCTGACCCAGAACCAAACACCGCCTATATAGCACATGACTTGGATAGACCGATTAGATCTTATGCGGCGCCCAACCTCTATAACTTCAACCCAGGAATCGCCTACCCTGTATTTGGCGAGAACGCCAGGTTTGAAATCAAACCTGTTATGCTTCAAATGATTCAGAACGCCGGACAATTCGGCGGACATCCTGGGGAAGATCCACACGAACATATAAGGAGTTTCTACTCCATCTGCGCTTCCTTCCATATGCCAGGCATCTCACCTGAAGAATTAAGATTCGCTCTTTTCCCGTTAACTCTGAGGGATGAGGCGAAGAGGTGGGTGAATGCCCTAGAAGATGGCGAGGTGGGAACATGGGATCAATTAATAGAAAAATTTATGAAGAAATTTTTCCCACCTCACGAAAATGCTAGAAGGAGGAAGGAACTTATGAGCTTCCAGCAGAAGGATAAAGAAAACCTACATGACGCGTGGAGTAGGTTCAAACGGATGGTCAAAGCATGCCCCCACAATGGCATTCCTAAATGCATATTGATGGAGGTTTTCTACTTTAGACTAAACAAGGCCACACAGCAGACTGTTGATGCTGTGTTTGTAGACGATATGCTGAAAAGTACATACAACCAGATTAAGACGACGCAGGACACGATGGCCAGCAATAATGAAGAATGGGACGAAGATGATTTCGGCAATCGCCGAGGAGGACGAGCAAAAGATGATGGCATGGATAAGAACGTCGTGGTGGCATTGCAGGGACAAATGACTGCGATGAACAATTTACTTAAATCAATGGCAATATCGCAAGTCAATGCCGCAGGAAGCTATGTGCTCGCGGCTAATCAAATTGATGACATGGGATGTGTGGGATGCGACGGTCATCATAACACTGACGCATGCCCACTCAATACTGAAACCGTCGCACTCCAACAATCCCATCATCAACAGCATCCCACCACCACCGCCTCGTCCACCTCTCCCATGGAAAACCTCCTCCGCGAATACATGCAGAAAAATGATGCTCTTCTGCAAAGCCAAGCTTCATCAATTCGTAATCTGGAGGTACAGTTAGGTCAGCTCGCTAGTGATTTCTCCGGAAGACAGCAAGGATCCCTTCCGAGCAATACAGAAACGCCAAATCAGGCGGGAGGATCTGGTAAATAG

Protein sequence

MENNNRNAPPPQADPEPNTAYIAHDLDRPIRSYAAPNLYNFNPGIAYPVFGENARFEIKPVMLQMIQNAGQFGGHPGEDPHEHIRSFYSICASFHMPGISPEELRFALFPLTLRDEAKRWVNALEDGEVGTWDQLIEKFMKKFFPPHENARRRKELMSFQQKDKENLHDAWSRFKRMVKACPHNGIPKCILMEVFYFRLNKATQQTVDAVFVDDMLKSTYNQIKTTQDTMASNNEEWDEDDFGNRRGGRAKDDGMDKNVVVALQGQMTAMNNLLKSMAISQVNAAGSYVLAANQIDDMGCVGCDGHHNTDACPLNTETVALQQSHHQQHPTTTASSTSPMENLLREYMQKNDALLQSQASSIRNLEVQLGQLASDFSGRQQGSLPSNTETPNQAGGSGK
Homology
BLAST of PI0023494 vs. ExPASy TrEMBL
Match: A0A6J1G7Q6 (uncharacterized protein LOC111451598 OS=Cucurbita moschata OX=3662 GN=LOC111451598 PE=4 SV=1)

HSP 1 Score: 253.8 bits (647), Expect = 1.2e-63
Identity = 152/444 (34.23%), Postives = 227/444 (51.13%), Query Frame = 0

Query: 18  NTAYIAHDLDRPIRSYAAPNLYNFNPGIAYPVFGENARFEIKPVMLQMIQNAGQFGGHPG 77
           N  ++A D +R IR+YA P +   NP I  P   +   FE+KPVM QM+Q  GQF G   
Sbjct: 10  NAIHVADDRERAIRAYAHPAVEELNPCIIRPEM-QATTFELKPVMFQMLQTIGQFHGLSS 69

Query: 78  EDPHEHIRSFYSICASFHMPGISPEELRFALFPLTLRDEAKRWVNALEDGEVGTWDQLIE 137
           +DPH H++SF  +  SF   G+  + +R + F  +LRD AK W+N L  G + +W+ L E
Sbjct: 70  KDPHLHLKSFLGVSDSFRFQGVDKDVIRLSFFSYSLRDGAKSWLNILALGIIDSWNSLAE 129

Query: 138 KFMKKFFPPHENARRRKELMSFQQKDKENLHDAWSRFKRMVKACPHNGIPKCILMEVFYF 197
           KF+ K+FPP  +AR R E+++FQ+ + E L +AW RFK  ++ CPH+G+P CI +E FY 
Sbjct: 130 KFLFKYFPPTRSARFRNEIVAFQKFENETLSEAWERFKETLRKCPHHGLPHCIQIETFYN 189

Query: 198 RLNKATQQTVDAVFVDDMLKSTYNQIKTTQDTMASNNEEWDEDDFGNRRGGRAKDDGMDK 257
            LN AT+Q VDA    D+L  TYN+     + +ASNN +W +        G+   + ++ 
Sbjct: 190 GLNTATKQVVDASANGDILSKTYNEAYEILERIASNNCQWVD---VRSNPGKKTREVLEV 249

Query: 258 NVVVALQGQMTAMNNLLKSMAISQ---VNAAGSYVLAANQIDDMGCVGCDGHHNTDACPL 317
           + + ++  Q+ +M N+L+++A  Q   + A         Q     CV C   H  D CP 
Sbjct: 250 DALSSINAQLASMTNILQNLAFGQGSMIKAPAHTATVMIQTATESCVYCGEKHTFDQCPS 309

Query: 318 NTETVAL---------------------------------QQSHHQQHP----------- 377
           N  ++                                   Q S++QQ P           
Sbjct: 310 NPASIFYVGNQASQGNPKTNPSSNTYNPGWRNHPNFLCKGQGSYNQQMPPKANYPPGFGL 369

Query: 378 -----------TTTASSTSP--------MENLLREYMQKNDALLQSQASSIRNLEVQLGQ 396
                      TT    TS         +E+L++EYM +NDA++QSQ  S+RNLEVQ+GQ
Sbjct: 370 QNQLTYDSQQATTQGEGTSQAQHISGTLLESLIKEYMARNDAVIQSQQVSLRNLEVQVGQ 429

BLAST of PI0023494 vs. ExPASy TrEMBL
Match: U5CUI2 (Retrotrans_gag domain-containing protein OS=Amborella trichopoda OX=13333 GN=AMTR_s04947p00003620 PE=4 SV=1)

HSP 1 Score: 252.3 bits (643), Expect = 3.4e-63
Identity = 134/304 (44.08%), Postives = 179/304 (58.88%), Query Frame = 0

Query: 18  NTAYIAHDLDRPIRSYAAPNLYNFNPGIAYPVFGENARFEIKPVMLQMIQNAGQFGGHPG 77
           N   +A D  R IR YAAP     NPGI  P   +  +FE+KPVM QM+Q  GQF G P 
Sbjct: 11  NPIILADDRARAIREYAAPMFNELNPGIVRPEI-QAPQFELKPVMFQMLQTVGQFSGMPT 70

Query: 78  EDPHEHIRSFYSICASFHMPGISPEELRFALFPLTLRDEAKRWVNALEDGEVGTWDQLIE 137
           EDPH H+RSF  +  SF + G+S E LR  LFP +LRD A+ W+N L    V  W+ L E
Sbjct: 71  EDPHLHLRSFLEVSDSFKIQGVSEEVLRLKLFPFSLRDRARSWLNTLPPDSVTNWNDLAE 130

Query: 138 KFMKKFFPPHENARRRKELMSFQQKDKENLHDAWSRFKRMVKACPHNGIPKCILMEVFYF 197
           KF++K+FPP  NA+ R E+MSFQQ + E+  DAW RFK +++ CPH+GIP CI ME FY 
Sbjct: 131 KFLRKYFPPTRNAKFRSEIMSFQQLEDESTSDAWERFKELLRKCPHHGIPHCIQMETFYN 190

Query: 198 RLNKATQQTVDAVFVDDMLKSTYNQIKTTQDTMASNNEEWDEDDFGNRRG--GRAKDDGM 257
            LN A++  +DA     +L  +YN+     +T+ASNN +W      N R    R     +
Sbjct: 191 GLNAASRMVLDASANGAILSKSYNEAFEILETIASNNYQW-----SNTRAPTSRKVAGVL 250

Query: 258 DKNVVVALQGQMTAMNNLLKSMAISQVNAAGSYVLAANQIDDMGCVGCDGHHNTDACPLN 317
           + + + AL  QM +M N+LK+++I   NA      AA Q DD+ CV C   H  + CP N
Sbjct: 251 EVDAITALTAQMASMTNVLKNLSIG--NAKNIQPAAAIQSDDVSCVFCGEGHVFEKCPSN 306

Query: 318 TETV 320
            E+V
Sbjct: 311 PESV 306

BLAST of PI0023494 vs. ExPASy TrEMBL
Match: A0A6J1EEI2 (uncharacterized protein LOC111433394 OS=Cucurbita moschata OX=3662 GN=LOC111433394 PE=4 SV=1)

HSP 1 Score: 239.6 bits (610), Expect = 2.3e-59
Identity = 144/417 (34.53%), Postives = 211/417 (50.60%), Query Frame = 0

Query: 18  NTAYIAHDLDRPIRSYAAPNLYNFNPGIAYPVFGENARFEIKPVMLQMIQNAGQFGGHPG 77
           N  ++A D +R IR+YA P +   NP I  P   +   FE+KPVM QM+Q  GQF G P 
Sbjct: 40  NAIHLADDRERAIRAYAHPAVEELNPCIIRPEM-QATTFELKPVMFQMLQTIGQFHGLPS 99

Query: 78  EDPHEHIRSFYSICASFHMPGISPEELRFALFPLTLRDEAKRWVNALEDGEVGTWDQLIE 137
           EDPH H++SF  +  SF    +  + +R +LFP +LRD AK W+N L  G + +W+ L+E
Sbjct: 100 EDPHLHLKSFLGVSDSFRFQRVDKDVIRLSLFPYSLRDGAKSWLNTLALGTIDSWNSLVE 159

Query: 138 KFMKKFFPPHENARRRKELMSFQQKDKENLHDAWSRFKRMVKACPHNGIPKCILMEVFYF 197
           KF+ K+FPP  NAR R E++ FQQ + + L +AW RFK M++ CPH+G+P CI ME FY 
Sbjct: 160 KFLIKYFPPTRNARFRNEIVVFQQFEDDTLSEAWERFKEMLRKCPHHGLPHCIQMETFYN 219

Query: 198 RLNKATQQTVDAVFVDDMLKSTYNQIKTTQDTMASNNEEWDEDDFGNRRGGRAKDDGMDK 257
            LN AT+Q VDA     +L  TYN+     + +ASNN +W +        GR     ++ 
Sbjct: 220 GLNIATKQVVDASANGAILSKTYNEAYEILERIASNNCQWAD---VRSNPGRKTRGVLEV 279

Query: 258 NVVVALQGQMTAMNNLLKSMAISQ---VNAAGSYVLAANQIDDMGCVGCDGHHNTDACPL 317
           + + ++  Q+ ++ N+L+++A+ Q   + A    V   NQ     CV C   H  D CP 
Sbjct: 280 DALSSINAQLASVTNILQNLALGQDSMIKAPVHTVAVINQTAAESCVYCGEEHTFDQCPS 339

Query: 318 NTETVAL---------------------------------QQSHHQQHP----------- 369
           N  ++                                   Q S++QQ P           
Sbjct: 340 NPASIFYVGNQASQGNPKNNPFSNTYNPGWRNHPNFSWKGQGSYNQQMPPKANYPPGFGL 399

BLAST of PI0023494 vs. ExPASy TrEMBL
Match: A0A6J1DW02 (uncharacterized protein LOC111024897 OS=Momordica charantia OX=3673 GN=LOC111024897 PE=4 SV=1)

HSP 1 Score: 234.6 bits (597), Expect = 7.3e-58
Identity = 140/424 (33.02%), Postives = 225/424 (53.07%), Query Frame = 0

Query: 5   NRNAPPPQADPEPNTAYIAHDLDRPIRSYAAPNLYNFNPGIAYPVFGENARFEIKPVMLQ 64
           N N      + E N   +A + D  +R YAA    NF+ GI  P+   +  FE+KP+M Q
Sbjct: 69  NGNMRDHARNDEFNYIQMADNRDVAMREYAATAFQNFDSGIVNPI-PAHXNFELKPMMFQ 128

Query: 65  MIQNAGQFGGHPGEDPHEHIRSFYSICASFHMPGISPEELRFALFPLTLRDEAKRWVNAL 124
           M+Q  G FGG   EDPH+H++SF  I  +F +PGI+ +     LFP +L+D+A+  +NA 
Sbjct: 129 MLQTIGHFGGQEHEDPHDHLKSFIQIANAFRLPGITDDAXXLTLFPFSLKDQARXXLNAF 188

Query: 125 EDGEVGTWDQLIEKFMKKFFPPHENARRRKELMSFQQKDKENLHDAWSRFKRMVKACPHN 184
             G + TW  L+EKF+ KFFPP  +A  R+E++SF+Q D+E +H+AW RFK +++ C ++
Sbjct: 189 PXGSITTWGSLVEKFLTKFFPPTRHADIREEIISFRQYDREPVHEAWERFKELIRKCXNH 248

Query: 185 GIPKCILMEVFYFRLNKATQQTVDAVFVDDMLKSTYNQIKTTQDTMASNNEEWDEDDFGN 244
           G+P C  +E F+  L+  T+  ++        K T+N+I    + +AS+NE W      +
Sbjct: 249 GLPACXQIEHFFRGLDHPTKMMLNNAANGAFTKKTFNEIVDILNDLASHNELWCSQ--RS 308

Query: 245 RRGGRAKDDG--MDKNVVVALQGQMTAMNNLLKSMAISQVNAAGSYVLAAN--------- 304
           R   + +D    +  ++  ++Q +M  MN  LK MA+   N   + +             
Sbjct: 309 RAAPKKQDPAGVLALDIATSMQKEMVTMNQRLKEMALGIKNPLATXIQPVQSDYCTHAPV 368

Query: 305 -QIDDMGC------------VGCDGHHNTDA--------CPLNTETVALQQSHHQQHPT- 364
            Q++D+ C             G  G +   +         P        QQ ++Q+  T 
Sbjct: 369 CQVNDLICWRHHPNFSWGGQGGSSGFNQGQSQQNKQPYVPPTQQHIPPPQQQYNQRTQTP 428

Query: 365 TTASSTSPMENLLREYMQKNDALLQSQASSIRNLEVQLGQLASDFSGRQQGSLPSNTETP 396
              ++ S +EN+++EYM + DA++QSQA+S+RN   QLG LA++   R QGS P +TE P
Sbjct: 429 PIQNNNSNLENMMKEYMARTDAVIQSQAASMRNFGTQLGHLANELKNRPQGSFPGHTELP 488

BLAST of PI0023494 vs. ExPASy TrEMBL
Match: A0A6J1EQ90 (uncharacterized protein LOC111436411 OS=Cucurbita moschata OX=3662 GN=LOC111436411 PE=4 SV=1)

HSP 1 Score: 234.2 bits (596), Expect = 9.6e-58
Identity = 150/449 (33.41%), Postives = 221/449 (49.22%), Query Frame = 0

Query: 18  NTAYIAHDLDRPIRSYAAPNLYNFNPGIAYPVFGENARFEIKPVMLQMIQNAGQFGGHPG 77
           N  ++A D +R IR+YA P +   NP I  P   +   FE+KPVM QM+Q  GQF G P 
Sbjct: 62  NPIHLADDRERAIRAYAHPAVEELNPCIIRPEI-QGTTFELKPVMFQMLQTIGQFHGLPL 121

Query: 78  EDPHEHIRSFYSI-------CASFHMPGISPEELRFALFPLTLRDEAKRWVNALEDGEVG 137
           EDPH H++SF  +         SF   G+  + +R +LFP  LRD AK W+N L  G + 
Sbjct: 122 EDPHLHLKSFLGVSDSFRFHSDSFRFQGVDKDMIRLSLFPYLLRDGAKSWLNTLAPGTID 181

Query: 138 TWDQLIEKFMKKFFPPHENARRRKELMSFQQKDKENLHDAWSRFKRMVKACPHNGIPKCI 197
           +W+ L E F+ K+FPP  NAR + E+++FQQ + E L +A  RFK M++ CPH+G+P CI
Sbjct: 182 SWNSLAENFLIKYFPPTRNARFKNEIVTFQQFEDETLSEACERFKEMLRKCPHHGLPHCI 241

Query: 198 LMEVFYFRLNKATQQTVDAVFVDDMLKSTYNQIKTTQDTMASNNEEWDEDDFGNRRGGRA 257
            ME FY  LN  T+Q VDA     +L  TYN+     + +ASNN +W +        GR 
Sbjct: 242 QMETFYNGLNIVTKQVVDASANGAILSKTYNEAYEILERIASNNCQWAD---VRSNPGRK 301

Query: 258 KDDGMDKNVVVALQGQMTAMNNLLKSMAISQ---VNAAGSYVLAANQIDDMGCVGCDGHH 317
               ++ + + ++  Q+ ++ N+L+++A+ Q   + A      A NQ     CV C   H
Sbjct: 302 TRGVLEVDALSSINAQLASVTNILQNLALGQDSMIKAPVHTAAAINQTAAESCVYCGEEH 361

Query: 318 NTDACPLNTETVAL---------------------------------QQSHHQQHP---- 377
             D CP N  ++                                   Q  ++QQ P    
Sbjct: 362 TFDQCPSNPASIFYVGNQASQGNLKNNPFSNTYNPGWRNHPNFSWKGQSLYNQQMPPKAN 421

Query: 378 ------------------------TTTASSTS--PMENLLREYMQKNDALLQSQASSIRN 394
                                   TT A  TS   +E+L++EYM KNDA++QSQ +S+RN
Sbjct: 422 YPSGFRLQNQLAYSSQQVNTQGKGTTQAQYTSETSIESLIKEYMAKNDAVIQSQQASLRN 481

BLAST of PI0023494 vs. NCBI nr
Match: XP_030497803.1 (uncharacterized protein LOC115713460 [Cannabis sativa])

HSP 1 Score: 297.4 bits (760), Expect = 1.9e-76
Identity = 177/427 (41.45%), Postives = 229/427 (53.63%), Query Frame = 0

Query: 13  ADPEPNTAYIAHDLDRPIRSYAAPNLYNFNPGIAYPVFGENARFEIKPVMLQMIQNAGQF 72
           A  E N   +A D  R IR YAAP     NPGI  P   +   FE+KPVM QM+Q  GQF
Sbjct: 10  AHNEANPIALADDRTRAIREYAAPMFNELNPGIVRPEI-QAPHFELKPVMFQMLQTVGQF 69

Query: 73  GGHPGEDPHEHIRSFYSICASFHMPGISPEELRFALFPLTLRDEAKRWVNALEDGEVGTW 132
           GG P EDPH HIRSF  +  SF + G+S E LR  LFP +LRD A+ W+N L    V  W
Sbjct: 70  GGSPTEDPHLHIRSFLEVSDSFKLQGVSEEALRLKLFPFSLRDRARAWLNTLPPDSVTNW 129

Query: 133 DQLIEKFMKKFFPPHENARRRKELMSFQQKDKENLHDAWSRFKRMVKACPHNGIPKCILM 192
           + L EKF++K+FPP  NA+ R E+MSFQQ + E   DAW RFK +++ CPH+GIP CI +
Sbjct: 130 NDLAEKFLRKYFPPTRNAKFRSEIMSFQQSEDETTSDAWERFKELLRKCPHHGIPHCIQL 189

Query: 193 EVFYFRLNKATQQTVDAVFVDDMLKSTYNQIKTTQDTMASNNEEWDEDDFGNRRGGRAKD 252
           E FY  LN A++  +DA     +L  +YN+     + +ASNN +W      NR     K 
Sbjct: 190 ETFYNGLNAASRMVLDASANGAILSKSYNEAFEILERIASNNYQWST----NRAHTSRKV 249

Query: 253 DG-MDKNVVVALQGQMTAMNNLLKSMAISQVNAAGS-YVLAANQIDDMGCVGCDGHHNTD 312
            G ++ + + AL  QM +M N+LK+M     N  GS    AA Q     CV C   H  +
Sbjct: 250 AGVLEVDALTALTAQMASMTNILKNM-----NMGGSVQPAAAIQRAKNSCVYCGDGHTFE 309

Query: 313 ACPLNTETVAL------------------------------------------QQSHHQQ 372
            CP N  +V                                            QQ   QQ
Sbjct: 310 NCPSNLASVCYVGNQNFNRNNNPYSNSYNPAWKHHPNFSWGGQGKQSFPPGFSQQPRPQQ 369

Query: 373 HPTTTASSTSPMENLLREYMQKNDALLQSQASSIRNLEVQLGQLASDFSGRQQGSLPSNT 396
                 S TS +E+L+R+YM KND ++QSQA+S+RNLEVQLGQLA+D   R QG+LPS+T
Sbjct: 370 PHQPQGSQTSSLESLMRDYMAKNDTVIQSQAASLRNLEVQLGQLANDLKNRPQGTLPSDT 426

BLAST of PI0023494 vs. NCBI nr
Match: XP_030505184.1 (uncharacterized protein LOC115720166 [Cannabis sativa])

HSP 1 Score: 286.6 bits (732), Expect = 3.4e-73
Identity = 168/425 (39.53%), Postives = 226/425 (53.18%), Query Frame = 0

Query: 22  IAHDLDRPIRSYAAPNLYNFNPGIAYPVFGENARFEIKPVMLQMIQNAGQFGGHPGEDPH 81
           +  D  R IR YAAP     NPGI  P   +  +FE+KPVM QM+Q  GQF   P EDPH
Sbjct: 20  LVDDRARAIREYAAPMFNELNPGIVRPEI-QAPQFELKPVMFQMLQTVGQFSEMPTEDPH 79

Query: 82  EHIRSFYSICASFHMPGISPEELRFALFPLTLRDEAKRWVNALEDGEVGTWDQLIEKFMK 141
            H+RSF  +  SF + G+S E  R  LFP +LRD A+ W+N L    V  W+   EKF++
Sbjct: 80  LHLRSFLEMSDSFKIQGVSEEVRRLKLFPFSLRDRARSWLNTLSPDSVTNWNDFAEKFLR 139

Query: 142 KFFPPHENARRRKELMSFQQKDKENLHDAWSRFKRMVKACPHNGIPKCILMEVFYFRLNK 201
           K+FPP  NA+ R E+MSF Q + E+  DAW RFK +++ CPH+GIP CI ME FY  LN 
Sbjct: 140 KYFPPTRNAKFRSEIMSFHQLEDESASDAWERFKELLRKCPHHGIPHCIQMETFYNGLNA 199

Query: 202 ATQQTVDAVFVDDMLKSTYNQIKTTQDTMASNNEEWDEDDFGNRRGGRAKDDG-MDKNVV 261
            +Q  +DA     +L  +YN+     +T+ASNN +W       R  G  K  G ++ + +
Sbjct: 200 TSQMVLDASANGAILSKSYNEAFEILETIASNNYQWS----NTRAPGSRKVAGVLEVDAI 259

Query: 262 VALQGQMTAMNNLLKSMAISQVNAAGSYVLAANQIDDMGCVGCDGHHNTDACPLNTETVA 321
            AL  QM +M N+LK+++I   N+      AA Q DD+ CV C   H  + CP N E+V 
Sbjct: 260 TALTTQMASMTNVLKNLSIG--NSKNIQPAAAIQSDDVSCVFCREGHAFEKCPSNPESVC 319

Query: 322 L--------------------------------------------------QQSHHQQHP 381
                                                              QQ  H QH 
Sbjct: 320 YMGNQNFNRNNGAFSNSYNQAWKNHPNLSWGSRSKLKHFDQGRQAYPPGFSQQLRHPQHA 379

Query: 382 TTTASSTSPMENLLREYMQKNDALLQSQASSIRNLEVQLGQLASDFSGRQQGSLPSNTET 396
               S  S +E+L+R+YM KNDA++QSQA+ +RNLE+QLG LA++   R QGSLPS+TE 
Sbjct: 380 QN--SQPSSLESLMRDYMAKNDAVIQSQAAFLRNLELQLGHLANELKARPQGSLPSDTEN 435

BLAST of PI0023494 vs. NCBI nr
Match: XP_030483210.1 (uncharacterized protein LOC115699807 [Cannabis sativa])

HSP 1 Score: 280.0 bits (715), Expect = 3.1e-71
Identity = 160/429 (37.30%), Postives = 237/429 (55.24%), Query Frame = 0

Query: 22  IAHDLDRPIRSYAAPNLYNFNPGIAYPVFGENARFEIKPVMLQMIQNAGQFGGHPGEDPH 81
           +A D D+ IR YAAP     NPGI  P   +  +FE+KPVM QM+Q  GQF G P EDPH
Sbjct: 1   MADDRDQIIRQYAAPLFNELNPGIVRPEI-QAPQFELKPVMFQMLQTVGQFSGIPTEDPH 60

Query: 82  EHIRSFYSICASFHMPGISPEELRFALFPLTLRDEAKRWVNALEDGEVGTWDQLIEKFMK 141
            H+R F  +  SF +PG++ + LR  LFP +LRD+A+ W+N+L    V TW +L E+F+ 
Sbjct: 61  LHLRLFMEVSDSFKLPGVTEDALRLKLFPYSLRDQARAWLNSLPSASVTTWQELAERFLM 120

Query: 142 KFFPPHENARRRKELMSFQQKDKENLHDAWSRFKRMVKACPHNGIPKCILMEVFYFRLNK 201
           K+FPP +NA+ RKE+ SFQQ + E+L++AW RFK +++ CPH+GIP CI ME FY  LN 
Sbjct: 121 KYFPPTKNAKLRKEITSFQQFEDESLYEAWERFKELLRKCPHHGIPHCIQMETFYNGLNA 180

Query: 202 ATQQTVDAVFVDDMLKSTYNQIKTTQDTMASNNEEWDED--DFGNRRGGRAKDDGMDKNV 261
            T+  VDA     +L  +YN+     + +++NN +W       G +  G      ++ + 
Sbjct: 181 HTRMVVDASANGALLAKSYNEAYDIIERISNNNYQWPTTRVPLGKKVAG-----VLEVDA 240

Query: 262 VVALQGQMTAMNNLLKSMAISQVNAAGSYVLAANQIDDMGCVGCDGHHNTDACPLNTETV 321
           + AL  Q+ +M+N++K+M++ Q     +      Q++++ CV C   H  D CP N  +V
Sbjct: 241 ITALSAQVASMSNMIKNMSMGQQMGQQNVSSPVGQLEEVSCVFCSEAHTFDNCPFNPASV 300

Query: 322 ALQQSH------HQQHPT---------TTASS------------------------TSPM 381
               S       ++QHP          T+ SS                        TS +
Sbjct: 301 FYMGSQNAYNQTYKQHPNLAYRNQGAGTSNSSMLPRSNFPPGFSQAFQQRQQQGVQTSSL 360

Query: 382 ENLLRE--------------YMQKNDALLQSQASSIRNLEVQLGQLASDFSGRQQGSLPS 396
           E+++R+              YM KND  +QSQA+S+R LE Q+GQLA++   R QG+LPS
Sbjct: 361 ESMMRDFMAKTENFMTRTESYMAKNDTAIQSQATSMRTLENQVGQLANELRNRPQGTLPS 420

BLAST of PI0023494 vs. NCBI nr
Match: XP_017233063.1 (PREDICTED: uncharacterized protein LOC108207110 [Daucus carota subsp. sativus])

HSP 1 Score: 278.5 bits (711), Expect = 9.1e-71
Identity = 168/443 (37.92%), Postives = 244/443 (55.08%), Query Frame = 0

Query: 1   MENNNRNAPPPQADPEPNTAYIAHDLDRPIRSYAAPNLYNFNPGIAYPVFGENARFEIKP 60
           M++N  N   P     P  A+I  D DR IR YAAP     N GI  P   +  +FE+KP
Sbjct: 36  MDDNVNNGDIPIV---PRGAFIVDDKDRAIRQYAAPRFEELNSGIIRPNI-QATQFELKP 95

Query: 61  VMLQMIQNAGQFGGHPGEDPHEHIRSFYSICASFHMPGISPEELRFALFPLTLRDEAKRW 120
           VM QM+Q  GQF G P EDPH H+R F  I  SF   G+  + LR  LFP ++RD A+ W
Sbjct: 96  VMFQMLQTIGQFSGMPTEDPHLHLRLFMEISDSFKFQGVPEDALRLKLFPYSVRDRARTW 155

Query: 121 VNALEDGEVGTWDQLIEKFMKKFFPPHENARRRKELMSFQQKDKENLHDAWSRFKRMVKA 180
           +N+L  G V TW+ L EKF+ K+FPP+ NA+ R E+ SFQQ+D E+L+DAW RFK +++ 
Sbjct: 156 LNSLPAGSVTTWNDLTEKFLSKYFPPNMNAKLRNEINSFQQQDDESLYDAWERFKELLRK 215

Query: 181 CPHNGIPKCILMEVFYFRLNKATQQTVDAVFVDDMLKSTYNQIKTTQDTMASNNEEWDED 240
           CPH+GI  CI ME FY  LN  T+  VDA     +L  +YNQ     +T+A+ N +W   
Sbjct: 216 CPHHGILHCIQMETFYNGLNAQTKMVVDASANGALLSKSYNQAYEILETIATKNYQWSS- 275

Query: 241 DFGNRRGGRAKDDGMDKNVVVALQGQMTAMNNLLKSMAISQVNAAGSYVLAA--NQIDDM 300
                + G+      D + + +++ Q+ +M ++LK++++   N +    L++  NQ  ++
Sbjct: 276 --SRAQTGKKVAGIYDVDSITSMKAQLASMEHMLKNLSMGN-NQSKEQSLSSQINQTKNV 335

Query: 301 GCVGCDGHHNTDACPLNTETVALQQSHH-------------QQHP------------TTT 360
            CV C   H  D+CP N E+V    + +             +QHP            T+T
Sbjct: 336 SCVFCGEAHTYDSCPSNPESVFYMGNQNKAGPYSNTYNQSWRQHPNFSWSNQGANSGTST 395

Query: 361 --------------ASSTSPMENLLREYMQKN-------DALLQSQASSIRNLEVQLGQL 396
                         A  ++ +EN+L+EY+ KN       +AL+QSQA+S+RNLE Q+GQL
Sbjct: 396 GNVKSNYPPGFSQQAPQSNSLENMLKEYIIKNEASRSQTEALVQSQAASLRNLENQVGQL 455

BLAST of PI0023494 vs. NCBI nr
Match: XP_038889363.1 (uncharacterized protein LOC120079279 [Benincasa hispida])

HSP 1 Score: 277.3 bits (708), Expect = 2.0e-70
Identity = 151/377 (40.05%), Postives = 224/377 (59.42%), Query Frame = 0

Query: 52  ENARFEIKPVMLQMIQNAGQFGGHPGEDPHEHIRSFYSICASFHMPGISPEELRFALFPL 111
           EN RF+IK VMLQM+QN GQFGG  GED H H+ SF  +C++F + G++PE +R  LFP 
Sbjct: 4   ENTRFKIKSVMLQMVQNTGQFGGLQGEDLHAHLTSFVEMCSTFSISGVTPEGIRLYLFPY 63

Query: 112 TLRDEAKRWVNALEDGEVGTWDQLIEKFMKKFFPPHENARRRKELMSFQQKDKENLHDAW 171
           TLRDEA  W ++LE  E+ +WDQL+E FMKKFFPP  NARRRK++++F+Q + E L   W
Sbjct: 64  TLRDEANIWAHSLEPNEITSWDQLVEWFMKKFFPPTVNARRRKDVLNFEQMNNETLSTTW 123

Query: 172 SRFKRMVKACPHNGIPKCILMEVFYFRLNKATQQTVDAVFVDDMLKSTYNQIKTTQDTMA 231
              +R+VK C H GIP C+LM+ FY  LN++TQ   DA      +  TY + K     ++
Sbjct: 124 VHLRRLVKNCLHIGIPDCVLMKTFYNGLNRSTQVVADASVARGFMDKTYTEAKVILHRIS 183

Query: 232 SNNEEWDEDDFGNRRGGRAKDDG--MDKNVVVALQGQMTAMNNLLKSMAISQ--VNAAGS 291
            N ++  +D +G R   R ++D   +  + +  L  QM A+ +LL++MA++Q  ++   +
Sbjct: 184 RNTDDCVDDGYGGRGSERRRNDNAIVPLDTMTTLAAQMAAVTSLLQTMALNQGALSQISA 243

Query: 292 YVLAANQIDDMGCVGCDGHH-----------NTDACPLNTETVAL------------QQS 351
              A  Q+  + CV C G H           +    P N ++               Q  
Sbjct: 244 QPNAPAQVAAISCVQCGGGHANHPNFGWGGNHNQGGPSNHQSNNFENRGNSPPFHQNQNQ 303

Query: 352 HHQQHP------TTTASSTSPMENLLREYMQKNDALLQSQASSIRNLEVQLGQLASDFSG 396
            HQ  P      + T++++S +E+LL++Y++KND ++QSQ SSIRNLE+Q+GQLA++   
Sbjct: 304 GHQPQPQNLPSSSNTSANSSSLESLLKQYIEKNDVVMQSQVSSIRNLEIQVGQLATELRN 363

The following BLAST results are available for this feature:
Match NameE-valueIdentityDescription
Match NameE-valueIdentityDescription
A0A6J1G7Q61.2e-6334.23uncharacterized protein LOC111451598 OS=Cucurbita moschata OX=3662 GN=LOC1114515... [more]
U5CUI23.4e-6344.08Retrotrans_gag domain-containing protein OS=Amborella trichopoda OX=13333 GN=AMT... [more]
A0A6J1EEI22.3e-5934.53uncharacterized protein LOC111433394 OS=Cucurbita moschata OX=3662 GN=LOC1114333... [more]
A0A6J1DW027.3e-5833.02uncharacterized protein LOC111024897 OS=Momordica charantia OX=3673 GN=LOC111024... [more]
A0A6J1EQ909.6e-5833.41uncharacterized protein LOC111436411 OS=Cucurbita moschata OX=3662 GN=LOC1114364... [more]
Match NameE-valueIdentityDescription
XP_030497803.11.9e-7641.45uncharacterized protein LOC115713460 [Cannabis sativa][more]
XP_030505184.13.4e-7339.53uncharacterized protein LOC115720166 [Cannabis sativa][more]
XP_030483210.13.1e-7137.30uncharacterized protein LOC115699807 [Cannabis sativa][more]
XP_017233063.19.1e-7137.92PREDICTED: uncharacterized protein LOC108207110 [Daucus carota subsp. sativus][more]
XP_038889363.12.0e-7040.05uncharacterized protein LOC120079279 [Benincasa hispida][more]
Match NameE-valueIdentityDescription
InterPro
Analysis Name: InterPro Annotations of Melon (PI 482460) v1
Date Performed: 2021-10-25
IPR TermIPR DescriptionSourceSource TermSource DescriptionAlignment
IPR005162Retrotransposon gag domainPFAMPF03732Retrotrans_gagcoord: 108..196
e-value: 7.3E-16
score: 58.2
NoneNo IPR availableMOBIDB_LITEmobidb-litedisorder_predictioncoord: 376..399
NoneNo IPR availableMOBIDB_LITEmobidb-litedisorder_predictioncoord: 234..253
NoneNo IPR availableMOBIDB_LITEmobidb-litedisorder_predictioncoord: 238..253
NoneNo IPR availablePANTHERPTHR24559TRANSPOSON TY3-I GAG-POL POLYPROTEINcoord: 82..246
NoneNo IPR availablePANTHERPTHR24559:SF334SUBFAMILY NOT NAMEDcoord: 82..246

Relationships

The following mRNA feature(s) are a part of this gene:

Feature NameUnique NameType
PI0023494.1PI0023494.1mRNA