Cmc07g0189741 (gene) Melon (Charmono) v1.1

Overview
NameCmc07g0189741
Typegene
OrganismCucumis melo L. var. cantalupensis cv. Charmono (Melon (Charmono) v1.1)
DescriptionRetrovirus-related Pol polyprotein from transposon TNT 1-94
LocationCMiso1.1chr07: 8483101 .. 8484471 (+)
RNA-Seq ExpressionCmc07g0189741
SyntenyCmc07g0189741
Sequences
The following sequences are available for this feature:

Gene sequence (with intron)

Legend: exonCDSpolypeptide
Hold the cursor over a type above to highlight its positions in the sequence below.
ATGCAGTTCGAAAAAAAGCTTCATAATATAAAAAAGGGAGCCATGTCTTTAAAGGAATATTTTATTAAAATTCGACAATATGTTGATGCTTTAGCCTCCATAAATAAACCTATATCAACTGAGGATCACATATTGTACATCTTAGCTGGATTAGGAAATGAATATCAGTCTATGATATCAGTTATTTTTGCTCGTACTGATTTTCCTTCTGTTCAAGATATTATGTCACTCTTATTGACTCAAGAATCACAGATTGAAAATAAGATTACGAGTGAGGTCTCTTTACCTACTGTCAATATGACTACACATACTAGAGACATTCCATCATTGGAAAAAGAGGGTGAGGTTACACACAGAGGAGGTTCGAATAATCTTAGTTATACAACCACCAATTCTCAATATCATCACAAAAGTCGTGGTGGGGGTCGATCTAATAGAGGAGGAAGAGGAAATAGACACAAAACTCAGTGTCAAATTTGCACCAAATTTGGACATATTGCCGATAGATGTTACTTTCGTTATACTCCAAGGAATCCTCCTTCAGGTTATTCTGCCAACACATCTAATGCTTTTCCATATGCTAATGCTTCTCATAATCCACAGATGTGTGCGATGGTCGCCTCCTATGATCTAAACATTGATAGTAATTGGTATCCTGACTCGGGAGCAACAAATCATTTGACACATAGCTTGAGTAATTTATCAATTGGATCTGAATATGGTGGAGGACATCAGATTTATACAGCAAATGGTTTAGGTTTGCCCATACTCCATTAGGGTTCATTACAATTTACCTCCTCATTTGTTCCAATAAAGTCTTTAGTTCTCAAAAATCTGCTTCACGTTCCTTCCAGTACAAAGAATCTGATAAGTGTCTCTCAATTTGCAAAAGATAACAAAGTCTATTTTGAGTTTCATCCATTTGTTTGTTATGTGAAGGATCAGGAAACAGGCCAAATACTTCTACAAGGACATTTATGTGATGGTCTATATCAATTCAATCTCAAATCCTCTCAACAAGGTTTCATGAAGTCTACTACTAATAGTAATCCACGTATTTTAACTACTACTTTATCTAAGTATCATGTGAACACTACTGATGTATGGCATAGGCGATTAGGCCATCCCCACCTGAATGTTATGCGAAATGCTTTGAAACATGTCCATCATGCCAATATCAGAATAAATAAAATGAATTTTTGTGAAGCTTGTGCTTTAGGAAAACATCATGCTCTTCTCTTTCACAATTCAAATACTCAATATATCTATCCTTTGCAACTAATTGTTTGTGATCTTTGGGGTCCTGCATTTGACACATCTAGGAATGATGTTCGATACTATATTAGTTTTGTTGATGCCTATAGTAGATAA

mRNA sequence

ATGCAGTTCGAAAAAAAGCTTCATAATATAAAAAAGGGAGCCATGTCTTTAAAGGAATATTTTATTAAAATTCGACAATATGTTGATGCTTTAGCCTCCATAAATAAACCTATATCAACTGAGGATCACATATTGTACATCTTAGCTGGATTAGGAAATGAATATCAGTCTATGATATCAGTTATTTTTGCTCGTACTGATTTTCCTTCTGTTCAAGATATTATGTCACTCTTATTGACTCAAGAATCACAGATTGAAAATAAGATTACGAGTGAGGTCTCTTTACCTACTGTCAATATGACTACACATACTAGAGACATTCCATCATTGGAAAAAGAGGGTGAGGTTACACACAGAGGAGGTTCGAATAATCTTAGTTATACAACCACCAATTCTCAATATCATCACAAAAGTCGTGGTGGGGGTCGATCTAATAGAGGAGGAAGAGGAAATAGACACAAAACTCAGTGTCAAATTTGCACCAAATTTGGACATATTGCCGATAGATGTTACTTTCGTTATACTCCAAGGAATCCTCCTTCAGGTTATTCTGCCAACACATCTAATGCTTTTCCATATGCTAATGCTTCTCATAATCCACAGATGTGTGCGATGGTCGCCTCCTATGATCTAAACATTGATAGTAATTGGTATCCTGACTCGGGAGCAACAAATCATTTGACACATAGCTTGAGTAATTTATCAATTGGATCTGAATATGGTGGAGGACATCAGATTTATACAGCAAATGGTTTAGGCCAAATACTTCTACAAGGACATTTATGTGATGGTCTATATCAATTCAATCTCAAATCCTCTCAACAAGGTTTCATGAAGTCTACTACTAATAGTAATCCACGTATTTTAACTACTACTTTATCTAAGTATCATGTGAACACTACTGATGTATGGCATAGGCGATTAGGCCATCCCCACCTGAATGTTATGCGAAATGCTTTGAAACATGTCCATCATGCCAATATCAGAATAAATAAAATGAATTTTTGTGAAGCTTGTGCTTTAGGAAAACATCATGCTCTTCTCTTTCACAATTCAAATACTCAATATATCTATCCTTTGCAACTAATTGTTTGTGATCTTTGGGGTCCTGCATTTGACACATCTAGGAATGATGTTCGATACTATATTAGTTTTGTTGATGCCTATAGTAGATAA

Coding sequence (CDS)

ATGCAGTTCGAAAAAAAGCTTCATAATATAAAAAAGGGAGCCATGTCTTTAAAGGAATATTTTATTAAAATTCGACAATATGTTGATGCTTTAGCCTCCATAAATAAACCTATATCAACTGAGGATCACATATTGTACATCTTAGCTGGATTAGGAAATGAATATCAGTCTATGATATCAGTTATTTTTGCTCGTACTGATTTTCCTTCTGTTCAAGATATTATGTCACTCTTATTGACTCAAGAATCACAGATTGAAAATAAGATTACGAGTGAGGTCTCTTTACCTACTGTCAATATGACTACACATACTAGAGACATTCCATCATTGGAAAAAGAGGGTGAGGTTACACACAGAGGAGGTTCGAATAATCTTAGTTATACAACCACCAATTCTCAATATCATCACAAAAGTCGTGGTGGGGGTCGATCTAATAGAGGAGGAAGAGGAAATAGACACAAAACTCAGTGTCAAATTTGCACCAAATTTGGACATATTGCCGATAGATGTTACTTTCGTTATACTCCAAGGAATCCTCCTTCAGGTTATTCTGCCAACACATCTAATGCTTTTCCATATGCTAATGCTTCTCATAATCCACAGATGTGTGCGATGGTCGCCTCCTATGATCTAAACATTGATAGTAATTGGTATCCTGACTCGGGAGCAACAAATCATTTGACACATAGCTTGAGTAATTTATCAATTGGATCTGAATATGGTGGAGGACATCAGATTTATACAGCAAATGGTTTAGGCCAAATACTTCTACAAGGACATTTATGTGATGGTCTATATCAATTCAATCTCAAATCCTCTCAACAAGGTTTCATGAAGTCTACTACTAATAGTAATCCACGTATTTTAACTACTACTTTATCTAAGTATCATGTGAACACTACTGATGTATGGCATAGGCGATTAGGCCATCCCCACCTGAATGTTATGCGAAATGCTTTGAAACATGTCCATCATGCCAATATCAGAATAAATAAAATGAATTTTTGTGAAGCTTGTGCTTTAGGAAAACATCATGCTCTTCTCTTTCACAATTCAAATACTCAATATATCTATCCTTTGCAACTAATTGTTTGTGATCTTTGGGGTCCTGCATTTGACACATCTAGGAATGATGTTCGATACTATATTAGTTTTGTTGATGCCTATAGTAGATAA

Protein sequence

MQFEKKLHNIKKGAMSLKEYFIKIRQYVDALASINKPISTEDHILYILAGLGNEYQSMISVIFARTDFPSVQDIMSLLLTQESQIENKITSEVSLPTVNMTTHTRDIPSLEKEGEVTHRGGSNNLSYTTTNSQYHHKSRGGGRSNRGGRGNRHKTQCQICTKFGHIADRCYFRYTPRNPPSGYSANTSNAFPYANASHNPQMCAMVASYDLNIDSNWYPDSGATNHLTHSLSNLSIGSEYGGGHQIYTANGLGQILLQGHLCDGLYQFNLKSSQQGFMKSTTNSNPRILTTTLSKYHVNTTDVWHRRLGHPHLNVMRNALKHVHHANIRINKMNFCEACALGKHHALLFHNSNTQYIYPLQLIVCDLWGPAFDTSRNDVRYYISFVDAYSR
Homology
BLAST of Cmc07g0189741 vs. NCBI nr
Match: KAA0048297.1 (Retrovirus-related Pol polyprotein from transposon TNT 1-94 [Cucumis melo var. makuwa])

HSP 1 Score: 448.7 bits (1153), Expect = 5.0e-122
Identity = 249/456 (54.61%), Postives = 297/456 (65.13%), Query Frame = 0

Query: 1   MQFEKKLHNIKKGAMSLKEYFIKIRQYVDALASINKPISTEDHILYILAGLGNEYQSMIS 60
           MQF+ KLHNIKKG+M LKEYF+KI Q VDALASINKP+S++DHILYILAGLG++YQSMIS
Sbjct: 140 MQFKNKLHNIKKGSMPLKEYFLKILQCVDALASINKPVSSDDHILYILAGLGSDYQSMIS 199

Query: 61  VIFARTDFPSVQDIMSLLLTQESQIENKITSEVSLPTVNMTTHTRDIPSLEKEGEVTHRG 120
           VI ARTD PSVQ++MSLLLTQESQ E+K+ SE +LP+VN+ T T      EK  E   R 
Sbjct: 200 VISARTDSPSVQEVMSLLLTQESQNESKLISETALPSVNIVTQT-----TEKGAESYIRT 259

Query: 121 GSNNLSYTTTNSQYHHKSRGGGRSNRGGRGNRHKTQCQICTKFGHIADRCYFRYTPRNPP 180
             NN  Y   +S      RG GRSNRG RGNR+K QCQIC K G+ ADRC+FRYTPR+  
Sbjct: 260 NQNN--YHNNHSYNQRGGRGNGRSNRGRRGNRNKPQCQICAKLGYSADRCFFRYTPRSNS 319

Query: 181 SGYSANTSNAFPYANASHNPQMCAMVASYDLNIDSNWYPDSGATNHLTHSLSNLSIGSEY 240
           SGYS N+ N   Y N +++PQM AMVA+ DLNIDSNWYPDSGATNHLTHSLSNLSIGSEY
Sbjct: 320 SGYSPNSHNT-SYTNMNNHPQMSAMVAALDLNIDSNWYPDSGATNHLTHSLSNLSIGSEY 379

Query: 241 GGGHQIYTANG------------------------------------------------- 300
           GGG+QIY ANG                                                 
Sbjct: 380 GGGNQIYAANGSGLPITHYGSMSFNSSTLPFKSFTLNNLLQVPSITKNLISVSQFAKDNH 439

Query: 301 ----------------LGQILLQGHLCDGLYQFNLKSSQQGFMKSTTNSNPRILTTTLSK 360
                            GQ+LLQG L DGLY+F ++ S +    S +N+ P +  T + K
Sbjct: 440 VFFEFHPTLCYVKDLDTGQVLLQGLLNDGLYKFTIEPSHKRLHHSNSNTKP-VFNTVVPK 499

Query: 361 YHVNTTDVWHRRLGHPHLNVMRNALKHVHHANIRINKMNFCEACALGKHHALLFHNSNTQ 392
            +    D+WHRRLGHPHL +++  L H+ +++  INK+NFCEACALGKHHAL F +S T 
Sbjct: 500 SNTPLLDLWHRRLGHPHLPIVKAVLNHIDNSSGTINKLNFCEACALGKHHALPFSHSLTL 559

BLAST of Cmc07g0189741 vs. NCBI nr
Match: TYK10642.1 (Retrovirus-related Pol polyprotein from transposon TNT 1-94 [Cucumis melo var. makuwa])

HSP 1 Score: 448.7 bits (1153), Expect = 5.0e-122
Identity = 249/456 (54.61%), Postives = 297/456 (65.13%), Query Frame = 0

Query: 1   MQFEKKLHNIKKGAMSLKEYFIKIRQYVDALASINKPISTEDHILYILAGLGNEYQSMIS 60
           MQF+ KLHNIKKG+M LKEYF+KI Q VDALASINKP+S++DHILYILAGLG++YQSMIS
Sbjct: 140 MQFKNKLHNIKKGSMPLKEYFLKILQCVDALASINKPVSSDDHILYILAGLGSDYQSMIS 199

Query: 61  VIFARTDFPSVQDIMSLLLTQESQIENKITSEVSLPTVNMTTHTRDIPSLEKEGEVTHRG 120
           VI ARTD PSVQ++MSLLLTQESQ E+K+ SE +LP+VN+ T T      EK  E   R 
Sbjct: 200 VISARTDSPSVQEVMSLLLTQESQNESKLISETALPSVNIVTQT-----TEKGAESYIRT 259

Query: 121 GSNNLSYTTTNSQYHHKSRGGGRSNRGGRGNRHKTQCQICTKFGHIADRCYFRYTPRNPP 180
             NN  Y   +S      RG GRSNRG RGNR+K QCQIC K G+ ADRC+FRYTPR+  
Sbjct: 260 NQNN--YHNNHSYNQRGGRGNGRSNRGRRGNRNKPQCQICAKLGYSADRCFFRYTPRSNS 319

Query: 181 SGYSANTSNAFPYANASHNPQMCAMVASYDLNIDSNWYPDSGATNHLTHSLSNLSIGSEY 240
           SGYS N+ N   Y N +++PQM AMVA+ DLNIDSNWYPDSGATNHLTHSLSNLSIGSEY
Sbjct: 320 SGYSPNSHNT-SYTNMNNHPQMSAMVAALDLNIDSNWYPDSGATNHLTHSLSNLSIGSEY 379

Query: 241 GGGHQIYTANG------------------------------------------------- 300
           GGG+QIY ANG                                                 
Sbjct: 380 GGGNQIYAANGSGLPITHYGSMSFNSSTLPFKSFTLNNLLQVPSITKNLISVSQFAKDNH 439

Query: 301 ----------------LGQILLQGHLCDGLYQFNLKSSQQGFMKSTTNSNPRILTTTLSK 360
                            GQ+LLQG L DGLY+F ++ S +    S +N+ P +  T + K
Sbjct: 440 VFFEFHPTLCYVKDLDTGQVLLQGLLNDGLYKFTIEPSHKRLHHSNSNTKP-VFNTVVPK 499

Query: 361 YHVNTTDVWHRRLGHPHLNVMRNALKHVHHANIRINKMNFCEACALGKHHALLFHNSNTQ 392
            +    D+WHRRLGHPHL +++  L H+ +++  INK+NFCEACALGKHHAL F +S T 
Sbjct: 500 SNTPLLDLWHRRLGHPHLPIVKAVLNHIDNSSGTINKLNFCEACALGKHHALPFSHSLTL 559

BLAST of Cmc07g0189741 vs. NCBI nr
Match: KAA0059137.1 (Retrovirus-related Pol polyprotein from transposon TNT 1-94 [Cucumis melo var. makuwa] >TYK21610.1 Retrovirus-related Pol polyprotein from transposon TNT 1-94 [Cucumis melo var. makuwa])

HSP 1 Score: 445.7 bits (1145), Expect = 4.2e-121
Identity = 236/319 (73.98%), Postives = 247/319 (77.43%), Query Frame = 0

Query: 51  LGNEYQSMISVIFARTDFPSVQDIMSLLLTQESQIENKITSEVSLPTVNMTTHTRDIPSL 110
           LGNEYQSMISVI ARTD  SVQDIMSLLLTQESQIE+KITS+VSLP VN+T HTRDIPSL
Sbjct: 54  LGNEYQSMISVISARTDSLSVQDIMSLLLTQESQIESKITSDVSLPAVNITIHTRDIPSL 113

Query: 111 EKEGEVTHRGGSNNLSYTTTNSQYHHKSRGGGRSNRGGRGNRHKTQCQICTKFGHIADRC 170
           EK+GEVTHRG SNNL+YTT NSQYHH+S GGGRS RGGRGNR+KTQCQICTKFGHIAD C
Sbjct: 114 EKKGEVTHRGSSNNLNYTTNNSQYHHRSHGGGRSTRGGRGNRNKTQCQICTKFGHIADIC 173

Query: 171 YFRYTPRNPPSGYSANTSNAFPYANASHNPQMCAMVASYDLNIDSNWYPDSGATNHLTHS 230
           YFRYTPRN  SGYSAN S+ FPY NAS NPQM AMV  YDLN +SNWYPDSGA+NHLTHS
Sbjct: 174 YFRYTPRNSTSGYSAN-SSTFPYTNASRNPQMSAMVTFYDLNFNSNWYPDSGASNHLTHS 233

Query: 231 LSNLSIGSEYGGGHQIYTANG--------------------------------------- 290
           LSNLS GSEYG GHQIY ANG                                       
Sbjct: 234 LSNLSTGSEYGRGHQIYAANGSDLPLLHHGSLQFTSSFVPSKALFLKNLFHVPSITKNLD 293

Query: 291 --LGQILLQGHLCDGLYQFNLKSSQQGFMKSTTNSNPRILTTTLSKYHVNTTDVWHRRLG 329
              GQILLQ HLCDGLYQFNLKSS QG MKST N NP  LTTTLSKYHVNTTDVWHRRLG
Sbjct: 294 QETGQILLQRHLCDGLYQFNLKSSHQGSMKSTPNINPCALTTTLSKYHVNTTDVWHRRLG 353

BLAST of Cmc07g0189741 vs. NCBI nr
Match: KAA0046195.1 (putative Ty1-copia-like retrotransposon [Cucumis melo var. makuwa] >TYK14162.1 putative Ty1-copia-like retrotransposon [Cucumis melo var. makuwa])

HSP 1 Score: 347.1 bits (889), Expect = 2.0e-91
Identity = 173/194 (89.18%), Postives = 184/194 (94.85%), Query Frame = 0

Query: 1   MQFEKKLHNIKKGAMSLKEYFIKIRQYVDALASINKPISTEDHILYILAGLGNEYQSMIS 60
           MQF+ KLHN+KKGAMSLKEYF+KI+Q VDALASINKPIST+DHILYILAGLGNEYQS+IS
Sbjct: 116 MQFKNKLHNMKKGAMSLKEYFLKIQQCVDALASINKPISTDDHILYILAGLGNEYQSIIS 175

Query: 61  VIFARTDFPSVQDIMSLLLTQESQIENKITSEVSLPTVNMTTHTRDIPSLEKEGEVTHRG 120
           +I ARTD PSVQD MSLLLTQESQIE+KITSEVSLPTVNMTTHTRDI SLEKE EVTHRG
Sbjct: 176 IISARTDSPSVQDNMSLLLTQESQIESKITSEVSLPTVNMTTHTRDISSLEKESEVTHRG 235

Query: 121 GSNNLSYTTTNSQYHHKSRGGGRSNRGGRGNRHKTQCQICTKFGHIADRCYFRYTPRNPP 180
           GSNNL YTTTNSQYHHKSR GGRSNRGGRGNRHKTQCQIC+KFGH+ADRCYFRYTPRNPP
Sbjct: 236 GSNNLCYTTTNSQYHHKSRAGGRSNRGGRGNRHKTQCQICSKFGHVADRCYFRYTPRNPP 295

Query: 181 SGYSANTSNAFPYA 195
           SGYS N+SNAFPYA
Sbjct: 296 SGYSTNSSNAFPYA 309

BLAST of Cmc07g0189741 vs. NCBI nr
Match: KAA0045111.1 (putative glutathione S-transferase isoform X1 [Cucumis melo var. makuwa] >TYK23627.1 putative glutathione S-transferase isoform X1 [Cucumis melo var. makuwa])

HSP 1 Score: 228.0 bits (580), Expect = 1.4e-55
Identity = 119/142 (83.80%), Postives = 128/142 (90.14%), Query Frame = 0

Query: 1   MQFEKKLHNIKKGAMSLKEYFIKIRQYVDALASINKPISTEDHILYILAGLGNEYQSMIS 60
           MQF+ KLHN+KKG +SLKEYF+KI+Q VDALASINKPIST+DHILYILAGLGNEYQSMIS
Sbjct: 275 MQFKNKLHNMKKGVISLKEYFLKIQQCVDALASINKPISTDDHILYILAGLGNEYQSMIS 334

Query: 61  VIFARTDFPSVQDIMSLLLTQESQIENKITSEVSLPTVNMTTHTRDIPSLEKEGEVTHRG 120
           VI ARTD PSVQD+MSLLLTQESQIE+KITSEVSLPTVNMTTH RDI SL KE  VTHRG
Sbjct: 335 VISARTDSPSVQDVMSLLLTQESQIESKITSEVSLPTVNMTTHARDISSLAKEDAVTHRG 394

Query: 121 GSNNLSYTTTNSQYHHKSRGGG 143
           G NNLSY  TNSQYHH+SRG G
Sbjct: 395 GLNNLSYPPTNSQYHHRSRGRG 416

BLAST of Cmc07g0189741 vs. ExPASy Swiss-Prot
Match: Q9ZT94 (Retrovirus-related Pol polyprotein from transposon RE2 OS=Arabidopsis thaliana OX=3702 GN=RE2 PE=4 SV=1)

HSP 1 Score: 104.4 bits (259), Expect = 3.0e-21
Identity = 111/432 (25.69%), Postives = 174/432 (40.28%), Query Frame = 0

Query: 29  DALASINKPISTEDHILYILAGLGNEYQSMISVIFARTDFPSVQDIMSLLLTQESQIENK 88
           D LA + KP+  ++ +  +L  L ++Y+ +I  I A+   PS+ +I   L+ +ES++   
Sbjct: 135 DQLALLGKPMDHDEQVERVLENLPDDYKPVIDQIAAKDTPPSLTEIHERLINRESKLLAL 194

Query: 89  ITSEVSLPTVNMTTHTRDIPSLEKEGEVTHRGGSNNLSYTTTNSQYHHKSRGGGRSNRGG 148
            ++EV   T N+ TH     +        +RG + N +     S     S  G RS+   
Sbjct: 195 NSAEVVPITANVVTHR----NTNTNRNQNNRGDNRNYNNNNNRSNSWQPSSSGSRSD-NR 254

Query: 149 RGNRHKTQCQICTKFGHIADRC----YFRYTPRNPPSGYSANTSNAFPYANASHNPQMCA 208
           +   +  +CQIC+  GH A RC     F+ T     S     TS   P+   ++     A
Sbjct: 255 QPKPYLGRCQICSVQGHSAKRCPQLHQFQSTTNQQQS-----TSPFTPWQPRAN----LA 314

Query: 209 MVASYDLNIDSNWYPDSGATNHLTHSLSNLSIGSEYGGGHQIYTANG------------- 268
           + + Y+ N   NW  DSGAT+H+T   +NLS    Y GG  +  A+G             
Sbjct: 315 VNSPYNAN---NWLLDSGATHHITSDFNNLSFHQPYTGGDDVMIADGSTIPITHTGSASL 374

Query: 269 -------------------------------------------------LGQILLQGHLC 328
                                                             G  LLQG   
Sbjct: 375 PTSSRSLDLNKVLYVPNIHKNLISVYRLCNTNRVSVEFFPASFQVKDLNTGVPLLQGKTK 434

Query: 329 DGLYQFNLKSSQQGFMKSTTNSNPRILTTTLSKYHVNTTDVWHRRLGHPHLNVMRNALKH 388
           D LY++ + SSQ   M ++  S               T   WH RLGHP L ++ + +  
Sbjct: 435 DELYEWPIASSQAVSMFASPCSKA-------------THSSWHSRLGHPSLAILNSVIS- 494

Query: 389 VHHANIRIN---KMNFCEACALGKHHALLFHNSNTQYIYPLQLIVCDLWGPAFDTSRNDV 392
            +H+   +N   K+  C  C + K H + F NS      PL+ I  D+W      S ++ 
Sbjct: 495 -NHSLPVLNPSHKLLSCSDCFINKSHKVPFSNSTITSSKPLEYIYSDVWSSPI-LSIDNY 533

BLAST of Cmc07g0189741 vs. ExPASy Swiss-Prot
Match: Q94HW2 (Retrovirus-related Pol polyprotein from transposon RE1 OS=Arabidopsis thaliana OX=3702 GN=RE1 PE=2 SV=1)

HSP 1 Score: 97.8 bits (242), Expect = 2.8e-19
Identity = 108/467 (23.13%), Postives = 176/467 (37.69%), Query Frame = 0

Query: 2   QFEKKLHNIKKGAMSLKEYFIKIRQYVDALASINKPISTEDHILYILAGLGNEYQSMISV 61
           Q   +L    KG  ++ +Y   +    D LA + KP+  ++ +  +L  L  EY+ +I  
Sbjct: 127 QLRTQLKQWTKGTKTIDDYMQGLVTRFDQLALLGKPMDHDEQVERVLENLPEEYKPVIDQ 186

Query: 62  IFARTDFPSVQDIMSLLLTQESQIENKITSEVSLPTVNMTTHTRDIPSLEKEGEVTHRGG 121
           I A+   P++ +I   LL  ES+I    ++ V   T N  +H     +        +  G
Sbjct: 187 IAAKDTPPTLTEIHERLLNHESKILAVSSATVIPITANAVSHRNTTTT------NNNNNG 246

Query: 122 SNNLSYTTTNSQYHHKSRGGGRSNRGGRGNRHKT---QCQICTKFGHIADRC------YF 181
           + N  Y   N+  + K      +N     N+ K    +CQIC   GH A RC        
Sbjct: 247 NRNNRYDNRNNNNNSKPWQQSSTNFHPNNNQSKPYLGKCQICGVQGHSAKRCSQLQHFLS 306

Query: 182 RYTPRNPPSGYS-----ANTSNAFPYANASHNPQMCAMVASYDLNIDSNWYPDSGATNHL 241
               + PPS ++     AN +   PY++                   +NW  DSGAT+H+
Sbjct: 307 SVNSQQPPSPFTPWQPRANLALGSPYSS-------------------NNWLLDSGATHHI 366

Query: 242 THSLSNLSIGSEYGGGHQIYTANG------------------------------------ 301
           T   +NLS+   Y GG  +  A+G                                    
Sbjct: 367 TSDFNNLSLHQPYTGGDDVMVADGSTIPISHTGSTSLSTKSRPLNLHNILYVPNIHKNLI 426

Query: 302 --------------------------LGQILLQGHLCDGLYQFNLKSSQQGFMKSTTNSN 361
                                      G  LLQG   D LY++ + SSQ   + ++ +S 
Sbjct: 427 SVYRLCNANGVSVEFFPASFQVKDLNTGVPLLQGKTKDELYEWPIASSQPVSLFASPSSK 486

Query: 362 PRILTTTLSKYHVNTTDVWHRRLGHPHLNVMRNALKHVHHANIR-INKMNFCEACALGKH 392
                         T   WH RLGHP  +++ + + +   + +   +K   C  C + K 
Sbjct: 487 A-------------THSSWHARLGHPAPSILNSVISNYSLSVLNPSHKFLSCSDCLINKS 546

BLAST of Cmc07g0189741 vs. ExPASy TrEMBL
Match: A0A5A7U233 (Retrovirus-related Pol polyprotein from transposon TNT 1-94 OS=Cucumis melo var. makuwa OX=1194695 GN=E6C27_scaffold264G00060 PE=4 SV=1)

HSP 1 Score: 448.7 bits (1153), Expect = 2.4e-122
Identity = 249/456 (54.61%), Postives = 297/456 (65.13%), Query Frame = 0

Query: 1   MQFEKKLHNIKKGAMSLKEYFIKIRQYVDALASINKPISTEDHILYILAGLGNEYQSMIS 60
           MQF+ KLHNIKKG+M LKEYF+KI Q VDALASINKP+S++DHILYILAGLG++YQSMIS
Sbjct: 140 MQFKNKLHNIKKGSMPLKEYFLKILQCVDALASINKPVSSDDHILYILAGLGSDYQSMIS 199

Query: 61  VIFARTDFPSVQDIMSLLLTQESQIENKITSEVSLPTVNMTTHTRDIPSLEKEGEVTHRG 120
           VI ARTD PSVQ++MSLLLTQESQ E+K+ SE +LP+VN+ T T      EK  E   R 
Sbjct: 200 VISARTDSPSVQEVMSLLLTQESQNESKLISETALPSVNIVTQT-----TEKGAESYIRT 259

Query: 121 GSNNLSYTTTNSQYHHKSRGGGRSNRGGRGNRHKTQCQICTKFGHIADRCYFRYTPRNPP 180
             NN  Y   +S      RG GRSNRG RGNR+K QCQIC K G+ ADRC+FRYTPR+  
Sbjct: 260 NQNN--YHNNHSYNQRGGRGNGRSNRGRRGNRNKPQCQICAKLGYSADRCFFRYTPRSNS 319

Query: 181 SGYSANTSNAFPYANASHNPQMCAMVASYDLNIDSNWYPDSGATNHLTHSLSNLSIGSEY 240
           SGYS N+ N   Y N +++PQM AMVA+ DLNIDSNWYPDSGATNHLTHSLSNLSIGSEY
Sbjct: 320 SGYSPNSHNT-SYTNMNNHPQMSAMVAALDLNIDSNWYPDSGATNHLTHSLSNLSIGSEY 379

Query: 241 GGGHQIYTANG------------------------------------------------- 300
           GGG+QIY ANG                                                 
Sbjct: 380 GGGNQIYAANGSGLPITHYGSMSFNSSTLPFKSFTLNNLLQVPSITKNLISVSQFAKDNH 439

Query: 301 ----------------LGQILLQGHLCDGLYQFNLKSSQQGFMKSTTNSNPRILTTTLSK 360
                            GQ+LLQG L DGLY+F ++ S +    S +N+ P +  T + K
Sbjct: 440 VFFEFHPTLCYVKDLDTGQVLLQGLLNDGLYKFTIEPSHKRLHHSNSNTKP-VFNTVVPK 499

Query: 361 YHVNTTDVWHRRLGHPHLNVMRNALKHVHHANIRINKMNFCEACALGKHHALLFHNSNTQ 392
            +    D+WHRRLGHPHL +++  L H+ +++  INK+NFCEACALGKHHAL F +S T 
Sbjct: 500 SNTPLLDLWHRRLGHPHLPIVKAVLNHIDNSSGTINKLNFCEACALGKHHALPFSHSLTL 559

BLAST of Cmc07g0189741 vs. ExPASy TrEMBL
Match: A0A5D3CH97 (Retrovirus-related Pol polyprotein from transposon TNT 1-94 OS=Cucumis melo var. makuwa OX=1194695 GN=E5676_scaffold25G00040 PE=4 SV=1)

HSP 1 Score: 448.7 bits (1153), Expect = 2.4e-122
Identity = 249/456 (54.61%), Postives = 297/456 (65.13%), Query Frame = 0

Query: 1   MQFEKKLHNIKKGAMSLKEYFIKIRQYVDALASINKPISTEDHILYILAGLGNEYQSMIS 60
           MQF+ KLHNIKKG+M LKEYF+KI Q VDALASINKP+S++DHILYILAGLG++YQSMIS
Sbjct: 140 MQFKNKLHNIKKGSMPLKEYFLKILQCVDALASINKPVSSDDHILYILAGLGSDYQSMIS 199

Query: 61  VIFARTDFPSVQDIMSLLLTQESQIENKITSEVSLPTVNMTTHTRDIPSLEKEGEVTHRG 120
           VI ARTD PSVQ++MSLLLTQESQ E+K+ SE +LP+VN+ T T      EK  E   R 
Sbjct: 200 VISARTDSPSVQEVMSLLLTQESQNESKLISETALPSVNIVTQT-----TEKGAESYIRT 259

Query: 121 GSNNLSYTTTNSQYHHKSRGGGRSNRGGRGNRHKTQCQICTKFGHIADRCYFRYTPRNPP 180
             NN  Y   +S      RG GRSNRG RGNR+K QCQIC K G+ ADRC+FRYTPR+  
Sbjct: 260 NQNN--YHNNHSYNQRGGRGNGRSNRGRRGNRNKPQCQICAKLGYSADRCFFRYTPRSNS 319

Query: 181 SGYSANTSNAFPYANASHNPQMCAMVASYDLNIDSNWYPDSGATNHLTHSLSNLSIGSEY 240
           SGYS N+ N   Y N +++PQM AMVA+ DLNIDSNWYPDSGATNHLTHSLSNLSIGSEY
Sbjct: 320 SGYSPNSHNT-SYTNMNNHPQMSAMVAALDLNIDSNWYPDSGATNHLTHSLSNLSIGSEY 379

Query: 241 GGGHQIYTANG------------------------------------------------- 300
           GGG+QIY ANG                                                 
Sbjct: 380 GGGNQIYAANGSGLPITHYGSMSFNSSTLPFKSFTLNNLLQVPSITKNLISVSQFAKDNH 439

Query: 301 ----------------LGQILLQGHLCDGLYQFNLKSSQQGFMKSTTNSNPRILTTTLSK 360
                            GQ+LLQG L DGLY+F ++ S +    S +N+ P +  T + K
Sbjct: 440 VFFEFHPTLCYVKDLDTGQVLLQGLLNDGLYKFTIEPSHKRLHHSNSNTKP-VFNTVVPK 499

Query: 361 YHVNTTDVWHRRLGHPHLNVMRNALKHVHHANIRINKMNFCEACALGKHHALLFHNSNTQ 392
            +    D+WHRRLGHPHL +++  L H+ +++  INK+NFCEACALGKHHAL F +S T 
Sbjct: 500 SNTPLLDLWHRRLGHPHLPIVKAVLNHIDNSSGTINKLNFCEACALGKHHALPFSHSLTL 559

BLAST of Cmc07g0189741 vs. ExPASy TrEMBL
Match: A0A5D3DDT9 (Retrovirus-related Pol polyprotein from transposon TNT 1-94 OS=Cucumis melo var. makuwa OX=1194695 GN=E5676_scaffold150G00070 PE=4 SV=1)

HSP 1 Score: 445.7 bits (1145), Expect = 2.0e-121
Identity = 236/319 (73.98%), Postives = 247/319 (77.43%), Query Frame = 0

Query: 51  LGNEYQSMISVIFARTDFPSVQDIMSLLLTQESQIENKITSEVSLPTVNMTTHTRDIPSL 110
           LGNEYQSMISVI ARTD  SVQDIMSLLLTQESQIE+KITS+VSLP VN+T HTRDIPSL
Sbjct: 54  LGNEYQSMISVISARTDSLSVQDIMSLLLTQESQIESKITSDVSLPAVNITIHTRDIPSL 113

Query: 111 EKEGEVTHRGGSNNLSYTTTNSQYHHKSRGGGRSNRGGRGNRHKTQCQICTKFGHIADRC 170
           EK+GEVTHRG SNNL+YTT NSQYHH+S GGGRS RGGRGNR+KTQCQICTKFGHIAD C
Sbjct: 114 EKKGEVTHRGSSNNLNYTTNNSQYHHRSHGGGRSTRGGRGNRNKTQCQICTKFGHIADIC 173

Query: 171 YFRYTPRNPPSGYSANTSNAFPYANASHNPQMCAMVASYDLNIDSNWYPDSGATNHLTHS 230
           YFRYTPRN  SGYSAN S+ FPY NAS NPQM AMV  YDLN +SNWYPDSGA+NHLTHS
Sbjct: 174 YFRYTPRNSTSGYSAN-SSTFPYTNASRNPQMSAMVTFYDLNFNSNWYPDSGASNHLTHS 233

Query: 231 LSNLSIGSEYGGGHQIYTANG--------------------------------------- 290
           LSNLS GSEYG GHQIY ANG                                       
Sbjct: 234 LSNLSTGSEYGRGHQIYAANGSDLPLLHHGSLQFTSSFVPSKALFLKNLFHVPSITKNLD 293

Query: 291 --LGQILLQGHLCDGLYQFNLKSSQQGFMKSTTNSNPRILTTTLSKYHVNTTDVWHRRLG 329
              GQILLQ HLCDGLYQFNLKSS QG MKST N NP  LTTTLSKYHVNTTDVWHRRLG
Sbjct: 294 QETGQILLQRHLCDGLYQFNLKSSHQGSMKSTPNINPCALTTTLSKYHVNTTDVWHRRLG 353

BLAST of Cmc07g0189741 vs. ExPASy TrEMBL
Match: A0A5D3CRZ7 (Putative Ty1-copia-like retrotransposon OS=Cucumis melo var. makuwa OX=1194695 GN=E5676_scaffold688G00160 PE=4 SV=1)

HSP 1 Score: 347.1 bits (889), Expect = 9.9e-92
Identity = 173/194 (89.18%), Postives = 184/194 (94.85%), Query Frame = 0

Query: 1   MQFEKKLHNIKKGAMSLKEYFIKIRQYVDALASINKPISTEDHILYILAGLGNEYQSMIS 60
           MQF+ KLHN+KKGAMSLKEYF+KI+Q VDALASINKPIST+DHILYILAGLGNEYQS+IS
Sbjct: 116 MQFKNKLHNMKKGAMSLKEYFLKIQQCVDALASINKPISTDDHILYILAGLGNEYQSIIS 175

Query: 61  VIFARTDFPSVQDIMSLLLTQESQIENKITSEVSLPTVNMTTHTRDIPSLEKEGEVTHRG 120
           +I ARTD PSVQD MSLLLTQESQIE+KITSEVSLPTVNMTTHTRDI SLEKE EVTHRG
Sbjct: 176 IISARTDSPSVQDNMSLLLTQESQIESKITSEVSLPTVNMTTHTRDISSLEKESEVTHRG 235

Query: 121 GSNNLSYTTTNSQYHHKSRGGGRSNRGGRGNRHKTQCQICTKFGHIADRCYFRYTPRNPP 180
           GSNNL YTTTNSQYHHKSR GGRSNRGGRGNRHKTQCQIC+KFGH+ADRCYFRYTPRNPP
Sbjct: 236 GSNNLCYTTTNSQYHHKSRAGGRSNRGGRGNRHKTQCQICSKFGHVADRCYFRYTPRNPP 295

Query: 181 SGYSANTSNAFPYA 195
           SGYS N+SNAFPYA
Sbjct: 296 SGYSTNSSNAFPYA 309

BLAST of Cmc07g0189741 vs. ExPASy TrEMBL
Match: A0A5A7TUB3 (Putative glutathione S-transferase isoform X1 OS=Cucumis melo var. makuwa OX=1194695 GN=E5676_scaffold500G001440 PE=4 SV=1)

HSP 1 Score: 228.0 bits (580), Expect = 6.7e-56
Identity = 119/142 (83.80%), Postives = 128/142 (90.14%), Query Frame = 0

Query: 1   MQFEKKLHNIKKGAMSLKEYFIKIRQYVDALASINKPISTEDHILYILAGLGNEYQSMIS 60
           MQF+ KLHN+KKG +SLKEYF+KI+Q VDALASINKPIST+DHILYILAGLGNEYQSMIS
Sbjct: 275 MQFKNKLHNMKKGVISLKEYFLKIQQCVDALASINKPISTDDHILYILAGLGNEYQSMIS 334

Query: 61  VIFARTDFPSVQDIMSLLLTQESQIENKITSEVSLPTVNMTTHTRDIPSLEKEGEVTHRG 120
           VI ARTD PSVQD+MSLLLTQESQIE+KITSEVSLPTVNMTTH RDI SL KE  VTHRG
Sbjct: 335 VISARTDSPSVQDVMSLLLTQESQIESKITSEVSLPTVNMTTHARDISSLAKEDAVTHRG 394

Query: 121 GSNNLSYTTTNSQYHHKSRGGG 143
           G NNLSY  TNSQYHH+SRG G
Sbjct: 395 GLNNLSYPPTNSQYHHRSRGRG 416

BLAST of Cmc07g0189741 vs. TAIR 10
Match: AT5G48050.1 (CONTAINS InterPro DOMAIN/s: Retrotransposon gag protein (InterPro:IPR005162); BEST Arabidopsis thaliana protein match is: unknown protein (TAIR:AT1G34070.1); Has 1807 Blast hits to 1807 proteins in 277 species: Archae - 0; Bacteria - 0; Metazoa - 736; Fungi - 347; Plants - 385; Viruses - 0; Other Eukaryotes - 339 (source: NCBI BLink). )

HSP 1 Score: 55.8 bits (133), Expect = 8.8e-08
Identity = 44/156 (28.21%), Postives = 83/156 (53.21%), Query Frame = 0

Query: 1   MQFEKKLHNIKKGAMSLKEYFIKIRQYVDALASINKPISTEDHILYILAGLGNEYQSMIS 60
           +QFE +L       +S+ EY  K++   D L +++ PIS    ++++L GL  +Y  +++
Sbjct: 117 LQFENELRTTTIDDLSVHEYCQKLKSLSDLLTNVDSPISDRVLVMHLLNGLTEKYDYILN 176

Query: 61  VIFARTDFPSVQDIMSLLLTQESQIENKITSEVS---LPTVNMTTHTRDIPSLEKEGEVT 120
           VI  ++ FPS  +  S+LL +ES++ NK  S +S    P+++    T  +P  ++     
Sbjct: 177 VIKHKSPFPSFTEARSMLLMEESRLSNKSKSSLSHTNHPSLSNVLFT--VPRQQERYPQE 236

Query: 121 HRGGSNNLSYTTTNSQYHHKSRGGGRSNRGGRGNRH 154
           +   ++N+       +   K+RGGG S+  GR N +
Sbjct: 237 YHNNNSNMG----RGRSKKKNRGGGSSD--GRYNNN 264

BLAST of Cmc07g0189741 vs. TAIR 10
Match: AT1G34070.1 (CONTAINS InterPro DOMAIN/s: Retrotransposon gag protein (InterPro:IPR005162); BEST Arabidopsis thaliana protein match is: unknown protein (TAIR:AT5G48050.1); Has 648 Blast hits to 647 proteins in 29 species: Archae - 0; Bacteria - 0; Metazoa - 16; Fungi - 25; Plants - 607; Viruses - 0; Other Eukaryotes - 0 (source: NCBI BLink). )

HSP 1 Score: 50.8 bits (120), Expect = 2.8e-06
Identity = 38/154 (24.68%), Postives = 78/154 (50.65%), Query Frame = 0

Query: 1   MQFEKKLHNIKKGAMSLKEYFIKIRQYVDALASINKPISTEDHILYILAGLGNEYQSMIS 60
           ++ + +L     G M + +Y+ K+++  D+L +++ P++  + ++Y+L GL  ++ ++I+
Sbjct: 115 LRLDSELRTKDIGDMRVADYYRKMKKLADSLRNVDVPVTDRNLVMYVLNGLNPKFDNIIN 174

Query: 61  VIFARTDFPSVQDIMSLLLTQESQIENKITSEVSLPTVNMTTHTRDIPSLEKEGEVTH-- 120
           VI  R  FPS  D  ++L  +E +++  I      PT    + +  + +  +   VT+  
Sbjct: 175 VIKHRQPFPSFDDAATMLQEEEDRLKRAIKPN---PTHVDHSSSSTVLACSEAPPVTNFQ 234

Query: 121 RGGSNNLSYTTTNSQYHHKSRGGGRSNRGGRGNR 153
           R G N + Y         + RG G +   GRG R
Sbjct: 235 RSGGNQMGY---------RGRGRGNNIFRGRGGR 256

The following BLAST results are available for this feature:
Match NameE-valueIdentityDescription
KAA0048297.15.0e-12254.61Retrovirus-related Pol polyprotein from transposon TNT 1-94 [Cucumis melo var. m... [more]
TYK10642.15.0e-12254.61Retrovirus-related Pol polyprotein from transposon TNT 1-94 [Cucumis melo var. m... [more]
KAA0059137.14.2e-12173.98Retrovirus-related Pol polyprotein from transposon TNT 1-94 [Cucumis melo var. m... [more]
KAA0046195.12.0e-9189.18putative Ty1-copia-like retrotransposon [Cucumis melo var. makuwa] >TYK14162.1 p... [more]
KAA0045111.11.4e-5583.80putative glutathione S-transferase isoform X1 [Cucumis melo var. makuwa] >TYK236... [more]
Match NameE-valueIdentityDescription
Q9ZT943.0e-2125.69Retrovirus-related Pol polyprotein from transposon RE2 OS=Arabidopsis thaliana O... [more]
Q94HW22.8e-1923.13Retrovirus-related Pol polyprotein from transposon RE1 OS=Arabidopsis thaliana O... [more]
Match NameE-valueIdentityDescription
A0A5A7U2332.4e-12254.61Retrovirus-related Pol polyprotein from transposon TNT 1-94 OS=Cucumis melo var.... [more]
A0A5D3CH972.4e-12254.61Retrovirus-related Pol polyprotein from transposon TNT 1-94 OS=Cucumis melo var.... [more]
A0A5D3DDT92.0e-12173.98Retrovirus-related Pol polyprotein from transposon TNT 1-94 OS=Cucumis melo var.... [more]
A0A5D3CRZ79.9e-9289.18Putative Ty1-copia-like retrotransposon OS=Cucumis melo var. makuwa OX=1194695 G... [more]
A0A5A7TUB36.7e-5683.80Putative glutathione S-transferase isoform X1 OS=Cucumis melo var. makuwa OX=119... [more]
Match NameE-valueIdentityDescription
AT5G48050.18.8e-0828.21CONTAINS InterPro DOMAIN/s: Retrotransposon gag protein (InterPro:IPR005162); BE... [more]
AT1G34070.12.8e-0624.68CONTAINS InterPro DOMAIN/s: Retrotransposon gag protein (InterPro:IPR005162); BE... [more]
InterPro
Analysis Name: InterPro Annotations of Melon (Charmono) v1.1
Date Performed: 2022-10-13
IPR TermIPR DescriptionSourceSource TermSource DescriptionAlignment
NoneNo IPR availablePFAMPF14223Retrotran_gag_2coord: 4..84
e-value: 6.8E-9
score: 35.6
NoneNo IPR availableMOBIDB_LITEmobidb-litedisorder_predictioncoord: 119..138
NoneNo IPR availableMOBIDB_LITEmobidb-litedisorder_predictioncoord: 112..153
NoneNo IPR availablePANTHERPTHR34222:SF48SUBFAMILY NOT NAMEDcoord: 2..271
NoneNo IPR availablePANTHERPTHR34222FAMILY NOT NAMEDcoord: 2..271
IPR025724GAG-pre-integrase domainPFAMPF13976gag_pre-integrscoord: 264..344
e-value: 1.2E-13
score: 50.7
IPR036875Zinc finger, CCHC-type superfamilySUPERFAMILY57756Retrovirus zinc finger-like domainscoord: 134..171

Relationships

The following mRNA feature(s) are a part of this gene:

Feature NameUnique NameType
Cmc07g0189741.1Cmc07g0189741.1mRNA


GO Annotation
GO Assignments
This gene is annotated with the following GO terms.
Category Term Accession Term Name
biological_process GO:0006259 DNA metabolic process
biological_process GO:0006749 glutathione metabolic process
biological_process GO:0042221 response to chemical
molecular_function GO:0004364 glutathione transferase activity
molecular_function GO:0003676 nucleic acid binding
molecular_function GO:0008270 zinc ion binding