MELO3C021644 (gene) Melon (DHL92) v3.5.1

NameMELO3C021644
Typegene
OrganismCucumis melo (Melon (DHL92) v3.5.1)
DescriptionGag-pro-like protein
Locationchr9 : 4884084 .. 4885622 (-)
The following sequences are available for this feature:

Gene sequence (with intron)

Legend: polypeptideCDS
Hold the cursor over a type above to highlight its positions in the sequence below.
ATGGCTGCCGAGGTTCAGCCACCGTTAACAGACAAAGAAATGACATCTATGTTTATGAATACCTTGCGGGCTCCATTCTATGATCGAATGATTGGTAATGCATTAAAAAATTTTTCTGATATTATTGTTATTGGCGAAAGAATTGAATATGGGATACAACACGGGAGGTTAGCAGAGGCTACAACTGAATATGATGGAATAAAGAAAGGAACAATATCTAAGAAGAAAGAAGGAGAGGTTCATGCAATTGGTTTTCCTAATTCATGGAAGCACAAATCAATTTTTTGTCAGAGAAAATATGAACAAAACTTTCCGTCATATATAAGCAATGTTTCTCATATCCCTTATAATAGCTATGTACCAGCCCATGCCGTCTCTGAAACTCCAAAACCCATTAACTCAAATTCTCCTCGACCATTTGTACAGAGTCAAGGTAGCAAAACCAATTCAGATACATGGCGGTTTGATCCGATTCCCATAACTTATAAAGAGCTTTTACCTCAACTAATTCAAAATCGACAGTTAGCTCCTATTCCAATGATTCCTATACCATCTCCTTACCCAAAATGGTATGATTCAAATGCTCGATGTGAATATCATGCTGGAGGAGCGGGACACTCAAATGAAAATTGTTTAGCTTTGAGAAGAAAGGTGCAATCTCTAATTAATGCTAGATGCTTGAGCTTCAAAAAATCTAGTGAGAAGCCAAATGTCAATGAAAACCCACGGCCTAATCATGAAAATACAAAAGTGAATGTTGTGGATCGCCTTGTTGAAAAGTGTAAAAATGAAGTCCATGAGATAATGATGCCTATGGAAGAACTTTTTAAAGGTCTTTTTGAAGCAGGATATGTTAGTCAGAAATATCTAGACCCCAATATAAAATATGAAGGATATGATGAAAGCAGACATTGTATATTCCATCAAGGAGTTGTAGGCCACGTTGTCCAACAGTGCAAAAAATTTAGATCCAAAGTACAACAACTTATGGATTCAAAGATACTCACGGTATATAGAGGACAAGGAAAAGACGAGATGAAAGACAGTAAAATATGTGCTTTAATGGATGAAGTTTCAGAAAAGGAATATTCCTTTTTACCAAGACCTTTGACAGTTTTTTTTTATCAAGAAAGTCGTAGTGAGTCAACATTCTACAATCCTAAAAAACTCACAATCCATGTGTCGAGTCCTTTCAAATGTAAGGATCTAAAAGTAGTGCCATGGTGGTATGATTGTCAAGTCATAACAGGTCCTGTTGATAACATTATAGGAATAAGTGGGATAACCCGAAGTGGAAGATGCTACAAACCAGATAATTTAACAGTCCCTTCAGATGGTCTAATACTGCAGCAAGGTAGCAAAAATGAGAAAAGAAATGTGAAGGAGCTTTGCAAAGACCAAGATGTGGAGATGCCTATCATTGCAAAATATATAGAATACAAAAAGTTTGTCACGGATGAGGAAGCAAATGAATTCTTGAAAATAGTAAAACAAAGTGAGTATAAGATCATAGAGCAAATGCATCATACTCCAGCTTGA

mRNA sequence

ATGGCTGCCGAGGTTCAGCCACCGTTAACAGACAAAGAAATGACATCTATGTTTATGAATACCTTGCGGGCTCCATTCTATGATCGAATGATTGGTAATGCATTAAAAAATTTTTCTGATATTATTGTTATTGGCGAAAGAATTGAATATGGGATACAACACGGGAGGTTAGCAGAGGCTACAACTGAATATGATGGAATAAAGAAAGGAACAATATCTAAGAAGAAAGAAGGAGAGGTTCATGCAATTGGTTTTCCTAATTCATGGAAGCACAAATCAATTTTTTGTCAGAGAAAATATGAACAAAACTTTCCGTCATATATAAGCAATGTTTCTCATATCCCTTATAATAGCTATGTACCAGCCCATGCCGTCTCTGAAACTCCAAAACCCATTAACTCAAATTCTCCTCGACCATTTGTACAGAGTCAAGGTAGCAAAACCAATTCAGATACATGGCGGTTTGATCCGATTCCCATAACTTATAAAGAGCTTTTACCTCAACTAATTCAAAATCGACAGTTAGCTCCTATTCCAATGATTCCTATACCATCTCCTTACCCAAAATGGTATGATTCAAATGCTCGATGTGAATATCATGCTGGAGGAGCGGGACACTCAAATGAAAATTGTTTAGCTTTGAGAAGAAAGGTGCAATCTCTAATTAATGCTAGATGCTTGAGCTTCAAAAAATCTAGTGAGAAGCCAAATGTCAATGAAAACCCACGGCCTAATCATGAAAATACAAAAGTGAATGTTGTGGATCGCCTTGTTGAAAAGTGTAAAAATGAAGTCCATGAGATAATGATGCCTATGGAAGAACTTTTTAAAGGTCTTTTTGAAGCAGGATATGTTAGTCAGAAATATCTAGACCCCAATATAAAATATGAAGGATATGATGAAAGCAGACATTGTATATTCCATCAAGGAGTTGTAGGCCACGTTGTCCAACAGTGCAAAAAATTTAGATCCAAAGTACAACAACTTATGGATTCAAAGATACTCACGGTATATAGAGGACAAGGAAAAGACGAGATGAAAGACAGTAAAATATGTGCTTTAATGGATGAAGTTTCAGAAAAGGAATATTCCTTTTTACCAAGACCTTTGACAGTTTTTTTTTATCAAGAAAGTCGTAGTGAGTCAACATTCTACAATCCTAAAAAACTCACAATCCATGTGTCGAGTCCTTTCAAATGTAAGGATCTAAAAGTAGTGCCATGGTGGTATGATTGTCAAGTCATAACAGGTCCTGTTGATAACATTATAGGAATAAGTGGGATAACCCGAAGTGGAAGATGCTACAAACCAGATAATTTAACAGTCCCTTCAGATGGTCTAATACTGCAGCAAGGTAGCAAAAATGAGAAAAGAAATGTGAAGGAGCTTTGCAAAGACCAAGATGTGGAGATGCCTATCATTGCAAAATATATAGAATACAAAAAGTTTGTCACGGATGAGGAAGCAAATGAATTCTTGAAAATAGTAAAACAAAGTGAGTATAAGATCATAGAGCAAATGCATCATACTCCAGCTTGA

Coding sequence (CDS)

ATGGCTGCCGAGGTTCAGCCACCGTTAACAGACAAAGAAATGACATCTATGTTTATGAATACCTTGCGGGCTCCATTCTATGATCGAATGATTGGTAATGCATTAAAAAATTTTTCTGATATTATTGTTATTGGCGAAAGAATTGAATATGGGATACAACACGGGAGGTTAGCAGAGGCTACAACTGAATATGATGGAATAAAGAAAGGAACAATATCTAAGAAGAAAGAAGGAGAGGTTCATGCAATTGGTTTTCCTAATTCATGGAAGCACAAATCAATTTTTTGTCAGAGAAAATATGAACAAAACTTTCCGTCATATATAAGCAATGTTTCTCATATCCCTTATAATAGCTATGTACCAGCCCATGCCGTCTCTGAAACTCCAAAACCCATTAACTCAAATTCTCCTCGACCATTTGTACAGAGTCAAGGTAGCAAAACCAATTCAGATACATGGCGGTTTGATCCGATTCCCATAACTTATAAAGAGCTTTTACCTCAACTAATTCAAAATCGACAGTTAGCTCCTATTCCAATGATTCCTATACCATCTCCTTACCCAAAATGGTATGATTCAAATGCTCGATGTGAATATCATGCTGGAGGAGCGGGACACTCAAATGAAAATTGTTTAGCTTTGAGAAGAAAGGTGCAATCTCTAATTAATGCTAGATGCTTGAGCTTCAAAAAATCTAGTGAGAAGCCAAATGTCAATGAAAACCCACGGCCTAATCATGAAAATACAAAAGTGAATGTTGTGGATCGCCTTGTTGAAAAGTGTAAAAATGAAGTCCATGAGATAATGATGCCTATGGAAGAACTTTTTAAAGGTCTTTTTGAAGCAGGATATGTTAGTCAGAAATATCTAGACCCCAATATAAAATATGAAGGATATGATGAAAGCAGACATTGTATATTCCATCAAGGAGTTGTAGGCCACGTTGTCCAACAGTGCAAAAAATTTAGATCCAAAGTACAACAACTTATGGATTCAAAGATACTCACGGTATATAGAGGACAAGGAAAAGACGAGATGAAAGACAGTAAAATATGTGCTTTAATGGATGAAGTTTCAGAAAAGGAATATTCCTTTTTACCAAGACCTTTGACAGTTTTTTTTTATCAAGAAAGTCGTAGTGAGTCAACATTCTACAATCCTAAAAAACTCACAATCCATGTGTCGAGTCCTTTCAAATGTAAGGATCTAAAAGTAGTGCCATGGTGGTATGATTGTCAAGTCATAACAGGTCCTGTTGATAACATTATAGGAATAAGTGGGATAACCCGAAGTGGAAGATGCTACAAACCAGATAATTTAACAGTCCCTTCAGATGGTCTAATACTGCAGCAAGGTAGCAAAAATGAGAAAAGAAATGTGAAGGAGCTTTGCAAAGACCAAGATGTGGAGATGCCTATCATTGCAAAATATATAGAATACAAAAAGTTTGTCACGGATGAGGAAGCAAATGAATTCTTGAAAATAGTAAAACAAAGTGAGTATAAGATCATAGAGCAAATGCATCATACTCCAGCTTGA

Protein sequence

MAAEVQPPLTDKEMTSMFMNTLRAPFYDRMIGNALKNFSDIIVIGERIEYGIQHGRLAEATTEYDGIKKGTISKKKEGEVHAIGFPNSWKHKSIFCQRKYEQNFPSYISNVSHIPYNSYVPAHAVSETPKPINSNSPRPFVQSQGSKTNSDTWRFDPIPITYKELLPQLIQNRQLAPIPMIPIPSPYPKWYDSNARCEYHAGGAGHSNENCLALRRKVQSLINARCLSFKKSSEKPNVNENPRPNHENTKVNVVDRLVEKCKNEVHEIMMPMEELFKGLFEAGYVSQKYLDPNIKYEGYDESRHCIFHQGVVGHVVQQCKKFRSKVQQLMDSKILTVYRGQGKDEMKDSKICALMDEVSEKEYSFLPRPLTVFFYQESRSESTFYNPKKLTIHVSSPFKCKDLKVVPWWYDCQVITGPVDNIIGISGITRSGRCYKPDNLTVPSDGLILQQGSKNEKRNVKELCKDQDVEMPIIAKYIEYKKFVTDEEANEFLKIVKQSEYKIIEQMHHTPA*
BLAST of MELO3C021644 vs. TrEMBL
Match: A0A061EXR3_THECC (Uncharacterized protein OS=Theobroma cacao GN=TCM_024883 PE=4 SV=1)

HSP 1 Score: 342.8 bits (878), Expect = 7.1e-91
Identity = 207/534 (38.76%), Postives = 305/534 (57.12%), Query Frame = 1

Query: 2    AAEVQPPLTDKEMTSMFMNTLRAPFYDRMIGNALKNFSDIIVIGERIEYGIQHGRLAEAT 61
            AA+VQPPLTDKEMT +F+NTLRAPFY+R+IGNA KNF+D+++ GE IE  I+ G++    
Sbjct: 893  AAQVQPPLTDKEMTVLFINTLRAPFYERLIGNATKNFTDLVLSGEIIEGAIKSGKIEGH- 952

Query: 62   TEYDGIKKGTISKKKEGEVHAIGFPNSWKHK-SIFCQRKYEQNFPSYISNVSHIPYNSYV 121
             E    KKG+  +KKEG+V A+   +   H  +++      Q F  +I N++  PY  Y 
Sbjct: 953  -EVASSKKGSTPRKKEGDVQAVAHDSQQAHNFNLYYPYPPYQPFYPHIGNITQNPY-VYQ 1012

Query: 122  P-------AHAVSETP--KPINSNSPRPFVQSQGSKTNSDTWRFDPIPITYKELLPQLIQ 181
            P        + + +TP  +PI S +  P    +G KT  +  +FD IP+ Y  LLPQLI+
Sbjct: 1013 PIPQPTFQTNVLPQTPPPRPIASTN-NPGHGQRGPKTTPERPKFDHIPVPYTTLLPQLIE 1072

Query: 182  NRQLAPIPMIPIPSPYPKWYDSNARCEYHAGGAGHSNENCLALRRKVQSLINARCLSFKK 241
             R L   P+ P+  P+PKWYD NA C+YH G  GHS ENC AL+ KVQ+LI A  L+F K
Sbjct: 1073 KRLLTQTPLEPLRPPFPKWYDPNAHCDYHFGIQGHSTENCTALKHKVQALIKAGLLNFTK 1132

Query: 242  SSEKPNVNENPRPNHENTKVNVV-DRLVEKCKNEVHEIMMPMEELFKGLFEAGYVSQKYL 301
              +  +V+ NP  NH    VN + + ++ + K  + EI  PM+++F+ L +   ++ + +
Sbjct: 1133 -KDSSSVDGNPLLNHGRPTVNAIHEGMIRRVKKGIDEIQTPMDKVFEALSKINAITPEPI 1192

Query: 302  DPNIKYEGYDESRHCIFHQGVVGHVVQQCKKFRSKVQQLMDSKILTVYRGQGKDEMKDSK 361
            D   K  G+D +  C FH G +GH +Q C  FR K+Q+LMDS I+  Y G  ++ +  + 
Sbjct: 1193 D--TKELGHDLAYSCKFHMGAIGHSIQNCDGFRRKLQELMDSSIIEFYEG-AEENLVGTI 1252

Query: 362  ICALMDEVSEKEY-SFLPRPLTVFFYQESRSESTFYNP----KKLTIHVSSPFKCKDLKV 421
                  EV+   + +  P+PLT+ FY+E++S     +P      +TI V SPF  K+ K 
Sbjct: 1253 YGDTPAEVASSSFGANKPKPLTI-FYEENKSPMNDTSPTMIRNGITIEVPSPFPYKNDKA 1312

Query: 422  VPWWYDCQVI-------TGPVDNIIGISGITRSGRCYKPDNLTVPSDGLILQQGSKNEKR 481
            VPW Y+C ++           ++I G+ GITRSGRCY P+       G   Q   +   +
Sbjct: 1313 VPWNYECNILGTASSAPQASFEDITGVGGITRSGRCYSPEVAERVEKGKPAQ--GEGGLK 1372

Query: 482  NVKELCKDQDVEMPIIAKYIEYKKFVTDEEANEFLKIVKQSEYKIIEQMHHTPA 513
                  KDQ V+  ++A   E K  VT++EA EFLK +K SEY ++EQ+   PA
Sbjct: 1373 KADTFSKDQ-VDEFVVAPNNEVKSPVTEKEAGEFLKFIKHSEYSVVEQLTKMPA 1414

BLAST of MELO3C021644 vs. TrEMBL
Match: A0A061E6J4_THECC (Uncharacterized protein OS=Theobroma cacao GN=TCM_010507 PE=4 SV=1)

HSP 1 Score: 312.4 bits (799), Expect = 1.0e-81
Identity = 199/550 (36.18%), Postives = 287/550 (52.18%), Query Frame = 1

Query: 1   MAAEVQPPLTDKEMTSMFMNTLRAPFYDRMIGNALKNFSDIIVIGERIEYGIQHGRLAEA 60
           +A++VQPPLT+KE T MF+NTLRAP+Y+R++G+A KNF+D+++ GE IE  I+ G++   
Sbjct: 275 VASQVQPPLTEKETTVMFVNTLRAPYYERLVGSATKNFADMVISGEMIETAIKQGKIEGG 334

Query: 61  TTEYDGIKKGTISKKKEGEVHAIGFPNSWKHKS-IFCQRKYEQNFPSY--ISNVSHIPY- 120
             +    +KG   K+KEGE   I    S +H+   +   +    +P Y  + N S  PY 
Sbjct: 335 --DMANTRKGGTFKRKEGEAQVI---TSGQHQGGTYNPYQPYLPYPYYPAVHNTSQSPYP 394

Query: 121 ---------NSYVPAHAVSETPKPINSNSPRPFVQSQGSKTNSDT------WR------- 180
                    N Y P + +  TP P  S         Q + +N+ T      WR       
Sbjct: 395 YPLMPNAFPNPY-PYNPIQRTPYPPASTPVTASTTQQTTPSNNHTTGESRGWRNKQEKVQ 454

Query: 181 FDPIPITYKELLPQLIQNRQLAPIPMIPIPSPYPKWYDSNARCEYHAGGAGHSNENCLAL 240
           FDPIPI Y EL  QL+ N  +AP+ + P+  P+P+WYD++A C+YH G  GHS ENC A 
Sbjct: 455 FDPIPIPYAELFTQLVANHLVAPLYIEPLKPPFPRWYDTSAHCDYHYGIEGHSIENCTAF 514

Query: 241 RRKVQSLINARCLSFKKSSEKPNVNENPRPNHENTKVNVVDRLVEKCKNEVHEIMMPMEE 300
           + KVQ LI A  L+F+K  E+ NVN NP PNH    VN ++R V   K  + E+   ME+
Sbjct: 515 KHKVQGLIKAGILNFEKKPEQ-NVNNNPLPNHAGAGVNAIEREV-YVKRNIREVETSMEK 574

Query: 301 LFKGLFEAGYVSQKYLDPNIKYEGYDESRHCIFHQGVVGHVVQQCKKFRSKVQQLMDSKI 360
           +F+ L +A  +      PN+      +   C++H+G VGH +Q C  FR +VQ++MD   
Sbjct: 575 VFEALVKADMLKVWPECPNVNDSRDIQRLCCLYHKGCVGHSIQGCSSFRKEVQRMMDESK 634

Query: 361 LTVYRGQGKDEMKDSKICALMDEVSEKEYSFLPRPLTVFFYQESRSESTFYNPKKLTIHV 420
           +  Y      E  +S +  +     E  +    +PLT+ FY+         N  K+ I V
Sbjct: 635 IEFY-----TEASESAVNMIS---KESTHPMKIKPLTI-FYEPKGELVEDKNHAKMVIEV 694

Query: 421 SSPFKCKDLKVVPWWYDCQVITGPVD-----------NIIGISGITRSGRCYKPDNLTVP 480
             PF  KD K VPW Y+C V                 NI G+ GITRSGRCY P+     
Sbjct: 695 PKPFPYKDNKAVPWNYNCNVQVSEAKKWIAESQDDAANITGVGGITRSGRCYSPEAF--- 754

Query: 481 SDGLILQQGSKNEKRNVKELC-KDQDVEMPIIAKYIEYKKFVTDEEANEFLKIVKQSEYK 513
                  +  KNEK   KE   +++ V+ P        K+ VT++EA EFLK +K SEY 
Sbjct: 755 -------ENLKNEKGGEKEQSPREEKVQPP--ESTDGSKRSVTEKEAAEFLKFIKHSEYN 795

BLAST of MELO3C021644 vs. TrEMBL
Match: A0A061E378_THECC (Uncharacterized protein OS=Theobroma cacao GN=TCM_008095 PE=4 SV=1)

HSP 1 Score: 303.9 bits (777), Expect = 3.7e-79
Identity = 194/547 (35.47%), Postives = 290/547 (53.02%), Query Frame = 1

Query: 2    AAEVQPPLTDKEMTSMFMNTLRAPFYDRMIGNALKNFSDIIVIGERIEYGIQHGRLAEAT 61
            AA+VQPPLTDKEMT +F+NTLRAPFY+R+IGNA+KNF+D+++ GE IE  I+ G++    
Sbjct: 954  AAQVQPPLTDKEMTVLFINTLRAPFYERLIGNAMKNFADLVLSGEIIEGAIKSGKI--EG 1013

Query: 62   TEYDGIKKGTISKKKEGEVHAIGFPNSWKH--KSIFCQRKYEQNFPSYISNVSHIPYNSY 121
             E    KKG+  KKKEG+V A+   +   H     +    Y+  +P +I NV+  PY  Y
Sbjct: 1014 HEVASSKKGSTPKKKEGDVQAVAHDSQQAHNFNPYYPYPPYQPFYP-HIGNVTQNPY-VY 1073

Query: 122  VP-------AHAVSET--PKPINSNSPRPFVQSQGSKTNSDTWRFDPIPITYKELLPQLI 181
             P        + + +T  P+PI S +  P    +G KT  +  +FDPIP+ Y  LLPQLI
Sbjct: 1074 QPVPQPTFQTNVLPQTPPPRPIASTN-NPGHGQRGPKTTPERPKFDPIPVPYTTLLPQLI 1133

Query: 182  QNRQLAPIPMIPIPSPYPKWYD-------------SNARCEYHAGGAGHSNENCLALRRK 241
            +NR +A  P+ P+  P+PKWYD                                   + K
Sbjct: 1134 ENRLIARTPLEPLRPPFPKWYDPXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXKHK 1193

Query: 242  VQSLINARCLSFKKSSEKPNVNENPRPNHENTKVNVV-DRLVEKCKNEVHEIMMPMEELF 301
            VQ+LI A  L+F K  +  +V+ NP PNH    VN + + ++ + K  + EI  PM+++F
Sbjct: 1194 VQALIKAGLLNFAK-KDSSSVDGNPLPNHGRPTVNAIHEGMIRRVKKGIDEIQTPMDKVF 1253

Query: 302  KGLFEAGYVSQKYLDPNIKYEGYDESRHCIFHQGVVGHVVQQCKKFRSKVQQLMDSKILT 361
            + L +   ++ + +D   K  G+D +  C FH G +GH +Q C  FR K+Q+LMDS ++ 
Sbjct: 1254 EALSKINAITPEPID--TKEFGHDLAYSCKFHMGAIGHSIQNCDGFRRKLQELMDSSVIE 1313

Query: 362  VYRGQGKDEMKDSKICALMDEVSEKEYSFLPRPLTVFFYQESRSESTFYNP----KKLTI 421
             Y G  ++ +         +  S    +  P+PLT+ FY+E++S     +P      +TI
Sbjct: 1314 FYEGAEENLVGTINRDTPAEVASSSFGANKPKPLTI-FYEENKSPMNDTSPTMSRNGITI 1373

Query: 422  HVSSPFKCKDLKVVPWWYDCQVI-------TGPVDNIIGISGITRSGRCYKPDNLTVPSD 481
             V SPF  K  K VPW Y+C ++           ++I G+ GITRSGRCY P+       
Sbjct: 1374 EVPSPFPYKSDKAVPWNYECNILGTVSSTPQASFEDITGVGGITRSGRCYSPEAAEKVGK 1433

Query: 482  GLILQQGSKNEKRNVKELCKDQDVEMPIIAKYIEYKKFVTDEEANEFLKIVKQSEYKIIE 513
            G   Q   +   +      K+Q V+  ++A   E K  VT++E  EFLK +K SEY ++E
Sbjct: 1434 GKPAQ--GEGGLKKADTFSKNQ-VDESVVAPNNEVKNPVTEKEEGEFLKFIKHSEYSVVE 1488

BLAST of MELO3C021644 vs. TrEMBL
Match: A0A061FBD9_THECC (Uncharacterized protein OS=Theobroma cacao GN=TCM_033215 PE=4 SV=1)

HSP 1 Score: 271.9 bits (694), Expect = 1.5e-69
Identity = 157/381 (41.21%), Postives = 223/381 (58.53%), Query Frame = 1

Query: 2   AAEVQPPLTDKEMTSMFMNTLRAPFYDRMIGNALKNFSDIIVIGERIEYGIQHGRLAEAT 61
           AA+VQPPLTDKEMT +F+NTLRAPFY+R+IGNA KNF+D+++ GE IE  I+ G++    
Sbjct: 234 AAQVQPPLTDKEMTVLFINTLRAPFYERLIGNATKNFADLVLSGEIIEGAIKSGKIEGH- 293

Query: 62  TEYDGIKKGTISKKKEGEVHAIGFPNSWKHK--SIFCQRKYEQNFPSYISNVSHIPYNSY 121
            E    KK +  KKKEG+V A+   +   H     +    Y+  +P +I N++  PY  Y
Sbjct: 294 -EVANSKKWSTPKKKEGDVQAVAHDSQQAHNFNPYYPYPPYQPFYP-HIGNITQNPY-VY 353

Query: 122 VPA-------HAVSETPKPIN-SNSPRPFVQSQGSKTNSDTWRFDPIPITYKELLPQLIQ 181
            P        + + +TP P   +++  P    +G KT  +  +FDPIP+ Y  LLPQLI+
Sbjct: 354 QPVPQPTFQTNVLPQTPPPRPVASTNNPGHGQRGPKTTPERPKFDPIPVPYTTLLPQLIE 413

Query: 182 NRQLAPIPMIPIPSPYPKWYDSNARCEYHAGGAGHSNENCLALRRKVQSLINARCLSFKK 241
           NR LA  P+ P+  P+PKWYD NA C+YH G  GHS ENC AL+ KVQ+LI A  L+F K
Sbjct: 414 NRLLARTPLEPLRPPFPKWYDPNAHCDYHFGIQGHSTENCTALKHKVQALIKAGLLNFAK 473

Query: 242 SSEKPNVNENPRPNHENTKVNVV-DRLVEKCKNEVHEIMMPMEELFKGLFEAGYVSQKYL 301
             +  NV+ NP PNH    VN + +R++ + K  V+EI  PM+ +F+ L +   ++ K +
Sbjct: 474 -KDNSNVDGNPLPNHGGPTVNAIHERMIRRVKKNVNEIRTPMDRVFEALSKIKAITPKPI 533

Query: 302 DPNIKYEGYDESRHCIFHQGVVGHVVQQCKKFRSKVQQLMDSKILTVYRGQGKDEMKDSK 361
           +  IK  G+D +  C FH GVVGH +Q C  FR K+Q+LMD   +  Y    ++E     
Sbjct: 534 E--IKEVGHDLTLSCKFHMGVVGHSIQNCDGFRLKLQELMDLSEIEFYEESEEEEFWKKT 593

BLAST of MELO3C021644 vs. TrEMBL
Match: A0A061E2I6_THECC (RNA-directed DNA polymerase (Reverse transcriptase), Ribonuclease H-like protein OS=Theobroma cacao GN=TCM_007834 PE=4 SV=1)

HSP 1 Score: 268.1 bits (684), Expect = 2.2e-68
Identity = 177/514 (34.44%), Postives = 264/514 (51.36%), Query Frame = 1

Query: 26  FYDRMIGNALKNF-----SDIIVIGERIEYGIQHGRLAEATTEYDGIKKGTISKKKEGEV 85
           F D + G+  + +     + +I+ GE IE  I+ G++     E    KKG+  +KKEG+V
Sbjct: 221 FQDSLTGSVARWYIQLDRNHLILSGEIIEGAIKSGKIEGH--EVASSKKGSTHRKKEGDV 280

Query: 86  HAIGFPNSWKHK--SIFCQRKYEQNFPSYISNVSHIPYNSYVPA-------HAVSETPKP 145
            A+   +   H     +    Y+  +P +I NV+  PY  Y P        + + +TP P
Sbjct: 281 QAVAHDSQQAHNFNPYYPYPPYQPFYP-HIGNVTQNPY-VYQPVPQPTFQTNVLPQTPPP 340

Query: 146 IN-SNSPRPFVQSQGSKTNSDTWRFDPIPITYKELLPQLIQNRQLAPIPMIPIPSPYPKW 205
              +++  P    +G KT  +  +FDPIP+ Y  LLPQLI+NR LA  P+ P+  P+PKW
Sbjct: 341 RPVASTNNPGHGQRGPKTTPERPKFDPIPVPYTTLLPQLIENRLLARTPLEPLRPPFPKW 400

Query: 206 YDSNARCEYHAGGAGHSNENCLALRRKVQSLINARCLSFKKSSEKPNVNENPRPNHENTK 265
           YD NA C+YH G  GHS ENC AL+ KVQ+LI A  L+F K  +  NV+ NP PNH    
Sbjct: 401 YDPNAHCDYHFGIQGHSTENCTALKHKVQALIKAGLLNFAK-KDSSNVDGNPLPNHGRPT 460

Query: 266 VNVV-DRLVEKCKNEVHEIMMPMEELFKGLFEAGYVSQKYLDPNIKYEGYDESRHCIFHQ 325
           VN + + ++ + K  + EI MPM+++F+ L +   ++ + +D   K  G+D +  C FH 
Sbjct: 461 VNAIHEGMIRRVKKGIDEIQMPMDKVFEALSKINAITPEPID--TKELGHDLTYSCKFHM 520

Query: 326 GVVGHVVQQCKKFRSKVQQLMDSKILTVYRGQGKDEMKDSKICALMDEVSEKEYSFLPRP 385
           G +GH +Q C  FR    ++  S       G  K                       P+P
Sbjct: 521 GAIGHSIQNCDGFRHTPAEVASSSF-----GANK-----------------------PKP 580

Query: 386 LTVFFYQESRSESTFYNP----KKLTIHVSSPFKCKDLKVVPWWYDCQVI-------TGP 445
           LT+ FY+E++S     +P      +TI V SPF  K  K VPW Y C +           
Sbjct: 581 LTI-FYEENKSPMNDTSPTMIRNGITIEVPSPFPYKSDKAVPWNYQCNISGTASSAPQAS 640

Query: 446 VDNIIGISGITRSGRCYKPDNLTVPSDGLILQQGSKNEKRNVKELCKDQDVEMPIIAKYI 505
            +++ G+ GITRSGRCY P+ +     G   Q+    +K +     KDQ V+  ++A   
Sbjct: 641 FEDLTGVGGITRSGRCYSPEVVERVGKGKPAQEEGGLKKADT--FSKDQ-VDESVVAPNN 695

Query: 506 EYKKFVTDEEANEFLKIVKQSEYKIIEQMHHTPA 513
           E K  VT++EA EFLK +K SEY ++EQ+   PA
Sbjct: 701 EVKNPVTEKEAGEFLKFIKHSEYSVVEQLTKMPA 695

BLAST of MELO3C021644 vs. NCBI nr
Match: gi|659129309|ref|XP_008464622.1| (PREDICTED: uncharacterized protein LOC103502461 [Cucumis melo])

HSP 1 Score: 449.5 bits (1155), Expect = 7.8e-123
Identity = 226/298 (75.84%), Postives = 243/298 (81.54%), Query Frame = 1

Query: 1   MAAEVQPPLTDKEMTSMFMNTLRAPFYDRMIGNALKNFSDIIVIGERIEYGIQHGRLAEA 60
           MAAEVQPPLTDKEMTS+                              IEYGI+HGRLA+A
Sbjct: 226 MAAEVQPPLTDKEMTSI------------------------------IEYGIKHGRLAKA 285

Query: 61  TTEYDGIKKGTISKKKEGEVHAIGFPNSWKHKSIFCQRKYEQNFPSYISNVSHIPYNSYV 120
           TTEY  IKKGTISKKKE EVHAIGFPNS KHKSIF QRKYEQNFPSYISNVS+IPYNSYV
Sbjct: 286 TTEYGRIKKGTISKKKEEEVHAIGFPNSGKHKSIFGQRKYEQNFPSYISNVSYIPYNSYV 345

Query: 121 PAHAVSETPKPINSNSPRPFVQSQGSKTNSDTWRFDPIPITYKELLPQLIQNRQLAPIPM 180
            AH VSETPKP+NSNSP+PFV+ QGSK NSDTWRFDPIP+TY ELL QLIQNRQLA IPM
Sbjct: 346 LAHTVSETPKPVNSNSPQPFVKRQGSKINSDTWRFDPIPMTYIELLRQLIQNRQLALIPM 405

Query: 181 IPIPSPYPKWYDSNARCEYHAGGAGHSNENCLALRRKVQSLINARCLSFKKSSEKPNVNE 240
           IPI  PYPKW+DSNARC+YHAGG GHS ENCLAL+RKVQSLINA  LSFKKS EKPNVNE
Sbjct: 406 IPIRPPYPKWFDSNARCDYHAGGVGHSTENCLALKRKVQSLINAGWLSFKKSGEKPNVNE 465

Query: 241 NPRPNHENTKVNVVDRLVEKCKNEVHEIMMPMEELFKGLFEAGYVSQKYLDPNIKYEG 299
           NP  +HEN KVNVVD LVEKCK+EVHEI+MPME LF+GLFEAGYVS +YLDPNI+YEG
Sbjct: 466 NPLSDHENPKVNVVDSLVEKCKSEVHEIVMPMEALFEGLFEAGYVSHEYLDPNIRYEG 493

BLAST of MELO3C021644 vs. NCBI nr
Match: gi|590636870|ref|XP_007028966.1| (Uncharacterized protein TCM_024883 [Theobroma cacao])

HSP 1 Score: 342.8 bits (878), Expect = 1.0e-90
Identity = 207/534 (38.76%), Postives = 305/534 (57.12%), Query Frame = 1

Query: 2    AAEVQPPLTDKEMTSMFMNTLRAPFYDRMIGNALKNFSDIIVIGERIEYGIQHGRLAEAT 61
            AA+VQPPLTDKEMT +F+NTLRAPFY+R+IGNA KNF+D+++ GE IE  I+ G++    
Sbjct: 893  AAQVQPPLTDKEMTVLFINTLRAPFYERLIGNATKNFTDLVLSGEIIEGAIKSGKIEGH- 952

Query: 62   TEYDGIKKGTISKKKEGEVHAIGFPNSWKHK-SIFCQRKYEQNFPSYISNVSHIPYNSYV 121
             E    KKG+  +KKEG+V A+   +   H  +++      Q F  +I N++  PY  Y 
Sbjct: 953  -EVASSKKGSTPRKKEGDVQAVAHDSQQAHNFNLYYPYPPYQPFYPHIGNITQNPY-VYQ 1012

Query: 122  P-------AHAVSETP--KPINSNSPRPFVQSQGSKTNSDTWRFDPIPITYKELLPQLIQ 181
            P        + + +TP  +PI S +  P    +G KT  +  +FD IP+ Y  LLPQLI+
Sbjct: 1013 PIPQPTFQTNVLPQTPPPRPIASTN-NPGHGQRGPKTTPERPKFDHIPVPYTTLLPQLIE 1072

Query: 182  NRQLAPIPMIPIPSPYPKWYDSNARCEYHAGGAGHSNENCLALRRKVQSLINARCLSFKK 241
             R L   P+ P+  P+PKWYD NA C+YH G  GHS ENC AL+ KVQ+LI A  L+F K
Sbjct: 1073 KRLLTQTPLEPLRPPFPKWYDPNAHCDYHFGIQGHSTENCTALKHKVQALIKAGLLNFTK 1132

Query: 242  SSEKPNVNENPRPNHENTKVNVV-DRLVEKCKNEVHEIMMPMEELFKGLFEAGYVSQKYL 301
              +  +V+ NP  NH    VN + + ++ + K  + EI  PM+++F+ L +   ++ + +
Sbjct: 1133 -KDSSSVDGNPLLNHGRPTVNAIHEGMIRRVKKGIDEIQTPMDKVFEALSKINAITPEPI 1192

Query: 302  DPNIKYEGYDESRHCIFHQGVVGHVVQQCKKFRSKVQQLMDSKILTVYRGQGKDEMKDSK 361
            D   K  G+D +  C FH G +GH +Q C  FR K+Q+LMDS I+  Y G  ++ +  + 
Sbjct: 1193 D--TKELGHDLAYSCKFHMGAIGHSIQNCDGFRRKLQELMDSSIIEFYEG-AEENLVGTI 1252

Query: 362  ICALMDEVSEKEY-SFLPRPLTVFFYQESRSESTFYNP----KKLTIHVSSPFKCKDLKV 421
                  EV+   + +  P+PLT+ FY+E++S     +P      +TI V SPF  K+ K 
Sbjct: 1253 YGDTPAEVASSSFGANKPKPLTI-FYEENKSPMNDTSPTMIRNGITIEVPSPFPYKNDKA 1312

Query: 422  VPWWYDCQVI-------TGPVDNIIGISGITRSGRCYKPDNLTVPSDGLILQQGSKNEKR 481
            VPW Y+C ++           ++I G+ GITRSGRCY P+       G   Q   +   +
Sbjct: 1313 VPWNYECNILGTASSAPQASFEDITGVGGITRSGRCYSPEVAERVEKGKPAQ--GEGGLK 1372

Query: 482  NVKELCKDQDVEMPIIAKYIEYKKFVTDEEANEFLKIVKQSEYKIIEQMHHTPA 513
                  KDQ V+  ++A   E K  VT++EA EFLK +K SEY ++EQ+   PA
Sbjct: 1373 KADTFSKDQ-VDEFVVAPNNEVKSPVTEKEAGEFLKFIKHSEYSVVEQLTKMPA 1414

BLAST of MELO3C021644 vs. NCBI nr
Match: gi|659123179|ref|XP_008461534.1| (PREDICTED: LOW QUALITY PROTEIN: uncharacterized protein LOC103500105 [Cucumis melo])

HSP 1 Score: 326.2 bits (835), Expect = 9.9e-86
Identity = 157/180 (87.22%), Postives = 167/180 (92.78%), Query Frame = 1

Query: 1   MAAEVQPPLTDKEMTSMFMNTLRAPFYDRMIGNALKNFSDIIVIGERIEYGIQHGRLAEA 60
           +A E+QPPLT+KEMTSMFMNTLRAPFY+RMIGNA  NFSDIIVIGERIEYGI+HGRL EA
Sbjct: 195 IAVELQPPLTEKEMTSMFMNTLRAPFYERMIGNASTNFSDIIVIGERIEYGIKHGRLVEA 254

Query: 61  TTEYDGIKKGTISKKKEGEVHAIGFPNSWKHKSIFCQRKYEQNFPSYISNVSHIPYNSYV 120
           TTEY GIKKGTISKKKEGEVHAIGFPN  KHKSIF QRKYEQNFPSYISNV HIPYNSY+
Sbjct: 255 TTEYGGIKKGTISKKKEGEVHAIGFPNLGKHKSIFGQRKYEQNFPSYISNVYHIPYNSYI 314

Query: 121 PAHAVSETPKPINSNSPRPFVQSQGSKTNSDTWRFDPIPITYKELLPQLIQNRQLAPIPM 180
            AH VSETPKP+NSNSPRPFVQ QGSKTNS+TWRFDPIP+TY ELLPQLIQNRQLAPIP+
Sbjct: 315 SAHTVSETPKPVNSNSPRPFVQGQGSKTNSNTWRFDPIPMTYIELLPQLIQNRQLAPIPI 374

BLAST of MELO3C021644 vs. NCBI nr
Match: gi|659131240|ref|XP_008465582.1| (PREDICTED: uncharacterized protein LOC103503218 [Cucumis melo])

HSP 1 Score: 312.8 bits (800), Expect = 1.1e-81
Identity = 151/168 (89.88%), Postives = 157/168 (93.45%), Query Frame = 1

Query: 4   EVQPPLTDKEMTSMFMNTLRAPFYDRMIGNALKNFSDIIVIGERIEYGIQHGRLAEATTE 63
           EVQPPLTDKEMT +FMNTLRAPFY+RMIGNA  NFSDIIVIGERIEYGI+HGRLAEATTE
Sbjct: 216 EVQPPLTDKEMTYVFMNTLRAPFYERMIGNASTNFSDIIVIGERIEYGIKHGRLAEATTE 275

Query: 64  YDGIKKGTISKKKEGEVHAIGFPNSWKHKSIFCQRKYEQNFPSYISNVSHIPYNSYVPAH 123
           Y GIKKGTISKKKEGEVH IGFPNS KHKSIF QRKYEQNFPSYISNVSHIPYNSYVPAH
Sbjct: 276 YGGIKKGTISKKKEGEVHGIGFPNSGKHKSIFGQRKYEQNFPSYISNVSHIPYNSYVPAH 335

Query: 124 AVSETPKPINSNSPRPFVQSQGSKTNSDTWRFDPIPITYKELLPQLIQ 172
            VSET KP+NSNSPRPFVQ QGSKTNSDTWRFDPIP+TY ELLPQLI+
Sbjct: 336 TVSETSKPVNSNSPRPFVQGQGSKTNSDTWRFDPIPMTYTELLPQLIK 383

BLAST of MELO3C021644 vs. NCBI nr
Match: gi|590695072|ref|XP_007044788.1| (Uncharacterized protein TCM_010507 [Theobroma cacao])

HSP 1 Score: 312.4 bits (799), Expect = 1.5e-81
Identity = 199/550 (36.18%), Postives = 287/550 (52.18%), Query Frame = 1

Query: 1   MAAEVQPPLTDKEMTSMFMNTLRAPFYDRMIGNALKNFSDIIVIGERIEYGIQHGRLAEA 60
           +A++VQPPLT+KE T MF+NTLRAP+Y+R++G+A KNF+D+++ GE IE  I+ G++   
Sbjct: 275 VASQVQPPLTEKETTVMFVNTLRAPYYERLVGSATKNFADMVISGEMIETAIKQGKIEGG 334

Query: 61  TTEYDGIKKGTISKKKEGEVHAIGFPNSWKHKS-IFCQRKYEQNFPSY--ISNVSHIPY- 120
             +    +KG   K+KEGE   I    S +H+   +   +    +P Y  + N S  PY 
Sbjct: 335 --DMANTRKGGTFKRKEGEAQVI---TSGQHQGGTYNPYQPYLPYPYYPAVHNTSQSPYP 394

Query: 121 ---------NSYVPAHAVSETPKPINSNSPRPFVQSQGSKTNSDT------WR------- 180
                    N Y P + +  TP P  S         Q + +N+ T      WR       
Sbjct: 395 YPLMPNAFPNPY-PYNPIQRTPYPPASTPVTASTTQQTTPSNNHTTGESRGWRNKQEKVQ 454

Query: 181 FDPIPITYKELLPQLIQNRQLAPIPMIPIPSPYPKWYDSNARCEYHAGGAGHSNENCLAL 240
           FDPIPI Y EL  QL+ N  +AP+ + P+  P+P+WYD++A C+YH G  GHS ENC A 
Sbjct: 455 FDPIPIPYAELFTQLVANHLVAPLYIEPLKPPFPRWYDTSAHCDYHYGIEGHSIENCTAF 514

Query: 241 RRKVQSLINARCLSFKKSSEKPNVNENPRPNHENTKVNVVDRLVEKCKNEVHEIMMPMEE 300
           + KVQ LI A  L+F+K  E+ NVN NP PNH    VN ++R V   K  + E+   ME+
Sbjct: 515 KHKVQGLIKAGILNFEKKPEQ-NVNNNPLPNHAGAGVNAIEREV-YVKRNIREVETSMEK 574

Query: 301 LFKGLFEAGYVSQKYLDPNIKYEGYDESRHCIFHQGVVGHVVQQCKKFRSKVQQLMDSKI 360
           +F+ L +A  +      PN+      +   C++H+G VGH +Q C  FR +VQ++MD   
Sbjct: 575 VFEALVKADMLKVWPECPNVNDSRDIQRLCCLYHKGCVGHSIQGCSSFRKEVQRMMDESK 634

Query: 361 LTVYRGQGKDEMKDSKICALMDEVSEKEYSFLPRPLTVFFYQESRSESTFYNPKKLTIHV 420
           +  Y      E  +S +  +     E  +    +PLT+ FY+         N  K+ I V
Sbjct: 635 IEFY-----TEASESAVNMIS---KESTHPMKIKPLTI-FYEPKGELVEDKNHAKMVIEV 694

Query: 421 SSPFKCKDLKVVPWWYDCQVITGPVD-----------NIIGISGITRSGRCYKPDNLTVP 480
             PF  KD K VPW Y+C V                 NI G+ GITRSGRCY P+     
Sbjct: 695 PKPFPYKDNKAVPWNYNCNVQVSEAKKWIAESQDDAANITGVGGITRSGRCYSPEAF--- 754

Query: 481 SDGLILQQGSKNEKRNVKELC-KDQDVEMPIIAKYIEYKKFVTDEEANEFLKIVKQSEYK 513
                  +  KNEK   KE   +++ V+ P        K+ VT++EA EFLK +K SEY 
Sbjct: 755 -------ENLKNEKGGEKEQSPREEKVQPP--ESTDGSKRSVTEKEAAEFLKFIKHSEYN 795

The following BLAST results are available for this feature:
Match NameE-valueIdentityDescription
A0A061EXR3_THECC7.1e-9138.76Uncharacterized protein OS=Theobroma cacao GN=TCM_024883 PE=4 SV=1[more]
A0A061E6J4_THECC1.0e-8136.18Uncharacterized protein OS=Theobroma cacao GN=TCM_010507 PE=4 SV=1[more]
A0A061E378_THECC3.7e-7935.47Uncharacterized protein OS=Theobroma cacao GN=TCM_008095 PE=4 SV=1[more]
A0A061FBD9_THECC1.5e-6941.21Uncharacterized protein OS=Theobroma cacao GN=TCM_033215 PE=4 SV=1[more]
A0A061E2I6_THECC2.2e-6834.44RNA-directed DNA polymerase (Reverse transcriptase), Ribonuclease H-like protein... [more]
Match NameE-valueIdentityDescription
gi|659129309|ref|XP_008464622.1|7.8e-12375.84PREDICTED: uncharacterized protein LOC103502461 [Cucumis melo][more]
gi|590636870|ref|XP_007028966.1|1.0e-9038.76Uncharacterized protein TCM_024883 [Theobroma cacao][more]
gi|659123179|ref|XP_008461534.1|9.9e-8687.22PREDICTED: LOW QUALITY PROTEIN: uncharacterized protein LOC103500105 [Cucumis me... [more]
gi|659131240|ref|XP_008465582.1|1.1e-8189.88PREDICTED: uncharacterized protein LOC103503218 [Cucumis melo][more]
gi|590695072|ref|XP_007044788.1|1.5e-8136.18Uncharacterized protein TCM_010507 [Theobroma cacao][more]
GO Assignments
This gene is annotated with the following GO terms.
Category Term Accession Term Name
biological_process GO:0008150 biological_process
cellular_component GO:0005575 cellular_component
molecular_function GO:0003674 molecular_function

The following mRNA feature(s) are a part of this gene:

Feature NameUnique NameType
MELO3C021644T1MELO3C021644T1mRNA