ClCG04G001080 (gene) Watermelon (Charleston Gray) v2.5

Overview
NameClCG04G001080
Typegene
OrganismCitrullus lanatus subsp. vulgaris cv. Charleston Gray (Watermelon (Charleston Gray) v2.5)
DescriptionRetrovirus-related Pol polyprotein from transposon TNT 1-94
LocationCG_Chr04: 3363474 .. 3365202 (-)
RNA-Seq ExpressionClCG04G001080
SyntenyClCG04G001080
Sequences
The following sequences are available for this feature:

Gene sequence (with intron)

Legend: polypeptideexonCDS
Hold the cursor over a type above to highlight its positions in the sequence below.
ATGGCCAACGTCTTTCCTTCTGTCAACTCAAATGGAGTATTTACTAACCCTACCTACAACAGCCCACCTCTAAATCAACTTTTAAACCAAATTACTTCCATCAAACTAGATAGGAGTAATTTTCTATTGTGGAAGAACTTAACCCATCCTATTTTGAAGAGTTATCGGCTGTTCGGACATCTCATTAGAGAGAAGGAATCTCCCCCTATGTTTGTTCAACCGGAAGCTACTGCCGTTTTGTCGCCTTTGCCAGCGACACCAGCTGCTCCCTCGTATGTCACCGTCACTGCACAATGAGGGATGACAGGGACTGAAACTCCTGTAGCGTCAAGCTCAGTAACCACGGCCCAAACCATTATTAATCCTCACTATGAATCGTGGATAGCAAATGATCAACTACTGCTGGGATGGCTATACAACTCCATGATGCCCGAAGTTGCAACTCAAGTCATGGGTCGTGATACGGCCAAAGGGGTTGTGGAATGCATTGCAAGAACTGTTCAGTGTGCAATCTCGCGCTGAGGAAAACTATCTAAGGCAAGTTTTTCAAACTACCCGTAAGGGATCTTTAAAAATGGTTGATTATCTCAGAACTATGAAAACACATGCTGATAATCTTGAACAAGTCGAAAGTCCTGTTGCGCTTAGAGCCTTAGTCTCTCAAGTTCTCCTAGGCCTAGATGAAGAGTACAGCCCGATAGTTGCTACAATCCAAGGAAAAATGGATGTCAAGTGGACTAACTTGCAATATGAGCTATTGATTTAGAACCGCCTTGAACATCAAAACACACAAACGGTAAGCACACTTCGTGAATCCACCTCAGTTAACATGGCAACAAGCAAGAACCCTATCTCTACCAACCAAAGATAGAATCAAAATTCCTTCAATGGCAATAGACCACCTCAACAGAACAATGGGCAATGCAGTAATGGTCGAGGAAGAGGCAGAGGGCGAGGAAATAACCGTCCAATCTTTCAAGTTTGCTAAAAAATTGGACATACTGCTCTACAGTGTTACCAACTGTACAACAAATCGTTTGGGCAAAATCAACATGGTGAGCAAGGGAACCGAGGCCAAAATTTTCAAGGGAAGCAGATTCCAAACCAAGGCCATAATTTTCCAGGACCGACACCTAACCAAGCATTCATGGCAACTCAGAATACCAACCCATTTGTTGTTACACCTGAATCTGTCATTGATTCAGGATGGTATGTCAACAGTGGGGCTTCAAACCATGTTGCCAATGATCACAACAGTCTAACCAATGCATATGAATATGGAGGTAAAGAAAGAGTAATTGTTGGTAATGGTAGGGCTCTACCTATAACTCACATTGGTTCTTGCCATATTCCTGCGGATGTTGCCTTATTAAATTTACATAATATGTTATGTGTGCCTGCTATTGCAAAGAATCTCATAAGTGTCTCTAAGCTTGTTCAAAATAATAAAGTTTTCATTGAATTTCACTCTAGCTCTTGTTTCATTAAGGATATCAACACGGGTCAAATGGTGCTGAAGGGGGAGCTTGATGATGGGCTGTACATATTTGAGAAAACTAAAGCCACTGGTAGTGCTTCAAATGTGGGAAAAACCAACCTGAAGTCAATTGGAAGAATAATGGCGTGCTCTACTGAAGTAAAAAATATTTGTGTAATTTTTGTGTCTAAGTCTATTTTGCATCAAATGCTTGGACATCCTTCCTCAAAAGTCCTGAATGATGTTGTTTAA

mRNA sequence

ATGGCCAACGTCTTTCCTTCTGTCAACTCAAATGGAGTATTTACTAACCCTACCTACAACAGCCCACCTCTAAATCAACTTTTAAACCAAATTACTTCCATCAAACTAGATAGGAGTAATTTTCTATTGTGGAAGAACTTAACCCATCCTATTTTGAAGAGTTATCGGCTGTTCGGACATCTCATTAGAGAGAAGGAATCTCCCCCTATGTTTGTTCAACCGGAAGCTACTGCCGTTTTGTCGCCTTTGCCAGCGACACCAGCTGCTCCCTCCTCAGTAACCACGGCCCAAACCATTATTAATCCTCACTATGAATCGTGGATAGCAAATGATCAACTACTGCTGGGATGGCTATACAACTCCATGATGCCCGAAGTTGCAACTCAAGTCATGGGTCGTGATACGGCCAAAGGGGTTGTGGAATGCATTGCAAGAACTGTTCAGCAAGTTTTTCAAACTACCCGTAAGGGATCTTTAAAAATGGTTGATTATCTCAGAACTATGAAAACACATGCTGATAATCTTGAACAAGTCGAAAGTCCTGTTGCGCTTAGAGCCTTAGTCTCTCAAGTTCTCCTAGGCCTAGATGAAGAGTACAGCCCGATAGTTGCTACAATCCAAGGAAAAATGGATTTAACATGGCAACAAGCAAGAACCCTATCTCTACCAACCAAAGATAGAATCAAAATTCCTTCAATGGCAATAGACCACCTCAACAGAACAATGGGCAATGCAGTAATGGTCGAGGAAGAGGCAGAGGGCGAGGAAATAACCTGTTACCAACTGTACAACAAATCGTTTGGGCAAAATCAACATGGTGAGCAAGGGAACCGAGGCCAAAATTTTCAAGGGAAGCAGATTCCAAACCAAGGCCATAATTTTCCAGGACCGACACCTAACCAAGCATTCATGGCAACTCAGAATACCAACCCATTTGTTGTTACACCTGAATCTGTCATTGATTCAGGATGGTATGTCAACAGTGGGGCTTCAAACCATGTTGCCAATGATCACAACAGTCTAACCAATGCATATGAATATGGAGGTAAAGAAAGAGTAATTGTTGGTAATGGTAGGGCTCTACCTATAACTCACATTGGTTCTTGCCATATTCCTGCGGATGATATCAACACGGGTCAAATGGTGCTGAAGGGGGAGCTTGATGATGGGCTGTACATATTTGAGAAAACTAAAGCCACTGGTAGTGCTTCAAATGTGGGAAAAACCAACCTGAAGTCAATTGGAAGAATAATGGCGTGCTCTACTGAATCTATTTTGCATCAAATGCTTGGACATCCTTCCTCAAAAGTCCTGAATGATGTTGTTTAA

Coding sequence (CDS)

ATGGCCAACGTCTTTCCTTCTGTCAACTCAAATGGAGTATTTACTAACCCTACCTACAACAGCCCACCTCTAAATCAACTTTTAAACCAAATTACTTCCATCAAACTAGATAGGAGTAATTTTCTATTGTGGAAGAACTTAACCCATCCTATTTTGAAGAGTTATCGGCTGTTCGGACATCTCATTAGAGAGAAGGAATCTCCCCCTATGTTTGTTCAACCGGAAGCTACTGCCGTTTTGTCGCCTTTGCCAGCGACACCAGCTGCTCCCTCCTCAGTAACCACGGCCCAAACCATTATTAATCCTCACTATGAATCGTGGATAGCAAATGATCAACTACTGCTGGGATGGCTATACAACTCCATGATGCCCGAAGTTGCAACTCAAGTCATGGGTCGTGATACGGCCAAAGGGGTTGTGGAATGCATTGCAAGAACTGTTCAGCAAGTTTTTCAAACTACCCGTAAGGGATCTTTAAAAATGGTTGATTATCTCAGAACTATGAAAACACATGCTGATAATCTTGAACAAGTCGAAAGTCCTGTTGCGCTTAGAGCCTTAGTCTCTCAAGTTCTCCTAGGCCTAGATGAAGAGTACAGCCCGATAGTTGCTACAATCCAAGGAAAAATGGATTTAACATGGCAACAAGCAAGAACCCTATCTCTACCAACCAAAGATAGAATCAAAATTCCTTCAATGGCAATAGACCACCTCAACAGAACAATGGGCAATGCAGTAATGGTCGAGGAAGAGGCAGAGGGCGAGGAAATAACCTGTTACCAACTGTACAACAAATCGTTTGGGCAAAATCAACATGGTGAGCAAGGGAACCGAGGCCAAAATTTTCAAGGGAAGCAGATTCCAAACCAAGGCCATAATTTTCCAGGACCGACACCTAACCAAGCATTCATGGCAACTCAGAATACCAACCCATTTGTTGTTACACCTGAATCTGTCATTGATTCAGGATGGTATGTCAACAGTGGGGCTTCAAACCATGTTGCCAATGATCACAACAGTCTAACCAATGCATATGAATATGGAGGTAAAGAAAGAGTAATTGTTGGTAATGGTAGGGCTCTACCTATAACTCACATTGGTTCTTGCCATATTCCTGCGGATGATATCAACACGGGTCAAATGGTGCTGAAGGGGGAGCTTGATGATGGGCTGTACATATTTGAGAAAACTAAAGCCACTGGTAGTGCTTCAAATGTGGGAAAAACCAACCTGAAGTCAATTGGAAGAATAATGGCGTGCTCTACTGAATCTATTTTGCATCAAATGCTTGGACATCCTTCCTCAAAAGTCCTGAATGATGTTGTTTAA

Protein sequence

MANVFPSVNSNGVFTNPTYNSPPLNQLLNQITSIKLDRSNFLLWKNLTHPILKSYRLFGHLIREKESPPMFVQPEATAVLSPLPATPAAPSSVTTAQTIINPHYESWIANDQLLLGWLYNSMMPEVATQVMGRDTAKGVVECIARTVQQVFQTTRKGSLKMVDYLRTMKTHADNLEQVESPVALRALVSQVLLGLDEEYSPIVATIQGKMDLTWQQARTLSLPTKDRIKIPSMAIDHLNRTMGNAVMVEEEAEGEEITCYQLYNKSFGQNQHGEQGNRGQNFQGKQIPNQGHNFPGPTPNQAFMATQNTNPFVVTPESVIDSGWYVNSGASNHVANDHNSLTNAYEYGGKERVIVGNGRALPITHIGSCHIPADDINTGQMVLKGELDDGLYIFEKTKATGSASNVGKTNLKSIGRIMACSTESILHQMLGHPSSKVLNDVV
Homology
BLAST of ClCG04G001080 vs. NCBI nr
Match: XP_022151683.1 (uncharacterized protein LOC111019598 [Momordica charantia])

HSP 1 Score: 268.9 bits (686), Expect = 8.0e-68
Identity = 183/498 (36.75%), Postives = 255/498 (51.20%), Query Frame = 0

Query: 6   PSVNSNGVFTNPTYNSPPLNQLLNQITSIKLDRSNFLLWKNLTHPILKSYRLFGHLIREK 65
           P V    V +   + SPPLNQLLNQITSIK+DR NFLLW+NL  PIL+SY+LF +L  +K
Sbjct: 9   PIVTPPAVVSGAVFTSPPLNQLLNQITSIKMDRGNFLLWQNLALPILRSYKLFDYLTGDK 68

Query: 66  ESPPMFVQPEATAVLSPLPATPAAPSSVTTAQTIINPHYESWIANDQLLLGWLYNSMMPE 125
             PP  + P  T        T    S+ + +   +NP YE+WI  D+LLLGWLYNSM  +
Sbjct: 69  PCPPTHLVPTDT-------PTNIEGSTSSQSSPTLNPTYEAWIVVDKLLLGWLYNSMAAD 128

Query: 126 VATQVMGRDTAKGVVECIART-----------VQQVFQTTRKGSLKMVDYLRTMKTHADN 185
           VA QVMG  T++ +   +              ++QVFQ T KGSL+M++YL+ MK+HADN
Sbjct: 129 VAMQVMGFSTSRELWTAVQELFGVQSRAEVDYLKQVFQQTCKGSLQMIEYLKLMKSHADN 188

Query: 186 LEQVESPVALRALVSQVLLGLDEEYSPIVATIQGKMDLTWQQARTLSLPTKDRIKIPSMA 245
           L    S V++R LVSQVL GLDEEY+PIV  +QGK++L+W +     L  + R++  +  
Sbjct: 189 LALAGSSVSVRDLVSQVLTGLDEEYNPIVVAVQGKVNLSWSEMHAELLTYEKRLEYQNSL 248

Query: 246 ID--HLNRTMGNAVMVEEEAEGEEITCYQLYNKSFGQNQHGEQGNRGQNFQ--------- 305
                +N+T   +V      +G      Q  N   G N HG   +RG  +Q         
Sbjct: 249 KSGIPINQTQTPSV---NYVDGRSFQTNQRTNN--GNNSHGSNTHRGGGYQRGSFGQRNR 308

Query: 306 --GKQIPNQGHNFPGPTPNQAFMATQNTNPFVVTPESVIDSGWYVNSGASNHVANDHNSL 365
             G Q P Q  NF          A  +T+  V TPE+VID  WY +SGA++HV  + N++
Sbjct: 309 GRGPQ-PTQHKNFTPSNSGPNVFAAHHTSTTVTTPETVIDPSWYADSGATSHVTANPNNV 368

Query: 366 TNAYEYGGKERVIVGNGRALPITHIGSCHIPAD-------------------DINTGQMV 425
               +Y G E VIV NG  L I+HIGS +I A                    D  +G+ +
Sbjct: 369 EQKVDYSGTENVIVANGNKLSISHIGSTNIHASGGSLKLKDVLRVPDIAKNLDKASGRTL 428

Query: 426 LKGELDDGLYIFEKTKATGSAS-------------NVGKTNLKS---------IGRIMAC 439
           LKG L D LY  +++  +  A+             ++    L S            I   
Sbjct: 429 LKGTLKDNLYRLDRSHRSPPATPTLTAPLFAHTVVSLSNNTLSSEKPTPSFPFAEHINVV 488

BLAST of ClCG04G001080 vs. NCBI nr
Match: KAA0026100.1 (uncharacterized protein E6C27_scaffold19G00360 [Cucumis melo var. makuwa])

HSP 1 Score: 211.5 bits (537), Expect = 1.5e-50
Identity = 159/444 (35.81%), Postives = 220/444 (49.55%), Query Frame = 0

Query: 6   PSVNSNGVFTNPTYNSPPLNQLLNQITSIKLDRSNFLLWKNLTHPILKSYRLFGHLIREK 65
           PS++S G      +++PPLNQ+LNQ+ ++KLDR N+LLWK L  PILK Y+L GHL  E 
Sbjct: 11  PSLSSAG------FSNPPLNQILNQLATVKLDRKNYLLWKTLALPILKGYKLEGHLTGET 70

Query: 66  ESPPMFV----QPEATAVLSPLPATPAAPSSVTTAQTIINPHYESWIANDQLLLGWLYNS 125
             P  FV        T       AT  A SS+T    I+N  +E W+  D LLLGWLYNS
Sbjct: 71  PCPSHFVLSASSSNTTVTEEGADATIGASSSIT--PRIVNSLFEQWVTTDLLLLGWLYNS 130

Query: 126 MMPEVATQVMGRDTAKGV---------VECIART--VQQVFQTTRKGSLKMVDYLRTMKT 185
           M P+VA Q+MG    + +         V+  A    ++Q+ QTTRKG+ KM +YL  MKT
Sbjct: 131 MTPDVAIQLMGFTNVEDLWDATQDFFGVQSRAEEDFLRQMLQTTRKGNTKMEEYLLVMKT 190

Query: 186 HADNLEQVESPVALRALVSQVLLGLDEEYSPIVATIQGKMDLTWQQARTLSLPTKDRIKI 245
           + DNL QV SPV  RAL+SQVLLGLDE Y+ ++  IQGK D++W       L  + ++ I
Sbjct: 191 NVDNLGQVGSPVPRRALISQVLLGLDEVYNLVIVVIQGKPDISW-------LDMQSKLLI 250

Query: 246 PSMAIDHLN-----RTMGNAVMVEEEAEGEEITCYQLYN----KSFGQN-QH--GEQGNR 305
               + H N     +  GN          +        N    K +G N QH  G++GN 
Sbjct: 251 FEKILKHQNTQKKKKKKGNITQSPALNMAQRFALNGQRNHSNKKFYGYNRQHFSGQRGNL 310

Query: 306 GQNFQGKQIPNQGHN-----------FPGP--------------TPNQA-FMATQNTNPF 365
                 +     GH+           F  P              +PN A F++TQN  PF
Sbjct: 311 NNGPTCQLCGKYGHSALVCYNRFNKEFSSPLVQDRNEHSSNGSVSPNPAVFVSTQNATPF 370

Query: 366 VVTPESVIDSGWYVNSGASNHVANDHNSLTNAYEYGGKERVIVGNGRALPITHIGSCHIP 397
             TP++V+D  WY++SGA+NHV  + +++TN  EY G+                      
Sbjct: 371 -ATPDTVVDPNWYIDSGATNHVTRECSNMTNPTEYSGQ---------------------- 412

BLAST of ClCG04G001080 vs. NCBI nr
Match: KAA0057475.1 (uncharacterized protein E6C27_scaffold280G003560 [Cucumis melo var. makuwa] >TYK30171.1 uncharacterized protein E5676_scaffold216G001590 [Cucumis melo var. makuwa])

HSP 1 Score: 199.5 bits (506), Expect = 6.0e-47
Identity = 134/351 (38.18%), Postives = 192/351 (54.70%), Query Frame = 0

Query: 75  EATAVLSPLPATPAAPSSVTTAQTIINPHYESWIANDQLLLGWLYNSMMPEVATQVMGRD 134
           EA++V+S       A SS +    I+NP YE W+ +D LLLG +YNSM+P+VA Q+MG +
Sbjct: 43  EASSVVS---EGTVASSSTSMNSKIVNPKYEQWVTSDMLLLGLIYNSMVPDVALQLMGFN 102

Query: 135 TAKGVVECIART-----------VQQVFQTTRKGSLKMVDYLRTMKTHADNLEQVESPVA 194
           TAK + E I              ++  FQTTR+G+ KM DYLR MK +ADNL Q  SPV 
Sbjct: 103 TAKDLWEAIQNLFGIKSRAEEYFLRHTFQTTREGNYKMEDYLRIMKINADNLGQAGSPVP 162

Query: 195 LRALVSQVLLGLDEEYSPIVATIQGKMDLTWQQARTLSLPTKDRIKIPSMAIDHLNRTMG 254
            R L+SQVLLGLDE Y+P+ A IQGK D++W   ++  L  ++ ++I  + ++     M 
Sbjct: 163 HRYLISQVLLGLDEVYNPVTAVIQGKPDISWLDMQSELLIFENLVEIVLIKMESETILMA 222

Query: 255 NAVMVEEEAEGEEITCYQLYNKSFGQNQHGEQGNRGQNFQGKQIPNQGHNFPGPTPNQAF 314
            A +VEEE            N+ F  NQ+G+Q                       P+ AF
Sbjct: 223 -ADVVEEE------------NRGFNPNQNGKQ----------------------IPDDAF 282

Query: 315 MATQNTNPFVVTPESVIDSGWYVNSGASNHVANDHNSLTNAYEYGGKERVIVGNGRALPI 374
           + TQ ++  + TPE+V+D+  YV+SGA+NHV +DH++L N  +Y G E V+VGN   L I
Sbjct: 283 ITTQKSSS-LATPETVVDTNRYVDSGATNHVTSDHSNLWNIDDYSGNENVVVGNENKLQI 342

Query: 375 THIGSCHIP-------ADDI------------NTGQMVLKGELDDGLYIFE 396
           + +G   +         D I            +TG+++LKG L DGLY  E
Sbjct: 343 SCVGYASLTDGKNCLRLDKILCVPEIKKNLAKDTGRVLLKGTLCDGLYHLE 354

BLAST of ClCG04G001080 vs. NCBI nr
Match: TXG69253.1 (hypothetical protein EZV62_004188 [Acer yangbiense])

HSP 1 Score: 196.4 bits (498), Expect = 5.1e-46
Identity = 144/418 (34.45%), Postives = 197/418 (47.13%), Query Frame = 0

Query: 6   PSVNSNGVFTNPTYNSPPLNQLLNQITSIKLDRSNFLLWKNLTHPILKSYRLFGHLIREK 65
           P+V   G  +N +  S P    LNQ  +IKLDR NF+LWK +   I+K +RL GHL   +
Sbjct: 21  PTVLQEG--SNSSNESSPFGNKLNQSFAIKLDRQNFILWKTMVTTIIKGHRLDGHLYSTR 80

Query: 66  ESPPMFVQPEATAVLSPLPATPAAPSSVTTAQTIINPHYESWIANDQLLLGWLYNSMMPE 125
             PP F+ P  T    P P TP     V+ + +  NP YE W+ NDQLL+GWLY+SM   
Sbjct: 81  PCPPEFL-PSPTTPGVPSPTTP----GVSDSGSCSNPEYEKWLVNDQLLMGWLYSSMTEN 140

Query: 126 VATQVMGRDTAKGVVECI-----------ARTVQQVFQTTRKGSLKMVDYLRTMKTHADN 185
           VA  VMG  TA G+ + +           A T++   QTTRKGS  M +YL  MKT AD+
Sbjct: 141 VALSVMGSTTAAGLWKALENLFGAYSKSKANTIRTSIQTTRKGSSTMEEYLTQMKTWADS 200

Query: 186 LEQVESPVALRALVSQVLLGLDEEYSPIVATIQGKMDLTWQQARTLSLPTKDRIKIPSMA 245
           L     P     L +  L GLD EY PIV  I+ +   TWQ+         D +      
Sbjct: 201 LAIAGDPYPENLLFANSLAGLDSEYMPIVVLIEAREHFTWQE-------IYDTLLSYDSK 260

Query: 246 IDHLNRTMGNAVMVEEEA---EGEEITCYQLYNKSFGQNQHGEQGNRGQN---------- 305
           ++H+N       ++   +      +       NK+  Q    + GNR  N          
Sbjct: 261 LEHINNVSAKGNLLSSPSAHLATNKPNNTPNTNKTSNQQNLNQGGNRAPNRGGFRGGGGR 320

Query: 306 FQGKQIPNQ------------GH-----------NFPGPTPNQAFMATQNTNP---FVVT 365
           F+G+   N             GH           N+ G  P     A  N N    FV T
Sbjct: 321 FRGRGGRNNNSRPTCQVCGKFGHSASVCYFRYDDNYMGSVPT----ANSNANSPSVFVAT 380

Query: 366 PESVIDSGWYVNSGASNHVANDHNSLTNAYEYGGKERVIVGNGRALPITHIGSCHIPA 374
           PE+V D+ WY +SGA+NHV ND  +L    +Y G E ++VGNG+ L I+H+G   +P+
Sbjct: 381 PETVDDTTWYADSGATNHVTNDAGNLDLKSDYRGDESLMVGNGKQLDISHVGLKSLPS 420

BLAST of ClCG04G001080 vs. NCBI nr
Match: TXG67243.1 (hypothetical protein EZV62_008518 [Acer yangbiense])

HSP 1 Score: 194.5 bits (493), Expect = 1.9e-45
Identity = 143/418 (34.21%), Postives = 197/418 (47.13%), Query Frame = 0

Query: 6   PSVNSNGVFTNPTYNSPPLNQLLNQITSIKLDRSNFLLWKNLTHPILKSYRLFGHLIREK 65
           P+V   G  +N +  S P    LNQ  +IKLDR NF+LWK +   I+K +RL GHL   +
Sbjct: 21  PTVLQEG--SNSSNESSPFGNKLNQSFAIKLDRQNFILWKTMVTTIIKGHRLDGHLYSTR 80

Query: 66  ESPPMFVQPEATAVLSPLPATPAAPSSVTTAQTIINPHYESWIANDQLLLGWLYNSMMPE 125
             PP F+ P  T    P P TP     V+ + +  NP YE W+ NDQLL+GWLY+SM   
Sbjct: 81  PCPPEFL-PSPTTPGVPSPTTP----GVSDSGSCSNPEYEKWLVNDQLLMGWLYSSMTEN 140

Query: 126 VATQVMGRDTAKGVVECI-----------ARTVQQVFQTTRKGSLKMVDYLRTMKTHADN 185
           VA  VMG  TA G+ + +           A T++   QTTRKGS  M +YL  MKT AD+
Sbjct: 141 VALSVMGSTTAAGLWKALENLFGAYSKSKANTIRTSIQTTRKGSSTMEEYLTQMKTWADS 200

Query: 186 LEQVESPVALRALVSQVLLGLDEEYSPIVATIQGKMDLTWQQARTLSLPTKDRIKIPSMA 245
           L     P     L +  L GLD EY PIV  I+ +   TWQ+         D +      
Sbjct: 201 LAIAGDPYPENLLFANSLAGLDSEYMPIVVLIEAREHFTWQE-------IYDTLLSYDSK 260

Query: 246 IDHLNRTMGNAVMVEEEA---EGEEITCYQLYNKSFGQNQHGEQGNRGQN---------- 305
           ++H+N       ++   +      +       NK+  Q    + GNR  N          
Sbjct: 261 LEHINNVSAKGNLLSSPSAHLATNKPNNTPNTNKTSNQQNLNQGGNRAPNRGGFRGGGGR 320

Query: 306 FQGKQIPNQ------------GH-----------NFPGPTPNQAFMATQNTNP---FVVT 365
           F+G+   N             GH           N+ G  P     A  N N    FV T
Sbjct: 321 FRGRGGRNNNSRPTCQVCGKFGHSASVCYFRYDDNYMGSVPT----ANSNANSPSVFVAT 380

Query: 366 PESVIDSGWYVNSGASNHVANDHNSLTNAYEYGGKERVIVGNGRALPITHIGSCHIPA 374
           PE+V D+ WY +SGA++HV ND  +L    +Y G E ++VGNG+ L I+H+G   +P+
Sbjct: 381 PETVDDTTWYADSGATDHVTNDAGNLDLKSDYRGDESLMVGNGKQLDISHVGLKSLPS 420

BLAST of ClCG04G001080 vs. ExPASy Swiss-Prot
Match: Q94HW2 (Retrovirus-related Pol polyprotein from transposon RE1 OS=Arabidopsis thaliana OX=3702 GN=RE1 PE=2 SV=1)

HSP 1 Score: 90.5 bits (223), Expect = 5.1e-17
Identity = 121/511 (23.68%), Postives = 194/511 (37.96%), Query Frame = 0

Query: 20  NSPPLNQLLNQITSIKLDRSNFLLWKNLTHPILKSYRLFGHLIREKESPPMFVQPEATAV 79
           N+  LN  ++ +T  KL  +N+L+W    H +   Y L G L      PP  +       
Sbjct: 12  NTSILNVNMSNVT--KLTSTNYLMWSRQVHALFDGYELAGFLDGSTTMPPATI------- 71

Query: 80  LSPLPATPAAPSSVTTAQTIINPHYESWIANDQLLLGWLYNSMMPEVATQVMGRDTAKGV 139
                 T AAP         +NP Y  W   D+L+   +  ++   V   V    TA  +
Sbjct: 72  -----GTDAAPR--------VNPDYTRWKRQDKLIYSAVLGAISMSVQPAVSRATTAAQI 131

Query: 140 VECIAR-----------TVQQVFQTTRKGSLKMVDYLRTMKTHADNLEQVESPVALRALV 199
            E + +            ++   +   KG+  + DY++ + T  D L  +  P+     V
Sbjct: 132 WETLRKIYANPSYGHVTQLRTQLKQWTKGTKTIDDYMQGLVTRFDQLALLGKPMDHDEQV 191

Query: 200 SQVLLGLDEEYSPIVATIQGK----------MDLTWQQARTLSLPTKDRIKIPSMAIDHL 259
            +VL  L EEY P++  I  K            L   +++ L++ +   I I + A+ H 
Sbjct: 192 ERVLENLPEEYKPVIDQIAAKDTPPTLTEIHERLLNHESKILAVSSATVIPITANAVSHR 251

Query: 260 NRTMGNAVMVEEEAEGEEITCYQLYN-----KSFGQNQHGEQGNRGQN--FQGK-QIPN- 319
           N T  N         G     Y   N     K + Q+      N  Q+  + GK QI   
Sbjct: 252 NTTTTN-----NNNNGNRNNRYDNRNNNNNSKPWQQSSTNFHPNNNQSKPYLGKCQICGV 311

Query: 320 QGHNFPGPTPNQAFMATQNTN--PFVVTP----------ESVIDSGWYVNSGASNHVAND 379
           QGH+    +  Q F+++ N+   P   TP               + W ++SGA++H+ +D
Sbjct: 312 QGHSAKRCSQLQHFLSSVNSQQPPSPFTPWQPRANLALGSPYSSNNWLLDSGATHHITSD 371

Query: 380 HNSLTNAYEYGGKERVIVGNGRALPITHIGSCHI-------------------------- 439
            N+L+    Y G + V+V +G  +PI+H GS  +                          
Sbjct: 372 FNNLSLHQPYTGGDDVMVADGSTIPISHTGSTSLSTKSRPLNLHNILYVPNIHKNLISVY 431

Query: 440 -------------PAD----DINTGQMVLKGELDDGLY---IFEKTKATGSASNVGKTNL 443
                        PA     D+NTG  +L+G+  D LY   I      +  AS   K   
Sbjct: 432 RLCNANGVSVEFFPASFQVKDLNTGVPLLQGKTKDELYEWPIASSQPVSLFASPSSK--- 483

BLAST of ClCG04G001080 vs. ExPASy Swiss-Prot
Match: Q9ZT94 (Retrovirus-related Pol polyprotein from transposon RE2 OS=Arabidopsis thaliana OX=3702 GN=RE2 PE=4 SV=1)

HSP 1 Score: 90.5 bits (223), Expect = 5.1e-17
Identity = 114/497 (22.94%), Postives = 192/497 (38.63%), Query Frame = 0

Query: 20  NSPPLNQLLNQITSIKLDRSNFLLWKNLTHPILKSYRLFGHLIREKESPPMFVQPEATAV 79
           N+  LN  ++ +T  KL  +N+L+W    H +   Y L G L      PP  +       
Sbjct: 12  NTNILNVNMSNVT--KLTSTNYLMWSRQVHALFDGYELAGFLDGSTPMPPATI------- 71

Query: 80  LSPLPATPAAPSSVTTAQTIINPHYESWIANDQLLLGWLYNSMMPEVATQVMGRDTAKGV 139
                 T A P         +NP Y  W   D+L+   +  ++   V   V    TA  +
Sbjct: 72  -----GTDAVPR--------VNPDYTRWRRQDKLIYSAILGAISMSVQPAVSRATTAAQI 131

Query: 140 VECIARTVQQVFQTTRKGSLKMVDYLRTMKTHADNLEQVESPVALRALVSQVLLGLDEEY 199
            E    T+++++     G +  + ++    T  D L  +  P+     V +VL  L ++Y
Sbjct: 132 WE----TLRKIYANPSYGHVTQLRFI----TRFDQLALLGKPMDHDEQVERVLENLPDDY 191

Query: 200 SPIVATIQGK----------MDLTWQQARTLSLPTKDRIKIPSMAIDHLNRTMGNAVMVE 259
            P++  I  K            L  ++++ L+L + + + I +  + H N          
Sbjct: 192 KPVIDQIAAKDTPPSLTEIHERLINRESKLLALNSAEVVPITANVVTHRNTNTNR----N 251

Query: 260 EEAEGEEITCYQLYNKSFGQNQHGEQGNRGQNFQGK------QIPN-QGHNFPGPTPNQA 319
           +   G+    Y   N      Q    G+R  N Q K      QI + QGH+         
Sbjct: 252 QNNRGDNRN-YNNNNNRSNSWQPSSSGSRSDNRQPKPYLGRCQICSVQGHSAKRCPQLHQ 311

Query: 320 FMAT----QNTNPF----------VVTPESVIDSGWYVNSGASNHVANDHNSLTNAYEYG 379
           F +T    Q+T+PF          V +P +   + W ++SGA++H+ +D N+L+    Y 
Sbjct: 312 FQSTTNQQQSTSPFTPWQPRANLAVNSPYNA--NNWLLDSGATHHITSDFNNLSFHQPYT 371

Query: 380 GKERVIVGNGRALPITHIGSCHIPAD---------------------------------- 439
           G + V++ +G  +PITH GS  +P                                    
Sbjct: 372 GGDDVMIADGSTIPITHTGSASLPTSSRSLDLNKVLYVPNIHKNLISVYRLCNTNRVSVE 431

Query: 440 ---------DINTGQMVLKGELDDGLYIFEKTKATGSASNVGKTNLKSIGRIMACSTESI 443
                    D+NTG  +L+G+  D LY  E   A+  A ++  +           +T S 
Sbjct: 432 FFPASFQVKDLNTGVPLLQGKTKDELY--EWPIASSQAVSMFASPCSK-------ATHSS 462

BLAST of ClCG04G001080 vs. ExPASy TrEMBL
Match: A0A6J1DCW4 (uncharacterized protein LOC111019598 OS=Momordica charantia OX=3673 GN=LOC111019598 PE=4 SV=1)

HSP 1 Score: 268.9 bits (686), Expect = 3.9e-68
Identity = 183/498 (36.75%), Postives = 255/498 (51.20%), Query Frame = 0

Query: 6   PSVNSNGVFTNPTYNSPPLNQLLNQITSIKLDRSNFLLWKNLTHPILKSYRLFGHLIREK 65
           P V    V +   + SPPLNQLLNQITSIK+DR NFLLW+NL  PIL+SY+LF +L  +K
Sbjct: 9   PIVTPPAVVSGAVFTSPPLNQLLNQITSIKMDRGNFLLWQNLALPILRSYKLFDYLTGDK 68

Query: 66  ESPPMFVQPEATAVLSPLPATPAAPSSVTTAQTIINPHYESWIANDQLLLGWLYNSMMPE 125
             PP  + P  T        T    S+ + +   +NP YE+WI  D+LLLGWLYNSM  +
Sbjct: 69  PCPPTHLVPTDT-------PTNIEGSTSSQSSPTLNPTYEAWIVVDKLLLGWLYNSMAAD 128

Query: 126 VATQVMGRDTAKGVVECIART-----------VQQVFQTTRKGSLKMVDYLRTMKTHADN 185
           VA QVMG  T++ +   +              ++QVFQ T KGSL+M++YL+ MK+HADN
Sbjct: 129 VAMQVMGFSTSRELWTAVQELFGVQSRAEVDYLKQVFQQTCKGSLQMIEYLKLMKSHADN 188

Query: 186 LEQVESPVALRALVSQVLLGLDEEYSPIVATIQGKMDLTWQQARTLSLPTKDRIKIPSMA 245
           L    S V++R LVSQVL GLDEEY+PIV  +QGK++L+W +     L  + R++  +  
Sbjct: 189 LALAGSSVSVRDLVSQVLTGLDEEYNPIVVAVQGKVNLSWSEMHAELLTYEKRLEYQNSL 248

Query: 246 ID--HLNRTMGNAVMVEEEAEGEEITCYQLYNKSFGQNQHGEQGNRGQNFQ--------- 305
                +N+T   +V      +G      Q  N   G N HG   +RG  +Q         
Sbjct: 249 KSGIPINQTQTPSV---NYVDGRSFQTNQRTNN--GNNSHGSNTHRGGGYQRGSFGQRNR 308

Query: 306 --GKQIPNQGHNFPGPTPNQAFMATQNTNPFVVTPESVIDSGWYVNSGASNHVANDHNSL 365
             G Q P Q  NF          A  +T+  V TPE+VID  WY +SGA++HV  + N++
Sbjct: 309 GRGPQ-PTQHKNFTPSNSGPNVFAAHHTSTTVTTPETVIDPSWYADSGATSHVTANPNNV 368

Query: 366 TNAYEYGGKERVIVGNGRALPITHIGSCHIPAD-------------------DINTGQMV 425
               +Y G E VIV NG  L I+HIGS +I A                    D  +G+ +
Sbjct: 369 EQKVDYSGTENVIVANGNKLSISHIGSTNIHASGGSLKLKDVLRVPDIAKNLDKASGRTL 428

Query: 426 LKGELDDGLYIFEKTKATGSAS-------------NVGKTNLKS---------IGRIMAC 439
           LKG L D LY  +++  +  A+             ++    L S            I   
Sbjct: 429 LKGTLKDNLYRLDRSHRSPPATPTLTAPLFAHTVVSLSNNTLSSEKPTPSFPFAEHINVV 488

BLAST of ClCG04G001080 vs. ExPASy TrEMBL
Match: A0A5A7SIT7 (Uncharacterized protein OS=Cucumis melo var. makuwa OX=1194695 GN=E6C27_scaffold19G00360 PE=4 SV=1)

HSP 1 Score: 211.5 bits (537), Expect = 7.3e-51
Identity = 159/444 (35.81%), Postives = 220/444 (49.55%), Query Frame = 0

Query: 6   PSVNSNGVFTNPTYNSPPLNQLLNQITSIKLDRSNFLLWKNLTHPILKSYRLFGHLIREK 65
           PS++S G      +++PPLNQ+LNQ+ ++KLDR N+LLWK L  PILK Y+L GHL  E 
Sbjct: 11  PSLSSAG------FSNPPLNQILNQLATVKLDRKNYLLWKTLALPILKGYKLEGHLTGET 70

Query: 66  ESPPMFV----QPEATAVLSPLPATPAAPSSVTTAQTIINPHYESWIANDQLLLGWLYNS 125
             P  FV        T       AT  A SS+T    I+N  +E W+  D LLLGWLYNS
Sbjct: 71  PCPSHFVLSASSSNTTVTEEGADATIGASSSIT--PRIVNSLFEQWVTTDLLLLGWLYNS 130

Query: 126 MMPEVATQVMGRDTAKGV---------VECIART--VQQVFQTTRKGSLKMVDYLRTMKT 185
           M P+VA Q+MG    + +         V+  A    ++Q+ QTTRKG+ KM +YL  MKT
Sbjct: 131 MTPDVAIQLMGFTNVEDLWDATQDFFGVQSRAEEDFLRQMLQTTRKGNTKMEEYLLVMKT 190

Query: 186 HADNLEQVESPVALRALVSQVLLGLDEEYSPIVATIQGKMDLTWQQARTLSLPTKDRIKI 245
           + DNL QV SPV  RAL+SQVLLGLDE Y+ ++  IQGK D++W       L  + ++ I
Sbjct: 191 NVDNLGQVGSPVPRRALISQVLLGLDEVYNLVIVVIQGKPDISW-------LDMQSKLLI 250

Query: 246 PSMAIDHLN-----RTMGNAVMVEEEAEGEEITCYQLYN----KSFGQN-QH--GEQGNR 305
               + H N     +  GN          +        N    K +G N QH  G++GN 
Sbjct: 251 FEKILKHQNTQKKKKKKGNITQSPALNMAQRFALNGQRNHSNKKFYGYNRQHFSGQRGNL 310

Query: 306 GQNFQGKQIPNQGHN-----------FPGP--------------TPNQA-FMATQNTNPF 365
                 +     GH+           F  P              +PN A F++TQN  PF
Sbjct: 311 NNGPTCQLCGKYGHSALVCYNRFNKEFSSPLVQDRNEHSSNGSVSPNPAVFVSTQNATPF 370

Query: 366 VVTPESVIDSGWYVNSGASNHVANDHNSLTNAYEYGGKERVIVGNGRALPITHIGSCHIP 397
             TP++V+D  WY++SGA+NHV  + +++TN  EY G+                      
Sbjct: 371 -ATPDTVVDPNWYIDSGATNHVTRECSNMTNPTEYSGQ---------------------- 412

BLAST of ClCG04G001080 vs. ExPASy TrEMBL
Match: A0A5D3E3L7 (Uncharacterized protein OS=Cucumis melo var. makuwa OX=1194695 GN=E5676_scaffold216G001590 PE=4 SV=1)

HSP 1 Score: 199.5 bits (506), Expect = 2.9e-47
Identity = 134/351 (38.18%), Postives = 192/351 (54.70%), Query Frame = 0

Query: 75  EATAVLSPLPATPAAPSSVTTAQTIINPHYESWIANDQLLLGWLYNSMMPEVATQVMGRD 134
           EA++V+S       A SS +    I+NP YE W+ +D LLLG +YNSM+P+VA Q+MG +
Sbjct: 43  EASSVVS---EGTVASSSTSMNSKIVNPKYEQWVTSDMLLLGLIYNSMVPDVALQLMGFN 102

Query: 135 TAKGVVECIART-----------VQQVFQTTRKGSLKMVDYLRTMKTHADNLEQVESPVA 194
           TAK + E I              ++  FQTTR+G+ KM DYLR MK +ADNL Q  SPV 
Sbjct: 103 TAKDLWEAIQNLFGIKSRAEEYFLRHTFQTTREGNYKMEDYLRIMKINADNLGQAGSPVP 162

Query: 195 LRALVSQVLLGLDEEYSPIVATIQGKMDLTWQQARTLSLPTKDRIKIPSMAIDHLNRTMG 254
            R L+SQVLLGLDE Y+P+ A IQGK D++W   ++  L  ++ ++I  + ++     M 
Sbjct: 163 HRYLISQVLLGLDEVYNPVTAVIQGKPDISWLDMQSELLIFENLVEIVLIKMESETILMA 222

Query: 255 NAVMVEEEAEGEEITCYQLYNKSFGQNQHGEQGNRGQNFQGKQIPNQGHNFPGPTPNQAF 314
            A +VEEE            N+ F  NQ+G+Q                       P+ AF
Sbjct: 223 -ADVVEEE------------NRGFNPNQNGKQ----------------------IPDDAF 282

Query: 315 MATQNTNPFVVTPESVIDSGWYVNSGASNHVANDHNSLTNAYEYGGKERVIVGNGRALPI 374
           + TQ ++  + TPE+V+D+  YV+SGA+NHV +DH++L N  +Y G E V+VGN   L I
Sbjct: 283 ITTQKSSS-LATPETVVDTNRYVDSGATNHVTSDHSNLWNIDDYSGNENVVVGNENKLQI 342

Query: 375 THIGSCHIP-------ADDI------------NTGQMVLKGELDDGLYIFE 396
           + +G   +         D I            +TG+++LKG L DGLY  E
Sbjct: 343 SCVGYASLTDGKNCLRLDKILCVPEIKKNLAKDTGRVLLKGTLCDGLYHLE 354

BLAST of ClCG04G001080 vs. ExPASy TrEMBL
Match: A0A5C7IJ06 (Uncharacterized protein OS=Acer yangbiense OX=1000413 GN=EZV62_004188 PE=4 SV=1)

HSP 1 Score: 196.4 bits (498), Expect = 2.4e-46
Identity = 144/418 (34.45%), Postives = 197/418 (47.13%), Query Frame = 0

Query: 6   PSVNSNGVFTNPTYNSPPLNQLLNQITSIKLDRSNFLLWKNLTHPILKSYRLFGHLIREK 65
           P+V   G  +N +  S P    LNQ  +IKLDR NF+LWK +   I+K +RL GHL   +
Sbjct: 21  PTVLQEG--SNSSNESSPFGNKLNQSFAIKLDRQNFILWKTMVTTIIKGHRLDGHLYSTR 80

Query: 66  ESPPMFVQPEATAVLSPLPATPAAPSSVTTAQTIINPHYESWIANDQLLLGWLYNSMMPE 125
             PP F+ P  T    P P TP     V+ + +  NP YE W+ NDQLL+GWLY+SM   
Sbjct: 81  PCPPEFL-PSPTTPGVPSPTTP----GVSDSGSCSNPEYEKWLVNDQLLMGWLYSSMTEN 140

Query: 126 VATQVMGRDTAKGVVECI-----------ARTVQQVFQTTRKGSLKMVDYLRTMKTHADN 185
           VA  VMG  TA G+ + +           A T++   QTTRKGS  M +YL  MKT AD+
Sbjct: 141 VALSVMGSTTAAGLWKALENLFGAYSKSKANTIRTSIQTTRKGSSTMEEYLTQMKTWADS 200

Query: 186 LEQVESPVALRALVSQVLLGLDEEYSPIVATIQGKMDLTWQQARTLSLPTKDRIKIPSMA 245
           L     P     L +  L GLD EY PIV  I+ +   TWQ+         D +      
Sbjct: 201 LAIAGDPYPENLLFANSLAGLDSEYMPIVVLIEAREHFTWQE-------IYDTLLSYDSK 260

Query: 246 IDHLNRTMGNAVMVEEEA---EGEEITCYQLYNKSFGQNQHGEQGNRGQN---------- 305
           ++H+N       ++   +      +       NK+  Q    + GNR  N          
Sbjct: 261 LEHINNVSAKGNLLSSPSAHLATNKPNNTPNTNKTSNQQNLNQGGNRAPNRGGFRGGGGR 320

Query: 306 FQGKQIPNQ------------GH-----------NFPGPTPNQAFMATQNTNP---FVVT 365
           F+G+   N             GH           N+ G  P     A  N N    FV T
Sbjct: 321 FRGRGGRNNNSRPTCQVCGKFGHSASVCYFRYDDNYMGSVPT----ANSNANSPSVFVAT 380

Query: 366 PESVIDSGWYVNSGASNHVANDHNSLTNAYEYGGKERVIVGNGRALPITHIGSCHIPA 374
           PE+V D+ WY +SGA+NHV ND  +L    +Y G E ++VGNG+ L I+H+G   +P+
Sbjct: 381 PETVDDTTWYADSGATNHVTNDAGNLDLKSDYRGDESLMVGNGKQLDISHVGLKSLPS 420

BLAST of ClCG04G001080 vs. ExPASy TrEMBL
Match: A0A5C7ID32 (Uncharacterized protein OS=Acer yangbiense OX=1000413 GN=EZV62_008518 PE=4 SV=1)

HSP 1 Score: 194.5 bits (493), Expect = 9.3e-46
Identity = 143/418 (34.21%), Postives = 197/418 (47.13%), Query Frame = 0

Query: 6   PSVNSNGVFTNPTYNSPPLNQLLNQITSIKLDRSNFLLWKNLTHPILKSYRLFGHLIREK 65
           P+V   G  +N +  S P    LNQ  +IKLDR NF+LWK +   I+K +RL GHL   +
Sbjct: 21  PTVLQEG--SNSSNESSPFGNKLNQSFAIKLDRQNFILWKTMVTTIIKGHRLDGHLYSTR 80

Query: 66  ESPPMFVQPEATAVLSPLPATPAAPSSVTTAQTIINPHYESWIANDQLLLGWLYNSMMPE 125
             PP F+ P  T    P P TP     V+ + +  NP YE W+ NDQLL+GWLY+SM   
Sbjct: 81  PCPPEFL-PSPTTPGVPSPTTP----GVSDSGSCSNPEYEKWLVNDQLLMGWLYSSMTEN 140

Query: 126 VATQVMGRDTAKGVVECI-----------ARTVQQVFQTTRKGSLKMVDYLRTMKTHADN 185
           VA  VMG  TA G+ + +           A T++   QTTRKGS  M +YL  MKT AD+
Sbjct: 141 VALSVMGSTTAAGLWKALENLFGAYSKSKANTIRTSIQTTRKGSSTMEEYLTQMKTWADS 200

Query: 186 LEQVESPVALRALVSQVLLGLDEEYSPIVATIQGKMDLTWQQARTLSLPTKDRIKIPSMA 245
           L     P     L +  L GLD EY PIV  I+ +   TWQ+         D +      
Sbjct: 201 LAIAGDPYPENLLFANSLAGLDSEYMPIVVLIEAREHFTWQE-------IYDTLLSYDSK 260

Query: 246 IDHLNRTMGNAVMVEEEA---EGEEITCYQLYNKSFGQNQHGEQGNRGQN---------- 305
           ++H+N       ++   +      +       NK+  Q    + GNR  N          
Sbjct: 261 LEHINNVSAKGNLLSSPSAHLATNKPNNTPNTNKTSNQQNLNQGGNRAPNRGGFRGGGGR 320

Query: 306 FQGKQIPNQ------------GH-----------NFPGPTPNQAFMATQNTNP---FVVT 365
           F+G+   N             GH           N+ G  P     A  N N    FV T
Sbjct: 321 FRGRGGRNNNSRPTCQVCGKFGHSASVCYFRYDDNYMGSVPT----ANSNANSPSVFVAT 380

Query: 366 PESVIDSGWYVNSGASNHVANDHNSLTNAYEYGGKERVIVGNGRALPITHIGSCHIPA 374
           PE+V D+ WY +SGA++HV ND  +L    +Y G E ++VGNG+ L I+H+G   +P+
Sbjct: 381 PETVDDTTWYADSGATDHVTNDAGNLDLKSDYRGDESLMVGNGKQLDISHVGLKSLPS 420

BLAST of ClCG04G001080 vs. TAIR 10
Match: AT1G34070.1 (CONTAINS InterPro DOMAIN/s: Retrotransposon gag protein (InterPro:IPR005162); BEST Arabidopsis thaliana protein match is: unknown protein (TAIR:AT5G48050.1); Has 648 Blast hits to 647 proteins in 29 species: Archae - 0; Bacteria - 0; Metazoa - 16; Fungi - 25; Plants - 607; Viruses - 0; Other Eukaryotes - 0 (source: NCBI BLink). )

HSP 1 Score: 45.1 bits (105), Expect = 1.8e-04
Identity = 54/187 (28.88%), Postives = 85/187 (45.45%), Query Frame = 0

Query: 157 GSLKMVDYLRTMKTHADNLEQVESPVALRALVSQVLLGLDEEYSPIVATIQGKMDL-TWQ 216
           G +++ DY R MK  AD+L  V+ PV  R LV  VL GL+ ++  I+  I+ +    ++ 
Sbjct: 127 GDMRVADYYRKMKKLADSLRNVDVPVTDRNLVMYVLNGLNPKFDNIINVIKHRQPFPSFD 186

Query: 217 QARTLSLPTKDRIKIPSMAI----DHLNRTMGNAVMVEEEAEGEEITCYQLYNKSFGQNQ 276
            A T+    +DR+K    AI     H++ +  + V+   EA    +T +Q      G NQ
Sbjct: 187 DAATMLQEEEDRLK---RAIKPNPTHVDHSSSSTVLACSEA--PPVTNFQ----RSGGNQ 246

Query: 277 HGEQG-NRGQN-FQGKQIPNQGHNFPGPTPNQAFMATQNTNPFVVTPESVIDSGW----Y 333
            G +G  RG N F+G+      +N P          + N  PF      + +  W    Y
Sbjct: 247 MGYRGRGRGNNIFRGRGGRFSYYNMP-------TFNSWNRPPFYQNSYQMWNHPWGYPPY 297

The following BLAST results are available for this feature:
Match NameE-valueIdentityDescription
XP_022151683.18.0e-6836.75uncharacterized protein LOC111019598 [Momordica charantia][more]
KAA0026100.11.5e-5035.81uncharacterized protein E6C27_scaffold19G00360 [Cucumis melo var. makuwa][more]
KAA0057475.16.0e-4738.18uncharacterized protein E6C27_scaffold280G003560 [Cucumis melo var. makuwa] >TYK... [more]
TXG69253.15.1e-4634.45hypothetical protein EZV62_004188 [Acer yangbiense][more]
TXG67243.11.9e-4534.21hypothetical protein EZV62_008518 [Acer yangbiense][more]
Match NameE-valueIdentityDescription
Q94HW25.1e-1723.68Retrovirus-related Pol polyprotein from transposon RE1 OS=Arabidopsis thaliana O... [more]
Q9ZT945.1e-1722.94Retrovirus-related Pol polyprotein from transposon RE2 OS=Arabidopsis thaliana O... [more]
Match NameE-valueIdentityDescription
A0A6J1DCW43.9e-6836.75uncharacterized protein LOC111019598 OS=Momordica charantia OX=3673 GN=LOC111019... [more]
A0A5A7SIT77.3e-5135.81Uncharacterized protein OS=Cucumis melo var. makuwa OX=1194695 GN=E6C27_scaffold... [more]
A0A5D3E3L72.9e-4738.18Uncharacterized protein OS=Cucumis melo var. makuwa OX=1194695 GN=E5676_scaffold... [more]
A0A5C7IJ062.4e-4634.45Uncharacterized protein OS=Acer yangbiense OX=1000413 GN=EZV62_004188 PE=4 SV=1[more]
A0A5C7ID329.3e-4634.21Uncharacterized protein OS=Acer yangbiense OX=1000413 GN=EZV62_008518 PE=4 SV=1[more]
Match NameE-valueIdentityDescription
AT1G34070.11.8e-0428.88CONTAINS InterPro DOMAIN/s: Retrotransposon gag protein (InterPro:IPR005162); BE... [more]
InterPro
Analysis Name: InterPro Annotations of Watermelon (Charleston Gray) v2.5
Date Performed: 2022-01-31
IPR TermIPR DescriptionSourceSource TermSource DescriptionAlignment
NoneNo IPR availableMOBIDB_LITEmobidb-litedisorder_predictioncoord: 270..300
NoneNo IPR availablePANTHERPTHR47481FAMILY NOT NAMEDcoord: 27..206
coord: 299..368

Relationships

The following mRNA feature(s) are a part of this gene:

Feature NameUnique NameType
ClCG04G001080.1ClCG04G001080.1mRNA