Cla013544 (gene) Watermelon (97103) v1

NameCla013544
Typegene
OrganismCitrullus. lanatus (Watermelon (97103) v1)
DescriptionUnknown Protein (AHRD V1); contains Interpro domain(s) IPR005162 Retrotransposon gag protein
LocationChr2 : 28185970 .. 28187100 (-)
   



The following sequences are available for this feature:

Gene sequence (with intron)

Legend: CDS
Hold the cursor over a type above to highlight its positions in the sequence below.
ATGGCGCGAAAACTCAGGCGTTCACCGTCGCCATTGCCATTCCGGCGACGTAATGCCGATTATACCACCGCCACCGATTACGATGCTTCTCCATCTCAATCTCTCTATGCATCGAACGAAGACGACTATGACCCCTCTGAATCTGTTAACTCCCACCCCACTGACCCCAAATCAAAATCCCTAGAAATTAAGCCCTCTGATTTAAGAACCGCCGCAGAATCCGCCTCCAAAAACAGCTTAGCGTATTTACAGACTCCAAACGCCGCCCAAACTGTATTTCCATACATCAACGTTGCACCGTTGCCGATTTTTCACGGCAGCGCCGATGAGTGTCCGGTGATACATTTAAGCAGATTCGCCAAAGTCTGCCGTGCGAACAACGCAACCTCCATCGACATGATGATGAGAATCTTCCCGGTGACGTTAGAGGGTGAGGCAGCGCTTTGGTACGACTTGAACATCGAGCCCTACCCTCCAATTTCTTGGGAAGAATTGAAGTCTTGTTTCTTGGATGCTTTCAATAAAATTGAATTGACTGACCAGTTGCGATCGGAGCTTATGACGATAAAACAACGGGAAGAGGAGAGTGTACGTTTGTATTTTCTGAGGTTGCAGTTGATTTTGAAGAAATGGCCACCGGGTAATTCACTTTCCGATGGCTTGTTGAAGACGATTTTTGTTGACGGATTGAGGGAAGAGTTCAAGGAATGGATGATTCTACAGAAACCGAGTTCATTGAACGAGGCATTGAGACTTGCATTTGGGTTTGAACAAGTAAGGACCGTCAGTACATCTGGCAAAAGGGGGTTTCTTCGGTGTGGGTTTTGTGAGGGGCCGCACGAGGAATTGGTTTGTGAGGTTAGGGAGAGAATGAGACAGTTGTGGAAGAGTAGGGAAAAGAAGAATACGGTTGACGTGGTGCAGAGTGACGGCCGTGAAGCGGCAATGGCAACGGCGGAGCTTATGCGATCGTCTTCGGCAATTAGTAGAAACGAATCGGAGGTTGAAAATGATGGCGGGGAGATGGTGGGTTTGAAGAAGAAGAGTCAGTGTCAATGTTGGAAGCATCAGTGTGGGATGAAGAAATTGGATCGAAACCTTAGCATGGTATCAAAAAATTCTAAAGGCTGA

mRNA sequence

ATGGCGCGAAAACTCAGGCGTTCACCGTCGCCATTGCCATTCCGGCGACGTAATGCCGATTATACCACCGCCACCGATTACGATGCTTCTCCATCTCAATCTCTCTATGCATCGAACGAAGACGACTATGACCCCTCTGAATCTGTTAACTCCCACCCCACTGACCCCAAATCAAAATCCCTAGAAATTAAGCCCTCTGATTTAAGAACCGCCGCAGAATCCGCCTCCAAAAACAGCTTAGCGTATTTACAGACTCCAAACGCCGCCCAAACTGTATTTCCATACATCAACGTTGCACCGTTGCCGATTTTTCACGGCAGCGCCGATGAGTGTCCGGTGATACATTTAAGCAGATTCGCCAAAGTCTGCCGTGCGAACAACGCAACCTCCATCGACATGATGATGAGAATCTTCCCGGTGACGTTAGAGGGTGAGGCAGCGCTTTGGTACGACTTGAACATCGAGCCCTACCCTCCAATTTCTTGGGAAGAATTGAAGTCTTGTTTCTTGGATGCTTTCAATAAAATTGAATTGACTGACCAGTTGCGATCGGAGCTTATGACGATAAAACAACGGGAAGAGGAGAGTGTACGTTTGTATTTTCTGAGGTTGCAGTTGATTTTGAAGAAATGGCCACCGGGTAATTCACTTTCCGATGGCTTGTTGAAGACGATTTTTGTTGACGGATTGAGGGAAGAGTTCAAGGAATGGATGATTCTACAGAAACCGAGTTCATTGAACGAGGCATTGAGACTTGCATTTGGGTTTGAACAAGTAAGGACCGTCAGTACATCTGGCAAAAGGGGGTTTCTTCGGTGTGGGTTTTGTGAGGGGCCGCACGAGGAATTGGTTTGTGAGGTTAGGGAGAGAATGAGACAGTTGTGGAAGAGTAGGGAAAAGAAGAATACGGTTGACGTGGTGCAGAGTGACGGCCGTGAAGCGGCAATGGCAACGGCGGAGCTTATGCGATCGTCTTCGGCAATTAGTAGAAACGAATCGGAGGTTGAAAATGATGGCGGGGAGATGGTGGGTTTGAAGAAGAAGAGTCAGTGTCAATGTTGGAAGCATCAGTGTGGGATGAAGAAATTGGATCGAAACCTTAGCATGGTATCAAAAAATTCTAAAGGCTGA

Coding sequence (CDS)

ATGGCGCGAAAACTCAGGCGTTCACCGTCGCCATTGCCATTCCGGCGACGTAATGCCGATTATACCACCGCCACCGATTACGATGCTTCTCCATCTCAATCTCTCTATGCATCGAACGAAGACGACTATGACCCCTCTGAATCTGTTAACTCCCACCCCACTGACCCCAAATCAAAATCCCTAGAAATTAAGCCCTCTGATTTAAGAACCGCCGCAGAATCCGCCTCCAAAAACAGCTTAGCGTATTTACAGACTCCAAACGCCGCCCAAACTGTATTTCCATACATCAACGTTGCACCGTTGCCGATTTTTCACGGCAGCGCCGATGAGTGTCCGGTGATACATTTAAGCAGATTCGCCAAAGTCTGCCGTGCGAACAACGCAACCTCCATCGACATGATGATGAGAATCTTCCCGGTGACGTTAGAGGGTGAGGCAGCGCTTTGGTACGACTTGAACATCGAGCCCTACCCTCCAATTTCTTGGGAAGAATTGAAGTCTTGTTTCTTGGATGCTTTCAATAAAATTGAATTGACTGACCAGTTGCGATCGGAGCTTATGACGATAAAACAACGGGAAGAGGAGAGTGTACGTTTGTATTTTCTGAGGTTGCAGTTGATTTTGAAGAAATGGCCACCGGGTAATTCACTTTCCGATGGCTTGTTGAAGACGATTTTTGTTGACGGATTGAGGGAAGAGTTCAAGGAATGGATGATTCTACAGAAACCGAGTTCATTGAACGAGGCATTGAGACTTGCATTTGGGTTTGAACAAGTAAGGACCGTCAGTACATCTGGCAAAAGGGGGTTTCTTCGGTGTGGGTTTTGTGAGGGGCCGCACGAGGAATTGGTTTGTGAGGTTAGGGAGAGAATGAGACAGTTGTGGAAGAGTAGGGAAAAGAAGAATACGGTTGACGTGGTGCAGAGTGACGGCCGTGAAGCGGCAATGGCAACGGCGGAGCTTATGCGATCGTCTTCGGCAATTAGTAGAAACGAATCGGAGGTTGAAAATGATGGCGGGGAGATGGTGGGTTTGAAGAAGAAGAGTCAGTGTCAATGTTGGAAGCATCAGTGTGGGATGAAGAAATTGGATCGAAACCTTAGCATGGTATCAAAAAATTCTAAAGGCTGA

Protein sequence

MARKLRRSPSPLPFRRRNADYTTATDYDASPSQSLYASNEDDYDPSESVNSHPTDPKSKSLEIKPSDLRTAAESASKNSLAYLQTPNAAQTVFPYINVAPLPIFHGSADECPVIHLSRFAKVCRANNATSIDMMMRIFPVTLEGEAALWYDLNIEPYPPISWEELKSCFLDAFNKIELTDQLRSELMTIKQREEESVRLYFLRLQLILKKWPPGNSLSDGLLKTIFVDGLREEFKEWMILQKPSSLNEALRLAFGFEQVRTVSTSGKRGFLRCGFCEGPHEELVCEVRERMRQLWKSREKKNTVDVVQSDGREAAMATAELMRSSSAISRNESEVENDGGEMVGLKKKSQCQCWKHQCGMKKLDRNLSMVSKNSKG
BLAST of Cla013544 vs. TrEMBL
Match: W9R9S0_9ROSA (Uncharacterized protein OS=Morus notabilis GN=L484_004813 PE=4 SV=1)

HSP 1 Score: 343.2 bits (879), Expect = 4.0e-91
Identity = 196/394 (49.75%), Postives = 256/394 (64.97%), Query Frame = 1

Query: 6   RRSPSPLPFRRR-NADYTTATDYDASPSQSL-YASNEDDYDPSESVNSHPTDPKSKSLEI 65
           RR+P+P  +    + DYTT      SP+ S  +   E+D D ++  +  PTD  +  L  
Sbjct: 19  RRTPTPQDYSSTYDDDYTTVV---RSPNDSTEFDQPENDDDDNDDASDAPTDSATNPLSD 78

Query: 66  KPSDL------RTAAESASKNSLAYLQTPNAAQTVF-PYINVAPLPIFHGSADECPVIHL 125
           + S +      R  + SAS + + +L     +QT +  Y+N+A  PIF G ++ECP  HL
Sbjct: 79  QFSSVSERINARKKSCSASHSPILHLPQQPVSQTGYNSYMNIAQFPIFRGGSEECPFAHL 138

Query: 126 SRFAKVCRANNATSIDMMMRIFPVTLEGEAALWYDLNIEPYPPISWEELKSCFLDAFNKI 185
           SRFAKVCRANN +SIDMMM+IFPVTLE EAALWYDLN+EPY  +SWEE+KS F  A+ KI
Sbjct: 139 SRFAKVCRANNVSSIDMMMKIFPVTLEDEAALWYDLNVEPYEELSWEEIKSSFYHAYGKI 198

Query: 186 ELTDQLRSELMTIKQREEESVRLYFLRLQLILKKWPPGNSLSDGLLKTIFVDGLREEFKE 245
           ELT+QLRS+LMTI Q + ESVR YFLRLQ ILKKWP  + LSD LLK +FVDGLR +F+E
Sbjct: 199 ELTEQLRSQLMTINQGDAESVRSYFLRLQWILKKWPE-HGLSDDLLKGVFVDGLRGDFQE 258

Query: 246 WMILQKPSSLNEALRLAFGFEQVRTVSTSGKRGFLRCGFCEGPHEELVCEVRERMRQLW- 305
           WM  QKP SLN+ALRLAF FEQV+++    +   ++CGFC G HEE  CEVRERMR+LW 
Sbjct: 259 WMAPQKPGSLNKALRLAFCFEQVKSIRNVRRNASVKCGFCGGLHEERGCEVRERMRELWL 318

Query: 306 ---------KSREKKNTVDV---VQSDGREAAMATAELMRSSSAISRNESEVENDG---- 365
                    K   ++N ++    V+  GR  +MAT+   RS+  + +N+ +VE DG    
Sbjct: 319 KSNKDDGLGKGMLERNLIEKSEGVKELGRSVSMATS---RSTCVVGKND-QVEEDGKEEE 378

Query: 366 GEMVGLKKKSQCQCWKHQCGMKKLDRNLSMVSKN 374
            E+   KK+SQCQC KHQC  K ++RN S VS N
Sbjct: 379 DELGSKKKRSQCQCGKHQCWKKNIERNNSTVSGN 404

BLAST of Cla013544 vs. TrEMBL
Match: B9RWN5_RICCO (Putative uncharacterized protein OS=Ricinus communis GN=RCOM_1022950 PE=4 SV=1)

HSP 1 Score: 337.8 bits (865), Expect = 1.7e-89
Identity = 185/378 (48.94%), Postives = 248/378 (65.61%), Query Frame = 1

Query: 1   MARKLRRSPSPLPFRRRNADYTTATDYDASPSQSLYASNEDDYDPSESVNSHPTDPKSKS 60
           M RK + S   L F  R+ DY+ +T    SPSQS Y SN+DD +  +     P   +S +
Sbjct: 1   MTRKAKNSRKSLQFSSRH-DYSEST----SPSQSPYDSNDDDDEIEDDDEEQPIISESVT 60

Query: 61  LEIKPSDLRTAAESASKNSLAYLQTPNAAQTVFPYINVAPLPIFHGSADECPVIHLSRFA 120
             +    L +++ S S+        PN +     YINVAPLP+FHG+++ECP+ HLSRF 
Sbjct: 61  NSLNADQLSSSSYSNSQ--------PNNS-----YINVAPLPVFHGNSNECPIAHLSRFV 120

Query: 121 KVCRANNATSIDMMMRIFPVTLEGEAALWYDLNIEPYPPISWEELKSCFLDAFNKIELTD 180
           KVCRANNA+S DMMMRIFPVTLE EAALWYDLNI+PYP +SW+E+   FL+A+ +I+L D
Sbjct: 121 KVCRANNASSTDMMMRIFPVTLENEAALWYDLNIQPYPSLSWDEIMLSFLEAYQRIKLVD 180

Query: 181 QLRSELMTIKQREEESVRLYFLRLQLILKKWPPGNSLSDGLLKTIFVDGLREEFKEWMIL 240
           QLRS+LM + Q  +ESVR YF+RLQ ILK+W P + LSD +LK IF+DGL   FK+W+I 
Sbjct: 181 QLRSDLMMLNQGSDESVRSYFMRLQWILKRW-PDHGLSDNMLKWIFIDGLMGNFKDWIIP 240

Query: 241 QKPSSLNEALRLAFGFEQVRTVSTSGKRGFLRCGFCEGPHEELVCEVRERMRQLWKSREK 300
            KP+SLNEALRLAF FEQV+++  + K+  ++CGFCEG HEE  C VRE+MR+L+++ +K
Sbjct: 241 HKPNSLNEALRLAFSFEQVKSIRGT-KQKVVKCGFCEGSHEENCCVVREKMRELFRNSKK 300

Query: 301 KNTVDVVQSDGREAAMATAELMRSSSAISRNESEVENDGGE--MVGLKK--KSQCQCWKH 360
           K  +    S+  EA    AE           E +V +D  E  M+   K  KS CQC KH
Sbjct: 301 KMMIPKEASERSEAGNEMAENKDGKEGEEEEEVDVGDDKEEKRMLSSSKTGKSPCQCSKH 358

Query: 361 QCGMKKLDRNLSMVSKNS 375
            C MKK +R+ S+ ++NS
Sbjct: 361 HCWMKKFERSNSVTTRNS 358

BLAST of Cla013544 vs. TrEMBL
Match: A5C7E6_VITVI (Putative uncharacterized protein OS=Vitis vinifera GN=VITISV_007470 PE=4 SV=1)

HSP 1 Score: 330.9 bits (847), Expect = 2.1e-87
Identity = 190/370 (51.35%), Postives = 242/370 (65.41%), Query Frame = 1

Query: 20  DYTTATDYDASPSQSLYASNEDDYDPSESVNSHPTDPKSKSLEIKPSD------LRTAAE 79
           DYT     + SPSQS Y  +E++ D      S  TD +S S    P D      L +  +
Sbjct: 161 DYT-----EQSPSQSPYEFDEEEEDEQ----SXYTDNESASGTNAPGDQFSLPALESIPK 220

Query: 80  SASKNSLAYLQTPNAAQTVFP---YINVAPLPIFHGSADECPVIHLSRFAKVCRANNATS 139
             S    + L + + +   F    YIN+APLPIF GS+DECPV HLSRF KVCRANN +S
Sbjct: 221 GKSFRPSSSLNSSSNSLNPFXQSSYINIAPLPIFRGSSDECPVTHLSRFTKVCRANNVSS 280

Query: 140 IDMMMRIFPVTLEGEAALWYDLNIEPYPPISWEELKSCFLDAFNKIELTDQLRSELMTIK 199
           ++M+MRIFPVTL+GEAALWYDLNIEPY  +SWEE+KS FL A+++  LTD+LRSELM I 
Sbjct: 281 VEMIMRIFPVTLDGEAALWYDLNIEPYSSLSWEEIKSSFLQAYHRXGLTDELRSELMMIN 340

Query: 200 QREEESVRLYFLRLQLILKKWPPGNSLSDGLLKTIFVDGLREEFKEWMILQKPSSLNEAL 259
           Q  EESVR YFLRLQ ILK+W P + L DGLL+ IF+DGLR++F++W+I QKPSSLNEAL
Sbjct: 341 QGTEESVRSYFLRLQWILKRW-PDHGLPDGLLEGIFIDGLRKDFQDWIIPQKPSSLNEAL 400

Query: 260 RLAFGFEQVRTVSTSGKRGFLRCGFCEGPHEELVCEVRERMRQLWKSREKKNTVD----- 319
           RLAF +E+V+++    ++    CGFC G H+E  CE+RERMR LW  + KK T D     
Sbjct: 401 RLAFAWEKVQSIRGGREK---ECGFCSGGHDEEGCEIRERMRXLW-VKSKKQTRDYSGRI 460

Query: 320 VVQSDGREAAMATAELMRSSSAISRNESEVENDGGEMVGLKKKSQCQCWKHQCGMKKLDR 375
           V   DG +       +   S  + +NE E E      +G KKKSQCQC KHQC  KKL+R
Sbjct: 461 VNDEDGEKEFERRVSVGGESRBVGKNEEEGEEG---XMGWKKKSQCQCGKHQCWKKKLER 513

BLAST of Cla013544 vs. TrEMBL
Match: A0A061DJI4_THECC (Uncharacterized protein OS=Theobroma cacao GN=TCM_001704 PE=4 SV=1)

HSP 1 Score: 328.9 bits (842), Expect = 7.8e-87
Identity = 185/370 (50.00%), Postives = 238/370 (64.32%), Query Frame = 1

Query: 20  DYTTATDYDASPSQSL-----YASNEDDYDP-------SESVNSHPTDPKSKSLEIKPSD 79
           +Y   T    SP +S         NE+DYD        SES+ + P  PK+    ++ + 
Sbjct: 41  NYVDNTSLSHSPDESNGDDLEQPRNENDYDDFDASDFQSESMTNAPNAPKTL---LRGNG 100

Query: 80  LRTAAESASKNSLAYLQTPNAAQTVFPYINVAPLPIFHGSADECPVIHLSRFAKVCRANN 139
           L  AA   S ++ A     N  +    YIN+APLPIF GS  +CPV HLSRFAKVCRANN
Sbjct: 101 LSAAASLNSVSNSAIWSRSNLIEAT-SYINIAPLPIFQGSPSDCPVTHLSRFAKVCRANN 160

Query: 140 ATSIDMMMRIFPVTLEGEAALWYDLNIEPYPPISWEELKSCFLDAFNKIELTDQLRSELM 199
            +S+DMMMRIFPVTLE EA LWYDLNIEPYP + WEE+KS FL A++K ++T+QLR ELM
Sbjct: 161 VSSVDMMMRIFPVTLENEAGLWYDLNIEPYPSLRWEEIKSSFLQAYHKTQVTEQLRHELM 220

Query: 200 TIKQREEESVRLYFLRLQLILKKWPPGNSLSDGLLKTIFVDGLREEFKEWMILQKPSSLN 259
            I Q  EE VR YFLRLQ  L++W P + + + LLK IFVDGLRE+F++W++ QKP SL 
Sbjct: 221 MINQGSEERVRSYFLRLQWSLQRW-PDHGIPENLLKEIFVDGLREDFQDWIVPQKPDSLV 280

Query: 260 EALRLAFGFEQVRTVSTSGKRGFLRCGFCEGPHEELVCEVRERMRQLWKSREKKNTVDVV 319
           EALRLA  FEQ++++  S K+  L+C FCEG HEE  C+VRERM++LW+  + K  +D  
Sbjct: 281 EALRLAIAFEQLKSIKISRKKD-LKCDFCEGSHEERNCQVRERMKELWRKTKDKEWMDSS 340

Query: 320 Q-SDGREAAMATAELMRSSSAISRNESEVENDGGEMVG--LKKKSQCQCWKHQCGMKKLD 375
           + +   EA   +AE     SA  R E E   +G  + G   KKKS CQC KHQC  K+LD
Sbjct: 341 EKNQSNEAVNESAE----GSAEDRIEEENVVEGEMLSGRKQKKKSPCQCCKHQCWKKQLD 400

BLAST of Cla013544 vs. TrEMBL
Match: K7KXT1_SOYBN (Uncharacterized protein OS=Glycine max GN=GLYMA_06G277800 PE=4 SV=1)

HSP 1 Score: 245.7 bits (626), Expect = 8.7e-62
Identity = 158/369 (42.82%), Postives = 209/369 (56.64%), Query Frame = 1

Query: 21  YTTATDYDASP--SQSLYASNEDDYDP---------------SESVNSHPTDPK--SKSL 80
           Y    D DAS    +  Y S+E++ +P               SESV+++ T P     S 
Sbjct: 42  YNVDDDADASEYSQEDEYESDEEEEEPEQDIDEDDNNVNGISSESVSNNSTPPNVPETST 101

Query: 81  EIKPSDLRTAAESASKNSLAYLQTPNAAQTVFPYINVAPLPIFHGSADECPVIHLSRFAK 140
            I  SDLR  + ++S +                Y+ +APLPIF G + E PV HLSRF K
Sbjct: 102 SISASDLRNPSSNSSSS----------------YVKIAPLPIFRGMSSESPVTHLSRFNK 161

Query: 141 VCRANNATSIDMMMRIFPVTLEGEAALWYDLNIEP-YPPISWEELKSCFLDAFNKIELTD 200
           VCRANNA+S+DM MRIFPVTLE EAALWYDLN+EP Y  +SWEE K  FL A+  +E  +
Sbjct: 162 VCRANNASSVDMQMRIFPVTLEDEAALWYDLNVEPYYGSLSWEETKLSFLQAYYDVEPVE 221

Query: 201 QLRSELMTIKQREEESVRLYFLRLQLILKKWPPGNSLSDGLLKTIFVDGLREEFKEWMIL 260
           +LRS+L+ I+Q + ESVR YFLRLQ ILK+WP  + L + +LK +FVDGLREEF+EW+++
Sbjct: 222 ELRSKLVGIRQDQRESVRSYFLRLQWILKRWPE-HGLGEDVLKGVFVDGLREEFREWVLM 281

Query: 261 QKPSSLNEALRLAFGFEQVRTVSTSGKRGFLRCGFCEGPHEELVCEVRERMRQLWKSREK 320
           QKP SLN+AL+LAF FE+V  V   GK G        GP     CEVR+ +         
Sbjct: 282 QKPGSLNDALKLAFEFEKVWRV--RGKEGV-------GP----TCEVRDVL--------- 341

Query: 321 KNTVDVVQSDGREAAMATAELMRSSSAISRNESEVENDGGEMVGLKKKSQCQCWKHQCGM 370
                     G++  + +  +   SS   R E    ++G  +VG  KK QC    H+CG 
Sbjct: 342 ----------GKDLIVGSCSVGEISSGQERVEDAKGSEG--LVGSVKKKQC----HKCGK 355

BLAST of Cla013544 vs. NCBI nr
Match: gi|703109765|ref|XP_010099386.1| (hypothetical protein L484_004813 [Morus notabilis])

HSP 1 Score: 343.2 bits (879), Expect = 5.8e-91
Identity = 196/394 (49.75%), Postives = 256/394 (64.97%), Query Frame = 1

Query: 6   RRSPSPLPFRRR-NADYTTATDYDASPSQSL-YASNEDDYDPSESVNSHPTDPKSKSLEI 65
           RR+P+P  +    + DYTT      SP+ S  +   E+D D ++  +  PTD  +  L  
Sbjct: 19  RRTPTPQDYSSTYDDDYTTVV---RSPNDSTEFDQPENDDDDNDDASDAPTDSATNPLSD 78

Query: 66  KPSDL------RTAAESASKNSLAYLQTPNAAQTVF-PYINVAPLPIFHGSADECPVIHL 125
           + S +      R  + SAS + + +L     +QT +  Y+N+A  PIF G ++ECP  HL
Sbjct: 79  QFSSVSERINARKKSCSASHSPILHLPQQPVSQTGYNSYMNIAQFPIFRGGSEECPFAHL 138

Query: 126 SRFAKVCRANNATSIDMMMRIFPVTLEGEAALWYDLNIEPYPPISWEELKSCFLDAFNKI 185
           SRFAKVCRANN +SIDMMM+IFPVTLE EAALWYDLN+EPY  +SWEE+KS F  A+ KI
Sbjct: 139 SRFAKVCRANNVSSIDMMMKIFPVTLEDEAALWYDLNVEPYEELSWEEIKSSFYHAYGKI 198

Query: 186 ELTDQLRSELMTIKQREEESVRLYFLRLQLILKKWPPGNSLSDGLLKTIFVDGLREEFKE 245
           ELT+QLRS+LMTI Q + ESVR YFLRLQ ILKKWP  + LSD LLK +FVDGLR +F+E
Sbjct: 199 ELTEQLRSQLMTINQGDAESVRSYFLRLQWILKKWPE-HGLSDDLLKGVFVDGLRGDFQE 258

Query: 246 WMILQKPSSLNEALRLAFGFEQVRTVSTSGKRGFLRCGFCEGPHEELVCEVRERMRQLW- 305
           WM  QKP SLN+ALRLAF FEQV+++    +   ++CGFC G HEE  CEVRERMR+LW 
Sbjct: 259 WMAPQKPGSLNKALRLAFCFEQVKSIRNVRRNASVKCGFCGGLHEERGCEVRERMRELWL 318

Query: 306 ---------KSREKKNTVDV---VQSDGREAAMATAELMRSSSAISRNESEVENDG---- 365
                    K   ++N ++    V+  GR  +MAT+   RS+  + +N+ +VE DG    
Sbjct: 319 KSNKDDGLGKGMLERNLIEKSEGVKELGRSVSMATS---RSTCVVGKND-QVEEDGKEEE 378

Query: 366 GEMVGLKKKSQCQCWKHQCGMKKLDRNLSMVSKN 374
            E+   KK+SQCQC KHQC  K ++RN S VS N
Sbjct: 379 DELGSKKKRSQCQCGKHQCWKKNIERNNSTVSGN 404

BLAST of Cla013544 vs. NCBI nr
Match: gi|223542750|gb|EEF44287.1| (conserved hypothetical protein [Ricinus communis])

HSP 1 Score: 337.8 bits (865), Expect = 2.4e-89
Identity = 185/378 (48.94%), Postives = 248/378 (65.61%), Query Frame = 1

Query: 1   MARKLRRSPSPLPFRRRNADYTTATDYDASPSQSLYASNEDDYDPSESVNSHPTDPKSKS 60
           M RK + S   L F  R+ DY+ +T    SPSQS Y SN+DD +  +     P   +S +
Sbjct: 1   MTRKAKNSRKSLQFSSRH-DYSEST----SPSQSPYDSNDDDDEIEDDDEEQPIISESVT 60

Query: 61  LEIKPSDLRTAAESASKNSLAYLQTPNAAQTVFPYINVAPLPIFHGSADECPVIHLSRFA 120
             +    L +++ S S+        PN +     YINVAPLP+FHG+++ECP+ HLSRF 
Sbjct: 61  NSLNADQLSSSSYSNSQ--------PNNS-----YINVAPLPVFHGNSNECPIAHLSRFV 120

Query: 121 KVCRANNATSIDMMMRIFPVTLEGEAALWYDLNIEPYPPISWEELKSCFLDAFNKIELTD 180
           KVCRANNA+S DMMMRIFPVTLE EAALWYDLNI+PYP +SW+E+   FL+A+ +I+L D
Sbjct: 121 KVCRANNASSTDMMMRIFPVTLENEAALWYDLNIQPYPSLSWDEIMLSFLEAYQRIKLVD 180

Query: 181 QLRSELMTIKQREEESVRLYFLRLQLILKKWPPGNSLSDGLLKTIFVDGLREEFKEWMIL 240
           QLRS+LM + Q  +ESVR YF+RLQ ILK+W P + LSD +LK IF+DGL   FK+W+I 
Sbjct: 181 QLRSDLMMLNQGSDESVRSYFMRLQWILKRW-PDHGLSDNMLKWIFIDGLMGNFKDWIIP 240

Query: 241 QKPSSLNEALRLAFGFEQVRTVSTSGKRGFLRCGFCEGPHEELVCEVRERMRQLWKSREK 300
            KP+SLNEALRLAF FEQV+++  + K+  ++CGFCEG HEE  C VRE+MR+L+++ +K
Sbjct: 241 HKPNSLNEALRLAFSFEQVKSIRGT-KQKVVKCGFCEGSHEENCCVVREKMRELFRNSKK 300

Query: 301 KNTVDVVQSDGREAAMATAELMRSSSAISRNESEVENDGGE--MVGLKK--KSQCQCWKH 360
           K  +    S+  EA    AE           E +V +D  E  M+   K  KS CQC KH
Sbjct: 301 KMMIPKEASERSEAGNEMAENKDGKEGEEEEEVDVGDDKEEKRMLSSSKTGKSPCQCSKH 358

Query: 361 QCGMKKLDRNLSMVSKNS 375
            C MKK +R+ S+ ++NS
Sbjct: 361 HCWMKKFERSNSVTTRNS 358

BLAST of Cla013544 vs. NCBI nr
Match: gi|147817046|emb|CAN62167.1| (hypothetical protein VITISV_007470 [Vitis vinifera])

HSP 1 Score: 332.4 bits (851), Expect = 1.0e-87
Identity = 190/370 (51.35%), Postives = 243/370 (65.68%), Query Frame = 1

Query: 20  DYTTATDYDASPSQSLYASNEDDYDPSESVNSHPTDPKSKSLEIKPSD------LRTAAE 79
           DYT     + SPSQS Y  +E++ D      S  TD +S S    P D      L +  +
Sbjct: 161 DYT-----EQSPSQSPYEFDEEEEDEQ----SXYTDNESASGTNAPGDQFSLPALESIPK 220

Query: 80  SASKNSLAYLQTPNAAQTVFP---YINVAPLPIFHGSADECPVIHLSRFAKVCRANNATS 139
             S    + L + + +   F    YIN+APLPIF GS+DECPV HLSRF KVCRANN +S
Sbjct: 221 GKSFRPSSSLNSSSNSLNPFXQSSYINIAPLPIFRGSSDECPVTHLSRFTKVCRANNVSS 280

Query: 140 IDMMMRIFPVTLEGEAALWYDLNIEPYPPISWEELKSCFLDAFNKIELTDQLRSELMTIK 199
           ++M+MRIFPVTL+GEAALWYDLNIEPY  +SWEE+KS FL A++++ LTD+LRSELM I 
Sbjct: 281 VEMIMRIFPVTLDGEAALWYDLNIEPYSSLSWEEIKSSFLQAYHRJGLTDELRSELMMIN 340

Query: 200 QREEESVRLYFLRLQLILKKWPPGNSLSDGLLKTIFVDGLREEFKEWMILQKPSSLNEAL 259
           Q  EESVR YFLRLQ ILK+W P + L DGLL+ IF+DGLR++F++W+I QKPSSLNEAL
Sbjct: 341 QGTEESVRSYFLRLQWILKRW-PDHGLPDGLLEGIFIDGLRKDFQDWIIPQKPSSLNEAL 400

Query: 260 RLAFGFEQVRTVSTSGKRGFLRCGFCEGPHEELVCEVRERMRQLWKSREKKNTVD----- 319
           RLAF +E+V+++    ++    CGFC G H+E  CE+RERMR LW  + KK T D     
Sbjct: 401 RLAFAWEKVQSIRGGREK---ECGFCSGGHDEEGCEIRERMRXLW-VKSKKQTRDYSGRI 460

Query: 320 VVQSDGREAAMATAELMRSSSAISRNESEVENDGGEMVGLKKKSQCQCWKHQCGMKKLDR 375
           V   DG +       +   S  + +NE E E      +G KKKSQCQC KHQC  KKL+R
Sbjct: 461 VNDEDGEKEFERRVSVGGESRBVGKNEEEGEEG---XMGWKKKSQCQCGKHQCWKKKLER 513

BLAST of Cla013544 vs. NCBI nr
Match: gi|590709920|ref|XP_007048687.1| (Uncharacterized protein TCM_001704 [Theobroma cacao])

HSP 1 Score: 328.9 bits (842), Expect = 1.1e-86
Identity = 185/370 (50.00%), Postives = 238/370 (64.32%), Query Frame = 1

Query: 20  DYTTATDYDASPSQSL-----YASNEDDYDP-------SESVNSHPTDPKSKSLEIKPSD 79
           +Y   T    SP +S         NE+DYD        SES+ + P  PK+    ++ + 
Sbjct: 41  NYVDNTSLSHSPDESNGDDLEQPRNENDYDDFDASDFQSESMTNAPNAPKTL---LRGNG 100

Query: 80  LRTAAESASKNSLAYLQTPNAAQTVFPYINVAPLPIFHGSADECPVIHLSRFAKVCRANN 139
           L  AA   S ++ A     N  +    YIN+APLPIF GS  +CPV HLSRFAKVCRANN
Sbjct: 101 LSAAASLNSVSNSAIWSRSNLIEAT-SYINIAPLPIFQGSPSDCPVTHLSRFAKVCRANN 160

Query: 140 ATSIDMMMRIFPVTLEGEAALWYDLNIEPYPPISWEELKSCFLDAFNKIELTDQLRSELM 199
            +S+DMMMRIFPVTLE EA LWYDLNIEPYP + WEE+KS FL A++K ++T+QLR ELM
Sbjct: 161 VSSVDMMMRIFPVTLENEAGLWYDLNIEPYPSLRWEEIKSSFLQAYHKTQVTEQLRHELM 220

Query: 200 TIKQREEESVRLYFLRLQLILKKWPPGNSLSDGLLKTIFVDGLREEFKEWMILQKPSSLN 259
            I Q  EE VR YFLRLQ  L++W P + + + LLK IFVDGLRE+F++W++ QKP SL 
Sbjct: 221 MINQGSEERVRSYFLRLQWSLQRW-PDHGIPENLLKEIFVDGLREDFQDWIVPQKPDSLV 280

Query: 260 EALRLAFGFEQVRTVSTSGKRGFLRCGFCEGPHEELVCEVRERMRQLWKSREKKNTVDVV 319
           EALRLA  FEQ++++  S K+  L+C FCEG HEE  C+VRERM++LW+  + K  +D  
Sbjct: 281 EALRLAIAFEQLKSIKISRKKD-LKCDFCEGSHEERNCQVRERMKELWRKTKDKEWMDSS 340

Query: 320 Q-SDGREAAMATAELMRSSSAISRNESEVENDGGEMVG--LKKKSQCQCWKHQCGMKKLD 375
           + +   EA   +AE     SA  R E E   +G  + G   KKKS CQC KHQC  K+LD
Sbjct: 341 EKNQSNEAVNESAE----GSAEDRIEEENVVEGEMLSGRKQKKKSPCQCCKHQCWKKQLD 400

BLAST of Cla013544 vs. NCBI nr
Match: gi|731408992|ref|XP_010657036.1| (PREDICTED: uncharacterized protein LOC104880826 [Vitis vinifera])

HSP 1 Score: 282.0 bits (720), Expect = 1.6e-72
Identity = 181/371 (48.79%), Postives = 226/371 (60.92%), Query Frame = 1

Query: 20  DYTTATDYDASPSQSLYASNEDDYDP--------SESVNSHPTDPKS-KSLEIKPSDLRT 79
           DYT     + SPSQS Y  +E++ D         S S  + P D  S  +LE  P   ++
Sbjct: 40  DYT-----EQSPSQSPYEFDEEEEDEQSVYTDNESASGTNAPGDQFSLPALESIPKG-KS 99

Query: 80  AAESASKNSLAYLQTPNAAQTVFPYINVAPLPIFHGSADECPVIHLSRFAKVCRANNATS 139
              S+S NS +    P    +   YIN+APLPIF GS+DECPV HLSRF KVCRANN +S
Sbjct: 100 FRPSSSLNSSSNSLNPFIQSS---YINIAPLPIFRGSSDECPVTHLSRFTKVCRANNVSS 159

Query: 140 IDMMMRIFPVTLEGEAALWYDLNIEPYPPISWEELKSCFLDAFNKIELTDQLRSELMTIK 199
           ++M+MRIFPVTL+GEAALW                          + LTD+LRSELM I 
Sbjct: 160 VEMIMRIFPVTLDGEAALW--------------------------LGLTDELRSELMMIN 219

Query: 200 QREEESVRLYFLRLQLILKKWPPGNSLSDGLLKTIFVDGLREEFKEWMILQKPSSLNEAL 259
           Q  EESVR YFLRLQ ILK+W P + L DGLL+ IF+DGLR++F++W+I QKPSSLNEAL
Sbjct: 220 QGTEESVRSYFLRLQWILKRW-PDHGLPDGLLEGIFIDGLRKDFQDWIIPQKPSSLNEAL 279

Query: 260 RLAFGFEQVRTVSTSGKRGFLRCGFCEGPHEELVCEVRERMRQLWKSREKKNTVD----- 319
           RLAF +E+V+++    ++    CGFC G HEE  CE+RERMR LW  + KK T D     
Sbjct: 280 RLAFAWEKVQSIRGGREK---ECGFCSGGHEEEGCEIRERMRGLW-VKSKKQTRDYSGRI 339

Query: 320 VVQSDGREAAMATAELMRSSSAISRNESEVENDGGE-MVGLKKKSQCQCWKHQCGMKKLD 375
           V   DG E      E   S    SRN  + E +G E M+G KKKSQCQC KHQC  KKL+
Sbjct: 340 VNDEDGEE-----FERRVSVGGESRNVGKNEEEGEEGMMGWKKKSQCQCGKHQCWKKKLE 365

The following BLAST results are available for this feature:
Match NameE-valueIdentityDescription
W9R9S0_9ROSA4.0e-9149.75Uncharacterized protein OS=Morus notabilis GN=L484_004813 PE=4 SV=1[more]
B9RWN5_RICCO1.7e-8948.94Putative uncharacterized protein OS=Ricinus communis GN=RCOM_1022950 PE=4 SV=1[more]
A5C7E6_VITVI2.1e-8751.35Putative uncharacterized protein OS=Vitis vinifera GN=VITISV_007470 PE=4 SV=1[more]
A0A061DJI4_THECC7.8e-8750.00Uncharacterized protein OS=Theobroma cacao GN=TCM_001704 PE=4 SV=1[more]
K7KXT1_SOYBN8.7e-6242.82Uncharacterized protein OS=Glycine max GN=GLYMA_06G277800 PE=4 SV=1[more]
Match NameE-valueIdentityDescription
gi|703109765|ref|XP_010099386.1|5.8e-9149.75hypothetical protein L484_004813 [Morus notabilis][more]
gi|223542750|gb|EEF44287.1|2.4e-8948.94conserved hypothetical protein [Ricinus communis][more]
gi|147817046|emb|CAN62167.1|1.0e-8751.35hypothetical protein VITISV_007470 [Vitis vinifera][more]
gi|590709920|ref|XP_007048687.1|1.1e-8650.00Uncharacterized protein TCM_001704 [Theobroma cacao][more]
gi|731408992|ref|XP_010657036.1|1.6e-7248.79PREDICTED: uncharacterized protein LOC104880826 [Vitis vinifera][more]
The following terms have been associated with this gene:
Vocabulary: INTERPRO
TermDefinition
IPR005162Retrotrans_gag_dom
GO Assignments
This gene is annotated with the following GO terms.
Category Term Accession Term Name
biological_process GO:0008150 biological_process
cellular_component GO:0005575 cellular_component
molecular_function GO:0003674 molecular_function

The following mRNA feature(s) are a part of this gene:

Feature NameUnique NameType
Cla013544Cla013544.1mRNA


Analysis Name: InterPro Annotations of watermelon (97103)
Date Performed: 2016-09-28
IPR TermIPR DescriptionSourceSource TermSource DescriptionAlignment
IPR005162Retrotransposon gag domainPFAMPF03732Retrotrans_gagcoord: 136..232
score: 2.9
NoneNo IPR availablePANTHERPTHR33223FAMILY NOT NAMEDcoord: 92..259
score: 4.2

The following gene(s) are paralogous to this gene:

None