Cla020841 (gene) Watermelon (97103) v1

NameCla020841
Typegene
OrganismCitrullus. lanatus (Watermelon (97103) v1)
DescriptionCytomatrix protein-like protein (AHRD V1 ***- Q84JE5_ARATH)
LocationChr5 : 26323918 .. 26325735 (-)
   



The following sequences are available for this feature:

Gene sequence (with intron)

Legend: CDS
Hold the cursor over a type above to highlight its positions in the sequence below.
ATGGGAAAGAGCGATACATCTCAAGCTTCCATTGAACGGCGAAATTGGGGCAAAATCTTCAATGGTCTCACACAAATGCTACAGACGCAACAAAATCAGCTCGAAACGCTCGTCAACGAGCGAAAACTTCTCGAAGACCGCATTAAAATGCAGCACGAACGATGGATCGCTGATATTCGTCTTTACGAGGATCATATCTCTCAGGTCATTTTATATCTCTGAAATTTTGTGGAATTTTTCGTCTAATTATTGAATTGGAACTTCCCCCTTTCCTCTAAAAATTAGGGTTTCTGATGAATTTTGGAATTTGCTTCTTGTGTAGATGAAGGATGATTTGTTGTTGCAAGATATGGAACGCTCACTTCAAACCTCGAAATCAGATCTGCTAACTGGAATGAAGCAGTCGGAGTTGTACCTCTGCCGACTGAAAATAGGTTTTTGTTTATTTTCCCCTTTTAATCTTTTGAAGTTACGAATAGTCAAAGCATTGATTGGATAAAATTCATGATGCCTCATGGGCGAAATCTTCTTCCCTTGGTACTTTGGTCAAATTTCAAGGAATGTAAATCGAAATACGATACTTAGAAGTAAGTTCAAGGACGCTTGTTTATACACTAGCAACAAAGTCGGTGTGACAACGATTTGGCGATTGAAAAATTTTCTCTAATTGGAATATGGTTCAAAATACTTCGGTAAACTCTTGTTTGTATTGGTTGTACTGTAATTTGTGTTGCGATGATAGGAAGAACTATATGAACGTTTTTACCTTTGGCAAAGTAGGGATTCTCAGATCTGACAATACTATACTGGCTTACGTTCTCCCAATGCAGAACATTCAGAAGCAGAGTTGGAAGATTTCAAATCTTTCTTTGACGATCTTATCTCTCATAGAAACTCCAATCCACAAGTATGGTTCATGGACTTCTTTTCTGGTTAAGAGTATTGGGACAAGTTCTAATCTTCTGTCTCTGGTTGCAGGACTCATCTTTGAGAAGTGCATCAGAACCAGCTCAGGCAAATGGTGGAAGAGAAAGTGGTTTGTCCGCATGTGGAAATACAGACGAAGCGAGACGTTCTAAGGCATTGGAAGGTGAAGTAAGGAGGTTGAGGTGTGAATATGAAAAACTTGCCTCAGAAAAGAGTTTGGAGGTGTCTGCGCTGGTGGCCGAGAAGAAATTTGTATGGAATCAGTATAATGTTATACAAGATGACTTCTCAAGTAAATTGGAGACTAAGCAGTTAGAGCTTGAACGTGCACACCTAAAGGTAGAGAAACTTCTAGCCACATTGGAACAATTACAAAGCTCAAACAATGAGAAGGATGGTGTTATTGCAACGTTAAGAAACCAAATGGGGAAGATGGAAACTGACTCATGTAAATTAAAAGACGAAATTTCCAGACTCTCACACAATTTAGAACTGCAAAGGAAGTCTATGAATGCAACTGCCACACCTGTGCTAAACCCATGCAAGGCAGGAACTAGGCCATCTAGTTTGGGAGGCAAAAATGGCACGAAAAGCAGAAGTAATGTCACTGTTAACAAAGACGCATCTTCTGCACAACCTTCTCATTCGGTAAGCAGACTATGATAATATGTGTATTATTTATGTAAATAATGGTTTTGACTTTGAACATTTTTACATTTTGTGGTCCATTTTGTGTGTTCATATGGAAATTCTAACTCAAATTCAACTTTTCAATGTGGGTAGGGAAACCAAAAGAAGAGAGGCGCTGATGATATTTCAAATCTAGGGACTCCAAGGTTGTTTACCTCTAGTTTCAAGGTCCCTAAACTGAAGAACGAACTCAATTTGTAG

mRNA sequence

ATGGGAAAGAGCGATACATCTCAAGCTTCCATTGAACGGCGAAATTGGGGCAAAATCTTCAATGGTCTCACACAAATGCTACAGACGCAACAAAATCAGCTCGAAACGCTCGTCAACGAGCGAAAACTTCTCGAAGACCGCATTAAAATGCAGCACGAACGATGGATCGCTGATATTCGTCTTTACGAGGATCATATCTCTCAGATGAAGGATGATTTGTTGTTGCAAGATATGGAACGCTCACTTCAAACCTCGAAATCAGATCTGCTAACTGGAATGAAGCAGTCGGAGTTGTACCTCTGCCGACTGAAAATAGAACATTCAGAAGCAGAGTTGGAAGATTTCAAATCTTTCTTTGACGATCTTATCTCTCATAGAAACTCCAATCCACAAGACTCATCTTTGAGAAGTGCATCAGAACCAGCTCAGGCAAATGGTGGAAGAGAAAGTGGTTTGTCCGCATGTGGAAATACAGACGAAGCGAGACGTTCTAAGGCATTGGAAGGTGAAGTAAGGAGGTTGAGGTGTGAATATGAAAAACTTGCCTCAGAAAAGAGTTTGGAGGTGTCTGCGCTGGTGGCCGAGAAGAAATTTGTATGGAATCAGTATAATGTTATACAAGATGACTTCTCAAGTAAATTGGAGACTAAGCAGTTAGAGCTTGAACGTGCACACCTAAAGGTAGAGAAACTTCTAGCCACATTGGAACAATTACAAAGCTCAAACAATGAGAAGGATGGTGTTATTGCAACGTTAAGAAACCAAATGGGGAAGATGGAAACTGACTCATGTAAATTAAAAGACGAAATTTCCAGACTCTCACACAATTTAGAACTGCAAAGGAAGTCTATGAATGCAACTGCCACACCTGTGCTAAACCCATGCAAGGCAGGAACTAGGCCATCTAGTTTGGGAGGCAAAAATGGCACGAAAAGCAGAAGTAATGTCACTGTTAACAAAGACGCATCTTCTGCACAACCTTCTCATTCGGGAAACCAAAAGAAGAGAGGCGCTGATGATATTTCAAATCTAGGGACTCCAAGGTTGTTTACCTCTAGTTTCAAGGTCCCTAAACTGAAGAACGAACTCAATTTGTAG

Coding sequence (CDS)

ATGGGAAAGAGCGATACATCTCAAGCTTCCATTGAACGGCGAAATTGGGGCAAAATCTTCAATGGTCTCACACAAATGCTACAGACGCAACAAAATCAGCTCGAAACGCTCGTCAACGAGCGAAAACTTCTCGAAGACCGCATTAAAATGCAGCACGAACGATGGATCGCTGATATTCGTCTTTACGAGGATCATATCTCTCAGATGAAGGATGATTTGTTGTTGCAAGATATGGAACGCTCACTTCAAACCTCGAAATCAGATCTGCTAACTGGAATGAAGCAGTCGGAGTTGTACCTCTGCCGACTGAAAATAGAACATTCAGAAGCAGAGTTGGAAGATTTCAAATCTTTCTTTGACGATCTTATCTCTCATAGAAACTCCAATCCACAAGACTCATCTTTGAGAAGTGCATCAGAACCAGCTCAGGCAAATGGTGGAAGAGAAAGTGGTTTGTCCGCATGTGGAAATACAGACGAAGCGAGACGTTCTAAGGCATTGGAAGGTGAAGTAAGGAGGTTGAGGTGTGAATATGAAAAACTTGCCTCAGAAAAGAGTTTGGAGGTGTCTGCGCTGGTGGCCGAGAAGAAATTTGTATGGAATCAGTATAATGTTATACAAGATGACTTCTCAAGTAAATTGGAGACTAAGCAGTTAGAGCTTGAACGTGCACACCTAAAGGTAGAGAAACTTCTAGCCACATTGGAACAATTACAAAGCTCAAACAATGAGAAGGATGGTGTTATTGCAACGTTAAGAAACCAAATGGGGAAGATGGAAACTGACTCATGTAAATTAAAAGACGAAATTTCCAGACTCTCACACAATTTAGAACTGCAAAGGAAGTCTATGAATGCAACTGCCACACCTGTGCTAAACCCATGCAAGGCAGGAACTAGGCCATCTAGTTTGGGAGGCAAAAATGGCACGAAAAGCAGAAGTAATGTCACTGTTAACAAAGACGCATCTTCTGCACAACCTTCTCATTCGGGAAACCAAAAGAAGAGAGGCGCTGATGATATTTCAAATCTAGGGACTCCAAGGTTGTTTACCTCTAGTTTCAAGGTCCCTAAACTGAAGAACGAACTCAATTTGTAG

Protein sequence

MGKSDTSQASIERRNWGKIFNGLTQMLQTQQNQLETLVNERKLLEDRIKMQHERWIADIRLYEDHISQMKDDLLLQDMERSLQTSKSDLLTGMKQSELYLCRLKIEHSEAELEDFKSFFDDLISHRNSNPQDSSLRSASEPAQANGGRESGLSACGNTDEARRSKALEGEVRRLRCEYEKLASEKSLEVSALVAEKKFVWNQYNVIQDDFSSKLETKQLELERAHLKVEKLLATLEQLQSSNNEKDGVIATLRNQMGKMETDSCKLKDEISRLSHNLELQRKSMNATATPVLNPCKAGTRPSSLGGKNGTKSRSNVTVNKDASSAQPSHSGNQKKRGADDISNLGTPRLFTSSFKVPKLKNELNL
BLAST of Cla020841 vs. TrEMBL
Match: A0A0A0LH58_CUCSA (Uncharacterized protein OS=Cucumis sativus GN=Csa_3G872190 PE=4 SV=1)

HSP 1 Score: 593.6 bits (1529), Expect = 1.7e-166
Identity = 300/365 (82.19%), Postives = 327/365 (89.59%), Query Frame = 1

Query: 1   MGKSDTSQASIERRNWGKIFNGLTQMLQTQQNQLETLVNERKLLEDRIKMQHERWIADIR 60
           MGKSD S+ SIERRNWGKIFNGLTQML+TQQNQLETLV ERKLLEDR+KMQHERW+ADIR
Sbjct: 1   MGKSDRSKPSIERRNWGKIFNGLTQMLRTQQNQLETLVTERKLLEDRVKMQHERWVADIR 60

Query: 61  LYEDHISQMKDDLLLQDMERSLQTSKSDLLTGMKQSELYLCRLKIEHSEAELEDFKSFFD 120
           LYEDH+SQM+D+L LQDMERS Q SKSDLL GMKQ+ELY+CRLKIEHSEAELEDFKSFFD
Sbjct: 61  LYEDHVSQMRDELFLQDMERSFQASKSDLLAGMKQTELYVCRLKIEHSEAELEDFKSFFD 120

Query: 121 DLISHRNSNPQDSSLRSASEPAQANGGRESGLSACGNTDEARRSKALEGEVRRLRCEYEK 180
           D I+H+NS  Q+S LRSASEPA+ANGG E G+S  GNTDE RRS+ALE EVRR R EYEK
Sbjct: 121 DFIAHKNSKLQESFLRSASEPAEANGGGEGGMSKFGNTDEVRRSEALESEVRRFRSEYEK 180

Query: 181 LASEKSLEVSALVAEKKFVWNQYNVIQDDFSSKLETKQLELERAHLKVEKLLATLEQLQS 240
           LASEKS EVSALV E KFVWNQYNVI+ D+SSKL+ K  ELERAHLKVE+LLATLEQLQS
Sbjct: 181 LASEKSSEVSALVTENKFVWNQYNVIEADYSSKLKNKHSELERAHLKVEELLATLEQLQS 240

Query: 241 SNNEKDGVIATLRNQMGKMETDSCKLKDEISRLSHNLELQRKSMNATATPVLNPCKAGTR 300
           SNNEKD VIA LRNQ+GKMETDS KLKDEISRLSH+LE+QRKS+NATATPVL PCKAG R
Sbjct: 241 SNNEKDDVIAMLRNQVGKMETDSFKLKDEISRLSHDLEVQRKSVNATATPVLKPCKAGLR 300

Query: 301 PSSLGGKNGTKSRSNVTVNKDASSAQPSHSGNQKKRGADDISNLGTPRLFTSSFKVPKLK 360
            S LGGKNG++SRSNV VNKDA SAQPSHSGNQ KRGA DIS+ GTPRLFTSSFKVPKLK
Sbjct: 301 TSGLGGKNGSRSRSNVIVNKDAYSAQPSHSGNQMKRGAGDISDPGTPRLFTSSFKVPKLK 360

Query: 361 NELNL 366
           NE+NL
Sbjct: 361 NEINL 365

BLAST of Cla020841 vs. TrEMBL
Match: M5Y9L9_PRUPE (Uncharacterized protein OS=Prunus persica GN=PRUPE_ppa016441mg PE=4 SV=1)

HSP 1 Score: 334.3 bits (856), Expect = 1.8e-88
Identity = 185/376 (49.20%), Postives = 253/376 (67.29%), Query Frame = 1

Query: 1   MGKSDTSQASIERRNWGKIFNGLTQMLQTQQNQLETLVNERKLLEDRIKMQHERWIADIR 60
           M   + S+AS + +NW   FNGL QM++ QQ+QLETL  +R+LLEDR++ Q+ERW  DIR
Sbjct: 1   MAVKERSRASSQSQNWQNTFNGLVQMIRDQQSQLETLAKDRQLLEDRVRTQNERWTYDIR 60

Query: 61  LYEDHISQMKDDLLLQDMERSLQTSKSDLLTGMKQSELYLCRLKIEHSEAELEDFKSFFD 120
           L ED ISQMK DL  Q++  SLQ +K +L+ G+KQ E YL +LK+E++++ELE FK +FD
Sbjct: 61  LLEDQISQMKGDLEFQELAGSLQAAKFELVLGLKQREGYLTKLKLEYTDSELEGFKGWFD 120

Query: 121 DLISHRNSNPQDSSLRSASEPAQANGGRESGLSACGNT-------DEARRSKALEGEVRR 180
                             ++ +   GG E G     NT        E +RSK LE ++RR
Sbjct: 121 ----------------LYNKFSDLKGGGEDGDKRISNTKSPKKSKQEKQRSKQLEDDLRR 180

Query: 181 LRCEYEKLASEKSLEVSALVAEKKFVWNQYNVIQDDFSSKLETKQLELERAHLKVEKLLA 240
           L  +Y+KLASEKS EVSAL+AEKKFVWNQY ++++++++KL +K  E+E+A  K++  LA
Sbjct: 181 LNQQYDKLASEKSSEVSALLAEKKFVWNQYKIMEENYTTKLRSKHSEVEQAEAKIQNFLA 240

Query: 241 TLEQLQSSNNEKDGVIATLRNQMGKMETDSCKLKDEISRLSHNLELQRKSMNATATPVLN 300
            +E LQSSN EKDG IA L +++ KME+DS KLK+EIS+LS  L+L RKS +A++TP+LN
Sbjct: 241 HMEHLQSSNKEKDGKIAILISKVAKMESDSNKLKEEISKLSTELDLLRKSTSASSTPLLN 300

Query: 301 PCKAGTRPSSLGGKNGTKSRSNVTVNKDASSAQ------PSHSGNQ-KKRGADDISNLG- 360
            C AGTR  SL G N  K R+NVTV KD+S+AQ       +  G+   KR  DD+  +  
Sbjct: 301 HCTAGTRTCSLRGNNSAKDRTNVTVKKDSSAAQLPDPIKDTRKGSSISKRKIDDVITISE 360

Query: 361 TPRLFTSSFKVPKLKN 362
           TP+LF+S FKVPKLKN
Sbjct: 361 TPKLFSSRFKVPKLKN 360

BLAST of Cla020841 vs. TrEMBL
Match: A0A061FKJ5_THECC (Cytomatrix protein-related, putative OS=Theobroma cacao GN=TCM_034227 PE=4 SV=1)

HSP 1 Score: 325.1 bits (832), Expect = 1.1e-85
Identity = 172/363 (47.38%), Postives = 250/363 (68.87%), Query Frame = 1

Query: 7   SQASIERRNWGKIFNGLTQMLQTQQNQLETLVNERKLLEDRIKMQHERWIADIRLYEDHI 66
           SQ S ER+NW KIF GL +ML+TQQ QLETL  ERK+LEDRIKMQ+ERW++D+RLYEDHI
Sbjct: 7   SQVSSERQNWDKIFEGLVEMLKTQQEQLETLAKERKILEDRIKMQYERWVSDVRLYEDHI 66

Query: 67  SQMKDDLLLQDMERSLQTSKSDLLTGMKQSELYLCRLKIEHSEAELEDFKSFFDDLISHR 126
           SQ+K D+  ++M R+L+ +K+DL+ G+K  E YLC+LK+E +E EL DF+ +FD L    
Sbjct: 67  SQVKSDMESKEMARALEAAKADLMVGLKHRETYLCKLKLEETEDELTDFRIWFDIL---- 126

Query: 127 NSNPQDSSLRSASEPAQANGGRESGLSACGNT-DEARRSKALEGEVRRLRCEYEKLASEK 186
           + N +D S R   E        + G+S C ++  ++   + LEG+VRRL+ +YE LASEK
Sbjct: 127 SKNSKDISQRDPEE-------TKRGMSGCKDSGSKSVTVRTLEGDVRRLKLKYENLASEK 186

Query: 187 SLEVSALVAEKKFVWNQYNVIQDDFSSKLETKQLELERAHLKVEKLLATLEQLQSSNNEK 246
           + +++AL+AE KF WNQ+NV++  ++ KL +K  ELE+A+ K+E L++ +E+L SSN EK
Sbjct: 187 NSQITALLAENKFAWNQFNVLETQYTDKLNSKDSELEKANRKIEALISNMEELNSSNAEK 246

Query: 247 DGVIATLRNQMGKMETDSCKLKDEISRLSHNLELQRKSMNATATPVLNPCKAGTRPSSLG 306
           D +I  L+ ++ + E ++ K   E+S+ S  +EL RKS NA+ TPV+  C AG R S LG
Sbjct: 247 DAIIERLKAEVSQKEANASKF-HEVSKKSREVELLRKSRNASCTPVIKRCSAGGRTSVLG 306

Query: 307 GKNGTKSRSNVTVNKDASS------AQPSHSGNQ-KKRGADDISNLG-TPRLFTSSFKVP 361
           GK+G +   NV V K+ S+       + S  G++  KR  DD++ +  TP+LFTS+FKVP
Sbjct: 307 GKHGGRDGGNVIVKKETSARNDPDLLKDSGKGSRSSKRKKDDVTPISETPKLFTSTFKVP 357

BLAST of Cla020841 vs. TrEMBL
Match: A0A0B0PTE6_GOSAR (Keratin, type II cytoskeletal 8 OS=Gossypium arboreum GN=F383_13705 PE=4 SV=1)

HSP 1 Score: 309.7 bits (792), Expect = 4.8e-81
Identity = 166/370 (44.86%), Postives = 243/370 (65.68%), Query Frame = 1

Query: 1   MGKSDTSQASIERRNWGKIFNGLTQMLQTQQNQLETLVNERKLLEDRIKMQHERWIADIR 60
           MG    SQ S ER+ W KIF GL +ML+TQQ QLETL  ERK+LEDRIKMQ+ERW++D+R
Sbjct: 1   MGAKTRSQDSSERQKWDKIFEGLVKMLKTQQQQLETLSKERKILEDRIKMQYERWVSDVR 60

Query: 61  LYEDHISQMKDDLLLQDMERSLQTSKSDLLTGMKQSELYLCRLKIEHSEAELEDFKSFFD 120
           LYEDHISQMK DL  ++M R L+ +K+D++ G+K  E YLC++K+E +E EL DF+ +FD
Sbjct: 61  LYEDHISQMKSDLESEEMARVLEATKADMMVGLKHREAYLCKMKLEEAEDELTDFRIWFD 120

Query: 121 DLISHRNSNPQDSSLRSASEPAQANGGRESGLSACGNTDEARRSKALEGEVRRLRCEYEK 180
            L      N +D S R+  E  +    R+   S      ++   +ALE ++RRL+ +Y+ 
Sbjct: 121 IL----GKNSKDISQRAPIETKKGTSARKRSGS------KSVTLEALEDDIRRLQLQYKN 180

Query: 181 LASEKSLEVSALVAEKKFVWNQYNVIQDDFSSKLETKQLELERAHLKVEKLLATLEQLQS 240
           L SEKS +V+ALVAE KF WNQ+NV++  ++ KL TKQ EL++A+ ++E L++ +E+L+S
Sbjct: 181 LVSEKSCQVTALVAENKFAWNQFNVLESQYTDKLNTKQSELDKANKRIEALMSDMEELRS 240

Query: 241 SNNEKDGVIATLRNQMGKMETDSCKLKDEISRLSHNLELQRKSMNATATPVLNPCKAGTR 300
           SN EKD +I  L+ ++ + + D+ + + ++S  S  +E  RKS +A+ TPV+  C AG R
Sbjct: 241 SNAEKDEIIERLKAELSRKKADASRFQ-QVSITSGEVESLRKSRSASCTPVIKRCAAGGR 300

Query: 301 PSSLGGKNGTKSRSNVTVNKDASSA----------QPSHSGNQKKRGADDISNLGTPRLF 360
              + GKN  +   N+TV K+ S++          + S S  +KK  A  IS   TP+LF
Sbjct: 301 TYVMSGKNSGRDPCNITVKKENSASHVPDLQKENEKGSRSSKRKKEDAKPISE--TPKLF 357

BLAST of Cla020841 vs. TrEMBL
Match: A0A0D2TW23_GOSRA (Uncharacterized protein OS=Gossypium raimondii GN=B456_009G270000 PE=4 SV=1)

HSP 1 Score: 301.6 bits (771), Expect = 1.3e-78
Identity = 161/370 (43.51%), Postives = 241/370 (65.14%), Query Frame = 1

Query: 1   MGKSDTSQASIERRNWGKIFNGLTQMLQTQQNQLETLVNERKLLEDRIKMQHERWIADIR 60
           MG    S+ S ER+ W KIF GL +ML+TQQ QLETL  ERK+LEDRIKMQ+ERW++D+R
Sbjct: 1   MGAKTRSEGSSERQKWDKIFEGLVKMLKTQQQQLETLSKERKILEDRIKMQYERWVSDVR 60

Query: 61  LYEDHISQMKDDLLLQDMERSLQTSKSDLLTGMKQSELYLCRLKIEHSEAELEDFKSFFD 120
           LYEDHISQMK  L  +++ R L+ +K+D++ G+K  E YLC++K+E +E EL DF+ +FD
Sbjct: 61  LYEDHISQMKSGLESEEIARVLEATKADMMVGLKHREAYLCKMKLEEAEDELTDFRIWFD 120

Query: 121 DLISHRNSNPQDSSLRSASEPAQANGGRESGLSACGNTDEARRSKALEGEVRRLRCEYEK 180
            L      N +D S R + E  +    R+   S      ++   +ALE ++RRL+ +Y+ 
Sbjct: 121 IL----GKNSKDISQRDSIETKKGTSARKRSGS------KSVTLEALEDDIRRLQLQYKN 180

Query: 181 LASEKSLEVSALVAEKKFVWNQYNVIQDDFSSKLETKQLELERAHLKVEKLLATLEQLQS 240
           L SEKS +V+ALVAE KF WNQ+N ++  ++  L TKQ EL++A+ ++E L++ +E+L+S
Sbjct: 181 LVSEKSCQVTALVAENKFAWNQFNALESRYTDNLNTKQSELDKANKRIEALMSDMEELRS 240

Query: 241 SNNEKDGVIATLRNQMGKMETDSCKLKDEISRLSHNLELQRKSMNATATPVLNPCKAGTR 300
           S  EKD +I  L+ ++ + + D+ + + ++S+ S ++E  RKS +A+ TPV+  C AG R
Sbjct: 241 SIAEKDEIIERLKAELSRKKADASRFQ-QVSKTSGDVESLRKSRSASCTPVIKRCAAGGR 300

Query: 301 PSSLGGKNGTKSRSNVTVNKDAS----------SAQPSHSGNQKKRGADDISNLGTPRLF 360
              +GGKN  +   N+TV K+ S          + + S S  +KK  A  IS   TP+LF
Sbjct: 301 TYVMGGKNSGRDPCNITVKKENSAPHVPDIQKENEKGSRSSKRKKEDAKPISE--TPKLF 357

BLAST of Cla020841 vs. NCBI nr
Match: gi|449437422|ref|XP_004136491.1| (PREDICTED: uncharacterized protein LOC101222062 [Cucumis sativus])

HSP 1 Score: 593.6 bits (1529), Expect = 2.4e-166
Identity = 300/365 (82.19%), Postives = 327/365 (89.59%), Query Frame = 1

Query: 1   MGKSDTSQASIERRNWGKIFNGLTQMLQTQQNQLETLVNERKLLEDRIKMQHERWIADIR 60
           MGKSD S+ SIERRNWGKIFNGLTQML+TQQNQLETLV ERKLLEDR+KMQHERW+ADIR
Sbjct: 1   MGKSDRSKPSIERRNWGKIFNGLTQMLRTQQNQLETLVTERKLLEDRVKMQHERWVADIR 60

Query: 61  LYEDHISQMKDDLLLQDMERSLQTSKSDLLTGMKQSELYLCRLKIEHSEAELEDFKSFFD 120
           LYEDH+SQM+D+L LQDMERS Q SKSDLL GMKQ+ELY+CRLKIEHSEAELEDFKSFFD
Sbjct: 61  LYEDHVSQMRDELFLQDMERSFQASKSDLLAGMKQTELYVCRLKIEHSEAELEDFKSFFD 120

Query: 121 DLISHRNSNPQDSSLRSASEPAQANGGRESGLSACGNTDEARRSKALEGEVRRLRCEYEK 180
           D I+H+NS  Q+S LRSASEPA+ANGG E G+S  GNTDE RRS+ALE EVRR R EYEK
Sbjct: 121 DFIAHKNSKLQESFLRSASEPAEANGGGEGGMSKFGNTDEVRRSEALESEVRRFRSEYEK 180

Query: 181 LASEKSLEVSALVAEKKFVWNQYNVIQDDFSSKLETKQLELERAHLKVEKLLATLEQLQS 240
           LASEKS EVSALV E KFVWNQYNVI+ D+SSKL+ K  ELERAHLKVE+LLATLEQLQS
Sbjct: 181 LASEKSSEVSALVTENKFVWNQYNVIEADYSSKLKNKHSELERAHLKVEELLATLEQLQS 240

Query: 241 SNNEKDGVIATLRNQMGKMETDSCKLKDEISRLSHNLELQRKSMNATATPVLNPCKAGTR 300
           SNNEKD VIA LRNQ+GKMETDS KLKDEISRLSH+LE+QRKS+NATATPVL PCKAG R
Sbjct: 241 SNNEKDDVIAMLRNQVGKMETDSFKLKDEISRLSHDLEVQRKSVNATATPVLKPCKAGLR 300

Query: 301 PSSLGGKNGTKSRSNVTVNKDASSAQPSHSGNQKKRGADDISNLGTPRLFTSSFKVPKLK 360
            S LGGKNG++SRSNV VNKDA SAQPSHSGNQ KRGA DIS+ GTPRLFTSSFKVPKLK
Sbjct: 301 TSGLGGKNGSRSRSNVIVNKDAYSAQPSHSGNQMKRGAGDISDPGTPRLFTSSFKVPKLK 360

Query: 361 NELNL 366
           NE+NL
Sbjct: 361 NEINL 365

BLAST of Cla020841 vs. NCBI nr
Match: gi|659132965|ref|XP_008466482.1| (PREDICTED: paramyosin [Cucumis melo])

HSP 1 Score: 582.4 bits (1500), Expect = 5.5e-163
Identity = 299/365 (81.92%), Postives = 331/365 (90.68%), Query Frame = 1

Query: 1   MGKSDTSQASIERRNWGKIFNGLTQMLQTQQNQLETLVNERKLLEDRIKMQHERWIADIR 60
           MGK+D S+ASIER+NWGKIFNGLTQML+TQQNQLETLV ERKLLEDR+KMQHERW+ADIR
Sbjct: 1   MGKTDRSKASIERQNWGKIFNGLTQMLRTQQNQLETLVTERKLLEDRVKMQHERWVADIR 60

Query: 61  LYEDHISQMKDDLLLQDMERSLQTSKSDLLTGMKQSELYLCRLKIEHSEAELEDFKSFFD 120
           LYEDHISQ++D+LLL DMERSLQ SKSDLL+GMKQ+ELY+CRLKIE SEAE EDFKS FD
Sbjct: 61  LYEDHISQLRDELLLLDMERSLQASKSDLLSGMKQTELYVCRLKIEQSEAEFEDFKSVFD 120

Query: 121 DLISHRNSNPQDSSLRSASEPAQANGGRESGLSACGNTDEARRSKALEGEVRRLRCEYEK 180
           D I+H+NSN Q+S LRSASEPA+ANGGRE  LS  GNTDE RRSKALE EVRRLR EYEK
Sbjct: 121 DFIAHKNSNLQESFLRSASEPAEANGGREGVLSTFGNTDEVRRSKALEKEVRRLRSEYEK 180

Query: 181 LASEKSLEVSALVAEKKFVWNQYNVIQDDFSSKLETKQLELERAHLKVEKLLATLEQLQS 240
           LASEKS EVSALV EKKFVW+QYNVI+ D+SSKL+ KQ ELE A LKVE+LLATLEQLQ+
Sbjct: 181 LASEKSSEVSALVTEKKFVWHQYNVIEADYSSKLKNKQSELEHARLKVEELLATLEQLQN 240

Query: 241 SNNEKDGVIATLRNQMGKMETDSCKLKDEISRLSHNLELQRKSMNATATPVLNPCKAGTR 300
           SN+EKD VIATL+NQ+GKMETDSCKLKDEISRLS++LE+QRKS+N TATPVL PCKA TR
Sbjct: 241 SNDEKDDVIATLQNQVGKMETDSCKLKDEISRLSNDLEVQRKSVNPTATPVLKPCKA-TR 300

Query: 301 PSSLGGKNGTKSRSNVTVNKDASSAQPSHSGNQKKRGADDISNLGTPRLFTSSFKVPKLK 360
            S LG KN +KSRSNVTVNKD SSAQPSHSGNQKKRGADDIS+ GTPRLFTSSFKVPKLK
Sbjct: 301 SSGLGVKNVSKSRSNVTVNKDVSSAQPSHSGNQKKRGADDISDPGTPRLFTSSFKVPKLK 360

Query: 361 NELNL 366
           NE+NL
Sbjct: 361 NEINL 364

BLAST of Cla020841 vs. NCBI nr
Match: gi|596299046|ref|XP_007227591.1| (hypothetical protein PRUPE_ppa016441mg [Prunus persica])

HSP 1 Score: 334.3 bits (856), Expect = 2.6e-88
Identity = 185/376 (49.20%), Postives = 253/376 (67.29%), Query Frame = 1

Query: 1   MGKSDTSQASIERRNWGKIFNGLTQMLQTQQNQLETLVNERKLLEDRIKMQHERWIADIR 60
           M   + S+AS + +NW   FNGL QM++ QQ+QLETL  +R+LLEDR++ Q+ERW  DIR
Sbjct: 1   MAVKERSRASSQSQNWQNTFNGLVQMIRDQQSQLETLAKDRQLLEDRVRTQNERWTYDIR 60

Query: 61  LYEDHISQMKDDLLLQDMERSLQTSKSDLLTGMKQSELYLCRLKIEHSEAELEDFKSFFD 120
           L ED ISQMK DL  Q++  SLQ +K +L+ G+KQ E YL +LK+E++++ELE FK +FD
Sbjct: 61  LLEDQISQMKGDLEFQELAGSLQAAKFELVLGLKQREGYLTKLKLEYTDSELEGFKGWFD 120

Query: 121 DLISHRNSNPQDSSLRSASEPAQANGGRESGLSACGNT-------DEARRSKALEGEVRR 180
                             ++ +   GG E G     NT        E +RSK LE ++RR
Sbjct: 121 ----------------LYNKFSDLKGGGEDGDKRISNTKSPKKSKQEKQRSKQLEDDLRR 180

Query: 181 LRCEYEKLASEKSLEVSALVAEKKFVWNQYNVIQDDFSSKLETKQLELERAHLKVEKLLA 240
           L  +Y+KLASEKS EVSAL+AEKKFVWNQY ++++++++KL +K  E+E+A  K++  LA
Sbjct: 181 LNQQYDKLASEKSSEVSALLAEKKFVWNQYKIMEENYTTKLRSKHSEVEQAEAKIQNFLA 240

Query: 241 TLEQLQSSNNEKDGVIATLRNQMGKMETDSCKLKDEISRLSHNLELQRKSMNATATPVLN 300
            +E LQSSN EKDG IA L +++ KME+DS KLK+EIS+LS  L+L RKS +A++TP+LN
Sbjct: 241 HMEHLQSSNKEKDGKIAILISKVAKMESDSNKLKEEISKLSTELDLLRKSTSASSTPLLN 300

Query: 301 PCKAGTRPSSLGGKNGTKSRSNVTVNKDASSAQ------PSHSGNQ-KKRGADDISNLG- 360
            C AGTR  SL G N  K R+NVTV KD+S+AQ       +  G+   KR  DD+  +  
Sbjct: 301 HCTAGTRTCSLRGNNSAKDRTNVTVKKDSSAAQLPDPIKDTRKGSSISKRKIDDVITISE 360

Query: 361 TPRLFTSSFKVPKLKN 362
           TP+LF+S FKVPKLKN
Sbjct: 361 TPKLFSSRFKVPKLKN 360

BLAST of Cla020841 vs. NCBI nr
Match: gi|645229002|ref|XP_008221258.1| (PREDICTED: nuclear distribution protein nudE homolog 1 [Prunus mume])

HSP 1 Score: 332.8 bits (852), Expect = 7.6e-88
Identity = 182/376 (48.40%), Postives = 255/376 (67.82%), Query Frame = 1

Query: 1   MGKSDTSQASIERRNWGKIFNGLTQMLQTQQNQLETLVNERKLLEDRIKMQHERWIADIR 60
           M   + S+AS +R+NW   F+GL QM++ QQ+QLETL  +R+LLEDR+++Q+ERW  DIR
Sbjct: 1   MAVKERSRASSQRQNWQNTFSGLVQMIRYQQSQLETLAKDRQLLEDRVRIQNERWTYDIR 60

Query: 61  LYEDHISQMKDDLLLQDMERSLQTSKSDLLTGMKQSELYLCRLKIEHSEAELEDFKSFFD 120
           L ED ISQMK DL  Q++  SLQ +K +L+ G+KQ E YL +LK+E++++ELE FK +FD
Sbjct: 61  LLEDQISQMKGDLEFQELAGSLQAAKFELVLGLKQREGYLTKLKLEYTDSELEGFKGWFD 120

Query: 121 DLISHRNSNPQDSSLRSASEPAQANGGRESGLSACGNT-------DEARRSKALEGEVRR 180
                             ++ +   GG E G     NT        E +RSK LE ++RR
Sbjct: 121 ----------------LYNKFSDLKGGGEDGDKRISNTKSPKKSKQEKQRSKQLEDDLRR 180

Query: 181 LRCEYEKLASEKSLEVSALVAEKKFVWNQYNVIQDDFSSKLETKQLELERAHLKVEKLLA 240
           L+ +Y+KLASEKS EVSAL+AEKKFVWNQY ++++++++KL +K  E+E+A   ++  LA
Sbjct: 181 LKQQYDKLASEKSSEVSALLAEKKFVWNQYKIMEENYTTKLRSKHSEVEQAEANIQNFLA 240

Query: 241 TLEQLQSSNNEKDGVIATLRNQMGKMETDSCKLKDEISRLSHNLELQRKSMNATATPVLN 300
            +E LQSSN EKDG +A L +++ KME+DS KLK+E+S+LS  L+L RKS +A++TP+LN
Sbjct: 241 HMEHLQSSNKEKDGKVAILISKIAKMESDSNKLKEEVSKLSTELDLLRKSTSASSTPLLN 300

Query: 301 PCKAGTRPSSLGGKNGTKSRSNVTVNKDASSAQ------PSHSGNQ-KKRGADDISNLG- 360
            C AGTR  SL G N  K R+NVTV KD+S+AQ       +  G+   KR  DD+  +  
Sbjct: 301 HCTAGTRTYSLRGNNSAKDRTNVTVKKDSSAAQLPDPIKDTRKGSSISKRKIDDVITISE 360

Query: 361 TPRLFTSSFKVPKLKN 362
           TP+LF+S FKVPKLKN
Sbjct: 361 TPKLFSSRFKVPKLKN 360

BLAST of Cla020841 vs. NCBI nr
Match: gi|590594269|ref|XP_007017804.1| (Cytomatrix protein-related, putative [Theobroma cacao])

HSP 1 Score: 325.1 bits (832), Expect = 1.6e-85
Identity = 172/363 (47.38%), Postives = 250/363 (68.87%), Query Frame = 1

Query: 7   SQASIERRNWGKIFNGLTQMLQTQQNQLETLVNERKLLEDRIKMQHERWIADIRLYEDHI 66
           SQ S ER+NW KIF GL +ML+TQQ QLETL  ERK+LEDRIKMQ+ERW++D+RLYEDHI
Sbjct: 7   SQVSSERQNWDKIFEGLVEMLKTQQEQLETLAKERKILEDRIKMQYERWVSDVRLYEDHI 66

Query: 67  SQMKDDLLLQDMERSLQTSKSDLLTGMKQSELYLCRLKIEHSEAELEDFKSFFDDLISHR 126
           SQ+K D+  ++M R+L+ +K+DL+ G+K  E YLC+LK+E +E EL DF+ +FD L    
Sbjct: 67  SQVKSDMESKEMARALEAAKADLMVGLKHRETYLCKLKLEETEDELTDFRIWFDIL---- 126

Query: 127 NSNPQDSSLRSASEPAQANGGRESGLSACGNT-DEARRSKALEGEVRRLRCEYEKLASEK 186
           + N +D S R   E        + G+S C ++  ++   + LEG+VRRL+ +YE LASEK
Sbjct: 127 SKNSKDISQRDPEE-------TKRGMSGCKDSGSKSVTVRTLEGDVRRLKLKYENLASEK 186

Query: 187 SLEVSALVAEKKFVWNQYNVIQDDFSSKLETKQLELERAHLKVEKLLATLEQLQSSNNEK 246
           + +++AL+AE KF WNQ+NV++  ++ KL +K  ELE+A+ K+E L++ +E+L SSN EK
Sbjct: 187 NSQITALLAENKFAWNQFNVLETQYTDKLNSKDSELEKANRKIEALISNMEELNSSNAEK 246

Query: 247 DGVIATLRNQMGKMETDSCKLKDEISRLSHNLELQRKSMNATATPVLNPCKAGTRPSSLG 306
           D +I  L+ ++ + E ++ K   E+S+ S  +EL RKS NA+ TPV+  C AG R S LG
Sbjct: 247 DAIIERLKAEVSQKEANASKF-HEVSKKSREVELLRKSRNASCTPVIKRCSAGGRTSVLG 306

Query: 307 GKNGTKSRSNVTVNKDASS------AQPSHSGNQ-KKRGADDISNLG-TPRLFTSSFKVP 361
           GK+G +   NV V K+ S+       + S  G++  KR  DD++ +  TP+LFTS+FKVP
Sbjct: 307 GKHGGRDGGNVIVKKETSARNDPDLLKDSGKGSRSSKRKKDDVTPISETPKLFTSTFKVP 357

The following BLAST results are available for this feature:
Match NameE-valueIdentityDescription
A0A0A0LH58_CUCSA1.7e-16682.19Uncharacterized protein OS=Cucumis sativus GN=Csa_3G872190 PE=4 SV=1[more]
M5Y9L9_PRUPE1.8e-8849.20Uncharacterized protein OS=Prunus persica GN=PRUPE_ppa016441mg PE=4 SV=1[more]
A0A061FKJ5_THECC1.1e-8547.38Cytomatrix protein-related, putative OS=Theobroma cacao GN=TCM_034227 PE=4 SV=1[more]
A0A0B0PTE6_GOSAR4.8e-8144.86Keratin, type II cytoskeletal 8 OS=Gossypium arboreum GN=F383_13705 PE=4 SV=1[more]
A0A0D2TW23_GOSRA1.3e-7843.51Uncharacterized protein OS=Gossypium raimondii GN=B456_009G270000 PE=4 SV=1[more]
Match NameE-valueIdentityDescription
gi|449437422|ref|XP_004136491.1|2.4e-16682.19PREDICTED: uncharacterized protein LOC101222062 [Cucumis sativus][more]
gi|659132965|ref|XP_008466482.1|5.5e-16381.92PREDICTED: paramyosin [Cucumis melo][more]
gi|596299046|ref|XP_007227591.1|2.6e-8849.20hypothetical protein PRUPE_ppa016441mg [Prunus persica][more]
gi|645229002|ref|XP_008221258.1|7.6e-8848.40PREDICTED: nuclear distribution protein nudE homolog 1 [Prunus mume][more]
gi|590594269|ref|XP_007017804.1|1.6e-8547.38Cytomatrix protein-related, putative [Theobroma cacao][more]
The following terms have been associated with this gene:
Vocabulary: INTERPRO
TermDefinition
GO Assignments
This gene is annotated with the following GO terms.
Category Term Accession Term Name
biological_process GO:0008150 biological_process
cellular_component GO:0005575 cellular_component
molecular_function GO:0003674 molecular_function
This gene is associated with the following unigenes:
Unigene NameAnalysis NameSequence type in Unigene
WMU48111watermelon EST collection version 2.0transcribed_cluster

The following mRNA feature(s) are a part of this gene:

Feature NameUnique NameType
Cla020841Cla020841.1mRNA


The following transcribed_cluster feature(s) are associated with this gene:

Feature NameUnique NameType
WMU48111WMU48111transcribed_cluster


Analysis Name: InterPro Annotations of watermelon (97103)
Date Performed: 2016-09-28
IPR TermIPR DescriptionSourceSource TermSource DescriptionAlignment
NoneNo IPR availableunknownCoilCoilcoord: 20..54
score: -coord: 164..184
score: -coord: 214..255
scor
NoneNo IPR availablePANTHERPTHR35992FAMILY NOT NAMEDcoord: 1..364
score: 8.7
NoneNo IPR availablePANTHERPTHR35992:SF1CYTOMATRIX PROTEIN-LIKE PROTEINcoord: 1..364
score: 8.7

The following gene(s) are orthologous to this gene:
GeneOrthologueOrganismBlock
Cla020841Cla97C08G157850Watermelon (97103) v2wmwmbB222
Cla020841Cla97C05G099580Watermelon (97103) v2wmwmbB207
Cla020841Csa3G872190Cucumber (Chinese Long) v2cuwmB244
Cla020841Csa2G368790Cucumber (Chinese Long) v2cuwmB126
Cla020841MELO3C003750Melon (DHL92) v3.5.1mewmB368
Cla020841MELO3C011012Melon (DHL92) v3.5.1mewmB317
Cla020841ClCG05G018260Watermelon (Charleston Gray)wcgwmB285
Cla020841ClCG08G014280Watermelon (Charleston Gray)wcgwmB412
Cla020841CSPI02G20860Wild cucumber (PI 183967)cpiwmB129
Cla020841CSPI03G43430Wild cucumber (PI 183967)cpiwmB243
Cla020841Cucsa.159900Cucumber (Gy14) v1cgywmB294
Cla020841Cucsa.343050Cucumber (Gy14) v1cgywmB615
Cla020841CmaCh01G019730Cucurbita maxima (Rimu)cmawmB440
Cla020841CmaCh05G003060Cucurbita maxima (Rimu)cmawmB759
Cla020841CmoCh05G003090Cucurbita moschata (Rifu)cmowmB753
Cla020841CmoCh01G020310Cucurbita moschata (Rifu)cmowmB434
Cla020841Lsi08G013180Bottle gourd (USVL1VR-Ls)lsiwmB481
Cla020841Cp4.1LG02g07920Cucurbita pepo (Zucchini)cpewmB559
Cla020841Cp4.1LG11g02430Cucurbita pepo (Zucchini)cpewmB109
Cla020841CsGy3G040980Cucumber (Gy14) v2cgybwmB230
Cla020841MELO3C003750.2Melon (DHL92) v3.6.1medwmB364
Cla020841MELO3C011012.2Melon (DHL92) v3.6.1medwmB311
Cla020841Carg17199Silver-seed gourdcarwmB0945
Cla020841CsaV3_3G045880Cucumber (Chinese Long) v3cucwmB253
Cla020841CsaV3_2G029530Cucumber (Chinese Long) v3cucwmB138
Cla020841Bhi04G000128Wax gourdwgowmB334
Cla020841Bhi09G001784Wax gourdwgowmB618
The following gene(s) are paralogous to this gene:
GeneParalogueOrganismBlock
Cla020841Cla022423Watermelon (97103) v1wmwmB059