Cla005101 (gene) Watermelon (97103) v1

NameCla005101
Typegene
OrganismCitrullus. lanatus (Watermelon (97103) v1)
DescriptionOs03g0859900 protein (Fragment) (AHRD V1 **-- Q0DLK3_ORYSJ); contains Interpro domain(s) IPR006869 Protein of unknown function DUF547
LocationChr3 : 3185069 .. 3190914 (+)
The following sequences are available for this feature:

Gene sequence (with intron)

Legend: CDS
Hold the cursor over a type above to highlight its positions in the sequence below.
ATGCAGGTAAAGGAGGTGTTGGCAGAGCTAGCAATGGTGGAAAGTGAGATAGCAAGGCTTGAGATCCAAATAACCCAACTCCAAAAGGACTTGAAAACTGAGCAACAACAAACCACAAAGTCCAAGCAATGGAGCTCTGAACAACCTCAAACCAATAATAATAATCCCAACAATAAACCACCATTGCATTGGAACCCAATTAGCAAAGCAACTTTTGACACTAAGGCTCTCCACTTCATTAGCAAAGCCATCAAGGGAGATTATGTGCTCAATCACTTTAAGTTGGATAATGCAAAAAATAGTGAATTAGATCCTAGAGATACCAACGACACTCATCATCTTCTTCATGAGGTTAAACTCCATGAAAGAGTTTCTAGAAAGAGTGGCCTTCTCGTTGCTTCGTCTCCGTTGCGAGACCCTCGACACCCTTCTCCAAAGGTACATCATATTTCGTTTTATTTGATTCGGTCTTTCAAATTTGCTATTATTCCTTTTATCCTTTGGTCATTCTTCTATCGATCCTTTTTCTTGTTTGTATTATACATTTACGAGATTTTTGTACTGATGAACCAATCCTAAATGTGAATGCAAAATGACCAAATCTTAGAAGCTAATGAAAACTAACTTTCTAATTACATGATGGAGCACTAAACAACATGGAACTCCTTCTCATGATCACGACTTCTCACCCATCGTGAAATATTCGAATCCCTAGCGCGATTGACATTAAAATCGTGATCATACTCCACTATTATTTCGTGGTGATTCGGCACCCACGACTGCGAATGGCCAACAAGGAGATTAAAAAAAATCGTGATGCACATTTAAAGCGTATTAGGGATTTGAATATTTCGTGATGGGTTAGAAGTCGCAATCGCGAGAAGAAGTTTCAGAACGTAAGTAGAAGGTTAGTTTTCTGGTTCCAACGACATGGTCATTTTACATTAGCATATTTAGGATTAGGTCATTTGTACAAAAATATCCACATATTTACCATGTTTCCTAATATTGAGTTGATAGATTATAGAATAGTTAATCATTGGAACAACCACCATTTTAGTCCCAAGTTTGGAAGAATAGGGACATTTCATAATGGATTTTCAAATCATATATTTTTAGTATTTGAGTATTTTAGAATATGAACCTTCTATCCCTGAGTTTTCAAAATGCAACATTTTTAGTCTATGAGTTCTTAAGAATAAGTTTAAAAGGTATTTGAAGTGATTTCTTTTTATTATTTTAAATAACCATATGACATTCGAATTCGATTAAAAAATAATGTTAGAGAGAAGAGATAACTTCTCAAAATTATTTTTTTAATAGAAAATAATTAAAAAAGTTTACTGCATAGACCCTTTTTTTAAACCTATTTTAAAAACTCGGGAACTAAGAAAGTATATTTTGAAAATGTACACACTCTTAAAAAACTCTAAGGCTAAAAAAATACATTTTAAAGAATCAAATGCACATATTCTTCTAAATTCGTAATTTTTTTCCCTTAAATCTTTTATATATATATATATATATATATTATATTGATAAGGTTTTTTGGTTAAATCACTTTTGTGTTAGCAACGGGAGCGAAATCCATTAGACATGCCACCACCAAAGTCTATGCCAATGCCAATTCAAGTAGAAGAAAACATCAAAAATTGGCACCCCAACAAGCTATCAGAGAGTATCATGAAGTGCTTGAATTTCATATATGTGAGACTGCTGAGAGCCTCAAGAACAATGGAGTTAGAGAAGTCAGGTCCCATTTCAAGATCTTTGCATTATTCTTCTTTGAGCTCGAGAAGCTTCCGAGTCGAGAACGGTTTAAACTCGAGCCTTTTGATACACAAAGAACTGAGGCAACAAGATCCTTATGGCATCTTCGAAAATGAAGAATCGATACCGAGGGATATTGGTCCTTACAAGAACTTGGTCATATTCACATCAACTTCCATGGATCCCAAATCCATATCTAGTGCCACCTTCATCCCTCTCATAAGGAAGCTAAGGTATATTTTAAAGCCATCTTTTAAGCTTCATGAGTGTTTAATTAATATTTGAAGATGACTAAAATCTAATTGCTTTAATGTCTTATGACTCGAATTTTGAAAACAATGAAATAACATTTTGCTTACGATGTTATAAAGCATGCACTTGGCAGCGTGAAATTTCCTCTTGTAGTTGTGAGTTCTCATCCATCGCAAATACTTGGGATCCTTGACATGAGTTAAAAAATACGAATACAAGCCCATTATTAAAGTTTGGTGATGGGTGAGAACTTGCGACCGTTGGTTTAAATGGTCATCACGAGCATTGGTCATTGTTTTGTTTGACTCTACATCGGCCATTGTTAAAATTTCACATTGCCCATCAATTTGTGATATAAGCAAAATGTTAGTTTTCATTGTCTCTAGAATTTGGGTCTTTCGACAATAGCGCAATCATATTTGGGTCATCCATACAAAAAAACTCATATCTAAGAGCCTTTGCCCATTGAAGCAACAAAGATTGAGACTTTTTTGTTTTTATGTTTTTTTTTTTAAAATACCTTTTTGGTCTCTAAATTTTTTTTTGTTAGGTTTCAAAATGTAATACTTTTAGTCCTCTAGTTTTGAGTTTAATTTCAATCTAATTCTTCCTAGGTTACATTTCAAAATGTTATAATTTTATAATTGAGGTTTGAGTTTTGTTTCAATTGGTTTCTAAGTTTCAGGATTTACACTTTTAACTTTGATTTTCACTAAATACTCACTTGTAGTCTTTAGTATTAATGTCTACTAGTTAATTTAAATGGATTAAAGAATTATAATTAATTAAATTTCACTATTTTTTTATGACTATTAAAATTAATTTTAAAATTTTACTTTATAATTGTTTTTAAATTAATTAATAGACATTAAAATCAAAGATTAAAAGTGAATATTTTGAGATTAAAAATATAAATCTTGAAACACAGGCACGAAATTAAGACAAAACTCAAACCTTAAATGTAAAATCGTAACATTTTTTTGAAACTCAGGTACTAAACTGATATAAACTCAAAACTTAAGAGCTAAAAATTGTAACAATTTGAAACTTAAAAACCAAATAGAAACTAAACCTAAACCTACCTCATTCTTGGGTTTGTGGTCACTTTATGGTGATGAATGCAGGGTCCTAATGAGCAATTTGCAAAGAGTGGATCTAAAGCCATTGAATTACCAACAAAAACTGGCATTTTGGATCAACATGTACAATGCTTGTATCATGAATGTAAATTTATTGGATCAAAACACCATTTTTTTTCTTTCCAATTCAACTTTAATTCCTTTTTAATGCTTTTCTAACAAATTGTTTGTTTTTTTTTAGGGATTCCTCCAATATGGAGTGCCTTCTTCTCCAGAAACACTAGCCACTTTGATGAATAAGGCAAGTAACTTATCGATCATATGGATTTTTTTCTTTCTTTCTAGGTGTACTTGTCCATTTTGAATCTAAGTTCTCATAAATCTTAGTTTTTATAGTCTTTTACTGATTTTTCCAAGTCAATTCATCCATGTTACACTACGTGCCCAACAAACTATTTTTGGATAAATTATTCAACGAATCTGTCACAATATTTAATTATTATCTTATAATAACGGTTGGATGTTCATTCAACCGTTATTATAAAAAAATGCGTTATAGTTTAATAACATATCGAATGTGATCAATTTGTTTGTTCGTTTTTTCCTGATTCAGGCAATGGTTAACATCGGAGGCAACGCCATAAATGCACAAGCCATAGAGCATTACATTCTAAGGAAACCAAAGTCTAGTAAAAAAGAGGTAATTGATCATGTCCCAAAAATATTTGATAATTAGAATATATAATTCAGTAGATAAGACCTCATTTATCATTTCAATAATCGATGGTTTAAATTAGCTTTCCAAGAATTAATGGTTCAATAATCCTCGCAATCGCAATTGATACACTTGAAAAAATCCCAAAAATCTTGTCCTTACGACAAAGTTTAGGCAAATAATATAAATGTTACTATTTACTCCTAATCCACCAATTTTTTCCTTCTTTCTTTCATTTTCATTGAAAAATACTTAGGTGCATTACAAGTTATACCTATTTTTGTGCATCCCATCAAATTTTCCAATCAATAATCTGTTAAGTGGAAAATTATTTTTACTTCTTTTTTAAAATATTTTAATCTTTTCCTAAAAACATTCTCTCTTTGAAACTTTGGTTTCTCTCTGACATATCATTAATCCCAACTTTTTTTCTTTCTCATTCATTTTTCTCACTAATAAATACATTTCTAACTTTATTGTTTCTACACTAAAAAACAATCTATTATCTTAATTTTTTTTAGTTATTTTAGTACATTTTGAGAAATACAAACCACAAACCTCTTAGCATATGATGTCACATCTTAATATGTTTTTATTAACTTTTTTCTTAATTTTTTTAATTCATCTTTATTTCTATCACATATCTTTTATTCTCAACTTTACTCTTTTTATCTCCGTAATATCTCTTTTAACTTTTTTTTTCTCTAAATCAATATCTTTTCCTTTAATTTAATATATTTCTCCTTTTAAGCATTATCATTGGATATATTGAGCATAAGTTAAATTACAAATTAAATTCAAATCTCTCAAAATCATTCCAAAATTGTGTTTAATAGAATTTATAAACTTTAAGATGCTAATGTGTACGCTAATATACATCCTTATATTTTTTAATTATGTGTCTAAATATACACCATTTACCTAATCTATATTTTAAAAAGAATTATACTTACGTGTATTATAGGTCGTACTTATCTTTGTGCTACTACTTCAATCACACTAACAAAAAGAAAAAAATCAAAAAATATTTTTAATAGAAAAATTGGTTTGTCACGTAATAGATTATGGTTGGACAATTGTGTGAATTGTACAAATATGAGAGAAGCTAGGGTGCATCTAAATATCTTTCTTTTGAAGAATAGACAAGTGTGTTAAACTTATAATTGAATTTTCTTTCTAATAGAATTCTAACCTCACTAGTGTGTCTAATAGGTTCATTACTTCTAAAAAATGACTTATATCTATCAATGACATTAAAAATAAAATTAAAAGTTTCAAAAGTCTTAATTATTAGACACATATTAAACTTCAACATACAAAATTGAAGATTTGAAACTTATTATAATATTATGAAATTGTTGGTTACTCTTTAGCATAAATTAAAATAAAGTCTTTTTCTAGAACCAACTCACTAATATAAATATGTATATATAGTAAAATGATCGATAACGAATAGTAATAAATATTGGTAGACTACTAGATACCGTAATACACTATGATAATAATAAATTATAGTAATTTACTAATGAACTATTTATCACAATTTTTTTATATTTAAAAATATTTCTACTAATTTTATAGTTTCAAACAACTAACCTTAAAATAATTAGGACGACAACAAAGAAGCCGTTGTCCGGAAGCTGTACGGCCTAGAATCATCAGAGCCGAACGTGACATTCGCCCTATGTTGTGGGACCCGGTCATCGCCGGCGGTGAGAATATACAGTGGCGAGGCGGTGGCGGCGGAGCTTGAAAGATCGAAGCTCGAGTATCTGCAGGCATCGGTGGTGGTCACCAGCTCCAGAAGGGTGGCGGTGCCGGAGCTTCTAGTTCGAAGCTTGCCAGAATTTGCAGGGGCGGCGGCGGCGGCAGACATGAAGGCGGTGGTGGAGTGGGTGTGCCACCAGCTTCCGACGTCGGGGAGTCTGAGGAAATCAATGGTGGAATGCTTTCGAGGACACCCCAAAACACAGCCCACCATCGACACCTTGCCTTATGACTTCGAGTTTCAATATCTTTTGCCTTTGTGA

mRNA sequence

ATGCAGGTAAAGGAGGTGTTGGCAGAGCTAGCAATGGTGGAAAGTGAGATAGCAAGGCTTGAGATCCAAATAACCCAACTCCAAAAGGACTTGAAAACTGAGCAACAACAAACCACAAAGTCCAAGCAATGGAGCTCTGAACAACCTCAAACCAATAATAATAATCCCAACAATAAACCACCATTGCATTGGAACCCAATTAGCAAAGCAACTTTTGACACTAAGGCTCTCCACTTCATTAGCAAAGCCATCAAGGGAGATTATGTGCTCAATCACTTTAAGTTGGATAATGCAAAAAATAGTGAATTAGATCCTAGAGATACCAACGACACTCATCATCTTCTTCATGAGGTTAAACTCCATGAAAGAGTTTCTAGAAAGAGTGGCCTTCTCGTTGCTTCGTCTCCGTTGCGAGACCCTCGACACCCTTCTCCAAAGCAACGGGAGCGAAATCCATTAGACATGCCACCACCAAAGTCTATGCCAATGCCAATTCAAGTAGAAGAAAACATCAAAAATTGGCACCCCAACAAGCTATCAGAGAGTATCATGAAGTGCTTGAATTTCATATATGTGAGACTGCTGAGAGCCTCAAGAACAATGGAGTTAGAGAAGTCAGGTCCCATTTCAAGATCTTTGCATTATTCTTCTTTGAGCTCGAGAAGCTTCCGAGTCGAGAACGGTTTAAACTCGAGCCTTTTGATACACAAAGAACTGAGGCAACAAGATCCTTATGGCATCTTCGAAAATGAAGAATCGATACCGAGGGATATTGGTCCTTACAAGAACTTGGTCATATTCACATCAACTTCCATGGATCCCAAATCCATATCTAGTGCCACCTTCATCCCTCTCATAAGGAAGCTAAGGGTCCTAATGAGCAATTTGCAAAGAGTGGATCTAAAGCCATTGAATTACCAACAAAAACTGGCATTTTGGATCAACATGTACAATGCTTGTATCATGAATGGATTCCTCCAATATGGAGTGCCTTCTTCTCCAGAAACACTAGCCACTTTGATGAATAAGGCAAGCAACGCCATAAATGCACAAGCCATAGAGCATTACATTCTAAGGAAACCAAAGTCTAGTAAAAAAGAGGACGACAACAAAGAAGCCGTTGTCCGGAAGCTGTACGGCCTAGAATCATCAGAGCCGAACGTGACATTCGCCCTATGTTGTGGGACCCGGTCATCGCCGGCGGTGAGAATATACAGTGGCGAGGCGGTGGCGGCGGAGCTTGAAAGATCGAAGCTCGAGTATCTGCAGGCATCGGTGGTGGTCACCAGCTCCAGAAGGGTGGCGGTGCCGGAGCTTCTAGTTCGAAGCTTGCCAGAATTTGCAGGGGCGGCGGCGGCGGCAGACATGAAGGCGGTGGTGGAGTGGGTGTGCCACCAGCTTCCGACGTCGGGGAGTCTGAGGAAATCAATGGTGGAATGCTTTCGAGGACACCCCAAAACACAGCCCACCATCGACACCTTGCCTTATGACTTCGAGTTTCAATATCTTTTGCCTTTGTGA

Coding sequence (CDS)

ATGCAGGTAAAGGAGGTGTTGGCAGAGCTAGCAATGGTGGAAAGTGAGATAGCAAGGCTTGAGATCCAAATAACCCAACTCCAAAAGGACTTGAAAACTGAGCAACAACAAACCACAAAGTCCAAGCAATGGAGCTCTGAACAACCTCAAACCAATAATAATAATCCCAACAATAAACCACCATTGCATTGGAACCCAATTAGCAAAGCAACTTTTGACACTAAGGCTCTCCACTTCATTAGCAAAGCCATCAAGGGAGATTATGTGCTCAATCACTTTAAGTTGGATAATGCAAAAAATAGTGAATTAGATCCTAGAGATACCAACGACACTCATCATCTTCTTCATGAGGTTAAACTCCATGAAAGAGTTTCTAGAAAGAGTGGCCTTCTCGTTGCTTCGTCTCCGTTGCGAGACCCTCGACACCCTTCTCCAAAGCAACGGGAGCGAAATCCATTAGACATGCCACCACCAAAGTCTATGCCAATGCCAATTCAAGTAGAAGAAAACATCAAAAATTGGCACCCCAACAAGCTATCAGAGAGTATCATGAAGTGCTTGAATTTCATATATGTGAGACTGCTGAGAGCCTCAAGAACAATGGAGTTAGAGAAGTCAGGTCCCATTTCAAGATCTTTGCATTATTCTTCTTTGAGCTCGAGAAGCTTCCGAGTCGAGAACGGTTTAAACTCGAGCCTTTTGATACACAAAGAACTGAGGCAACAAGATCCTTATGGCATCTTCGAAAATGAAGAATCGATACCGAGGGATATTGGTCCTTACAAGAACTTGGTCATATTCACATCAACTTCCATGGATCCCAAATCCATATCTAGTGCCACCTTCATCCCTCTCATAAGGAAGCTAAGGGTCCTAATGAGCAATTTGCAAAGAGTGGATCTAAAGCCATTGAATTACCAACAAAAACTGGCATTTTGGATCAACATGTACAATGCTTGTATCATGAATGGATTCCTCCAATATGGAGTGCCTTCTTCTCCAGAAACACTAGCCACTTTGATGAATAAGGCAAGCAACGCCATAAATGCACAAGCCATAGAGCATTACATTCTAAGGAAACCAAAGTCTAGTAAAAAAGAGGACGACAACAAAGAAGCCGTTGTCCGGAAGCTGTACGGCCTAGAATCATCAGAGCCGAACGTGACATTCGCCCTATGTTGTGGGACCCGGTCATCGCCGGCGGTGAGAATATACAGTGGCGAGGCGGTGGCGGCGGAGCTTGAAAGATCGAAGCTCGAGTATCTGCAGGCATCGGTGGTGGTCACCAGCTCCAGAAGGGTGGCGGTGCCGGAGCTTCTAGTTCGAAGCTTGCCAGAATTTGCAGGGGCGGCGGCGGCGGCAGACATGAAGGCGGTGGTGGAGTGGGTGTGCCACCAGCTTCCGACGTCGGGGAGTCTGAGGAAATCAATGGTGGAATGCTTTCGAGGACACCCCAAAACACAGCCCACCATCGACACCTTGCCTTATGACTTCGAGTTTCAATATCTTTTGCCTTTGTGA

Protein sequence

MQVKEVLAELAMVESEIARLEIQITQLQKDLKTEQQQTTKSKQWSSEQPQTNNNNPNNKPPLHWNPISKATFDTKALHFISKAIKGDYVLNHFKLDNAKNSELDPRDTNDTHHLLHEVKLHERVSRKSGLLVASSPLRDPRHPSPKQRERNPLDMPPPKSMPMPIQVEENIKNWHPNKLSESIMKCLNFIYVRLLRASRTMELEKSGPISRSLHYSSLSSRSFRVENGLNSSLLIHKELRQQDPYGIFENEESIPRDIGPYKNLVIFTSTSMDPKSISSATFIPLIRKLRVLMSNLQRVDLKPLNYQQKLAFWINMYNACIMNGFLQYGVPSSPETLATLMNKASNAINAQAIEHYILRKPKSSKKEDDNKEAVVRKLYGLESSEPNVTFALCCGTRSSPAVRIYSGEAVAAELERSKLEYLQASVVVTSSRRVAVPELLVRSLPEFAGAAAAADMKAVVEWVCHQLPTSGSLRKSMVECFRGHPKTQPTIDTLPYDFEFQYLLPL
BLAST of Cla005101 vs. TrEMBL
Match: A0A0A0LSP4_CUCSA (Uncharacterized protein OS=Cucumis sativus GN=Csa_1G118870 PE=4 SV=1)

HSP 1 Score: 810.4 bits (2092), Expect = 1.2e-231
Identity = 424/511 (82.97%), Postives = 451/511 (88.26%), Query Frame = 1

Query: 2   QVKEVLAELAMVESEIARLEIQITQLQKDLKTEQQQTTKSKQWSSEQ-PQTNNNNPNNKP 61
           +VKE+LAELAMVESEIARLEIQITQLQKDLK EQQQTTKSKQWSSEQ PQTNNN    KP
Sbjct: 72  KVKEMLAELAMVESEIARLEIQITQLQKDLKFEQQQTTKSKQWSSEQQPQTNNN----KP 131

Query: 62  PLHWNPISKATFDTKALHFISKAIKGDYV-LNH-FKLDNAKNSELDPRDTNDTHHLLHEV 121
           PL+WNPISK TFDTKALHFISKAIKGDY  LNH FKLD +KN+ELDPRD  D+HH LHEV
Sbjct: 132 PLNWNPISKTTFDTKALHFISKAIKGDYAPLNHHFKLDTSKNNELDPRDAKDSHHPLHEV 191

Query: 122 KLHER-VSRKSGLLVASSPLRDPRHPSPKQRERNPLDMPPPKSMPMPIQVEENIKNWHPN 181
           KLHER VSRKSGLLVASSPLRDPRHPSPKQRERNPLD+P PKS+PM  Q EENI+NWHPN
Sbjct: 192 KLHERSVSRKSGLLVASSPLRDPRHPSPKQRERNPLDIPLPKSIPMLTQAEENIQNWHPN 251

Query: 182 KLSESIMKCLNFIYVRLLRASRTMELEKSGPISRSLHYSSLSSRSFRVENGLNSSLLIHK 241
           KLSESIMKCLNFIYVRLLRASRTMELEKSGPISRSLHYSSLSSRSFRVENGLNSSL  HK
Sbjct: 252 KLSESIMKCLNFIYVRLLRASRTMELEKSGPISRSLHYSSLSSRSFRVENGLNSSLSAHK 311

Query: 242 ELRQQDPYGIFENEESIPRDIGPYKNLVIFTSTSMDPKSISSATFIPLIRKLRVLMSNLQ 301
           ELRQQDPYGIFENEES+PRDIGPYKNLVIFTSTSMDPKSISSATFIPL+RKLRVLMSNLQ
Sbjct: 312 ELRQQDPYGIFENEESLPRDIGPYKNLVIFTSTSMDPKSISSATFIPLMRKLRVLMSNLQ 371

Query: 302 RVDLKPLNYQQKLAFWINMYNACIMNGFLQYGVPSSPETLATLMNKASNAINAQAIEHYI 361
           +VDL+PL+YQQKLAFWINMYNACIMN        +        ++   ++  ++   H+ 
Sbjct: 372 KVDLRPLSYQQKLAFWINMYNACIMNVNSYQLCYNKRMIFFVGISSIWSSFVSRKTSHFD 431

Query: 362 L--RKPKSSKKEDDNKEAVVRKLYGLESSEPNVTFALCCGTRSSPAVRIYSGEAVAAELE 421
              +KP S  KEDDNKEA+VRKLYGLESSEPNVTFALCCGTRSSPAVRIYSGE V  ELE
Sbjct: 432 EQGKKPMSINKEDDNKEAIVRKLYGLESSEPNVTFALCCGTRSSPAVRIYSGEGVGVELE 491

Query: 422 RSKLEYLQASVVVTSSRRVAVPELLVRSLPEFAGAAAAADMKAVVEWVCHQLPTSGSLRK 481
           RSKLEYLQASVVVTSS+RVAVPELLVRSLPEF    ++ADMK VVEWVCHQLPTSGSLRK
Sbjct: 492 RSKLEYLQASVVVTSSKRVAVPELLVRSLPEF----SSADMKTVVEWVCHQLPTSGSLRK 551

Query: 482 SMVECFRGHPKTQPTIDTLPYDFEFQYLLPL 507
           SMVECFRGHPKTQPTIDTLPYDFEFQYLLPL
Sbjct: 552 SMVECFRGHPKTQPTIDTLPYDFEFQYLLPL 574

BLAST of Cla005101 vs. TrEMBL
Match: A0A067FAJ0_CITSI (Uncharacterized protein OS=Citrus sinensis GN=CISIN_1g037790mg PE=4 SV=1)

HSP 1 Score: 598.2 bits (1541), Expect = 9.3e-168
Identity = 332/539 (61.60%), Postives = 404/539 (74.95%), Query Frame = 1

Query: 2   QVKEVLAELAMVESEIARLEIQITQLQKDLKTEQQQT--TKSKQWS-----SEQPQTNNN 61
           +VKE+LAELA+VE EI RLE QI+QLQ  LK EQ+ T  TKSKQW      + Q  +   
Sbjct: 174 KVKELLAELALVEGEIKRLEGQISQLQLGLKHEQEVTKETKSKQWQLGSLGNLQGHSTYM 233

Query: 62  NPNNKPPLHWNPISKATFDTKALHFISKAIKGDYVLNHF-----KLDNAKNSELDPRDTN 121
              + P ++     K  F+TKALHFISKAIKGDY L+ F     K+ N+K   +D ++  
Sbjct: 234 ANISSPLINKVGNEKVAFETKALHFISKAIKGDYNLSDFSVNEKKMGNSKVVFVDQKENQ 293

Query: 122 DTHHLLHEVKLHERVSRKSGLLVASSPLRDPRHPSPKQRERNPLDMP---PPKSMPMPIQ 181
                  EVK  +RV RKSG++  +SPLRDPRHP+PK RERN  ++    PPKS+   I 
Sbjct: 294 FQQQ--QEVKFQDRVPRKSGMIKPASPLRDPRHPTPKPRERNAAEISFDLPPKSLSNSIL 353

Query: 182 VEENIKNWHPNKLSESIMKCLNFIYVRLLRASRTMELEKSGPISRSLHYSSLSSRSFRVE 241
           +EE+I+NW PNKLSESIMKCLNFIYVRLLR SR +ELEK+GPISRS+H SS++SRSFR +
Sbjct: 354 LEESIQNWQPNKLSESIMKCLNFIYVRLLRTSRAIELEKAGPISRSMH-SSITSRSFRAD 413

Query: 242 NGLNS--SLLIHKELRQQDPYGIFENEESIPRDIGPYKNLVIFTSTSMDPKSISSATFIP 301
             LNS  S+++ K+ RQQDPYGIF+ EESIPRDIGPYKNLVIF+S+SMDPK ISS++ +P
Sbjct: 414 TSLNSKSSIVLQKDSRQQDPYGIFDMEESIPRDIGPYKNLVIFSSSSMDPKCISSSSSVP 473

Query: 302 LIRKLRVLMSNLQRVDLKPLNYQQKLAFWINMYNACIMNGFLQYGVPSSPETLATLMNKA 361
           LIRKLR+LM+NLQ VDLK L YQQKLAFWINM+NACIM+GFLQYGVP+SPE L  LMNKA
Sbjct: 474 LIRKLRILMNNLQTVDLKALTYQQKLAFWINMFNACIMHGFLQYGVPNSPEKLIALMNKA 533

Query: 362 S-----NAINAQAIEHYILRKPKSSK--------KEDDNKEAVVRKLYGLESSEPNVTFA 421
           +     + INAQAIEHYILR  +SS          E D KEA+VRKLYGLES++PNVTFA
Sbjct: 534 TLSIGGSTINAQAIEHYILRGQESSNLKEVDQKAGEKDEKEAIVRKLYGLESTDPNVTFA 593

Query: 422 LCCGTRSSPAVRIYSGEAVAAELERSKLEYLQASVVVTSSRRVAVPELLVRSLPEFAGAA 481
           LC GTRSSPAVRIY+ + V AELE+SKLEYLQASVVVT++R++A PELL R++ +F    
Sbjct: 594 LCYGTRSSPAVRIYTADGVIAELEKSKLEYLQASVVVTNTRKIAFPELLFRNMLDF---- 653

Query: 482 AAADMKAVVEWVCHQLPTSGSLRKSMVECFR--GH--PKTQPTIDTLPYDFEFQYLLPL 507
            A D+  +VEWVCHQLPTSGSLRKSMV+CFR  GH   K   T++ +PYDFEFQYLL +
Sbjct: 654 -AMDIDTLVEWVCHQLPTSGSLRKSMVDCFRHQGHNNGKISITVEKIPYDFEFQYLLAI 704

BLAST of Cla005101 vs. TrEMBL
Match: W9S2M1_9ROSA (Uncharacterized protein OS=Morus notabilis GN=L484_007502 PE=4 SV=1)

HSP 1 Score: 597.0 bits (1538), Expect = 2.1e-167
Identity = 326/530 (61.51%), Postives = 394/530 (74.34%), Query Frame = 1

Query: 2   QVKEVLAELAMVESEIARLEIQITQLQKDLKTEQQ--QTTKSKQWSSEQPQTNNNNPNNK 61
           ++KE+LAELAMVE EI RLE QI++L+  LK EQ+  + +KSKQW    P       ++ 
Sbjct: 73  KMKELLAELAMVEDEIERLEGQISKLKLGLKQEQEVNKESKSKQWRYGSPLRPGPGGHSS 132

Query: 62  P-----PLHWNPIS-KATFDTKALHFISKAIKGDYVLNHFKLDNAKNSELDPRDTNDTHH 121
                 P+H    S +  ++TK LHFISKAIKGDY LN F L+  +            ++
Sbjct: 133 SFTFPSPMHNGVHSDRMAYETKTLHFISKAIKGDYNLNDFSLNERRGMNFKAFGDQKENY 192

Query: 122 LLHEVKLHERVSRKSGLLVASSPLRDPRHPSPKQRERN--PLDMPPPKSMPMPIQVEENI 181
              EVK+ ERV RKSG+L  +SP+RDPR+PSP+       PLD  PPK +   I  EENI
Sbjct: 193 FHEEVKIQERVPRKSGMLKPASPMRDPRNPSPRPTRNPGMPLDHLPPKPVSASIHSEENI 252

Query: 182 KNWHPNKLSESIMKCLNFIYVRLLRASRTMELEKSGPISRSLHYSSLSSRSFRVENGLNS 241
           +NW PNKLSE I+KCLNFIY+RLLR +RTMELEKSGPISRSLH SSLSSRSFRVE G N 
Sbjct: 253 QNWQPNKLSEEILKCLNFIYIRLLRTTRTMELEKSGPISRSLH-SSLSSRSFRVEAGSNL 312

Query: 242 SLLIHKELRQQDPYGIFENEESIPRDIGPYKNLVIFTSTSMDPKSISSATFIPLIRKLRV 301
                KE RQQDPYGIF  EESIPRDIGPYKNLV+FT++SMDPK +SS + +PL+RKLR 
Sbjct: 313 -----KESRQQDPYGIFNVEESIPRDIGPYKNLVMFTASSMDPKCVSSPSSLPLLRKLRQ 372

Query: 302 LMSNLQRVDLKPLNYQQKLAFWINMYNACIMNGFLQYGVPSSPETLATLMNKA-----SN 361
           LM+ LQ VDL+ L YQQKLAFWINMYNACIM+GFLQYGVPSSPE L TLMNKA      N
Sbjct: 373 LMNILQTVDLRSLTYQQKLAFWINMYNACIMHGFLQYGVPSSPEKLLTLMNKAILNIGGN 432

Query: 362 AINAQAIEHYILRK--PKSSKK-----EDDNKEAVVRKLYGLESSEPNVTFALCCGTRSS 421
            INAQAIEHYILRK  P S K+     E D+KEA+VR+LYGLES++PNVTFALCCGTRSS
Sbjct: 433 IINAQAIEHYILRKSAPSSMKEAYKNGEKDDKEAIVRELYGLESTDPNVTFALCCGTRSS 492

Query: 422 PAVRIYSGEAVAAELERSKLEYLQASVVVTSSRRVAVPELLVRSLPEFAGAAAAADMKAV 481
           PAVR+Y+ E++ AELERSKLEYLQASV+VTS++++A+PELL+R+L +F     AAD + +
Sbjct: 493 PAVRMYTAESIVAELERSKLEYLQASVIVTSTKKIALPELLLRNLLDF-----AADKELL 552

Query: 482 VEWVCHQLPTSGSLRKSMVECFRGHPK---TQPTIDTLPYDFEFQYLLPL 507
           VEWVCHQLPTSGSLRKS+V+CFR H     T  T++ +PYD+EFQYLL +
Sbjct: 553 VEWVCHQLPTSGSLRKSIVDCFRSHSSGRVTSATVEKIPYDYEFQYLLAI 591

BLAST of Cla005101 vs. TrEMBL
Match: A0A0B0PR44_GOSAR (Topoisomerase 1-associated factor 1 OS=Gossypium arboreum GN=F383_07577 PE=4 SV=1)

HSP 1 Score: 595.1 bits (1533), Expect = 7.9e-167
Identity = 330/532 (62.03%), Postives = 399/532 (75.00%), Query Frame = 1

Query: 2   QVKEVLAELAMVESEIARLEIQITQLQKDLKTEQQQTTKSKQWS--------SEQPQTNN 61
           + KE+LAELAMVE EIARLE QI+QLQ DLK E++  T++KQW          + P T +
Sbjct: 53  RTKELLAELAMVEGEIARLESQISQLQLDLKQEKE-ATQAKQWQPGSLMSYLQDHPSTTS 112

Query: 62  NNPNNKPPLHWNPISKATFDTKALHFISKAIKGDYVLNHFKLDNAKNSELDPRDTNDTHH 121
           N PN   P+      K  F+TKALHFISKAIKGDY L+ F L+   +S L        + 
Sbjct: 113 N-PN---PIKQGGQEKVVFETKALHFISKAIKGDYTLSDFSLNERMDSRL--LSEQKENQ 172

Query: 122 LLHEVKLHERVSRKSGLLVASSPLRDPRHPSPKQRERNP---LDMPPPKSMPMPIQVEEN 181
              EVK  ERV RKS LL A+SPLRDPRHPSPK RER P    D+PP KS+   +  EE+
Sbjct: 173 FQGEVKFQERVPRKSSLLKAASPLRDPRHPSPKLRERIPESNWDLPP-KSLSSTLLSEES 232

Query: 182 IKNWHPNKLSESIMKCLNFIYVRLLRASRTMELEKSGPISRSLHYSSLSSRSFRVENGLN 241
            +NWHPNKLSE+IMKCLNFI+VRLLR SR MELEKSGPI+R +  + LSSRSFRVE+ LN
Sbjct: 233 SQNWHPNKLSENIMKCLNFIFVRLLRTSRAMELEKSGPITRFMS-TPLSSRSFRVESTLN 292

Query: 242 --SSLLIHKELRQQDPYGIFENEESIPRDIGPYKNLVIFTSTSMDPKSISSATFIPLIRK 301
             SSL   KE RQQDPYGIF+ EESIPRDIGPYKNLVIF S SMDPK ISS+  IPL++K
Sbjct: 293 PKSSLGSQKESRQQDPYGIFDMEESIPRDIGPYKNLVIFASNSMDPKCISSS--IPLLKK 352

Query: 302 LRVLMSNLQRVDLKPLNYQQKLAFWINMYNACIMNGFLQYGVPSSPETLATLMNKAS--- 361
           LRVLMSNLQ+VDL+ L YQQKLAFWIN+YNACIM+G+LQYGVP++PE   TLMNKA+   
Sbjct: 353 LRVLMSNLQKVDLRSLTYQQKLAFWINIYNACIMHGYLQYGVPNTPEKFLTLMNKATLNV 412

Query: 362 --NAINAQAIEHYILRKPKSS-------KKEDDNKEAVVRKLYGLESSEPNVTFALCCGT 421
             N I+AQA+EHYILRKP SS       K + DN+EA+VRKLYGL+  +PNVTFAL CGT
Sbjct: 413 GGNTISAQAMEHYILRKPASSNMKEAYQKDDKDNQEAIVRKLYGLQLMDPNVTFALSCGT 472

Query: 422 RSSPAVRIYSGEAVAAELERSKLEYLQASVVVTSSRRVAVPELLVRSLPEFAGAAAAADM 481
           RSSPAVRIY+ + VAAELE+SKLEYLQAS+ VT+++++A+PELL+R++ +F     + DM
Sbjct: 473 RSSPAVRIYTADGVAAELEKSKLEYLQASIAVTNTKKIALPELLLRNMFDF-----SVDM 532

Query: 482 KAVVEWVCHQLPTSGSLRKSMVECFRGH--PKTQPTIDTLPYDFEFQYLLPL 507
            ++V+WVC QLPTSGSLRKSMV+CFR H   K   T++ +PYDFEFQYLL +
Sbjct: 533 TSLVQWVCQQLPTSGSLRKSMVDCFRSHNSGKVSITVEKIPYDFEFQYLLAM 568

BLAST of Cla005101 vs. TrEMBL
Match: V4SL49_9ROSI (Uncharacterized protein OS=Citrus clementina GN=CICLE_v10028060mg PE=4 SV=1)

HSP 1 Score: 595.1 bits (1533), Expect = 7.9e-167
Identity = 330/535 (61.68%), Postives = 403/535 (75.33%), Query Frame = 1

Query: 2   QVKEVLAELAMVESEIARLEIQITQLQKDLKTEQQQT--TKSKQWS-----SEQPQTNNN 61
           +VKE+LAELA+VE EI RLE QI+QLQ  LK EQ+ T  TKSKQW      + Q  +   
Sbjct: 73  KVKELLAELALVEGEIKRLEGQISQLQLGLKHEQEVTKETKSKQWQLGSLGNLQGHSTYM 132

Query: 62  NPNNKPPLHWNPISKATFDTKALHFISKAIKGDYVLNHF-----KLDNAKNSELDPRDTN 121
              + P ++     K  F+TKALHFISKAIKGDY L+ F     K+ N+K   +D ++  
Sbjct: 133 ANISSPLINKVGNEKVAFETKALHFISKAIKGDYNLSDFSVNEKKMGNSKVVFVDQKENQ 192

Query: 122 DTHHLLHEVKLHERVSRKSGLLVASSPLRDPRHPSPKQRERNPLDMP---PPKSMPMPIQ 181
                  EVK  +RV RKSG++  +SPLRDPRHP+PK RER+  ++    PPKS+   I 
Sbjct: 193 FQQQ--QEVKFQDRVPRKSGMIKPASPLRDPRHPTPKPRERSAAEISFDLPPKSLSNSIL 252

Query: 182 VEENIKNWHPNKLSESIMKCLNFIYVRLLRASRTMELEKSGPISRSLHYSSLSSRSFRVE 241
           +EE+I+NW PNKLSESIM+CLNFIYVRLLR SR +ELEK+GPISRS+H SS++SRSFR +
Sbjct: 253 LEESIQNWQPNKLSESIMRCLNFIYVRLLRTSRAIELEKAGPISRSIH-SSIASRSFRAD 312

Query: 242 NGLNS--SLLIHKELRQQDPYGIFENEESIPRDIGPYKNLVIFTSTSMDPKSISSATFIP 301
             LNS  S+++ K+ RQQDPYGIF  EESIPRDIGPYKNLVIF+S+SMDPK ISS++ +P
Sbjct: 313 TSLNSKSSIVLQKDSRQQDPYGIFYMEESIPRDIGPYKNLVIFSSSSMDPKCISSSSSVP 372

Query: 302 LIRKLRVLMSNLQRVDLKPLNYQQKLAFWINMYNACIMNGFLQYGVPSSPETLATLMNKA 361
           LIRKLR+LM+NLQ VDLK L YQQKLAFWINM+NACIM+GFLQYGVP+SPE L  LMNKA
Sbjct: 373 LIRKLRILMNNLQTVDLKALTYQQKLAFWINMFNACIMHGFLQYGVPNSPEKLIALMNKA 432

Query: 362 S-----NAINAQAIEHYILRKPKSSK----KEDDNKEAVVRKLYGLESSEPNVTFALCCG 421
           +     + INAQAIEHYILR  +SS      E D KEA+VRKLYGLES++PNVTFALC G
Sbjct: 433 TLSIGGSTINAQAIEHYILRGQESSNLKEAGEKDEKEAIVRKLYGLESTDPNVTFALCYG 492

Query: 422 TRSSPAVRIYSGEAVAAELERSKLEYLQASVVVTSSRRVAVPELLVRSLPEFAGAAAAAD 481
           TRSSPAVRIY+ + V AELE+SKLEYLQASVVVT++R++A PELL R++ +F     A D
Sbjct: 493 TRSSPAVRIYTADGVIAELEKSKLEYLQASVVVTNTRKIAFPELLFRNMLDF-----AMD 552

Query: 482 MKAVVEWVCHQLPTSGSLRKSMVECFR--GH--PKTQPTIDTLPYDFEFQYLLPL 507
           +  +VEWVCHQLPTSGSLRKSMV+CFR  GH   K   T++ +PYDFEFQYLL +
Sbjct: 553 IDTLVEWVCHQLPTSGSLRKSMVDCFRHQGHNNGKISITVEKIPYDFEFQYLLAI 599

BLAST of Cla005101 vs. NCBI nr
Match: gi|659071989|ref|XP_008462925.1| (PREDICTED: uncharacterized protein LOC103501181 isoform X2 [Cucumis melo])

HSP 1 Score: 884.0 bits (2283), Expect = 1.2e-253
Identity = 454/513 (88.50%), Postives = 474/513 (92.40%), Query Frame = 1

Query: 2   QVKEVLAELAMVESEIARLEIQITQLQKDLKTEQQQTTKSKQWSSEQ-PQTNNNNPNNKP 61
           +VKE+LAELAMVESEIARLEIQITQL+KDLK EQQ TTKSKQWSSEQ PQTNNNN   KP
Sbjct: 37  KVKEMLAELAMVESEIARLEIQITQLRKDLKIEQQHTTKSKQWSSEQQPQTNNNN---KP 96

Query: 62  PLHWNPISKATFDTKALHFISKAIKGDYVLNH-FKLDNAKNSELDPRDTNDTHHLLHEVK 121
           PL+WNPISK TFDTKALHFISKAIKGDY LNH FKLDN+KN+ELDPRD  D+HH LHEVK
Sbjct: 97  PLNWNPISKTTFDTKALHFISKAIKGDYALNHHFKLDNSKNNELDPRDAKDSHHPLHEVK 156

Query: 122 LHER-VSRKSGLLVASSPLRDPRHPSPKQRERNPLDMPPPKSMPMPIQVEENIKNWHPNK 181
           LHER VSRKSGLLVASSPLRDPRHPSPKQRERNPLD+P PKSMPM  Q EENI+NWHPNK
Sbjct: 157 LHERSVSRKSGLLVASSPLRDPRHPSPKQRERNPLDIPLPKSMPMLTQAEENIQNWHPNK 216

Query: 182 LSESIMKCLNFIYVRLLRASRTMELEKSGPISRSLHYSSLSSRSFRVENGLNSSLLIHKE 241
           LSESIMKCLNFIYVRLLRASRTMELEKSGPISRSLHYSSLSSRSFRVENGLNSSL  HKE
Sbjct: 217 LSESIMKCLNFIYVRLLRASRTMELEKSGPISRSLHYSSLSSRSFRVENGLNSSLSAHKE 276

Query: 242 LRQQDPYGIFENEESIPRDIGPYKNLVIFTSTSMDPKSISSATFIPLIRKLRVLMSNLQR 301
           LRQQDPYGIFENEESIPRDIGPYKNLVIFTSTSMDPKSISSATFIPL+RKLRVLMSNLQ+
Sbjct: 277 LRQQDPYGIFENEESIPRDIGPYKNLVIFTSTSMDPKSISSATFIPLMRKLRVLMSNLQK 336

Query: 302 VDLKPLNYQQKLAFWINMYNACIMNGFLQYGVPSSPETLATLMNKA-----SNAINAQAI 361
           VDL+PL+YQQKLAFWINMYNACIMNGFLQYGVPSSPE LATLMNKA      N INAQAI
Sbjct: 337 VDLRPLSYQQKLAFWINMYNACIMNGFLQYGVPSSPEKLATLMNKAMVNIGGNTINAQAI 396

Query: 362 EHYILRKPKSSKKEDDNKEAVVRKLYGLESSEPNVTFALCCGTRSSPAVRIYSGEAVAAE 421
           +HYILRKP S   EDDNKEA+VRKLYGLESSEPNVTFALCCGTRSSPAVRIYSGE V AE
Sbjct: 397 DHYILRKPMSINIEDDNKEAIVRKLYGLESSEPNVTFALCCGTRSSPAVRIYSGEGVVAE 456

Query: 422 LERSKLEYLQASVVVTSSRRVAVPELLVRSLPEFAGAAAAADMKAVVEWVCHQLPTSGSL 481
           LERSKLEYLQASVVVTSS+RVAVPELL+RSLPEF    ++ADMK VVEWVCHQLPTSGSL
Sbjct: 457 LERSKLEYLQASVVVTSSKRVAVPELLIRSLPEF----SSADMKTVVEWVCHQLPTSGSL 516

Query: 482 RKSMVECFRGHPKTQPTIDTLPYDFEFQYLLPL 507
           RKS+VECFRGHPKTQPTIDTL YDFEFQYLLPL
Sbjct: 517 RKSIVECFRGHPKTQPTIDTLSYDFEFQYLLPL 542

BLAST of Cla005101 vs. NCBI nr
Match: gi|659071987|ref|XP_008462917.1| (PREDICTED: uncharacterized protein LOC103501181 isoform X1 [Cucumis melo])

HSP 1 Score: 884.0 bits (2283), Expect = 1.2e-253
Identity = 454/513 (88.50%), Postives = 474/513 (92.40%), Query Frame = 1

Query: 2   QVKEVLAELAMVESEIARLEIQITQLQKDLKTEQQQTTKSKQWSSEQ-PQTNNNNPNNKP 61
           +VKE+LAELAMVESEIARLEIQITQL+KDLK EQQ TTKSKQWSSEQ PQTNNNN   KP
Sbjct: 71  KVKEMLAELAMVESEIARLEIQITQLRKDLKIEQQHTTKSKQWSSEQQPQTNNNN---KP 130

Query: 62  PLHWNPISKATFDTKALHFISKAIKGDYVLNH-FKLDNAKNSELDPRDTNDTHHLLHEVK 121
           PL+WNPISK TFDTKALHFISKAIKGDY LNH FKLDN+KN+ELDPRD  D+HH LHEVK
Sbjct: 131 PLNWNPISKTTFDTKALHFISKAIKGDYALNHHFKLDNSKNNELDPRDAKDSHHPLHEVK 190

Query: 122 LHER-VSRKSGLLVASSPLRDPRHPSPKQRERNPLDMPPPKSMPMPIQVEENIKNWHPNK 181
           LHER VSRKSGLLVASSPLRDPRHPSPKQRERNPLD+P PKSMPM  Q EENI+NWHPNK
Sbjct: 191 LHERSVSRKSGLLVASSPLRDPRHPSPKQRERNPLDIPLPKSMPMLTQAEENIQNWHPNK 250

Query: 182 LSESIMKCLNFIYVRLLRASRTMELEKSGPISRSLHYSSLSSRSFRVENGLNSSLLIHKE 241
           LSESIMKCLNFIYVRLLRASRTMELEKSGPISRSLHYSSLSSRSFRVENGLNSSL  HKE
Sbjct: 251 LSESIMKCLNFIYVRLLRASRTMELEKSGPISRSLHYSSLSSRSFRVENGLNSSLSAHKE 310

Query: 242 LRQQDPYGIFENEESIPRDIGPYKNLVIFTSTSMDPKSISSATFIPLIRKLRVLMSNLQR 301
           LRQQDPYGIFENEESIPRDIGPYKNLVIFTSTSMDPKSISSATFIPL+RKLRVLMSNLQ+
Sbjct: 311 LRQQDPYGIFENEESIPRDIGPYKNLVIFTSTSMDPKSISSATFIPLMRKLRVLMSNLQK 370

Query: 302 VDLKPLNYQQKLAFWINMYNACIMNGFLQYGVPSSPETLATLMNKA-----SNAINAQAI 361
           VDL+PL+YQQKLAFWINMYNACIMNGFLQYGVPSSPE LATLMNKA      N INAQAI
Sbjct: 371 VDLRPLSYQQKLAFWINMYNACIMNGFLQYGVPSSPEKLATLMNKAMVNIGGNTINAQAI 430

Query: 362 EHYILRKPKSSKKEDDNKEAVVRKLYGLESSEPNVTFALCCGTRSSPAVRIYSGEAVAAE 421
           +HYILRKP S   EDDNKEA+VRKLYGLESSEPNVTFALCCGTRSSPAVRIYSGE V AE
Sbjct: 431 DHYILRKPMSINIEDDNKEAIVRKLYGLESSEPNVTFALCCGTRSSPAVRIYSGEGVVAE 490

Query: 422 LERSKLEYLQASVVVTSSRRVAVPELLVRSLPEFAGAAAAADMKAVVEWVCHQLPTSGSL 481
           LERSKLEYLQASVVVTSS+RVAVPELL+RSLPEF    ++ADMK VVEWVCHQLPTSGSL
Sbjct: 491 LERSKLEYLQASVVVTSSKRVAVPELLIRSLPEF----SSADMKTVVEWVCHQLPTSGSL 550

Query: 482 RKSMVECFRGHPKTQPTIDTLPYDFEFQYLLPL 507
           RKS+VECFRGHPKTQPTIDTL YDFEFQYLLPL
Sbjct: 551 RKSIVECFRGHPKTQPTIDTLSYDFEFQYLLPL 576

BLAST of Cla005101 vs. NCBI nr
Match: gi|449443572|ref|XP_004139551.1| (PREDICTED: uncharacterized protein LOC101221529 [Cucumis sativus])

HSP 1 Score: 881.7 bits (2277), Expect = 6.1e-253
Identity = 455/514 (88.52%), Postives = 474/514 (92.22%), Query Frame = 1

Query: 2   QVKEVLAELAMVESEIARLEIQITQLQKDLKTEQQQTTKSKQWSSEQ-PQTNNNNPNNKP 61
           +VKE+LAELAMVESEIARLEIQITQLQKDLK EQQQTTKSKQWSSEQ PQTNNN    KP
Sbjct: 72  KVKEMLAELAMVESEIARLEIQITQLQKDLKFEQQQTTKSKQWSSEQQPQTNNN----KP 131

Query: 62  PLHWNPISKATFDTKALHFISKAIKGDYV-LNH-FKLDNAKNSELDPRDTNDTHHLLHEV 121
           PL+WNPISK TFDTKALHFISKAIKGDY  LNH FKLD +KN+ELDPRD  D+HH LHEV
Sbjct: 132 PLNWNPISKTTFDTKALHFISKAIKGDYAPLNHHFKLDTSKNNELDPRDAKDSHHPLHEV 191

Query: 122 KLHER-VSRKSGLLVASSPLRDPRHPSPKQRERNPLDMPPPKSMPMPIQVEENIKNWHPN 181
           KLHER VSRKSGLLVASSPLRDPRHPSPKQRERNPLD+P PKS+PM  Q EENI+NWHPN
Sbjct: 192 KLHERSVSRKSGLLVASSPLRDPRHPSPKQRERNPLDIPLPKSIPMLTQAEENIQNWHPN 251

Query: 182 KLSESIMKCLNFIYVRLLRASRTMELEKSGPISRSLHYSSLSSRSFRVENGLNSSLLIHK 241
           KLSESIMKCLNFIYVRLLRASRTMELEKSGPISRSLHYSSLSSRSFRVENGLNSSL  HK
Sbjct: 252 KLSESIMKCLNFIYVRLLRASRTMELEKSGPISRSLHYSSLSSRSFRVENGLNSSLSAHK 311

Query: 242 ELRQQDPYGIFENEESIPRDIGPYKNLVIFTSTSMDPKSISSATFIPLIRKLRVLMSNLQ 301
           ELRQQDPYGIFENEES+PRDIGPYKNLVIFTSTSMDPKSISSATFIPL+RKLRVLMSNLQ
Sbjct: 312 ELRQQDPYGIFENEESLPRDIGPYKNLVIFTSTSMDPKSISSATFIPLMRKLRVLMSNLQ 371

Query: 302 RVDLKPLNYQQKLAFWINMYNACIMNGFLQYGVPSSPETLATLMNKA-----SNAINAQA 361
           +VDL+PL+YQQKLAFWINMYNACIMNGFLQYGVPSSPE LATLMNKA      N INAQA
Sbjct: 372 KVDLRPLSYQQKLAFWINMYNACIMNGFLQYGVPSSPEKLATLMNKAMINVGGNTINAQA 431

Query: 362 IEHYILRKPKSSKKEDDNKEAVVRKLYGLESSEPNVTFALCCGTRSSPAVRIYSGEAVAA 421
           I+HYILRKP S  KEDDNKEA+VRKLYGLESSEPNVTFALCCGTRSSPAVRIYSGE V  
Sbjct: 432 IDHYILRKPMSINKEDDNKEAIVRKLYGLESSEPNVTFALCCGTRSSPAVRIYSGEGVGV 491

Query: 422 ELERSKLEYLQASVVVTSSRRVAVPELLVRSLPEFAGAAAAADMKAVVEWVCHQLPTSGS 481
           ELERSKLEYLQASVVVTSS+RVAVPELLVRSLPEF    ++ADMK VVEWVCHQLPTSGS
Sbjct: 492 ELERSKLEYLQASVVVTSSKRVAVPELLVRSLPEF----SSADMKTVVEWVCHQLPTSGS 551

Query: 482 LRKSMVECFRGHPKTQPTIDTLPYDFEFQYLLPL 507
           LRKSMVECFRGHPKTQPTIDTLPYDFEFQYLLPL
Sbjct: 552 LRKSMVECFRGHPKTQPTIDTLPYDFEFQYLLPL 577

BLAST of Cla005101 vs. NCBI nr
Match: gi|700209722|gb|KGN64818.1| (hypothetical protein Csa_1G118870 [Cucumis sativus])

HSP 1 Score: 810.4 bits (2092), Expect = 1.7e-231
Identity = 424/511 (82.97%), Postives = 451/511 (88.26%), Query Frame = 1

Query: 2   QVKEVLAELAMVESEIARLEIQITQLQKDLKTEQQQTTKSKQWSSEQ-PQTNNNNPNNKP 61
           +VKE+LAELAMVESEIARLEIQITQLQKDLK EQQQTTKSKQWSSEQ PQTNNN    KP
Sbjct: 72  KVKEMLAELAMVESEIARLEIQITQLQKDLKFEQQQTTKSKQWSSEQQPQTNNN----KP 131

Query: 62  PLHWNPISKATFDTKALHFISKAIKGDYV-LNH-FKLDNAKNSELDPRDTNDTHHLLHEV 121
           PL+WNPISK TFDTKALHFISKAIKGDY  LNH FKLD +KN+ELDPRD  D+HH LHEV
Sbjct: 132 PLNWNPISKTTFDTKALHFISKAIKGDYAPLNHHFKLDTSKNNELDPRDAKDSHHPLHEV 191

Query: 122 KLHER-VSRKSGLLVASSPLRDPRHPSPKQRERNPLDMPPPKSMPMPIQVEENIKNWHPN 181
           KLHER VSRKSGLLVASSPLRDPRHPSPKQRERNPLD+P PKS+PM  Q EENI+NWHPN
Sbjct: 192 KLHERSVSRKSGLLVASSPLRDPRHPSPKQRERNPLDIPLPKSIPMLTQAEENIQNWHPN 251

Query: 182 KLSESIMKCLNFIYVRLLRASRTMELEKSGPISRSLHYSSLSSRSFRVENGLNSSLLIHK 241
           KLSESIMKCLNFIYVRLLRASRTMELEKSGPISRSLHYSSLSSRSFRVENGLNSSL  HK
Sbjct: 252 KLSESIMKCLNFIYVRLLRASRTMELEKSGPISRSLHYSSLSSRSFRVENGLNSSLSAHK 311

Query: 242 ELRQQDPYGIFENEESIPRDIGPYKNLVIFTSTSMDPKSISSATFIPLIRKLRVLMSNLQ 301
           ELRQQDPYGIFENEES+PRDIGPYKNLVIFTSTSMDPKSISSATFIPL+RKLRVLMSNLQ
Sbjct: 312 ELRQQDPYGIFENEESLPRDIGPYKNLVIFTSTSMDPKSISSATFIPLMRKLRVLMSNLQ 371

Query: 302 RVDLKPLNYQQKLAFWINMYNACIMNGFLQYGVPSSPETLATLMNKASNAINAQAIEHYI 361
           +VDL+PL+YQQKLAFWINMYNACIMN        +        ++   ++  ++   H+ 
Sbjct: 372 KVDLRPLSYQQKLAFWINMYNACIMNVNSYQLCYNKRMIFFVGISSIWSSFVSRKTSHFD 431

Query: 362 L--RKPKSSKKEDDNKEAVVRKLYGLESSEPNVTFALCCGTRSSPAVRIYSGEAVAAELE 421
              +KP S  KEDDNKEA+VRKLYGLESSEPNVTFALCCGTRSSPAVRIYSGE V  ELE
Sbjct: 432 EQGKKPMSINKEDDNKEAIVRKLYGLESSEPNVTFALCCGTRSSPAVRIYSGEGVGVELE 491

Query: 422 RSKLEYLQASVVVTSSRRVAVPELLVRSLPEFAGAAAAADMKAVVEWVCHQLPTSGSLRK 481
           RSKLEYLQASVVVTSS+RVAVPELLVRSLPEF    ++ADMK VVEWVCHQLPTSGSLRK
Sbjct: 492 RSKLEYLQASVVVTSSKRVAVPELLVRSLPEF----SSADMKTVVEWVCHQLPTSGSLRK 551

Query: 482 SMVECFRGHPKTQPTIDTLPYDFEFQYLLPL 507
           SMVECFRGHPKTQPTIDTLPYDFEFQYLLPL
Sbjct: 552 SMVECFRGHPKTQPTIDTLPYDFEFQYLLPL 574

BLAST of Cla005101 vs. NCBI nr
Match: gi|985462946|ref|XP_015388820.1| (PREDICTED: uncharacterized protein LOC102616627 isoform X3 [Citrus sinensis])

HSP 1 Score: 599.0 bits (1543), Expect = 7.8e-168
Identity = 332/539 (61.60%), Postives = 404/539 (74.95%), Query Frame = 1

Query: 2   QVKEVLAELAMVESEIARLEIQITQLQKDLKTEQQQT--TKSKQWS-----SEQPQTNNN 61
           +VKE+LAELA+VE EI RLE QI+QLQ  LK EQ+ T  TKSKQW      + Q  +   
Sbjct: 37  KVKELLAELALVEGEIKRLEGQISQLQLGLKHEQEVTKETKSKQWQLGSLGNLQGHSTYM 96

Query: 62  NPNNKPPLHWNPISKATFDTKALHFISKAIKGDYVLNHF-----KLDNAKNSELDPRDTN 121
              + P ++     K  F+TKALHFISKAIKGDY L+ F     K+ N+K   +D ++  
Sbjct: 97  ANISSPLINKVGNEKVAFETKALHFISKAIKGDYNLSDFSVNEKKMGNSKVVFVDQKENQ 156

Query: 122 DTHHLLHEVKLHERVSRKSGLLVASSPLRDPRHPSPKQRERNPLDMP---PPKSMPMPIQ 181
                  EVK  +RV RKSG++  +SPLRDPRHP+PK RERN  ++    PPKS+   I 
Sbjct: 157 FQQQ--QEVKFQDRVPRKSGMIKPASPLRDPRHPTPKPRERNAAEISFDLPPKSLSNSIL 216

Query: 182 VEENIKNWHPNKLSESIMKCLNFIYVRLLRASRTMELEKSGPISRSLHYSSLSSRSFRVE 241
           +EE+I+NW PNKLSESIMKCLNFIYVRLLR SR +ELEK+GPISRS+H SS++SRSFR +
Sbjct: 217 LEESIQNWQPNKLSESIMKCLNFIYVRLLRTSRAIELEKAGPISRSMH-SSITSRSFRAD 276

Query: 242 NGLNS--SLLIHKELRQQDPYGIFENEESIPRDIGPYKNLVIFTSTSMDPKSISSATFIP 301
             LNS  S+++ K+ RQQDPYGIF+ EESIPRDIGPYKNLVIF+S+SMDPK ISS++ +P
Sbjct: 277 TSLNSKSSIVLQKDSRQQDPYGIFDMEESIPRDIGPYKNLVIFSSSSMDPKCISSSSSVP 336

Query: 302 LIRKLRVLMSNLQRVDLKPLNYQQKLAFWINMYNACIMNGFLQYGVPSSPETLATLMNKA 361
           LIRKLR+LM+NLQ VDLK L YQQKLAFWINM+NACIM+GFLQYGVP+SPE L  LMNKA
Sbjct: 337 LIRKLRILMNNLQTVDLKALTYQQKLAFWINMFNACIMHGFLQYGVPNSPEKLIALMNKA 396

Query: 362 S-----NAINAQAIEHYILRKPKSSK--------KEDDNKEAVVRKLYGLESSEPNVTFA 421
           +     + INAQAIEHYILR  +SS          E D KEA+VRKLYGLES++PNVTFA
Sbjct: 397 TLSIGGSTINAQAIEHYILRGQESSNLKEVDQKASEKDEKEAIVRKLYGLESTDPNVTFA 456

Query: 422 LCCGTRSSPAVRIYSGEAVAAELERSKLEYLQASVVVTSSRRVAVPELLVRSLPEFAGAA 481
           LC GTRSSPAVRIY+ + V AELE+SKLEYLQASVVVT++R++A PELL R++ +F    
Sbjct: 457 LCYGTRSSPAVRIYTADGVIAELEKSKLEYLQASVVVTNTRKIAFPELLFRNMLDF---- 516

Query: 482 AAADMKAVVEWVCHQLPTSGSLRKSMVECFR--GH--PKTQPTIDTLPYDFEFQYLLPL 507
            A D+  +VEWVCHQLPTSGSLRKSMV+CFR  GH   K   T++ +PYDFEFQYLL +
Sbjct: 517 -AMDIDTLVEWVCHQLPTSGSLRKSMVDCFRHQGHNNGKISITVEKIPYDFEFQYLLAI 567

The following BLAST results are available for this feature:
Match NameE-valueIdentityDescription
A0A0A0LSP4_CUCSA1.2e-23182.97Uncharacterized protein OS=Cucumis sativus GN=Csa_1G118870 PE=4 SV=1[more]
A0A067FAJ0_CITSI9.3e-16861.60Uncharacterized protein OS=Citrus sinensis GN=CISIN_1g037790mg PE=4 SV=1[more]
W9S2M1_9ROSA2.1e-16761.51Uncharacterized protein OS=Morus notabilis GN=L484_007502 PE=4 SV=1[more]
A0A0B0PR44_GOSAR7.9e-16762.03Topoisomerase 1-associated factor 1 OS=Gossypium arboreum GN=F383_07577 PE=4 SV=... [more]
V4SL49_9ROSI7.9e-16761.68Uncharacterized protein OS=Citrus clementina GN=CICLE_v10028060mg PE=4 SV=1[more]
Match NameE-valueIdentityDescription
gi|659071989|ref|XP_008462925.1|1.2e-25388.50PREDICTED: uncharacterized protein LOC103501181 isoform X2 [Cucumis melo][more]
gi|659071987|ref|XP_008462917.1|1.2e-25388.50PREDICTED: uncharacterized protein LOC103501181 isoform X1 [Cucumis melo][more]
gi|449443572|ref|XP_004139551.1|6.1e-25388.52PREDICTED: uncharacterized protein LOC101221529 [Cucumis sativus][more]
gi|700209722|gb|KGN64818.1|1.7e-23182.97hypothetical protein Csa_1G118870 [Cucumis sativus][more]
gi|985462946|ref|XP_015388820.1|7.8e-16861.60PREDICTED: uncharacterized protein LOC102616627 isoform X3 [Citrus sinensis][more]
The following terms have been associated with this gene:
Vocabulary: INTERPRO
TermDefinition
IPR006869DUF547
GO Assignments
This gene is annotated with the following GO terms.
Category Term Accession Term Name
biological_process GO:0042546 cell wall biogenesis
biological_process GO:0045491 xylan metabolic process
biological_process GO:0044036 cell wall macromolecule metabolic process
biological_process GO:0008150 biological_process
cellular_component GO:0005575 cellular_component
cellular_component GO:0009507 chloroplast
molecular_function GO:0003674 molecular_function
molecular_function GO:0016853 isomerase activity
This gene is associated with the following unigenes:
Unigene NameAnalysis NameSequence type in Unigene
WMU55413watermelon EST collection version 2.0transcribed_cluster

The following mRNA feature(s) are a part of this gene:

Feature NameUnique NameType
Cla005101Cla005101.1mRNA


The following transcribed_cluster feature(s) are associated with this gene:

Feature NameUnique NameType
WMU55413WMU55413transcribed_cluster


Analysis Name: InterPro Annotations of watermelon (97103)
Date Performed: 2016-09-28
IPR TermIPR DescriptionSourceSource TermSource DescriptionAlignment
IPR006869Domain of unknown function DUF547PFAMPF04784DUF547coord: 304..422
score: 2.1
NoneNo IPR availableunknownCoilCoilcoord: 3..37
scor
NoneNo IPR availablePANTHERPTHR23054UNCHARACTERIZEDcoord: 4..506
score: 8.1E
NoneNo IPR availablePANTHERPTHR23054:SF16SUBFAMILY NOT NAMEDcoord: 4..506
score: 8.1E