Cla021666 (gene) Watermelon (97103) v1

NameCla021666
Typegene
OrganismCitrullus. lanatus (Watermelon (97103) v1)
DescriptionUnknown Protein (AHRD V1)
LocationChr5 : 5101123 .. 5102931 (-)
The following sequences are available for this feature:

Gene sequence (with intron)

Legend: CDS
Hold the cursor over a type above to highlight its positions in the sequence below.
ATGCCAGAAAAGGGGTTACTTCCGGTGACATCAAATCCATGGCTTTTGAGGTCGTCTCCTTTAGTACATTCCCGTTTTGGGTTTTTAACGGCGTTGATTCTTCTCAGTATGTTAGCCATTTGGAGTATCGATGGTTGTAACATCAGAACTTTCATTAAAGCTTGGAGTTCCCCTCAAGAATTCATCTCTGTTTCTTCCAATTTCACCAATACCCTTCAAACTTTATCCACTAATGATTCCAATTTCACCCAATTCATCTCTTACAAACCCGAAAAAAAATACCCAGAATCGGTGTTGATTGAACCCCTTCCGCCGCAGAATTCCACCGTCCAGCCTCGCCGGAGCCAGAGCAAACCGGCGGTTCACCATGAGCTTGCCTCTGATTGGTTTTCAGCTGAACTTGAGCCGAATTTCACCTCCCATCTTCTTGCTCTATGGATGGCTCCCGGCGGTGAGCCTTGCAGAGACTCAAAGACGGCGGACATTGCCATCTCCGGCATGGAAAGTCCGGCAATGGTGGAGCTTTCAACAGGAGATGTTCATGAGTTTCGATTCCAAGCTGTTGATGAATCTGGAAACCCTCGTTGTTTAGGTGGGGATTATTTTGAGACTGATCTTTCTGGGGATTTGTGGAAATCTCGGCCGTTCGTTAAAGATTTTGGTAATGGGACTTACTCGTTTTGGCTTCAGGTTCACCCTGATTTCGCCGGAGATTATAATCTCACTGTGATTCTTCTGTTTCGACATTTCGAGGGGCTTAGGTTTTCCCCAACCCGATTCGCCTATGATCAAGAGCTTCGGAGAATCAAGGTTCGATTTGTTAAGAATTCGGTTGTGTTGCCGGAGATTAAAATGTGTCGGAGCTCTGACTTCGATAGGGATATTTGGACCGGGCGGTGGACTCGACATGGTCGAAATGACCGTTGCAAGATCAGCGATGACGGTCGCTACCGGTGTTTTGCGCCGGATTATCCTTGTCAAAGCCCTTGGTGTAATGGCTCCTTAGGGTTGTTGGAGAGTAATGGATGGGTTTATTCTGCTCATTGTTCGTTTCAGATGTTTTCTAGTAGGTCTGCTTGGGATTGCTTGAAGGATAGATGGATCTTCTTTTGGGGTGATTCGAATCATGTCGATACGATAAGAAACCTTCTCAATTTCGTGTTGGATTTGCCCGAAATCCCTGCAGTTCCGAGGCGATTCGATAGGAATTTTTCGAACCCGAAGGACCCGTCTCAAACGGTTCGCATTACCAGCATTTTCAATGGGCATTGGAATGATACACAAAACTATGAAGGTTTGAATTCATTGAGAAATGAAGGGTTTAGAAATCTTCTGTACAAATACTTCTCGGAAGAAACTGTTCCTGACACGATCATCATGAACTCGGGCCTCCACGACGGTGTTCATTGGTTAAATATTCGAGCTTTCTCAGTTGGGGCAACCTATGCTGCATCATTCTGGAAACAAGTTTTGGAATCCATTAAGCAAAGGGGGTTACCAGTCCCGAAAGTGTTCTATCGAACCACGGTCGCAACCGGTGGCTATGCTCGAACGCTTGCCTTCAATCCTAACAAAATGGAAATGTTCAACTGGGTGGTACTGGAGAAGTTGAAGGAAGCGGGGATCGTCCATGGCGTGATTGACAACTTCGACATGACATTCCCTTGGCATTTTGACAATCGGTGCAACGACGGAGTCCATTACGGGCGAGCACCAGCGAAGTTGAAATGGAGGGACGGTGAAATTGGGCACCAATATTTCTTAGACCTCATGTTGGCTCATATTCTTCTCAATGCACTCTGCACTTGA

mRNA sequence

ATGCCAGAAAAGGGGTTACTTCCGGTGACATCAAATCCATGGCTTTTGAGGTCGTCTCCTTTAGTACATTCCCGTTTTGGGTTTTTAACGGCGTTGATTCTTCTCAGTATGTTAGCCATTTGGAGTATCGATGGTTGTAACATCAGAACTTTCATTAAAGCTTGGAGTTCCCCTCAAGAATTCATCTCTGTTTCTTCCAATTTCACCAATACCCTTCAAACTTTATCCACTAATGATTCCAATTTCACCCAATTCATCTCTTACAAACCCGAAAAAAAATACCCAGAATCGGTGTTGATTGAACCCCTTCCGCCGCAGAATTCCACCGTCCAGCCTCGCCGGAGCCAGAGCAAACCGGCGGTTCACCATGAGCTTGCCTCTGATTGGTTTTCAGCTGAACTTGAGCCGAATTTCACCTCCCATCTTCTTGCTCTATGGATGGCTCCCGGCGGTGAGCCTTGCAGAGACTCAAAGACGGCGGACATTGCCATCTCCGGCATGGAAAGTCCGGCAATGGTGGAGCTTTCAACAGGAGATGTTCATGAGTTTCGATTCCAAGCTGTTGATGAATCTGGAAACCCTCGTTGTTTAGGTGGGGATTATTTTGAGACTGATCTTTCTGGGGATTTGTGGAAATCTCGGCCGTTCGTTAAAGATTTTGGTAATGGGACTTACTCGTTTTGGCTTCAGGTTCACCCTGATTTCGCCGGAGATTATAATCTCACTGTGATTCTTCTGTTTCGACATTTCGAGGGGCTTAGGTTTTCCCCAACCCGATTCGCCTATGATCAAGAGCTTCGGAGAATCAAGGTTCGATTTGTTAAGAATTCGGTTGTGTTGCCGGAGATTAAAATGTGTCGGAGCTCTGACTTCGATAGGGATATTTGGACCGGGCGGTGGACTCGACATGGTCGAAATGACCGTTGCAAGATCAGCGATGACGGTCGCTACCGGTGTTTTGCGCCGGATTATCCTTGTCAAAGCCCTTGGTGTAATGGCTCCTTAGGGTTGTTGGAGAGTAATGGATGGGTTTATTCTGCTCATTGTTCGTTTCAGATGTTTTCTAGTAGGTCTGCTTGGGATTGCTTGAAGGATAGATGGATCTTCTTTTGGGGTGATTCGAATCATGTCGATACGATAAGAAACCTTCTCAATTTCGTGTTGGATTTGCCCGAAATCCCTGCAGTTCCGAGGCGATTCGATAGGAATTTTTCGAACCCGAAGGACCCGTCTCAAACGGTTCGCATTACCAGCATTTTCAATGGGCATTGGAATGATACACAAAACTATGAAGGTTTGAATTCATTGAGAAATGAAGGGTTTAGAAATCTTCTGTACAAATACTTCTCGGAAGAAACTGTTCCTGACACGATCATCATGAACTCGGGCCTCCACGACGGTGTTCATTGGTTAAATATTCGAGCTTTCTCAGTTGGGGCAACCTATGCTGCATCATTCTGGAAACAAGTTTTGGAATCCATTAAGCAAAGGGGGTTACCAGTCCCGAAAGTGTTCTATCGAACCACGGTCGCAACCGGTGGCTATGCTCGAACGCTTGCCTTCAATCCTAACAAAATGGAAATGTTCAACTGGGTGGTACTGGAGAAGTTGAAGGAAGCGGGGATCGTCCATGGCGTGATTGACAACTTCGACATGACATTCCCTTGGCATTTTGACAATCGGTGCAACGACGGAGTCCATTACGGGCGAGCACCAGCGAAGTTGAAATGGAGGGACGGTGAAATTGGGCACCAATATTTCTTAGACCTCATGTTGGCTCATATTCTTCTCAATGCACTCTGCACTTGA

Coding sequence (CDS)

ATGCCAGAAAAGGGGTTACTTCCGGTGACATCAAATCCATGGCTTTTGAGGTCGTCTCCTTTAGTACATTCCCGTTTTGGGTTTTTAACGGCGTTGATTCTTCTCAGTATGTTAGCCATTTGGAGTATCGATGGTTGTAACATCAGAACTTTCATTAAAGCTTGGAGTTCCCCTCAAGAATTCATCTCTGTTTCTTCCAATTTCACCAATACCCTTCAAACTTTATCCACTAATGATTCCAATTTCACCCAATTCATCTCTTACAAACCCGAAAAAAAATACCCAGAATCGGTGTTGATTGAACCCCTTCCGCCGCAGAATTCCACCGTCCAGCCTCGCCGGAGCCAGAGCAAACCGGCGGTTCACCATGAGCTTGCCTCTGATTGGTTTTCAGCTGAACTTGAGCCGAATTTCACCTCCCATCTTCTTGCTCTATGGATGGCTCCCGGCGGTGAGCCTTGCAGAGACTCAAAGACGGCGGACATTGCCATCTCCGGCATGGAAAGTCCGGCAATGGTGGAGCTTTCAACAGGAGATGTTCATGAGTTTCGATTCCAAGCTGTTGATGAATCTGGAAACCCTCGTTGTTTAGGTGGGGATTATTTTGAGACTGATCTTTCTGGGGATTTGTGGAAATCTCGGCCGTTCGTTAAAGATTTTGGTAATGGGACTTACTCGTTTTGGCTTCAGGTTCACCCTGATTTCGCCGGAGATTATAATCTCACTGTGATTCTTCTGTTTCGACATTTCGAGGGGCTTAGGTTTTCCCCAACCCGATTCGCCTATGATCAAGAGCTTCGGAGAATCAAGGTTCGATTTGTTAAGAATTCGGTTGTGTTGCCGGAGATTAAAATGTGTCGGAGCTCTGACTTCGATAGGGATATTTGGACCGGGCGGTGGACTCGACATGGTCGAAATGACCGTTGCAAGATCAGCGATGACGGTCGCTACCGGTGTTTTGCGCCGGATTATCCTTGTCAAAGCCCTTGGTGTAATGGCTCCTTAGGGTTGTTGGAGAGTAATGGATGGGTTTATTCTGCTCATTGTTCGTTTCAGATGTTTTCTAGTAGGTCTGCTTGGGATTGCTTGAAGGATAGATGGATCTTCTTTTGGGGTGATTCGAATCATGTCGATACGATAAGAAACCTTCTCAATTTCGTGTTGGATTTGCCCGAAATCCCTGCAGTTCCGAGGCGATTCGATAGGAATTTTTCGAACCCGAAGGACCCGTCTCAAACGGTTCGCATTACCAGCATTTTCAATGGGCATTGGAATGATACACAAAACTATGAAGGTTTGAATTCATTGAGAAATGAAGGGTTTAGAAATCTTCTGTACAAATACTTCTCGGAAGAAACTGTTCCTGACACGATCATCATGAACTCGGGCCTCCACGACGGTGTTCATTGGTTAAATATTCGAGCTTTCTCAGTTGGGGCAACCTATGCTGCATCATTCTGGAAACAAGTTTTGGAATCCATTAAGCAAAGGGGGTTACCAGTCCCGAAAGTGTTCTATCGAACCACGGTCGCAACCGGTGGCTATGCTCGAACGCTTGCCTTCAATCCTAACAAAATGGAAATGTTCAACTGGGTGGTACTGGAGAAGTTGAAGGAAGCGGGGATCGTCCATGGCGTGATTGACAACTTCGACATGACATTCCCTTGGCATTTTGACAATCGGTGCAACGACGGAGTCCATTACGGGCGAGCACCAGCGAAGTTGAAATGGAGGGACGGTGAAATTGGGCACCAATATTTCTTAGACCTCATGTTGGCTCATATTCTTCTCAATGCACTCTGCACTTGA

Protein sequence

MPEKGLLPVTSNPWLLRSSPLVHSRFGFLTALILLSMLAIWSIDGCNIRTFIKAWSSPQEFISVSSNFTNTLQTLSTNDSNFTQFISYKPEKKYPESVLIEPLPPQNSTVQPRRSQSKPAVHHELASDWFSAELEPNFTSHLLALWMAPGGEPCRDSKTADIAISGMESPAMVELSTGDVHEFRFQAVDESGNPRCLGGDYFETDLSGDLWKSRPFVKDFGNGTYSFWLQVHPDFAGDYNLTVILLFRHFEGLRFSPTRFAYDQELRRIKVRFVKNSVVLPEIKMCRSSDFDRDIWTGRWTRHGRNDRCKISDDGRYRCFAPDYPCQSPWCNGSLGLLESNGWVYSAHCSFQMFSSRSAWDCLKDRWIFFWGDSNHVDTIRNLLNFVLDLPEIPAVPRRFDRNFSNPKDPSQTVRITSIFNGHWNDTQNYEGLNSLRNEGFRNLLYKYFSEETVPDTIIMNSGLHDGVHWLNIRAFSVGATYAASFWKQVLESIKQRGLPVPKVFYRTTVATGGYARTLAFNPNKMEMFNWVVLEKLKEAGIVHGVIDNFDMTFPWHFDNRCNDGVHYGRAPAKLKWRDGEIGHQYFLDLMLAHILLNALCT
BLAST of Cla021666 vs. TrEMBL
Match: A0A061EKY8_THECC (F28L1.9 protein OS=Theobroma cacao GN=TCM_020507 PE=4 SV=1)

HSP 1 Score: 868.2 bits (2242), Expect = 5.7e-249
Identity = 401/624 (64.26%), Postives = 491/624 (78.69%), Query Frame = 1

Query: 1   MPEKG--LLPVTSNPWLLRSSPLVHSRFGFLTALILLSMLAIWSIDGCNIRTFIKAWSSP 60
           MPEKG   LP  S+ WLLRSSPL   RFG LTAL+ + M+ +WSIDGC ++ FI++W   
Sbjct: 1   MPEKGGSFLPPPSSAWLLRSSPLHQWRFGLLTALVFVGMVVVWSIDGCTVKNFIQSWQFK 60

Query: 61  QEFISVSSNFTNTLQTLSTNDSNFTQFISYKPE---KKYPESVLIEPLPPQNSTVQPRR- 120
           Q++I++  N    L     N ++  + ++  P     ++P   +   + P NS+++ +  
Sbjct: 61  QDYITMKVNSLANLNHPYQNPTHSLRNLTVNPTLNTSRFPIYSINSSVFPLNSSLESKNV 120

Query: 121 -----------------SQSKPAVHHELASDWFSAELEPNFTSHLLALWMAPGGEPCRDS 180
                             ++        +  W SAELE N+TS+LLA W+APGGEPC+DS
Sbjct: 121 TQISSREMANFSSVENSDENLTTFKDSSSLKWVSAELEQNYTSNLLARWLAPGGEPCKDS 180

Query: 181 KTADIAISGMESPAMVELSTGDVHEFRFQAVDESGNPRCLGGDYFETDLSGDLWKSRPFV 240
           +T +I I G++  ++VELS G++HEF FQAVDESGN RCLGGDYFE DLSG+ WKSRP V
Sbjct: 181 QTVEIKIPGLDGESLVELSAGEIHEFMFQAVDESGNARCLGGDYFEADLSGESWKSRPPV 240

Query: 241 KDFGNGTYSFWLQVHPDFAGDYNLTVILLFRHFEGLRFSPTRFAYDQELRRIKVRFVKNS 300
           KDFGNG+YS  LQVHPDFAG+YNLTVILLFRHF+GL+FSP RFAYD++LR I +RF +  
Sbjct: 241 KDFGNGSYSVSLQVHPDFAGEYNLTVILLFRHFQGLKFSPARFAYDRQLRHIGIRFYRTK 300

Query: 301 VVLPEIKMCRSSDFDRDIWTGRWTRHGRNDRCKISDDGRYRCFAPDYPCQSPWCNGSLGL 360
             L E+  C+ SDF +D+W+GRWTRHG+ND C+IS+DGRYRC A D+PCQ+PWCNGSLGL
Sbjct: 301 ARLTELPSCQKSDFSKDVWSGRWTRHGKNDDCQISNDGRYRCLAADFPCQNPWCNGSLGL 360

Query: 361 LESNGWVYSAHCSFQMFSSRSAWDCLKDRWIFFWGDSNHVDTIRNLLNFVLDLPEIPAVP 420
           LESNGWVYS+HCSFQ+F + SAW+CLK+RWIFFWGDSNHVDTIRN+LNFVL LPEI +VP
Sbjct: 361 LESNGWVYSSHCSFQLFLADSAWNCLKNRWIFFWGDSNHVDTIRNMLNFVLGLPEIKSVP 420

Query: 421 RRFDRNFSNPKDPSQTVRITSIFNGHWNDTQNYEGLNSLRNEGFRNLLYKYFSEETVPDT 480
           RRFD NFSNPKDPSQTVRITSIFNGHWN TQNY GL+SL++EGFRNLL KYFSE+TVPDT
Sbjct: 421 RRFDMNFSNPKDPSQTVRITSIFNGHWNGTQNYLGLDSLKDEGFRNLLKKYFSEDTVPDT 480

Query: 481 IIMNSGLHDGVHWLNIRAFSVGATYAASFWKQVLESIKQRGLPVPKVFYRTTVATGGYAR 540
           IIMNSGLHDGVHW  IRAFS GA YAA+FWK+V++S++QRGL VP++ +R T+ATGGYAR
Sbjct: 481 IIMNSGLHDGVHWSTIRAFSHGAEYAATFWKEVMDSVRQRGLVVPQIIFRNTIATGGYAR 540

Query: 541 TLAFNPNKMEMFNWVVLEKLKEAGIVHGVIDNFDMTFPWHFDNRCNDGVHYGRAPAKLKW 600
           +LAFNPNK+E FN V+LEKL+ AG+V GVIDNFDMTFPWHFDNRCNDGVHYGRAP K+KW
Sbjct: 541 SLAFNPNKIEAFNGVLLEKLRRAGLVSGVIDNFDMTFPWHFDNRCNDGVHYGRAPLKMKW 600

Query: 601 RDGEIGHQYFLDLMLAHILLNALC 602
           RDGE+GHQYF+DLML H+LLN LC
Sbjct: 601 RDGEVGHQYFVDLMLCHVLLNVLC 624

BLAST of Cla021666 vs. TrEMBL
Match: A0A0B2P0Z3_GLYSO (Uncharacterized protein OS=Glycine soja GN=glysoja_009912 PE=4 SV=1)

HSP 1 Score: 864.8 bits (2233), Expect = 6.3e-248
Identity = 400/601 (66.56%), Postives = 482/601 (80.20%), Query Frame = 1

Query: 1   MPEKGLLPVTSNPWLLRSSPLVHSRFGFLTALILLSMLAIWSIDGCNIRTFIKAWSSPQE 60
           MPEK    V S  W L S P++H R G LTAL+++ M+ +WSIDGC ++  I+AW   Q+
Sbjct: 1   MPEK---VVISPQWALWSIPMLHWRVGLLTALVMVGMVVVWSIDGCTVKNIIQAWRYQQD 60

Query: 61  FISVSSNFTNTLQTLSTNDSNFTQFISYKPEKKYPESVLIEPLPPQNSTVQPRRSQSKPA 120
           +++V S+                           P    + P  P N TV  +    KP 
Sbjct: 61  YLAVKSHT--------------------------PNLTFVSPYSPVNFTVYGK----KPL 120

Query: 121 VHHELASDWFSAELEPNFTSHLLALWMAPGGEPCRDSKTADIAISGMESPAMVELSTGDV 180
           +    AS W S+ELEPN TS+L+A W A GGEPC+DSK  +IAI G++   ++ELS GDV
Sbjct: 121 LVKGHAS-WVSSELEPNLTSNLIARWSARGGEPCKDSKAVEIAIPGLDGGEVIELSAGDV 180

Query: 181 HEFRFQAVDESGNPRCLGGDYFETDLSGDLWKSRPFVKDFGNGTYSFWLQVHPDFAGDYN 240
           HEF FQA+D+SG P C+GGDYFETDLSGD WKSRP VKDF NG+Y   LQVHPDF G YN
Sbjct: 181 HEFGFQALDDSGKPLCVGGDYFETDLSGDSWKSRPLVKDFSNGSYLISLQVHPDFDGVYN 240

Query: 241 LTVILLFRHFEGLRFSPTRFAYDQELRRIKVRFVKNSVVLPEIKMCRSSDFDRDIWTGRW 300
           LT+ILL+RHFEGL+F+P RF+YDQ LR + +RF K+SV LPE++ C++SDFDRD+W GRW
Sbjct: 241 LTIILLYRHFEGLKFTPWRFSYDQMLRSVAIRFYKSSVRLPELQGCKASDFDRDVWIGRW 300

Query: 301 TRHGRNDRCKISDDGRYRCFAPDYPCQSPWCNGSLGLLESNGWVYSAHCSFQMFSSRSAW 360
           TRHG+ND C I +DGRYRC APD+PCQ+PWC+GSLG+LESNGWVYS HCSF+++S+ SAW
Sbjct: 301 TRHGKNDDCTIGNDGRYRCLAPDFPCQAPWCDGSLGILESNGWVYSTHCSFKLYSAESAW 360

Query: 361 DCLKDRWIFFWGDSNHVDTIRNLLNFVLDLPEIPAVPRRFDRNFSNPKDPSQTVRITSIF 420
           +CLK+RWIFFWGDSNHVDTIRNLLNFVLDLPEIP+VPRRFD NFSNP+DPSQTVRITSIF
Sbjct: 361 NCLKNRWIFFWGDSNHVDTIRNLLNFVLDLPEIPSVPRRFDMNFSNPRDPSQTVRITSIF 420

Query: 421 NGHWNDTQNYEGLNSLRNEGFRNLLYKYFSEETVPDTIIMNSGLHDGVHWLNIRAFSVGA 480
           NGHWN+TQNY GL+SLR+EGF++LL KYFSE+T+PDT+IMNSGLHDGVHW NIRAFSVGA
Sbjct: 421 NGHWNETQNYLGLDSLRDEGFQDLLKKYFSEDTIPDTVIMNSGLHDGVHWRNIRAFSVGA 480

Query: 481 TYAASFWKQVLESIKQRGLPVPKVFYRTTVATGGYARTLAFNPNKMEMFNWVVLEKLKEA 540
            YAASFW  V++++KQRGL  P+VF+R TVATGGYAR+LAFNPNKME+FN V+LEKLK++
Sbjct: 481 DYAASFWGDVMKTVKQRGLAWPRVFFRNTVATGGYARSLAFNPNKMEVFNGVLLEKLKQS 540

Query: 541 GIVHGVIDNFDMTFPWHFDNRCNDGVHYGRAPAKLKWRDGEIGHQYFLDLMLAHILLNAL 600
           G+V GVIDNFDMTFPWHFDNRCNDGVHYGRAPAK+KWRDG+IGHQYF+DLMLAH+LLNAL
Sbjct: 541 GVVSGVIDNFDMTFPWHFDNRCNDGVHYGRAPAKMKWRDGQIGHQYFVDLMLAHVLLNAL 567

Query: 601 C 602
           C
Sbjct: 601 C 567

BLAST of Cla021666 vs. TrEMBL
Match: K7L3U9_SOYBN (Uncharacterized protein OS=Glycine max GN=GLYMA_07G256900 PE=4 SV=1)

HSP 1 Score: 864.8 bits (2233), Expect = 6.3e-248
Identity = 400/601 (66.56%), Postives = 482/601 (80.20%), Query Frame = 1

Query: 1   MPEKGLLPVTSNPWLLRSSPLVHSRFGFLTALILLSMLAIWSIDGCNIRTFIKAWSSPQE 60
           MPEK    V S  W L S P++H R G LTAL+++ M+ +WSIDGC ++  I+AW   Q+
Sbjct: 100 MPEK---VVISPQWALWSIPMLHWRVGLLTALVMVGMVVVWSIDGCTVKNIIQAWRYQQD 159

Query: 61  FISVSSNFTNTLQTLSTNDSNFTQFISYKPEKKYPESVLIEPLPPQNSTVQPRRSQSKPA 120
           +++V S+                           P    + P  P N TV  +    KP 
Sbjct: 160 YLAVKSHT--------------------------PNLTFVSPYSPVNFTVYGK----KPL 219

Query: 121 VHHELASDWFSAELEPNFTSHLLALWMAPGGEPCRDSKTADIAISGMESPAMVELSTGDV 180
           +    AS W S+ELEPN TS+L+A W A GGEPC+DSK  +IAI G++   ++ELS GDV
Sbjct: 220 LVKGHAS-WVSSELEPNLTSNLIARWSARGGEPCKDSKAVEIAIPGLDGGEVIELSAGDV 279

Query: 181 HEFRFQAVDESGNPRCLGGDYFETDLSGDLWKSRPFVKDFGNGTYSFWLQVHPDFAGDYN 240
           HEF FQA+D+SG P C+GGDYFETDLSGD WKSRP VKDF NG+Y   LQVHPDF G YN
Sbjct: 280 HEFGFQALDDSGKPLCVGGDYFETDLSGDSWKSRPLVKDFSNGSYLISLQVHPDFDGVYN 339

Query: 241 LTVILLFRHFEGLRFSPTRFAYDQELRRIKVRFVKNSVVLPEIKMCRSSDFDRDIWTGRW 300
           LT+ILL+RHFEGL+F+P RF+YDQ LR + +RF K+SV LPE++ C++SDFDRD+W GRW
Sbjct: 340 LTIILLYRHFEGLKFTPWRFSYDQMLRSVAIRFYKSSVRLPELQGCKASDFDRDVWIGRW 399

Query: 301 TRHGRNDRCKISDDGRYRCFAPDYPCQSPWCNGSLGLLESNGWVYSAHCSFQMFSSRSAW 360
           TRHG+ND C I +DGRYRC APD+PCQ+PWC+GSLG+LESNGWVYS HCSF+++S+ SAW
Sbjct: 400 TRHGKNDDCTIGNDGRYRCLAPDFPCQAPWCDGSLGILESNGWVYSTHCSFKLYSAESAW 459

Query: 361 DCLKDRWIFFWGDSNHVDTIRNLLNFVLDLPEIPAVPRRFDRNFSNPKDPSQTVRITSIF 420
           +CLK+RWIFFWGDSNHVDTIRNLLNFVLDLPEIP+VPRRFD NFSNP+DPSQTVRITSIF
Sbjct: 460 NCLKNRWIFFWGDSNHVDTIRNLLNFVLDLPEIPSVPRRFDMNFSNPRDPSQTVRITSIF 519

Query: 421 NGHWNDTQNYEGLNSLRNEGFRNLLYKYFSEETVPDTIIMNSGLHDGVHWLNIRAFSVGA 480
           NGHWN+TQNY GL+SLR+EGF++LL KYFSE+T+PDT+IMNSGLHDGVHW NIRAFSVGA
Sbjct: 520 NGHWNETQNYLGLDSLRDEGFQDLLKKYFSEDTIPDTVIMNSGLHDGVHWRNIRAFSVGA 579

Query: 481 TYAASFWKQVLESIKQRGLPVPKVFYRTTVATGGYARTLAFNPNKMEMFNWVVLEKLKEA 540
            YAASFW  V++++KQRGL  P+VF+R TVATGGYAR+LAFNPNKME+FN V+LEKLK++
Sbjct: 580 DYAASFWGDVMKTVKQRGLAWPRVFFRNTVATGGYARSLAFNPNKMEVFNGVLLEKLKQS 639

Query: 541 GIVHGVIDNFDMTFPWHFDNRCNDGVHYGRAPAKLKWRDGEIGHQYFLDLMLAHILLNAL 600
           G+V GVIDNFDMTFPWHFDNRCNDGVHYGRAPAK+KWRDG+IGHQYF+DLMLAH+LLNAL
Sbjct: 640 GVVSGVIDNFDMTFPWHFDNRCNDGVHYGRAPAKMKWRDGQIGHQYFVDLMLAHVLLNAL 666

Query: 601 C 602
           C
Sbjct: 700 C 666

BLAST of Cla021666 vs. TrEMBL
Match: B9RTT4_RICCO (Putative uncharacterized protein OS=Ricinus communis GN=RCOM_0912600 PE=4 SV=1)

HSP 1 Score: 861.7 bits (2225), Expect = 5.4e-247
Identity = 406/637 (63.74%), Postives = 493/637 (77.39%), Query Frame = 1

Query: 1   MPEKGLLPVTS---NPWLLRSSPLVHSRFGFLTALILLSMLAIWSIDGCNIRTFIKAWSS 60
           MPEKGL    S     WL RSSPL+  RF  LTAL+ + M+ +WSIDGC I+  I++W  
Sbjct: 1   MPEKGLSSSPSPNHQSWLFRSSPLLQWRFHILTALVFVGMVTVWSIDGCTIKNVIESWRF 60

Query: 61  PQE-----FISVSSNFTN-----TLQTLSTNDSNFTQF----ISYKPEKKYPESVLIEPL 120
            QE       S  SN TN     T  T  T+ S++        SY P   Y +  L +PL
Sbjct: 61  RQEEYLRKVTSQPSNATNLTQSNTTDTTETSSSSYNILNNDNSSYVPASNYSDGFLEKPL 120

Query: 121 PP---------------QNSTVQPRRSQS--KPAVHHELASD--WFSAELEPNFTSHLLA 180
                            QN T     S    +  + +EL ++  W SA LEP+ T +LL+
Sbjct: 121 EKESEEAHAKWVSADVQQNITSHQNLSAGFLEKPIENELEANVKWVSAILEPDLTPNLLS 180

Query: 181 LWMAPGGEPCRDSKTADIAISGMESPAMVELSTGDVHEFRFQAVDESGNPRCLGGDYFET 240
            W+APGGEPC+DS+T DI I G++   ++EL+ G+ HEF FQAVDE  NP CLGGDYFET
Sbjct: 181 RWLAPGGEPCKDSRTVDIVIPGLDGRNLIELTAGNSHEFIFQAVDEFKNPLCLGGDYFET 240

Query: 241 DLSGDLWKSRPFVKDFGNGTYSFWLQVHPDFAGDYNLTVILLFRHFEGLRFSPTRFAYDQ 300
           DLSG+ WKSRP V+DFGNG+YS  LQVHPDF GDYNLTVILLFRHFEGL+FSP+RF YD+
Sbjct: 241 DLSGEEWKSRPLVRDFGNGSYSISLQVHPDFVGDYNLTVILLFRHFEGLKFSPSRFVYDR 300

Query: 301 ELRRIKVRFVKNSVVLPEIKMCRSSDFDRDIWTGRWTRHGRNDRCKISDDGRYRCFAPDY 360
           ELR++++RFVK    LPE+++C+ SDF +D+W GRWTRHG+ND C+IS+DGRYRC   D+
Sbjct: 301 ELRKVQIRFVKAHYKLPELQICQKSDFTKDLWLGRWTRHGKNDGCEISNDGRYRCLPSDF 360

Query: 361 PCQSPWCNGSLGLLESNGWVYSAHCSFQMFSSRSAWDCLKDRWIFFWGDSNHVDTIRNLL 420
           PCQSPWCNGSLGLLESNGWVYS+HCSF++FS+ SAW+CLK RWIFFWGDSNHVDTIRN+L
Sbjct: 361 PCQSPWCNGSLGLLESNGWVYSSHCSFRLFSADSAWNCLKGRWIFFWGDSNHVDTIRNML 420

Query: 421 NFVLDLPEIPAVPRRFDRNFSNPKDPSQTVRITSIFNGHWNDTQNYEGLNSLRNEGFRNL 480
           NF+LDLP+I +VPRRFDRNFSNPKD SQ+VRITSIFNGHWN+TQNY GL+SL++EGFRNL
Sbjct: 421 NFLLDLPDIKSVPRRFDRNFSNPKDASQSVRITSIFNGHWNETQNYLGLDSLKDEGFRNL 480

Query: 481 LYKYFSEETVPDTIIMNSGLHDGVHWLNIRAFSVGATYAASFWKQVLESIKQRGLPVPKV 540
           L KYFSE+TVPDTII+NSGLHDG++W N+R FS GA YA SFWK+V++S+KQRGL  PK+
Sbjct: 481 LKKYFSEDTVPDTIILNSGLHDGIYWKNVRRFSAGADYAVSFWKEVVDSVKQRGLVAPKI 540

Query: 541 FYRTTVATGGYARTLAFNPNKMEMFNWVVLEKLKEAGIVHGVIDNFDMTFPWHFDNRCND 600
           FYRTT++TGGYAR LAFNP KME+FNWVVL+K +++G++ GVIDNFDMTFPWH+DNRCND
Sbjct: 541 FYRTTISTGGYARALAFNPYKMEVFNWVVLDKFRQSGLLSGVIDNFDMTFPWHYDNRCND 600

Query: 601 GVHYGRAPAKLKWRDGEIGHQYFLDLMLAHILLNALC 602
           GVHYGRAPAK+KWRDGEIGHQYF+DLMLAH+LLN LC
Sbjct: 601 GVHYGRAPAKMKWRDGEIGHQYFVDLMLAHVLLNVLC 637

BLAST of Cla021666 vs. TrEMBL
Match: V7C9U1_PHAVU (Uncharacterized protein OS=Phaseolus vulgaris GN=PHAVU_003G092800g PE=4 SV=1)

HSP 1 Score: 857.8 bits (2215), Expect = 7.8e-246
Identity = 395/601 (65.72%), Postives = 482/601 (80.20%), Query Frame = 1

Query: 1   MPEKGLLPVTSNPWLLRSSPLVHSRFGFLTALILLSMLAIWSIDGCNIRTFIKAWSSPQE 60
           MP+K    V S  W+L S P++H R G LTAL+L+ M+ IWSIDGC I++ I+ W   Q+
Sbjct: 1   MPKK---VVISPQWVLWSIPMLHWRVGLLTALVLVGMVVIWSIDGCTIKSIIQVWRYQQD 60

Query: 61  FISVSSNFTNTLQTLSTNDSNFTQFISYKPEKKYPESVLIEPLPPQNSTVQPRRSQSKPA 120
           +++V  +  N       +  NFT +     EK   +S+L+E                   
Sbjct: 61  YLAVRFHTPNLTLVSPYSPVNFTLY-----EKPGNKSLLLE------------------- 120

Query: 121 VHHELASDWFSAELEPNFTSHLLALWMAPGGEPCRDSKTADIAISGMESPAMVELSTGDV 180
             H   + W S+ELEPNFTS+L+A W A GG PC+DSK  +IA+ G++   ++ELS GDV
Sbjct: 121 -RH---TSWISSELEPNFTSNLIARWSARGGVPCKDSKAVEIAVPGLDGGEVIELSAGDV 180

Query: 181 HEFRFQAVDESGNPRCLGGDYFETDLSGDLWKSRPFVKDFGNGTYSFWLQVHPDFAGDYN 240
           HEF FQA+D+SGNPRCLGGDYFE DLSG+ WKSRP VKDF NG+YS  +QVHPDF G YN
Sbjct: 181 HEFGFQALDDSGNPRCLGGDYFEADLSGESWKSRPLVKDFSNGSYSILVQVHPDFDGVYN 240

Query: 241 LTVILLFRHFEGLRFSPTRFAYDQELRRIKVRFVKNSVVLPEIKMCRSSDFDRDIWTGRW 300
           LT+ILL+RHFEGL+F+P RF+YD+ LR + +RF K SV LPE++ C++SDF RD+W+GRW
Sbjct: 241 LTIILLYRHFEGLKFTPWRFSYDRMLRNVAIRFYKISVQLPELQACKASDFGRDVWSGRW 300

Query: 301 TRHGRNDRCKISDDGRYRCFAPDYPCQSPWCNGSLGLLESNGWVYSAHCSFQMFSSRSAW 360
           TRHG+ND C I +DGRYRC   D+PC++PWC+GSLG+LESNGWVYS HCSF+++S+ SAW
Sbjct: 301 TRHGKNDDCAIGNDGRYRCLGSDFPCKAPWCDGSLGILESNGWVYSTHCSFKLYSAESAW 360

Query: 361 DCLKDRWIFFWGDSNHVDTIRNLLNFVLDLPEIPAVPRRFDRNFSNPKDPSQTVRITSIF 420
           +CLK+RWIFFWGDSNHVDTIRN+LNFVLDLPEIP+VPRRFD NFSNPKDPSQ VRITSIF
Sbjct: 361 NCLKNRWIFFWGDSNHVDTIRNMLNFVLDLPEIPSVPRRFDMNFSNPKDPSQKVRITSIF 420

Query: 421 NGHWNDTQNYEGLNSLRNEGFRNLLYKYFSEETVPDTIIMNSGLHDGVHWLNIRAFSVGA 480
           NGHWN+TQNY GL+SLR+EGF+NL+ KYF+E+T+PD +IMNSGLHDGVHW NIRAFSVGA
Sbjct: 421 NGHWNETQNYLGLDSLRDEGFQNLIKKYFTEDTIPDAVIMNSGLHDGVHWRNIRAFSVGA 480

Query: 481 TYAASFWKQVLESIKQRGLPVPKVFYRTTVATGGYARTLAFNPNKMEMFNWVVLEKLKEA 540
            YAASFW  V++++KQRGL  P+VFYRTTVATGGYAR+LAFNPNKME FN V+LEKLK+A
Sbjct: 481 DYAASFWTDVMKTVKQRGLAWPRVFYRTTVATGGYARSLAFNPNKMEAFNGVLLEKLKQA 540

Query: 541 GIVHGVIDNFDMTFPWHFDNRCNDGVHYGRAPAKLKWRDGEIGHQYFLDLMLAHILLNAL 600
           G+V GVIDNFDMTFPWHFDNRCNDGVHYGRAPAK+KWRDG+IGHQYFLDLML+H+LLNAL
Sbjct: 541 GVVSGVIDNFDMTFPWHFDNRCNDGVHYGRAPAKMKWRDGQIGHQYFLDLMLSHVLLNAL 570

Query: 601 C 602
           C
Sbjct: 601 C 570

BLAST of Cla021666 vs. NCBI nr
Match: gi|659076520|ref|XP_008438725.1| (PREDICTED: uncharacterized protein LOC103483748 [Cucumis melo])

HSP 1 Score: 1159.4 bits (2998), Expect = 0.0e+00
Identity = 549/604 (90.89%), Postives = 572/604 (94.70%), Query Frame = 1

Query: 1   MPEKGLLPVTSNPWLLRSSPLVHSRFGFLTALILLSMLAIWSIDGCNIRTFIKAWSSPQE 60
           M  KGLLPVTSNPWLLRSSPLVHSRFG LTALILLSM+AIWSIDG NI T I+AW SPQ+
Sbjct: 1   MQGKGLLPVTSNPWLLRSSPLVHSRFGVLTALILLSMVAIWSIDGSNINTVIEAWRSPQD 60

Query: 61  FISVSSNFTNTLQTLSTNDSNFTQFISYKPEKKYPESVLIEPLPPQNSTVQPRRSQSKPA 120
           F+SVSSNFTNT    +T+DSNFT  ISYKPE  + ESVL EP+PPQN+ V+PRR+QSKPA
Sbjct: 61  FVSVSSNFTNT----NTHDSNFTPVISYKPEINFSESVLSEPVPPQNALVEPRRNQSKPA 120

Query: 121 VHHELASDWFSAELEPNFTSHLLALWMAPGGEPCRDSKTADIAISGMES--PAMVELSTG 180
           VHH+  SDWFSAELEPNFTSHLLALW+APGGEPCRD KT DIAISGMES  PAMV LSTG
Sbjct: 121 VHHDSVSDWFSAELEPNFTSHLLALWLAPGGEPCRDLKTTDIAISGMESQSPAMVTLSTG 180

Query: 181 DVHEFRFQAVDESGNPRCLGGDYFETDLSGDLWKSRPFVKDFGNGTYSFWLQVHPDFAGD 240
           DVHEFRFQA+DESGNPRCLGGDYFETDLSGDLWKSRPFVKDFGNGTYSFWLQVHPDFAGD
Sbjct: 181 DVHEFRFQALDESGNPRCLGGDYFETDLSGDLWKSRPFVKDFGNGTYSFWLQVHPDFAGD 240

Query: 241 YNLTVILLFRHFEGLRFSPTRFAYDQELRRIKVRFVKNSVVLPEIKMCRSSDFDRDIWTG 300
           YNLTVILLFRHFEGLRFSPTRFAYD+ELRRIKVRFVKNSVVLPEIKMCRSSDF RDIWTG
Sbjct: 241 YNLTVILLFRHFEGLRFSPTRFAYDRELRRIKVRFVKNSVVLPEIKMCRSSDFSRDIWTG 300

Query: 301 RWTRHGRNDRCKISDDGRYRCFAPDYPCQSPWCNGSLGLLESNGWVYSAHCSFQMFSSRS 360
           RWTRHG+ND+C+ISDDGRYRCFAPDYPCQSPWCNG LGLLESNGWVYSAHCSFQMFSS S
Sbjct: 301 RWTRHGQNDQCEISDDGRYRCFAPDYPCQSPWCNGPLGLLESNGWVYSAHCSFQMFSSSS 360

Query: 361 AWDCLKDRWIFFWGDSNHVDTIRNLLNFVLDLPEIPAVPRRFDRNFSNPKDPSQTVRITS 420
           AWDCLK RWIFFWGDSNHVDTIRNLLNFVLDLPEIPAVPRRFDRNFSNPK+PSQTVRITS
Sbjct: 361 AWDCLKGRWIFFWGDSNHVDTIRNLLNFVLDLPEIPAVPRRFDRNFSNPKNPSQTVRITS 420

Query: 421 IFNGHWNDTQNYEGLNSLRNEGFRNLLYKYFSEETVPDTIIMNSGLHDGVHWLNIRAFSV 480
           IFNGHWN+TQNYEGLNSLRNEGFR+LL KYFSEETVPDTIIMNSGLHDGVHWLNIRAFSV
Sbjct: 421 IFNGHWNETQNYEGLNSLRNEGFRSLLQKYFSEETVPDTIIMNSGLHDGVHWLNIRAFSV 480

Query: 481 GATYAASFWKQVLESIKQRGLPVPKVFYRTTVATGGYARTLAFNPNKMEMFNWVVLEKLK 540
           GATYAASFWKQVLESIKQRGL VPKVFYRTTVATGGYARTLAFNPNKME+FNWVVLEKLK
Sbjct: 481 GATYAASFWKQVLESIKQRGLTVPKVFYRTTVATGGYARTLAFNPNKMEIFNWVVLEKLK 540

Query: 541 EAGIVHGVIDNFDMTFPWHFDNRCNDGVHYGRAPAKLKWRDGEIGHQYFLDLMLAHILLN 600
           EAGI+HGVIDNFDMTFPWHFDNRCNDGVHYGRAPAKLKWRDGEIGHQYFLDLMLAHILLN
Sbjct: 541 EAGIIHGVIDNFDMTFPWHFDNRCNDGVHYGRAPAKLKWRDGEIGHQYFLDLMLAHILLN 600

Query: 601 ALCT 603
            LCT
Sbjct: 601 VLCT 600

BLAST of Cla021666 vs. NCBI nr
Match: gi|778678593|ref|XP_011650994.1| (PREDICTED: uncharacterized protein LOC101204185 [Cucumis sativus])

HSP 1 Score: 1152.9 bits (2981), Expect = 0.0e+00
Identity = 546/602 (90.70%), Postives = 567/602 (94.19%), Query Frame = 1

Query: 1   MPEKGLLPVTSNPWLLRSSPLVHSRFGFLTALILLSMLAIWSIDGCNIRTFIKAWSSPQE 60
           M EKGLLPVTSNPWLLRSSPLVHSRFG LTALIL SMLAIWSIDG +I+ FIKAWSSPQ+
Sbjct: 1   MQEKGLLPVTSNPWLLRSSPLVHSRFGVLTALILFSMLAIWSIDGGHIKIFIKAWSSPQD 60

Query: 61  FISVSSNFTNTLQTLSTNDSNFTQFISYKPEKKYPESVLIEPLPPQNSTVQPRRSQSKPA 120
           F+SVSSNFT+T      +D NFT  ISYKPE  Y ESVL EP+PPQN+ V+PRR QSKPA
Sbjct: 61  FVSVSSNFTDT------HDFNFTPVISYKPEINYQESVLNEPVPPQNAPVEPRRKQSKPA 120

Query: 121 VHHELASDWFSAELEPNFTSHLLALWMAPGGEPCRDSKTADIAISGMESPAMVELSTGDV 180
           V H+  SDWFSAELEPNFTSHLLA W+APGGEPCRD KT DIAISGMESPA+V LSTGDV
Sbjct: 121 VRHDSFSDWFSAELEPNFTSHLLAQWLAPGGEPCRDLKTTDIAISGMESPAIVTLSTGDV 180

Query: 181 HEFRFQAVDESGNPRCLGGDYFETDLSGDLWKSRPFVKDFGNGTYSFWLQVHPDFAGDYN 240
           HEFRFQA+DESGNPRCLGGDYFETDLSG+LWKSRPFVKDFGNGTYSFWLQVHPDFAGDYN
Sbjct: 181 HEFRFQALDESGNPRCLGGDYFETDLSGNLWKSRPFVKDFGNGTYSFWLQVHPDFAGDYN 240

Query: 241 LTVILLFRHFEGLRFSPTRFAYDQELRRIKVRFVKNSVVLPEIKMCRSSDFDRDIWTGRW 300
           LTVILLFRHFEGLRFSPTRFAYD+ELRRIKVRFVKNSVVLP+IKMCRSSDF RDIWTGRW
Sbjct: 241 LTVILLFRHFEGLRFSPTRFAYDRELRRIKVRFVKNSVVLPKIKMCRSSDFSRDIWTGRW 300

Query: 301 TRHGRNDRCKISDDGRYRCFAPDYPCQSPWCNGSLGLLESNGWVYSAHCSFQMFSSRSAW 360
           TRHGRNDRCKISDDGRYRCFAPDYPCQSPWCNG LGLLESNGWVYSAHCSF MFSS SAW
Sbjct: 301 TRHGRNDRCKISDDGRYRCFAPDYPCQSPWCNGPLGLLESNGWVYSAHCSFTMFSSSSAW 360

Query: 361 DCLKDRWIFFWGDSNHVDTIRNLLNFVLDLPEIPAVPRRFDRNFSNPKDPSQTVRITSIF 420
           DCLK RWIFFWGDSNHVDTIRNLLNFVLDLPEIPAVPRRFDRNFSNPK+PSQTVRITSIF
Sbjct: 361 DCLKGRWIFFWGDSNHVDTIRNLLNFVLDLPEIPAVPRRFDRNFSNPKNPSQTVRITSIF 420

Query: 421 NGHWNDTQNYEGLNSLRNEGFRNLLYKYFSEETVPDTIIMNSGLHDGVHWLNIRAFSVGA 480
           NGHWNDTQNYEGLNSLRNEGFR+LL KYFSEETVPDTIIMNSGLHDGVHWLNIR+FSVGA
Sbjct: 421 NGHWNDTQNYEGLNSLRNEGFRSLLQKYFSEETVPDTIIMNSGLHDGVHWLNIRSFSVGA 480

Query: 481 TYAASFWKQVLESIKQRGLPVPKVFYRTTVATGGYARTLAFNPNKMEMFNWVVLEKLKEA 540
           TYAASFWKQVL+SIKQRGL VPKVFYRTTVATGGYARTLAFNPNKME+FNWVVLEKLKEA
Sbjct: 481 TYAASFWKQVLDSIKQRGLTVPKVFYRTTVATGGYARTLAFNPNKMEIFNWVVLEKLKEA 540

Query: 541 GIVHGVIDNFDMTFPWHFDNRCNDGVHYGRAPAKLKWRDGEIGHQYFLDLMLAHILLNAL 600
           GI HGVIDNFDMTFPWHFDNRCNDGVHYGRAPAKLKWRDGEIGHQYFLDLMLAHILLNAL
Sbjct: 541 GITHGVIDNFDMTFPWHFDNRCNDGVHYGRAPAKLKWRDGEIGHQYFLDLMLAHILLNAL 596

Query: 601 CT 603
           CT
Sbjct: 601 CT 596

BLAST of Cla021666 vs. NCBI nr
Match: gi|1009141588|ref|XP_015888274.1| (PREDICTED: uncharacterized protein LOC107423261 [Ziziphus jujuba])

HSP 1 Score: 887.1 bits (2291), Expect = 1.7e-254
Identity = 416/622 (66.88%), Postives = 500/622 (80.39%), Query Frame = 1

Query: 4   KGLLPVTSNPWLLRSSPLVHSRFGFLTALILLSMLAIWSIDGCNIRTFIKAWSSPQEFIS 63
           KGL   ++NPW  RSSPL+H RFG LTAL+L+ M+ + S DG  I++F+ AW S Q+++S
Sbjct: 2   KGLNWPSTNPWTPRSSPLLHWRFGLLTALVLVGMVVVLSADGRTIKSFVDAWRSRQDYLS 61

Query: 64  VSSNFTNTLQ----------TLSTNDSNFTQFISYKPEKKYPESVLIEPLPPQNST---- 123
                TN  +          ++S ++S   +F +   + ++  ++  + L P N T    
Sbjct: 62  TVKVSTNNGRAFPPQTYEGMSISPSESAIAEFDNQINQTQF--NITFDSLNPVNLTQNEQ 121

Query: 124 ----------VQPRRSQSKPAVHHELASDWFSAELEPNFTSHLLALWMAPGGEPCRDSKT 183
                     +   ++Q++      L+  W + ELE + T +LLA W+APGGEPC+DSKT
Sbjct: 122 SFSENSTGVFMNSEKNQTQLVFSSSLS--WVANELERDLTKNLLARWLAPGGEPCKDSKT 181

Query: 184 ADIAISGMESPAMVELSTGDVHEFRFQAVDESGNPRCLGGDYFETDLSGDLWKSRPFVKD 243
            DIAI G++   ++ELS G++HEF F ++D++ NPRCLGGDYFETDLSGD WKSRP VKD
Sbjct: 182 VDIAIPGLDGQDLIELSAGEIHEFGFLSLDDAKNPRCLGGDYFETDLSGDSWKSRPLVKD 241

Query: 244 FGNGTYSFWLQVHPDFAGDYNLTVILLFRHFEGLRFSPTRFAYDQELRRIKVRFVKNSVV 303
           F NG+YS  LQVHPDF G YNLT+ILLFRHFEGLRFSP RFAYDQELR+I++RF K S  
Sbjct: 242 FNNGSYSISLQVHPDFVGIYNLTIILLFRHFEGLRFSPPRFAYDQELRKIQIRFYKGSAQ 301

Query: 304 LPEIKMCRSSDFDRDIWTGRWTRHGRNDRCKISDDGRYRCFAPDYPCQSPWCNGSLGLLE 363
           LPE++ C+ SDF RD+W+GRWTRHGRND C+IS+DGRYRC   D+ CQ+PWCNGSLGLLE
Sbjct: 302 LPELQTCQESDFGRDLWSGRWTRHGRNDDCQISNDGRYRCLPSDFQCQNPWCNGSLGLLE 361

Query: 364 SNGWVYSAHCSFQMFSSRSAWDCLKDRWIFFWGDSNHVDTIRNLLNFVLDLPEIPAVPRR 423
           SNGWVYSAHCSF++F++ SAW+CLK+RWIFFWGDSNHVDTIRN+L+F+LDL EIP+VPRR
Sbjct: 362 SNGWVYSAHCSFRLFTADSAWNCLKNRWIFFWGDSNHVDTIRNILHFILDLHEIPSVPRR 421

Query: 424 FDRNFSNPKDPSQTVRITSIFNGHWNDTQNYEGLNSLRNEGFRNLLYKYFSEETVPDTII 483
           FDRNFSNPKDPSQTVRITSIFNGHWNDTQNY+GLNSL++EGFRNL+ KYFSEETVPDTII
Sbjct: 422 FDRNFSNPKDPSQTVRITSIFNGHWNDTQNYQGLNSLKDEGFRNLVKKYFSEETVPDTII 481

Query: 484 MNSGLHDGVHWLNIRAFSVGATYAASFWKQVLESIKQRGLPVPKVFYRTTVATGGYARTL 543
           MNSGLHDGV W NIRAFS GA YAASFWK+VLESIKQRGL  PKV+YRTT+ATGGYARTL
Sbjct: 482 MNSGLHDGVFWTNIRAFSKGANYAASFWKEVLESIKQRGLMFPKVYYRTTIATGGYARTL 541

Query: 544 AFNPNKMEMFNWVVLEKLKEAGIVHGVIDNFDMTFPWHFDNRCNDGVHYGRAPAKLKWRD 602
            FNPNKME FNWVVLEKLKEAG++ GVIDNFDMTFPWHFDNRCNDGVHYGRAPAK++WRD
Sbjct: 542 QFNPNKMEAFNWVVLEKLKEAGVISGVIDNFDMTFPWHFDNRCNDGVHYGRAPAKMRWRD 601

BLAST of Cla021666 vs. NCBI nr
Match: gi|590657568|ref|XP_007034600.1| (F28L1.9 protein [Theobroma cacao])

HSP 1 Score: 868.2 bits (2242), Expect = 8.2e-249
Identity = 401/624 (64.26%), Postives = 491/624 (78.69%), Query Frame = 1

Query: 1   MPEKG--LLPVTSNPWLLRSSPLVHSRFGFLTALILLSMLAIWSIDGCNIRTFIKAWSSP 60
           MPEKG   LP  S+ WLLRSSPL   RFG LTAL+ + M+ +WSIDGC ++ FI++W   
Sbjct: 1   MPEKGGSFLPPPSSAWLLRSSPLHQWRFGLLTALVFVGMVVVWSIDGCTVKNFIQSWQFK 60

Query: 61  QEFISVSSNFTNTLQTLSTNDSNFTQFISYKPE---KKYPESVLIEPLPPQNSTVQPRR- 120
           Q++I++  N    L     N ++  + ++  P     ++P   +   + P NS+++ +  
Sbjct: 61  QDYITMKVNSLANLNHPYQNPTHSLRNLTVNPTLNTSRFPIYSINSSVFPLNSSLESKNV 120

Query: 121 -----------------SQSKPAVHHELASDWFSAELEPNFTSHLLALWMAPGGEPCRDS 180
                             ++        +  W SAELE N+TS+LLA W+APGGEPC+DS
Sbjct: 121 TQISSREMANFSSVENSDENLTTFKDSSSLKWVSAELEQNYTSNLLARWLAPGGEPCKDS 180

Query: 181 KTADIAISGMESPAMVELSTGDVHEFRFQAVDESGNPRCLGGDYFETDLSGDLWKSRPFV 240
           +T +I I G++  ++VELS G++HEF FQAVDESGN RCLGGDYFE DLSG+ WKSRP V
Sbjct: 181 QTVEIKIPGLDGESLVELSAGEIHEFMFQAVDESGNARCLGGDYFEADLSGESWKSRPPV 240

Query: 241 KDFGNGTYSFWLQVHPDFAGDYNLTVILLFRHFEGLRFSPTRFAYDQELRRIKVRFVKNS 300
           KDFGNG+YS  LQVHPDFAG+YNLTVILLFRHF+GL+FSP RFAYD++LR I +RF +  
Sbjct: 241 KDFGNGSYSVSLQVHPDFAGEYNLTVILLFRHFQGLKFSPARFAYDRQLRHIGIRFYRTK 300

Query: 301 VVLPEIKMCRSSDFDRDIWTGRWTRHGRNDRCKISDDGRYRCFAPDYPCQSPWCNGSLGL 360
             L E+  C+ SDF +D+W+GRWTRHG+ND C+IS+DGRYRC A D+PCQ+PWCNGSLGL
Sbjct: 301 ARLTELPSCQKSDFSKDVWSGRWTRHGKNDDCQISNDGRYRCLAADFPCQNPWCNGSLGL 360

Query: 361 LESNGWVYSAHCSFQMFSSRSAWDCLKDRWIFFWGDSNHVDTIRNLLNFVLDLPEIPAVP 420
           LESNGWVYS+HCSFQ+F + SAW+CLK+RWIFFWGDSNHVDTIRN+LNFVL LPEI +VP
Sbjct: 361 LESNGWVYSSHCSFQLFLADSAWNCLKNRWIFFWGDSNHVDTIRNMLNFVLGLPEIKSVP 420

Query: 421 RRFDRNFSNPKDPSQTVRITSIFNGHWNDTQNYEGLNSLRNEGFRNLLYKYFSEETVPDT 480
           RRFD NFSNPKDPSQTVRITSIFNGHWN TQNY GL+SL++EGFRNLL KYFSE+TVPDT
Sbjct: 421 RRFDMNFSNPKDPSQTVRITSIFNGHWNGTQNYLGLDSLKDEGFRNLLKKYFSEDTVPDT 480

Query: 481 IIMNSGLHDGVHWLNIRAFSVGATYAASFWKQVLESIKQRGLPVPKVFYRTTVATGGYAR 540
           IIMNSGLHDGVHW  IRAFS GA YAA+FWK+V++S++QRGL VP++ +R T+ATGGYAR
Sbjct: 481 IIMNSGLHDGVHWSTIRAFSHGAEYAATFWKEVMDSVRQRGLVVPQIIFRNTIATGGYAR 540

Query: 541 TLAFNPNKMEMFNWVVLEKLKEAGIVHGVIDNFDMTFPWHFDNRCNDGVHYGRAPAKLKW 600
           +LAFNPNK+E FN V+LEKL+ AG+V GVIDNFDMTFPWHFDNRCNDGVHYGRAP K+KW
Sbjct: 541 SLAFNPNKIEAFNGVLLEKLRRAGLVSGVIDNFDMTFPWHFDNRCNDGVHYGRAPLKMKW 600

Query: 601 RDGEIGHQYFLDLMLAHILLNALC 602
           RDGE+GHQYF+DLML H+LLN LC
Sbjct: 601 RDGEVGHQYFVDLMLCHVLLNVLC 624

BLAST of Cla021666 vs. NCBI nr
Match: gi|571467864|ref|XP_003528671.2| (PREDICTED: uncharacterized protein LOC100783018 [Glycine max])

HSP 1 Score: 864.8 bits (2233), Expect = 9.1e-248
Identity = 400/601 (66.56%), Postives = 482/601 (80.20%), Query Frame = 1

Query: 1   MPEKGLLPVTSNPWLLRSSPLVHSRFGFLTALILLSMLAIWSIDGCNIRTFIKAWSSPQE 60
           MPEK    V S  W L S P++H R G LTAL+++ M+ +WSIDGC ++  I+AW   Q+
Sbjct: 100 MPEK---VVISPQWALWSIPMLHWRVGLLTALVMVGMVVVWSIDGCTVKNIIQAWRYQQD 159

Query: 61  FISVSSNFTNTLQTLSTNDSNFTQFISYKPEKKYPESVLIEPLPPQNSTVQPRRSQSKPA 120
           +++V S+                           P    + P  P N TV  +    KP 
Sbjct: 160 YLAVKSHT--------------------------PNLTFVSPYSPVNFTVYGK----KPL 219

Query: 121 VHHELASDWFSAELEPNFTSHLLALWMAPGGEPCRDSKTADIAISGMESPAMVELSTGDV 180
           +    AS W S+ELEPN TS+L+A W A GGEPC+DSK  +IAI G++   ++ELS GDV
Sbjct: 220 LVKGHAS-WVSSELEPNLTSNLIARWSARGGEPCKDSKAVEIAIPGLDGGEVIELSAGDV 279

Query: 181 HEFRFQAVDESGNPRCLGGDYFETDLSGDLWKSRPFVKDFGNGTYSFWLQVHPDFAGDYN 240
           HEF FQA+D+SG P C+GGDYFETDLSGD WKSRP VKDF NG+Y   LQVHPDF G YN
Sbjct: 280 HEFGFQALDDSGKPLCVGGDYFETDLSGDSWKSRPLVKDFSNGSYLISLQVHPDFDGVYN 339

Query: 241 LTVILLFRHFEGLRFSPTRFAYDQELRRIKVRFVKNSVVLPEIKMCRSSDFDRDIWTGRW 300
           LT+ILL+RHFEGL+F+P RF+YDQ LR + +RF K+SV LPE++ C++SDFDRD+W GRW
Sbjct: 340 LTIILLYRHFEGLKFTPWRFSYDQMLRSVAIRFYKSSVRLPELQGCKASDFDRDVWIGRW 399

Query: 301 TRHGRNDRCKISDDGRYRCFAPDYPCQSPWCNGSLGLLESNGWVYSAHCSFQMFSSRSAW 360
           TRHG+ND C I +DGRYRC APD+PCQ+PWC+GSLG+LESNGWVYS HCSF+++S+ SAW
Sbjct: 400 TRHGKNDDCTIGNDGRYRCLAPDFPCQAPWCDGSLGILESNGWVYSTHCSFKLYSAESAW 459

Query: 361 DCLKDRWIFFWGDSNHVDTIRNLLNFVLDLPEIPAVPRRFDRNFSNPKDPSQTVRITSIF 420
           +CLK+RWIFFWGDSNHVDTIRNLLNFVLDLPEIP+VPRRFD NFSNP+DPSQTVRITSIF
Sbjct: 460 NCLKNRWIFFWGDSNHVDTIRNLLNFVLDLPEIPSVPRRFDMNFSNPRDPSQTVRITSIF 519

Query: 421 NGHWNDTQNYEGLNSLRNEGFRNLLYKYFSEETVPDTIIMNSGLHDGVHWLNIRAFSVGA 480
           NGHWN+TQNY GL+SLR+EGF++LL KYFSE+T+PDT+IMNSGLHDGVHW NIRAFSVGA
Sbjct: 520 NGHWNETQNYLGLDSLRDEGFQDLLKKYFSEDTIPDTVIMNSGLHDGVHWRNIRAFSVGA 579

Query: 481 TYAASFWKQVLESIKQRGLPVPKVFYRTTVATGGYARTLAFNPNKMEMFNWVVLEKLKEA 540
            YAASFW  V++++KQRGL  P+VF+R TVATGGYAR+LAFNPNKME+FN V+LEKLK++
Sbjct: 580 DYAASFWGDVMKTVKQRGLAWPRVFFRNTVATGGYARSLAFNPNKMEVFNGVLLEKLKQS 639

Query: 541 GIVHGVIDNFDMTFPWHFDNRCNDGVHYGRAPAKLKWRDGEIGHQYFLDLMLAHILLNAL 600
           G+V GVIDNFDMTFPWHFDNRCNDGVHYGRAPAK+KWRDG+IGHQYF+DLMLAH+LLNAL
Sbjct: 640 GVVSGVIDNFDMTFPWHFDNRCNDGVHYGRAPAKMKWRDGQIGHQYFVDLMLAHVLLNAL 666

Query: 601 C 602
           C
Sbjct: 700 C 666

The following BLAST results are available for this feature:
Match NameE-valueIdentityDescription
A0A061EKY8_THECC5.7e-24964.26F28L1.9 protein OS=Theobroma cacao GN=TCM_020507 PE=4 SV=1[more]
A0A0B2P0Z3_GLYSO6.3e-24866.56Uncharacterized protein OS=Glycine soja GN=glysoja_009912 PE=4 SV=1[more]
K7L3U9_SOYBN6.3e-24866.56Uncharacterized protein OS=Glycine max GN=GLYMA_07G256900 PE=4 SV=1[more]
B9RTT4_RICCO5.4e-24763.74Putative uncharacterized protein OS=Ricinus communis GN=RCOM_0912600 PE=4 SV=1[more]
V7C9U1_PHAVU7.8e-24665.72Uncharacterized protein OS=Phaseolus vulgaris GN=PHAVU_003G092800g PE=4 SV=1[more]
Match NameE-valueIdentityDescription
gi|659076520|ref|XP_008438725.1|0.0e+0090.89PREDICTED: uncharacterized protein LOC103483748 [Cucumis melo][more]
gi|778678593|ref|XP_011650994.1|0.0e+0090.70PREDICTED: uncharacterized protein LOC101204185 [Cucumis sativus][more]
gi|1009141588|ref|XP_015888274.1|1.7e-25466.88PREDICTED: uncharacterized protein LOC107423261 [Ziziphus jujuba][more]
gi|590657568|ref|XP_007034600.1|8.2e-24964.26F28L1.9 protein [Theobroma cacao][more]
gi|571467864|ref|XP_003528671.2|9.1e-24866.56PREDICTED: uncharacterized protein LOC100783018 [Glycine max][more]
The following terms have been associated with this gene:
Vocabulary: INTERPRO
TermDefinition
GO Assignments
This gene is annotated with the following GO terms.
Category Term Accession Term Name
biological_process GO:0008150 biological_process
cellular_component GO:0005575 cellular_component
cellular_component GO:0016021 integral component of membrane
molecular_function GO:0003674 molecular_function
This gene is associated with the following unigenes:
Unigene NameAnalysis NameSequence type in Unigene
WMU29637watermelon EST collection version 2.0transcribed_cluster

The following mRNA feature(s) are a part of this gene:

Feature NameUnique NameType
Cla021666Cla021666.1mRNA


The following transcribed_cluster feature(s) are associated with this gene:

Feature NameUnique NameType
WMU29637WMU29637transcribed_cluster


Analysis Name: InterPro Annotations of watermelon (97103)
Date Performed: 2016-09-28
IPR TermIPR DescriptionSourceSource TermSource DescriptionAlignment
NoneNo IPR availablePANTHERPTHR35124FAMILY NOT NAMEDcoord: 1..602
score:

The following gene(s) are paralogous to this gene:

None