Cla007909 (gene) Watermelon (97103) v1

NameCla007909
Typegene
OrganismCitrullus. lanatus (Watermelon (97103) v1)
DescriptionCoatomer (AHRD V1 ***- Q6K992_ORYSJ)
LocationChr11 : 9407856 .. 9412671 (-)
   



The following sequences are available for this feature:

Gene sequence (with intron)

Legend: polypeptideCDS
Hold the cursor over a type above to highlight its positions in the sequence below.
ATGGAGGCACAAAATCAGAAGTCTCTGGAAAGAATTGTTTCACAAAAAGCTCTACAATTAGGGAGTTCATTTCCTTGTCAAATTTGTGTTGTGGGTTTTCTCTGTGGAGTTTGTATTGCCTCTCTGTTTCTTGGTGCTTTTACTTCATTTGGCAGCCCTTTGGGATTTGGTTGGAGTTCTTTCTCACCTAATGCAATGCCTGCTTCATTATTGAACTCCTCTTCTGAAAATATTAGTAAGTCGTCATTCTCCTCCTCCTCATCTCTCTCTCTCTCTCTCTTTGAAATATAATCATCTTAAATTGCTTTCTTTAACTGAATGTTTGCTGTTTGATATGAAGAAATTTCTGAAAGTGAGTGGGTAGACGTAGAATATCAGGACATTTTAATAACATGGTAAATACAATTTGGATTGAATTCTAAAACATTTGGGAGTTAGTCAAAGATAACCTCCACGAGTTTAACCAATTTGTTTTAAAAAAGTTGTGAATTCAGATATTTCGAATACAAACTAGACTATTTTCTTATAAAATTAGTTGAGGTGGCAACAGAAGTTAGCATAAAGATGAACTTAGTACCTAATTAAGATGTTAAATTTCTTTTTGAGTTTTATAATACTTAACATTGTAGGGTCAAACTATTAGCTTATAAGAATAGTTGAGATGTGCATAAGATGGAAAAGGGAAATCTAAAGGTGGCCTTGGATTAAAAGAAAAAATGGGTTATTCTGAAAGCTTGTTTTGTAGCTTTTGTTTGGATTGTGGAAAAATTCATTAGTATAAAGCTGTATAGGAATGATTAAAAGGTAGAGGATAAGATACTTAAAGTTGGAATAACTTCTACTTTACCCTAATGCAATTTGACTTCAATAGGGCAATTTTTTCAAGACTAGTAGATTCTTCACAATTCTCTTGTTTACAATCTTCTTGAAGAAGCTATTGGTTCCCCTTTTTTTCTTTTTTAAATTTTGTTTATTTAGTTGGCAATTCATATTATCATCATTCATTTTTTATTCAAATATTAAAAGATAATGTAATCATTTAAGCAATTTCAATGGGAAGTCATCTTTAAGGAACTGTATAAAACCTAATCTTTCCTTAGAAATAAGGAAAATCTTTTCAGGCATTTGAAATTGCTTCCAAGTAGTGAAATTATTTGTTGTTCATCATTGACTAATCTCAATATTTTGACTCCTTTTTCTGACCCAAACCAATGTAAAATCTGTCAGACTGTAACTTTAGACCAAAGGAGATTGAGAAACTGAAGGATTTTCAGAAAATTAAGGTAAATATCCATGACGAGAAAAAGTCCTTATTATATTCAGCTTGGAATTCTTTGTTGACTGAACCGATCAGCGGAAGAAATGCATTTTTGCGGGATCTTGGATTGGACAAAGCCACAGTCCCAAATCCACCCCATTTGGAAGACTGCAAGTCGAAAGCAAAGACAAATAAGCGATTCGATGAGCGGTCTGCAACCGATGCATTCCCTCCTTGGACAAGTTGGAAAGGGTTTTTAGACATGCACCCGACTGCTAACACTGAGGCATCGAGCTACCTTCGGCGTCAAGAAATGTTTGAAGGTTCCTACCCTCCCTGGGTATGTTTATAAAAACTTCTTAACCTTCGCTATGTAATGTCATTTATGACCTCAAATGTTGAATTGTGTTCAAGTTTCTTCATATCTTCTACAGGTTGAATAACTCACATATATTTTTTCTTTAAAAAAAATTAATTTTTGAAATCAATTTCGTACACATTGTCATGTTGCTTCTTATAATTCTTTGTTTTATTTAGTTCCTGCTTGATGCAGTAGGTTAAAACAAGCTGGCCCCCCATGTTTGATTAATTTAAAACATATTCCCATATTGTTAAACACAGAATTAATAATTGGAGAATTTAATGAACCATGATCAAATTGATAACATACAATAGCTTCCAGGTTTCGGGAGCAGATGAAGAAAATTACCCCTTGACTAGAAAAGCACAGCGGGATCTATGGATCCATCAGCATCCGTCGAACTGTAGAGATCCAAATGTTCGTTTTCTTGTAGCTGATTGGGAGCGGCTACCTGGATTCGGTATTGGAGCTCAGATAGTGGGAATGTGTGGACTTCTTGCTATAGCGATCAACGAGAAGAGGGTTCTCGTAACGAATTATTATAATCGAGCTGACCATGATGGATGTCAAGGTACAAGTAATGGCAAATTGATATGGATGGTTAAGTGATAAATTAACTGTATTTAATATAAACCATCTTTTTTTTTTTTTTTTCTCTTTCTTCTGGATTCAAGCTTGGAAAGGACTTGTTCAAGGAAAACCTAGCTAGGAAAGATTATTATTTTAAAGCATGCACTTCTCTTCCTGCACATTCAATCTTTTAAGGCTTTTAATCTTGGTTTGTTTATAGGTTTGTCTAGGTCCAGTTGGTCATGCTATTTCTTACCCGAAACATCCCAAGAATGTCGGGACCGTGCATTTGAACTTCTGGGGAACAATGAAGCATGGAAGAGTGGAATCATAACAGCAAAAGAAAATTACAGCACCAAAGAAATCTGGACTGGTCGGATTCCTAGGTGAGTTATTGCTTAGTGAATAAAATACTATTTATAAATTGAAGAGCTTAAATCTCTACTGTCACACCAAACTCATGTTGAACTGAAGAAAACATAGATATTTTCATCGATATTTACAATATAACATAAGAATATCAATGAAATACATCATTTGTCAATAAGAATATCGTATCACCTATATAAATGTGAATATGTGAATGCTAATAACGAAATTTTAAAGCTATTTAAGATGTGTAAAGTACTTATTTAACAATTTTATAAGTTTCACGATTTCCATTATTTTTTAAAAGGAAAGAATATTTGTTGATTTTCATTGATATTAACAGTTTAATTCTATTGCTCTATTTGTTGAAGGGTAAATTCATTGCTAATTAATGCAGGGCATGGGGAAACCCTTGGAGTTATTTGCAACCTACAACAGAAGTAAATGGAAGTTTACTTTCTAATCATCGCAAGATGGATAGAAGGTGGTGGAGGGCACAGGTAATCTTTTATATGCTTCAGATAACTGATGATCTAAACTCTCCCTCCCAGTCCTATCTCTCGCACATACATGCACGCACCACGAACTAAACACATTCTGTAAATTTCAGTGGTTTTGGACTTGAATGCTCCATAAATTGGCAGATTTCAGACAATGTTTCTCCTTCCAATCCAGGAGGAGATGAATCTCAAAACACTGACACTCTTATAATAATCCCTTCCCCTATGTTGGCTTACTGAAAGATGGTGTCTGATTGTAATGGTGAACATGAGCTACATAGTACATACTAAAACCCTGAGGAACATAAAGACCTGCGCAAAATGTGCTGAAGTAAAGATTTGGGCTAAATCGGAAACTGGAAAGTAGGAAACTAGTACTGAAAAATTGTACAGCAATGGCAAAGAGAAAATATACTGGTTATAGGTAGCTTTTCAAGTGGATTCTCTCAAGTTTGCCTAATCTCCCAACCCTCTAATACATGTTAAAAGTACAAATATGCTACTTTTAATCTTCCTCACAAATACTTTCTGGCTTTCCTCACCATATGCATTGTTAATAGCATCCAGAGTGGCTAGCTTTCTTCCAGAATACTAATCTACAAAAACCGCCTATGACCAAACCACCCAACCTTATTCACACAGTAACAACATTTCAAGTTATTACAAGCATTCATACACCCATCCCTATCATCTTAAATAGCCAAGTATCCAACGAAGATACATCAGAATACTGATACTGCAAATCTGATCCTCAGGCAGTGCGCTACTTAATGAGATTCCAGACAGAATACACGTGTGGCTTAATGAATGCTGCTCGCCATGCCGCATTTGGGGAGGAAGCTGCAGAAATGGTTCTCAAAAGTCTCGATGGAAAATGGCCAAAGGTATGTTGACATTAGATTTACTATTTTGGTTTATATAGGATGGAACAGAAAATTCAGGTTGCTGGATTTAAAATTCAAATGCTTTAATGAAATTCAACAATTGAACCTATATTCTTACTCTAATATCAAATGTTCTTAAATAGAAGTTAATCTCACTATCATAATTTCTATAATAAGATGAAATGCAGCATGGCACTGATAGAAACGGGTTTATTCAAGTCTGTAGATGTATATACCTTACTAGACTAGACTGGTCGATGGTGATTCTAATATTGAGCACGTTATGTCAGAAAGATTCAATGACATTAAAACATGATATAGAAGATTTTGTATGGTCGAATCACAAAGCATGGATACCTAGGCCACTCTTAAGCATGCATGTAAGAATGGGAGATAAAGCCTGTGAAATGAAGGTTGTTGAATTTGAAGAATACATGGCCCTTGCCACACGCATTAGAAAACGGTTTCCAAATCTTGACAACATTTGGCTTTCGACTGAAATGCAGGTGAGTTCAACCCCTCAATGAAATGTTTAACGATAAATCAACATCTATAAGCTCTTCTCTCATTATGTGCAATCTATATTATATCCCAGGAAGTGATTGATAAAACGATAAGTTACCCATCCTGGAAATTTTACTACACGAATGTGAAGCGACAAGTAGGAAACCTTACTATGGCCACCTACGAAGCACAGCTTGGTAGAATAACCAGTACAAACTATCCCCTTGTGAACTTCTTGATGGCAACTGAAGCTGATTTTTTCGTTGGAGCATTGGGTTCAACATGGTGCTTTCTTATAGATGGAATGAGAAATACAGGGGGCAAAGTAATGGCCGGATACTTGAGTGTAAACAAGGATCGGTTTTGGTGA

mRNA sequence

ATGGAGGCACAAAATCAGAAGTCTCTGGAAAGAATTGTTTCACAAAAAGCTCTACAATTAGGGAGTTCATTTCCTTGTCAAATTTGTGTTGTGGGTTTTCTCTGTGGAGTTTGTATTGCCTCTCTGTTTCTTGGTGCTTTTACTTCATTTGGCAGCCCTTTGGGATTTGGTTGGAGTTCTTTCTCACCTAATGCAATGCCTGCTTCATTATTGAACTCCTCTTCTGAAAATATTAACTGTAACTTTAGACCAAAGGAGATTGAGAAACTGAAGGATTTTCAGAAAATTAAGGTAAATATCCATGACGAGAAAAAGTCCTTATTATATTCAGCTTGGAATTCTTTGTTGACTGAACCGATCAGCGGAAGAAATGCATTTTTGCGGGATCTTGGATTGGACAAAGCCACAGTCCCAAATCCACCCCATTTGGAAGACTGCAAGTCGAAAGCAAAGACAAATAAGCGATTCGATGAGCGGTCTGCAACCGATGCATTCCCTCCTTGGACAAGTTGGAAAGGGTTTTTAGACATGCACCCGACTGCTAACACTGAGGCATCGAGCTACCTTCGGCGTCAAGAAATGTTTGAAGGTTCCTACCCTCCCTGGGTTTCGGGAGCAGATGAAGAAAATTACCCCTTGACTAGAAAAGCACAGCGGGATCTATGGATCCATCAGCATCCGTCGAACTGTAGAGATCCAAATGTTCGTTTTCTTGTAGCTGATTGGGAGCGGCTACCTGGATTCGGTATTGGAGCTCAGATAGTGGGAATGTGTGGACTTCTTGCTATAGCGATCAACGAGAAGAGGGTTCTCGTAACGAATTATTATAATCGAGCTGACCATGATGGATGTCAAGGTTTGTCTAGGTCCAGTTGGTCATGCTATTTCTTACCCGAAACATCCCAAGAATGTCGGGACCGTGCATTTGAACTTCTGGGGAACAATGAAGCATGGAAGAGTGGAATCATAACAGCAAAAGAAAATTACAGCACCAAAGAAATCTGGACTGGTCGGATTCCTAGGGCATGGGGAAACCCTTGGAGTTATTTGCAACCTACAACAGAAGTAAATGGAAGTTTACTTTCTAATCATCGCAAGATGGATAGAAGGTGGTGGAGGGCACAGGCAGTGCGCTACTTAATGAGATTCCAGACAGAATACACGTGTGGCTTAATGAATGCTGCTCGCCATGCCGCATTTGGGGAGGAAGCTGCAGAAATGGTTCTCAAAAGTCTCGATGGAAAATGGCCAAAGGTTGTTGAATTTGAAGAATACATGGCCCTTGCCACACGCATTAGAAAACGGTTTCCAAATCTTGACAACATTTGGCTTTCGACTGAAATGCAGGAAGTGATTGATAAAACGATAAGTTACCCATCCTGGAAATTTTACTACACGAATGTGAAGCGACAAGTAGGAAACCTTACTATGGCCACCTACGAAGCACAGCTTGGTAGAATAACCAGTACAAACTATCCCCTTGTGAACTTCTTGATGGCAACTGAAGCTGATTTTTTCGTTGGAGCATTGGGTTCAACATGGTGCTTTCTTATAGATGGAATGAGAAATACAGGGGGCAAAGTAATGGCCGGATACTTGAGTGTAAACAAGGATCGGTTTTGGTGA

Coding sequence (CDS)

ATGGAGGCACAAAATCAGAAGTCTCTGGAAAGAATTGTTTCACAAAAAGCTCTACAATTAGGGAGTTCATTTCCTTGTCAAATTTGTGTTGTGGGTTTTCTCTGTGGAGTTTGTATTGCCTCTCTGTTTCTTGGTGCTTTTACTTCATTTGGCAGCCCTTTGGGATTTGGTTGGAGTTCTTTCTCACCTAATGCAATGCCTGCTTCATTATTGAACTCCTCTTCTGAAAATATTAACTGTAACTTTAGACCAAAGGAGATTGAGAAACTGAAGGATTTTCAGAAAATTAAGGTAAATATCCATGACGAGAAAAAGTCCTTATTATATTCAGCTTGGAATTCTTTGTTGACTGAACCGATCAGCGGAAGAAATGCATTTTTGCGGGATCTTGGATTGGACAAAGCCACAGTCCCAAATCCACCCCATTTGGAAGACTGCAAGTCGAAAGCAAAGACAAATAAGCGATTCGATGAGCGGTCTGCAACCGATGCATTCCCTCCTTGGACAAGTTGGAAAGGGTTTTTAGACATGCACCCGACTGCTAACACTGAGGCATCGAGCTACCTTCGGCGTCAAGAAATGTTTGAAGGTTCCTACCCTCCCTGGGTTTCGGGAGCAGATGAAGAAAATTACCCCTTGACTAGAAAAGCACAGCGGGATCTATGGATCCATCAGCATCCGTCGAACTGTAGAGATCCAAATGTTCGTTTTCTTGTAGCTGATTGGGAGCGGCTACCTGGATTCGGTATTGGAGCTCAGATAGTGGGAATGTGTGGACTTCTTGCTATAGCGATCAACGAGAAGAGGGTTCTCGTAACGAATTATTATAATCGAGCTGACCATGATGGATGTCAAGGTTTGTCTAGGTCCAGTTGGTCATGCTATTTCTTACCCGAAACATCCCAAGAATGTCGGGACCGTGCATTTGAACTTCTGGGGAACAATGAAGCATGGAAGAGTGGAATCATAACAGCAAAAGAAAATTACAGCACCAAAGAAATCTGGACTGGTCGGATTCCTAGGGCATGGGGAAACCCTTGGAGTTATTTGCAACCTACAACAGAAGTAAATGGAAGTTTACTTTCTAATCATCGCAAGATGGATAGAAGGTGGTGGAGGGCACAGGCAGTGCGCTACTTAATGAGATTCCAGACAGAATACACGTGTGGCTTAATGAATGCTGCTCGCCATGCCGCATTTGGGGAGGAAGCTGCAGAAATGGTTCTCAAAAGTCTCGATGGAAAATGGCCAAAGGTTGTTGAATTTGAAGAATACATGGCCCTTGCCACACGCATTAGAAAACGGTTTCCAAATCTTGACAACATTTGGCTTTCGACTGAAATGCAGGAAGTGATTGATAAAACGATAAGTTACCCATCCTGGAAATTTTACTACACGAATGTGAAGCGACAAGTAGGAAACCTTACTATGGCCACCTACGAAGCACAGCTTGGTAGAATAACCAGTACAAACTATCCCCTTGTGAACTTCTTGATGGCAACTGAAGCTGATTTTTTCGTTGGAGCATTGGGTTCAACATGGTGCTTTCTTATAGATGGAATGAGAAATACAGGGGGCAAAGTAATGGCCGGATACTTGAGTGTAAACAAGGATCGGTTTTGGTGA

Protein sequence

MEAQNQKSLERIVSQKALQLGSSFPCQICVVGFLCGVCIASLFLGAFTSFGSPLGFGWSSFSPNAMPASLLNSSSENINCNFRPKEIEKLKDFQKIKVNIHDEKKSLLYSAWNSLLTEPISGRNAFLRDLGLDKATVPNPPHLEDCKSKAKTNKRFDERSATDAFPPWTSWKGFLDMHPTANTEASSYLRRQEMFEGSYPPWVSGADEENYPLTRKAQRDLWIHQHPSNCRDPNVRFLVADWERLPGFGIGAQIVGMCGLLAIAINEKRVLVTNYYNRADHDGCQGLSRSSWSCYFLPETSQECRDRAFELLGNNEAWKSGIITAKENYSTKEIWTGRIPRAWGNPWSYLQPTTEVNGSLLSNHRKMDRRWWRAQAVRYLMRFQTEYTCGLMNAARHAAFGEEAAEMVLKSLDGKWPKVVEFEEYMALATRIRKRFPNLDNIWLSTEMQEVIDKTISYPSWKFYYTNVKRQVGNLTMATYEAQLGRITSTNYPLVNFLMATEADFFVGALGSTWCFLIDGMRNTGGKVMAGYLSVNKDRFW
BLAST of Cla007909 vs. TrEMBL
Match: A0A0A0LKK1_CUCSA (Uncharacterized protein OS=Cucumis sativus GN=Csa_2G075410 PE=4 SV=1)

HSP 1 Score: 845.1 bits (2182), Expect = 4.7e-242
Identity = 427/584 (73.12%), Postives = 464/584 (79.45%), Query Frame = 1

Query: 1   MEAQNQKSLERIVSQKALQLGSSFPCQICVVGFLCGVCIASLFLGAFTSFGSPLGFGWSS 60
           MEAQNQKSLERIVSQKALQLGSSFPCQICVVGFLCGVCIASLFLGAFTS GSPLGFGWSS
Sbjct: 1   MEAQNQKSLERIVSQKALQLGSSFPCQICVVGFLCGVCIASLFLGAFTSLGSPLGFGWSS 60

Query: 61  FSPNAMPASLLNSSSENINCNFRPKEIEKLKDFQKIKVNIHDEKKSLLYSAWNSLLTEPI 120
           FSPN+ PASL NS+SENINCNFRPKEIE+L+DFQ+IKVN  DEK SLLYSAW+SL+TEPI
Sbjct: 61  FSPNSQPASLCNSTSENINCNFRPKEIEELRDFQRIKVNNDDEKTSLLYSAWSSLMTEPI 120

Query: 121 SGRNAFLRDLGLDKATVPNPPHLEDCKSKAKTNKRFDERSATDAFPPWTSWKGFLDMHPT 180
           S RNAFLRDLGLDKAT+PN PHLE+CK KA+TNKRFDER  TD FPPWTSWKG LD HPT
Sbjct: 121 SSRNAFLRDLGLDKATIPNAPHLENCKLKAETNKRFDERLQTDGFPPWTSWKGILDTHPT 180

Query: 181 ANTEASSYLRRQEMFEGSYPPWVSGADEENYPLTRKAQRDLWIHQHPSNCRDPNVRFLVA 240
           A TE SSYLRRQEMF GS+PPWVSG+DEENYPLTRK QRDLWIHQHP NC D NVRFLVA
Sbjct: 181 AMTEESSYLRRQEMFGGSFPPWVSGSDEENYPLTRKVQRDLWIHQHPLNCSDSNVRFLVA 240

Query: 241 DWERLPGFGIGAQIVGMCGLLAIAINEKRVLVTNYYNRADHDGCQGLSRSSWSCYFLPET 300
           DWERLPGFGIGAQI GMCGLLAIAINEKRVLVTNYYNRADHDGCQG SRSSWSCYFLPET
Sbjct: 241 DWERLPGFGIGAQIAGMCGLLAIAINEKRVLVTNYYNRADHDGCQGSSRSSWSCYFLPET 300

Query: 301 SQECRDRAFELLGNNEAWKSGIITAKENYSTKEIWTGRIPRAWGNPWSYLQPTTEVNGSL 360
           SQECRDRAFELLGNNEAWKSGIITAKENYSTKEIWTGRIPR WGNPWSYLQPTTEVNGSL
Sbjct: 301 SQECRDRAFELLGNNEAWKSGIITAKENYSTKEIWTGRIPRTWGNPWSYLQPTTEVNGSL 360

Query: 361 LSNHRKMDRRWWRAQAVRYLMRFQTEYTCGLMNAARHAAFGEEAAEMVLKSLDGKWPK-- 420
           LS HRKMDRRWWRAQAVRYLMRF+TEYTCGLMNAARHAAFG+EAAEM LKSLDGKWPK  
Sbjct: 361 LSKHRKMDRRWWRAQAVRYLMRFKTEYTCGLMNAARHAAFGKEAAEMALKSLDGKWPKKD 420

Query: 421 ----VVEFEEYMALATRIRKRFPNLD-NIWLSTEMQEVIDKTISYPSWKFYYTNVKRQVG 480
                 + E+++    +     P L  ++ +  +  E+  K + +  +      ++R+  
Sbjct: 421 STTSKHDIEDFVWSNHKAWIPRPLLSMHVRMGDKACEM--KVVEFAEYMALAKRIRRRFP 480

Query: 481 NLTMATYEAQLGRI--TSTNYP------------LVNFLMAT-EADF------------F 540
           NL       ++  +   + +YP            + N  MAT EA              F
Sbjct: 481 NLDNIWLSTEMQEVIDKTVSYPSWKFYYTNVKRQVGNLTMATYEAQLGRITSTNYPLVNF 540

Query: 541 VGAL---------GSTWCFLIDGMRNTGGKVMAGYLSVNKDRFW 542
           + A          GSTWCFLIDGMRNTGGKVMAGYLSVNKDRFW
Sbjct: 541 LMATEADFFIGALGSTWCFLIDGMRNTGGKVMAGYLSVNKDRFW 582

BLAST of Cla007909 vs. TrEMBL
Match: A0A0D2SS03_GOSRA (Uncharacterized protein OS=Gossypium raimondii GN=B456_007G275200 PE=4 SV=1)

HSP 1 Score: 713.8 bits (1841), Expect = 1.6e-202
Identity = 357/590 (60.51%), Postives = 424/590 (71.86%), Query Frame = 1

Query: 1   MEAQNQKSLERIVSQKALQLGSSFPCQICVVGFLCGVCIASLFLGAFTSFGSPLGFGWSS 60
           ME  NQK LER+VSQKALQ+GSSFPCQICVVGFLCGVC+ SLFL   TS G+  GF   S
Sbjct: 1   MEPTNQKFLERVVSQKALQMGSSFPCQICVVGFLCGVCLTSLFLAVLTSVGT-FGFTGIS 60

Query: 61  FSPNAMPASLLNSSSENINC------NFRPKEIEKLKDFQ-KIKVNIHDEKKSLLYSAWN 120
           FS  +M  S LNSSSE  N         + KE  +  D + +   +  DE+ SLL  AW 
Sbjct: 61  FSSLSMGNSPLNSSSEISNVITSSDYQSKVKETARWVDSKGREPESDDDERVSLLTEAWG 120

Query: 121 SLLTEPISGRNAFLRDLGLDKATVPNPPHLEDCKSKAKTNKRFDERSATDAFPPWTSWKG 180
           +LL +     + F +  GL K+++PN PHLE+CK  A+ NKR D RS    FPPWT+WKG
Sbjct: 121 ALLADREPEESEFSKRFGLSKSSLPNTPHLENCKLSAQVNKRLDTRSGAGRFPPWTTWKG 180

Query: 181 FLDMHPTANTEAS-SYLRRQEMFEGSYPPWVSGADEENYPLTRKAQRDLWIHQHPSNCRD 240
            L+M+P   T+ +    + Q + +G+YPPW+ G+DEENYPLTRK Q D+WIHQHP NC D
Sbjct: 181 SLNMYPATETDENLRSFKNQPVSDGAYPPWIVGSDEENYPLTRKVQSDIWIHQHPVNCHD 240

Query: 241 PNVRFLVADWERLPGFGIGAQIVGMCGLLAIAINEKRVLVTNYYNRADHDGCQGLSRSSW 300
           PN++FLVADWE+LPGFGIGAQ+ GM GLLAIAINEKRVLVT Y+NRADHDGC+  SR SW
Sbjct: 241 PNIKFLVADWEKLPGFGIGAQLAGMAGLLAIAINEKRVLVTGYFNRADHDGCKA-SRGSW 300

Query: 301 SCYFLPETSQECRDRAFELLGNNEAWKSGIITAKENYSTKEIWTGRIPRAWGNPWSYLQP 360
           SCYF  ETSQECRDRAFEL+ N EAW+ G I  K++Y +KEIWT ++PR WG+PWSYLQP
Sbjct: 301 SCYFFLETSQECRDRAFELITNKEAWEKGTIKGKDSYKSKEIWTAKVPRVWGDPWSYLQP 360

Query: 361 TTEVNGSLLSNHRKMDRRWWRAQAVRYLMRFQTEYTCGLMNAARHAAFGEEAAEMVLKSL 420
           TT++NGSL++ H KMDRRWWRAQAVRYLMRFQTEYTCGL+N ARHAAFG+EAA+MVL S+
Sbjct: 361 TTDINGSLIAVHHKMDRRWWRAQAVRYLMRFQTEYTCGLLNVARHAAFGKEAAKMVLASI 420

Query: 421 DGKWPKVV------EFEEY-------------MALATRIRKRFPNLDNIWL--------- 480
           D  WPKV+      E EE+             +++  R+  +   +  +           
Sbjct: 421 DRDWPKVITNQPKSEIEEFVWSNHRPWVPRPLLSMHVRMGDKACEMKVVKFEEYMELAHR 480

Query: 481 -------------STEMQEVIDKTISYPSWKFYYTNVKRQVGNLTMATYEAQLGRITSTN 540
                        STEMQEVIDKT SYP W FYYTNV RQVGN++MATYEA LGR TSTN
Sbjct: 481 IQMHFPHLKNVWLSTEMQEVIDKTRSYPHWNFYYTNVTRQVGNVSMATYEASLGRKTSTN 540

Query: 541 YPLVNFLMATEADFFVGALGSTWCFLIDGMRNTGGKVMAGYLSVNKDRFW 542
           YPLVNFLMA E+DFF+GALGSTWCFLIDGMRNTGGKVMAGYLSVNKDRFW
Sbjct: 541 YPLVNFLMAVESDFFIGALGSTWCFLIDGMRNTGGKVMAGYLSVNKDRFW 588

BLAST of Cla007909 vs. TrEMBL
Match: M1CPQ9_SOLTU (Uncharacterized protein OS=Solanum tuberosum GN=PGSC0003DMG400028063 PE=4 SV=1)

HSP 1 Score: 690.6 bits (1781), Expect = 1.5e-195
Identity = 348/592 (58.78%), Postives = 415/592 (70.10%), Query Frame = 1

Query: 6   QKSLERIVSQKALQLGSSFPCQICVVGFLCGVCIASLFLGAFTSFGSPLGFGWSSFSPNA 65
           Q+SLER+VSQ+ALQ+GSSFPCQICV+GFL GVC+ SLFL +F       G   SS     
Sbjct: 3   QRSLERVVSQRALQIGSSFPCQICVLGFLSGVCLTSLFLASFG------GLSLSS----- 62

Query: 66  MPASLLNSSSENI---NCNFRPKEIEK-LKDFQKIKVNIHDEKKSLLYSAWNSLLTEPIS 125
            P S   SSS  +   +CN + K IE+     +K+ + +H E+ SLLYS+W +L+ +  +
Sbjct: 63  -PISTFTSSSTTLISGDCNVKQKNIERRFFSHEKLALGVH-ERVSLLYSSWGNLVNQSAN 122

Query: 126 GRNAFLRDLGLDKATVPNPPHLEDCKSKAKTNKRFDERSATDAFPPWTSWKGFLDMHP-T 185
                       K+ +P  PHLEDCK  A+TN+R D R   D+FPPWT WKG LD  P T
Sbjct: 123 E----------GKSNLPKAPHLEDCKLSAETNERLDTRLQNDSFPPWTIWKGQLDNFPWT 182

Query: 186 ANTEASSYLRRQEMFEGSYPPWVSGADEENYPLTRKAQRDLWIHQHPSNCRDPNVRFLVA 245
           A  E   Y R Q + EG+YPPW+ G+DEENYPLTRK QRD+W+HQHP NC D NV+FL+A
Sbjct: 183 AEGEQLRYYRHQTVSEGAYPPWIKGSDEENYPLTRKVQRDIWLHQHPLNCSDGNVKFLIA 242

Query: 246 DWERLPGFGIGAQIVGMCGLLAIAINEKRVLVTNYYNRADHDGCQGLSRSSWSCYFLPET 305
           DWER+PGFGIGAQI GMCGLLAIAINEKRVLVT+YYNRADH+GC+G SRSSWSCYF PET
Sbjct: 243 DWERIPGFGIGAQIAGMCGLLAIAINEKRVLVTSYYNRADHNGCEGSSRSSWSCYFFPET 302

Query: 306 SQECRDRAFELLGNNEAWKSGIITAKENYSTKEIWTGRIPRAWGNPWSYLQPTTEVNGSL 365
           S ECRDRAFEL+ + EAW+ GIIT KENY++KEIW+GR PR+WG PWSYLQPTT++NGSL
Sbjct: 303 SPECRDRAFELMKHKEAWEKGIITIKENYTSKEIWSGRTPRSWGKPWSYLQPTTDINGSL 362

Query: 366 LSNHRKMDRRWWRAQAVRYLMRFQT-----------------------------EYTCGL 425
           ++ HRKMDRRWWRAQAVRYLMRFQT                             ++  G+
Sbjct: 363 IAYHRKMDRRWWRAQAVRYLMRFQTEYTCNLLNHARHAAFGWEAARMVLESQFRDFPKGV 422

Query: 426 MNAARH----------------------AAFGEEAAEMVLKSLDGKWPKVVEFEEYMALA 485
             AA+H                         G++A EM+          VV F+EYM LA
Sbjct: 423 DKAAKHDIESFVWSSHTPWSPRPMLSMHVRMGDKACEMI----------VVGFKEYMHLA 482

Query: 486 TRIRKRFPNLDNIWLSTEMQEVIDKTISYPSWKFYYTNVKRQVGNLTMATYEAQLGRITS 542
            RIRK FPNL +IWLSTEMQEV+D++  YP W FYYTNV RQ+GN+TMATYEA LGR TS
Sbjct: 483 ERIRKHFPNLKSIWLSTEMQEVVDQSRLYPHWTFYYTNVTRQMGNMTMATYEASLGRETS 542

BLAST of Cla007909 vs. TrEMBL
Match: A0A0R0EFX5_SOYBN (Uncharacterized protein OS=Glycine max GN=GLYMA_20G223600 PE=4 SV=1)

HSP 1 Score: 675.6 bits (1742), Expect = 4.9e-191
Identity = 341/559 (61.00%), Postives = 399/559 (71.38%), Query Frame = 1

Query: 1   MEAQNQKSLERIVSQKALQLGSSFPCQICVVGFLCGVCIASLFLGAFTSFGSPLGFGWSS 60
           ME  NQKS ER+VSQKALQ+G+SFPCQICVVGFLCGVC+ SLFL A TSFGS   FG   
Sbjct: 1   MEPPNQKSFERVVSQKALQMGNSFPCQICVVGFLCGVCLTSLFLAALTSFGS-FQFGPIL 60

Query: 61  FSPNAMPASLLNSSSEN-IN------CNFRPKEIEKLKDFQKIKVNIHDEKKSLLYSAWN 120
           FS  +M  S   S+  N IN      C+F+ KE E+L D +  +   +DE+ SLLYSAW+
Sbjct: 61  FSTMSMANSSGYSTFPNDINMVTRSDCHFKFKETERLGDSKSSRERNNDERVSLLYSAWS 120

Query: 121 SLLTEPISGRNAFLRDLGLDKATVPNPPHLEDCKSKAKTNKRFDERSATDAFPPWTSWKG 180
           S+L EP SG    L+  G+  +++PN PHLE+CK K       D+R   + FPPWT+WKG
Sbjct: 121 SVLNEPTSGGKEHLQKHGISGSSLPNAPHLENCKVKTHLYDYLDKRKGYEVFPPWTTWKG 180

Query: 181 FLDMHPTAN-TEASSYLRRQEMFEGSYPPWVSGADEENYPLTRKAQRDLWIHQHPSNCRD 240
            L   P A   E    LR + + EG+YPPW++G+DEENYPLTRK QRD+W+HQHP NC  
Sbjct: 181 SLQTFPVAAFNEQIQNLRHEAVSEGAYPPWIAGSDEENYPLTRKVQRDIWMHQHPLNCSS 240

Query: 241 PNVRFLVADWERLPGFGIGAQIVGMCGLLAIAINEKRVLVTNYYNRADHDGCQGLSRSSW 300
           P+V+FLV DWERLPGFGIGAQI GMCGLL IAINE RVLVTNYYNRADH  C+G SRSSW
Sbjct: 241 PDVKFLVTDWERLPGFGIGAQIAGMCGLLGIAINEGRVLVTNYYNRADHGSCKGSSRSSW 300

Query: 301 SCYFLPETSQECRDRAFELLGNNEAWKSGIITAKENYSTKEIWTGRIPRAWGNPWSYLQP 360
           SCYF PETS ECR RAFEL+ + EAW  GI+T KENY+TK IW G  PR WG PW+YLQP
Sbjct: 301 SCYFFPETSLECRQRAFELMKSEEAWSKGIVTTKENYTTKHIWAGPTPRKWGLPWNYLQP 360

Query: 361 TTEVNGSLLSNHRKMDRRWWRAQ--AVRYLMRFQTEYTCGLMN-------AARHAAFGEE 420
           TT++NG+LL++HRKMDRRWWRAQ  + R       EY                H   G++
Sbjct: 361 TTDINGTLLASHRKMDRRWWRAQESSARPRPDDIDEYVWSNHKPWVPRPLLCMHVRMGDK 420

Query: 421 AAEMVLKSLDGKWPKVVEFEEYMALATRIRKRFPNLDNIWLSTEMQEVIDKTISYPSWKF 480
           A EM          KVV FEEYM LA R R+ FP+L+NIWLSTEMQEVIDKT  Y  W F
Sbjct: 421 ACEM----------KVVGFEEYMQLADRTRRHFPHLNNIWLSTEMQEVIDKTREYSHWNF 480

Query: 481 YYTNVKRQVG-NLTMATYEAQLGRITSTNYPLVNFLMATEADFFVGALGSTWCFLIDGMR 540
           YYT V+RQ   N++MA YEA LGR TSTNYPLVNFLMA ++DF+VGALGS+W FLIDGMR
Sbjct: 481 YYTKVRRQARINMSMAVYEASLGRETSTNYPLVNFLMAADSDFYVGALGSSWSFLIDGMR 540

Query: 541 NTGGKVMAGYLSVNKDRFW 542
           NTGGKVMAGYLSVNKDRFW
Sbjct: 541 NTGGKVMAGYLSVNKDRFW 548

BLAST of Cla007909 vs. TrEMBL
Match: A0A061F2G2_THECC (Uncharacterized protein isoform 1 OS=Theobroma cacao GN=TCM_026174 PE=4 SV=1)

HSP 1 Score: 657.9 bits (1696), Expect = 1.1e-185
Identity = 338/591 (57.19%), Postives = 409/591 (69.20%), Query Frame = 1

Query: 1   MEAQNQKSLERIVSQKALQLGSSFPCQICVVGFLCGVCIASLFLGAFTSFGSPLGFGWSS 60
           ME  NQK LER+VSQ+ALQ+GSSFPCQICVVGFLCGVC+ SLFL A TS G+  GFG  S
Sbjct: 1   MEPTNQKFLERVVSQRALQMGSSFPCQICVVGFLCGVCLTSLFLAALTSLGT-YGFGGIS 60

Query: 61  FSPNAMPASLLNSSSENIN------CNFRPKEIEKLKDFQKIKVNIHDEKKSLLYSAWNS 120
           FS  +M  S LNSSSE IN      C F+ KE EK    Q+ K    DE+ SLL  AW +
Sbjct: 61  FSSISMGISPLNSSSEIINVGTSIDCKFKLKETEKWVVSQQRKTESDDERVSLLTEAWGA 120

Query: 121 LLTEPISGRNAFLRDLGLDKATVPNPPHLEDCKSKAKTNKRFDERSATDAFPPWTSWKGF 180
           LLT+     + FL+  GL K+++PN PHL++CK  A+  KR D R+  + FPPWT+WKG 
Sbjct: 121 LLTDKADEESEFLQRFGLSKSSIPNAPHLDNCKLSARVKKRLDTRAGAERFPPWTTWKGS 180

Query: 181 LDMHP-TANTEASSYLRRQEMFEGSYPPWVSGADEENYPLTRKAQRDLWIHQHPSNCRDP 240
           LDM P TA  E   + R Q + EG+YPPW+ G+DEENYPLTRK QRD+WIHQHP NCRDP
Sbjct: 181 LDMFPATAANEQIRHFRHQAISEGAYPPWIVGSDEENYPLTRKVQRDIWIHQHPVNCRDP 240

Query: 241 NVRFLVADWERLPGFGIGAQIVGMCGLLAIAINEKRVLVTNYYNRADHDGCQGLSRSSWS 300
            V+FLVADWE LPGFGIGAQ  GMCGLLAIAINEKRVLVTNYYNRADHDGC+G SRSSWS
Sbjct: 241 TVKFLVADWETLPGFGIGAQFAGMCGLLAIAINEKRVLVTNYYNRADHDGCKGSSRSSWS 300

Query: 301 CYFLPETSQECRDRAFELLGNNEAWKSGIITAKENYSTKEIWTGRIPRAWGNPWSYLQPT 360
           CYF PETSQECRDRAFEL+   EAW+ GII  KENY++KEIWTGRIPR WG+PWSYLQPT
Sbjct: 301 CYFFPETSQECRDRAFELMQTKEAWEKGIIKGKENYNSKEIWTGRIPRVWGDPWSYLQPT 360

Query: 361 TEVNGSLLSNHRKMDRRWWRAQAVRYLMRFQTEYTCGLMNAARHAAFGEEAAEMVLKSLD 420
           TE+NG+L++ HRKMDRRWWRAQAVRYLMRFQTEYTCGL+N ARHAAFG+EAA+MVL ++D
Sbjct: 361 TEINGTLIAFHRKMDRRWWRAQAVRYLMRFQTEYTCGLLNIARHAAFGKEAAKMVLATID 420

Query: 421 GKWPKVV------EFEEYMALATRIRKRFPNLD-NIWLSTEMQEVIDKTISYPSWKFYYT 480
            +WPKV+      + EE++    +     P L  ++ +  +  E+  K + +  +     
Sbjct: 421 REWPKVITNKPKTDIEEFVWSNHKPWVPRPLLSMHVRMGDKACEM--KVVEFEGYMELAD 480

Query: 481 NVKRQVGNLTMATYEAQLGRI--TSTNYPLVNF------------LMATE---------- 540
           +++++  +L       ++  +   + +YP  NF             MAT           
Sbjct: 481 HIRKRFPHLNNIWLSTEMQEVIDKTKSYPHWNFYYTNVTRQVRNIAMATYEASLGRKTST 540

Query: 541 ----ADFFVGAL--------GSTWCFLIDGMRNTGGKVMAGYLSVNKDRFW 542
                +F + A         GSTWCFLIDGMRNTGGKVMAGYLSVNKDRFW
Sbjct: 541 NYPLVNFLMAAESDFFIGALGSTWCFLIDGMRNTGGKVMAGYLSVNKDRFW 588

BLAST of Cla007909 vs. NCBI nr
Match: gi|659082412|ref|XP_008441826.1| (PREDICTED: uncharacterized protein LOC103485874 isoform X2 [Cucumis melo])

HSP 1 Score: 944.1 bits (2439), Expect = 1.1e-271
Identity = 460/550 (83.64%), Postives = 479/550 (87.09%), Query Frame = 1

Query: 1   MEAQNQKSLERIVSQKALQLGSSFPCQICVVGFLCGVCIASLFLGAFTSFGSPLGFGWSS 60
           MEAQNQKSLERIVSQKALQLGSSFPCQICVVGFLCGVCIASLFLGAFTS GSPLGFGWSS
Sbjct: 1   MEAQNQKSLERIVSQKALQLGSSFPCQICVVGFLCGVCIASLFLGAFTSLGSPLGFGWSS 60

Query: 61  FSPNAMPASLLNSSSENINCNFRPKEIEKLKDFQKIKVNIHD-EKKSLLYSAWNSLLTEP 120
           FSPN+ PAS  NS+SEN NCNFRPKEIE  KDFQ++KVNI D EK SLLYSAW+SLLTEP
Sbjct: 61  FSPNSQPASSWNSTSENTNCNFRPKEIENPKDFQRVKVNIDDDEKTSLLYSAWSSLLTEP 120

Query: 121 ISGRNAFLRDLGLDKATVPNPPHLEDCKSKAKTNKRFDERSATDAFPPWTSWKGFLDMHP 180
           +S RN FLRDLGLDK T+PN PHLE+C  KA+ NKRFDERSATD FP WTSWKGFLD HP
Sbjct: 121 VSRRNTFLRDLGLDKGTIPNAPHLENCMLKAEANKRFDERSATDGFPSWTSWKGFLDTHP 180

Query: 181 TANTEASSYLRRQEMFEGSYPPWVSGADEENYPLTRKAQRDLWIHQHPSNCRDPNVRFLV 240
           TA TE SS LRRQE FEGSYPPWVSG+DEENYPLTRK QRDLWIHQHP NC D N+RFLV
Sbjct: 181 TAMTEESSNLRRQEKFEGSYPPWVSGSDEENYPLTRKVQRDLWIHQHPLNCSDSNIRFLV 240

Query: 241 ADWERLPGFGIGAQIVGMCGLLAIAINEKRVLVTNYYNRADHDGCQGLSRSSWSCYFLPE 300
           ADWERLPGFGIGAQI GMCGLLAIAINEKRVLVTNYYNRADHDGCQG SRSSWSCYFLPE
Sbjct: 241 ADWERLPGFGIGAQIAGMCGLLAIAINEKRVLVTNYYNRADHDGCQGSSRSSWSCYFLPE 300

Query: 301 TSQECRDRAFELLGNNEAWKSGIITAKENYSTKEIWTGRIPRAWGNPWSYLQPTTEVNGS 360
           TSQECRDRAFELLGNNEAWKSGIITAKENYSTKEIWTGRIPRAWGNPWSYLQPTTEVNGS
Sbjct: 301 TSQECRDRAFELLGNNEAWKSGIITAKENYSTKEIWTGRIPRAWGNPWSYLQPTTEVNGS 360

Query: 361 LLSNHRKMDRRWWRAQAVRYLMRFQTEYTCGLMNAA--------RHAAFGEEAAEMVLKS 420
           LLS HRKMDRRWWRAQ      +   E      + A         H   G++A EM    
Sbjct: 361 LLSKHRKMDRRWWRAQKDSMTSKRDIEDFVWSDHKAWIPRPLLSMHVRMGDKACEM---- 420

Query: 421 LDGKWPKVVEFEEYMALATRIRKRFPNLDNIWLSTEMQEVIDKTISYPSWKFYYTNVKRQ 480
                 KVVEFEEYMALA RIR+RFPNLDNIWLSTEMQEVIDKT+SYPSWKFYYTNVKRQ
Sbjct: 421 ------KVVEFEEYMALAKRIRRRFPNLDNIWLSTEMQEVIDKTVSYPSWKFYYTNVKRQ 480

Query: 481 VGNLTMATYEAQLGRITSTNYPLVNFLMATEADFFVGALGSTWCFLIDGMRNTGGKVMAG 540
           +GNLTMATYEAQLGRITSTNYPLVNFLMATEADFF+GALGSTWCFLIDGMRNTGGKVMAG
Sbjct: 481 IGNLTMATYEAQLGRITSTNYPLVNFLMATEADFFIGALGSTWCFLIDGMRNTGGKVMAG 540

Query: 541 YLSVNKDRFW 542
           YLSVNKDRFW
Sbjct: 541 YLSVNKDRFW 540

BLAST of Cla007909 vs. NCBI nr
Match: gi|778667929|ref|XP_011649010.1| (PREDICTED: uncharacterized protein LOC101206485 [Cucumis sativus])

HSP 1 Score: 845.1 bits (2182), Expect = 6.7e-242
Identity = 427/584 (73.12%), Postives = 464/584 (79.45%), Query Frame = 1

Query: 1   MEAQNQKSLERIVSQKALQLGSSFPCQICVVGFLCGVCIASLFLGAFTSFGSPLGFGWSS 60
           MEAQNQKSLERIVSQKALQLGSSFPCQICVVGFLCGVCIASLFLGAFTS GSPLGFGWSS
Sbjct: 1   MEAQNQKSLERIVSQKALQLGSSFPCQICVVGFLCGVCIASLFLGAFTSLGSPLGFGWSS 60

Query: 61  FSPNAMPASLLNSSSENINCNFRPKEIEKLKDFQKIKVNIHDEKKSLLYSAWNSLLTEPI 120
           FSPN+ PASL NS+SENINCNFRPKEIE+L+DFQ+IKVN  DEK SLLYSAW+SL+TEPI
Sbjct: 61  FSPNSQPASLCNSTSENINCNFRPKEIEELRDFQRIKVNNDDEKTSLLYSAWSSLMTEPI 120

Query: 121 SGRNAFLRDLGLDKATVPNPPHLEDCKSKAKTNKRFDERSATDAFPPWTSWKGFLDMHPT 180
           S RNAFLRDLGLDKAT+PN PHLE+CK KA+TNKRFDER  TD FPPWTSWKG LD HPT
Sbjct: 121 SSRNAFLRDLGLDKATIPNAPHLENCKLKAETNKRFDERLQTDGFPPWTSWKGILDTHPT 180

Query: 181 ANTEASSYLRRQEMFEGSYPPWVSGADEENYPLTRKAQRDLWIHQHPSNCRDPNVRFLVA 240
           A TE SSYLRRQEMF GS+PPWVSG+DEENYPLTRK QRDLWIHQHP NC D NVRFLVA
Sbjct: 181 AMTEESSYLRRQEMFGGSFPPWVSGSDEENYPLTRKVQRDLWIHQHPLNCSDSNVRFLVA 240

Query: 241 DWERLPGFGIGAQIVGMCGLLAIAINEKRVLVTNYYNRADHDGCQGLSRSSWSCYFLPET 300
           DWERLPGFGIGAQI GMCGLLAIAINEKRVLVTNYYNRADHDGCQG SRSSWSCYFLPET
Sbjct: 241 DWERLPGFGIGAQIAGMCGLLAIAINEKRVLVTNYYNRADHDGCQGSSRSSWSCYFLPET 300

Query: 301 SQECRDRAFELLGNNEAWKSGIITAKENYSTKEIWTGRIPRAWGNPWSYLQPTTEVNGSL 360
           SQECRDRAFELLGNNEAWKSGIITAKENYSTKEIWTGRIPR WGNPWSYLQPTTEVNGSL
Sbjct: 301 SQECRDRAFELLGNNEAWKSGIITAKENYSTKEIWTGRIPRTWGNPWSYLQPTTEVNGSL 360

Query: 361 LSNHRKMDRRWWRAQAVRYLMRFQTEYTCGLMNAARHAAFGEEAAEMVLKSLDGKWPK-- 420
           LS HRKMDRRWWRAQAVRYLMRF+TEYTCGLMNAARHAAFG+EAAEM LKSLDGKWPK  
Sbjct: 361 LSKHRKMDRRWWRAQAVRYLMRFKTEYTCGLMNAARHAAFGKEAAEMALKSLDGKWPKKD 420

Query: 421 ----VVEFEEYMALATRIRKRFPNLD-NIWLSTEMQEVIDKTISYPSWKFYYTNVKRQVG 480
                 + E+++    +     P L  ++ +  +  E+  K + +  +      ++R+  
Sbjct: 421 STTSKHDIEDFVWSNHKAWIPRPLLSMHVRMGDKACEM--KVVEFAEYMALAKRIRRRFP 480

Query: 481 NLTMATYEAQLGRI--TSTNYP------------LVNFLMAT-EADF------------F 540
           NL       ++  +   + +YP            + N  MAT EA              F
Sbjct: 481 NLDNIWLSTEMQEVIDKTVSYPSWKFYYTNVKRQVGNLTMATYEAQLGRITSTNYPLVNF 540

Query: 541 VGAL---------GSTWCFLIDGMRNTGGKVMAGYLSVNKDRFW 542
           + A          GSTWCFLIDGMRNTGGKVMAGYLSVNKDRFW
Sbjct: 541 LMATEADFFIGALGSTWCFLIDGMRNTGGKVMAGYLSVNKDRFW 582

BLAST of Cla007909 vs. NCBI nr
Match: gi|659082408|ref|XP_008441824.1| (PREDICTED: uncharacterized protein LOC103485874 isoform X1 [Cucumis melo])

HSP 1 Score: 831.2 bits (2146), Expect = 1.0e-237
Identity = 422/587 (71.89%), Postives = 454/587 (77.34%), Query Frame = 1

Query: 1   MEAQNQKSLERIVSQKALQLGSSFPCQICVVGFLCGVCIASLFLGAFTSFGSPLGFGWSS 60
           MEAQNQKSLERIVSQKALQLGSSFPCQICVVGFLCGVCIASLFLGAFTS GSPLGFGWSS
Sbjct: 1   MEAQNQKSLERIVSQKALQLGSSFPCQICVVGFLCGVCIASLFLGAFTSLGSPLGFGWSS 60

Query: 61  FSPNAMPASLLNSSSENINCNFRPKEIEKLKDFQKIKVNIHD-EKKSLLYSAWNSLLTEP 120
           FSPN+ PAS  NS+SEN NCNFRPKEIE  KDFQ++KVNI D EK SLLYSAW+SLLTEP
Sbjct: 61  FSPNSQPASSWNSTSENTNCNFRPKEIENPKDFQRVKVNIDDDEKTSLLYSAWSSLLTEP 120

Query: 121 ISGRNAFLRDLGLDKATVPNPPHLEDCKSKAKTNKRFDERSATDAFPPWTSWKGFLDMHP 180
           +S RN FLRDLGLDK T+PN PHLE+C  KA+ NKRFDERSATD FP WTSWKGFLD HP
Sbjct: 121 VSRRNTFLRDLGLDKGTIPNAPHLENCMLKAEANKRFDERSATDGFPSWTSWKGFLDTHP 180

Query: 181 TANTEASSYLRRQEMFEGSYPPWVSGADEENYPLTRKAQRDLWIHQHPSNCRDPNVRFLV 240
           TA TE SS LRRQE FEGSYPPWVSG+DEENYPLTRK QRDLWIHQHP NC D N+RFLV
Sbjct: 181 TAMTEESSNLRRQEKFEGSYPPWVSGSDEENYPLTRKVQRDLWIHQHPLNCSDSNIRFLV 240

Query: 241 ADWERLPGFGIGAQIVGMCGLLAIAINEKRVLVTNYYNRADHDGCQGLSRSSWSCYFLPE 300
           ADWERLPGFGIGAQI GMCGLLAIAINEKRVLVTNYYNRADHDGCQG SRSSWSCYFLPE
Sbjct: 241 ADWERLPGFGIGAQIAGMCGLLAIAINEKRVLVTNYYNRADHDGCQGSSRSSWSCYFLPE 300

Query: 301 TSQECRDRAFELLGNNEAWKSGIITAKENYSTKEIWTGRIPRAWGNPWSYLQPTTEVNGS 360
           TSQECRDRAFELLGNNEAWKSGIITAKENYSTKEIWTGRIPRAWGNPWSYLQPTTEVNGS
Sbjct: 301 TSQECRDRAFELLGNNEAWKSGIITAKENYSTKEIWTGRIPRAWGNPWSYLQPTTEVNGS 360

Query: 361 LLSNHRKMDRRWWRAQAVRYLMRFQTEYTCGLMNAARHAAFGEEAAEMVLKSLDGKWPKV 420
           LLS HRKMDRRWWRAQAVRYLMRF+TEY CGLMNAARHAAFG+EAAEMVLKSLDGKWPK 
Sbjct: 361 LLSKHRKMDRRWWRAQAVRYLMRFKTEYMCGLMNAARHAAFGKEAAEMVLKSLDGKWPK- 420

Query: 421 VEFEEYMALATRIRKRFPNLDNIWLSTEM---------QEVIDKTISYPSWKFYYTNVKR 480
              ++ M     I     +    W+   +         +    K + +  +      ++R
Sbjct: 421 ---KDSMTSKRDIEDFVWSDHKAWIPRPLLSMHVRMGDKACEMKVVEFEEYMALAKRIRR 480

Query: 481 QVGNLTMATYEAQLGRI--TSTNYP------------LVNFLMAT-EADF---------- 540
           +  NL       ++  +   + +YP            + N  MAT EA            
Sbjct: 481 RFPNLDNIWLSTEMQEVIDKTVSYPSWKFYYTNVKRQIGNLTMATYEAQLGRITSTNYPL 540

Query: 541 --FVGAL---------GSTWCFLIDGMRNTGGKVMAGYLSVNKDRFW 542
             F+ A          GSTWCFLIDGMRNTGGKVMAGYLSVNKDRFW
Sbjct: 541 VNFLMATEADFFIGALGSTWCFLIDGMRNTGGKVMAGYLSVNKDRFW 583

BLAST of Cla007909 vs. NCBI nr
Match: gi|702271996|ref|XP_010043633.1| (PREDICTED: uncharacterized protein LOC104432792 [Eucalyptus grandis])

HSP 1 Score: 731.1 bits (1886), Expect = 1.4e-207
Identity = 369/578 (63.84%), Postives = 432/578 (74.74%), Query Frame = 1

Query: 1   MEAQ-NQKSLERIVSQKALQLGSSFPCQICVVGFLCGVCIASLFLGAFTSFGSPLGFGWS 60
           MEAQ NQ ++ER+VS +ALQ+GSSFPCQ+CVVGFL GVC+A+LFL A TSFG+  GF   
Sbjct: 11  MEAQSNQTAIERVVSGRALQMGSSFPCQVCVVGFLWGVCLATLFLAALTSFGA-FGFSRV 70

Query: 61  SFSPNAMPASLLNSSSENINC------NFRPKEIEKLKDFQKIKVNIHD-EKKSLLYSAW 120
           S SP +M     NSSS  IN       +F PK      D    +    D E+ SLLYSAW
Sbjct: 71  SLSPISMGIPQGNSSSNIINIFARSDQDFDPKRPPGNVDSVPSEEQHQDRERASLLYSAW 130

Query: 121 NSLLTEPISGRNAFLRDLGLDKATVPNPPHLEDCKSKAKTNKRFDERSATDAFPPWTSWK 180
           ++LL EP  G    LR LGL ++ VP+ PHLEDCKS A  NKR D     + FPPWT WK
Sbjct: 131 SALLKEPTDGSEE-LRKLGLSRSMVPSAPHLEDCKSHAGVNKRLDSYVKNEIFPPWTGWK 190

Query: 181 GFLDMHPTANT-EASSYLRRQEMFEGSYPPWVSGADEENYPLTRKAQRDLWIHQHPSNCR 240
           G L MH  + T E    LRRQE+ EGS PPW++G+D+ENYPLTRK Q DLWIHQHPSNC 
Sbjct: 191 GILKMHSVSVTDEQPENLRRQEISEGSSPPWITGSDQENYPLTRKVQSDLWIHQHPSNCS 250

Query: 241 DPNVRFLVADWERLPGFGIGAQIVGMCGLLAIAINEKRVLVTNYYNRADHDGCQGLSRSS 300
           DP +RFLVADWER PGFGIGAQI  MCGLLAIA+ EKRVLVTNYYNRADHDGC+G +RSS
Sbjct: 251 DPRLRFLVADWERSPGFGIGAQIAAMCGLLAIALREKRVLVTNYYNRADHDGCKGSARSS 310

Query: 301 WSCYFLPETSQECRDRAFELLGNNEAWKSGIITAKENYSTKEIWTGRIPRAWGNPWSYLQ 360
           WSCYFLPETSQEC+DRA EL+ +++A +SGIITAK+NYS+K+IW G+IP  WG+PW+YL+
Sbjct: 311 WSCYFLPETSQECQDRALELIRSHDACESGIITAKQNYSSKQIWHGKIPNFWGDPWTYLK 370

Query: 361 PTTEVNGSLLSNHRKMDRRWWRAQAVRYLMRFQTEYTCGLMNAARHAAFGEEAAEMVLKS 420
           PTTE+NG L+S HR+MD RWWRAQ VRYL RFQT+YTCGLMNAARHAAFG EAA++VL +
Sbjct: 371 PTTEINGRLISRHREMDIRWWRAQGVRYLTRFQTKYTCGLMNAARHAAFGAEAAKIVLPA 430

Query: 421 LDGK----WP------------------------KVVEFEEYMALATRIRKRFPNLDNIW 480
             G     W                         KVV FEEY+ LA R+R RFP+L +IW
Sbjct: 431 CVGHRKAVWSDRKPWVPRPLLSMHVRMGDKAGEMKVVGFEEYIRLANRMRMRFPHLRSIW 490

Query: 481 LSTEMQEVIDKTISYPSWKFYYTNVKRQVGNLTMATYEAQLGRITSTNYPLVNFLMATEA 540
           LSTEMQEV+DK+ SY SW FY T+V+RQ+G+ +MA YEA LGR TSTNYPLVNFLMA++A
Sbjct: 491 LSTEMQEVVDKSRSYASWNFYNTDVRRQIGDTSMAEYEASLGRETSTNYPLVNFLMASDA 550

Query: 541 DFFVGALGSTWCFLIDGMRNTGGKVMAGYLSVNKDRFW 542
           DFF+GALGS WCFLIDGMRNTGGKVMAGYLSVNKDRFW
Sbjct: 551 DFFIGALGSNWCFLIDGMRNTGGKVMAGYLSVNKDRFW 586

BLAST of Cla007909 vs. NCBI nr
Match: gi|743888617|ref|XP_011038497.1| (PREDICTED: uncharacterized protein LOC105135349 isoform X2 [Populus euphratica])

HSP 1 Score: 718.4 bits (1853), Expect = 9.5e-204
Identity = 355/558 (63.62%), Postives = 419/558 (75.09%), Query Frame = 1

Query: 2   EAQNQKSLERIVSQKALQLGSSFPCQICVVGFLCGVCIASLFLGAFTSFGSPLGFGWSSF 61
           E  NQKSLER+VSQKALQ+GSSF  QICVVGFL GVC+ SLFL A TS G+   FG  SF
Sbjct: 129 EPLNQKSLERVVSQKALQIGSSFSFQICVVGFLSGVCLTSLFLAALTSLGT-FEFGGISF 188

Query: 62  SPNAMPASLLNSSSENI-------NCNFRPKEI--EKLKDFQKIKVNIHDEKKSLLYSAW 121
           S  ++  S LNSSS          +C F+ +EI  E+  D ++ +  + DE+ SLL+SAW
Sbjct: 189 SSISLGNSPLNSSSSGFFNTVTSADCKFKREEILTERWVDSKRSENGVDDERVSLLHSAW 248

Query: 122 NSLLTEPISGRNAFLRDLGLDKATVPNPPHLEDCKSKAKTNKRFDERSATDAFPPWTSWK 181
           ++LL+E + G  AF +  GL K+ VPN PHLE+CK   + N+  D+R+  +  PPWT+WK
Sbjct: 249 SALLSESVDGEIAFWKSSGLRKSAVPNAPHLENCKLSEQINEHLDKRAENERLPPWTTWK 308

Query: 182 GFLDMHPTAN-TEASSYLRRQEMFEGSYPPWVSGADEENYPLTRKAQRDLWIHQHPSNCR 241
           G L+ HP +  TE   YLR Q + EG+YPPW++G+DEENYPLTRK QRD+W+HQHP NCR
Sbjct: 309 GLLNAHPASMPTEQLRYLRHQAIPEGAYPPWITGSDEENYPLTRKVQRDIWLHQHPENCR 368

Query: 242 DPNVRFLVADWERLPGFGIGAQIVGMCGLLAIAINEKRVLVTNYYNRADHDGCQGLSRSS 301
           DPN+RFLVA+WERLPGFGIGAQ+ GMCGLLAIAINEKRVLVT+YYNRADHDGC+G  RSS
Sbjct: 369 DPNIRFLVAEWERLPGFGIGAQLAGMCGLLAIAINEKRVLVTSYYNRADHDGCKGSLRSS 428

Query: 302 WSCYFLPETSQECRDRAFELLGNNEAWKSGIITAKENYSTKEIWTGRIPRAWGNPWSYLQ 361
           WSCYF PETSQECRDRAFELLGN EA + GI+T K+NY++KEIWTGR PR WG PW +LQ
Sbjct: 429 WSCYFFPETSQECRDRAFELLGNKEALERGIVTTKDNYTSKEIWTGRTPRVWGEPWRFLQ 488

Query: 362 PTTEVNGSLLSNHRKMDRRWWRAQAVRYLMRFQTEYTCGLMN--------AARHAAFGEE 421
           PTTE+NGSL+ +HRKMDRRWWRAQ      R   E      +         + H   G++
Sbjct: 489 PTTEINGSLVVSHRKMDRRWWRAQDFGNKRRSDIEEFVWSNHRPWTPRPLLSMHVRMGDK 548

Query: 422 AAEMVLKSLDGKWPKVVEFEEYMALATRIRKRFPNLDNIWLSTEMQEVIDKTISYPSWKF 481
           A EM          KVVEFE YM LA RIR+ FP+L ++WLSTEMQEVI+K+  Y +W F
Sbjct: 549 ACEM----------KVVEFEGYMHLADRIRQHFPHLKSVWLSTEMQEVINKSKLYTNWNF 608

Query: 482 YYTNVKRQVGNLTMATYEAQLGRITSTNYPLVNFLMATEADFFVGALGSTWCFLIDGMRN 541
           YYTNV RQVGN+TMATYEA LGR TSTNYPLVNFLMA EADFFVGALGSTWCFLIDGMRN
Sbjct: 609 YYTNVTRQVGNMTMATYEASLGRKTSTNYPLVNFLMAAEADFFVGALGSTWCFLIDGMRN 668

The following BLAST results are available for this feature:
Match NameE-valueIdentityDescription
A0A0A0LKK1_CUCSA4.7e-24273.12Uncharacterized protein OS=Cucumis sativus GN=Csa_2G075410 PE=4 SV=1[more]
A0A0D2SS03_GOSRA1.6e-20260.51Uncharacterized protein OS=Gossypium raimondii GN=B456_007G275200 PE=4 SV=1[more]
M1CPQ9_SOLTU1.5e-19558.78Uncharacterized protein OS=Solanum tuberosum GN=PGSC0003DMG400028063 PE=4 SV=1[more]
A0A0R0EFX5_SOYBN4.9e-19161.00Uncharacterized protein OS=Glycine max GN=GLYMA_20G223600 PE=4 SV=1[more]
A0A061F2G2_THECC1.1e-18557.19Uncharacterized protein isoform 1 OS=Theobroma cacao GN=TCM_026174 PE=4 SV=1[more]
Match NameE-valueIdentityDescription
gi|659082412|ref|XP_008441826.1|1.1e-27183.64PREDICTED: uncharacterized protein LOC103485874 isoform X2 [Cucumis melo][more]
gi|778667929|ref|XP_011649010.1|6.7e-24273.12PREDICTED: uncharacterized protein LOC101206485 [Cucumis sativus][more]
gi|659082408|ref|XP_008441824.1|1.0e-23771.89PREDICTED: uncharacterized protein LOC103485874 isoform X1 [Cucumis melo][more]
gi|702271996|ref|XP_010043633.1|1.4e-20763.84PREDICTED: uncharacterized protein LOC104432792 [Eucalyptus grandis][more]
gi|743888617|ref|XP_011038497.1|9.5e-20463.62PREDICTED: uncharacterized protein LOC105135349 isoform X2 [Populus euphratica][more]
The following terms have been associated with this gene:
Vocabulary: INTERPRO
TermDefinition
GO Assignments
This gene is annotated with the following GO terms.
Category Term Accession Term Name
biological_process GO:0008150 biological_process
cellular_component GO:0005575 cellular_component
cellular_component GO:0016021 integral component of membrane
cellular_component GO:0016020 membrane
molecular_function GO:0003674 molecular_function
molecular_function GO:0016740 transferase activity
molecular_function GO:0016757 transferase activity, transferring glycosyl groups
This gene is associated with the following unigenes:
Unigene NameAnalysis NameSequence type in Unigene
WMU16090watermelon EST collection version 2.0transcribed_cluster
WMU23212watermelon EST collection version 2.0transcribed_cluster
WMU39603watermelon EST collection version 2.0transcribed_cluster
WMU44721watermelon EST collection version 2.0transcribed_cluster
WMU48957watermelon EST collection version 2.0transcribed_cluster
WMU51163watermelon EST collection version 2.0transcribed_cluster
WMU53601watermelon EST collection version 2.0transcribed_cluster
WMU62310watermelon EST collection version 2.0transcribed_cluster
WMU78007watermelon EST collection version 2.0transcribed_cluster

The following mRNA feature(s) are a part of this gene:

Feature NameUnique NameType
Cla007909Cla007909.1mRNA


The following transcribed_cluster feature(s) are associated with this gene:

Feature NameUnique NameType
WMU39603WMU39603transcribed_cluster
WMU16090WMU16090transcribed_cluster
WMU23212WMU23212transcribed_cluster
WMU78007WMU78007transcribed_cluster
WMU44721WMU44721transcribed_cluster
WMU51163WMU51163transcribed_cluster
WMU62310WMU62310transcribed_cluster
WMU48957WMU48957transcribed_cluster
WMU53601WMU53601transcribed_cluster


Analysis Name: InterPro Annotations of watermelon (97103)
Date Performed: 2016-09-28
IPR TermIPR DescriptionSourceSource TermSource DescriptionAlignment
NoneNo IPR availablePANTHERPTHR13132ALPHA- 1,6 -FUCOSYLTRANSFERASEcoord: 5..541
score: 8.2E
NoneNo IPR availablePANTHERPTHR13132:SF29ALPHA-(1,6)-FUCOSYLTRANSFERASEcoord: 5..541
score: 8.2E