ClCG06G001420 (gene) Watermelon (Charleston Gray)

NameClCG06G001420
Typegene
OrganismCitrullus lanatus (Watermelon (Charleston Gray))
DescriptionPre-mRNA processing protein PRP39, putative, expressed
LocationCG_Chr06 : 1482232 .. 1486495 (-)
The following sequences are available for this feature:

Gene sequence (with intron)

Legend: five_prime_UTRCDSthree_prime_UTR
Hold the cursor over a type above to highlight its positions in the sequence below.
TTCGATTCTTAGCTTTTCTCCAATTTTTCCTCATTCCCGAATGATTTGACCTATACGATGCCCGTCCATACCTGAAAATGCCGCCCATAAGGCTCGTTTCCATCTTCAAGCGCTCTCCAATGCCACATACTTCCAGGTGCGACGTTCTTGTAATTGGGATGATATCTTCTTCCCCTTTGCTTTTGCCTGTTTTTGTTTGTGATCCTATTTCTGGGTTTTATTTACCAAACATTGAATTGATCGCTAGCTCTTGGGTTTTGTCTCTTTGACGGGTGAATGATTCCTGGGGCTGTAAAGCACTCTCGTTATGCATATTATTCGAGAAACTATTTGAAGAGCCCGTTGATGGTTATTTCAACAGTAACTTTTGTTTTAAAAAGATTACATCCTTCAAGAATTTCAAGAATTAGTATGATGGTTATTTCAAAATACATCAGGCTGCCACAATTTGGCCAACAGTTTAAAACTGTTAGATTAGTGAGAGAAATACTGCAAAAGTATCTCAAATGTATATCTAGTAGTAGTTATTCGAGAACATACGGATTTTTGAATCTACATTCCATAGTTGTTATCTCATTCCATTTTATAAAAACAATCTGAAAAAGTAGGATTTAAATACACTTCTTAATTGAATTTTGAGGATAAAATTAGTAAAAGGAGGTAATAAATTGTAGGACTAATTTCTGGTATTCTAGAATTTAAGCATGTATGGCATTTTGGTAACTTAATTTTGTGAAATTTTGTTGGAGAAGAAGGTTTAGAACTTTGGAATAGAAGAAAATAATTTCGTTATTGACATTGGTGAACTATAACTTGTTAAAAAAAAATAGAAAAAGAGAAAAAAAAAATGGAAATGGGAGTTTGCTATGTTGGAAAAGAAAAGGGAAAGGAATTGGTTATTGACCTCACAATGTATCATAGCTGTAGCAAGCATGTCTGAATTGTTAAGGCTCAGGGCTTTTCAAATTCTGGATAAGCAATAAAGTTTGATCAAATGAAAAGTATGAATTTGGCCTAATAAACTGGTGTATCTTCGAATTGTTTGTTATGCATTTGTTTGCTACCATGTTTTACTTATTTGTGCTTCGAAAGTCAAGGAAACTCTCCCATTAAACCATGTTGTCGTCTTCATAAAGCCATTTAACTTAAAAAGTCAAGGGGGGCATTGAAGTTGTTGAGTCTTGGTAGTTATCAGCCTTTAAAATTGGCTGTGGTTTTATATTACTCAGGTCATTGTGTTTTTCGAACCATGTTTTTTCTTCACTTATGAACTAATCAATTTGATTGGATGAAGTTTCTTATAAGTACGCAATGGTTCTTTCTTTTTTTCTTGTGGGGATATTTTCCTTGTATAACTTTTCCTTGTTCCTCCAGATGGCATCCGGATGCAACTGGTTCCACTGGAACAATGTTCACTATCTTTTGCCCTCAGATGAGCCTGACCATTTCTCGTTGCCTTCCCCAACTCCTGAATGGCCTCAAGGTATATGATACTCTCAAATAAGAATGAGTTGTAAACTATGGCATTTTGATCATTTTTTGTTTTATTTGTTTCTTTTTTACTTCTCATTGAATTTGAGGCCATTCTTTGTTTGTTTCTGAGTCTTTGACTCAATGTGTGGCTTTTGTTGATAAGGAGTATTGGACTAACTACTTGATCTCGCTTTTTCCCCATTAAGCTAATTGGGTTTGTTGTCGTATAAGATTTCTAATGTTGAAGTAGGAGAGAAATATAATGTTGATCTCCTGTTGTTGAAAACTATGAGGCAATTGTTTTACATTTCCCGAAAATCCCACTTCAAGCTAAAAGCGATGTGACCACACAATGTAAAAGGATGCATGTGGAGCAGAGCATCCATTTACATTTGAGGTCTCCGTCTCTATGGCCTGGGAAGTTTTTGGATAAATTTTCACCAGGGTAAAATCTTGAAGAAATTGTAAATCTGTGCGGTAATCCTTTAATGTTGTCATATTTGTTAATGATTTAATGCTTGAGTATTGTTTTTGTGTCGTTACATTGTATTTGGACTCCCCTGCATGCACACACGCACACCAACCAATAATAATATTCATTGATATAATTGAATCATGTAGGAAAAAAGGTATTCTCTTCATATCATCTTTGTTGATGCTCCTACTGAGATTATTGTCAGCCAACATTCTCCTGATACTATAAGCCGATTTTGTTTTCATTTCAAGAAAACAAAATAGTTAACATGTCTCTAGTATCTTTAAATATAGGTGTCTAAACGTGTTGAGTTTGCCAAGTTCTCTAACTTGTGAATAAGTAATTAAAAAATTGAGAGCTTTTACTTGTTTAGTTGCAGTTATATATACAAATATTGATCCTCCTTTAATGGAACTTGATGCATGTCATCCTAGGTGGAGGTTTTGCTTCTGGAATAGCAAGTCTTGGGGAGATTGAGGTTCGCAAAATCACCCAATTTGTTTCCATATGGGGTTGCAATCTAACACGCCGAGGAAACAATGGTGTCACATTTTACAGACCATTAAGGATACCCGAAGGATTTCACTGCCTTGGTCACTATTGCCAACCTAATGACCGGCCACTGCACGGTTATCTTCTCACGGCGAGGGAAGTAGATGGTTATTTTCAGGAAAGTGATCATATTAGCAACATTGTTAAATTGCCAGCCCTTGTGGAACCCCTTGATTATACATTGATATGGAGTCCAGATGATGGGAGTGAGGAGAAGTACAGTGAATGTGCCTACATTTGGCTACCTCAACCACCTGATGGTTATAAATCCATGGGTTATTTTGTCACTAACAAGCTAGAAAAGCCTGAAGTGGGTGAAGTAAGGTGCGTTCGAGCTGATTTAACCGATAGATGCGAAACTTATCGCTTAATGTTTAATATCAGTTCTAAATGTAAAAACTTTCTAGTACAGATTTGGAGTACAAGAGCATGTCACAGAGGGATGCTTGGTAGGGGAGTTCCTGTAGGAACATTTCATTGTGGTAGTTACGAAGACACTGAAAAAGAGCTTCCTATTGCATGCTTAAAAAACCTGGATTCTACACTTTCTACAATGCCCAACCTTAATCAGATTCATGCTCTAATCAACCACTACGGACCCACTGTTTTCTTCCATCCCAAAGAGATCTACTTGCCATCTTCCGTTTCATGGTTTTTCGAAAATGGGGTGCTATTACACAGAGATGGCATATCATCTGGGGAAACCATACATGTTTGTGGCAAAAATTTGCCTGGTGGTGGGAGAAATGATAGGGGATGTTGGATGGATTTGCCAACTGATGGCTGTAGAGACAAGATCATATATGGAAATCTGGAAAGTGCGAAGCTTTACGTTCATGTAAAGCCAGCTCTGGGCGGGACATTCACGGATATTGCTATGTGGGTCTTTTGTCCCTTCAATGGACCATCCACTCTCAAACTTGGAATTATGAATATTAGTCTAGGGAAAATTGGACAACATGTGGGGGATTGGGAGCATATCACTCTTAGGATTTGCAACTTTACAGGAGAACTTTGTAGCATTTACTTCTCCCAGCACAGTGGTGGTGAATGGGTGGATGCTTACAATTTGGAATTCATAGAAGGAAACAAAGCCATAGTTTACTCCTCAAAGAGTGGACATGCTAGCTATCCTCATCCTGGAGTCTACATCCAAGGCTCTGCAATGCTCGGGATCGGAATAAGGAATGACTGCGCACGTAGTCATCTTTTTGTTGATTCAAGCATCCATTATGAAATAGTTGCAGCAGAGTATCTGAGATGCAATGGCATTGTGGAGCCTTGTTGGTTGCAATTCATGAGAGAATGGGGTCCAACTATTGTCTACAGCTCGAGAACGAAGCTTGACAACATCATCGATCGCCTTCCGTTGAAGATTCGGTGTACAGTTGCAAATATATTTAGAATGTTACCAGGGGAATTGTTTGGAGAGGGTGGTCCAACTGGGCCAAAGGAGAAGAACAATTGGGAAGGAGATGAGAGAGGCTAAAATTTTGTTGTGCATTCTAAATTCATTGCAGCTTGAGATATTCATATGACAAAGAGGGATGTGCTTTATTGAAGCAAACTTTGTGCTTGCTTGGAGTTTTGATTCAATATTTAGAAAACCCAAACTCAATTTCTTATGTATTGATTCTTGGTTGAGACTATAGGTAGTACATAGATGGAATCTCGACCTCTTTCTATTGTAATGTAGCTTGAACTGACAAACATTCTATATCAAATAGAGAAAGAAGGGCATACCAAAAATAAGTAGTTAGGGAA

mRNA sequence

TTCGATTCTTAGCTTTTCTCCAATTTTTCCTCATTCCCGAATGATTTGACCTATACGATGCCCGTCCATACCTGAAAATGCCGCCCATAAGGCTCGTTTCCATCTTCAAGCGCTCTCCAATGCCACATACTTCCAGATGGCATCCGGATGCAACTGGTTCCACTGGAACAATGTTCACTATCTTTTGCCCTCAGATGAGCCTGACCATTTCTCGTTGCCTTCCCCAACTCCTGAATGGCCTCAAGGTGGAGGTTTTGCTTCTGGAATAGCAAGTCTTGGGGAGATTGAGGTTCGCAAAATCACCCAATTTGTTTCCATATGGGGTTGCAATCTAACACGCCGAGGAAACAATGGTGTCACATTTTACAGACCATTAAGGATACCCGAAGGATTTCACTGCCTTGGTCACTATTGCCAACCTAATGACCGGCCACTGCACGGTTATCTTCTCACGGCGAGGGAAGTAGATGGTTATTTTCAGGAAAGTGATCATATTAGCAACATTGTTAAATTGCCAGCCCTTGTGGAACCCCTTGATTATACATTGATATGGAGTCCAGATGATGGGAGTGAGGAGAAGTACAGTGAATGTGCCTACATTTGGCTACCTCAACCACCTGATGGTTATAAATCCATGGGTTATTTTGTCACTAACAAGCTAGAAAAGCCTGAAGTGGGTGAAGTAAGGTGCGTTCGAGCTGATTTAACCGATAGATGCGAAACTTATCGCTTAATGTTTAATATCAGTTCTAAATGTAAAAACTTTCTAGTACAGATTTGGAGTACAAGAGCATGTCACAGAGGGATGCTTGGTAGGGGAGTTCCTGTAGGAACATTTCATTGTGGTAGTTACGAAGACACTGAAAAAGAGCTTCCTATTGCATGCTTAAAAAACCTGGATTCTACACTTTCTACAATGCCCAACCTTAATCAGATTCATGCTCTAATCAACCACTACGGACCCACTGTTTTCTTCCATCCCAAAGAGATCTACTTGCCATCTTCCGTTTCATGGTTTTTCGAAAATGGGGTGCTATTACACAGAGATGGCATATCATCTGGGGAAACCATACATGTTTGTGGCAAAAATTTGCCTGGTGGTGGGAGAAATGATAGGGGATGTTGGATGGATTTGCCAACTGATGGCTGTAGAGACAAGATCATATATGGAAATCTGGAAAGTGCGAAGCTTTACGTTCATGTAAAGCCAGCTCTGGGCGGGACATTCACGGATATTGCTATGTGGGTCTTTTGTCCCTTCAATGGACCATCCACTCTCAAACTTGGAATTATGAATATTAGTCTAGGGAAAATTGGACAACATGTGGGGGATTGGGAGCATATCACTCTTAGGATTTGCAACTTTACAGGAGAACTTTGTAGCATTTACTTCTCCCAGCACAGTGGTGGTGAATGGGTGGATGCTTACAATTTGGAATTCATAGAAGGAAACAAAGCCATAGTTTACTCCTCAAAGAGTGGACATGCTAGCTATCCTCATCCTGGAGTCTACATCCAAGGCTCTGCAATGCTCGGGATCGGAATAAGGAATGACTGCGCACGTAGTCATCTTTTTGTTGATTCAAGCATCCATTATGAAATAGTTGCAGCAGAGTATCTGAGATGCAATGGCATTGTGGAGCCTTGTTGGTTGCAATTCATGAGAGAATGGGGTCCAACTATTGTCTACAGCTCGAGAACGAAGCTTGACAACATCATCGATCGCCTTCCGTTGAAGATTCGGTGTACAGTTGCAAATATATTTAGAATGTTACCAGGGGAATTGTTTGGAGAGGGTGGTCCAACTGGGCCAAAGGAGAAGAACAATTGGGAAGGAGATGAGAGAGGCTAAAATTTTGTTGTGCATTCTAAATTCATTGCAGCTTGAGATATTCATATGACAAAGAGGGATGTGCTTTATTGAAGCAAACTTTGTGCTTGCTTGGAGTTTTGATTCAATATTTAGAAAACCCAAACTCAATTTCTTATGTATTGATTCTTGGTTGAGACTATAGGTAGTACATAGATGGAATCTCGACCTCTTTCTATTGTAATGTAGCTTGAACTGACAAACATTCTATATCAAATAGAGAAAGAAGGGCATACCAAAAATAAGTAGTTAGGGAA

Coding sequence (CDS)

ATGGCATCCGGATGCAACTGGTTCCACTGGAACAATGTTCACTATCTTTTGCCCTCAGATGAGCCTGACCATTTCTCGTTGCCTTCCCCAACTCCTGAATGGCCTCAAGGTGGAGGTTTTGCTTCTGGAATAGCAAGTCTTGGGGAGATTGAGGTTCGCAAAATCACCCAATTTGTTTCCATATGGGGTTGCAATCTAACACGCCGAGGAAACAATGGTGTCACATTTTACAGACCATTAAGGATACCCGAAGGATTTCACTGCCTTGGTCACTATTGCCAACCTAATGACCGGCCACTGCACGGTTATCTTCTCACGGCGAGGGAAGTAGATGGTTATTTTCAGGAAAGTGATCATATTAGCAACATTGTTAAATTGCCAGCCCTTGTGGAACCCCTTGATTATACATTGATATGGAGTCCAGATGATGGGAGTGAGGAGAAGTACAGTGAATGTGCCTACATTTGGCTACCTCAACCACCTGATGGTTATAAATCCATGGGTTATTTTGTCACTAACAAGCTAGAAAAGCCTGAAGTGGGTGAAGTAAGGTGCGTTCGAGCTGATTTAACCGATAGATGCGAAACTTATCGCTTAATGTTTAATATCAGTTCTAAATGTAAAAACTTTCTAGTACAGATTTGGAGTACAAGAGCATGTCACAGAGGGATGCTTGGTAGGGGAGTTCCTGTAGGAACATTTCATTGTGGTAGTTACGAAGACACTGAAAAAGAGCTTCCTATTGCATGCTTAAAAAACCTGGATTCTACACTTTCTACAATGCCCAACCTTAATCAGATTCATGCTCTAATCAACCACTACGGACCCACTGTTTTCTTCCATCCCAAAGAGATCTACTTGCCATCTTCCGTTTCATGGTTTTTCGAAAATGGGGTGCTATTACACAGAGATGGCATATCATCTGGGGAAACCATACATGTTTGTGGCAAAAATTTGCCTGGTGGTGGGAGAAATGATAGGGGATGTTGGATGGATTTGCCAACTGATGGCTGTAGAGACAAGATCATATATGGAAATCTGGAAAGTGCGAAGCTTTACGTTCATGTAAAGCCAGCTCTGGGCGGGACATTCACGGATATTGCTATGTGGGTCTTTTGTCCCTTCAATGGACCATCCACTCTCAAACTTGGAATTATGAATATTAGTCTAGGGAAAATTGGACAACATGTGGGGGATTGGGAGCATATCACTCTTAGGATTTGCAACTTTACAGGAGAACTTTGTAGCATTTACTTCTCCCAGCACAGTGGTGGTGAATGGGTGGATGCTTACAATTTGGAATTCATAGAAGGAAACAAAGCCATAGTTTACTCCTCAAAGAGTGGACATGCTAGCTATCCTCATCCTGGAGTCTACATCCAAGGCTCTGCAATGCTCGGGATCGGAATAAGGAATGACTGCGCACGTAGTCATCTTTTTGTTGATTCAAGCATCCATTATGAAATAGTTGCAGCAGAGTATCTGAGATGCAATGGCATTGTGGAGCCTTGTTGGTTGCAATTCATGAGAGAATGGGGTCCAACTATTGTCTACAGCTCGAGAACGAAGCTTGACAACATCATCGATCGCCTTCCGTTGAAGATTCGGTGTACAGTTGCAAATATATTTAGAATGTTACCAGGGGAATTGTTTGGAGAGGGTGGTCCAACTGGGCCAAAGGAGAAGAACAATTGGGAAGGAGATGAGAGAGGCTAA

Protein sequence

MASGCNWFHWNNVHYLLPSDEPDHFSLPSPTPEWPQGGGFASGIASLGEIEVRKITQFVSIWGCNLTRRGNNGVTFYRPLRIPEGFHCLGHYCQPNDRPLHGYLLTAREVDGYFQESDHISNIVKLPALVEPLDYTLIWSPDDGSEEKYSECAYIWLPQPPDGYKSMGYFVTNKLEKPEVGEVRCVRADLTDRCETYRLMFNISSKCKNFLVQIWSTRACHRGMLGRGVPVGTFHCGSYEDTEKELPIACLKNLDSTLSTMPNLNQIHALINHYGPTVFFHPKEIYLPSSVSWFFENGVLLHRDGISSGETIHVCGKNLPGGGRNDRGCWMDLPTDGCRDKIIYGNLESAKLYVHVKPALGGTFTDIAMWVFCPFNGPSTLKLGIMNISLGKIGQHVGDWEHITLRICNFTGELCSIYFSQHSGGEWVDAYNLEFIEGNKAIVYSSKSGHASYPHPGVYIQGSAMLGIGIRNDCARSHLFVDSSIHYEIVAAEYLRCNGIVEPCWLQFMREWGPTIVYSSRTKLDNIIDRLPLKIRCTVANIFRMLPGELFGEGGPTGPKEKNNWEGDERG
BLAST of ClCG06G001420 vs. TrEMBL
Match: A0A0A0KDB1_CUCSA (Uncharacterized protein OS=Cucumis sativus GN=Csa_6G088000 PE=4 SV=1)

HSP 1 Score: 1042.3 bits (2694), Expect = 2.1e-301
Identity = 491/535 (91.78%), Postives = 507/535 (94.77%), Query Frame = 1

Query: 37  GGGFASGIASLGEIEVRKITQFVSIWGCNLTRRGNNGVTFYRPLRIPEGFHCLGHYCQPN 96
           GGGFASGIASLGEIEV KITQFVSIWGCNL+RRGNNGVTFYRPLR+PEG+HCLGHYCQPN
Sbjct: 64  GGGFASGIASLGEIEVLKITQFVSIWGCNLSRRGNNGVTFYRPLRMPEGYHCLGHYCQPN 123

Query: 97  DRPLHGYLLTAREVDGYFQESDHISNIVKLPALVEPLDYTLIWSPDDGSEEKYSECAYIW 156
           DRPLHGYLL AREVDGYFQESDHISNIVKLPALVEP+D+TLIWSPDDGSEEKY ECAYIW
Sbjct: 124 DRPLHGYLLVAREVDGYFQESDHISNIVKLPALVEPIDFTLIWSPDDGSEEKYGECAYIW 183

Query: 157 LPQPPDGYKSMGYFVTNKLEKPEVGEVRCVRADLTDRCETYRLMFNISSKCKNFLVQIWS 216
           LPQPPDGYKSMGYFVTNKLEKP VGEVRCVRADLTDRCETYRLMFNISSKCKNFLVQIWS
Sbjct: 184 LPQPPDGYKSMGYFVTNKLEKPVVGEVRCVRADLTDRCETYRLMFNISSKCKNFLVQIWS 243

Query: 217 TRACHRGMLGRGVPVGTFHCGSYEDTEKELPIACLKNLDSTLSTMPNLNQIHALINHYGP 276
           TRACHRGMLGRGVPVGTFHCGSY+ TEKELPIACLKNL+STL TMPN++QIH+LINHYGP
Sbjct: 244 TRACHRGMLGRGVPVGTFHCGSYKGTEKELPIACLKNLNSTLPTMPNIDQIHSLINHYGP 303

Query: 277 TVFFHPKEIYLPSSVSWFFENGVLLHRDGISSGETIHVCGKNLPGGGRNDRGCWMDLPTD 336
           TVFFHPKEIYLPSSVSWFFENGVLLHRDG+SSGE I VCG NLP  GRND  CWMDLPTD
Sbjct: 304 TVFFHPKEIYLPSSVSWFFENGVLLHRDGMSSGEAILVCGTNLPTDGRNDTVCWMDLPTD 363

Query: 337 GCRDKIIYGNLESAKLYVHVKPALGGTFTDIAMWVFCPFNGPSTLKLGIMNISLGKIGQH 396
           GCRDKII GNLESAKLY HVKPALGGTFTDIAMWVFCPFNGPSTLKLGI+NISLGKIGQH
Sbjct: 364 GCRDKIINGNLESAKLYAHVKPALGGTFTDIAMWVFCPFNGPSTLKLGIVNISLGKIGQH 423

Query: 397 VGDWEHITLRICNFTGELCSIYFSQHSGGEWVDAYNLEFIEGNKAIVYSSKSGHASYPHP 456
           VGDWEHITLRICNFTGEL SIYFSQHSGGEWVDAYNLEFIEGNKAIVYSSKSGHASYP P
Sbjct: 424 VGDWEHITLRICNFTGELFSIYFSQHSGGEWVDAYNLEFIEGNKAIVYSSKSGHASYPRP 483

Query: 457 GVYIQGSAMLGIGIRNDCARSHLFVDSSIHYEIVAAEYLRCNGIVEPCWLQFMREWGPTI 516
           G+YIQGS+ LGIGIRNDCARSHLF+DSS HYEIVAAE+LR N IVEP WLQFMREWGPTI
Sbjct: 484 GLYIQGSSKLGIGIRNDCARSHLFIDSSTHYEIVAAEHLRRNDIVEPGWLQFMREWGPTI 543

Query: 517 VYSSRTKLDNIIDRLPLKIRCTVANIFRMLPGELFGEGGPTGPKEKNNWEGDERG 572
           VYSSRTKLDN IDRLPLKIR  VANIFR LP ELFGE GPTGPKEKNNWEGDERG
Sbjct: 544 VYSSRTKLDNFIDRLPLKIRFPVANIFRKLPAELFGEVGPTGPKEKNNWEGDERG 598

BLAST of ClCG06G001420 vs. TrEMBL
Match: W9QPL3_9ROSA (Uncharacterized protein OS=Morus notabilis GN=L484_015871 PE=4 SV=1)

HSP 1 Score: 806.2 bits (2081), Expect = 2.5e-230
Identity = 376/569 (66.08%), Postives = 441/569 (77.50%), Query Frame = 1

Query: 3   SGCNWFHWNNVHYLLPSDEPDHFSLPSPTPEWPQGGGFASGIASLGEIEVRKITQFVSIW 62
           S CN   WN  H      EP+ FSLP+P P+WPQG GFASG  S+GE+EV K+T+F  IW
Sbjct: 4   SSCNCLCWNK-HTDFSLSEPETFSLPAPLPKWPQGKGFASGRISIGELEVFKVTRFEFIW 63

Query: 63  GCNLTRRGNNGVTFYRPLRIPEGFHCLGHYCQPNDRPLHGYLLTAREVDGYFQESDHISN 122
             NL++  N G +FY+P  IP+GFH LGH+CQPN++PL G+LL AREV  +  ES H SN
Sbjct: 64  CYNLSQDKNKGFSFYKPAEIPDGFHSLGHFCQPNNQPLRGFLLVAREVASHMPESCHASN 123

Query: 123 IVKLPALVEPLDYTLIWSPDDGSEEKYSECAYIWLPQPPDGYKSMGYFVTNKLEKPEVGE 182
             KLP L EPLDY L+WSPDD SEEK   C Y WLPQ P+GYK +G+ VTNK  KP + E
Sbjct: 124 TAKLPVLCEPLDYVLVWSPDDWSEEKCGGCGYFWLPQAPEGYKPVGFLVTNKPVKPRLDE 183

Query: 183 VRCVRADLTDRCETYRLMFNISSKCKNFLVQIWSTRACHRGMLGRGVPVGTFHCGSYEDT 242
           VRCVR+DL D C+ YRL+   +S+  NF  ++WSTR  HRG+ G+GVPVGTF C S    
Sbjct: 184 VRCVRSDLMDECDAYRLLLTCNSRYMNFSFRVWSTRPRHRGVTGKGVPVGTFFCSSSWSA 243

Query: 243 EKELPIACLKNLDSTLSTMPNLNQIHALINHYGPTVFFHPKEIYLPSSVSWFFENGVLLH 302
            ++L I CLKNLD TL  MPNL+QIHALINHYGPTVFFHP+E+YLPSSVSWFFENG LL+
Sbjct: 244 GEDLCIGCLKNLDPTLPAMPNLDQIHALINHYGPTVFFHPEEVYLPSSVSWFFENGALLY 303

Query: 303 RDGISSGETIHVCGKNLPGGGRNDRGCWMDLPTDGCRDKIIYGNLESAKLYVHVKPALGG 362
           R GI+ GE I  CG NLPGGG ND   W+DLP D  R+ +  GNLESAKLY H KPALGG
Sbjct: 304 RAGITVGEPIDACGSNLPGGGTNDGEFWIDLPRDKRRESVKRGNLESAKLYAHAKPALGG 363

Query: 363 TFTDIAMWVFCPFNGPSTLKLGIMNISLGKIGQHVGDWEHITLRICNFTGELCSIYFSQH 422
           TFTDIAMWVFCPFNGP+TLK+G+MNI+L KIG+H+GDWEH TLRICNFTGEL S+YFSQH
Sbjct: 364 TFTDIAMWVFCPFNGPATLKVGLMNIALSKIGEHIGDWEHFTLRICNFTGELWSMYFSQH 423

Query: 423 SGGEWVDAYNLEFIEGNKAIVYSSKSGHASYPHPGVYIQGSAMLGIGIRNDCARSHLFVD 482
           SGGEWVDAYNLEFIEGNKAIVYSSKSGHAS+PHPG YIQGS+ LG+GIRND ARS+L VD
Sbjct: 424 SGGEWVDAYNLEFIEGNKAIVYSSKSGHASFPHPGTYIQGSSTLGVGIRNDAARSNLLVD 483

Query: 483 SSIHYEIVAAEYLRCNGIVEPCWLQFMREWGPTIVYSSRTKLDNIIDRLPLKIRCTVANI 542
           SS  YE+VAAEYL    + EPCWLQ+MR WGPTIVY SRT+LD +I+RLP+ ++ +V N 
Sbjct: 484 SSKQYELVAAEYLGDGVVTEPCWLQYMRLWGPTIVYDSRTELDKMINRLPMMVKYSVVNW 543

Query: 543 FRMLPGELFGEGGPTGPKEKNNWEGDERG 572
           F  +P EL  + GPTGPKEK+ W GDERG
Sbjct: 544 FAKVPVELSRQEGPTGPKEKDYWLGDERG 571

BLAST of ClCG06G001420 vs. TrEMBL
Match: A0A061DQE8_THECC (Vacuolar protein sorting-associated protein YPR157W OS=Theobroma cacao GN=TCM_003907 PE=4 SV=1)

HSP 1 Score: 789.6 bits (2038), Expect = 2.5e-225
Identity = 377/568 (66.37%), Postives = 438/568 (77.11%), Query Frame = 1

Query: 4   GCNWFHWNNVHYLLPSDEPDHFSLPSPTPEWPQGGGFASGIASLGEIEVRKITQFVSIWG 63
           GC  F+WN +  LLP  EP+ FSLP+P P+WPQG GFASG  +LGE+EV KI++F  IW 
Sbjct: 3   GCKCFYWNKMDQLLPC-EPETFSLPAPLPQWPQGQGFASGKINLGELEVVKISRFEFIWS 62

Query: 64  CNLTRRGNNGVTFYRPLRIPEGFHCLGHYCQPNDRPLHGYLLTAREVDGYFQESDHISNI 123
            NL R    GVTFY P+ IP+GF+ LGHYCQ ND+PL GY+L ARE   +  E+ H S  
Sbjct: 63  SNLLRDKKKGVTFYEPVGIPDGFYSLGHYCQSNDQPLRGYVLVAREKP-FKSEAAHFSAC 122

Query: 124 VKLPALVEPLDYTLIWSPDDGSEEKYSECAYIWLPQPPDGYKSMGYFVTNKLEKPEVGEV 183
           V  PAL EPLD +L+WS +  SEE    C + WLPQPP+GYKSMGY VTN  +KP++ +V
Sbjct: 123 VSSPALREPLDCSLVWSSNGRSEESLEGCGFFWLPQPPEGYKSMGYLVTNTPKKPKLDKV 182

Query: 184 RCVRADLTDRCETYRLMFNISSKCKNFLVQIWSTRACHRGMLGRGVPVGTFHCGSYEDTE 243
           RCVRADLTDRCE Y+++ N   +   F  Q+WSTR  HRGMLGRGV VGTF CGS+    
Sbjct: 183 RCVRADLTDRCENYQVVHNGHMRFSEFPFQVWSTRPSHRGMLGRGVSVGTFSCGSFWTPG 242

Query: 244 KELPIACLKNLDSTLSTMPNLNQIHALINHYGPTVFFHPKEIYLPSSVSWFFENGVLLHR 303
           +EL IACLKN D TL  MPN +QIHALINHYGPTVFFHP EIYLPSSVSWFF+NG LL +
Sbjct: 243 QELSIACLKNSDPTLHAMPNCDQIHALINHYGPTVFFHPDEIYLPSSVSWFFKNGALLFK 302

Query: 304 DGISSGETIHVCGKNLPGGGRNDRGCWMDLPTDGCRDKIIYGNLESAKLYVHVKPALGGT 363
            G   GE I V G NLP GGRND   W+DLP+   ++ +  GNL SAKLYVHVKPALGGT
Sbjct: 303 KGDLEGECIDVNGSNLPSGGRNDGEFWIDLPSGDRKNNVKLGNLGSAKLYVHVKPALGGT 362

Query: 364 FTDIAMWVFCPFNGPSTLKLGIMNISLGKIGQHVGDWEHITLRICNFTGELCSIYFSQHS 423
           FTDIAMW+FCPFNGP+TLK+GIM+I+L KIGQHVGDWEH TLR+CNFTGEL SIYFSQHS
Sbjct: 363 FTDIAMWIFCPFNGPATLKIGIMDIALSKIGQHVGDWEHFTLRLCNFTGELWSIYFSQHS 422

Query: 424 GGEWVDAYNLEFIEGNKAIVYSSKSGHASYPHPGVYIQGSAMLGIGIRNDCARSHLFVDS 483
           GG WV+AY+LE+++GN AIVYSSKSGHASYPHPG YIQGS+ LGIGIRND   S+ +VDS
Sbjct: 423 GGVWVNAYDLEYVQGNTAIVYSSKSGHASYPHPGAYIQGSSKLGIGIRNDAVSSNFYVDS 482

Query: 484 SIHYEIVAAEYLRCNGIVEPCWLQFMREWGPTIVYSSRTKLDNIIDRLPLKIRCTVANIF 543
           S HYE+VAAEYL    I EP WLQFMREWGPTIVY SRT+LD  I+ LP+ +R +V NIF
Sbjct: 483 STHYELVAAEYLGDGVIAEPGWLQFMREWGPTIVYDSRTELDKFINILPVMLRYSVENIF 542

Query: 544 RMLPGELFGEGGPTGPKEKNNWEGDERG 572
             LP EL+GE GPTGPKEKNNW GDERG
Sbjct: 543 YKLPVELYGEEGPTGPKEKNNWVGDERG 568

BLAST of ClCG06G001420 vs. TrEMBL
Match: M5VIR5_PRUPE (Uncharacterized protein OS=Prunus persica GN=PRUPE_ppa003532mg PE=4 SV=1)

HSP 1 Score: 789.6 bits (2038), Expect = 2.5e-225
Identity = 370/567 (65.26%), Postives = 437/567 (77.07%), Query Frame = 1

Query: 4   GCNWFHWNNVHYLLPSDEPDHFSLPSPTPEWPQGGGFASGIASLGEIEVRKITQFVSIWG 63
           GC  F+W  +  L P  EP+ FSLP P P+WP G GFASG  SLGEIEV KI +F  IW 
Sbjct: 3   GCKCFYWKKLSDLFPP-EPEPFSLPDPIPQWPPGEGFASGKVSLGEIEVFKINRFEFIWT 62

Query: 64  CNLTRRGNNGVTFYRPLRIPEGFHCLGHYCQPNDRPLHGYLLTAREVDGYFQESDHISNI 123
           C+L       VTFY+P  IP+GFH +GHYCQ ND+PLHG++L  RE D    E+  +   
Sbjct: 63  CSLPEDKKKCVTFYKPAGIPDGFHSIGHYCQSNDKPLHGFVLVVREAD--MPETADVLER 122

Query: 124 VKLPALVEPLDYTLIWSPDDGSEEKYSECAYIWLPQPPDGYKSMGYFVTNKLEKPEVGEV 183
           VK PAL +PLDYTL+WSPDDG+EE Y  C Y WLPQPP+GYK+MG+ VTNK +KP + EV
Sbjct: 123 VKSPALSKPLDYTLVWSPDDGNEEIYGACGYFWLPQPPEGYKAMGFLVTNKPDKPGLDEV 182

Query: 184 RCVRADLTDRCETYRLMFNISSKCKNFLVQIWSTRACHRGMLGRGVPVGTFHCGSYEDTE 243
           RCVRADLTDRCETY L+ N  +   N   Q+W+TR  HRGM+G+GV VGTF C +     
Sbjct: 183 RCVRADLTDRCETYTLILNAITTSLNLPFQVWTTRPHHRGMMGKGVSVGTFFCSNDLGIV 242

Query: 244 KELPIACLKNLDSTLSTMPNLNQIHALINHYGPTVFFHPKEIYLPSSVSWFFENGVLLHR 303
           K+L I CLKNL+  LS MPNL+QIH+LINHYGPTVFFHP+E+YLPSSVSWFF++G LL++
Sbjct: 243 KDLHIRCLKNLNPKLSGMPNLDQIHSLINHYGPTVFFHPEEVYLPSSVSWFFKSGALLYK 302

Query: 304 DGISSGETIHVCGKNLPGGGRNDRGCWMDLPTDGCRDKIIYGNLESAKLYVHVKPALGGT 363
            G S GE I   G NLP GG ND   W+DLP D  R+ I +GNLESAKLYVHVKPALGG 
Sbjct: 303 SGTSVGEAIDGSGSNLPSGGANDGQFWIDLPNDDRREIITHGNLESAKLYVHVKPALGGI 362

Query: 364 FTDIAMWVFCPFNGPSTLKLGIMNISLGKIGQHVGDWEHITLRICNFTGELCSIYFSQHS 423
           F+DIAMWVFCPFNGP+T+K+G ++I L KIGQHVGDWEH TLRICNF+GEL SIYFSQHS
Sbjct: 363 FSDIAMWVFCPFNGPATIKVGPLDIPLSKIGQHVGDWEHFTLRICNFSGELWSIYFSQHS 422

Query: 424 GGEWVDAYNLEFIEGNKAIVYSSKSGHASYPHPGVYIQGSAMLGIGIRNDCARSHLFVDS 483
           GG+WVDAY+LE+IEGN+AIVYSSKSGHASYPHPG Y+QGS  LG+GIRND A S+L VDS
Sbjct: 423 GGKWVDAYDLEYIEGNRAIVYSSKSGHASYPHPGTYLQGSDKLGVGIRNDAACSNLSVDS 482

Query: 484 SIHYEIVAAEYLRCNGIVEPCWLQFMREWGPTIVYSSRTKLDNIIDRLPLKIRCTVANIF 543
           S+HYE+V+AEYL    + EPCWL FMREWGPTIVY+SRT+LD +I  LP+  + +V NIF
Sbjct: 483 SVHYELVSAEYLGDGVVTEPCWLNFMREWGPTIVYNSRTELDKVISLLPVMFKYSVENIF 542

Query: 544 RMLPGELFGEGGPTGPKEKNNWEGDER 571
              P EL+GE GPTGPKEKNNW GDER
Sbjct: 543 SKFPVELYGEEGPTGPKEKNNWVGDER 566

BLAST of ClCG06G001420 vs. TrEMBL
Match: A0A067FAX8_CITSI (Uncharacterized protein OS=Citrus sinensis GN=CISIN_1g045952mg PE=4 SV=1)

HSP 1 Score: 781.9 bits (2018), Expect = 5.1e-223
Identity = 369/568 (64.96%), Postives = 434/568 (76.41%), Query Frame = 1

Query: 4   GCNWFHWNNVHYLLPSDEPDHFSLPSPTPEWPQGGGFASGIASLGEIEVRKITQFVSIWG 63
           GC  F+WN V+ + P+ EP  FSLP+P P WPQG GFASG  +LGEIEV +I++F  IW 
Sbjct: 3   GCKCFYWNEVNNMSPT-EPGTFSLPAPLPTWPQGQGFASGRINLGEIEVCRISRFNFIWS 62

Query: 64  CNLTRRGNNGVTFYRPLRIPEGFHCLGHYCQPNDRPLHGYLLTAREVDGYFQESDHISNI 123
           CNL +      TFY P  IP+GF+ LGHYCQ + RPL G++L AR++     E  H SN+
Sbjct: 63  CNLLQSKKKSATFYEPAGIPDGFYSLGHYCQFDSRPLRGFVLVARDLASSEAEGAHTSNL 122

Query: 124 VKLPALVEPLDYTLIWSPDDGSEEKYSECAYIWLPQPPDGYKSMGYFVTNKLEKPEVGEV 183
            K PAL +PLDYTL+W  D+G +  Y  CA+ WLPQPPDGYKSMG+ VT    KPE+ EV
Sbjct: 123 FKSPALQKPLDYTLVWCSDEGGQGNYEGCAFFWLPQPPDGYKSMGFLVTKTPNKPELDEV 182

Query: 184 RCVRADLTDRCETYRLMFNISSKCKNFLVQIWSTRACHRGMLGRGVPVGTFHCGSYEDTE 243
           RCVR DLTD+CE + L+F+  SK  +    +WSTR C+RGMLGRGV VGTF C S   + 
Sbjct: 183 RCVRDDLTDKCEVHHLIFDAISKFSSSPFSVWSTRPCNRGMLGRGVSVGTFFCSSNWISG 242

Query: 244 KELPIACLKNLDSTLSTMPNLNQIHALINHYGPTVFFHPKEIYLPSSVSWFFENGVLLHR 303
           +EL IACLKNLD  L  MPN +QIHALI +YGPTVFFHP E+YLPSSVSWFF NG LL++
Sbjct: 243 QELNIACLKNLDPKLHAMPNCDQIHALIRNYGPTVFFHPDEVYLPSSVSWFFTNGALLYK 302

Query: 304 DGISSGETIHVCGKNLPGGGRNDRGCWMDLPTDGCRDKIIYGNLESAKLYVHVKPALGGT 363
            G   GE I   G NLP GGRND   W+DLP+DG R  + +GN+ESAKLYVHVKPA+GGT
Sbjct: 303 AGDLVGEAIDPSGSNLPSGGRNDGEFWIDLPSDGGRQIVKHGNMESAKLYVHVKPAVGGT 362

Query: 364 FTDIAMWVFCPFNGPSTLKLGIMNISLGKIGQHVGDWEHITLRICNFTGELCSIYFSQHS 423
           FTDI MWVFCPFNGP TLK+GIMN++  KIGQHVGDWEH TLRICNFTGEL SIYFSQHS
Sbjct: 363 FTDIVMWVFCPFNGPGTLKVGIMNVAFSKIGQHVGDWEHFTLRICNFTGELWSIYFSQHS 422

Query: 424 GGEWVDAYNLEFIEGNKAIVYSSKSGHASYPHPGVYIQGSAMLGIGIRNDCARSHLFVDS 483
           GG+WV AY+LE+IEGNKAIVYSSK+GHAS+PHPG Y+QGS +LGIG+RND ARS+L+VDS
Sbjct: 423 GGKWVAAYDLEYIEGNKAIVYSSKNGHASFPHPGTYLQGSEILGIGVRNDAARSNLYVDS 482

Query: 484 SIHYEIVAAEYLRCNGIVEPCWLQFMREWGPTIVYSSRTKLDNIIDRLPLKIRCTVANIF 543
           SI YE+VAAEYL    + EP WLQFMR+WGPTIVY S+T+LD II  LPL IR +V N  
Sbjct: 483 SIQYELVAAEYLGEGVVAEPSWLQFMRKWGPTIVYDSKTELDKIIKLLPLMIRYSVENAV 542

Query: 544 RMLPGELFGEGGPTGPKEKNNWEGDERG 572
             LP EL+GE GPTGPKEKNNW GDERG
Sbjct: 543 SKLPLELYGEEGPTGPKEKNNWVGDERG 569

BLAST of ClCG06G001420 vs. TAIR10
Match: AT5G43950.1 (AT5G43950.1 Plant protein of unknown function (DUF946))

HSP 1 Score: 671.4 bits (1731), Expect = 4.9e-193
Identity = 332/575 (57.74%), Postives = 406/575 (70.61%), Query Frame = 1

Query: 4   GCNWFHWNNVHYLLPSDEPDHFSLPSPTPEWPQGGGFASGIASLGEIEVRKITQFVSIWG 63
           GC   +WNN+    P  EP+ FSLP+  P+WP G GF  G  +LGE+EV +IT F  +W 
Sbjct: 3   GCKCLYWNNLKEYPPLKEPETFSLPASLPQWPSGQGFGLGRINLGELEVAEITSFEFVWR 62

Query: 64  CNLTRRGNNGVTFYRPLRIPEGFHCLGHYCQPNDRPLHGYLLTAREVDGYFQESDHISNI 123
               R     V+FY+P ++PE FHCLGHYCQ +   L G+LL AR+V           N 
Sbjct: 63  YCSRRDNKKSVSFYKPDKLPEDFHCLGHYCQSDSHLLRGFLLVARQV-----------NK 122

Query: 124 VKLPALVEPLDYTLIWSPDDGSEEKYSEC-AYIWLPQPPDGYKSMGYFVTNKLEKPEVGE 183
              PALV+PLDYTL+WS +D SEE+ SE   Y WLPQPP GYK +GY VT    KPE+ +
Sbjct: 123 SSEPALVQPLDYTLVWSSNDLSEERQSESYGYFWLPQPPQGYKPIGYLVTTSPAKPELDQ 182

Query: 184 VRCVRADLTDRCETYRLMFNISSKCKNFLVQIWSTRACHRGMLGRGVPVGTFHCGSYEDT 243
           VRCVRADLTD+CE ++++    S   +  + IW TR   RGM G+GV  GTF C +    
Sbjct: 183 VRCVRADLTDKCEAHKVIITAISDSLSIPMFIWKTRPSDRGMRGKGVSTGTFFCTTQSPE 242

Query: 244 EKELP-IACLKNLDSTLSTMPNLNQIHALINHYGPTVFFHPKEIYLPSSVSWFFENGVLL 303
           E  L  IACLKNLDS+L  MPN+ QIHA+I HYGP V+FHP E+YLPSSVSWFF+NG LL
Sbjct: 243 EDHLSTIACLKNLDSSLHAMPNIEQIHAMIQHYGPRVYFHPNEVYLPSSVSWFFKNGALL 302

Query: 304 HRDGISS---GETIHVCGKNLPGGGRNDRGCWMDLPTDGC--RDKIIYGNLESAKLYVHV 363
             +  SS    E I   G NLP GG ND+  W+DLP +    R+ I  G+LES+KLYVHV
Sbjct: 303 CSNSNSSVINNEPIDETGSNLPHGGTNDKRYWIDLPINDQQRREFIKRGDLESSKLYVHV 362

Query: 364 KPALGGTFTDIAMWVFCPFNGPSTLKLGIMNISLGKIGQHVGDWEHITLRICNFTGELCS 423
           KPA GGTFTD+A W+FCPFNGP+TLKLG+M++SL K GQHV DWEH T+RI NF+GEL S
Sbjct: 363 KPAFGGTFTDLAFWIFCPFNGPATLKLGLMDLSLAKTGQHVCDWEHFTVRISNFSGELYS 422

Query: 424 IYFSQHSGGEWVDAYNLEFIEG-NKAIVYSSKSGHASYPHPGVYIQGSAMLGIGIRNDCA 483
           IYFSQHSGGEW+   NLEF+EG NKA+VYSSK+GHAS+   G+Y+QGSA+LGIGIRND A
Sbjct: 423 IYFSQHSGGEWIKPENLEFVEGSNKAVVYSSKNGHASFSKSGMYLQGSALLGIGIRNDSA 482

Query: 484 RSHLFVDSSIHYEIVAAEYLRCNGIVEPCWLQFMREWGPTIVYSSRTKLDNIIDRLPLKI 543
           +S LFVDSS+ YEIVAAEYLR   +VEP WL +MREWGP IVY+SR++++ + +RLP ++
Sbjct: 483 KSDLFVDSSLKYEIVAAEYLR-GAVVEPPWLGYMREWGPKIVYNSRSEIEKLNERLPWRL 542

Query: 544 RCTVANIFRMLPGELFGEGGPTGPKEKNNWEGDER 571
           R  V  + R +P EL GE GPTGPKEKNNW GDER
Sbjct: 543 RSWVDAVLRKIPVELSGEEGPTGPKEKNNWFGDER 565

BLAST of ClCG06G001420 vs. TAIR10
Match: AT3G04350.1 (AT3G04350.1 Plant protein of unknown function (DUF946))

HSP 1 Score: 665.2 bits (1715), Expect = 3.5e-191
Identity = 319/575 (55.48%), Postives = 406/575 (70.61%), Query Frame = 1

Query: 4   GCNWFHWNNVHYLLPSD--EPDHFSLPSPTPEWPQGGGFASGIASLGEIEVRKITQFVSI 63
           GC+ F+W+     L S+  EP  FSLP+P P WPQG GFA+G  SLGEIEV KIT+F  +
Sbjct: 3   GCDCFYWSRGISELDSESSEPKPFSLPAPLPSWPQGKGFATGRISLGEIEVVKITKFHRV 62

Query: 64  WGCNLTRRGNNGVTFYRPLRIPEGFHCLGHYCQPNDRPLHGYLLTAREVDGYFQESDHIS 123
           W  + +   +   TFYR   IPEGFHCLGHYCQP D+PL GY+L AR        +    
Sbjct: 63  WSSDSSHDKSKRATFYRADDIPEGFHCLGHYCQPTDQPLRGYVLAAR--------TSKAV 122

Query: 124 NIVKLPALVEPLDYTLIWSPDDGSEEKYSECAYIWLPQPPDGYKSMGYFVTNKLEKPEVG 183
           N    P L +P+ Y+L+WS D  SE+      Y WLP PP GY++MG  VT++  +PE  
Sbjct: 123 NADDFPPLKKPVSYSLVWSAD--SEKNGG--GYFWLPNPPVGYRAMGVIVTHEPGEPETE 182

Query: 184 EVRCVRADLTDRCETYRLMFNISSKCKN----FLVQIWSTRACHRGMLGRGVPVGTFHCG 243
           EVRCVR DLT+ CET  ++  + S  K+        +WSTR C RGML +GV VG+F C 
Sbjct: 183 EVRCVREDLTESCETSEMILEVGSSKKSNGSSSPFSVWSTRPCERGMLSQGVAVGSFFCC 242

Query: 244 SYE-DTEKELP-IACLKNLDSTLSTMPNLNQIHALINHYGPTVFFHPKEIYLPSSVSWFF 303
           +Y+  +E+ +P I CLKNLD TL  MPNL+Q+HA+I H+GPTV+FHP+E Y+PSSV WFF
Sbjct: 243 TYDLSSERTVPDIGCLKNLDPTLHAMPNLDQVHAVIEHFGPTVYFHPEEAYMPSSVQWFF 302

Query: 304 ENGVLLHRDGISSGETIHVCGKNLPGGGRNDRGCWMDLPTDG-CRDKIIYGNLESAKLYV 363
           +NG LL+R G S G+ I+  G NLP GG ND   W+DLP D   +  +  GNLES++LYV
Sbjct: 303 KNGALLYRSGKSEGQPINSTGSNLPAGGCNDMDFWIDLPEDEEAKSNLKKGNLESSELYV 362

Query: 364 HVKPALGGTFTDIAMWVFCPFNGPSTLKLGIMNISLGKIGQHVGDWEHITLRICNFTGEL 423
           HVKPALGGTFTDI MW+FCPFNGP+TLK+G+  + + +IG+HVGDWEH T RICNF+GEL
Sbjct: 363 HVKPALGGTFTDIVMWIFCPFNGPATLKIGLFTLPMTRIGEHVGDWEHFTFRICNFSGEL 422

Query: 424 CSIYFSQHSGGEWVDAYNLEFIEGNKAIVYSSKSGHASYPHPGVYIQGSAMLGIGIRNDC 483
             ++FSQHSGG WVDA ++EF++ NK  VYSSK GHAS+PHPG+Y+QGS+ LGIG+RND 
Sbjct: 423 WQMFFSQHSGGGWVDASDIEFVKDNKPAVYSSKHGHASFPHPGMYLQGSSKLGIGVRNDV 482

Query: 484 ARSHLFVDSSIHYEIVAAEYLRCNGIVEPCWLQFMREWGPTIVYSSRTKLDNIIDRLPLK 543
           A+S   VDSS  Y IVAAEYL    ++EPCWLQ+MREWGPTI Y S ++++ I++ LPL 
Sbjct: 483 AKSKYIVDSSQRYVIVAAEYLGKGAVIEPCWLQYMREWGPTIAYDSGSEINKIMNLLPLV 542

Query: 544 IRCTVANIFRMLPGELFGEGGPTGPKEKNNWEGDE 570
           +R ++ NI  + P  L+GE GPTGPKEK+NWEGDE
Sbjct: 543 VRFSIENIVDLFPIALYGEEGPTGPKEKDNWEGDE 565

BLAST of ClCG06G001420 vs. TAIR10
Match: AT1G04090.1 (AT1G04090.1 Plant protein of unknown function (DUF946))

HSP 1 Score: 659.1 bits (1699), Expect = 2.5e-189
Identity = 326/577 (56.50%), Postives = 404/577 (70.02%), Query Frame = 1

Query: 4   GCNWFHWNNVHYLLPSDEPDHFSLPSPTPEWPQGGGFASGIASLGEIEVRKITQFVSIWG 63
           G    HWNN+  L P  +P+ FSLPS  P WP G GF SG  +LG+++V KIT F  IW 
Sbjct: 3   GYKCLHWNNLIDLPPLKDPETFSLPSSIPHWPPGQGFGSGTINLGKLQVIKITDFEFIWR 62

Query: 64  CNLTRRGNNGVTFYRPLRI-PEGFHCLGHYCQPNDRPLHGYLLTAREVDGYFQESDHISN 123
              T +  N ++FY+P  + P+ FHCLGHYCQ +  PL GY+L AR++    ++      
Sbjct: 63  YRSTEKKKN-ISFYKPKGLLPKDFHCLGHYCQSDSHPLRGYVLAARDLVDSLEQ------ 122

Query: 124 IVKLPALVEPLDYTLIWSPDDGSEEKYS---ECAYIWLPQPPDGYKSMGYFVTNKLEKPE 183
            V+ PALVEP+D+TL+WS +D +E + S   EC Y WLPQPP+GY+S+G+ VT    KPE
Sbjct: 123 -VEKPALVEPVDFTLVWSSNDSAENECSSKSECGYFWLPQPPEGYRSIGFVVTKTSVKPE 182

Query: 184 VGEVRCVRADLTDRCETYRLMFNISSKCKNFLVQIWSTRACHRGMLGRGVPVGTFHCGSY 243
           + EVRCVRADLTD CE + ++    S+     + IW TR   RGM G+GV  GTF C + 
Sbjct: 183 LNEVRCVRADLTDICEPHNVIVTAVSESLGVPLFIWRTRPSDRGMWGKGVSAGTFFCRTR 242

Query: 244 EDTEKE---LPIACLKNLDSTLSTMPNLNQIHALINHYGPTVFFHPKEIYLPSSVSWFFE 303
               +E   + IACLKNLD +L  MPN++QI ALI HYGPT+ FHP E YLPSSVSWFF+
Sbjct: 243 LVAAREDLGIGIACLKNLDLSLHAMPNVDQIQALIQHYGPTLVFHPGETYLPSSVSWFFK 302

Query: 304 NGVLLHRDGISSGETIHVCGKNLPGGGRNDRGCWMDLPTDGC-RDKIIYGNLESAKLYVH 363
           NG +L   G    E I   G NLP GG ND+  W+DLP D   RD +  GNLES+KLY+H
Sbjct: 303 NGAVLCEKGNPIEEPIDENGSNLPQGGSNDKQFWIDLPCDDQQRDFVKRGNLESSKLYIH 362

Query: 364 VKPALGGTFTDIAMWVFCPFNGPSTLKLGIMNISLGKIGQHVGDWEHITLRICNFTGELC 423
           +KPALGGTFTD+  W+FCPFNGP+TLKLG+++ISL  IGQHV DWEH TLRI NF+GEL 
Sbjct: 363 IKPALGGTFTDLVFWIFCPFNGPATLKLGLVDISLISIGQHVCDWEHFTLRISNFSGELY 422

Query: 424 SIYFSQHSGGEWVDAYNLEFIEG-NKAIVYSSKSGHASYPHPGVYIQGSAMLGIGIRNDC 483
           SIY SQHSGGEW++AY+LE I G NKA+VYSSK GHAS+P  G Y+QGS MLGIGIRND 
Sbjct: 423 SIYLSQHSGGEWIEAYDLEIIPGSNKAVVYSSKHGHASFPRAGTYLQGSTMLGIGIRNDT 482

Query: 484 ARSHLFVDSSIHYEIVAAEYLRCNGIV-EPCWLQFMREWGPTIVYSSRTKLDNIIDRLPL 543
           ARS L VDSS  YEI+AAEYL  N ++ EP WLQ+MREWGP +VY SR +++ +++R P 
Sbjct: 483 ARSELLVDSSSRYEIIAAEYLSGNSVLAEPPWLQYMREWGPKVVYDSREEIERLVNRFPR 542

Query: 544 KIRCTVANIFRMLPGELFGEGGPTGPKEKNNWEGDER 571
            +R ++A + R LP EL GE GPTGPKEKNNW GDER
Sbjct: 543 TVRVSLATVLRKLPVELSGEEGPTGPKEKNNWYGDER 571

BLAST of ClCG06G001420 vs. TAIR10
Match: AT5G18490.1 (AT5G18490.1 Plant protein of unknown function (DUF946))

HSP 1 Score: 653.7 bits (1685), Expect = 1.1e-187
Identity = 316/572 (55.24%), Postives = 400/572 (69.93%), Query Frame = 1

Query: 5   CNWFHWNNVHYLLPSD--EPDHFSLPSPTPEWPQGGGFASGIASLGEIEVRKITQFVSIW 64
           C+ F+WN     L S+  E   FSLPSP P+WPQG GFA+G  SLGEI+V K+T+F  +W
Sbjct: 3   CDCFYWNKGFSELESESSESKPFSLPSPLPQWPQGRGFATGRISLGEIQVVKVTEFDRVW 62

Query: 65  GCNLTRRGNNGVTFYRPLRIPEGFHCLGHYCQPNDRPLHGYLLTAREVDGYFQESDHISN 124
            C  +R      +FY+P+ IPEGFHCLGHYCQPN++PL G++L AR         DH   
Sbjct: 63  KCGTSRGKLRCASFYKPVGIPEGFHCLGHYCQPNNQPLRGFVLAARANKPGHLADDH--- 122

Query: 125 IVKLPALVEPLDYTLIWSPDDGSEEKYSECAYIWLPQPPDGYKSMGYFVTNKLEKPEVGE 184
               P L +PL+Y+L+WS D       S+C Y WLP PP GY+++G  VT+  E+PEV E
Sbjct: 123 ---RPPLKKPLNYSLVWSSD-------SDC-YFWLPNPPVGYRAVGVIVTDGSEEPEVDE 182

Query: 185 VRCVRADLTDRCETYRLMFNISSKCKNFLVQIWSTRACHRGMLGRGVPVGTFHCGSYE-- 244
           VRCVR DLT+ CET   +  + S        +WST+ C RG+  RGV VG+F C + +  
Sbjct: 183 VRCVREDLTESCETGEKVLGVGS------FNVWSTKPCERGIWSRGVEVGSFVCSTNDLS 242

Query: 245 -DTEKELPIACLKNLDSTLSTMPNLNQIHALINHYGPTVFFHPKEIYLPSSVSWFFENGV 304
            D +  + IACLKNLD +L  MPNL+Q+HALI+HYGP V+FHP+E Y+PSSV WFF+NG 
Sbjct: 243 SDNKAAMNIACLKNLDPSLQGMPNLDQVHALIHHYGPMVYFHPEETYMPSSVPWFFKNGA 302

Query: 305 LLHRDGISSGETIHVCGKNLPGGGRNDRGCWMDLPTDG-CRDKIIYGNLESAKLYVHVKP 364
           LLHR G S GE I+  G NLP GG ND   W+DLP D   R  +  GN+ES++LYVHVKP
Sbjct: 303 LLHRFGKSQGEPINSAGSNLPAGGENDGSFWIDLPEDEEVRSNLKKGNIESSELYVHVKP 362

Query: 365 ALGGTFTDIAMWVFCPFNGPSTLKLGIMNISLGKIGQHVGDWEHITLRICNFTGELCSIY 424
           ALGG FTD+ MW+FCPFNGP+TLK+G++ + + ++G+HVGDWEH T RI NF G+L  ++
Sbjct: 363 ALGGIFTDVVMWIFCPFNGPATLKIGLLTVPMNRLGEHVGDWEHFTFRISNFNGDLTQMF 422

Query: 425 FSQHSGGEWVDAYNLEFIEG-NKAIVYSSKSGHASYPHPGVYIQGSAMLGIGIRNDCARS 484
           FSQHSGG WVD  +LEF++G NK +VYSSK GHAS+PHPG+Y+QG + LGIG+RND A+S
Sbjct: 423 FSQHSGGGWVDVSDLEFVKGSNKPVVYSSKHGHASFPHPGMYLQGPSKLGIGVRNDVAKS 482

Query: 485 HLFVDSSIHYEIVAAEYLRCNGIVEPCWLQFMREWGPTIVYSSRTKLDNIIDRLPLKIRC 544
              VDSS  Y IVAAEYL    + EP WLQFMREWGPTIVY S  +++ IID LPL +R 
Sbjct: 483 KYMVDSSQRYRIVAAEYLGEGAVSEPYWLQFMREWGPTIVYDSAAEINKIIDLLPLILRN 542

Query: 545 TVANIFRMLPGELFGEGGPTGPKEKNNWEGDE 570
           +  ++F   P EL+GE GPTGPKEK+NWEGDE
Sbjct: 543 SFESLF---PIELYGEEGPTGPKEKDNWEGDE 551

BLAST of ClCG06G001420 vs. TAIR10
Match: AT2G44230.1 (AT2G44230.1 Plant protein of unknown function (DUF946))

HSP 1 Score: 453.0 bits (1164), Expect = 2.8e-127
Identity = 240/559 (42.93%), Postives = 324/559 (57.96%), Query Frame = 1

Query: 17  LPSDEPDHFSLPSPTPEWPQGGGFASGIASLGEIEVRKITQFVSIWGCNLTRRGNNGVTF 76
           LP D    F+LPSP P WP G GFA G   LG +EV ++  F  +W      + N G TF
Sbjct: 14  LPIDST--FNLPSPLPSWPSGEGFAKGRIDLGGLEVSQVDTFNKVWTVYEGGQDNLGATF 73

Query: 77  YRPLRIPEGFHCLGHYCQPNDRPLHGYLLTAREVDGYFQESDHISNIVKLPALVEPLDYT 136
           + P  +PEGF  LG Y QPN+R L G+ L  +++ G               +L  P+DY 
Sbjct: 74  FEPSSVPEGFSILGFYAQPNNRKLFGWTLVGKDLSG--------------DSLRPPVDYL 133

Query: 137 LIWSPDDGSEEKYS-ECAYIWLPQPPDGYKSMGYFVTNKLEKPEVGEVRCVRADLTDRCE 196
           L+WS      E    E  Y W P PPDGY ++G  VT   EKP + ++RCVR+DLTD+ E
Sbjct: 134 LLWSGKSTKVENNKVETGYFWQPVPPDGYNAVGLIVTTSDEKPPLDKIRCVRSDLTDQSE 193

Query: 197 TYRLMFNISSKCKNFLVQIWSTRACHRGMLGRGVPVGTFHCGSYEDTEKELPIACLKNLD 256
              L++  +         + S++  +RG    GV VGTF   S         + CLKN +
Sbjct: 194 PDALIWETNG------FSVSSSKPVNRGTQASGVSVGTFFSNSPNPA-----LPCLKNNN 253

Query: 257 STLSTMPNLNQIHALINHYGPTVFFHPKEIYLPSSVSWFFENGVLLHRDGISSGET-IHV 316
              S MP+  QI AL   Y P ++FH  E YLPSSV+WFF NG LL++ G  S    +  
Sbjct: 254 FDFSCMPSKPQIDALFQTYAPWIYFHKDEKYLPSSVNWFFSNGALLYKKGDESNPVPVEP 313

Query: 317 CGKNLPGGGRNDRGCWMDLPT-DGCRDKIIYGNLESAKLYVHVKPALGGTFTDIAMWVFC 376
            G NLP G  ND   W+DLP     R ++  G+L+S ++Y+H+KP  GGTFTDIA+W+F 
Sbjct: 314 NGLNLPQGEFNDGLYWLDLPVASDARKRVQCGDLQSMEVYLHIKPVFGGTFTDIAVWMFY 373

Query: 377 PFNGPSTLKLGIMNISLGKIGQHVGDWEHITLRICNFTGELCSIYFSQHSGGEWVDAYNL 436
           PFNGPS  KL   +I LG+IG+H+GDWEH TLRI NF+G+L  +Y SQHSGG W DA  +
Sbjct: 374 PFNGPSRAKLKAASIPLGRIGEHIGDWEHFTLRISNFSGKLHRMYLSQHSGGSWADASEI 433

Query: 437 EFI-EGNKAIVYSSKSGHASYPHPGVYIQGSAMLGIGIRNDCARSHLFVDSSIHYEIVAA 496
           EF   GNK + Y+S +GHA Y  PG+ +QG     +GIRND  +S   +D+++ + +VAA
Sbjct: 434 EFQGGGNKPVAYASLNGHAMYSKPGLVLQGKD--NVGIRNDTGKSEKVIDTAVRFRVVAA 493

Query: 497 EYLRCNGIVEPCWLQFMREWGPTIVYSSRTKLDNIIDRLPL--KIRCTVANIFRMLPGEL 556
           EY+R   + EP WL +MR WGP I Y    ++   ++++ +   ++ T  +  + LP E+
Sbjct: 494 EYMR-GELEEPAWLNYMRHWGPKIDYGHENEIRG-VEKIMVGESLKTTFRSAIKGLPNEV 541

Query: 557 FGEGGPTGPKEKNNWEGDE 570
           FGE GPTGPK K NW GDE
Sbjct: 554 FGEEGPTGPKLKRNWLGDE 541

BLAST of ClCG06G001420 vs. NCBI nr
Match: gi|659120015|ref|XP_008459966.1| (PREDICTED: uncharacterized protein LOC103498924 [Cucumis melo])

HSP 1 Score: 1148.7 bits (2970), Expect = 0.0e+00
Identity = 532/571 (93.17%), Postives = 549/571 (96.15%), Query Frame = 1

Query: 1   MASGCNWFHWNNVHYLLPSDEPDHFSLPSPTPEWPQGGGFASGIASLGEIEVRKITQFVS 60
           MASGC+WF+W+N HYLLPSDEPDHFSLPSPTPEWPQGGGFASGIASLGEIEV KITQFVS
Sbjct: 1   MASGCSWFNWSNAHYLLPSDEPDHFSLPSPTPEWPQGGGFASGIASLGEIEVLKITQFVS 60

Query: 61  IWGCNLTRRGNNGVTFYRPLRIPEGFHCLGHYCQPNDRPLHGYLLTAREVDGYFQESDHI 120
           IWGCNL+RRGNNG TFYRPLRIPEGFHCLGHYCQPNDRPLHGYLL AREVDGYFQESDHI
Sbjct: 61  IWGCNLSRRGNNGFTFYRPLRIPEGFHCLGHYCQPNDRPLHGYLLVAREVDGYFQESDHI 120

Query: 121 SNIVKLPALVEPLDYTLIWSPDDGSEEKYSECAYIWLPQPPDGYKSMGYFVTNKLEKPEV 180
           SNIVKLPALVEP+D+TLIWSPDDGSEEKY EC YIWLPQPPDGYKSMGYFVTNKLEKPEV
Sbjct: 121 SNIVKLPALVEPIDFTLIWSPDDGSEEKYGECVYIWLPQPPDGYKSMGYFVTNKLEKPEV 180

Query: 181 GEVRCVRADLTDRCETYRLMFNISSKCKNFLVQIWSTRACHRGMLGRGVPVGTFHCGSYE 240
           GEVRCVRADLTDRCETYRLMFNISSKCKNFLVQIWSTRACHRGMLGRGVPVGTFHC SY+
Sbjct: 181 GEVRCVRADLTDRCETYRLMFNISSKCKNFLVQIWSTRACHRGMLGRGVPVGTFHCCSYK 240

Query: 241 DTEKELPIACLKNLDSTLSTMPNLNQIHALINHYGPTVFFHPKEIYLPSSVSWFFENGVL 300
            TEKELPIACLKNLDSTL TMPN+NQIH+LINHYGPTVFFHP+EIYLPSSVSWFFENGVL
Sbjct: 241 GTEKELPIACLKNLDSTLPTMPNINQIHSLINHYGPTVFFHPEEIYLPSSVSWFFENGVL 300

Query: 301 LHRDGISSGETIHVCGKNLPGGGRNDRGCWMDLPTDGCRDKIIYGNLESAKLYVHVKPAL 360
           LHRDG+SSGE IHVCG NLP GGRND  CWMDLPTDGCRDKIIYGNLESAKLYVHVKPAL
Sbjct: 301 LHRDGMSSGEAIHVCGTNLPAGGRNDTVCWMDLPTDGCRDKIIYGNLESAKLYVHVKPAL 360

Query: 361 GGTFTDIAMWVFCPFNGPSTLKLGIMNISLGKIGQHVGDWEHITLRICNFTGELCSIYFS 420
           GGTFTDIAMWVFCPFNGPSTLKLGI+NISLGKIGQHVGDWEHITLRICNF+GEL SIYFS
Sbjct: 361 GGTFTDIAMWVFCPFNGPSTLKLGIVNISLGKIGQHVGDWEHITLRICNFSGELFSIYFS 420

Query: 421 QHSGGEWVDAYNLEFIEGNKAIVYSSKSGHASYPHPGVYIQGSAMLGIGIRNDCARSHLF 480
           QHSGGEWVDAYNLEFIEGNKAIVYSSKSGHASYPHPG+YIQGS+ LGIGIRNDCARSHLF
Sbjct: 421 QHSGGEWVDAYNLEFIEGNKAIVYSSKSGHASYPHPGLYIQGSSKLGIGIRNDCARSHLF 480

Query: 481 VDSSIHYEIVAAEYLRCNGIVEPCWLQFMREWGPTIVYSSRTKLDNIIDRLPLKIRCTVA 540
           +DSSIHYEIVAAE+LRCN IVEPCWLQFMREWGPTIVYSSRTKLDN IDRLPLKIR TVA
Sbjct: 481 IDSSIHYEIVAAEHLRCNEIVEPCWLQFMREWGPTIVYSSRTKLDNFIDRLPLKIRLTVA 540

Query: 541 NIFRMLPGELFGEGGPTGPKEKNNWEGDERG 572
           NIFR LP ELFGE GPTGPKEKNNWEGDERG
Sbjct: 541 NIFRKLPAELFGEVGPTGPKEKNNWEGDERG 571

BLAST of ClCG06G001420 vs. NCBI nr
Match: gi|449445816|ref|XP_004140668.1| (PREDICTED: uncharacterized protein LOC101209282 [Cucumis sativus])

HSP 1 Score: 1122.5 bits (2902), Expect = 0.0e+00
Identity = 524/571 (91.77%), Postives = 542/571 (94.92%), Query Frame = 1

Query: 1   MASGCNWFHWNNVHYLLPSDEPDHFSLPSPTPEWPQGGGFASGIASLGEIEVRKITQFVS 60
           MASGCNWF+W+N HYLLPSDEPDHFSLPSPTPEWPQGGGFASGIASLGEIEV KITQFVS
Sbjct: 1   MASGCNWFNWSNAHYLLPSDEPDHFSLPSPTPEWPQGGGFASGIASLGEIEVLKITQFVS 60

Query: 61  IWGCNLTRRGNNGVTFYRPLRIPEGFHCLGHYCQPNDRPLHGYLLTAREVDGYFQESDHI 120
           IWGCNL+RRGNNGVTFYRPLR+PEG+HCLGHYCQPNDRPLHGYLL AREVDGYFQESDHI
Sbjct: 61  IWGCNLSRRGNNGVTFYRPLRMPEGYHCLGHYCQPNDRPLHGYLLVAREVDGYFQESDHI 120

Query: 121 SNIVKLPALVEPLDYTLIWSPDDGSEEKYSECAYIWLPQPPDGYKSMGYFVTNKLEKPEV 180
           SNIVKLPALVEP+D+TLIWSPDDGSEEKY ECAYIWLPQPPDGYKSMGYFVTNKLEKP V
Sbjct: 121 SNIVKLPALVEPIDFTLIWSPDDGSEEKYGECAYIWLPQPPDGYKSMGYFVTNKLEKPVV 180

Query: 181 GEVRCVRADLTDRCETYRLMFNISSKCKNFLVQIWSTRACHRGMLGRGVPVGTFHCGSYE 240
           GEVRCVRADLTDRCETYRLMFNISSKCKNFLVQIWSTRACHRGMLGRGVPVGTFHCGSY+
Sbjct: 181 GEVRCVRADLTDRCETYRLMFNISSKCKNFLVQIWSTRACHRGMLGRGVPVGTFHCGSYK 240

Query: 241 DTEKELPIACLKNLDSTLSTMPNLNQIHALINHYGPTVFFHPKEIYLPSSVSWFFENGVL 300
            TEKELPIACLKNL+STL TMPN++QIH+LINHYGPTVFFHPKEIYLPSSVSWFFENGVL
Sbjct: 241 GTEKELPIACLKNLNSTLPTMPNIDQIHSLINHYGPTVFFHPKEIYLPSSVSWFFENGVL 300

Query: 301 LHRDGISSGETIHVCGKNLPGGGRNDRGCWMDLPTDGCRDKIIYGNLESAKLYVHVKPAL 360
           LHRDG+SSGE I VCG NLP  GRND  CWMDLPTDGCRDKII GNLESAKLY HVKPAL
Sbjct: 301 LHRDGMSSGEAILVCGTNLPTDGRNDTVCWMDLPTDGCRDKIINGNLESAKLYAHVKPAL 360

Query: 361 GGTFTDIAMWVFCPFNGPSTLKLGIMNISLGKIGQHVGDWEHITLRICNFTGELCSIYFS 420
           GGTFTDIAMWVFCPFNGPSTLKLGI+NISLGKIGQHVGDWEHITLRICNFTGEL SIYFS
Sbjct: 361 GGTFTDIAMWVFCPFNGPSTLKLGIVNISLGKIGQHVGDWEHITLRICNFTGELFSIYFS 420

Query: 421 QHSGGEWVDAYNLEFIEGNKAIVYSSKSGHASYPHPGVYIQGSAMLGIGIRNDCARSHLF 480
           QHSGGEWVDAYNLEFIEGNKAIVYSSKSGHASYP PG+YIQGS+ LGIGIRNDCARSHLF
Sbjct: 421 QHSGGEWVDAYNLEFIEGNKAIVYSSKSGHASYPRPGLYIQGSSKLGIGIRNDCARSHLF 480

Query: 481 VDSSIHYEIVAAEYLRCNGIVEPCWLQFMREWGPTIVYSSRTKLDNIIDRLPLKIRCTVA 540
           +DSS HYEIVAAE+LR N IVEP WLQFMREWGPTIVYSSRTKLDN IDRLPLKIR  VA
Sbjct: 481 IDSSTHYEIVAAEHLRRNDIVEPGWLQFMREWGPTIVYSSRTKLDNFIDRLPLKIRFPVA 540

Query: 541 NIFRMLPGELFGEGGPTGPKEKNNWEGDERG 572
           NIFR LP ELFGE GPTGPKEKNNWEGDERG
Sbjct: 541 NIFRKLPAELFGEVGPTGPKEKNNWEGDERG 571

BLAST of ClCG06G001420 vs. NCBI nr
Match: gi|700191168|gb|KGN46372.1| (hypothetical protein Csa_6G088000 [Cucumis sativus])

HSP 1 Score: 1042.3 bits (2694), Expect = 3.0e-301
Identity = 491/535 (91.78%), Postives = 507/535 (94.77%), Query Frame = 1

Query: 37  GGGFASGIASLGEIEVRKITQFVSIWGCNLTRRGNNGVTFYRPLRIPEGFHCLGHYCQPN 96
           GGGFASGIASLGEIEV KITQFVSIWGCNL+RRGNNGVTFYRPLR+PEG+HCLGHYCQPN
Sbjct: 64  GGGFASGIASLGEIEVLKITQFVSIWGCNLSRRGNNGVTFYRPLRMPEGYHCLGHYCQPN 123

Query: 97  DRPLHGYLLTAREVDGYFQESDHISNIVKLPALVEPLDYTLIWSPDDGSEEKYSECAYIW 156
           DRPLHGYLL AREVDGYFQESDHISNIVKLPALVEP+D+TLIWSPDDGSEEKY ECAYIW
Sbjct: 124 DRPLHGYLLVAREVDGYFQESDHISNIVKLPALVEPIDFTLIWSPDDGSEEKYGECAYIW 183

Query: 157 LPQPPDGYKSMGYFVTNKLEKPEVGEVRCVRADLTDRCETYRLMFNISSKCKNFLVQIWS 216
           LPQPPDGYKSMGYFVTNKLEKP VGEVRCVRADLTDRCETYRLMFNISSKCKNFLVQIWS
Sbjct: 184 LPQPPDGYKSMGYFVTNKLEKPVVGEVRCVRADLTDRCETYRLMFNISSKCKNFLVQIWS 243

Query: 217 TRACHRGMLGRGVPVGTFHCGSYEDTEKELPIACLKNLDSTLSTMPNLNQIHALINHYGP 276
           TRACHRGMLGRGVPVGTFHCGSY+ TEKELPIACLKNL+STL TMPN++QIH+LINHYGP
Sbjct: 244 TRACHRGMLGRGVPVGTFHCGSYKGTEKELPIACLKNLNSTLPTMPNIDQIHSLINHYGP 303

Query: 277 TVFFHPKEIYLPSSVSWFFENGVLLHRDGISSGETIHVCGKNLPGGGRNDRGCWMDLPTD 336
           TVFFHPKEIYLPSSVSWFFENGVLLHRDG+SSGE I VCG NLP  GRND  CWMDLPTD
Sbjct: 304 TVFFHPKEIYLPSSVSWFFENGVLLHRDGMSSGEAILVCGTNLPTDGRNDTVCWMDLPTD 363

Query: 337 GCRDKIIYGNLESAKLYVHVKPALGGTFTDIAMWVFCPFNGPSTLKLGIMNISLGKIGQH 396
           GCRDKII GNLESAKLY HVKPALGGTFTDIAMWVFCPFNGPSTLKLGI+NISLGKIGQH
Sbjct: 364 GCRDKIINGNLESAKLYAHVKPALGGTFTDIAMWVFCPFNGPSTLKLGIVNISLGKIGQH 423

Query: 397 VGDWEHITLRICNFTGELCSIYFSQHSGGEWVDAYNLEFIEGNKAIVYSSKSGHASYPHP 456
           VGDWEHITLRICNFTGEL SIYFSQHSGGEWVDAYNLEFIEGNKAIVYSSKSGHASYP P
Sbjct: 424 VGDWEHITLRICNFTGELFSIYFSQHSGGEWVDAYNLEFIEGNKAIVYSSKSGHASYPRP 483

Query: 457 GVYIQGSAMLGIGIRNDCARSHLFVDSSIHYEIVAAEYLRCNGIVEPCWLQFMREWGPTI 516
           G+YIQGS+ LGIGIRNDCARSHLF+DSS HYEIVAAE+LR N IVEP WLQFMREWGPTI
Sbjct: 484 GLYIQGSSKLGIGIRNDCARSHLFIDSSTHYEIVAAEHLRRNDIVEPGWLQFMREWGPTI 543

Query: 517 VYSSRTKLDNIIDRLPLKIRCTVANIFRMLPGELFGEGGPTGPKEKNNWEGDERG 572
           VYSSRTKLDN IDRLPLKIR  VANIFR LP ELFGE GPTGPKEKNNWEGDERG
Sbjct: 544 VYSSRTKLDNFIDRLPLKIRFPVANIFRKLPAELFGEVGPTGPKEKNNWEGDERG 598

BLAST of ClCG06G001420 vs. NCBI nr
Match: gi|1009165701|ref|XP_015901183.1| (PREDICTED: uncharacterized protein LOC107434248 [Ziziphus jujuba])

HSP 1 Score: 828.9 bits (2140), Expect = 5.3e-237
Identity = 379/568 (66.73%), Postives = 454/568 (79.93%), Query Frame = 1

Query: 4   GCNWFHWNNVHYLLPSDEPDHFSLPSPTPEWPQGGGFASGIASLGEIEVRKITQFVSIWG 63
           GC  F+WN +  LLP  EPD FSLP+P PEWPQG GFASG  SLGE+EV KIT+F  +WG
Sbjct: 3   GCKCFYWNRLTDLLPM-EPDIFSLPAPLPEWPQGQGFASGKMSLGELEVVKITRFEFVWG 62

Query: 64  CNLTRRGNNGVTFYRPLRIPEGFHCLGHYCQPNDRPLHGYLLTAREVDGYFQESDHISNI 123
           C +++  N GV+FYRP+ IP+GFHCLGHYCQPND+PL GYLL AR +D Y  ++DH+   
Sbjct: 63  CGMSKDKNKGVSFYRPVGIPDGFHCLGHYCQPNDQPLRGYLLAARVIDSYLPDTDHVCAH 122

Query: 124 VKLPALVEPLDYTLIWSPDDGSEEKYSECAYIWLPQPPDGYKSMGYFVTNKLEKPEVGEV 183
           VK PAL +PLDYTL+WSPDDG+E KYS   Y WLP+  +GYK +G+ VT+K  KP +  +
Sbjct: 123 VKSPALRKPLDYTLVWSPDDGTEGKYSGFGYFWLPRAAEGYKPVGFLVTDKPGKPSLDAI 182

Query: 184 RCVRADLTDRCETYRLMFNISSKCKNFLVQIWSTRACHRGMLGRGVPVGTFHCGSYEDTE 243
           RCVR DLTD+CE YRL+ N+ SK  +F  QIWSTR CHRGM+G+GVPVGTF C S+ +  
Sbjct: 183 RCVRDDLTDKCENYRLILNVGSKFSDFPFQIWSTRPCHRGMMGKGVPVGTFFCSSFWNAG 242

Query: 244 KELPIACLKNLDSTLSTMPNLNQIHALINHYGPTVFFHPKEIYLPSSVSWFFENGVLLHR 303
           ++L   CLKNL+ TLS MPNL+QIHALINHYGPTVFFHP+E+Y PSSVSWFF++G LL++
Sbjct: 243 EDLSTRCLKNLNPTLSAMPNLDQIHALINHYGPTVFFHPEEVYFPSSVSWFFKSGALLYK 302

Query: 304 DGISSGETIHVCGKNLPGGGRNDRGCWMDLPTDGCRDKIIYGNLESAKLYVHVKPALGGT 363
            G++ GE I   G NLPGGG ND   W+DLP+D  RDKII+G+L SAKLYVHVKPALGG+
Sbjct: 303 SGMTVGEAIDASGSNLPGGGTNDGQFWIDLPSDDHRDKIIHGHLGSAKLYVHVKPALGGS 362

Query: 364 FTDIAMWVFCPFNGPSTLKLGIMNISLGKIGQHVGDWEHITLRICNFTGELCSIYFSQHS 423
           FTDI MWVFCPFNGP+ LK+G M+I+LGKIG+H+GDWEH TLRI NFTGEL SIYFSQHS
Sbjct: 363 FTDIVMWVFCPFNGPANLKIGPMSIALGKIGEHIGDWEHFTLRIWNFTGELWSIYFSQHS 422

Query: 424 GGEWVDAYNLEFIEGNKAIVYSSKSGHASYPHPGVYIQGSAMLGIGIRNDCARSHLFVDS 483
           GGEWVDAY LE+IEGNKAIVYSSK+GHAS+PHPG YIQGS+  GIGI+ND A S  +VDS
Sbjct: 423 GGEWVDAYKLEYIEGNKAIVYSSKAGHASFPHPGTYIQGSSKFGIGIKNDAAHSSSYVDS 482

Query: 484 SIHYEIVAAEYLRCNGIVEPCWLQFMREWGPTIVYSSRTKLDNIIDRLPLKIRCTVANIF 543
           S  YE++AAEYL    + EPCWLQ+MREWGPTIVY SR ++D II +LPL +R +V N+F
Sbjct: 483 SSQYELIAAEYLGEGVVTEPCWLQYMREWGPTIVYDSRAEIDKIIKKLPLMVRYSVENMF 542

Query: 544 RMLPGELFGEGGPTGPKEKNNWEGDERG 572
             LP EL+ E GPTGPKEKNNW GDERG
Sbjct: 543 NKLPVELYKEEGPTGPKEKNNWVGDERG 569

BLAST of ClCG06G001420 vs. NCBI nr
Match: gi|703081868|ref|XP_010091803.1| (hypothetical protein L484_015871 [Morus notabilis])

HSP 1 Score: 806.2 bits (2081), Expect = 3.6e-230
Identity = 376/569 (66.08%), Postives = 441/569 (77.50%), Query Frame = 1

Query: 3   SGCNWFHWNNVHYLLPSDEPDHFSLPSPTPEWPQGGGFASGIASLGEIEVRKITQFVSIW 62
           S CN   WN  H      EP+ FSLP+P P+WPQG GFASG  S+GE+EV K+T+F  IW
Sbjct: 4   SSCNCLCWNK-HTDFSLSEPETFSLPAPLPKWPQGKGFASGRISIGELEVFKVTRFEFIW 63

Query: 63  GCNLTRRGNNGVTFYRPLRIPEGFHCLGHYCQPNDRPLHGYLLTAREVDGYFQESDHISN 122
             NL++  N G +FY+P  IP+GFH LGH+CQPN++PL G+LL AREV  +  ES H SN
Sbjct: 64  CYNLSQDKNKGFSFYKPAEIPDGFHSLGHFCQPNNQPLRGFLLVAREVASHMPESCHASN 123

Query: 123 IVKLPALVEPLDYTLIWSPDDGSEEKYSECAYIWLPQPPDGYKSMGYFVTNKLEKPEVGE 182
             KLP L EPLDY L+WSPDD SEEK   C Y WLPQ P+GYK +G+ VTNK  KP + E
Sbjct: 124 TAKLPVLCEPLDYVLVWSPDDWSEEKCGGCGYFWLPQAPEGYKPVGFLVTNKPVKPRLDE 183

Query: 183 VRCVRADLTDRCETYRLMFNISSKCKNFLVQIWSTRACHRGMLGRGVPVGTFHCGSYEDT 242
           VRCVR+DL D C+ YRL+   +S+  NF  ++WSTR  HRG+ G+GVPVGTF C S    
Sbjct: 184 VRCVRSDLMDECDAYRLLLTCNSRYMNFSFRVWSTRPRHRGVTGKGVPVGTFFCSSSWSA 243

Query: 243 EKELPIACLKNLDSTLSTMPNLNQIHALINHYGPTVFFHPKEIYLPSSVSWFFENGVLLH 302
            ++L I CLKNLD TL  MPNL+QIHALINHYGPTVFFHP+E+YLPSSVSWFFENG LL+
Sbjct: 244 GEDLCIGCLKNLDPTLPAMPNLDQIHALINHYGPTVFFHPEEVYLPSSVSWFFENGALLY 303

Query: 303 RDGISSGETIHVCGKNLPGGGRNDRGCWMDLPTDGCRDKIIYGNLESAKLYVHVKPALGG 362
           R GI+ GE I  CG NLPGGG ND   W+DLP D  R+ +  GNLESAKLY H KPALGG
Sbjct: 304 RAGITVGEPIDACGSNLPGGGTNDGEFWIDLPRDKRRESVKRGNLESAKLYAHAKPALGG 363

Query: 363 TFTDIAMWVFCPFNGPSTLKLGIMNISLGKIGQHVGDWEHITLRICNFTGELCSIYFSQH 422
           TFTDIAMWVFCPFNGP+TLK+G+MNI+L KIG+H+GDWEH TLRICNFTGEL S+YFSQH
Sbjct: 364 TFTDIAMWVFCPFNGPATLKVGLMNIALSKIGEHIGDWEHFTLRICNFTGELWSMYFSQH 423

Query: 423 SGGEWVDAYNLEFIEGNKAIVYSSKSGHASYPHPGVYIQGSAMLGIGIRNDCARSHLFVD 482
           SGGEWVDAYNLEFIEGNKAIVYSSKSGHAS+PHPG YIQGS+ LG+GIRND ARS+L VD
Sbjct: 424 SGGEWVDAYNLEFIEGNKAIVYSSKSGHASFPHPGTYIQGSSTLGVGIRNDAARSNLLVD 483

Query: 483 SSIHYEIVAAEYLRCNGIVEPCWLQFMREWGPTIVYSSRTKLDNIIDRLPLKIRCTVANI 542
           SS  YE+VAAEYL    + EPCWLQ+MR WGPTIVY SRT+LD +I+RLP+ ++ +V N 
Sbjct: 484 SSKQYELVAAEYLGDGVVTEPCWLQYMRLWGPTIVYDSRTELDKMINRLPMMVKYSVVNW 543

Query: 543 FRMLPGELFGEGGPTGPKEKNNWEGDERG 572
           F  +P EL  + GPTGPKEK+ W GDERG
Sbjct: 544 FAKVPVELSRQEGPTGPKEKDYWLGDERG 571

The following BLAST results are available for this feature:
Match NameE-valueIdentityDescription
A0A0A0KDB1_CUCSA2.1e-30191.78Uncharacterized protein OS=Cucumis sativus GN=Csa_6G088000 PE=4 SV=1[more]
W9QPL3_9ROSA2.5e-23066.08Uncharacterized protein OS=Morus notabilis GN=L484_015871 PE=4 SV=1[more]
A0A061DQE8_THECC2.5e-22566.37Vacuolar protein sorting-associated protein YPR157W OS=Theobroma cacao GN=TCM_00... [more]
M5VIR5_PRUPE2.5e-22565.26Uncharacterized protein OS=Prunus persica GN=PRUPE_ppa003532mg PE=4 SV=1[more]
A0A067FAX8_CITSI5.1e-22364.96Uncharacterized protein OS=Citrus sinensis GN=CISIN_1g045952mg PE=4 SV=1[more]
Match NameE-valueIdentityDescription
AT5G43950.14.9e-19357.74 Plant protein of unknown function (DUF946)[more]
AT3G04350.13.5e-19155.48 Plant protein of unknown function (DUF946)[more]
AT1G04090.12.5e-18956.50 Plant protein of unknown function (DUF946)[more]
AT5G18490.11.1e-18755.24 Plant protein of unknown function (DUF946)[more]
AT2G44230.12.8e-12742.93 Plant protein of unknown function (DUF946)[more]
Match NameE-valueIdentityDescription
gi|659120015|ref|XP_008459966.1|0.0e+0093.17PREDICTED: uncharacterized protein LOC103498924 [Cucumis melo][more]
gi|449445816|ref|XP_004140668.1|0.0e+0091.77PREDICTED: uncharacterized protein LOC101209282 [Cucumis sativus][more]
gi|700191168|gb|KGN46372.1|3.0e-30191.78hypothetical protein Csa_6G088000 [Cucumis sativus][more]
gi|1009165701|ref|XP_015901183.1|5.3e-23766.73PREDICTED: uncharacterized protein LOC107434248 [Ziziphus jujuba][more]
gi|703081868|ref|XP_010091803.1|3.6e-23066.08hypothetical protein L484_015871 [Morus notabilis][more]
The following terms have been associated with this gene:
Vocabulary: INTERPRO
TermDefinition
IPR009291Vps62
GO Assignments
This gene is annotated with the following GO terms.
Category Term Accession Term Name
biological_process GO:0008150 biological_process
biological_process GO:0006810 transport
cellular_component GO:0005575 cellular_component
cellular_component GO:0005622 intracellular
cellular_component GO:0016020 membrane
cellular_component GO:0016021 integral component of membrane
molecular_function GO:0003674 molecular_function
molecular_function GO:0003676 nucleic acid binding
molecular_function GO:0000166 nucleotide binding
molecular_function GO:0008466 glycogenin glucosyltransferase activity
molecular_function GO:0016740 transferase activity
molecular_function GO:0016757 transferase activity, transferring glycosyl groups

The following mRNA feature(s) are a part of this gene:

Feature NameUnique NameType
ClCG06G001420.1ClCG06G001420.1mRNA


Analysis Name: InterPro Annotations of watermelon (Charleston Gray)
Date Performed: 2016-09-28
IPR TermIPR DescriptionSourceSource TermSource DescriptionAlignment
IPR009291Vacuolar protein sorting-associated protein 62PFAMPF06101DUF946coord: 22..569
score: 6.4E
NoneNo IPR availablePANTHERPTHR17204PRE-MRNA PROCESSING PROTEIN PRP39-RELATEDcoord: 72..570
score: 1.7E
NoneNo IPR availablePANTHERPTHR17204:SF30SUBFAMILY NOT NAMEDcoord: 72..570
score: 1.7E