Csa1G470450 (gene) Cucumber (Chinese Long) v2

NameCsa1G470450
Typegene
OrganismCucumis. sativus (Cucumber (Chinese Long) v2)
DescriptionF28N24.8 protein; contains IPR007789 (Protein of unknown function DUF688)
LocationChr1 : 17018531 .. 17021883 (-)
   



The following sequences are available for this feature:

Gene sequence (with intron)

Legend: five_prime_UTRCDSthree_prime_UTR
Hold the cursor over a type above to highlight its positions in the sequence below.
AATAATGATTTAATTAATACAATTGTATAATAGATATAGTTTGTTTTCGAATGTATAAATTATGATATCCAACGGCGAAAAAGGAATAAACTGGTGGAGAGTGATGGAAGAGAAAAGTGAAATGGAGCAAAGGGCCAAGAAAAGTGCACTATCAGTATCAATATCAACTCTGTTCCTCCCTCAACTGTATTTCTCGTGACCACAAGGCGAGCAATGAGCTTTTACGAAAAAGAAAAGTCCAAAACAGAGAGCGCGTGAAAAATTGAATTGAAACCTCTATCATCGGCGTAAAGCAACGAAAATCCATGCTCACAAATTTCCCCTAACTACACGCCATCATTTATGCACACTCTCTCTTTCTTATCTCTTCCACTTACTGGTAAAACCCATCATCTCTTTCTCTTTTGTTCTCTTCTTTTTTCTCTGTTAAATGTGTTCATTCTCTGTCTTTCTTTTCTTAAAATTTGTTTTCTTATGCTCTGTGCACATCGCTTCAATTTCTCTCTCTTTTTTCCCGTTTCTTTTCGCATTTTGATTGACTTTTAGTGAGTTCTGCACCAAATTCTCATCTGGGGTTCGCTTAATTTTGGGGATTTTGATGATTAATTGGTGGCTGACGAATTTTCATTTCTGAATTTCATTACTTGATTACTGATTGTTGTGGTGTCTGATCGTACAAGGGTTAACTTACAACCAACCCAGAACGTACAAGGGTTTATTCAGTTTGGCTTTATTTTCCGGTCGAAAATTGGCGGTGTTCGTGACTTTTATTTACTCTCTTCTTTCTTGCAGCCTTAATTTGGTAGTTAGATTTTCTTCTCTTAGTTCTTGTTTCTTGGGCAAGCAATGGAGGAAAGAAAACTCAATTTTAATGCTCCACTCATGTCTGTGAGGCGATTTTCCAAGGCAGCTAGTTCTATATCTAAAGCGAATGAGAAAAAATCTGAAAATTCCCACTTTAGTAGACGGAGTACCTTTCCGGTTTCTAGACCACAATTCAATTTAGATCAAGTTACAGAACCAGTTGCAGTTCCTTTCTACTGGGAGCAGATTCCAGGAAGAGCTAAGAATGATAGCGGCTCGGCCTCACCTGAGGTTCACCTGCCTCACCCGCCTGAGAGGACTTGTTCTACTCCGAGGCTTTCCTTTGGGACGGCTTTGGATGCTAACAAATATAGCTCAGAAATGGAAGCTTGTCATCAAGATGGGTGTGAATCATCTTCTTCTAATGCCATTGTTGTTAGATTAGAGTCAGCAAAAGCTAGCGGTGGGAGGAGCTTGGCATCTGAGAATGATGATGACGATGACGATGATGATTTTTCTGATGCACGTGAGACATTGTCCCTCACTGGTTCATTCTCTGTTAACAATTGTAGTGTTAGTGGTATAAGTGGATACAATGGCCCCATGGTGAAACCGTCGGGAACTTTCCGAACGGATCCTCAAACTCGAGATTTCATGATGAGTCGCTTCTTGCCTGCAGCCAAGGCAATGGTTTTGGAGCCTGCTAAATATTCCTTAAAGAAGAAACTTGTAGCAGTTGAGCAACCTAGACAAGTTAAGAAGGCTGAGAATAGGAGGATTTCTCCTATTAAACGACTTGAGTCTACCTTGTTACTACAGTATGGCAAAGATGAAGTACATGGAGTAGATGAAGTAGATGAAGAAAGTGACTCTGTGGATGATGAATATGATAATTCAGGTCATATATCAGCTAGAGGTTGTGGTCTAATACCCAACATATGCTTCAAAAACTCTTTGGGCCTTCTTAATCCTGTGCCTGGAATGAGAATCAGGACCGAGGCACCCATGTCTGTCACTAAGAAAGTTGGGGGATCAAGTAGAACACTGCACCATTCATACGGCCAGAAGATGAACAAGGTTTCTGTTTAATGAATTATTTTTCTTCCCATTCACTTGCTATGTATGAAAGCATAATTGCTGATAAATTTTACATCTTACAGCATGCTTGGGATGCTACTTACAAGCAAAAATCAGAAGCTGCTGTCGGTTCACCCAGGCTGTTGGAGGTGAAAGATAAGTGGACTGGTGAATCGAAACATTTTTCTTCCTCCACCGACTTGCAAATGAAAGGTAGGTCTTCTCCTTTCAGGCATTCGAGGGCTGCTTCTCCCTTCCGGAATGAAGCATCACGGTCTCCTTGCAGAAGGCAGCCATTTGTAGTTCCTAAAGAAGTTGACATTATCTCCAAATCTAAAGGTGATATAGACTGTCACGATACACCGTCCATTCAAGCAACTAATAAAGATGGGGTTGACATGGCAAATATCCTTATGGAGAAGACACTTTACATAGATACCGCTAGTGTCGCTGGAACAAATCCTCCATTTAATTCAGCCATTTTCGACGACAAGAAGAAATCGGAATGTCCTAATGGGAAGAATGAGACAGCATGTGAAATGAGAGTGATGGAAGAAAGTACCACTGAGGAACCTTCATTCCTAGAAATAAAATGTTTGACTATGGTTGAAGAAGGGAGACTGGAGCGTGAAGCTGCAGAATCTAAAATCAAAGATGCTATTGATGATTGCCTCAAAGTAGGACATGGACTTTATGAAGAAGATCACACTGAATATACTAATTTGGGCTCTGCTGATGAAGAAGATTACTCCAAGGCCAATTATCAGCTGGTCAAAGTTGAAGATCCAGCAAGTGTCAAAGTAACTTCTGCAATATCTTCTCAACCTCCGCCTCTACCAAAGTCTCCTTCCGAGTCTTGGCTCTGGCGTACCCTGCCTTCAGTTTCCTCGAAAAAGTTACTGGCAGGATCAAATTTTGGAAACAAGTTGTATCAAAAACCGCAGAGCCCTAGAACATCAGCCAGTACCAAATGGGAAACCATTGTAAAATCTTCGAATTTGTGTCACGATCATGTTCGCTACTCTGAGGTAATGTCATGAACTCTCAAACACACTCTGTGCATCTAGTTGGAGACGGAGAAGAGACTAATCCTTTTCTTTCTTGGTTTGTTTATAGGAATTACTTCCTCGTGTTTCTCAGCACTCAACAACAGAAAATTTCAAGTAGTTCTCTGCAACACTCAGGTGGGTTGATTGGCTTTGTGTGAAAGTTTATACATCAGAATGGTTTTTAAATTTCTTATAGAATCTTTTTCCCCTTCCTTTCCTCTAAAAATTTTGAATAACTTGAGAGGATTTGAGCTGCCCTTTCCTTTTATGTCTTCTGATATTTATGTGAAGACATGCACGGGGTAGCTGAGGAAATACTCATATTCATATTCATAGAATGAATTCTGTATAATGCAGCCTAGGAATCAATATAGCTGTGAGTGCTGTGAGATAATTGCACTGTATATTTGAAGTAATACAAGT

mRNA sequence

ATGGAGGAAAGAAAACTCAATTTTAATGCTCCACTCATGTCTGTGAGGCGATTTTCCAAGGCAGCTAGTTCTATATCTAAAGCGAATGAGAAAAAATCTGAAAATTCCCACTTTAGTAGACGGAGTACCTTTCCGGTTTCTAGACCACAATTCAATTTAGATCAAGTTACAGAACCAGTTGCAGTTCCTTTCTACTGGGAGCAGATTCCAGGAAGAGCTAAGAATGATAGCGGCTCGGCCTCACCTGAGGTTCACCTGCCTCACCCGCCTGAGAGGACTTGTTCTACTCCGAGGCTTTCCTTTGGGACGGCTTTGGATGCTAACAAATATAGCTCAGAAATGGAAGCTTGTCATCAAGATGGGTGTGAATCATCTTCTTCTAATGCCATTGTTGTTAGATTAGAGTCAGCAAAAGCTAGCGGTGGGAGGAGCTTGGCATCTGAGAATGATGATGACGATGACGATGATGATTTTTCTGATGCACGTGAGACATTGTCCCTCACTGGTTCATTCTCTGTTAACAATTGTAGTGTTAGTGGTATAAGTGGATACAATGGCCCCATGGTGAAACCGTCGGGAACTTTCCGAACGGATCCTCAAACTCGAGATTTCATGATGAGTCGCTTCTTGCCTGCAGCCAAGGCAATGGTTTTGGAGCCTGCTAAATATTCCTTAAAGAAGAAACTTGTAGCAGTTGAGCAACCTAGACAAGTTAAGAAGGCTGAGAATAGGAGGATTTCTCCTATTAAACGACTTGAGTCTACCTTGTTACTACAGTATGGCAAAGATGAAGTACATGGAGTAGATGAAGTAGATGAAGAAAGTGACTCTGTGGATGATGAATATGATAATTCAGGTCATATATCAGCTAGAGGTTGTGGTCTAATACCCAACATATGCTTCAAAAACTCTTTGGGCCTTCTTAATCCTGTGCCTGGAATGAGAATCAGGACCGAGGCACCCATGTCTGTCACTAAGAAAGTTGGGGGATCAAGTAGAACACTGCACCATTCATACGGCCAGAAGATGAACAAGCATGCTTGGGATGCTACTTACAAGCAAAAATCAGAAGCTGCTGTCGGTTCACCCAGGCTGTTGGAGGTGAAAGATAAGTGGACTGGTGAATCGAAACATTTTTCTTCCTCCACCGACTTGCAAATGAAAGGTAGGTCTTCTCCTTTCAGGCATTCGAGGGCTGCTTCTCCCTTCCGGAATGAAGCATCACGGTCTCCTTGCAGAAGGCAGCCATTTGTAGTTCCTAAAGAAGTTGACATTATCTCCAAATCTAAAGGTGATATAGACTGTCACGATACACCGTCCATTCAAGCAACTAATAAAGATGGGGTTGACATGGCAAATATCCTTATGGAGAAGACACTTTACATAGATACCGCTAGTGTCGCTGGAACAAATCCTCCATTTAATTCAGCCATTTTCGACGACAAGAAGAAATCGGAATGTCCTAATGGGAAGAATGAGACAGCATGTGAAATGAGAGTGATGGAAGAAAGTACCACTGAGGAACCTTCATTCCTAGAAATAAAATGTTTGACTATGGTTGAAGAAGGGAGACTGGAGCGTGAAGCTGCAGAATCTAAAATCAAAGATGCTATTGATGATTGCCTCAAAGTAGGACATGGACTTTATGAAGAAGATCACACTGAATATACTAATTTGGGCTCTGCTGATGAAGAAGATTACTCCAAGGCCAATTATCAGCTGGTCAAAGTTGAAGATCCAGCAAGTGTCAAAGTAACTTCTGCAATATCTTCTCAACCTCCGCCTCTACCAAAGTCTCCTTCCGAGTCTTGGCTCTGGCGTACCCTGCCTTCAGTTTCCTCGAAAAAGTTACTGGCAGGATCAAATTTTGGAAACAAGTTGTATCAAAAACCGCAGAGCCCTAGAACATCAGCCAGTACCAAATGGGAAACCATTGTAAAATCTTCGAATTTGTGTCACGATCATGTTCGCTACTCTGAGGAATTACTTCCTCGTGTTTCTCAGCACTCAACAACAGAAAATTTCAAGTAG

Coding sequence (CDS)

ATGGAGGAAAGAAAACTCAATTTTAATGCTCCACTCATGTCTGTGAGGCGATTTTCCAAGGCAGCTAGTTCTATATCTAAAGCGAATGAGAAAAAATCTGAAAATTCCCACTTTAGTAGACGGAGTACCTTTCCGGTTTCTAGACCACAATTCAATTTAGATCAAGTTACAGAACCAGTTGCAGTTCCTTTCTACTGGGAGCAGATTCCAGGAAGAGCTAAGAATGATAGCGGCTCGGCCTCACCTGAGGTTCACCTGCCTCACCCGCCTGAGAGGACTTGTTCTACTCCGAGGCTTTCCTTTGGGACGGCTTTGGATGCTAACAAATATAGCTCAGAAATGGAAGCTTGTCATCAAGATGGGTGTGAATCATCTTCTTCTAATGCCATTGTTGTTAGATTAGAGTCAGCAAAAGCTAGCGGTGGGAGGAGCTTGGCATCTGAGAATGATGATGACGATGACGATGATGATTTTTCTGATGCACGTGAGACATTGTCCCTCACTGGTTCATTCTCTGTTAACAATTGTAGTGTTAGTGGTATAAGTGGATACAATGGCCCCATGGTGAAACCGTCGGGAACTTTCCGAACGGATCCTCAAACTCGAGATTTCATGATGAGTCGCTTCTTGCCTGCAGCCAAGGCAATGGTTTTGGAGCCTGCTAAATATTCCTTAAAGAAGAAACTTGTAGCAGTTGAGCAACCTAGACAAGTTAAGAAGGCTGAGAATAGGAGGATTTCTCCTATTAAACGACTTGAGTCTACCTTGTTACTACAGTATGGCAAAGATGAAGTACATGGAGTAGATGAAGTAGATGAAGAAAGTGACTCTGTGGATGATGAATATGATAATTCAGGTCATATATCAGCTAGAGGTTGTGGTCTAATACCCAACATATGCTTCAAAAACTCTTTGGGCCTTCTTAATCCTGTGCCTGGAATGAGAATCAGGACCGAGGCACCCATGTCTGTCACTAAGAAAGTTGGGGGATCAAGTAGAACACTGCACCATTCATACGGCCAGAAGATGAACAAGCATGCTTGGGATGCTACTTACAAGCAAAAATCAGAAGCTGCTGTCGGTTCACCCAGGCTGTTGGAGGTGAAAGATAAGTGGACTGGTGAATCGAAACATTTTTCTTCCTCCACCGACTTGCAAATGAAAGGTAGGTCTTCTCCTTTCAGGCATTCGAGGGCTGCTTCTCCCTTCCGGAATGAAGCATCACGGTCTCCTTGCAGAAGGCAGCCATTTGTAGTTCCTAAAGAAGTTGACATTATCTCCAAATCTAAAGGTGATATAGACTGTCACGATACACCGTCCATTCAAGCAACTAATAAAGATGGGGTTGACATGGCAAATATCCTTATGGAGAAGACACTTTACATAGATACCGCTAGTGTCGCTGGAACAAATCCTCCATTTAATTCAGCCATTTTCGACGACAAGAAGAAATCGGAATGTCCTAATGGGAAGAATGAGACAGCATGTGAAATGAGAGTGATGGAAGAAAGTACCACTGAGGAACCTTCATTCCTAGAAATAAAATGTTTGACTATGGTTGAAGAAGGGAGACTGGAGCGTGAAGCTGCAGAATCTAAAATCAAAGATGCTATTGATGATTGCCTCAAAGTAGGACATGGACTTTATGAAGAAGATCACACTGAATATACTAATTTGGGCTCTGCTGATGAAGAAGATTACTCCAAGGCCAATTATCAGCTGGTCAAAGTTGAAGATCCAGCAAGTGTCAAAGTAACTTCTGCAATATCTTCTCAACCTCCGCCTCTACCAAAGTCTCCTTCCGAGTCTTGGCTCTGGCGTACCCTGCCTTCAGTTTCCTCGAAAAAGTTACTGGCAGGATCAAATTTTGGAAACAAGTTGTATCAAAAACCGCAGAGCCCTAGAACATCAGCCAGTACCAAATGGGAAACCATTGTAAAATCTTCGAATTTGTGTCACGATCATGTTCGCTACTCTGAGGAATTACTTCCTCGTGTTTCTCAGCACTCAACAACAGAAAATTTCAAGTAG

Protein sequence

MEERKLNFNAPLMSVRRFSKAASSISKANEKKSENSHFSRRSTFPVSRPQFNLDQVTEPVAVPFYWEQIPGRAKNDSGSASPEVHLPHPPERTCSTPRLSFGTALDANKYSSEMEACHQDGCESSSSNAIVVRLESAKASGGRSLASENDDDDDDDDFSDARETLSLTGSFSVNNCSVSGISGYNGPMVKPSGTFRTDPQTRDFMMSRFLPAAKAMVLEPAKYSLKKKLVAVEQPRQVKKAENRRISPIKRLESTLLLQYGKDEVHGVDEVDEESDSVDDEYDNSGHISARGCGLIPNICFKNSLGLLNPVPGMRIRTEAPMSVTKKVGGSSRTLHHSYGQKMNKHAWDATYKQKSEAAVGSPRLLEVKDKWTGESKHFSSSTDLQMKGRSSPFRHSRAASPFRNEASRSPCRRQPFVVPKEVDIISKSKGDIDCHDTPSIQATNKDGVDMANILMEKTLYIDTASVAGTNPPFNSAIFDDKKKSECPNGKNETACEMRVMEESTTEEPSFLEIKCLTMVEEGRLEREAAESKIKDAIDDCLKVGHGLYEEDHTEYTNLGSADEEDYSKANYQLVKVEDPASVKVTSAISSQPPPLPKSPSESWLWRTLPSVSSKKLLAGSNFGNKLYQKPQSPRTSASTKWETIVKSSNLCHDHVRYSEELLPRVSQHSTTENFK*
BLAST of Csa1G470450 vs. TrEMBL
Match: A0A0A0LX77_CUCSA (Uncharacterized protein OS=Cucumis sativus GN=Csa_1G470450 PE=4 SV=1)

HSP 1 Score: 1362.4 bits (3525), Expect = 0.0e+00
Identity = 676/676 (100.00%), Postives = 676/676 (100.00%), Query Frame = 1

Query: 1   MEERKLNFNAPLMSVRRFSKAASSISKANEKKSENSHFSRRSTFPVSRPQFNLDQVTEPV 60
           MEERKLNFNAPLMSVRRFSKAASSISKANEKKSENSHFSRRSTFPVSRPQFNLDQVTEPV
Sbjct: 1   MEERKLNFNAPLMSVRRFSKAASSISKANEKKSENSHFSRRSTFPVSRPQFNLDQVTEPV 60

Query: 61  AVPFYWEQIPGRAKNDSGSASPEVHLPHPPERTCSTPRLSFGTALDANKYSSEMEACHQD 120
           AVPFYWEQIPGRAKNDSGSASPEVHLPHPPERTCSTPRLSFGTALDANKYSSEMEACHQD
Sbjct: 61  AVPFYWEQIPGRAKNDSGSASPEVHLPHPPERTCSTPRLSFGTALDANKYSSEMEACHQD 120

Query: 121 GCESSSSNAIVVRLESAKASGGRSLASENDDDDDDDDFSDARETLSLTGSFSVNNCSVSG 180
           GCESSSSNAIVVRLESAKASGGRSLASENDDDDDDDDFSDARETLSLTGSFSVNNCSVSG
Sbjct: 121 GCESSSSNAIVVRLESAKASGGRSLASENDDDDDDDDFSDARETLSLTGSFSVNNCSVSG 180

Query: 181 ISGYNGPMVKPSGTFRTDPQTRDFMMSRFLPAAKAMVLEPAKYSLKKKLVAVEQPRQVKK 240
           ISGYNGPMVKPSGTFRTDPQTRDFMMSRFLPAAKAMVLEPAKYSLKKKLVAVEQPRQVKK
Sbjct: 181 ISGYNGPMVKPSGTFRTDPQTRDFMMSRFLPAAKAMVLEPAKYSLKKKLVAVEQPRQVKK 240

Query: 241 AENRRISPIKRLESTLLLQYGKDEVHGVDEVDEESDSVDDEYDNSGHISARGCGLIPNIC 300
           AENRRISPIKRLESTLLLQYGKDEVHGVDEVDEESDSVDDEYDNSGHISARGCGLIPNIC
Sbjct: 241 AENRRISPIKRLESTLLLQYGKDEVHGVDEVDEESDSVDDEYDNSGHISARGCGLIPNIC 300

Query: 301 FKNSLGLLNPVPGMRIRTEAPMSVTKKVGGSSRTLHHSYGQKMNKHAWDATYKQKSEAAV 360
           FKNSLGLLNPVPGMRIRTEAPMSVTKKVGGSSRTLHHSYGQKMNKHAWDATYKQKSEAAV
Sbjct: 301 FKNSLGLLNPVPGMRIRTEAPMSVTKKVGGSSRTLHHSYGQKMNKHAWDATYKQKSEAAV 360

Query: 361 GSPRLLEVKDKWTGESKHFSSSTDLQMKGRSSPFRHSRAASPFRNEASRSPCRRQPFVVP 420
           GSPRLLEVKDKWTGESKHFSSSTDLQMKGRSSPFRHSRAASPFRNEASRSPCRRQPFVVP
Sbjct: 361 GSPRLLEVKDKWTGESKHFSSSTDLQMKGRSSPFRHSRAASPFRNEASRSPCRRQPFVVP 420

Query: 421 KEVDIISKSKGDIDCHDTPSIQATNKDGVDMANILMEKTLYIDTASVAGTNPPFNSAIFD 480
           KEVDIISKSKGDIDCHDTPSIQATNKDGVDMANILMEKTLYIDTASVAGTNPPFNSAIFD
Sbjct: 421 KEVDIISKSKGDIDCHDTPSIQATNKDGVDMANILMEKTLYIDTASVAGTNPPFNSAIFD 480

Query: 481 DKKKSECPNGKNETACEMRVMEESTTEEPSFLEIKCLTMVEEGRLEREAAESKIKDAIDD 540
           DKKKSECPNGKNETACEMRVMEESTTEEPSFLEIKCLTMVEEGRLEREAAESKIKDAIDD
Sbjct: 481 DKKKSECPNGKNETACEMRVMEESTTEEPSFLEIKCLTMVEEGRLEREAAESKIKDAIDD 540

Query: 541 CLKVGHGLYEEDHTEYTNLGSADEEDYSKANYQLVKVEDPASVKVTSAISSQPPPLPKSP 600
           CLKVGHGLYEEDHTEYTNLGSADEEDYSKANYQLVKVEDPASVKVTSAISSQPPPLPKSP
Sbjct: 541 CLKVGHGLYEEDHTEYTNLGSADEEDYSKANYQLVKVEDPASVKVTSAISSQPPPLPKSP 600

Query: 601 SESWLWRTLPSVSSKKLLAGSNFGNKLYQKPQSPRTSASTKWETIVKSSNLCHDHVRYSE 660
           SESWLWRTLPSVSSKKLLAGSNFGNKLYQKPQSPRTSASTKWETIVKSSNLCHDHVRYSE
Sbjct: 601 SESWLWRTLPSVSSKKLLAGSNFGNKLYQKPQSPRTSASTKWETIVKSSNLCHDHVRYSE 660

Query: 661 ELLPRVSQHSTTENFK 677
           ELLPRVSQHSTTENFK
Sbjct: 661 ELLPRVSQHSTTENFK 676

BLAST of Csa1G470450 vs. TrEMBL
Match: A0A067JBF4_JATCU (Uncharacterized protein OS=Jatropha curcas GN=JCGZ_21573 PE=4 SV=1)

HSP 1 Score: 436.0 bits (1120), Expect = 8.2e-119
Identity = 288/685 (42.04%), Postives = 393/685 (57.37%), Query Frame = 1

Query: 1   MEERKLNFNAPLMSVRRFSKAASSISKANEKKSENSHFSRRSTFPVSRPQFNLDQVTEPV 60
           MEERKLNFNAPLMSVRR S A    +    KK EN+   +R+T P  +  FNLDQVTEPV
Sbjct: 1   MEERKLNFNAPLMSVRRSSTATKPSNVTKGKKFENAQLVKRNTLPSYKSDFNLDQVTEPV 60

Query: 61  AVPFYWEQIPGRAKNDSGSASPEVHLPHPPERTCSTPRLSFGTALDANKY----SSEMEA 120
           AVPF+WEQIPGR K+ S    P+   P   E    TPR +   ALD  K+      E + 
Sbjct: 61  AVPFHWEQIPGRRKDGS---KPD---PRGCEEASVTPRFTPRRALDVVKHIEDKKPEDQV 120

Query: 121 CHQDGCESSSSNAIVVRLESAKASGGRSLASENDDDDDDDDFSDARETLSLTGSFSVNNC 180
             +   +S+S N I   L+ +K          +++DDDDD +SDAR+TLS   SFSV +C
Sbjct: 121 AFRPQIQSNSFNDIANGLDCSKEGVNEKSDFNSENDDDDDLYSDARDTLSGMDSFSV-DC 180

Query: 181 SVSGISGYNGPMVKPSGTFRTDPQTRDFMMSRFLPAAKAMVLEPAKYSLKKKLVAVEQPR 240
           SVSG+SG++   VKPSGTF  DPQTRDFMMSRFLPAAKAM LE  +Y+ +K+ V+ EQPR
Sbjct: 181 SVSGVSGFDSLAVKPSGTFNADPQTRDFMMSRFLPAAKAMTLEAPQYASRKQPVSGEQPR 240

Query: 241 QVKKAENR-RISPIKRLESTLLLQYGKDEVHGVDEVDEESDSVDDEYDNSGHISARGCGL 300
           Q+ +   R R  P+ R ES  +  Y +      D VDEES+   D+Y N G I  +GCGL
Sbjct: 241 QIVQVVQRDRTPPVNRKESFNVPSYHQ------DLVDEESEDECDQYVNYGKIMTKGCGL 300

Query: 301 IPNICFKNSLGLLNPVPGMRIRTEAPMSVTKKVGGSSRTLH-HSYGQKMNKHAWDATYKQ 360
           +P +C KNSL L+NPVPGM++R ++PMS  + +   +++++  S    +NK A D  +K+
Sbjct: 301 LPLLCVKNSLRLVNPVPGMKVRNQSPMSAARDIKRMTKSVYSRSQSPTINKPAKDPVHKK 360

Query: 361 KSEAAVGSPRLLEVKDKWTGESKHFSSSTDLQMKGRSSPFRHSRAASPFRNEASRSPCRR 420
           + +  V SPRL+ V +K TG S  F+ + D QM  R+SPFR S A SP+RNEA +SP   
Sbjct: 361 EPDNEVQSPRLVGVDNKLTGGSNRFTYARDRQMISRTSPFRRSGAISPYRNEAPQSPFPI 420

Query: 421 QPFV-VPKEVDIISKSKGDI--DCHDTPSIQATN---KDGVDMANILMEKTLYIDTASVA 480
             F+ VPK+++    +K ++   C+            + G    +   EKTLY+DT +VA
Sbjct: 421 GGFLGVPKDLENFKANKLNLYGKCYSKSQELVPYHGLRHGSRPLSPTTEKTLYVDTVNVA 480

Query: 481 GTNPPFNSAIFDDKKKSECPNGKN-ETACEMRVMEESTTEEPSFLEIKCLTMVEEGRLER 540
           G     N+   D KK    P  K+ ++    R ++E+ T E +  ++  L   E+   + 
Sbjct: 481 GLLCS-NAGSSDIKKGGMGPAEKDIKSLLSSREIQETYTIESTSKDVTSLNFPEQKSGDA 540

Query: 541 EAAESKIKDAIDDCLKVGHGLYEEDHTEYTNLGSADEEDYSKANYQLVKVEDPASVKVTS 600
           + +         D    G  L +E       + +  E + +  N Q+  + D  + K   
Sbjct: 541 DLSLLSDMSTHRDQWDTGEDLSQES-LALVCVSTTTEGNLNIENDQISNM-DIGNAKTGF 600

Query: 601 AISSQPPPLPKSPSESWLWRTLPSVSSKK----LLAGSNFGNKLYQKPQSPRTSASTKWE 660
           A  S PP LPK+PSESWL RTLP+VSS+     L  G+NF +K   +  S  TS STKWE
Sbjct: 601 AQCSLPPSLPKTPSESWLSRTLPTVSSQNPSSHLYRGTNFRSK---RQDSKTTSTSTKWE 660

Query: 661 TIVKSSNLCHDHVRYSEELLPRVSQ 669
            IVKSS L +DHVRYSEEL P  SQ
Sbjct: 661 NIVKSSYLHNDHVRYSEELFPHASQ 666

BLAST of Csa1G470450 vs. TrEMBL
Match: A0A061GLK7_THECC (Transcription initiation factor TFIID subunit 11, putative OS=Theobroma cacao GN=TCM_029685 PE=4 SV=1)

HSP 1 Score: 423.7 bits (1088), Expect = 4.2e-115
Identity = 293/730 (40.14%), Postives = 399/730 (54.66%), Query Frame = 1

Query: 1   MEERKLNFNAPLMSVRRFSKAASSISKANEKKSENSHFSRRSTFPVSRPQFNLDQVTEPV 60
           MEERKLNFNAPL+SVRRFS  ++   +  +K  EN   +RR T P      +LDQVTEPV
Sbjct: 1   MEERKLNFNAPLLSVRRFSATSAFSDRDKQKIVENPCPNRRHTLPFYNSDVSLDQVTEPV 60

Query: 61  AVPFYWEQIPGRAKNDSGSASPEVHLPHPPERTCSTPRLSFGTALDANKYSSEMEACHQD 120
           AVPF WEQIPG+AK          H   P +    TPRL  G  LD  KY+ E E  +Q+
Sbjct: 61  AVPFVWEQIPGKAKGGIE------HESQPNKEASGTPRLPPGRVLDIMKYTVEKEFENQN 120

Query: 121 ----GCESSSSNAIVVRLESAKASGGRSLASENDDDDDDDDFSDARETLSLTGSFSVNNC 180
                 E  S N  V +L+S+         SE++ DDD   +SDA +TLS T S S+N C
Sbjct: 121 VVRPQSEIYSLNDNVTKLDSSNKGINEKCISESETDDDA--YSDALDTLSPTDSLSMN-C 180

Query: 181 SVSGISGYNGPMVKPSGTFRTDPQTRDFMMSRFLPAAKAMVLEPAKYSLKKKLVAVEQPR 240
           S+SG+SG +G + KPSGTF +DPQTRDFMMSRFLPAAKAM LE  +Y+ +K+ VA   PR
Sbjct: 181 SISGLSGSSGLVAKPSGTFSSDPQTRDFMMSRFLPAAKAMTLEMPQYASRKQSVAPALPR 240

Query: 241 QVKK-AENRRISPIKRLESTLLLQYGKDEVHGVDEVDEESDSVDDEYDNSGHISARGCGL 300
           + KK     R  P+ + ES ++  Y +D    VD   EE++   D+Y++SG++S + CGL
Sbjct: 241 EDKKVVVGDRKPPVNQYESVIIPHYNQD----VD--GEETEDEYDDYEDSGNLSRKACGL 300

Query: 301 IPNICFKNSLGLLNPVPGMRIRTEAPMSVTKKVGGSSR-TLHHSYGQKMNKHAWDATYKQ 360
           +P + FKNSL LLNPVPG+++RT + M  T++V   S+ T   S+ Q + KHAWDA +K 
Sbjct: 301 LPRLSFKNSLCLLNPVPGLKVRTHSSMPSTREVAKPSKATYMKSHSQIIEKHAWDAVHKN 360

Query: 361 KSEA----------------------------AVGSPRLLEVKDKWTGESKHFSSSTDLQ 420
           KS++                             V SPRL E+  K T  S  F++S D Q
Sbjct: 361 KSDSGVQSPQPQENKSDTGVQSPRLPENKLSGGVQSPRLPEIGKKMTCGSNQFTNSGDQQ 420

Query: 421 MKGRSSPFR--HSRAASPFRNEASRSPCRRQPFV-VPKEVD------IISKSKGDIDCHD 480
           +  RS P R   S   SP+R E  +SP R   F+ +PKE +      +I  +K + +  +
Sbjct: 421 IVNRSPPKRLPGSARISPYRRERPQSPFRGGGFLGMPKEAEKFNANMLIKYTKSNNNSQE 480

Query: 481 TPSIQATNKDGVDMANILMEKTLYIDTASVAGTNPPFNSAIFDDKKKSECPNGKNETACE 540
               Q+T + G    +  +EKTLY+DT + A      NS   D K   +     ++T   
Sbjct: 481 LVPYQST-RQGSGALSPAVEKTLYVDTVNFAEI-ASSNSDSSDTKAPMDSMGKHSDTLLV 540

Query: 541 MRVMEESTTEEPSFLEIKCLTMVEEGRLER-------EAAESKIKDAIDDCLKVGHGL-- 600
            R++EES T E S  +IKCL +++   + +        ++ S   D  D  LK    +  
Sbjct: 541 NRMLEESATVESSLQDIKCLNLLDGKDISKYEITGSVYSSRSSFSDKPD--LKGQAEMMD 600

Query: 601 -YEEDHTEYTNLG----SADEEDYSKANYQLVKVEDPASVKVTSAISSQPPPLPKSPSES 660
            + ++     +LG     AD      AN   V+  D       S  S  PPPLPK+PSES
Sbjct: 601 CFRQNGGLNKSLGRIKVRADRSLTLSANGD-VREADQEENNAGSDCSPLPPPLPKTPSES 660

Query: 661 WLWRTLPSVSSKKLLAGSNFGNKLYQKPQSPRTSAS-TKWETIVKSSNLCHDHVRYSEEL 673
           WLW  LPSV+S+   + S  G + Y K + P+ SA+ TKWETIVK+S L HDHVRYSEEL
Sbjct: 661 WLWCALPSVTSRNSFSQSYNGTRFYPKKEEPKVSATDTKWETIVKTSYLHHDHVRYSEEL 710

BLAST of Csa1G470450 vs. TrEMBL
Match: A0A067GDT4_CITSI (Uncharacterized protein OS=Citrus sinensis GN=CISIN_1g006045mg PE=4 SV=1)

HSP 1 Score: 416.0 bits (1068), Expect = 8.8e-113
Identity = 277/682 (40.62%), Postives = 381/682 (55.87%), Query Frame = 1

Query: 1   MEERKLNFNAPLMSVRRFSKAASSISKA-NEKKSENSHFSRRSTFPVSRPQFNLDQVTEP 60
           M+ERKLNFNAPL+SVRR+S  A + S   N K  E S  SRR + P  R   NL+QVTEP
Sbjct: 1   MDERKLNFNAPLLSVRRYSTTAVASSDGENGKMVEISASSRRYSIPFYRTDLNLEQVTEP 60

Query: 61  VAVPFYWEQIPGRAKNDSGSASPEVHLPHPPERTCSTPRLSFGTALDANKYSSEMEACHQ 120
            AVPF WEQIPGR K+      PE  L H  +    TPRL    ALD  KY    E    
Sbjct: 61  AAVPFMWEQIPGRPKD----GGPE--LQHSEDAPV-TPRLPPLKALDIIKYPLAKEFDDS 120

Query: 121 DGCESSSSNAIVVRLESAKASGGRSLASENDDDDDDDDFSDARETLSLTGSFSVNNCSVS 180
              ES S N  +  L+S   +       + D D+DDD +SDA +TLS T S+S+N CS+S
Sbjct: 121 PRVESRSLNENMCTLDSPNEAN--DWKQQLDTDNDDDVYSDALDTLSSTDSYSIN-CSLS 180

Query: 181 GISGYNGPMVKPSGTFRTDPQTRDFMMSRFLPAAKAMVLEPAKYSLKKKLVAVEQPRQVK 240
           G+SG +G +VK SGTF TDPQTRDFMM RFLPAAKAM LEP +Y+ +K+ V +EQPRQV 
Sbjct: 181 GLSGSDGQVVKRSGTFSTDPQTRDFMMRRFLPAAKAMALEPPQYASRKQPVTIEQPRQVI 240

Query: 241 KAENRRISPIKRLESTLLLQYGKD-EVHGVDEVDEESDSVDDEYDNSGHISARGCGLIPN 300
           K  +    P+   +S  +  YG+D E    +E +EE++   DEYD+S ++S + CGL+P 
Sbjct: 241 KVVSEDRRPLVN-KSIFIPHYGEDVEEEEEEEEEEETEDEVDEYDDSDNLSGKACGLLPR 300

Query: 301 ICFKNSLGLLNPVPGMRIRTEAPMSVTKKVGGSSRTLH-HSYGQKMNKHAWDATYKQKSE 360
           +C   SL LLNP+PG++ RT + +S +  V    +  +  S  Q + KH  DA YK ++E
Sbjct: 301 LCLNKSLCLLNPMPGLKARTHSSVSSSSDVRNLGKAAYTESRNQTVKKHVRDAVYKHQAE 360

Query: 361 AAVGSPRLLEVKDKWTGESKHFSSSTDLQMKGRSSPFRHSRAASPFRNEASRSPCRRQPF 420
           + V SP+LL +++K T  S  F+  +D QM GRSSP+R  R  SP+RNE  +SP R   F
Sbjct: 361 SGVQSPKLLGIENKMTCGSNRFACLSDQQMAGRSSPYR--RGISPYRNERPQSPFRGGGF 420

Query: 421 V-VPKEVDIISKSKGDIDCHDTPSIQATN-------KDGVDMANILMEKTLYIDTASVAG 480
           + VPKE + +  +K  ++ ++    ++         K      +  +EKTLY+DT + + 
Sbjct: 421 LGVPKEAENVRANK--LNPYNRAGSKSQELFPHHSFKKRFGSLSPAVEKTLYVDTVNFSK 480

Query: 481 TNPPFNSAIFDDKKKSECPNGKNETACEMRVMEESTTEEPSFLEIKCLTMVEEGRLEREA 540
            +          + KSE   G   TA    V +ES +E      I+        ++   A
Sbjct: 481 ISDTMG------QMKSE---GIERTASVDTVKDESRSETKVSASIEASRSSSFEKIMHPA 540

Query: 541 AESKIKDAIDDCLKVGHGLYEEDHTEYTNLGSADEEDYSKANYQLVKVEDPASVKVTSAI 600
            +      ++ CL +   L +E  +      +ADE   S   ++  + +D   +   S  
Sbjct: 541 GQGD----MEQCLGLDGELNQECKSLVCTNVTADETLNSICQHK-SEADDLGCINSGSEQ 600

Query: 601 SSQPPPLPKSPSESWLWRTLPSVSSKKLLAGSNFGNKLYQKPQSPRTS-ASTKWETIVKS 660
           S  P PLPK P+ESWLWRTLPSVSS+   +  N G +   K Q P+T   +TKWETIVK+
Sbjct: 601 SLLPLPLPKKPTESWLWRTLPSVSSRNSFSNPNVGTRFNPKKQDPKTPLTTTKWETIVKT 653

Query: 661 SNLCHDHVRYSEELLPRVSQHS 671
           S   HDH+RYSEEL    SQ S
Sbjct: 661 SYAHHDHIRYSEELTSHFSQQS 653

BLAST of Csa1G470450 vs. TrEMBL
Match: V4UF07_9ROSI (Uncharacterized protein OS=Citrus clementina GN=CICLE_v10014532mg PE=4 SV=1)

HSP 1 Score: 413.3 bits (1061), Expect = 5.7e-112
Identity = 277/693 (39.97%), Postives = 384/693 (55.41%), Query Frame = 1

Query: 1   MEERKLNFNAPLMSVRRFSKAASSISKA-NEKKSENSHFSRRSTFPVSRPQFNLDQVTEP 60
           M+ERKLNFNAPL+SVRR+S  A + S   N K  E+S  SRR + P  R   NL+QVTEP
Sbjct: 1   MDERKLNFNAPLLSVRRYSTTAVASSDGENGKMVESSASSRRYSIPFYRTDLNLEQVTEP 60

Query: 61  VAVPFYWEQIPGRAKNDSGSASPEVHLPHPPERTCSTPRLSFGTALDANKYSSEMEACHQ 120
            AVPF WEQIPGR K+      PE  L H  +    TPRL+   AL+  KY    E    
Sbjct: 61  AAVPFMWEQIPGRPKD----GGPE--LEHSEDAPV-TPRLTPLKALNIIKYPLAKEFDDL 120

Query: 121 DGCESSSSNAIVVRLESAKASGGRSLASENDDDDDDDDFSDARETLSLTGSFSVNNCSVS 180
              ES S N  +  L+S   +       +   D DDD +SDA +TLS T S+S+N CS+S
Sbjct: 121 PRVESRSLNENMCTLDSPNEANDW----KQQLDTDDDVYSDALDTLSSTDSYSIN-CSLS 180

Query: 181 GISGYNGPMVKPSGTFRTDPQTRDFMMSRFLPAAKAMVLEPAKYSLKKKLVAVEQPRQVK 240
           G+SG +G +VK SGTF TDPQTRDFMM RFLPAAKAM LEP +Y+ +K+ V +EQPRQV 
Sbjct: 181 GLSGSDGQVVKRSGTFSTDPQTRDFMMRRFLPAAKAMALEPPQYASRKQPVTIEQPRQVI 240

Query: 241 KAENRRISPIKRLESTLLLQYGKD-------EVHGVDEVDEESDSVDDEYDNSGHISARG 300
           K  +    P+   +S  +  YG+D       E    +E +EE++   DEYD+SG++S + 
Sbjct: 241 KVVSEDRRPLVN-KSIFIPHYGEDVEEEEEEEEEEEEEEEEETEDEVDEYDDSGNLSRKA 300

Query: 301 CGLIPNICFKNSLGLLNPVPGMRIRTEAPMSVTKKVGGSSRTLH-HSYGQKMNKHAWDAT 360
           CGL+P +C   SL LLNP+PG++ RT + +S +  V    +  +  S  Q + KH  DA 
Sbjct: 301 CGLLPRLCLNKSLCLLNPMPGLKARTHSSVSSSSDVRNLGKAAYTESRNQTVKKHVRDAV 360

Query: 361 YKQKSEAAVGSPRLLEVKDKWTGESKHFSSSTDLQMKGRSSPFRHSRAASPFRNEASRSP 420
           YK ++E+ V SP+LL +++K T  SK F+  +D QM GRSSP+R  R  SP+RNE  +SP
Sbjct: 361 YKHQAESGVQSPKLLGIENKMTCGSKQFACLSDQQMAGRSSPYR--RGISPYRNERPQSP 420

Query: 421 CRRQPFV-VPKEVDIISKSKGDIDCHDTPSIQATN-------KDGVDMANILMEKTLYID 480
            R   F+ VPKE + +  +K  ++ ++    ++         K      +  +EKTLY+D
Sbjct: 421 FRGGGFLGVPKEAENVRANK--LNPYNRAGSKSQELFPHHSFKKRFGSLSPAVEKTLYVD 480

Query: 481 TASVAGTNPPFNSAIFDDKKKSECPNGKNETACEMRVMEESTTEEPSFLEIKCLTMVEEG 540
           T + +               K     G+ E+    R+    T ++ S  E K    +E  
Sbjct: 481 TVNFS---------------KISDTMGQMESEGRERIASVDTAKDESRSETKVSVSIE-- 540

Query: 541 RLEREAAESKI-----KDAIDDCLKVGHGLYEEDHTEYTNLGSADEEDYSKANYQLVKVE 600
              R ++  KI     +  ++ CL +   L +E  +      +ADE   S   ++  + +
Sbjct: 541 -ASRSSSSEKIMHPAGQGDMEHCLGLHGELNQECKSLVCTNVTADETLNSICQHK-SEAD 600

Query: 601 DPASVKVTSAISSQPPPLPKSPSESWLWRTLPSVSSKKLLAGSNFGNKLYQKPQSPRTS- 660
           D   +   S  S  P PLPK P+ESWLWRTLPSVSS+   +  N G +   K Q P+T  
Sbjct: 601 DLGCINSGSEQSPLPLPLPKKPTESWLWRTLPSVSSRNSFSNPNVGTRFNPKKQDPKTPL 657

Query: 661 ASTKWETIVKSSNLCHDHVRYSEELLPRVSQHS 671
            +TKWETIVK+S   HDH+RYSEEL    SQ S
Sbjct: 661 TTTKWETIVKTSYAHHDHIRYSEELTSHFSQQS 657

BLAST of Csa1G470450 vs. TAIR10
Match: AT1G29240.1 (AT1G29240.1 Protein of unknown function (DUF688))

HSP 1 Score: 222.2 bits (565), Expect = 9.4e-58
Identity = 174/535 (32.52%), Postives = 259/535 (48.41%), Query Frame = 1

Query: 1   MEERKLNFNAPLMSVRRFSKAASSISKANEKKSENSHFSRRS---TFPVSRPQFNL--DQ 60
           MEERKLNF+ PL+S RR  K A    + N+  +   + S+ S   + PV  P      D+
Sbjct: 1   MEERKLNFSVPLLSTRRMQKTAGVSVRRNKSNNFTDYDSKTSECPSVPVLVPYMMGLDDE 60

Query: 61  VTEPVAVPFYWEQIPGRAKNDSGSASPEVHLPHPPERTCSTPRLSFGTALDANKYSSEME 120
           VTEP +VPF WEQ PGR K +     P+V +    E    TP L  G A+DAN       
Sbjct: 61  VTEPASVPFTWEQAPGRLKGND--FKPQVCVLMKEEEQVFTPCLPPGKAVDAN------- 120

Query: 121 ACHQDGCESSSSNAIVVRLESAKASGGRSLASENDDDDDDDDFSDARETLSLTGSFSVNN 180
                          + RL+S+K   G+ +  E  DDD+DD FSDA +TLS   SFS NN
Sbjct: 121 ---------------MTRLQSSK---GKQV--EESDDDEDDVFSDALDTLSPKDSFSFNN 180

Query: 181 CSVSGISGYNGPMVKPSGTFRTDPQTRDFMMSRFLPAAKAMVLEPAKYSLKKK--LVAVE 240
            S+SG+S Y G   K       D Q+RDFMMSRFLPAAKAM +E + Y+  +K      E
Sbjct: 181 -SISGVSEYGGVETKKP----LDAQSRDFMMSRFLPAAKAMTVEQSHYASNRKPSTFMAE 240

Query: 241 QPRQVKK-AENRRISPIKRLESTLLLQYGKDEVHGVDEVDEESDSVDDEYDNSGHISARG 300
              Q+++     +     R + +++  Y   +   +D+ + +    DDE     ++S RG
Sbjct: 241 PTIQIRELVPGEKQQTPNRYDVSIVPSYYYHQ--DIDDEESKVGEEDDEVSEYAYLSKRG 300

Query: 301 CGLIPNICFKNSLGLLNPVPGMRIRTEAPM-SVTKKVGGSSRTLHHSYG-QKMNKHAWDA 360
           CG++P +CFK+SLG+LN VPG + +  +P+ S +     SS+     Y  Q + K A D+
Sbjct: 301 CGMLPQLCFKDSLGMLNTVPGFKAKHNSPITSPSHDQVKSSKVAQLKYRFQSVKKLALDS 360

Query: 361 TYKQKSEAAVGSPRLLEVKDKWTGESKHFSSSTDLQMKGRSSPFRHSRAASPFRNEASRS 420
             K K    V SP       K+  ES   S++     K  SSP+RHSR  SPFR+  + S
Sbjct: 361 VSKHKLSGKVHSPVHPSNGKKFNSESNLISAA-----KRSSSPYRHSRCMSPFRSTGNGS 420

Query: 421 PCRRQPFVVPKEVDIISKSKGDIDCHDTPSIQATNKDGV--DMANILMEKTLYIDTASVA 480
           P     F  P+        + +   + T +I  T+++ +       ++EKT+Y+DT +  
Sbjct: 421 PLHHAGF--PETRRETENLRANRLSNHTRNISRTSQELLYPKSNGSILEKTVYVDTENDH 480

Query: 481 GTNPPFNSAIFDDKKKSECPNGKNETACEMRVMEESTTEEPSFLEIKCLTMVEEG 524
            TN   NS +    ++++    K +   E+   E  +      ++   L  +  G
Sbjct: 481 MTNDQHNSNLMIFPEEADMTGRKPDANPELEAFENISIRSSEMVKGNELVEISSG 492


HSP 2 Score: 67.4 bits (163), Expect = 3.9e-11
Identity = 44/111 (39.64%), Postives = 59/111 (53.15%), Query Frame = 1

Query: 565 EDYSKANYQLVKVEDPASVKVTSAISSQP--PPLPKSPSESWLWRTLPSVSSKKLLAGSN 624
           E+ S  + ++VK  +   V+++S     P  PP PK PSESWL   LPSV+S+  +    
Sbjct: 471 ENISIRSSEMVKGNE--LVEISSGFDRSPLAPPSPKRPSESWLCHNLPSVTSQ--ITSRR 530

Query: 625 FGNKLYQKPQSPRTSAS-TKWETIVKSSNLCHDHVRYSEELLPRVSQHSTT 673
           +     QK        + TKWETIVK+S +  DH+RYSEEL+   S  S T
Sbjct: 531 YHPFNPQKQDLTENYRNGTKWETIVKTSYMHRDHIRYSEELVAHTSHQSKT 577

BLAST of Csa1G470450 vs. TAIR10
Match: AT2G30990.1 (AT2G30990.1 Protein of unknown function (DUF688))

HSP 1 Score: 190.7 bits (483), Expect = 3.0e-48
Identity = 193/689 (28.01%), Postives = 300/689 (43.54%), Query Frame = 1

Query: 1   MEERKLNFNAPLMSVRRFSKAASSISKANEKKSENSHFSRRSTFPVSRPQFNLD----QV 60
           MEE++L+FN PL+S+RR ++ + S SK        S  S  +  P S P +  D     V
Sbjct: 8   MEEKQLDFNRPLISIRRPTQTSESESKTR------SFDSVTNMIPPSPPVYKSDIKSGPV 67

Query: 61  TEPVAVPFYWEQIPGRAKNDSGSASPEVHLPHPPERTCSTPRLSFGT--ALDANKYSSEM 120
             P  VPF WE  PG+ K++          PH        P+L  G    ++  +     
Sbjct: 68  RNPGTVPFQWEHKPGKPKDERKPVLQSFVEPH------FVPKLPPGRERVVELGRKPEST 127

Query: 121 EACHQDGCESSSSNAIVVRLESAKASGGRSLASENDDDDDDDDFSDARETLSLTGSFSVN 180
            A HQ    SSS   +V   E AK++  R    ++DDDD D  + DA +TLS T SF  N
Sbjct: 128 GADHQTKTVSSSDKYLV---EDAKSNSSRY---DDDDDDSDGTYLDATDTLSRTESFFFN 187

Query: 181 NCSVSGISGYNGP--MVKPSGTFRTDPQTRDFMMSRFLPAAKAMVLEPAKYSLKKKLVAV 240
             +VSG SG +G   +V+P GT  TD QT+D MM RFLPAAKA+  E   +  +K     
Sbjct: 188 CSAVSGASGLDGSGILVEPFGTLSTDRQTQDLMMGRFLPAAKALTSESPPHLARKPPKPE 247

Query: 241 EQPRQVKKAENRRISPIKRLESTLLLQYGKDEVHGVDEVDEESDSVDDEYDNSGHISARG 300
           E  +Q+ K +  ++      ++    ++  D+       +EE  ++      S  +++  
Sbjct: 248 EPVKQLMKKKQNKVE-----QNPYRFRHSPDQ-------EEEDGNI------SSMMASGV 307

Query: 301 CGLIPNICFKNSLGLLNPVPGMRIRTEAPMSVTKKVGGSSRTLHHSYGQKMNKHAWDATY 360
           CGL+P +C ++SLGLLNPVP +R++ +  +SV +      R+ +         H      
Sbjct: 308 CGLLPQLCLRSSLGLLNPVPSVRMQQQRAVSVRR-----MRSKYQDSAPSNETHNKQNED 367

Query: 361 KQKSEAAVGSPRLLEVKDKWTGESKHFSSSTDLQMKGRSSPFRHSRAASPFRNEASRSPC 420
           K+K        +L+E   K + + +  S S+  Q K +                ASRS  
Sbjct: 368 KRKL-------KLIESVAKGSSQGESLSVSSIPQGKEKLENV----------GTASRSK- 427

Query: 421 RRQPFVVPKEVDIISKSKGDIDCHDTPSIQATNKDGVDMANILMEKTLYIDTASVAGTNP 480
                        ISK+ G++   D  + + +++  V       EKTLY+D   +     
Sbjct: 428 -------------ISKNFGELLASDENTWEPSSETPV------AEKTLYVDIVHLV---- 487

Query: 481 PFNSAIFDDKKKSECPNGKNETACEMRVMEESTTEEPSFLEIKCLTMV----EEGRLER- 540
                   DKK                 ++E + ++P   E   L +V    EE  + + 
Sbjct: 488 --------DKK-----------------VQEESKKQPILKESPSLDIVPVKDEEADISQP 547

Query: 541 EAAESKIKDAIDDCLKVGHGLYEE--DHTEYTNLGSADEEDYSKANYQLVKVEDPASVKV 600
           ++ E +  +  +D  K      EE  DH     L  +D  + +K   + + +E    V  
Sbjct: 548 KSIEQENGNRDEDFTKFSSQKVEECPDHQAIVALPESDVVEITKE--KKIDLEVQLQVIT 587

Query: 601 TSAISSQ-----------PPPLPKSPSESWLWRTLPSVSSKKLLAGSNFGNKLYQKPQSP 660
           T+  SS+           PPPLPK+PS+SWL RTLP++  K                   
Sbjct: 608 TNIESSRLHHRSSYLIVPPPPLPKAPSDSWLKRTLPTIPPKNNSFAWLQSLGTDDDNHFT 587

Query: 661 RTSASTKWETIVKSSNLCHDHVRYSEELL 664
           +T A+ KWET+VK+SN     V +S+E L
Sbjct: 668 KTQANPKWETMVKTSNTQQGFVCFSKETL 587

BLAST of Csa1G470450 vs. TAIR10
Match: AT2G34170.1 (AT2G34170.1 Protein of unknown function (DUF688))

HSP 1 Score: 170.2 bits (430), Expect = 4.2e-42
Identity = 171/550 (31.09%), Postives = 249/550 (45.27%), Query Frame = 1

Query: 143 RSLASENDDDDDDDDFSDARETLSLTGSFSVNNCSVSGISGYNGPMVKPSGTFRTDPQTR 202
           + +  E ++ +DDD FSDA +TLSL  S       +SG  G     +KPS     DPQ  
Sbjct: 65  KHVEEEAEESEDDDVFSDALDTLSLKQS-------ISGGGGVEA--MKPSMPSE-DPQ-- 124

Query: 203 DFMMSRFLPAAKAMVLE-PAKYSLKKKLVAV-----EQPRQVKKAENRRISPIKRLESTL 262
            FM+ RFLPAAK++ LE P +YS K++ + +      Q R +  AENR      R ES+ 
Sbjct: 125 -FMLDRFLPAAKSLTLEQPPQYSWKRQPLPLMSEPMRQIRDIVPAENRATPT--RYESSF 184

Query: 263 LLQYGKDEVHGVDEVDEESDSVDDEYDNSGHISARGCGLI-PNICFKNSLGLLNPVPGMR 322
              Y +D    +D+ + E DS DDE   S ++S RGCG++ P ICFKNSLG+L+ V G++
Sbjct: 185 TPSYYQD----IDDEESEEDSDDDEV--SEYLSKRGCGMMSPQICFKNSLGMLSSVNGLK 244

Query: 323 IRTEAPMSVT----KKVGGSSRTLHHSYGQKMNKHAWDATYKQKSEAAVGSPRLLEVKDK 382
              E P S+      +V  S      S  Q + K A D  YKQK  +   SP    V  K
Sbjct: 245 ---EKPYSLRTPSHDQVKSSKVAQLKSRFQSVKKLALD--YKQKLGSIAQSPVHPSVGKK 304

Query: 383 WT-GESKHFSSSTDLQMKGRSSPFRHSRAASPFRNEASRSPCRRQPFV-VPKEVDIISKS 442
           +  G  +H S S+       SSP+R +   SP+R+  + SP     F    KE +++  +
Sbjct: 305 FNFGSEQHESKSS---ASRPSSPYRQNGCMSPYRSVGNSSPLHAAGFPGTRKEAEVMRAN 364

Query: 443 ------KGDIDCHDTPSIQATNKDGVDMANILMEKTLYIDTASVAGTNPPFNSAIFDDKK 502
                 +     H++   ++T +D     +  MEKTLY+D+                   
Sbjct: 365 RLNKHIRNISKSHESLYPKSTKQD--CSTSSAMEKTLYVDS------------------- 424

Query: 503 KSECPNGKNET-ACEMRVMEESTTEEPSFLEIKCLTMVEEGRLEREAAESKIKDAIDDCL 562
               P   NE  +  ++ + E+ +EEP            EG+  +   E K  + +   +
Sbjct: 425 -ENSPRTSNENRSSNVKKLPETISEEPEM----------EGKKPKAVRELKAVETLS--I 484

Query: 563 KVGHGLYEEDHTEYTNLGSADEEDYSKANYQLVKVEDPASVKVTSAISSQPPPLPKSPSE 622
             G  + + D     N G                            +S   PP PK PSE
Sbjct: 485 SSGVKMMKADELGKNNSGC--------------------------DLSPLAPPPPKKPSE 523

Query: 623 SWLWRTLPSVSSKKLLAGSNFGNKLYQKPQSPRTSASTKWETIVKSSNLCHDHVRYSEEL 673
           SWL+  LPSVSSK  +    +     +K     +++ TKWETIVK+S    DH+RYSEEL
Sbjct: 545 SWLFSNLPSVSSK--IPSPRYLFHPQKKNVEENSTSVTKWETIVKTSYTHRDHIRYSEEL 523


HSP 2 Score: 38.1 bits (87), Expect = 2.5e-02
Identity = 22/59 (37.29%), Postives = 32/59 (54.24%), Query Frame = 1

Query: 1  MEERKLNFNAPLMSVRRFSKAASSISKANEKK-SENSHFSRRSTFPVSRPQFNLDQVTE 59
          M E++LNF+APL+S RR  K+A S+ +   K+ ++ S  S   +  V      LDQV +
Sbjct: 1  MAEKQLNFDAPLLSTRRMKKSAISVRRNMPKQLTDESKTSESLSVSVLVQDMGLDQVPD 59

BLAST of Csa1G470450 vs. TAIR10
Match: AT4G18630.1 (AT4G18630.1 Protein of unknown function (DUF688))

HSP 1 Score: 81.6 bits (200), Expect = 2.0e-15
Identity = 60/180 (33.33%), Postives = 95/180 (52.78%), Query Frame = 1

Query: 151 DDDDDDDFSDARETLSLTGSFSVNNCSVSGISGYNGPMVKPSGTFRTDPQTR---DFMMS 210
           DD+++D+  +  +T+S   SFSVN CS SG+S     + K       D ++R   D MMS
Sbjct: 97  DDEEEDEVDEDLDTVSSNVSFSVN-CSTSGVS----EIEKTGERSDCDEKSRESLDLMMS 156

Query: 211 RFLPAAKAMVLEPAKY------SLKKKLVAVEQPRQVKKAENRRISPIKRLESTLLLQYG 270
           RFLPAAKAM L+  +       S ++KL+   +   V +  ++ ++     E   ++Q  
Sbjct: 157 RFLPAAKAMALQTHQKHQSSYNSSEQKLITQNREALVARQRSQLVA---EHEHFAIVQSL 216

Query: 271 KDEVHGVDEVDEESDSVDDEYDNSGHI---------SARGCGLIPNICFKNSLGLLNPVP 313
            D+++ +D+ D E D  DD+ D+ GH+         + + CG +P +C KNS   LNPVP
Sbjct: 217 YDDLN-IDD-DTEDDENDDDGDDYGHVDHKIYPAEVTKKACGFLPRLCAKNSFKFLNPVP 266


HSP 2 Score: 63.5 bits (153), Expect = 5.6e-10
Identity = 42/94 (44.68%), Postives = 55/94 (58.51%), Query Frame = 1

Query: 582 SVKVTSAISSQPPPLPKSPSESWLWRTL-PSVSSKK--LLAGSNFGNKLYQKPQSPRTSA 641
           S + T A+S  PPPLP++PS SWL RTL P V+ +   ++ G     KL Q+        
Sbjct: 393 SSEYTPALS--PPPLPETPSRSWLGRTLLPPVNPRPYGVVLGQVGIKKLNQE-----VLE 452

Query: 642 STKWETIVKSSNLCHDHVRYSEELLPRVSQHSTT 673
           STKWETIVK+S + +DH RYS+EL+   S+   T
Sbjct: 453 STKWETIVKTSYVHNDHARYSQELIVYPSRQQNT 479


HSP 3 Score: 57.0 bits (136), Expect = 5.2e-08
Identity = 32/90 (35.56%), Postives = 43/90 (47.78%), Query Frame = 1

Query: 1  MEERKLNFNAPLMSVRRFSKAASSISKANEKKSENSHFSRRSTFPVSR-------PQFNL 60
          ME +KLN  APL S+RR        S+   KK+  +    RS     +       P  + 
Sbjct: 1  MEGKKLNLYAPLPSIRRIPSMIERSSELENKKTTITRPELRSCPSTKQETPVFVLPDQSF 60

Query: 61 DQVTEPVAVPFYWEQIPGRAKNDSGSASPE 84
          D +TEP ++PF WEQIPG+ K+D  +   E
Sbjct: 61 DHLTEPASIPFMWEQIPGKPKDDMATLIQE 90

BLAST of Csa1G470450 vs. TAIR10
Match: AT5G45850.1 (AT5G45850.1 Protein of unknown function (DUF688))

HSP 1 Score: 69.3 bits (168), Expect = 1.0e-11
Identity = 39/82 (47.56%), Postives = 47/82 (57.32%), Query Frame = 1

Query: 589 ISSQPPPLPKSPSESWLWRTLPSVSSKKLLAGSNFGNKL------YQKPQSPRT-SASTK 648
           +S   PPLPK+PSESWL RTLP  S+   +    F   +      ++K    +T S S K
Sbjct: 354 LSVSQPPLPKTPSESWLCRTLPRSSTTSSVVSGQFAVVVSGQAARFKKNMEKKTDSQSKK 413

Query: 649 WETIVKSSNLCHDHVRYSEELL 664
           WETIVK+S   HDHVRYSE L+
Sbjct: 414 WETIVKTSYSHHDHVRYSEGLI 435


HSP 2 Score: 64.7 bits (156), Expect = 2.5e-10
Identity = 76/250 (30.40%), Postives = 107/250 (42.80%), Query Frame = 1

Query: 154 DDDDFSDARETLSLTGSFSVNNCSVSGISGY--NGPMVKPSGTFRTDP--QTRDFMMSRF 213
           ++ D   A E +S T SFSVN CS SG+S +  NG   + S   R D   + RD +MSRF
Sbjct: 92  EESDLIKAMEMVSSTASFSVN-CSSSGVSEFENNGDGDRSSNVSRDDVILEYRDLIMSRF 151

Query: 214 LPAAKAMVLEPAKYSLKKKLVAVEQPRQVKKAENRRISPIKRLESTLLLQYGKDEVH--- 273
           LPAA+A+       SLK K  A     + KK ++  +  +    +  L     DE H   
Sbjct: 152 LPAAEAI-------SLKMKKEASRVKAEKKKKQSIALQRVSMAINQDLNNDVDDEEHCDH 211

Query: 274 ----GVDEVDEESDSVDDEYDNSGHISARGCGLIPNICFKNSLGLLNPVPGMRIRTEAPM 333
               G+D +   +DS   +            G +P  C KNS+ +L PV   RI+T    
Sbjct: 212 NHVDGIDALVYSNDSKKGQ-----------LGFLPWFCSKNSVDVLTPVLS-RIKT---- 271

Query: 334 SVTKKVGGSSRTLHHSYGQKMNKHAWDATYKQKSEAAVGSPRLLEVKDKWTGESKHFSSS 393
              + VG  S  +       +N  + D+ YK KS     SPR+ +        SK  S S
Sbjct: 272 --CQDVGVKSENI-------INPKSIDSVYKTKST----SPRIFK-------SSKVMSKS 297


HSP 3 Score: 48.9 bits (115), Expect = 1.4e-05
Identity = 42/139 (30.22%), Postives = 54/139 (38.85%), Query Frame = 1

Query: 1   MEERKLNFNAPLMSVRRFSKAASSISKANEKKSENSHFSRRSTFPVSRPQFN-------- 60
           MEE+KLN +AP +SVRR        +++   K +     RR      +   N        
Sbjct: 1   MEEKKLNLDAPFLSVRRVPTKPDDPNESENTKKKTMTTRRREIKDSCQETENEILIRLLQ 60

Query: 61  ---LDQVTEPVAVPFYWEQIPGRAKNDSGSASPEVHLPHPPERTCSTPRLSFGTALDANK 120
               + V EP +VPF WEQ PG+ K D  +   E  L    E   ST   S       N 
Sbjct: 61  DQSFEHVMEPSSVPFKWEQTPGKPK-DKKNLVEESDLIKAMEMVSSTASFS------VNC 120

Query: 121 YSSEMEACHQDGCESSSSN 129
            SS +     +G    SSN
Sbjct: 121 SSSGVSEFENNGDGDRSSN 132

BLAST of Csa1G470450 vs. NCBI nr
Match: gi|449465006|ref|XP_004150220.1| (PREDICTED: uncharacterized protein LOC101207534 [Cucumis sativus])

HSP 1 Score: 1362.4 bits (3525), Expect = 0.0e+00
Identity = 676/676 (100.00%), Postives = 676/676 (100.00%), Query Frame = 1

Query: 1   MEERKLNFNAPLMSVRRFSKAASSISKANEKKSENSHFSRRSTFPVSRPQFNLDQVTEPV 60
           MEERKLNFNAPLMSVRRFSKAASSISKANEKKSENSHFSRRSTFPVSRPQFNLDQVTEPV
Sbjct: 1   MEERKLNFNAPLMSVRRFSKAASSISKANEKKSENSHFSRRSTFPVSRPQFNLDQVTEPV 60

Query: 61  AVPFYWEQIPGRAKNDSGSASPEVHLPHPPERTCSTPRLSFGTALDANKYSSEMEACHQD 120
           AVPFYWEQIPGRAKNDSGSASPEVHLPHPPERTCSTPRLSFGTALDANKYSSEMEACHQD
Sbjct: 61  AVPFYWEQIPGRAKNDSGSASPEVHLPHPPERTCSTPRLSFGTALDANKYSSEMEACHQD 120

Query: 121 GCESSSSNAIVVRLESAKASGGRSLASENDDDDDDDDFSDARETLSLTGSFSVNNCSVSG 180
           GCESSSSNAIVVRLESAKASGGRSLASENDDDDDDDDFSDARETLSLTGSFSVNNCSVSG
Sbjct: 121 GCESSSSNAIVVRLESAKASGGRSLASENDDDDDDDDFSDARETLSLTGSFSVNNCSVSG 180

Query: 181 ISGYNGPMVKPSGTFRTDPQTRDFMMSRFLPAAKAMVLEPAKYSLKKKLVAVEQPRQVKK 240
           ISGYNGPMVKPSGTFRTDPQTRDFMMSRFLPAAKAMVLEPAKYSLKKKLVAVEQPRQVKK
Sbjct: 181 ISGYNGPMVKPSGTFRTDPQTRDFMMSRFLPAAKAMVLEPAKYSLKKKLVAVEQPRQVKK 240

Query: 241 AENRRISPIKRLESTLLLQYGKDEVHGVDEVDEESDSVDDEYDNSGHISARGCGLIPNIC 300
           AENRRISPIKRLESTLLLQYGKDEVHGVDEVDEESDSVDDEYDNSGHISARGCGLIPNIC
Sbjct: 241 AENRRISPIKRLESTLLLQYGKDEVHGVDEVDEESDSVDDEYDNSGHISARGCGLIPNIC 300

Query: 301 FKNSLGLLNPVPGMRIRTEAPMSVTKKVGGSSRTLHHSYGQKMNKHAWDATYKQKSEAAV 360
           FKNSLGLLNPVPGMRIRTEAPMSVTKKVGGSSRTLHHSYGQKMNKHAWDATYKQKSEAAV
Sbjct: 301 FKNSLGLLNPVPGMRIRTEAPMSVTKKVGGSSRTLHHSYGQKMNKHAWDATYKQKSEAAV 360

Query: 361 GSPRLLEVKDKWTGESKHFSSSTDLQMKGRSSPFRHSRAASPFRNEASRSPCRRQPFVVP 420
           GSPRLLEVKDKWTGESKHFSSSTDLQMKGRSSPFRHSRAASPFRNEASRSPCRRQPFVVP
Sbjct: 361 GSPRLLEVKDKWTGESKHFSSSTDLQMKGRSSPFRHSRAASPFRNEASRSPCRRQPFVVP 420

Query: 421 KEVDIISKSKGDIDCHDTPSIQATNKDGVDMANILMEKTLYIDTASVAGTNPPFNSAIFD 480
           KEVDIISKSKGDIDCHDTPSIQATNKDGVDMANILMEKTLYIDTASVAGTNPPFNSAIFD
Sbjct: 421 KEVDIISKSKGDIDCHDTPSIQATNKDGVDMANILMEKTLYIDTASVAGTNPPFNSAIFD 480

Query: 481 DKKKSECPNGKNETACEMRVMEESTTEEPSFLEIKCLTMVEEGRLEREAAESKIKDAIDD 540
           DKKKSECPNGKNETACEMRVMEESTTEEPSFLEIKCLTMVEEGRLEREAAESKIKDAIDD
Sbjct: 481 DKKKSECPNGKNETACEMRVMEESTTEEPSFLEIKCLTMVEEGRLEREAAESKIKDAIDD 540

Query: 541 CLKVGHGLYEEDHTEYTNLGSADEEDYSKANYQLVKVEDPASVKVTSAISSQPPPLPKSP 600
           CLKVGHGLYEEDHTEYTNLGSADEEDYSKANYQLVKVEDPASVKVTSAISSQPPPLPKSP
Sbjct: 541 CLKVGHGLYEEDHTEYTNLGSADEEDYSKANYQLVKVEDPASVKVTSAISSQPPPLPKSP 600

Query: 601 SESWLWRTLPSVSSKKLLAGSNFGNKLYQKPQSPRTSASTKWETIVKSSNLCHDHVRYSE 660
           SESWLWRTLPSVSSKKLLAGSNFGNKLYQKPQSPRTSASTKWETIVKSSNLCHDHVRYSE
Sbjct: 601 SESWLWRTLPSVSSKKLLAGSNFGNKLYQKPQSPRTSASTKWETIVKSSNLCHDHVRYSE 660

Query: 661 ELLPRVSQHSTTENFK 677
           ELLPRVSQHSTTENFK
Sbjct: 661 ELLPRVSQHSTTENFK 676

BLAST of Csa1G470450 vs. NCBI nr
Match: gi|659068207|ref|XP_008443305.1| (PREDICTED: uncharacterized protein LOC103486924 [Cucumis melo])

HSP 1 Score: 1265.0 bits (3272), Expect = 0.0e+00
Identity = 636/676 (94.08%), Postives = 646/676 (95.56%), Query Frame = 1

Query: 1   MEERKLNFNAPLMSVRRFSKAASSISKANEKKSENSHFSRRSTFPVSRPQFNLDQVTEPV 60
           MEERKLNFNAPLMSVRRFSKAASSI KANEKKSENSHFSRRSTFPVSRPQFNLDQVTEPV
Sbjct: 1   MEERKLNFNAPLMSVRRFSKAASSIYKANEKKSENSHFSRRSTFPVSRPQFNLDQVTEPV 60

Query: 61  AVPFYWEQIPGRAKNDSGSASPEVHLPHPPERTCSTPRLSFGTALDANKYSSEMEACHQD 120
           AVPFYWEQIPGRAKNDSGSASPEV LPHPPERTCSTPRLSFGTALDANKYSSEMEACHQD
Sbjct: 61  AVPFYWEQIPGRAKNDSGSASPEVQLPHPPERTCSTPRLSFGTALDANKYSSEMEACHQD 120

Query: 121 GCESSSSNAIVVRLESAKASGGRSLASENDDDDDDDDFSDARETLSLTGSFSVNNCSVSG 180
           GCESSSSNAIVVRLESAKASG RSLASENDDDDDDDDFSDARETLSLTGSFSVNNCSVSG
Sbjct: 121 GCESSSSNAIVVRLESAKASGARSLASENDDDDDDDDFSDARETLSLTGSFSVNNCSVSG 180

Query: 181 ISGYNGPMVKPSGTFRTDPQTRDFMMSRFLPAAKAMVLEPAKYSLKKKLVAVEQPRQVKK 240
           ISGYNGPMVKPSGTFRTDPQTRDFMMSRFLPAAKAMVLEPAKYSLKKKLVAVEQPRQVKK
Sbjct: 181 ISGYNGPMVKPSGTFRTDPQTRDFMMSRFLPAAKAMVLEPAKYSLKKKLVAVEQPRQVKK 240

Query: 241 AENRRISPIKRLESTLLLQYGKDEVHGVDEVDEESDSVDDEYDNSGHISARGCGLIPNIC 300
            ENRR+SPIKRLESTLLLQYGKDEVHGVDEVDEESDSVDDEYDNSG+ISARGCGLIPNIC
Sbjct: 241 VENRRMSPIKRLESTLLLQYGKDEVHGVDEVDEESDSVDDEYDNSGNISARGCGLIPNIC 300

Query: 301 FKNSLGLLNPVPGMRIRTEAPMSVTKKVGGSSRTLHHSYGQKMNKHAWDATYKQKSEAAV 360
           FKNSLGLLNPVPGMRIRTEAPMSVTKKVG SSRTLHH YGQK NKHAWDATYKQKSEAAV
Sbjct: 301 FKNSLGLLNPVPGMRIRTEAPMSVTKKVGESSRTLHHPYGQKTNKHAWDATYKQKSEAAV 360

Query: 361 GSPRLLEVKDKWTGESKHFSSSTDLQMKGRSSPFRHSRAASPFRNEASRSPCRRQPFVVP 420
           GS +LLEVKDKWTGESKHFS STDLQMKGRSSPFRHSRAASPFRNEAS+SPCRRQPFVVP
Sbjct: 361 GSHKLLEVKDKWTGESKHFSFSTDLQMKGRSSPFRHSRAASPFRNEASQSPCRRQPFVVP 420

Query: 421 KEVDIISKSKGDIDCHDTPSIQATNKDGVDMANILMEKTLYIDTASVAGTNPPFNSAIFD 480
           KEVD ISKSKGD+D HDTPSIQA NKDGVDMA+ LMEKTLYIDTASVA TNPPFN AI D
Sbjct: 421 KEVDTISKSKGDVDFHDTPSIQA-NKDGVDMASFLMEKTLYIDTASVAETNPPFNPAISD 480

Query: 481 DKKKSECPNGKNETACEMRVMEESTTEEPSFLEIKCLTMVEEGRLEREAAESKIKDAIDD 540
           DKKK E  NGK+ETACEMRVMEESTTEEPSFLEIKCLTMVEEGRLEREAAESK KD  D 
Sbjct: 481 DKKKLEYHNGKDETACEMRVMEESTTEEPSFLEIKCLTMVEEGRLEREAAESKSKDVTDY 540

Query: 541 CLKVGHGLYEEDHTEYTNLGSADEEDYSKANYQLVKVEDPASVKVTSAISSQPPPLPKSP 600
           C  VGHGLYEEDHTEYTN G+ADEEDYSKANYQLVKVEDPA VKVTS+ISSQPPPLPKSP
Sbjct: 541 CPIVGHGLYEEDHTEYTNSGTADEEDYSKANYQLVKVEDPAIVKVTSSISSQPPPLPKSP 600

Query: 601 SESWLWRTLPSVSSKKLLAGSNFGNKLYQKPQSPRTSASTKWETIVKSSNLCHDHVRYSE 660
           SESWLWRTLPSVSSKKLLAGSN GNKLYQKPQSPR SASTKWETIVKSSNL HDHVRYSE
Sbjct: 601 SESWLWRTLPSVSSKKLLAGSNLGNKLYQKPQSPRISASTKWETIVKSSNLRHDHVRYSE 660

Query: 661 ELLPRVSQHSTTENFK 677
           EL+PRVSQHSTTENFK
Sbjct: 661 ELIPRVSQHSTTENFK 675

BLAST of Csa1G470450 vs. NCBI nr
Match: gi|1000938093|ref|XP_015583729.1| (PREDICTED: uncharacterized protein LOC8258854 [Ricinus communis])

HSP 1 Score: 442.2 bits (1136), Expect = 1.6e-120
Identity = 298/703 (42.39%), Postives = 404/703 (57.47%), Query Frame = 1

Query: 1   MEERKLNFNAPLMSVRRFS----KAASSISKANEK--KSENSHFSRRSTFPVSRPQFNLD 60
           MEERKLNFN PL+SVRR S     +A + S + EK  K++N H  RR T P  +P + LD
Sbjct: 1   MEERKLNFNIPLLSVRRSSTPTRSSAPTKSSSGEKGKKNDNFHPDRRRTLPSCKPAYILD 60

Query: 61  QVTEPVAVPFYWEQIPGRAKNDSGSASPEVHLPHPPERTCSTPRLSFGTALDANKYSSEM 120
           QVTEPVAVPF WEQIPGR K D     P+ H     E    TPR+     LD  K+    
Sbjct: 61  QVTEPVAVPFQWEQIPGRPK-DGAVPDPQGH-----EEVSVTPRIPPRRVLDVVKHIDNK 120

Query: 121 EACHQDGC----ESSSSNAIVVRLESAKASGGRSLASENDDDDDDDDFSDARETLSLTGS 180
           +   QD      E+ S   IV RL+ +K           ++DDD+D +SDA +TLS T S
Sbjct: 121 KPEDQDALTPQIEAKSFTNIVGRLDCSKEGVDEKAIIILENDDDEDVYSDALDTLSPTDS 180

Query: 181 FSVNNCSVSGISGYNGPMVKPSGTFRTDPQTRDFMMSRFLPAAKAMVLEPAKYSLKKKLV 240
           FSVN CS+SG+SG++   VKPSGTF  D Q +DFMMSRFLPAAKAM LEP +Y+ +K+ V
Sbjct: 181 FSVN-CSLSGVSGFDNLAVKPSGTFSIDQQAQDFMMSRFLPAAKAMTLEPPQYASRKQPV 240

Query: 241 AVEQPRQVKKAENR-RISPIKRLESTLLLQYGKDEVHGVDEVDEESDSVDDEYDNSGHIS 300
           + EQPRQ  KA NR R  P+ R  S  +  Y +      D+ DEES+   D+Y +SG+I+
Sbjct: 241 SGEQPRQTTKAVNRDRTPPVIRNRSCNIPPYHQ------DKEDEESEDECDDYSDSGNIT 300

Query: 301 ARGCGLIPNICFKNSLGLLNPVPGMRIRTEAPMSVTKKVGGSSRTL-HHSYGQKMNKHAW 360
           A+GCG +P +C KNSL LLNPVPGM+IRT+  MS TK +   ++ +   S    + K A 
Sbjct: 301 AKGCGFLPRLCIKNSLCLLNPVPGMKIRTQTSMSSTKDIKKLTKAVFSRSQSPTVKKPAR 360

Query: 361 DATYKQKSEAAVGSPRLLEVKDKWTGESKHFSSSTDLQMKGRSSPFRHSRAASPFRNEAS 420
           +A  KQK ++ V SPR++ V++K TG S  F+ +TD QM  R+SPFR S   SP RNEA 
Sbjct: 361 NAVSKQKQDSEVPSPRMVGVENKLTGGSNRFTYATDRQMISRTSPFRRSGCISPHRNEAP 420

Query: 421 RSPCR-RQPFVVPKEVDII------SKSKGDIDCHDTPSIQATNKDGVDMANILMEKTLY 480
           +SP R R    +PK+++ +      S ++G     +  S     + G   A+  +EKTLY
Sbjct: 421 QSPFRGRGSQGIPKQLENLKSNQFNSFNRGYSKSQELVSYNGIRR-GSRPASPTVEKTLY 480

Query: 481 IDTASVAGTNPPFNSAIFDDKKKSECPNGKNETACE--MRVMEESTTEEPSFLEIKCLTM 540
           +DT + AG        +  +   S+   G  ++A +    + +E    + SF ++KCL +
Sbjct: 481 VDTVNAAG-------ILCSNSGSSDIKKGFVDSAEKDLKSLFQEIAVVKSSFRDMKCLNV 540

Query: 541 V-EEGRLEREAAES---KIKDAIDDCLKVGHGLYEEDHTEYTN----LGSADEEDYSKAN 600
              EG+LE +   S   ++    D     G     ED ++ +     + +A E + +  +
Sbjct: 541 AGGEGKLETKGLRSGGPELPLLSDGSPDKGQAEMTEDLSKESMALVCISAATEGNVNIES 600

Query: 601 YQLVKVEDPASVKVTSAISSQ---PPPLPKSPSESWLWRTLPSVSSKKLLAGSNFGNKLY 660
            Q+ K +D  S K TS +  Q   PP LPK+PSESWLWRTLPS+SS+   + S   N   
Sbjct: 601 DQISKRDDTGSEK-TSLVLVQPPIPPLLPKTPSESWLWRTLPSISSQNQSSNSYRNNSFL 660

Query: 661 QKPQSPRT-SASTKWETIVKSSNLCHDHVRYSEELLPRVSQHS 671
            K Q  +T SA+TKWE IVKSS L HDHVRYSEEL P  SQ S
Sbjct: 661 SKRQDTKTFSATTKWENIVKSSYLHHDHVRYSEELFPHASQQS 681

BLAST of Csa1G470450 vs. NCBI nr
Match: gi|643704038|gb|KDP21102.1| (hypothetical protein JCGZ_21573 [Jatropha curcas])

HSP 1 Score: 436.0 bits (1120), Expect = 1.2e-118
Identity = 288/685 (42.04%), Postives = 393/685 (57.37%), Query Frame = 1

Query: 1   MEERKLNFNAPLMSVRRFSKAASSISKANEKKSENSHFSRRSTFPVSRPQFNLDQVTEPV 60
           MEERKLNFNAPLMSVRR S A    +    KK EN+   +R+T P  +  FNLDQVTEPV
Sbjct: 1   MEERKLNFNAPLMSVRRSSTATKPSNVTKGKKFENAQLVKRNTLPSYKSDFNLDQVTEPV 60

Query: 61  AVPFYWEQIPGRAKNDSGSASPEVHLPHPPERTCSTPRLSFGTALDANKY----SSEMEA 120
           AVPF+WEQIPGR K+ S    P+   P   E    TPR +   ALD  K+      E + 
Sbjct: 61  AVPFHWEQIPGRRKDGS---KPD---PRGCEEASVTPRFTPRRALDVVKHIEDKKPEDQV 120

Query: 121 CHQDGCESSSSNAIVVRLESAKASGGRSLASENDDDDDDDDFSDARETLSLTGSFSVNNC 180
             +   +S+S N I   L+ +K          +++DDDDD +SDAR+TLS   SFSV +C
Sbjct: 121 AFRPQIQSNSFNDIANGLDCSKEGVNEKSDFNSENDDDDDLYSDARDTLSGMDSFSV-DC 180

Query: 181 SVSGISGYNGPMVKPSGTFRTDPQTRDFMMSRFLPAAKAMVLEPAKYSLKKKLVAVEQPR 240
           SVSG+SG++   VKPSGTF  DPQTRDFMMSRFLPAAKAM LE  +Y+ +K+ V+ EQPR
Sbjct: 181 SVSGVSGFDSLAVKPSGTFNADPQTRDFMMSRFLPAAKAMTLEAPQYASRKQPVSGEQPR 240

Query: 241 QVKKAENR-RISPIKRLESTLLLQYGKDEVHGVDEVDEESDSVDDEYDNSGHISARGCGL 300
           Q+ +   R R  P+ R ES  +  Y +      D VDEES+   D+Y N G I  +GCGL
Sbjct: 241 QIVQVVQRDRTPPVNRKESFNVPSYHQ------DLVDEESEDECDQYVNYGKIMTKGCGL 300

Query: 301 IPNICFKNSLGLLNPVPGMRIRTEAPMSVTKKVGGSSRTLH-HSYGQKMNKHAWDATYKQ 360
           +P +C KNSL L+NPVPGM++R ++PMS  + +   +++++  S    +NK A D  +K+
Sbjct: 301 LPLLCVKNSLRLVNPVPGMKVRNQSPMSAARDIKRMTKSVYSRSQSPTINKPAKDPVHKK 360

Query: 361 KSEAAVGSPRLLEVKDKWTGESKHFSSSTDLQMKGRSSPFRHSRAASPFRNEASRSPCRR 420
           + +  V SPRL+ V +K TG S  F+ + D QM  R+SPFR S A SP+RNEA +SP   
Sbjct: 361 EPDNEVQSPRLVGVDNKLTGGSNRFTYARDRQMISRTSPFRRSGAISPYRNEAPQSPFPI 420

Query: 421 QPFV-VPKEVDIISKSKGDI--DCHDTPSIQATN---KDGVDMANILMEKTLYIDTASVA 480
             F+ VPK+++    +K ++   C+            + G    +   EKTLY+DT +VA
Sbjct: 421 GGFLGVPKDLENFKANKLNLYGKCYSKSQELVPYHGLRHGSRPLSPTTEKTLYVDTVNVA 480

Query: 481 GTNPPFNSAIFDDKKKSECPNGKN-ETACEMRVMEESTTEEPSFLEIKCLTMVEEGRLER 540
           G     N+   D KK    P  K+ ++    R ++E+ T E +  ++  L   E+   + 
Sbjct: 481 GLLCS-NAGSSDIKKGGMGPAEKDIKSLLSSREIQETYTIESTSKDVTSLNFPEQKSGDA 540

Query: 541 EAAESKIKDAIDDCLKVGHGLYEEDHTEYTNLGSADEEDYSKANYQLVKVEDPASVKVTS 600
           + +         D    G  L +E       + +  E + +  N Q+  + D  + K   
Sbjct: 541 DLSLLSDMSTHRDQWDTGEDLSQES-LALVCVSTTTEGNLNIENDQISNM-DIGNAKTGF 600

Query: 601 AISSQPPPLPKSPSESWLWRTLPSVSSKK----LLAGSNFGNKLYQKPQSPRTSASTKWE 660
           A  S PP LPK+PSESWL RTLP+VSS+     L  G+NF +K   +  S  TS STKWE
Sbjct: 601 AQCSLPPSLPKTPSESWLSRTLPTVSSQNPSSHLYRGTNFRSK---RQDSKTTSTSTKWE 660

Query: 661 TIVKSSNLCHDHVRYSEELLPRVSQ 669
            IVKSS L +DHVRYSEEL P  SQ
Sbjct: 661 NIVKSSYLHNDHVRYSEELFPHASQ 666

BLAST of Csa1G470450 vs. NCBI nr
Match: gi|802786880|ref|XP_012091781.1| (PREDICTED: uncharacterized protein LOC105649674 [Jatropha curcas])

HSP 1 Score: 436.0 bits (1120), Expect = 1.2e-118
Identity = 288/685 (42.04%), Postives = 393/685 (57.37%), Query Frame = 1

Query: 1   MEERKLNFNAPLMSVRRFSKAASSISKANEKKSENSHFSRRSTFPVSRPQFNLDQVTEPV 60
           MEERKLNFNAPLMSVRR S A    +    KK EN+   +R+T P  +  FNLDQVTEPV
Sbjct: 2   MEERKLNFNAPLMSVRRSSTATKPSNVTKGKKFENAQLVKRNTLPSYKSDFNLDQVTEPV 61

Query: 61  AVPFYWEQIPGRAKNDSGSASPEVHLPHPPERTCSTPRLSFGTALDANKY----SSEMEA 120
           AVPF+WEQIPGR K+ S    P+   P   E    TPR +   ALD  K+      E + 
Sbjct: 62  AVPFHWEQIPGRRKDGS---KPD---PRGCEEASVTPRFTPRRALDVVKHIEDKKPEDQV 121

Query: 121 CHQDGCESSSSNAIVVRLESAKASGGRSLASENDDDDDDDDFSDARETLSLTGSFSVNNC 180
             +   +S+S N I   L+ +K          +++DDDDD +SDAR+TLS   SFSV +C
Sbjct: 122 AFRPQIQSNSFNDIANGLDCSKEGVNEKSDFNSENDDDDDLYSDARDTLSGMDSFSV-DC 181

Query: 181 SVSGISGYNGPMVKPSGTFRTDPQTRDFMMSRFLPAAKAMVLEPAKYSLKKKLVAVEQPR 240
           SVSG+SG++   VKPSGTF  DPQTRDFMMSRFLPAAKAM LE  +Y+ +K+ V+ EQPR
Sbjct: 182 SVSGVSGFDSLAVKPSGTFNADPQTRDFMMSRFLPAAKAMTLEAPQYASRKQPVSGEQPR 241

Query: 241 QVKKAENR-RISPIKRLESTLLLQYGKDEVHGVDEVDEESDSVDDEYDNSGHISARGCGL 300
           Q+ +   R R  P+ R ES  +  Y +      D VDEES+   D+Y N G I  +GCGL
Sbjct: 242 QIVQVVQRDRTPPVNRKESFNVPSYHQ------DLVDEESEDECDQYVNYGKIMTKGCGL 301

Query: 301 IPNICFKNSLGLLNPVPGMRIRTEAPMSVTKKVGGSSRTLH-HSYGQKMNKHAWDATYKQ 360
           +P +C KNSL L+NPVPGM++R ++PMS  + +   +++++  S    +NK A D  +K+
Sbjct: 302 LPLLCVKNSLRLVNPVPGMKVRNQSPMSAARDIKRMTKSVYSRSQSPTINKPAKDPVHKK 361

Query: 361 KSEAAVGSPRLLEVKDKWTGESKHFSSSTDLQMKGRSSPFRHSRAASPFRNEASRSPCRR 420
           + +  V SPRL+ V +K TG S  F+ + D QM  R+SPFR S A SP+RNEA +SP   
Sbjct: 362 EPDNEVQSPRLVGVDNKLTGGSNRFTYARDRQMISRTSPFRRSGAISPYRNEAPQSPFPI 421

Query: 421 QPFV-VPKEVDIISKSKGDI--DCHDTPSIQATN---KDGVDMANILMEKTLYIDTASVA 480
             F+ VPK+++    +K ++   C+            + G    +   EKTLY+DT +VA
Sbjct: 422 GGFLGVPKDLENFKANKLNLYGKCYSKSQELVPYHGLRHGSRPLSPTTEKTLYVDTVNVA 481

Query: 481 GTNPPFNSAIFDDKKKSECPNGKN-ETACEMRVMEESTTEEPSFLEIKCLTMVEEGRLER 540
           G     N+   D KK    P  K+ ++    R ++E+ T E +  ++  L   E+   + 
Sbjct: 482 GLLCS-NAGSSDIKKGGMGPAEKDIKSLLSSREIQETYTIESTSKDVTSLNFPEQKSGDA 541

Query: 541 EAAESKIKDAIDDCLKVGHGLYEEDHTEYTNLGSADEEDYSKANYQLVKVEDPASVKVTS 600
           + +         D    G  L +E       + +  E + +  N Q+  + D  + K   
Sbjct: 542 DLSLLSDMSTHRDQWDTGEDLSQES-LALVCVSTTTEGNLNIENDQISNM-DIGNAKTGF 601

Query: 601 AISSQPPPLPKSPSESWLWRTLPSVSSKK----LLAGSNFGNKLYQKPQSPRTSASTKWE 660
           A  S PP LPK+PSESWL RTLP+VSS+     L  G+NF +K   +  S  TS STKWE
Sbjct: 602 AQCSLPPSLPKTPSESWLSRTLPTVSSQNPSSHLYRGTNFRSK---RQDSKTTSTSTKWE 661

Query: 661 TIVKSSNLCHDHVRYSEELLPRVSQ 669
            IVKSS L +DHVRYSEEL P  SQ
Sbjct: 662 NIVKSSYLHNDHVRYSEELFPHASQ 667

The following BLAST results are available for this feature:
Match NameE-valueIdentityDescription
A0A0A0LX77_CUCSA0.0e+00100.00Uncharacterized protein OS=Cucumis sativus GN=Csa_1G470450 PE=4 SV=1[more]
A0A067JBF4_JATCU8.2e-11942.04Uncharacterized protein OS=Jatropha curcas GN=JCGZ_21573 PE=4 SV=1[more]
A0A061GLK7_THECC4.2e-11540.14Transcription initiation factor TFIID subunit 11, putative OS=Theobroma cacao GN... [more]
A0A067GDT4_CITSI8.8e-11340.62Uncharacterized protein OS=Citrus sinensis GN=CISIN_1g006045mg PE=4 SV=1[more]
V4UF07_9ROSI5.7e-11239.97Uncharacterized protein OS=Citrus clementina GN=CICLE_v10014532mg PE=4 SV=1[more]
Match NameE-valueIdentityDescription
AT1G29240.19.4e-5832.52 Protein of unknown function (DUF688)[more]
AT2G30990.13.0e-4828.01 Protein of unknown function (DUF688)[more]
AT2G34170.14.2e-4231.09 Protein of unknown function (DUF688)[more]
AT4G18630.12.0e-1533.33 Protein of unknown function (DUF688)[more]
AT5G45850.11.0e-1147.56 Protein of unknown function (DUF688)[more]
Match NameE-valueIdentityDescription
gi|449465006|ref|XP_004150220.1|0.0e+00100.00PREDICTED: uncharacterized protein LOC101207534 [Cucumis sativus][more]
gi|659068207|ref|XP_008443305.1|0.0e+0094.08PREDICTED: uncharacterized protein LOC103486924 [Cucumis melo][more]
gi|1000938093|ref|XP_015583729.1|1.6e-12042.39PREDICTED: uncharacterized protein LOC8258854 [Ricinus communis][more]
gi|643704038|gb|KDP21102.1|1.2e-11842.04hypothetical protein JCGZ_21573 [Jatropha curcas][more]
gi|802786880|ref|XP_012091781.1|1.2e-11842.04PREDICTED: uncharacterized protein LOC105649674 [Jatropha curcas][more]
The following terms have been associated with this gene:
Vocabulary: INTERPRO
TermDefinition
IPR007789DUF688
GO Assignments
This gene is annotated with the following GO terms.
Category Term Accession Term Name
biological_process GO:0008150 biological_process
cellular_component GO:0005575 cellular_component
molecular_function GO:0003674 molecular_function
This gene is associated with the following unigenes:
Unigene NameAnalysis NameSequence type in Unigene
CU115044cucumber EST collection version 3.0transcribed_cluster
CU123958cucumber EST collection version 3.0transcribed_cluster
CU125941cucumber EST collection version 3.0transcribed_cluster
CU146984cucumber EST collection version 3.0transcribed_cluster

The following mRNA feature(s) are a part of this gene:

Feature NameUnique NameType
Csa1G470450.1Csa1G470450.1mRNA


The following transcribed_cluster feature(s) are associated with this gene:

Feature NameUnique NameType
CU115044CU115044transcribed_cluster
CU146984CU146984transcribed_cluster
CU123958CU123958transcribed_cluster
CU125941CU125941transcribed_cluster


Analysis Name: InterPro Annotations of cucumber (Chinese Long)
Date Performed: 2016-09-28
IPR TermIPR DescriptionSourceSource TermSource DescriptionAlignment
IPR007789Protein of unknown function DUF688PFAMPF05097DUF688coord: 1..471
score: 5.9
NoneNo IPR availableunknownCoilCoilcoord: 520..540
scor
NoneNo IPR availablePANTHERPTHR33671FAMILY NOT NAMEDcoord: 1..672
score: 2.2E
NoneNo IPR availablePANTHERPTHR33671:SF3F28N24.8 PROTEINcoord: 1..672
score: 2.2E

The following gene(s) are paralogous to this gene:
GeneParalogueOrganismBlock
Csa1G470450Csa7G048630Cucumber (Chinese Long) v2cucuB048