Csa1G222880 (gene) Cucumber (Chinese Long) v2

NameCsa1G222880
Typegene
OrganismCucumis. sativus (Cucumber (Chinese Long) v2)
DescriptionPutative uncharacterized protein GmTDF-5; contains IPR016024 (Armadillo-type fold)
LocationChr1 : 11753457 .. 11754886 (-)
The following sequences are available for this feature:

Gene sequence (with intron)

Legend: five_prime_UTRCDSthree_prime_UTR
Hold the cursor over a type above to highlight its positions in the sequence below.
TTTGTATGTAGAACCGAGAGCGCCACATCGTTCTCTCTCGAACAACCCCTGCTCCAGAATTCAAACTCCTTCCCCTCCCCCTCCCCCATGGATGATTCTTCCAAGAACGAAGATAGCCCACGAGTTGTTAGAGTTCTCGAAGCTCTCAAGCAAGCCTCTCATGAATTACAATCAAATCCAAATCCTCGTTCCCACGAGTTCAATTCTCCCGCCATTAAAGCTCTGCTTGAACTCGAGGTTGAATCCGACAAAAACCTTTCCACCGATCCAAATCTCTCCACTCTCTCTCACCATCTCGCCAACCTAAAATCCTTAGTGGACACGCTTCAGAAATCCAGAGGTTACAGTCCCAGGTCATTCTTAACCCGCCGTTTCGCCACGAATTCCGTATCGCAAGTCGCCGGTTCAATCGAATCGGAGATTCAAGCTTGGATTGACCGTAAAAGCTTGGACACTTTGGTTCGAGCTCTTAAAGAGCCTTCAATCGATGAGAACGAATTGATTAAGTTACTAACTCAGTTTGAGAATCGACTCGCTCAAGGTTTCAACCGTGAATTACAGGATCTCATGCTGAAATCGAAGGTGTTTTCGCTGCTTGAATCGATCGTTTGTAACCCTAATTTCTCGAAAACGATCCGAGAGCATTCCGCGTATGTGATTGGGGGAATGGTCCGTTTCAATAAAGATGTATTCGTCGGTCAAATATTAATGGGACCGACAATTCATGCCCTAGTTGAAATGGCTTCTTCTCACTCATTAAAAATCCTCTGCTCTTTAATCAGATTAATTAGGTCTCCGTTGGTGGAGGAAATCGAATCAAACGGCGATATACCAAAAATAATAAGCTTATTGAATTGTGCAGATCTGCAAATTAGGGTTTTAGCCATGGATTGCATTGTCGAAATTGGGTATTTTGGGCGTAAAGATACCGTAGATGCTATGCTGGAACAGGATTTGATAGACAGGCTTGTGGAGTTGCAGAGGTCGGAATTGGGCGGGGATTTGATCGGACTAGGAAAACACACGGCAGAGGAGAGTAGAGAGGTCACTGGGAGCGCCGGAGAGAAGAGGTTCTTGGAGAAGCATCCATTTGCGAGCTGTGTAGCGAAATTTGGAGTGCAATTGGAGGTTGGAGAAGGGTTGAGGAAGAGGGAAAAGAGGGCGATTAAGGGTGAAATTTTGAAGAGAGTTCGAAAAGCTTGTGTTTCGGATGCTGAAGCAGCCACCATAATTGCTGAGGTTTTGTGGGGTTCAAGTCCTTGATTGATAGGAAACTGTGAGGGGATTTTGGTTTGATTGTGTTCATGAATATGGTGGATCGAAGGAATCGATGAGGTGCGTAAGTTCCAAATTTTTGTTTCTGGGATGATGAACATTCATTCATCAATCCTTTGTCTCTGTTCATATGTTGGTGTCATAGTTATATGTTG

mRNA sequence

ATGGATGATTCTTCCAAGAACGAAGATAGCCCACGAGTTGTTAGAGTTCTCGAAGCTCTCAAGCAAGCCTCTCATGAATTACAATCAAATCCAAATCCTCGTTCCCACGAGTTCAATTCTCCCGCCATTAAAGCTCTGCTTGAACTCGAGGTTGAATCCGACAAAAACCTTTCCACCGATCCAAATCTCTCCACTCTCTCTCACCATCTCGCCAACCTAAAATCCTTAGTGGACACGCTTCAGAAATCCAGAGGTTACAGTCCCAGGTCATTCTTAACCCGCCGTTTCGCCACGAATTCCGTATCGCAAGTCGCCGGTTCAATCGAATCGGAGATTCAAGCTTGGATTGACCGTAAAAGCTTGGACACTTTGGTTCGAGCTCTTAAAGAGCCTTCAATCGATGAGAACGAATTGATTAAGTTACTAACTCAGTTTGAGAATCGACTCGCTCAAGGTTTCAACCGTGAATTACAGGATCTCATGCTGAAATCGAAGGTGTTTTCGCTGCTTGAATCGATCGTTTGTAACCCTAATTTCTCGAAAACGATCCGAGAGCATTCCGCGTATGTGATTGGGGGAATGGTCCGTTTCAATAAAGATGTATTCGTCGGTCAAATATTAATGGGACCGACAATTCATGCCCTAGTTGAAATGGCTTCTTCTCACTCATTAAAAATCCTCTGCTCTTTAATCAGATTAATTAGGTCTCCGTTGGTGGAGGAAATCGAATCAAACGGCGATATACCAAAAATAATAAGCTTATTGAATTGTGCAGATCTGCAAATTAGGGTTTTAGCCATGGATTGCATTGTCGAAATTGGGTATTTTGGGCGTAAAGATACCGTAGATGCTATGCTGGAACAGGATTTGATAGACAGGCTTGTGGAGTTGCAGAGGTCGGAATTGGGCGGGGATTTGATCGGACTAGGAAAACACACGGCAGAGGAGAGTAGAGAGGTCACTGGGAGCGCCGGAGAGAAGAGGTTCTTGGAGAAGCATCCATTTGCGAGCTGTGTAGCGAAATTTGGAGTGCAATTGGAGGTTGGAGAAGGGTTGAGGAAGAGGGAAAAGAGGGCGATTAAGGGTGAAATTTTGAAGAGAGTTCGAAAAGCTTGTGTTTCGGATGCTGAAGCAGCCACCATAATTGCTGAGGTTTTGTGGGGTTCAAGTCCTTGA

Coding sequence (CDS)

ATGGATGATTCTTCCAAGAACGAAGATAGCCCACGAGTTGTTAGAGTTCTCGAAGCTCTCAAGCAAGCCTCTCATGAATTACAATCAAATCCAAATCCTCGTTCCCACGAGTTCAATTCTCCCGCCATTAAAGCTCTGCTTGAACTCGAGGTTGAATCCGACAAAAACCTTTCCACCGATCCAAATCTCTCCACTCTCTCTCACCATCTCGCCAACCTAAAATCCTTAGTGGACACGCTTCAGAAATCCAGAGGTTACAGTCCCAGGTCATTCTTAACCCGCCGTTTCGCCACGAATTCCGTATCGCAAGTCGCCGGTTCAATCGAATCGGAGATTCAAGCTTGGATTGACCGTAAAAGCTTGGACACTTTGGTTCGAGCTCTTAAAGAGCCTTCAATCGATGAGAACGAATTGATTAAGTTACTAACTCAGTTTGAGAATCGACTCGCTCAAGGTTTCAACCGTGAATTACAGGATCTCATGCTGAAATCGAAGGTGTTTTCGCTGCTTGAATCGATCGTTTGTAACCCTAATTTCTCGAAAACGATCCGAGAGCATTCCGCGTATGTGATTGGGGGAATGGTCCGTTTCAATAAAGATGTATTCGTCGGTCAAATATTAATGGGACCGACAATTCATGCCCTAGTTGAAATGGCTTCTTCTCACTCATTAAAAATCCTCTGCTCTTTAATCAGATTAATTAGGTCTCCGTTGGTGGAGGAAATCGAATCAAACGGCGATATACCAAAAATAATAAGCTTATTGAATTGTGCAGATCTGCAAATTAGGGTTTTAGCCATGGATTGCATTGTCGAAATTGGGTATTTTGGGCGTAAAGATACCGTAGATGCTATGCTGGAACAGGATTTGATAGACAGGCTTGTGGAGTTGCAGAGGTCGGAATTGGGCGGGGATTTGATCGGACTAGGAAAACACACGGCAGAGGAGAGTAGAGAGGTCACTGGGAGCGCCGGAGAGAAGAGGTTCTTGGAGAAGCATCCATTTGCGAGCTGTGTAGCGAAATTTGGAGTGCAATTGGAGGTTGGAGAAGGGTTGAGGAAGAGGGAAAAGAGGGCGATTAAGGGTGAAATTTTGAAGAGAGTTCGAAAAGCTTGTGTTTCGGATGCTGAAGCAGCCACCATAATTGCTGAGGTTTTGTGGGGTTCAAGTCCTTGA

Protein sequence

MDDSSKNEDSPRVVRVLEALKQASHELQSNPNPRSHEFNSPAIKALLELEVESDKNLSTDPNLSTLSHHLANLKSLVDTLQKSRGYSPRSFLTRRFATNSVSQVAGSIESEIQAWIDRKSLDTLVRALKEPSIDENELIKLLTQFENRLAQGFNRELQDLMLKSKVFSLLESIVCNPNFSKTIREHSAYVIGGMVRFNKDVFVGQILMGPTIHALVEMASSHSLKILCSLIRLIRSPLVEEIESNGDIPKIISLLNCADLQIRVLAMDCIVEIGYFGRKDTVDAMLEQDLIDRLVELQRSELGGDLIGLGKHTAEESREVTGSAGEKRFLEKHPFASCVAKFGVQLEVGEGLRKREKRAIKGEILKRVRKACVSDAEAATIIAEVLWGSSP*
BLAST of Csa1G222880 vs. TrEMBL
Match: A0A0A0LT75_CUCSA (Uncharacterized protein OS=Cucumis sativus GN=Csa_1G222880 PE=4 SV=1)

HSP 1 Score: 756.1 bits (1951), Expect = 2.1e-215
Identity = 391/391 (100.00%), Postives = 391/391 (100.00%), Query Frame = 1

Query: 1   MDDSSKNEDSPRVVRVLEALKQASHELQSNPNPRSHEFNSPAIKALLELEVESDKNLSTD 60
           MDDSSKNEDSPRVVRVLEALKQASHELQSNPNPRSHEFNSPAIKALLELEVESDKNLSTD
Sbjct: 1   MDDSSKNEDSPRVVRVLEALKQASHELQSNPNPRSHEFNSPAIKALLELEVESDKNLSTD 60

Query: 61  PNLSTLSHHLANLKSLVDTLQKSRGYSPRSFLTRRFATNSVSQVAGSIESEIQAWIDRKS 120
           PNLSTLSHHLANLKSLVDTLQKSRGYSPRSFLTRRFATNSVSQVAGSIESEIQAWIDRKS
Sbjct: 61  PNLSTLSHHLANLKSLVDTLQKSRGYSPRSFLTRRFATNSVSQVAGSIESEIQAWIDRKS 120

Query: 121 LDTLVRALKEPSIDENELIKLLTQFENRLAQGFNRELQDLMLKSKVFSLLESIVCNPNFS 180
           LDTLVRALKEPSIDENELIKLLTQFENRLAQGFNRELQDLMLKSKVFSLLESIVCNPNFS
Sbjct: 121 LDTLVRALKEPSIDENELIKLLTQFENRLAQGFNRELQDLMLKSKVFSLLESIVCNPNFS 180

Query: 181 KTIREHSAYVIGGMVRFNKDVFVGQILMGPTIHALVEMASSHSLKILCSLIRLIRSPLVE 240
           KTIREHSAYVIGGMVRFNKDVFVGQILMGPTIHALVEMASSHSLKILCSLIRLIRSPLVE
Sbjct: 181 KTIREHSAYVIGGMVRFNKDVFVGQILMGPTIHALVEMASSHSLKILCSLIRLIRSPLVE 240

Query: 241 EIESNGDIPKIISLLNCADLQIRVLAMDCIVEIGYFGRKDTVDAMLEQDLIDRLVELQRS 300
           EIESNGDIPKIISLLNCADLQIRVLAMDCIVEIGYFGRKDTVDAMLEQDLIDRLVELQRS
Sbjct: 241 EIESNGDIPKIISLLNCADLQIRVLAMDCIVEIGYFGRKDTVDAMLEQDLIDRLVELQRS 300

Query: 301 ELGGDLIGLGKHTAEESREVTGSAGEKRFLEKHPFASCVAKFGVQLEVGEGLRKREKRAI 360
           ELGGDLIGLGKHTAEESREVTGSAGEKRFLEKHPFASCVAKFGVQLEVGEGLRKREKRAI
Sbjct: 301 ELGGDLIGLGKHTAEESREVTGSAGEKRFLEKHPFASCVAKFGVQLEVGEGLRKREKRAI 360

Query: 361 KGEILKRVRKACVSDAEAATIIAEVLWGSSP 392
           KGEILKRVRKACVSDAEAATIIAEVLWGSSP
Sbjct: 361 KGEILKRVRKACVSDAEAATIIAEVLWGSSP 391

BLAST of Csa1G222880 vs. TrEMBL
Match: A0A061GVT8_THECC (Uncharacterized protein OS=Theobroma cacao GN=TCM_038176 PE=4 SV=1)

HSP 1 Score: 504.2 bits (1297), Expect = 1.4e-139
Identity = 257/393 (65.39%), Postives = 324/393 (82.44%), Query Frame = 1

Query: 1   MDDSSKNEDSPRVVRVLEALKQASHELQSNPNPRSHEFNSPAIKALLELEVESDKNLSTD 60
           M+D S  ED+PRV++VLEALKQASHELQ++P  +S   NS AIKALLELE ESD  LS D
Sbjct: 1   MEDPSSKEDNPRVLKVLEALKQASHELQAHPTYKSANSNSSAIKALLELETESDSILSND 60

Query: 61  PNLSTLSHHLANLKSLVDTLQKSRGYSPRSFLTRRFATNSVSQVAGSIESEIQAWIDRKS 120
           P+L TLS HLA+LK+ V+TL+++RGY  RSFLTRR +T+S+S+VAGSIESEIQAWIDR+S
Sbjct: 61  PHLFTLSQHLADLKTHVETLKRTRGYGLRSFLTRRVSTHSISRVAGSIESEIQAWIDRES 120

Query: 121 LDTLVRALKEPSIDENELIKLLTQFENRLAQGFNRELQDLMLKSKVFSLLESIVCNPNFS 180
           +++L++ LKEP  +E+EL++LLTQFE+R++QGFN ELQ+L+LKSK FS+L+S++C+PN S
Sbjct: 121 IESLIKGLKEPGKNEDELVRLLTQFEDRVSQGFNCELQNLVLKSKAFSVLQSVLCDPNCS 180

Query: 181 KTIREHSAYVIGGMVRFNKDVFVGQILMGPTIHALVEMASSHSLKILCSLIRLIRSPLVE 240
           K IRE +A+ +  ++RFNKDVFVGQ+ MG TIHAL++M S+HSLK+LC LI+  +SP V+
Sbjct: 181 KRIRESAAFCVAALIRFNKDVFVGQVNMGGTIHALLDMRSTHSLKVLCELIKFTKSPFVD 240

Query: 241 EIESNGDIPKIISLLNCADLQIRVLAMDCIVEIGYFGRKDTVDAMLEQDLIDRLVELQRS 300
           EI  NG+IP+I++LL   DL ++VLA DCI+EIGYFGRK+  +A+L+  LI  LVELQRS
Sbjct: 241 EIVCNGEIPEILTLLETKDLDMKVLAFDCILEIGYFGRKEAGEALLKGGLIKNLVELQRS 300

Query: 301 ELGGDLIGLGKHTAE--ESREVTGSAGEKRFLEKHPFASCVAKFGVQLEVGEGLRKREKR 360
           ELGGDLI +GK  AE  E  E      EKRFLE HPFASCVA+F VQLEVGEGLR+REKR
Sbjct: 301 ELGGDLIEMGKFEAENKEIEEKKREKREKRFLENHPFASCVARFAVQLEVGEGLRQREKR 360

Query: 361 AIKGEILKRVRKACVSDAEAATIIAEVLWGSSP 392
           A K EIL+RVR A +SDAEAATIIAEVLWGSSP
Sbjct: 361 AFKLEILERVRGASISDAEAATIIAEVLWGSSP 393

BLAST of Csa1G222880 vs. TrEMBL
Match: A0A067G4V8_CITSI (Uncharacterized protein OS=Citrus sinensis GN=CISIN_1g035696mg PE=4 SV=1)

HSP 1 Score: 503.1 bits (1294), Expect = 3.2e-139
Identity = 265/405 (65.43%), Postives = 331/405 (81.73%), Query Frame = 1

Query: 1   MDDSSKNEDSPRVVRVLEALKQASHELQSNPNPRSHEFNSPAIKALLELEVESDKNLSTD 60
           M+D S    +PRV+ VLEALKQAS +LQ++P+  S EFNS AIKALLELE ESD  LS D
Sbjct: 1   MEDPSVQVQAPRVLNVLEALKQASVDLQAHPSSNSAEFNSSAIKALLELETESDALLSED 60

Query: 61  PNLSTLSHHLANLKSLVDTLQKSRGY-SPRSFLTRRFATNSVSQVAGSIESEIQAWIDRK 120
           P+LSTLS HLA+LK+LV TL KSRG  S RSFL RR +T+S+S+VAGSIE+EIQAWIDR+
Sbjct: 61  PHLSTLSQHLADLKTLVQTLHKSRGRNSLRSFLARRVSTHSISRVAGSIETEIQAWIDRE 120

Query: 121 SLDTLVRALKEPSIDE---NELIKLLTQFENRLAQGFNRELQDLMLKSKVFSLLESIVCN 180
            ++ L +ALK+   D    +EL+KLLTQFE+R++QGF+RELQDL+LKSKVFSLLE+I+CN
Sbjct: 121 HIERLTKALKDLDGDGGNVDELVKLLTQFEDRVSQGFSRELQDLVLKSKVFSLLETILCN 180

Query: 181 PNFSKTIREHSAYVIGGMVRFNKDVFVGQILMGPTIHALVEMASSHSLKILCSLIRLIRS 240
           P+ SK++RE SAY I  ++RFNKDVFVGQ+LMGPT+ AL+ M+S+HS K+LC LI+ I+S
Sbjct: 181 PSCSKSLREQSAYSIASLIRFNKDVFVGQVLMGPTVQALLTMSSTHSSKVLCELIKSIKS 240

Query: 241 PLVEEIESNGDIPKIISLLNCADLQIRVLAMDCIVEIGYFGRKDTVDAMLEQDLIDRLVE 300
           PLV+EIESNG+IPKIISLL+  DLQ+++LAMDCI+EIGYFGRK+ +DAMLEQ L+ +LVE
Sbjct: 241 PLVDEIESNGEIPKIISLLDMKDLQMKLLAMDCILEIGYFGRKEAIDAMLEQGLVKKLVE 300

Query: 301 LQRSELGGDLIGLGKHTAEESREVTGSAG----------EKRFLEKHPFASCVAKFGVQL 360
           LQRSELGGDLI + +   +E  +    AG          E++FL++HPFASCVA+F VQL
Sbjct: 301 LQRSELGGDLIEMERFEEKEKNDRGVGAGGVVESKRESRERKFLKRHPFASCVARFAVQL 360

Query: 361 EVGEGLRKREKRAIKGEILKRVRKACVSDAEAATIIAEVLWGSSP 392
           EVGEGLR+REKRA+K EIL RVR+A  SDAEAATI+AEVLWGSSP
Sbjct: 361 EVGEGLRQREKRALKQEILLRVREASASDAEAATIVAEVLWGSSP 405

BLAST of Csa1G222880 vs. TrEMBL
Match: U5G650_POPTR (Uncharacterized protein OS=Populus trichocarpa GN=POPTR_0006s06240g PE=4 SV=1)

HSP 1 Score: 503.1 bits (1294), Expect = 3.2e-139
Identity = 259/392 (66.07%), Postives = 319/392 (81.38%), Query Frame = 1

Query: 1   MDDSSKNEDSPRVVRVLEALKQASHELQSNPNPRSHEFNSPAIKALLELEVESDKNLSTD 60
           M+D+   E+  RV++VLEALKQASH+LQ++P P S E NSPA+KALLELE ESD  LS D
Sbjct: 2   MEDTPSKEEITRVLKVLEALKQASHDLQTHPGPNSAESNSPALKALLELETESDTILSKD 61

Query: 61  PNLSTLSHHLANLKSLVDTLQKSRGYSPRSFLTRRFATNSVSQVAGSIESEIQAWIDRKS 120
           P LSTLS HLA+LKSL DTLQ+S G+  RSFLTRR +T S+S+VAGSIESEIQAWIDR+S
Sbjct: 62  PLLSTLSEHLASLKSLFDTLQRSCGHGLRSFLTRRVSTQSISRVAGSIESEIQAWIDRES 121

Query: 121 LDTLVRALKEP-SIDENELIKLLTQFENRLAQGFNRELQDLMLKSKVFSLLESIVCNPNF 180
           +D L++ LK+P  I+E EL+ LL+QFE+R+ QGFNRELQDL+LKSK+F LLE I C+P+ 
Sbjct: 122 IDRLMKGLKDPLQIEEGELVGLLSQFEDRVLQGFNRELQDLVLKSKIFCLLERISCDPSC 181

Query: 181 SKTIREHSAYVIGGMVRFNKDVFVGQILMGPTIHALVEMASSHSLKILCSLIRLIRSPLV 240
           S+ +RE  A+V+  ++RFNKDVFVGQ+LMG  IH +V MAS  S+K+L SLI+ I+SPLV
Sbjct: 182 SRKVREQCAFVVSALIRFNKDVFVGQVLMGRLIHGVVSMASWKSMKVLSSLIKSIKSPLV 241

Query: 241 EEIESNGDIPKIISLLNCADLQIRVLAMDCIVEIGYFGRKDTVDAMLEQDLIDRLVELQR 300
           +EIESNG+IPKIIS L+  DL +RV+ MDCI+EIGYFGRK+ ++AML + LI +LVELQR
Sbjct: 242 DEIESNGEIPKIISFLDYKDLHLRVVTMDCILEIGYFGRKEAIEAMLREALIKKLVELQR 301

Query: 301 SELGGDLIGLGKHTAEESREVTGSAGEKRFLEKHPFASCVAKFGVQLEVGEGLRKREKRA 360
           S+LGGDLI +G    E+ R      GE RFLE HPFASCVA+F VQLEVGEGLR+RE+RA
Sbjct: 302 SKLGGDLIDIGMFDDEKER----GKGETRFLENHPFASCVARFAVQLEVGEGLRQRERRA 361

Query: 361 IKGEILKRVRKACVSDAEAATIIAEVLWGSSP 392
            K EILK VR ACVS+AEAATI+AEVLWGSSP
Sbjct: 362 FKQEILKTVRNACVSNAEAATIVAEVLWGSSP 389

BLAST of Csa1G222880 vs. TrEMBL
Match: A0A067K215_JATCU (Uncharacterized protein OS=Jatropha curcas GN=JCGZ_17013 PE=4 SV=1)

HSP 1 Score: 502.3 bits (1292), Expect = 5.4e-139
Identity = 258/396 (65.15%), Postives = 326/396 (82.32%), Query Frame = 1

Query: 1   MDDSSKNEDSPRVVRVLEALKQASHELQSNPNPRSHEFNSPAIKALLELEVESDKNLSTD 60
           M++ S  E+  RV++VLEALKQASH+LQ +P P+S + NSPAIKALLELE ESD  LS D
Sbjct: 1   MENPSSKEEISRVLKVLEALKQASHDLQLHPTPKSTDSNSPAIKALLELETESDTILSKD 60

Query: 61  PNLSTLSHHLANLKSLVDTLQKSRGYSPRSFLTRRFATNSVSQVAGSIESEIQAWIDRKS 120
           PNLSTLS HL +L++L+++LQ+SRGYS R+FL RR +T+S+S+VAGSIESEIQAWIDR+S
Sbjct: 61  PNLSTLSQHLTSLRTLIESLQESRGYSFRNFLNRRVSTHSISRVAGSIESEIQAWIDRES 120

Query: 121 LDTLVRALKEPSIDENE--LIKLLTQFENRLAQGFNRELQDLMLKSKVFSLLESIVCNPN 180
           +++L  ALKEP  +ENE  L+ LLTQFENR++QGFNRELQDL+LKSK+  +LE+I+C+PN
Sbjct: 121 IESLTAALKEPGKNENEEVLVSLLTQFENRVSQGFNRELQDLVLKSKILYILENIICDPN 180

Query: 181 ---FSKTIREHSAYVIGGMVRFNKDVFVGQILMGPTIHALVEMASSHSLKILCSLIRLIR 240
              +SK I+E   +V+  ++RFNKDVFVGQ+LMGP IHALV MAS  S+++LCSLI+LI+
Sbjct: 181 NNNWSKRIKEQCGFVVAALIRFNKDVFVGQVLMGPLIHALVSMASWKSIRVLCSLIKLIK 240

Query: 241 SPLVEEIESNGDIPKIISLLNCADLQIRVLAMDCIVEIGYFGRKDTVDAMLEQDLIDRLV 300
           SPLV+EIESNG+IPKII  L+  DLQIRVLAM+CI+EIGYFGRK+ ++AML++ LI +LV
Sbjct: 241 SPLVDEIESNGEIPKIICFLDYKDLQIRVLAMECILEIGYFGRKEAIEAMLKEGLIKKLV 300

Query: 301 ELQRSELGGDLIGLGKHTAEESREVTGSAGEKRFLEKHPFASCVAKFGVQLEVGEGLRKR 360
           ELQR +LG DLI +     E+  E      EK+ LE+ PFA CVAKF VQLEVGEGLR+R
Sbjct: 301 ELQRLDLGSDLIDISNFDEEKESE---KIKEKKVLERFPFAGCVAKFAVQLEVGEGLRQR 360

Query: 361 EKRAIKGEILKRVRKACVSDAEAATIIAEVLWGSSP 392
           EKRA K EIL+RV++A VSDAEAATI+AEVLWGSSP
Sbjct: 361 EKRAFKQEILERVKEASVSDAEAATIVAEVLWGSSP 393

BLAST of Csa1G222880 vs. NCBI nr
Match: gi|700210009|gb|KGN65105.1| (hypothetical protein Csa_1G222880 [Cucumis sativus])

HSP 1 Score: 756.1 bits (1951), Expect = 3.0e-215
Identity = 391/391 (100.00%), Postives = 391/391 (100.00%), Query Frame = 1

Query: 1   MDDSSKNEDSPRVVRVLEALKQASHELQSNPNPRSHEFNSPAIKALLELEVESDKNLSTD 60
           MDDSSKNEDSPRVVRVLEALKQASHELQSNPNPRSHEFNSPAIKALLELEVESDKNLSTD
Sbjct: 1   MDDSSKNEDSPRVVRVLEALKQASHELQSNPNPRSHEFNSPAIKALLELEVESDKNLSTD 60

Query: 61  PNLSTLSHHLANLKSLVDTLQKSRGYSPRSFLTRRFATNSVSQVAGSIESEIQAWIDRKS 120
           PNLSTLSHHLANLKSLVDTLQKSRGYSPRSFLTRRFATNSVSQVAGSIESEIQAWIDRKS
Sbjct: 61  PNLSTLSHHLANLKSLVDTLQKSRGYSPRSFLTRRFATNSVSQVAGSIESEIQAWIDRKS 120

Query: 121 LDTLVRALKEPSIDENELIKLLTQFENRLAQGFNRELQDLMLKSKVFSLLESIVCNPNFS 180
           LDTLVRALKEPSIDENELIKLLTQFENRLAQGFNRELQDLMLKSKVFSLLESIVCNPNFS
Sbjct: 121 LDTLVRALKEPSIDENELIKLLTQFENRLAQGFNRELQDLMLKSKVFSLLESIVCNPNFS 180

Query: 181 KTIREHSAYVIGGMVRFNKDVFVGQILMGPTIHALVEMASSHSLKILCSLIRLIRSPLVE 240
           KTIREHSAYVIGGMVRFNKDVFVGQILMGPTIHALVEMASSHSLKILCSLIRLIRSPLVE
Sbjct: 181 KTIREHSAYVIGGMVRFNKDVFVGQILMGPTIHALVEMASSHSLKILCSLIRLIRSPLVE 240

Query: 241 EIESNGDIPKIISLLNCADLQIRVLAMDCIVEIGYFGRKDTVDAMLEQDLIDRLVELQRS 300
           EIESNGDIPKIISLLNCADLQIRVLAMDCIVEIGYFGRKDTVDAMLEQDLIDRLVELQRS
Sbjct: 241 EIESNGDIPKIISLLNCADLQIRVLAMDCIVEIGYFGRKDTVDAMLEQDLIDRLVELQRS 300

Query: 301 ELGGDLIGLGKHTAEESREVTGSAGEKRFLEKHPFASCVAKFGVQLEVGEGLRKREKRAI 360
           ELGGDLIGLGKHTAEESREVTGSAGEKRFLEKHPFASCVAKFGVQLEVGEGLRKREKRAI
Sbjct: 301 ELGGDLIGLGKHTAEESREVTGSAGEKRFLEKHPFASCVAKFGVQLEVGEGLRKREKRAI 360

Query: 361 KGEILKRVRKACVSDAEAATIIAEVLWGSSP 392
           KGEILKRVRKACVSDAEAATIIAEVLWGSSP
Sbjct: 361 KGEILKRVRKACVSDAEAATIIAEVLWGSSP 391

BLAST of Csa1G222880 vs. NCBI nr
Match: gi|590578712|ref|XP_007013585.1| (Uncharacterized protein TCM_038176 [Theobroma cacao])

HSP 1 Score: 504.2 bits (1297), Expect = 2.0e-139
Identity = 257/393 (65.39%), Postives = 324/393 (82.44%), Query Frame = 1

Query: 1   MDDSSKNEDSPRVVRVLEALKQASHELQSNPNPRSHEFNSPAIKALLELEVESDKNLSTD 60
           M+D S  ED+PRV++VLEALKQASHELQ++P  +S   NS AIKALLELE ESD  LS D
Sbjct: 1   MEDPSSKEDNPRVLKVLEALKQASHELQAHPTYKSANSNSSAIKALLELETESDSILSND 60

Query: 61  PNLSTLSHHLANLKSLVDTLQKSRGYSPRSFLTRRFATNSVSQVAGSIESEIQAWIDRKS 120
           P+L TLS HLA+LK+ V+TL+++RGY  RSFLTRR +T+S+S+VAGSIESEIQAWIDR+S
Sbjct: 61  PHLFTLSQHLADLKTHVETLKRTRGYGLRSFLTRRVSTHSISRVAGSIESEIQAWIDRES 120

Query: 121 LDTLVRALKEPSIDENELIKLLTQFENRLAQGFNRELQDLMLKSKVFSLLESIVCNPNFS 180
           +++L++ LKEP  +E+EL++LLTQFE+R++QGFN ELQ+L+LKSK FS+L+S++C+PN S
Sbjct: 121 IESLIKGLKEPGKNEDELVRLLTQFEDRVSQGFNCELQNLVLKSKAFSVLQSVLCDPNCS 180

Query: 181 KTIREHSAYVIGGMVRFNKDVFVGQILMGPTIHALVEMASSHSLKILCSLIRLIRSPLVE 240
           K IRE +A+ +  ++RFNKDVFVGQ+ MG TIHAL++M S+HSLK+LC LI+  +SP V+
Sbjct: 181 KRIRESAAFCVAALIRFNKDVFVGQVNMGGTIHALLDMRSTHSLKVLCELIKFTKSPFVD 240

Query: 241 EIESNGDIPKIISLLNCADLQIRVLAMDCIVEIGYFGRKDTVDAMLEQDLIDRLVELQRS 300
           EI  NG+IP+I++LL   DL ++VLA DCI+EIGYFGRK+  +A+L+  LI  LVELQRS
Sbjct: 241 EIVCNGEIPEILTLLETKDLDMKVLAFDCILEIGYFGRKEAGEALLKGGLIKNLVELQRS 300

Query: 301 ELGGDLIGLGKHTAE--ESREVTGSAGEKRFLEKHPFASCVAKFGVQLEVGEGLRKREKR 360
           ELGGDLI +GK  AE  E  E      EKRFLE HPFASCVA+F VQLEVGEGLR+REKR
Sbjct: 301 ELGGDLIEMGKFEAENKEIEEKKREKREKRFLENHPFASCVARFAVQLEVGEGLRQREKR 360

Query: 361 AIKGEILKRVRKACVSDAEAATIIAEVLWGSSP 392
           A K EIL+RVR A +SDAEAATIIAEVLWGSSP
Sbjct: 361 AFKLEILERVRGASISDAEAATIIAEVLWGSSP 393

BLAST of Csa1G222880 vs. NCBI nr
Match: gi|641854661|gb|KDO73455.1| (hypothetical protein CISIN_1g035696mg [Citrus sinensis])

HSP 1 Score: 503.1 bits (1294), Expect = 4.5e-139
Identity = 265/405 (65.43%), Postives = 331/405 (81.73%), Query Frame = 1

Query: 1   MDDSSKNEDSPRVVRVLEALKQASHELQSNPNPRSHEFNSPAIKALLELEVESDKNLSTD 60
           M+D S    +PRV+ VLEALKQAS +LQ++P+  S EFNS AIKALLELE ESD  LS D
Sbjct: 1   MEDPSVQVQAPRVLNVLEALKQASVDLQAHPSSNSAEFNSSAIKALLELETESDALLSED 60

Query: 61  PNLSTLSHHLANLKSLVDTLQKSRGY-SPRSFLTRRFATNSVSQVAGSIESEIQAWIDRK 120
           P+LSTLS HLA+LK+LV TL KSRG  S RSFL RR +T+S+S+VAGSIE+EIQAWIDR+
Sbjct: 61  PHLSTLSQHLADLKTLVQTLHKSRGRNSLRSFLARRVSTHSISRVAGSIETEIQAWIDRE 120

Query: 121 SLDTLVRALKEPSIDE---NELIKLLTQFENRLAQGFNRELQDLMLKSKVFSLLESIVCN 180
            ++ L +ALK+   D    +EL+KLLTQFE+R++QGF+RELQDL+LKSKVFSLLE+I+CN
Sbjct: 121 HIERLTKALKDLDGDGGNVDELVKLLTQFEDRVSQGFSRELQDLVLKSKVFSLLETILCN 180

Query: 181 PNFSKTIREHSAYVIGGMVRFNKDVFVGQILMGPTIHALVEMASSHSLKILCSLIRLIRS 240
           P+ SK++RE SAY I  ++RFNKDVFVGQ+LMGPT+ AL+ M+S+HS K+LC LI+ I+S
Sbjct: 181 PSCSKSLREQSAYSIASLIRFNKDVFVGQVLMGPTVQALLTMSSTHSSKVLCELIKSIKS 240

Query: 241 PLVEEIESNGDIPKIISLLNCADLQIRVLAMDCIVEIGYFGRKDTVDAMLEQDLIDRLVE 300
           PLV+EIESNG+IPKIISLL+  DLQ+++LAMDCI+EIGYFGRK+ +DAMLEQ L+ +LVE
Sbjct: 241 PLVDEIESNGEIPKIISLLDMKDLQMKLLAMDCILEIGYFGRKEAIDAMLEQGLVKKLVE 300

Query: 301 LQRSELGGDLIGLGKHTAEESREVTGSAG----------EKRFLEKHPFASCVAKFGVQL 360
           LQRSELGGDLI + +   +E  +    AG          E++FL++HPFASCVA+F VQL
Sbjct: 301 LQRSELGGDLIEMERFEEKEKNDRGVGAGGVVESKRESRERKFLKRHPFASCVARFAVQL 360

Query: 361 EVGEGLRKREKRAIKGEILKRVRKACVSDAEAATIIAEVLWGSSP 392
           EVGEGLR+REKRA+K EIL RVR+A  SDAEAATI+AEVLWGSSP
Sbjct: 361 EVGEGLRQREKRALKQEILLRVREASASDAEAATIVAEVLWGSSP 405

BLAST of Csa1G222880 vs. NCBI nr
Match: gi|566174784|ref|XP_006381098.1| (hypothetical protein POPTR_0006s06240g [Populus trichocarpa])

HSP 1 Score: 503.1 bits (1294), Expect = 4.5e-139
Identity = 259/392 (66.07%), Postives = 319/392 (81.38%), Query Frame = 1

Query: 1   MDDSSKNEDSPRVVRVLEALKQASHELQSNPNPRSHEFNSPAIKALLELEVESDKNLSTD 60
           M+D+   E+  RV++VLEALKQASH+LQ++P P S E NSPA+KALLELE ESD  LS D
Sbjct: 2   MEDTPSKEEITRVLKVLEALKQASHDLQTHPGPNSAESNSPALKALLELETESDTILSKD 61

Query: 61  PNLSTLSHHLANLKSLVDTLQKSRGYSPRSFLTRRFATNSVSQVAGSIESEIQAWIDRKS 120
           P LSTLS HLA+LKSL DTLQ+S G+  RSFLTRR +T S+S+VAGSIESEIQAWIDR+S
Sbjct: 62  PLLSTLSEHLASLKSLFDTLQRSCGHGLRSFLTRRVSTQSISRVAGSIESEIQAWIDRES 121

Query: 121 LDTLVRALKEP-SIDENELIKLLTQFENRLAQGFNRELQDLMLKSKVFSLLESIVCNPNF 180
           +D L++ LK+P  I+E EL+ LL+QFE+R+ QGFNRELQDL+LKSK+F LLE I C+P+ 
Sbjct: 122 IDRLMKGLKDPLQIEEGELVGLLSQFEDRVLQGFNRELQDLVLKSKIFCLLERISCDPSC 181

Query: 181 SKTIREHSAYVIGGMVRFNKDVFVGQILMGPTIHALVEMASSHSLKILCSLIRLIRSPLV 240
           S+ +RE  A+V+  ++RFNKDVFVGQ+LMG  IH +V MAS  S+K+L SLI+ I+SPLV
Sbjct: 182 SRKVREQCAFVVSALIRFNKDVFVGQVLMGRLIHGVVSMASWKSMKVLSSLIKSIKSPLV 241

Query: 241 EEIESNGDIPKIISLLNCADLQIRVLAMDCIVEIGYFGRKDTVDAMLEQDLIDRLVELQR 300
           +EIESNG+IPKIIS L+  DL +RV+ MDCI+EIGYFGRK+ ++AML + LI +LVELQR
Sbjct: 242 DEIESNGEIPKIISFLDYKDLHLRVVTMDCILEIGYFGRKEAIEAMLREALIKKLVELQR 301

Query: 301 SELGGDLIGLGKHTAEESREVTGSAGEKRFLEKHPFASCVAKFGVQLEVGEGLRKREKRA 360
           S+LGGDLI +G    E+ R      GE RFLE HPFASCVA+F VQLEVGEGLR+RE+RA
Sbjct: 302 SKLGGDLIDIGMFDDEKER----GKGETRFLENHPFASCVARFAVQLEVGEGLRQRERRA 361

Query: 361 IKGEILKRVRKACVSDAEAATIIAEVLWGSSP 392
            K EILK VR ACVS+AEAATI+AEVLWGSSP
Sbjct: 362 FKQEILKTVRNACVSNAEAATIVAEVLWGSSP 389

BLAST of Csa1G222880 vs. NCBI nr
Match: gi|802666098|ref|XP_012081168.1| (PREDICTED: uncharacterized protein LOC105641270 [Jatropha curcas])

HSP 1 Score: 502.3 bits (1292), Expect = 7.7e-139
Identity = 258/396 (65.15%), Postives = 326/396 (82.32%), Query Frame = 1

Query: 1   MDDSSKNEDSPRVVRVLEALKQASHELQSNPNPRSHEFNSPAIKALLELEVESDKNLSTD 60
           M++ S  E+  RV++VLEALKQASH+LQ +P P+S + NSPAIKALLELE ESD  LS D
Sbjct: 1   MENPSSKEEISRVLKVLEALKQASHDLQLHPTPKSTDSNSPAIKALLELETESDTILSKD 60

Query: 61  PNLSTLSHHLANLKSLVDTLQKSRGYSPRSFLTRRFATNSVSQVAGSIESEIQAWIDRKS 120
           PNLSTLS HL +L++L+++LQ+SRGYS R+FL RR +T+S+S+VAGSIESEIQAWIDR+S
Sbjct: 61  PNLSTLSQHLTSLRTLIESLQESRGYSFRNFLNRRVSTHSISRVAGSIESEIQAWIDRES 120

Query: 121 LDTLVRALKEPSIDENE--LIKLLTQFENRLAQGFNRELQDLMLKSKVFSLLESIVCNPN 180
           +++L  ALKEP  +ENE  L+ LLTQFENR++QGFNRELQDL+LKSK+  +LE+I+C+PN
Sbjct: 121 IESLTAALKEPGKNENEEVLVSLLTQFENRVSQGFNRELQDLVLKSKILYILENIICDPN 180

Query: 181 ---FSKTIREHSAYVIGGMVRFNKDVFVGQILMGPTIHALVEMASSHSLKILCSLIRLIR 240
              +SK I+E   +V+  ++RFNKDVFVGQ+LMGP IHALV MAS  S+++LCSLI+LI+
Sbjct: 181 NNNWSKRIKEQCGFVVAALIRFNKDVFVGQVLMGPLIHALVSMASWKSIRVLCSLIKLIK 240

Query: 241 SPLVEEIESNGDIPKIISLLNCADLQIRVLAMDCIVEIGYFGRKDTVDAMLEQDLIDRLV 300
           SPLV+EIESNG+IPKII  L+  DLQIRVLAM+CI+EIGYFGRK+ ++AML++ LI +LV
Sbjct: 241 SPLVDEIESNGEIPKIICFLDYKDLQIRVLAMECILEIGYFGRKEAIEAMLKEGLIKKLV 300

Query: 301 ELQRSELGGDLIGLGKHTAEESREVTGSAGEKRFLEKHPFASCVAKFGVQLEVGEGLRKR 360
           ELQR +LG DLI +     E+  E      EK+ LE+ PFA CVAKF VQLEVGEGLR+R
Sbjct: 301 ELQRLDLGSDLIDISNFDEEKESE---KIKEKKVLERFPFAGCVAKFAVQLEVGEGLRQR 360

Query: 361 EKRAIKGEILKRVRKACVSDAEAATIIAEVLWGSSP 392
           EKRA K EIL+RV++A VSDAEAATI+AEVLWGSSP
Sbjct: 361 EKRAFKQEILERVKEASVSDAEAATIVAEVLWGSSP 393

The following BLAST results are available for this feature:
Match NameE-valueIdentityDescription
A0A0A0LT75_CUCSA2.1e-215100.00Uncharacterized protein OS=Cucumis sativus GN=Csa_1G222880 PE=4 SV=1[more]
A0A061GVT8_THECC1.4e-13965.39Uncharacterized protein OS=Theobroma cacao GN=TCM_038176 PE=4 SV=1[more]
A0A067G4V8_CITSI3.2e-13965.43Uncharacterized protein OS=Citrus sinensis GN=CISIN_1g035696mg PE=4 SV=1[more]
U5G650_POPTR3.2e-13966.07Uncharacterized protein OS=Populus trichocarpa GN=POPTR_0006s06240g PE=4 SV=1[more]
A0A067K215_JATCU5.4e-13965.15Uncharacterized protein OS=Jatropha curcas GN=JCGZ_17013 PE=4 SV=1[more]
Match NameE-valueIdentityDescription
gi|700210009|gb|KGN65105.1|3.0e-215100.00hypothetical protein Csa_1G222880 [Cucumis sativus][more]
gi|590578712|ref|XP_007013585.1|2.0e-13965.39Uncharacterized protein TCM_038176 [Theobroma cacao][more]
gi|641854661|gb|KDO73455.1|4.5e-13965.43hypothetical protein CISIN_1g035696mg [Citrus sinensis][more]
gi|566174784|ref|XP_006381098.1|4.5e-13966.07hypothetical protein POPTR_0006s06240g [Populus trichocarpa][more]
gi|802666098|ref|XP_012081168.1|7.7e-13965.15PREDICTED: uncharacterized protein LOC105641270 [Jatropha curcas][more]
The following terms have been associated with this gene:
Vocabulary: INTERPRO
TermDefinition
IPR011989ARM-like
IPR016024ARM-type_fold
Vocabulary: Molecular Function
TermDefinition
GO:0005488binding
GO Assignments
This gene is annotated with the following GO terms.
Category Term Accession Term Name
biological_process GO:0008150 biological_process
cellular_component GO:0005575 cellular_component
molecular_function GO:0005488 binding
This gene is associated with the following unigenes:
Unigene NameAnalysis NameSequence type in Unigene
CU123061cucumber EST collection version 3.0transcribed_cluster
CU138485cucumber EST collection version 3.0transcribed_cluster

The following mRNA feature(s) are a part of this gene:

Feature NameUnique NameType
Csa1G222880.1Csa1G222880.1mRNA


The following transcribed_cluster feature(s) are associated with this gene:

Feature NameUnique NameType
CU123061CU123061transcribed_cluster
CU138485CU138485transcribed_cluster


Analysis Name: InterPro Annotations of cucumber (Chinese Long)
Date Performed: 2016-09-28
IPR TermIPR DescriptionSourceSource TermSource DescriptionAlignment
IPR011989Armadillo-like helicalGENE3DG3DSA:1.25.10.10coord: 94..301
score: 1.3E-10coord: 32..57
score: 1.3
IPR016024Armadillo-type foldunknownSSF48371ARM repeatcoord: 33..54
score: 2.42E-14coord: 88..301
score: 2.42
NoneNo IPR availablePANTHERPTHR35834FAMILY NOT NAMEDcoord: 3..391
score: 3.7E
NoneNo IPR availablePANTHERPTHR35834:SF2SUBFAMILY NOT NAMEDcoord: 3..391
score: 3.7E