Csa1G004180.1 (mRNA) Cucumber (Chinese Long) v2

NameCsa1G004180.1
TypemRNA
OrganismCucumis. sativus (Cucumber (Chinese Long) v2)
DescriptionChromosome undetermined scaffold_34, whole genome shotgun sequence
LocationChr1 : 678390 .. 679993 (+)
Sequence length657
The following sequences are available for this feature:

Gene sequence (with intron)

Legend: five_prime_UTRCDSthree_prime_UTR
Hold the cursor over a type above to highlight its positions in the sequence below.
AGTTACCAAATAATATGAGAGTGATGATTTAGTTTTTGCAAAACTTTCTCGTGTGAGTTAACATATATCAGATTCTCAATTCGAATGGGGTCACAGGTCAAATCTGTCTGTATTTTCTTGTATAAAATGACAGCCATTCACGAGCCTCATAGACGTTTCCACTAATCATAACTCTGAGAGAGAGCTAGTCAAATAGAGCAAGGGATAAGCACAGCCAATATAGGTACCCCAAATGCAGCTCTTTTATGGAGGAGCTTTTGTTCAGAGTTCACATCAAACACATCTCTCTTTCTTATTTCCTGTCTATAAGACCTTCTTGAAAAAAACTCACTCAGTAACCCAGAAAGGCTAAAGAAGCAAAGCCCTTCAGTAAAATGGGTGTATGCTCAAGCAAAGATGGGAGTGAGATTTTTTATGATGATGTTAAAGGATTGAAAGAAAAGATAAGGCTACTTAGAGAGGAAGTGAGGGGAGTAATCTGCGAAATGGATAAAGAGACTAAAGCACATGAGAAAGACATGGTGGTTTTTGCATTTAAGGAGGCAGGTTGGAAAACAGAGAAAAAGAGACTCAAAGAGGAGGTGAAAATGTTGAGGAAGAAGGTAAAAGAGAGTTTTACAGAGATTGAAGAAGGAAATTTTGGAGAGAAGATTGCAACAGAATGGGAAATGGAGGGAACTCCAAATACAATCTTTGAGCAGATACAACAGGAAAGAGCACGAAGAGATGAAGCCATTGAGAAGTGGAAACAACTTTATCACGCTATCAAGATTGAACTTGATGACCTAATTCAGAGGACACATAATGGTATGTGTATGAACTTGTCTTATCTTGTCTTATCTTATCTTATCTTATCTTCCCTTTGTTTCTGAATTTTCTATCGATTGTCTTTCAATTTAGCTCAAGATTAGAGATGAGTGAAGGACGACCTTACTTGAACAAGATCTGTCTTGTGGGTTGTACAACCGTGTCTTTGAGTTCTTAAGATTAGAGATTAGTGAAGGGTAGCAAAAAACTAAGACTAAAGAATTTCAATATCACTAACAGACACTTATCATTTACTTTGATAGGAGATGGATTACATTGGGGAGTGACTGAGAGGACGGAAGCACTAAAAACACAATTACAAGCAAAGGAGGAGACCATAAAAGCCCTCAAAGAACAAGTAGTTTCAATGGAGCAAGACAAATATAAGAGGAACAGAGAAATCGACATACTGAGGCAAAGCTTGAGAATCATGACCAGTAAGAAGGAACAGCAGATAGAAACTTTCCATAAATGCGTCTGTAAGTAACTTGCCAATGAAATAGGCATGAAAACAGAATTTTCAACACTACATTAGAATTAAGAAGAAATAATGAAATGCCAATTGAAATGTTACTAGTTGTAAATGTGTCCAGGAAGGTTGTGTTACCTTTGCACACCCACCCTTCATTTCAACCCAAATGTTGTTATTCTTATTATCATTACGTTGGCATGGTCCTAAAGATCAACATTTTTGTTCAAGACTGGGATTTTGACAAGTGTTACAAAAGATGAAAAAGAAAAGGTTAATCCTCCCTCAGTTAACACAGACAATTGAGGCTTAATGATAGATAGATAGG

mRNA sequence

ATGGGTGTATGCTCAAGCAAAGATGGGAGTGAGATTTTTTATGATGATGTTAAAGGATTGAAAGAAAAGATAAGGCTACTTAGAGAGGAAGTGAGGGGAGTAATCTGCGAAATGGATAAAGAGACTAAAGCACATGAGAAAGACATGGTGGTTTTTGCATTTAAGGAGGCAGGTTGGAAAACAGAGAAAAAGAGACTCAAAGAGGAGGTGAAAATGTTGAGGAAGAAGGTAAAAGAGAGTTTTACAGAGATTGAAGAAGGAAATTTTGGAGAGAAGATTGCAACAGAATGGGAAATGGAGGGAACTCCAAATACAATCTTTGAGCAGATACAACAGGAAAGAGCACGAAGAGATGAAGCCATTGAGAAGTGGAAACAACTTTATCACGCTATCAAGATTGAACTTGATGACCTAATTCAGAGGACACATAATGGAGATGGATTACATTGGGGAGTGACTGAGAGGACGGAAGCACTAAAAACACAATTACAAGCAAAGGAGGAGACCATAAAAGCCCTCAAAGAACAAGTAGTTTCAATGGAGCAAGACAAATATAAGAGGAACAGAGAAATCGACATACTGAGGCAAAGCTTGAGAATCATGACCAGTAAGAAGGAACAGCAGATAGAAACTTTCCATAAATGCGTCTGTAAGTAA

Coding sequence (CDS)

ATGGGTGTATGCTCAAGCAAAGATGGGAGTGAGATTTTTTATGATGATGTTAAAGGATTGAAAGAAAAGATAAGGCTACTTAGAGAGGAAGTGAGGGGAGTAATCTGCGAAATGGATAAAGAGACTAAAGCACATGAGAAAGACATGGTGGTTTTTGCATTTAAGGAGGCAGGTTGGAAAACAGAGAAAAAGAGACTCAAAGAGGAGGTGAAAATGTTGAGGAAGAAGGTAAAAGAGAGTTTTACAGAGATTGAAGAAGGAAATTTTGGAGAGAAGATTGCAACAGAATGGGAAATGGAGGGAACTCCAAATACAATCTTTGAGCAGATACAACAGGAAAGAGCACGAAGAGATGAAGCCATTGAGAAGTGGAAACAACTTTATCACGCTATCAAGATTGAACTTGATGACCTAATTCAGAGGACACATAATGGAGATGGATTACATTGGGGAGTGACTGAGAGGACGGAAGCACTAAAAACACAATTACAAGCAAAGGAGGAGACCATAAAAGCCCTCAAAGAACAAGTAGTTTCAATGGAGCAAGACAAATATAAGAGGAACAGAGAAATCGACATACTGAGGCAAAGCTTGAGAATCATGACCAGTAAGAAGGAACAGCAGATAGAAACTTTCCATAAATGCGTCTGTAAGTAA

Protein sequence

MGVCSSKDGSEIFYDDVKGLKEKIRLLREEVRGVICEMDKETKAHEKDMVVFAFKEAGWKTEKKRLKEEVKMLRKKVKESFTEIEEGNFGEKIATEWEMEGTPNTIFEQIQQERARRDEAIEKWKQLYHAIKIELDDLIQRTHNGDGLHWGVTERTEALKTQLQAKEETIKALKEQVVSMEQDKYKRNREIDILRQSLRIMTSKKEQQIETFHKCVCK*
BLAST of Csa1G004180.1 vs. Swiss-Prot
Match: YPF05_PLAF7 (Uncharacterized protein PF11_0207 OS=Plasmodium falciparum (isolate 3D7) GN=PF11_0207 PE=1 SV=2)

HSP 1 Score: 62.4 bits (150), Expect = 7.2e-09
Identity = 53/203 (26.11%), Postives = 106/203 (52.22%), Query Frame = 1

Query: 15  DDVKGLKEKIRLLREEVRGVICEMDKETKAHEKDMVVFAFKEAGWKTEKKRLKEEVKMLR 74
           +++K +KE+I+ ++EE++  I E  KE K   K+ +    KE   K E K +KEE+K ++
Sbjct: 502 EEIKEIKEEIKEVKEEIKEEIKEEIKEVKEEIKEEIKEEIKEV--KEEIKEVKEEIKEVK 561

Query: 75  KKVKESFTEIEEGNFGEKIATEWEMEGTPNTIFEQIQQE-RARRDEAIEKWKQLYHAIKI 134
           +++KE   EI+E           E++     I E+I++E +  ++E  E+ K+    +K 
Sbjct: 562 EEIKEVKEEIKE-----------EIKEVKEEIKEEIKEEIKEVKEEIKEEVKEEIKEVKE 621

Query: 135 ELDDLIQRTHNGDGLHWGVTERTEALKTQLQAKEETIKALKEQVVSMEQDKYKRNREIDI 194
           E+ ++ +     + +   + E  E +K   +  +E IK +KE++   E+ K +   EI  
Sbjct: 622 EIKEVKEEIK--EEVKEEIKEVKEEIKEVKEEIKEEIKEVKEEI--KEEVKEEIKEEIKE 681

Query: 195 LRQSLR--IMTSKKEQQIETFHK 215
           +++ L+  I +   +++  T HK
Sbjct: 682 IKEELKNDISSETTKEEKNTEHK 687


HSP 2 Score: 57.8 bits (138), Expect = 1.8e-07
Identity = 55/209 (26.32%), Postives = 104/209 (49.76%), Query Frame = 1

Query: 15  DDVKGLKEKIRLLREEVRGVICEMDKETKAHEKDMVVFAFKEAGWKTEKKRLKEEVKMLR 74
           +++K +KE+I+ ++EE++  I E+ +E K   K+ +    +E      K+ +KEE+K ++
Sbjct: 553 EEIKEVKEEIKEVKEEIKEEIKEVKEEIKEEIKEEIKEVKEEI-----KEEVKEEIKEVK 612

Query: 75  KKVKESFTEIEEGNFGEKIATEWEMEGTPNTIFEQIQQERAR-RDEAIEKWKQLYHAIKI 134
           +++KE   EI+E    E    + E++     I E+I++ +   ++E  E+ K+    IK 
Sbjct: 613 EEIKEVKEEIKEEVKEEIKEVKEEIKEVKEEIKEEIKEVKEEIKEEVKEEIKEEIKEIKE 672

Query: 135 ELDDLIQRTHNGDGLHWGVTERTEALKTQLQAKEETIKA--LKEQVVSMEQD-KYKRNRE 194
           EL + I             +E T+  K     KEET K   + ++V+  +Q+ K K  R 
Sbjct: 673 ELKNDIS------------SETTKEEKNTEHKKEETEKKKFIPKRVIMYQQELKEKEERN 732

Query: 195 IDILRQS-------LRIMTSKKEQQIETF 213
           + +L Q        L+++ SK +    TF
Sbjct: 733 LKLLEQQRKEREMRLQLIRSKTQGTSSTF 744


HSP 3 Score: 45.8 bits (107), Expect = 6.9e-04
Identity = 37/176 (21.02%), Postives = 93/176 (52.84%), Query Frame = 1

Query: 39  DKETKAHEKDMV--VFAFKEAGWKTEKKRLKEEVKMLRKKVKESFTEIEEGNFGEKIATE 98
           D   K HEK+ +  +   KE   +  K+ +KEE+K +++++KE   EI+E    E    +
Sbjct: 471 DTAMKMHEKEQIDDIQERKEEIKEEFKEEVKEEIKEIKEEIKEVKEEIKEEIKEEIKEVK 530

Query: 99  WEMEGTPNTIFEQIQQERARRDEAIEKWKQLYHAIKIELDDLIQRTHNGDGLHWGVTERT 158
            E++       +++++E     E I++ K+    +K E+ + I+     + +   + E  
Sbjct: 531 EEIKEEIKEEIKEVKEEIKEVKEEIKEVKEEIKEVKEEIKEEIKEVK--EEIKEEIKEEI 590

Query: 159 EALKTQLQAK-EETIKALKEQVVSMEQD-KYKRNREIDILRQSLRIMTSKKEQQIE 211
           + +K +++ + +E IK +KE++  ++++ K +   EI  +++ ++ +  + +++I+
Sbjct: 591 KEVKEEIKEEVKEEIKEVKEEIKEVKEEIKEEVKEEIKEVKEEIKEVKEEIKEEIK 644


HSP 4 Score: 36.2 bits (82), Expect = 5.5e-01
Identity = 40/200 (20.00%), Postives = 88/200 (44.00%), Query Frame = 1

Query: 15  DDVKGLKEKIRLLREEVRGVICEMDKETKAHEKDMVVFAFKEAGWKTEKKRLKEEVKMLR 74
           +++K +KE+I+ ++EE++  I E+ +E K   K+ +         K E K +KEE   L+
Sbjct: 623 EEIKEVKEEIKEVKEEIKEEIKEVKEEIKEEVKEEI---------KEEIKEIKEE---LK 682

Query: 75  KKVKESFTEIEEGNFGEKIATEWEMEGTPNTIFEQIQQERARRDEAIEKWKQLYHAIKIE 134
             +    T+ E+    +K  TE + +  P  +    Q+ + + +  ++  +Q     ++ 
Sbjct: 683 NDISSETTKEEKNTEHKKEETE-KKKFIPKRVIMYQQELKEKEERNLKLLEQQRKEREMR 742

Query: 135 LDDLIQRTHNGDGLHWGVTERTEALKTQLQAKEETIKALKEQVVSMEQDKYKRNREIDIL 194
           L  LI+    G    +  + + + L++    KEE  K +K  +    +D    N   +  
Sbjct: 743 L-QLIRSKTQGTSSTFIPSAKLKHLES---LKEEKKKEVKTNI--QPKDNNNNNNNNNNN 802

Query: 195 RQSLRIMTSKKEQQIETFHK 215
             ++ ++ + K ++     K
Sbjct: 803 NNNIAVLKNNKNEEQNVIKK 803

BLAST of Csa1G004180.1 vs. Swiss-Prot
Match: CFA57_MOUSE (Cilia- and flagella-associated protein 57 OS=Mus musculus GN=Cfap57 PE=1 SV=3)

HSP 1 Score: 54.3 bits (129), Expect = 1.9e-06
Identity = 60/217 (27.65%), Postives = 102/217 (47.00%), Query Frame = 1

Query: 20   LKEKIRLL---REEVRGVICEMDKETKAHEKD--------MVVFAFKEAGWKTEKKRLKE 79
            L+EK  LL   +E+VR  + E ++  K  E+D           +  K    K    RLK 
Sbjct: 834  LQEKTGLLEEAQEDVRQQLREFEETKKQIEEDEDREIQDIKTKYERKLRDEKESNLRLKG 893

Query: 80   EVKMLRKKVKESFTEIEEGNFGEKI--ATEWEMEGTPNTIFEQIQ---QERARRDEAIE- 139
            E  ++RKK      EIEE     ++  + + +++G   ++ + IQ   +E   RDE I+ 
Sbjct: 894  ETGIMRKKFSSLQKEIEERTNDIELLKSEQMKLQGIIRSLEKDIQGLKREIQERDETIQD 953

Query: 140  KWKQLYHAIKIELDDLIQRTHNGDGLHWGVTERTEALKTQLQAKEETIKALKEQVVSMEQ 199
            K K++Y        DL ++    +   + +  + + LK Q++ +E  IK +KEQ+  ME 
Sbjct: 954  KEKRIY--------DLKKKNQELEKFKFVLDYKIKELKKQIEPRENEIKVMKEQIQEMEA 1013

Query: 200  DK---YKRNREIDI----LRQSLRI--MTSKKEQQIE 211
            +    +K+N ++++    L Q LR      +KEQQ E
Sbjct: 1014 ELERFHKQNTQLELNITELLQKLRATDQEMRKEQQKE 1042

BLAST of Csa1G004180.1 vs. TrEMBL
Match: A0A0A0LS07_CUCSA (Uncharacterized protein OS=Cucumis sativus GN=Csa_1G004180 PE=4 SV=1)

HSP 1 Score: 438.7 bits (1127), Expect = 4.1e-120
Identity = 218/218 (100.00%), Postives = 218/218 (100.00%), Query Frame = 1

Query: 1   MGVCSSKDGSEIFYDDVKGLKEKIRLLREEVRGVICEMDKETKAHEKDMVVFAFKEAGWK 60
           MGVCSSKDGSEIFYDDVKGLKEKIRLLREEVRGVICEMDKETKAHEKDMVVFAFKEAGWK
Sbjct: 1   MGVCSSKDGSEIFYDDVKGLKEKIRLLREEVRGVICEMDKETKAHEKDMVVFAFKEAGWK 60

Query: 61  TEKKRLKEEVKMLRKKVKESFTEIEEGNFGEKIATEWEMEGTPNTIFEQIQQERARRDEA 120
           TEKKRLKEEVKMLRKKVKESFTEIEEGNFGEKIATEWEMEGTPNTIFEQIQQERARRDEA
Sbjct: 61  TEKKRLKEEVKMLRKKVKESFTEIEEGNFGEKIATEWEMEGTPNTIFEQIQQERARRDEA 120

Query: 121 IEKWKQLYHAIKIELDDLIQRTHNGDGLHWGVTERTEALKTQLQAKEETIKALKEQVVSM 180
           IEKWKQLYHAIKIELDDLIQRTHNGDGLHWGVTERTEALKTQLQAKEETIKALKEQVVSM
Sbjct: 121 IEKWKQLYHAIKIELDDLIQRTHNGDGLHWGVTERTEALKTQLQAKEETIKALKEQVVSM 180

Query: 181 EQDKYKRNREIDILRQSLRIMTSKKEQQIETFHKCVCK 219
           EQDKYKRNREIDILRQSLRIMTSKKEQQIETFHKCVCK
Sbjct: 181 EQDKYKRNREIDILRQSLRIMTSKKEQQIETFHKCVCK 218

BLAST of Csa1G004180.1 vs. TrEMBL
Match: A0A0B2P7Y4_GLYSO (Uncharacterized protein OS=Glycine soja GN=glysoja_046900 PE=4 SV=1)

HSP 1 Score: 217.6 bits (553), Expect = 1.5e-53
Identity = 116/204 (56.86%), Postives = 152/204 (74.51%), Query Frame = 1

Query: 17  VKGLKEKIRLLREEVRGVICEMDKETKAHEKDMVVFAFKEAGWKTEKKRLKEEVKMLRKK 76
           V  LKEK+RLL+EE++ ++ E +KET+++E+D++VF FKEA WK E KRL+EEVK LR  
Sbjct: 19  VMNLKEKVRLLQEEIKEMMYEREKETRSYERDIMVFTFKEADWKQEGKRLREEVKQLRSL 78

Query: 77  VK---ESFTEIEEGNFGEKIATEWEMEGTPNTIFEQIQQERARRDEAIEKWKQLYHAIKI 136
           V+   E   EIE G   +    EWE+ GT   + EQ+++ERARRDEA+EKWKQLY AIKI
Sbjct: 79  VEEKDEKIREIEVGMMEKSSEKEWELMGT-KLLVEQMKEERARRDEAVEKWKQLYLAIKI 138

Query: 137 ELDDLIQRTHNGDGLHWGVTE---RTEALKTQLQAKEETIKALKEQVVSMEQDKYKRNRE 196
           ELD+LIQRT++GDGL+W   E   + E LK +LQ K+ETIKALK Q++SME++KYK+ RE
Sbjct: 139 ELDELIQRTYDGDGLYWKAEENDIQMENLKKELQEKDETIKALKTQLLSMEKEKYKKERE 198

Query: 197 IDILRQSLRIMTSKKEQQIETFHK 215
            D+LRQSLRIM  KK   I+T  K
Sbjct: 199 FDLLRQSLRIMNGKK-NSIQTKEK 220

BLAST of Csa1G004180.1 vs. TrEMBL
Match: I1NEK7_SOYBN (Uncharacterized protein OS=Glycine max GN=GLYMA_20G081000 PE=4 SV=1)

HSP 1 Score: 217.6 bits (553), Expect = 1.5e-53
Identity = 116/204 (56.86%), Postives = 152/204 (74.51%), Query Frame = 1

Query: 17  VKGLKEKIRLLREEVRGVICEMDKETKAHEKDMVVFAFKEAGWKTEKKRLKEEVKMLRKK 76
           V  LKEK+RLL+EE++ ++ E +KET+++E+D++VF FKEA WK E KRL+EEVK LR  
Sbjct: 19  VMNLKEKVRLLQEEIKEMMYEREKETRSYERDIMVFTFKEADWKQEGKRLREEVKQLRSL 78

Query: 77  VK---ESFTEIEEGNFGEKIATEWEMEGTPNTIFEQIQQERARRDEAIEKWKQLYHAIKI 136
           V+   E   EIE G   +    EWE+ GT   + EQ+++ERARRDEA+EKWKQLY AIKI
Sbjct: 79  VEEKDEKIREIEVGMMEKSSEKEWELMGT-KLLVEQMKEERARRDEAVEKWKQLYLAIKI 138

Query: 137 ELDDLIQRTHNGDGLHWGVTE---RTEALKTQLQAKEETIKALKEQVVSMEQDKYKRNRE 196
           ELD+LIQRT++GDGL+W   E   + E LK +LQ K+ETIKALK Q++SME++KYK+ RE
Sbjct: 139 ELDELIQRTYDGDGLYWKAEENDIQMENLKKELQEKDETIKALKTQLLSMEKEKYKKERE 198

Query: 197 IDILRQSLRIMTSKKEQQIETFHK 215
            D+LRQSLRIM  KK   I+T  K
Sbjct: 199 FDLLRQSLRIMNGKK-NSIQTKEK 220

BLAST of Csa1G004180.1 vs. TrEMBL
Match: A0A151U2S2_CAJCA (Uncharacterized protein OS=Cajanus cajan GN=KK1_006214 PE=4 SV=1)

HSP 1 Score: 214.5 bits (545), Expect = 1.3e-52
Identity = 117/220 (53.18%), Postives = 155/220 (70.45%), Query Frame = 1

Query: 1   MGVCSSKDGSEIFYDDVKGLKEKIRLLREEVRGVICEMDKETKAHEKDMVVFAFKEAGWK 60
           MG   SK         V  LKEK+R+L+EE++ ++ E +KE++ +E+D++VF FKEA WK
Sbjct: 1   MGGSGSKSERRCSEKYVMNLKEKVRVLQEEIKEMMYEREKESRGYERDIMVFTFKEADWK 60

Query: 61  TEKKRLKEEVKMLRKKVK---ESFTEIEEGNFGEKIATEWEMEGTPNTIFEQIQQERARR 120
            E KRL+EEVK LRK V+   E   E+E G   +K   EWE+ GT   + EQ+++ERARR
Sbjct: 61  QEGKRLREEVKQLRKVVEEKDEKIREMEVGLMEKKSEKEWELMGT-KLLVEQMKEERARR 120

Query: 121 DEAIEKWKQLYHAIKIELDDLIQRTHNGDGLHWGVTE---RTEALKTQLQAKEETIKALK 180
           DEA+EKWKQLY AIK ELD+LIQRT++GDGL+W   E   + E L+ +LQ KEET KALK
Sbjct: 121 DEAVEKWKQLYLAIKTELDELIQRTYDGDGLYWKAEENDIQMENLRRELQEKEETTKALK 180

Query: 181 EQVVSMEQDKYKRNREIDILRQSLRIMTSKKEQQIETFHK 215
            Q++SME++KYK+ RE D+LRQSLRIM  KK   I+T  K
Sbjct: 181 TQLLSMEKEKYKKEREFDLLRQSLRIMNGKK-NSIQTREK 218

BLAST of Csa1G004180.1 vs. TrEMBL
Match: I1LAN6_SOYBN (Uncharacterized protein OS=Glycine max GN=GLYMA_10G130200 PE=4 SV=1)

HSP 1 Score: 210.7 bits (535), Expect = 1.8e-51
Identity = 115/204 (56.37%), Postives = 149/204 (73.04%), Query Frame = 1

Query: 17  VKGLKEKIRLLREEVRGVICEMDKETKAHEKDMVVFAFKEAGWKTEKKRLKEEVKMLRKK 76
           V  LKEK+RLL+EE++ ++ E +KET+ +E+D++VF FKEA  K E KRL+EEVK LR  
Sbjct: 17  VMNLKEKVRLLQEEIKEMMYEREKETRRYERDIMVFTFKEADSKQEGKRLREEVKQLRSL 76

Query: 77  VKES---FTEIEEGNFGEKIATEWEMEGTPNTIFEQIQQERARRDEAIEKWKQLYHAIKI 136
           V+E      EIE G   +    EWE+ GT   + EQ+++ERARRDEA+EKWKQLY AIK 
Sbjct: 77  VEEKDEKIREIEVGMMEKNSEKEWELMGT-KLLVEQMKEERARRDEAVEKWKQLYLAIKT 136

Query: 137 ELDDLIQRTHNGDGLHWGVTE---RTEALKTQLQAKEETIKALKEQVVSMEQDKYKRNRE 196
           ELD+LIQRT++GDGL+W   E   + E LK +LQ KEETIKALK Q++SME++KYK+ RE
Sbjct: 137 ELDELIQRTYDGDGLYWKAEENGIQMENLKKELQEKEETIKALKTQLLSMEKEKYKKERE 196

Query: 197 IDILRQSLRIMTSKKEQQIETFHK 215
            D+LRQSLRIM  KK   I+T  K
Sbjct: 197 FDLLRQSLRIMNGKK-NSIQTKEK 218

BLAST of Csa1G004180.1 vs. TAIR10
Match: AT3G23930.1 (AT3G23930.1 unknown protein)

HSP 1 Score: 126.3 bits (316), Expect = 2.3e-29
Identity = 88/226 (38.94%), Postives = 127/226 (56.19%), Query Frame = 1

Query: 1   MGVCSSKDGSEIFYDDVKGLKEKIRLLREEVRGVICEMDKETKAHEKDMVVFAFKEAG-- 60
           MG C+SK         V+ +KE  +      R +I E D+  K     ++    KEA   
Sbjct: 1   MGGCTSKQERR----GVRSVKETSKDQSRGRRHLIKERDEREK-----VMFLQLKEAERE 60

Query: 61  WKTEKKRLKEEVKMLRKKVKESFTEIEEGNFGEKIATEWEMEGTPNTIFEQIQQERARRD 120
           W+ E+K+L+EEV+ LRKK++E   E +     E+   +W        + E++  ERA RD
Sbjct: 61  WRKERKKLREEVRRLRKKLEER-EEAKTTTTEEREYWKW--------VVEEMCVERAVRD 120

Query: 121 EAIEKWKQLYHAIKIELDDLIQRTHNGDG-------LHWGVTERTEA-----LKTQLQAK 180
           EA+EKWKQLY AIK ELD LI  T +  G       L     E TEA     L+ +++ K
Sbjct: 121 EAVEKWKQLYLAIKNELDHLISHTTSSSGEAIMQRKLEEQEEEETEAKRVEVLRDEVRVK 180

Query: 181 EETIKALKEQVVSMEQDKYKRNREIDILRQSLRIMTSKKEQQIETF 213
           EET++ L+EQ+V M++ KY++ REID+LRQSLRI+ SKK+++  +F
Sbjct: 181 EETVETLEEQIVLMDRQKYEKEREIDLLRQSLRILGSKKKKKTGSF 208

BLAST of Csa1G004180.1 vs. TAIR10
Match: AT4G13540.1 (AT4G13540.1 unknown protein)

HSP 1 Score: 116.7 bits (291), Expect = 1.8e-26
Identity = 78/199 (39.20%), Postives = 113/199 (56.78%), Query Frame = 1

Query: 9   GSEIFYDDVKGLKEKIRLLREEVRGVICEMDKETKAHEKDMVVFAFK--EAGWKTEKKRL 68
           G     D+    K +I++   E R      +   +  EK+ V+ A K  E  W+ E+KRL
Sbjct: 2   GGSTSKDERNSSKRRIKVKANEQR----RRETRRELDEKERVILALKMAETEWRKERKRL 61

Query: 69  KEEVKMLRKKVKESFTEIEEGNFGEKIATEWEMEGTPNTIFEQIQQERARRDEAIEKWKQ 128
           +EEVK LR+K++E     EEG   +    EWE       + EQ+  ERA R+EA+E+WKQ
Sbjct: 62  REEVKRLRQKMEEK----EEGKAKQH---EWEW------VVEQMCLERAVREEAVERWKQ 121

Query: 129 LYHAIKIELDDLIQRTHNGDGLHW----GVTERTEALKTQLQAKEETIKALKEQVVSMEQ 188
           LY AIK ELDDLI  T+ G+ L       V +  + L+ +++A+ ETI+ LK ++  ME+
Sbjct: 122 LYFAIKNELDDLIHTTY-GEALRQKPQEEVAKAVQELRKEVKARGETIETLKGRINLMEK 181

Query: 189 DKYKRNREIDILRQSLRIM 202
            +  + REID+LRQSLRI+
Sbjct: 182 QQNGKEREIDLLRQSLRIL 182

BLAST of Csa1G004180.1 vs. NCBI nr
Match: gi|449440646|ref|XP_004138095.1| (PREDICTED: uncharacterized protein LOC101204691 [Cucumis sativus])

HSP 1 Score: 438.7 bits (1127), Expect = 5.9e-120
Identity = 218/218 (100.00%), Postives = 218/218 (100.00%), Query Frame = 1

Query: 1   MGVCSSKDGSEIFYDDVKGLKEKIRLLREEVRGVICEMDKETKAHEKDMVVFAFKEAGWK 60
           MGVCSSKDGSEIFYDDVKGLKEKIRLLREEVRGVICEMDKETKAHEKDMVVFAFKEAGWK
Sbjct: 1   MGVCSSKDGSEIFYDDVKGLKEKIRLLREEVRGVICEMDKETKAHEKDMVVFAFKEAGWK 60

Query: 61  TEKKRLKEEVKMLRKKVKESFTEIEEGNFGEKIATEWEMEGTPNTIFEQIQQERARRDEA 120
           TEKKRLKEEVKMLRKKVKESFTEIEEGNFGEKIATEWEMEGTPNTIFEQIQQERARRDEA
Sbjct: 61  TEKKRLKEEVKMLRKKVKESFTEIEEGNFGEKIATEWEMEGTPNTIFEQIQQERARRDEA 120

Query: 121 IEKWKQLYHAIKIELDDLIQRTHNGDGLHWGVTERTEALKTQLQAKEETIKALKEQVVSM 180
           IEKWKQLYHAIKIELDDLIQRTHNGDGLHWGVTERTEALKTQLQAKEETIKALKEQVVSM
Sbjct: 121 IEKWKQLYHAIKIELDDLIQRTHNGDGLHWGVTERTEALKTQLQAKEETIKALKEQVVSM 180

Query: 181 EQDKYKRNREIDILRQSLRIMTSKKEQQIETFHKCVCK 219
           EQDKYKRNREIDILRQSLRIMTSKKEQQIETFHKCVCK
Sbjct: 181 EQDKYKRNREIDILRQSLRIMTSKKEQQIETFHKCVCK 218

BLAST of Csa1G004180.1 vs. NCBI nr
Match: gi|659129099|ref|XP_008464528.1| (PREDICTED: WD repeat-containing protein 65-like [Cucumis melo])

HSP 1 Score: 425.2 bits (1092), Expect = 6.7e-116
Identity = 211/218 (96.79%), Postives = 214/218 (98.17%), Query Frame = 1

Query: 1   MGVCSSKDGSEIFYDDVKGLKEKIRLLREEVRGVICEMDKETKAHEKDMVVFAFKEAGWK 60
           MGVCSSKDGSEIFYDDVKGLKEKIRLLREEVRGVI EMDKETKAHEKDMVVFAFKEA WK
Sbjct: 1   MGVCSSKDGSEIFYDDVKGLKEKIRLLREEVRGVITEMDKETKAHEKDMVVFAFKEASWK 60

Query: 61  TEKKRLKEEVKMLRKKVKESFTEIEEGNFGEKIATEWEMEGTPNTIFEQIQQERARRDEA 120
           TEKKRL+EEVKMLRKKVKESFTEIEEGNFGEKIATEWEMEGTPNTIFEQIQQERARRDEA
Sbjct: 61  TEKKRLREEVKMLRKKVKESFTEIEEGNFGEKIATEWEMEGTPNTIFEQIQQERARRDEA 120

Query: 121 IEKWKQLYHAIKIELDDLIQRTHNGDGLHWGVTERTEALKTQLQAKEETIKALKEQVVSM 180
           +EKWKQLYHAIKIELDDLIQRTHNGDGLHWG TERTEALKTQLQAKEETIK+LKEQVVSM
Sbjct: 121 VEKWKQLYHAIKIELDDLIQRTHNGDGLHWGATERTEALKTQLQAKEETIKSLKEQVVSM 180

Query: 181 EQDKYKRNREIDILRQSLRIMTSKKEQQIETFHKCVCK 219
           EQDKYKRNREIDILRQSLRIMTSKKEQQIET HKCVCK
Sbjct: 181 EQDKYKRNREIDILRQSLRIMTSKKEQQIETSHKCVCK 218

BLAST of Csa1G004180.1 vs. NCBI nr
Match: gi|356575194|ref|XP_003555727.1| (PREDICTED: uncharacterized protein PF11_0207-like [Glycine max])

HSP 1 Score: 217.6 bits (553), Expect = 2.1e-53
Identity = 116/204 (56.86%), Postives = 152/204 (74.51%), Query Frame = 1

Query: 17  VKGLKEKIRLLREEVRGVICEMDKETKAHEKDMVVFAFKEAGWKTEKKRLKEEVKMLRKK 76
           V  LKEK+RLL+EE++ ++ E +KET+++E+D++VF FKEA WK E KRL+EEVK LR  
Sbjct: 19  VMNLKEKVRLLQEEIKEMMYEREKETRSYERDIMVFTFKEADWKQEGKRLREEVKQLRSL 78

Query: 77  VK---ESFTEIEEGNFGEKIATEWEMEGTPNTIFEQIQQERARRDEAIEKWKQLYHAIKI 136
           V+   E   EIE G   +    EWE+ GT   + EQ+++ERARRDEA+EKWKQLY AIKI
Sbjct: 79  VEEKDEKIREIEVGMMEKSSEKEWELMGT-KLLVEQMKEERARRDEAVEKWKQLYLAIKI 138

Query: 137 ELDDLIQRTHNGDGLHWGVTE---RTEALKTQLQAKEETIKALKEQVVSMEQDKYKRNRE 196
           ELD+LIQRT++GDGL+W   E   + E LK +LQ K+ETIKALK Q++SME++KYK+ RE
Sbjct: 139 ELDELIQRTYDGDGLYWKAEENDIQMENLKKELQEKDETIKALKTQLLSMEKEKYKKERE 198

Query: 197 IDILRQSLRIMTSKKEQQIETFHK 215
            D+LRQSLRIM  KK   I+T  K
Sbjct: 199 FDLLRQSLRIMNGKK-NSIQTKEK 220

BLAST of Csa1G004180.1 vs. NCBI nr
Match: gi|1012362388|gb|KYP73571.1| (hypothetical protein KK1_006214 [Cajanus cajan])

HSP 1 Score: 214.5 bits (545), Expect = 1.8e-52
Identity = 117/220 (53.18%), Postives = 155/220 (70.45%), Query Frame = 1

Query: 1   MGVCSSKDGSEIFYDDVKGLKEKIRLLREEVRGVICEMDKETKAHEKDMVVFAFKEAGWK 60
           MG   SK         V  LKEK+R+L+EE++ ++ E +KE++ +E+D++VF FKEA WK
Sbjct: 1   MGGSGSKSERRCSEKYVMNLKEKVRVLQEEIKEMMYEREKESRGYERDIMVFTFKEADWK 60

Query: 61  TEKKRLKEEVKMLRKKVK---ESFTEIEEGNFGEKIATEWEMEGTPNTIFEQIQQERARR 120
            E KRL+EEVK LRK V+   E   E+E G   +K   EWE+ GT   + EQ+++ERARR
Sbjct: 61  QEGKRLREEVKQLRKVVEEKDEKIREMEVGLMEKKSEKEWELMGT-KLLVEQMKEERARR 120

Query: 121 DEAIEKWKQLYHAIKIELDDLIQRTHNGDGLHWGVTE---RTEALKTQLQAKEETIKALK 180
           DEA+EKWKQLY AIK ELD+LIQRT++GDGL+W   E   + E L+ +LQ KEET KALK
Sbjct: 121 DEAVEKWKQLYLAIKTELDELIQRTYDGDGLYWKAEENDIQMENLRRELQEKEETTKALK 180

Query: 181 EQVVSMEQDKYKRNREIDILRQSLRIMTSKKEQQIETFHK 215
            Q++SME++KYK+ RE D+LRQSLRIM  KK   I+T  K
Sbjct: 181 TQLLSMEKEKYKKEREFDLLRQSLRIMNGKK-NSIQTREK 218

BLAST of Csa1G004180.1 vs. NCBI nr
Match: gi|947084820|gb|KRH33541.1| (hypothetical protein GLYMA_10G130200 [Glycine max])

HSP 1 Score: 210.7 bits (535), Expect = 2.6e-51
Identity = 115/204 (56.37%), Postives = 149/204 (73.04%), Query Frame = 1

Query: 17  VKGLKEKIRLLREEVRGVICEMDKETKAHEKDMVVFAFKEAGWKTEKKRLKEEVKMLRKK 76
           V  LKEK+RLL+EE++ ++ E +KET+ +E+D++VF FKEA  K E KRL+EEVK LR  
Sbjct: 17  VMNLKEKVRLLQEEIKEMMYEREKETRRYERDIMVFTFKEADSKQEGKRLREEVKQLRSL 76

Query: 77  VKES---FTEIEEGNFGEKIATEWEMEGTPNTIFEQIQQERARRDEAIEKWKQLYHAIKI 136
           V+E      EIE G   +    EWE+ GT   + EQ+++ERARRDEA+EKWKQLY AIK 
Sbjct: 77  VEEKDEKIREIEVGMMEKNSEKEWELMGT-KLLVEQMKEERARRDEAVEKWKQLYLAIKT 136

Query: 137 ELDDLIQRTHNGDGLHWGVTE---RTEALKTQLQAKEETIKALKEQVVSMEQDKYKRNRE 196
           ELD+LIQRT++GDGL+W   E   + E LK +LQ KEETIKALK Q++SME++KYK+ RE
Sbjct: 137 ELDELIQRTYDGDGLYWKAEENGIQMENLKKELQEKEETIKALKTQLLSMEKEKYKKERE 196

Query: 197 IDILRQSLRIMTSKKEQQIETFHK 215
            D+LRQSLRIM  KK   I+T  K
Sbjct: 197 FDLLRQSLRIMNGKK-NSIQTKEK 218

The following BLAST results are available for this feature:
Match NameE-valueIdentityDescription
YPF05_PLAF77.2e-0926.11Uncharacterized protein PF11_0207 OS=Plasmodium falciparum (isolate 3D7) GN=PF11... [more]
CFA57_MOUSE1.9e-0627.65Cilia- and flagella-associated protein 57 OS=Mus musculus GN=Cfap57 PE=1 SV=3[more]
Match NameE-valueIdentityDescription
A0A0A0LS07_CUCSA4.1e-120100.00Uncharacterized protein OS=Cucumis sativus GN=Csa_1G004180 PE=4 SV=1[more]
A0A0B2P7Y4_GLYSO1.5e-5356.86Uncharacterized protein OS=Glycine soja GN=glysoja_046900 PE=4 SV=1[more]
I1NEK7_SOYBN1.5e-5356.86Uncharacterized protein OS=Glycine max GN=GLYMA_20G081000 PE=4 SV=1[more]
A0A151U2S2_CAJCA1.3e-5253.18Uncharacterized protein OS=Cajanus cajan GN=KK1_006214 PE=4 SV=1[more]
I1LAN6_SOYBN1.8e-5156.37Uncharacterized protein OS=Glycine max GN=GLYMA_10G130200 PE=4 SV=1[more]
Match NameE-valueIdentityDescription
AT3G23930.12.3e-2938.94 unknown protein[more]
AT4G13540.11.8e-2639.20 unknown protein[more]
Match NameE-valueIdentityDescription
gi|449440646|ref|XP_004138095.1|5.9e-120100.00PREDICTED: uncharacterized protein LOC101204691 [Cucumis sativus][more]
gi|659129099|ref|XP_008464528.1|6.7e-11696.79PREDICTED: WD repeat-containing protein 65-like [Cucumis melo][more]
gi|356575194|ref|XP_003555727.1|2.1e-5356.86PREDICTED: uncharacterized protein PF11_0207-like [Glycine max][more]
gi|1012362388|gb|KYP73571.1|1.8e-5253.18hypothetical protein KK1_006214 [Cajanus cajan][more]
gi|947084820|gb|KRH33541.1|2.6e-5156.37hypothetical protein GLYMA_10G130200 [Glycine max][more]
The following terms have been associated with this mRNA:
Vocabulary: INTERPRO
TermDefinition
GO Assignments
This mRNA is annotated with the following GO terms.
Category Term Accession Term Name
biological_process GO:0008150 biological_process
cellular_component GO:0005575 cellular_component
molecular_function GO:0003674 molecular_function

This mRNA is a part of the following gene feature(s):

Feature NameUnique NameType
Csa1G004180Csa1G004180gene


The following polypeptide feature(s) derives from this mRNA:

Feature NameUnique NameType
Csa1G004180.1Csa1G004180.1-proteinpolypeptide


The following five_prime_UTR feature(s) are a part of this mRNA:

Feature NameUnique NameType
Csa1G004180.1.utr5p1Csa1G004180.1.utr5p1five_prime_UTR


The following CDS feature(s) are a part of this mRNA:

Feature NameUnique NameType
Csa1G004180.1.cds1Csa1G004180.1.cds1CDS
Csa1G004180.1.cds2Csa1G004180.1.cds2CDS


The following three_prime_UTR feature(s) are a part of this mRNA:

Feature NameUnique NameType
Csa1G004180.1.utr3p1Csa1G004180.1.utr3p1three_prime_UTR


Analysis Name: InterPro Annotations of cucumber (Chinese Long)
Date Performed: 2016-09-28
IPR TermIPR DescriptionSourceSource TermSource DescriptionAlignment
NoneNo IPR availableunknownCoilCoilcoord: 17..37
score: -coord: 156..183
score: -coord: 121..141
score: -coord: 56..83
scor
NoneNo IPR availablePANTHERPTHR37226FAMILY NOT NAMEDcoord: 2..206
score: 3.6
NoneNo IPR availablePANTHERPTHR37226:SF1GENOMIC DNA, CHROMOSOME 3, BAC CLONE:F14O13coord: 2..206
score: 3.6