CSPI04G02030.1 (mRNA) Wild cucumber (PI 183967)

NameCSPI04G02030.1
TypemRNA
OrganismCucumis sativus (Wild cucumber (PI 183967))
DescriptionU11/U12 small nuclear ribonucleoprotein 25 kDa protein, putative
LocationChr4 : 1162147 .. 1164284 (-)
Sequence length732
The following sequences are available for this feature:

Gene sequence (with intron)

Legend: CDSthree_prime_UTR
Hold the cursor over a type above to highlight its positions in the sequence below.
ATGCCAACCACTTCTCCTCGCTCCGACCGGACCAAGATCGCTAACTTATGGCGTTTCACCCGCTTCATTGATTACCGGTGTTTTGCTCCGCCGACGCTCGACTTTGATCCTGACTCCGACGCTCTCAGTCGGAGGCCTTATCGCTTGCTTCCTCGTCAGGATCATATCGATCTTTTCGTCCTGAAGCTCGACGGTTCTGTTTTTGGTGATTCCTACTCCATTTTCTCCATTTTCTTTTTCTTCTGTTGTTTCTTGGAAGACGTGAGTAGAGAATCTGTGTTTCGGTGAAGTTTCTGTCGGTTTTCTGGTGGTTGAGAATCGATATGGAAAATGAATGATGAAAGATGGAAGGAAACTGTACTGGAAACTGTTTTAATATAACTTGCTTAAGAGAACAGAACGGAGTATGGTTTCTTGATAACGCTGTAGCGGTAAAATGCGTTTAGAAAAGAGTTGGTAATTAAGGTGTTTGGCGCTTATGGCTACAATGGCTAAGAAAAATGCAACGGCGATATGTTTAAATATCTCGATTCTTTAGTTTATGCAGTTTTTCTCTGCTTTAATTTGTAGGTGTGGAGATCATGTGAAAATAACTAAATATTTGATTTGAAGCTACTTGTTCTTTATATTGCGGGAATATGGTTACAGTGTCAACTACAAGTTAAACTGATTTTTGATTCTTTGGCTATCCAGGCATTCGAGTATCGCGGAACAGTACAGTAGCAGATCTTAAACGAGCAATAGAGAAGGTTTTTGATTCCCCCGGGGGGAGTGAACACTACAAGATTACATGGTAGTACTGACTATTTACGAGACTTATGGCTAATTTTACACTGGAGATTTTGTTTTTGTTCCTTTTAGGAGCTTTGAGATGCTATTGGGTTTTCTATGAACGTCGTTTTGTTTATTTTCACCACATTCAACATTGCTGAATGTACATGTTCTTAGGTCACTTATCTGGGGACATTTCTGCTTATGTTATGAAGGCGAAAAGCTTATTGATGACAAGACATGCATTAAAGGTTATGGGATTAAGGATGGTGATCAGGTGTGAGAATATTGTGATGTGTCTCTTGCTTGTTCGTTGAGTTAAACGTTAGGTGTTCTCTAGCTTTTAATGTTTTCCCCTGTTGTGATAATGCAGCTTCAGTTTATTAGGCACATGTCTATCAACTGTTTGTCGATGAAGAAAGACAGGAAGAACCAAACTGTTCCTTGCAAAGCAATGCTGTTGTGAGTTTCTAAGTTGTTCTTTCTTATAGCTGATGTTGCACATGTGTCCGTTTCTTGCAAAACAGTGTCATTTCTTATCTGTTCTCTTTATAGGCTGCACTAGCACATGCATAAACATGCCAATGCACGGGCACATACCGTTATCTTTTCTGAACAAAATTGCTTGATTTGATGGCTTGTAAGAAGCAAAAGATGAATACAAGAGATGGATACAGCACGACACTGATATTGAGACACGTTCTTTTCTAAAAATTTAGGATATGATATGGCAATAACACCTTTATTAAAATATTTATCATTTTTATATAAATTTTTTTTTAGTAAATGAGATTTATATGCTTAAAAAGTTGATGTATTTTACGCTCAAAATTTATTAAAATTGTCACATGTGTTTTCTTAATCTAACTCAACAAGTGATTTATGCATGTACAATTCATTTGTTGTACAGATACACTAGCCAAACTAGCCAAACTAAAGTGTCCATAGATTTTAGATGGCTTGTTTTTTAGTCCCATTGTACAGTCTACAAACTATGCATACTTTCCCAATATTTTCTTTGTTCTGCAATGAACAGTTGATGGTTTATTCTTGAACTGTTCTATTAGACAGTAAACTGATATTTCCCATAAAAGTTTCTCGCCAGAATCAAAAGCTGTTGAAGAGAATCAAGCAGATGGTCAAGACGATTTCAAGGATTATCAAGTTCACAGGGACGATTCGAACCGAGAGGAGGTTGCTTCTGTGGCGAGAGCTGGATTTCAGCTGGCAAATTTGTTCAAAGGACGGGTATTGTATTCCAGGATATGGGGTTTCTGTAAGAGTGCATCAGAAGGCAGGAATAGGGCATCCTCTGCTTTTCGCATTGCTTGAGATTGGAAATGGACATGGAATCACACAAGCTGCAA

mRNA sequence

ATGCCAACCACTTCTCCTCGCTCCGACCGGACCAAGATCGCTAACTTATGGCGTTTCACCCGCTTCATTGATTACCGGTGTTTTGCTCCGCCGACGCTCGACTTTGATCCTGACTCCGACGCTCTCAGTCGGAGGCCTTATCGCTTGCTTCCTCGTCAGGATCATATCGATCTTTTCGTCCTGAAGCTCGACGGTTCTGTTTTTGGCATTCGAGTATCGCGGAACAGTACAGTAGCAGATCTTAAACGAGCAATAGAGAAGGTTTTTGATTCCCCCGGGGGGAGTGAACACTACAAGATTACATGGTCACTTATCTGGGGACATTTCTGCTTATGTTATGAAGGCGAAAAGCTTATTGATGACAAGACATGCATTAAAGGTTATGGGATTAAGGATGGTGATCAGCTTCAGTTTATTAGGCACATGTCTATCAACTGTTTGTCGATGAAGAAAGACAGGAAGAACCAAACTGTTCCTTGCAAAGCAATGCTGTTTTTCTCGCCAGAATCAAAAGCTGTTGAAGAGAATCAAGCAGATGGTCAAGACGATTTCAAGGATTATCAAGTTCACAGGGACGATTCGAACCGAGAGGAGGTTGCTTCTGTGGCGAGAGCTGGATTTCAGCTGGCAAATTTGTTCAAAGGACGGGTATTGTATTCCAGGATATGGGGTTTCTGTAAGAGTGCATCAGAAGGCAGGAATAGGGCATCCTCTGCTTTTCGCATTGCTTGA

Coding sequence (CDS)

ATGCCAACCACTTCTCCTCGCTCCGACCGGACCAAGATCGCTAACTTATGGCGTTTCACCCGCTTCATTGATTACCGGTGTTTTGCTCCGCCGACGCTCGACTTTGATCCTGACTCCGACGCTCTCAGTCGGAGGCCTTATCGCTTGCTTCCTCGTCAGGATCATATCGATCTTTTCGTCCTGAAGCTCGACGGTTCTGTTTTTGGCATTCGAGTATCGCGGAACAGTACAGTAGCAGATCTTAAACGAGCAATAGAGAAGGTTTTTGATTCCCCCGGGGGGAGTGAACACTACAAGATTACATGGTCACTTATCTGGGGACATTTCTGCTTATGTTATGAAGGCGAAAAGCTTATTGATGACAAGACATGCATTAAAGGTTATGGGATTAAGGATGGTGATCAGCTTCAGTTTATTAGGCACATGTCTATCAACTGTTTGTCGATGAAGAAAGACAGGAAGAACCAAACTGTTCCTTGCAAAGCAATGCTGTTTTTCTCGCCAGAATCAAAAGCTGTTGAAGAGAATCAAGCAGATGGTCAAGACGATTTCAAGGATTATCAAGTTCACAGGGACGATTCGAACCGAGAGGAGGTTGCTTCTGTGGCGAGAGCTGGATTTCAGCTGGCAAATTTGTTCAAAGGACGGGTATTGTATTCCAGGATATGGGGTTTCTGTAAGAGTGCATCAGAAGGCAGGAATAGGGCATCCTCTGCTTTTCGCATTGCTTGA
BLAST of CSPI04G02030.1 vs. Swiss-Prot
Match: SNR25_BOVIN (U11/U12 small nuclear ribonucleoprotein 25 kDa protein OS=Bos taurus GN=SNRNP25 PE=2 SV=2)

HSP 1 Score: 69.7 bits (169), Expect = 5.0e-11
Identity = 33/87 (37.93%), Postives = 55/87 (63.22%), Query Frame = 1

Query: 60  VLKLDGSVFGIRVSRNSTVADLKRAIEKVF----DSPGGSEHYKITWSLIWGHFCLCYEG 119
           V K+DG V  + V +N+TV DLK+AI++      +  GG +H  I+WS +W  + L   G
Sbjct: 36  VCKMDGEVMPVVVVQNATVLDLKKAIQRYVQLRQEREGGIQH--ISWSYVWRTYHLTSAG 95

Query: 120 EKLIDDKTCIKGYGIKDGDQLQFIRHM 143
           EKL +D+  ++ YGI++ D++ FI+ +
Sbjct: 96  EKLTEDRKKLRDYGIRNRDEVSFIKKL 120

BLAST of CSPI04G02030.1 vs. Swiss-Prot
Match: SNR25_MOUSE (U11/U12 small nuclear ribonucleoprotein 25 kDa protein OS=Mus musculus GN=Snrnp25 PE=1 SV=1)

HSP 1 Score: 69.7 bits (169), Expect = 5.0e-11
Identity = 33/87 (37.93%), Postives = 55/87 (63.22%), Query Frame = 1

Query: 60  VLKLDGSVFGIRVSRNSTVADLKRAIEKVF----DSPGGSEHYKITWSLIWGHFCLCYEG 119
           V K+DG V  + V +N+TV DLK+AI++      +  GG +H  I+WS +W  + L   G
Sbjct: 36  VCKMDGEVMPVVVVQNATVLDLKKAIQRYVQLKQEREGGVQH--ISWSYVWRTYHLTSAG 95

Query: 120 EKLIDDKTCIKGYGIKDGDQLQFIRHM 143
           EKL +D+  ++ YGI++ D++ FI+ +
Sbjct: 96  EKLTEDRKKLRDYGIRNRDEVSFIKKL 120

BLAST of CSPI04G02030.1 vs. Swiss-Prot
Match: U1125_ARATH (U11/U12 small nuclear ribonucleoprotein 25 kDa protein OS=Arabidopsis thaliana GN=SNRNP25 PE=2 SV=1)

HSP 1 Score: 68.2 bits (165), Expect = 1.5e-10
Identity = 38/112 (33.93%), Postives = 58/112 (51.79%), Query Frame = 1

Query: 58  LFVLKLDGSVFGIRVSRNSTVADLKRAIEKVFDS--PGGSEHYKITWSLIWGHFCLCYEG 117
           L V+KLDGS   + V  ++T+ DLK  I+K  +        H  I+W  +W +FCL    
Sbjct: 54  LSVVKLDGSSLDVAVMNSATLKDLKLLIKKKVNEMEQANMGHRHISWKHVWSNFCLSCNN 113

Query: 118 EKLIDDKTCIKGYGIKDGDQLQFIRHMSINCLSMKKDRKNQTVPCKAMLFFS 168
           EKL+DD   ++  GI++  Q+ F+ ++      MKK R   +   K  LF S
Sbjct: 114 EKLLDDNAVLQDVGIRNNSQVTFMPYV------MKKGRGRHSKRKKHRLFRS 159

BLAST of CSPI04G02030.1 vs. Swiss-Prot
Match: SNR25_HUMAN (U11/U12 small nuclear ribonucleoprotein 25 kDa protein OS=Homo sapiens GN=SNRNP25 PE=1 SV=1)

HSP 1 Score: 67.8 bits (164), Expect = 1.9e-10
Identity = 32/87 (36.78%), Postives = 55/87 (63.22%), Query Frame = 1

Query: 60  VLKLDGSVFGIRVSRNSTVADLKRAIEKVF----DSPGGSEHYKITWSLIWGHFCLCYEG 119
           V K+DG V  + V +++TV DLK+AI++      +  GG +H  I+WS +W  + L   G
Sbjct: 45  VCKMDGEVMPVVVVQSATVLDLKKAIQRYVQLKQEREGGIQH--ISWSYVWRTYHLTSAG 104

Query: 120 EKLIDDKTCIKGYGIKDGDQLQFIRHM 143
           EKL +D+  ++ YGI++ D++ FI+ +
Sbjct: 105 EKLTEDRKKLRDYGIRNRDEVSFIKKL 129

BLAST of CSPI04G02030.1 vs. TrEMBL
Match: A0A0A0KTJ1_CUCSA (Uncharacterized protein OS=Cucumis sativus GN=Csa_4G006470 PE=4 SV=1)

HSP 1 Score: 501.5 bits (1290), Expect = 5.7e-139
Identity = 240/243 (98.77%), Postives = 241/243 (99.18%), Query Frame = 1

Query: 1   MPTTSPRSDRTKIANLWRFTRFIDYRCFAPPTLDFDPDSDALSRRPYRLLPRQDHIDLFV 60
           MPT+SPR DRTKIANLWRFTRFIDYRCFAPPTLDFDPDSDALSRRPYRLLPRQDHIDLFV
Sbjct: 1   MPTSSPRPDRTKIANLWRFTRFIDYRCFAPPTLDFDPDSDALSRRPYRLLPRQDHIDLFV 60

Query: 61  LKLDGSVFGIRVSRNSTVADLKRAIEKVFDSPGGSEHYKITWSLIWGHFCLCYEGEKLID 120
           LKLDGSVFGIRVSRNSTVADLKRAIEKVFDSPGGSEHYKITWSLIWGHFCLCYEGEKLID
Sbjct: 61  LKLDGSVFGIRVSRNSTVADLKRAIEKVFDSPGGSEHYKITWSLIWGHFCLCYEGEKLID 120

Query: 121 DKTCIKGYGIKDGDQLQFIRHMSINCLSMKKDRKNQTVPCKAMLFFSPESKAVEENQADG 180
           DKTCIKGYGIKDGDQLQFIRHMSINCLSMKKDRKNQTVPCKAMLF SPESKAVEENQADG
Sbjct: 121 DKTCIKGYGIKDGDQLQFIRHMSINCLSMKKDRKNQTVPCKAMLFLSPESKAVEENQADG 180

Query: 181 QDDFKDYQVHRDDSNREEVASVARAGFQLANLFKGRVLYSRIWGFCKSASEGRNRASSAF 240
           QDDFKDYQVHRDDSNREEVASVARAGFQLANLFKGRVLYSRIWGFCKSASEGRNRASSAF
Sbjct: 181 QDDFKDYQVHRDDSNREEVASVARAGFQLANLFKGRVLYSRIWGFCKSASEGRNRASSAF 240

Query: 241 RIA 244
           RIA
Sbjct: 241 RIA 243

BLAST of CSPI04G02030.1 vs. TrEMBL
Match: W9REE9_9ROSA (Uncharacterized protein OS=Morus notabilis GN=L484_025757 PE=4 SV=1)

HSP 1 Score: 177.2 bits (448), Expect = 2.5e-41
Identity = 99/223 (44.39%), Postives = 139/223 (62.33%), Query Frame = 1

Query: 23  IDYRCFAPPTLDFDPDSDALSRRPYRLLPRQDHIDLFVLKLDGSVFGIRVSRNSTVADLK 82
           I+ RC AP TL+     D +SR  Y+ LP+  H+ L VLKLDGSVF ++V +++TVA+LK
Sbjct: 22  IERRCLAPLTLN-----DDVSR--YQKLPQLQHLKLSVLKLDGSVFEVQVVKSATVAELK 81

Query: 83  RAIEKVFDSPGGSEHYKITWSLIWGHFCLCYEGEKLIDDKTCIKGYGIKDGDQLQFIRHM 142
            AIE+ F S       KI+WSL+WGHFCLCYEG+KLI+DK+ I+ YGIK+GDQLQFIRHM
Sbjct: 82  SAIEEFFSSLPKEGQDKISWSLVWGHFCLCYEGQKLINDKSNIRNYGIKEGDQLQFIRHM 141

Query: 143 SINCLSMKKDRKNQTVPCKA-MLFFSPESKAVEENQADGQDDFKD-YQVHRDDSNREEVA 202
           SIN    KK  K +   CK  +L  S  S A EE++ +G +  K+  ++ +++ N     
Sbjct: 142 SINYSPFKKRPKKENTSCKKDLLMLSSGSNACEESEQNGMNVGKESKEIDKEEEN----- 201

Query: 203 SVARAGFQLANLFKGRVLYSRIWGFCKSASEGRNRASSAFRIA 244
               A F++A+  +G + YS++ GF +       R S   R A
Sbjct: 202 --GAAQFKMAHFLRGLLSYSKLRGFSRGGGSSDCRTSRPTRFA 230

BLAST of CSPI04G02030.1 vs. TrEMBL
Match: A0A061G7L5_THECC (U11/U12 small nuclear ribonucleoprotein 25 kDa protein, putative OS=Theobroma cacao GN=TCM_027215 PE=4 SV=1)

HSP 1 Score: 166.8 bits (421), Expect = 3.3e-38
Identity = 89/224 (39.73%), Postives = 133/224 (59.38%), Query Frame = 1

Query: 18  RFTRFIDYRCFAPPTLDFDP-------DSDALSRRP--YRLLPRQDHIDLFVLKLDGSVF 77
           R    +D R  AP  L+F+        D+D +  R   YR LP+Q +  L VLKLDGS+F
Sbjct: 17  RVFNLLDRRRLAP--LNFNSRGGGDGDDNDVIVARKLLYRKLPQQRNFKLSVLKLDGSLF 76

Query: 78  GIRVSRNSTVADLKRAIEKVFDSPGGSEHYKITWSLIWGHFCLCYEGEKLIDDKTCIKGY 137
            + V RN+TVA+LK AIE++F +  G  H  I+WS +WGHFCL YEG+KL+++K CI+ +
Sbjct: 77  DVNVGRNATVAELKVAIEELFATLPGDTHGSISWSHVWGHFCLSYEGQKLVNNKACIRNF 136

Query: 138 GIKDGDQLQFIRHMSINCLSMKKDRKNQTVPCKAMLFFSPESKAVEENQADGQDDFKDYQ 197
           GIKDGDQLQFIRHMS+N L +++  K+  VPCK +   S   +  + N  +  +  ++ +
Sbjct: 137 GIKDGDQLQFIRHMSVNQLPLRRRLKHHNVPCKWLSSGSSYHQEKQHNSVNFNNKDENQE 196

Query: 198 VHRDDSNREEVASVARAGFQLANLFKGRVLYSRIWGFCKSASEG 233
                 + EE   +     +L +L +G +  +R+WG  +   EG
Sbjct: 197 DSSTSDHYEEEEEIPLPEVKLGHLLRGWLSCTRLWGASRKGPEG 238

BLAST of CSPI04G02030.1 vs. TrEMBL
Match: A0A0B0MCU7_GOSAR (U11/U12 small nuclear ribonucleoprotein 25 kDa OS=Gossypium arboreum GN=F383_18090 PE=4 SV=1)

HSP 1 Score: 163.3 bits (412), Expect = 3.7e-37
Identity = 94/235 (40.00%), Postives = 137/235 (58.30%), Query Frame = 1

Query: 5   SPRSDRTKIANLWRFTRFIDYRCFAPPTLDFDPDSDALSRRP--YRLLPRQDHIDLFVLK 64
           S RS   ++ NL    R    +     +   D + DAL  R   YR LP Q  ++L VLK
Sbjct: 10  STRSYNLRVLNLRERGRLCSLKFNRVRSGGGDDEDDALVERQLLYRKLPDQHLLNLSVLK 69

Query: 65  LDGSVFGIRVSRNSTVADLKRAIEKVFDSPGGSEHYKITWSLIWGHFCLCYEGEKLIDDK 124
           LDGS+F + + RN+TVA+LK AIE++F    G     I+W+ +WGHFCL YEG+KL+++K
Sbjct: 70  LDGSLFDVNIGRNATVAELKVAIEELFTEMAGETQGCISWAHVWGHFCLAYEGQKLVNNK 129

Query: 125 TCIKGYGIKDGDQLQFIRHMSINCLSMKKDRKNQTVPCKAMLFFSPESKAVEENQADGQD 184
            CIK +GIKDGDQL+FIRHMS+N   +K+  K+  VPCK    FSP S   +E Q +  +
Sbjct: 130 ACIKNFGIKDGDQLEFIRHMSMNQSPIKRRVKHHGVPCKC---FSPRSSHDQERQENPVN 189

Query: 185 DFKDYQVHRDDSNREEVASVARAGFQLANLFKGRVLYSRIWGFCKSASEGRNRAS 238
             K+    +D  + EE A ++   F+ A+  +  + ++R     +   EGRN +S
Sbjct: 190 HHKEEDEDQDYYHDEEHA-MSLPEFKFAHFLRRWLSHTRSQSASRRRLEGRNHSS 240

BLAST of CSPI04G02030.1 vs. TrEMBL
Match: B9RV86_RICCO (Putative uncharacterized protein OS=Ricinus communis GN=RCOM_0900820 PE=4 SV=1)

HSP 1 Score: 161.4 bits (407), Expect = 1.4e-36
Identity = 100/232 (43.10%), Postives = 143/232 (61.64%), Query Frame = 1

Query: 23  IDYRC-FAPPTLDFDPDSDALS--------RRPYRLLPRQDHIDLFVLKLDGSVFGIRVS 82
           I+ RC  +P +L  D D DA++        R  Y  LP+Q  + L VLKLDGS F + + 
Sbjct: 34  IERRCGLSPLSLKVDED-DAVNGNGGLVVRRYSYLKLPQQ-LLKLTVLKLDGSSFDVNIG 93

Query: 83  RNSTVADLKRAIEKVFDSPGGSEHYKITWSLIWGHFCLCYEGEKLIDDKTCIKGYGIKDG 142
           RN+TVA+LK+A+E++F S     H KI+WSL+WGHFCL YE +KLI+DK CI+ +GIKDG
Sbjct: 94  RNATVAELKQAVEEIFSSSPEEGHDKISWSLVWGHFCLSYENQKLINDKVCIRNFGIKDG 153

Query: 143 DQLQFIRHMSINCLSMKKDRKNQTVPCKAMLFFSPESK--AVEENQADGQD-----DFKD 202
           DQLQFIRHMSIN  S  +   +Q    K++    P SK   VE+ Q   +D     + +D
Sbjct: 154 DQLQFIRHMSIN-YSNSRQSTSQNAVRKSL---GPPSKDANVEKEQKTTEDHTSMNENQD 213

Query: 203 YQVHRDDSNREEVASVARAGFQLANLFKGRVLYSRIWGFC-KSASEGRNRAS 238
           Y ++    ++EE+     + F+LA+  +G + YSR+WG   +  S+G+NR S
Sbjct: 214 YNLYDYCEDQEEIPI---SEFKLAHFLRGWLSYSRLWGTASRKGSQGQNRPS 256

BLAST of CSPI04G02030.1 vs. TAIR10
Match: AT1G80060.1 (AT1G80060.1 Ubiquitin-like superfamily protein)

HSP 1 Score: 130.2 bits (326), Expect = 1.8e-30
Identity = 76/200 (38.00%), Postives = 111/200 (55.50%), Query Frame = 1

Query: 44  RRPYRLLPRQDHIDLFVLKLDGSVFGIRVSRNSTVADLKRAIEKVFDSPGGSEHYKITWS 103
           R  Y  LP Q  I L V+KL+GS+F + V+++ +VA+LKRA+E+VF       H  I+WS
Sbjct: 41  RSSYLKLPPQGRIKLSVVKLNGSLFDVEVAKDCSVAELKRAVEQVFTISPLEGHGMISWS 100

Query: 104 LIWGHFCLCYEGEKLIDDKTCIKGYGIKDGDQLQFIRHMSINCLSMKKDRKNQTVPCKAM 163
            +WGHFCLCY  ++L++DKT I+  G+ DGDQL F+RH+SI+   M K  K+ +  CK  
Sbjct: 101 HVWGHFCLCYRDQRLVNDKTSIRYLGLNDGDQLHFVRHLSIDHSPMNKRSKSPS--CKRY 160

Query: 164 LFFSPESKA----VEENQADGQDDF--KDYQVHRDDSNREEVASVARAGFQLANLFKGRV 223
           L     S      ++    +G DD   K Y   +D+        +  A  +L NL KG +
Sbjct: 161 LELDVSSIVNEIQIQNQNQNGVDDVAEKCYPGAQDE--------LPAAESRLVNLIKGWL 220

Query: 224 LYSRIWGFCKSASEGRNRAS 238
            Y+  WG  +   E R+  S
Sbjct: 221 PYAGRWGVSRKGPECRSGPS 230

BLAST of CSPI04G02030.1 vs. TAIR10
Match: AT4G32270.1 (AT4G32270.1 Ubiquitin-like superfamily protein)

HSP 1 Score: 106.3 bits (264), Expect = 2.7e-23
Identity = 64/154 (41.56%), Postives = 90/154 (58.44%), Query Frame = 1

Query: 40  DALSRRP---YRLLPRQDHIDLFVLKLDGSVFGIRVSRNSTVADLKRAIEKVFDSPGGSE 99
           D L RR    Y  +P ++ I L VLKLDGS FGI+V + +TV +LK A+E  F     S 
Sbjct: 20  DGLPRRRSFNYNQMP-EEPIKLTVLKLDGSSFGIQVLKTATVGELKMAVEAAFSHLPISG 79

Query: 100 HYKITWSLIWGHFCLCYEGEKLIDDKTCIKGYGIKDGDQLQFIRHMSINCLSMKKDRKN- 159
             KI+W  +WG FCL YE ++LI++   +  +GIKDGDQL+FIRH+S  C+ M K +   
Sbjct: 80  PGKISWPHVWGQFCLSYEDKRLINESEYLTEFGIKDGDQLRFIRHISNYCMLMVKHKSKT 139

Query: 160 -QTVPCKAMLFFS--PESK---AVEENQADGQDD 184
            +    K +  FS  PE++    + E + DG  D
Sbjct: 140 PRVSSFKQLKLFSTTPETRKKNVIREVEEDGVVD 172

BLAST of CSPI04G02030.1 vs. TAIR10
Match: AT5G25340.1 (AT5G25340.1 Ubiquitin-like superfamily protein)

HSP 1 Score: 105.5 bits (262), Expect = 4.6e-23
Identity = 55/114 (48.25%), Postives = 70/114 (61.40%), Query Frame = 1

Query: 47  YRLLPRQDHIDLFVLKLDGSVFGIRVSRNSTVADLKRAIEKVFDSPGGSEHYKITWSLIW 106
           Y  LP +  I L VLKLDGS F + V  ++TV DLK AIE  F         KI+WS +W
Sbjct: 11  YDKLPNEP-IRLSVLKLDGSSFDVYVLTSATVGDLKVAIETAFSHVPKKGPSKISWSHVW 70

Query: 107 GHFCLCYEGEKLIDDKTCIKGYGIKDGDQLQFIRHMSINCL-----SMKKDRKN 156
           GHFCLC+ G+KLI D  CI  YG+KDGD+++F  H+S N +     S K  +KN
Sbjct: 71  GHFCLCFGGQKLITDTDCIGNYGMKDGDEVRFKNHVSGNAVLSKGYSRKSKQKN 123

BLAST of CSPI04G02030.1 vs. TAIR10
Match: AT3G07860.1 (AT3G07860.1 Ubiquitin-like superfamily protein)

HSP 1 Score: 68.2 bits (165), Expect = 8.2e-12
Identity = 38/112 (33.93%), Postives = 58/112 (51.79%), Query Frame = 1

Query: 58  LFVLKLDGSVFGIRVSRNSTVADLKRAIEKVFDS--PGGSEHYKITWSLIWGHFCLCYEG 117
           L V+KLDGS   + V  ++T+ DLK  I+K  +        H  I+W  +W +FCL    
Sbjct: 54  LSVVKLDGSSLDVAVMNSATLKDLKLLIKKKVNEMEQANMGHRHISWKHVWSNFCLSCNN 113

Query: 118 EKLIDDKTCIKGYGIKDGDQLQFIRHMSINCLSMKKDRKNQTVPCKAMLFFS 168
           EKL+DD   ++  GI++  Q+ F+ ++      MKK R   +   K  LF S
Sbjct: 114 EKLLDDNAVLQDVGIRNNSQVTFMPYV------MKKGRGRHSKRKKHRLFRS 159

BLAST of CSPI04G02030.1 vs. NCBI nr
Match: gi|449469242|ref|XP_004152330.1| (PREDICTED: uncharacterized protein LOC101206096 [Cucumis sativus])

HSP 1 Score: 501.5 bits (1290), Expect = 8.2e-139
Identity = 240/243 (98.77%), Postives = 241/243 (99.18%), Query Frame = 1

Query: 1   MPTTSPRSDRTKIANLWRFTRFIDYRCFAPPTLDFDPDSDALSRRPYRLLPRQDHIDLFV 60
           MPT+SPR DRTKIANLWRFTRFIDYRCFAPPTLDFDPDSDALSRRPYRLLPRQDHIDLFV
Sbjct: 1   MPTSSPRPDRTKIANLWRFTRFIDYRCFAPPTLDFDPDSDALSRRPYRLLPRQDHIDLFV 60

Query: 61  LKLDGSVFGIRVSRNSTVADLKRAIEKVFDSPGGSEHYKITWSLIWGHFCLCYEGEKLID 120
           LKLDGSVFGIRVSRNSTVADLKRAIEKVFDSPGGSEHYKITWSLIWGHFCLCYEGEKLID
Sbjct: 61  LKLDGSVFGIRVSRNSTVADLKRAIEKVFDSPGGSEHYKITWSLIWGHFCLCYEGEKLID 120

Query: 121 DKTCIKGYGIKDGDQLQFIRHMSINCLSMKKDRKNQTVPCKAMLFFSPESKAVEENQADG 180
           DKTCIKGYGIKDGDQLQFIRHMSINCLSMKKDRKNQTVPCKAMLF SPESKAVEENQADG
Sbjct: 121 DKTCIKGYGIKDGDQLQFIRHMSINCLSMKKDRKNQTVPCKAMLFLSPESKAVEENQADG 180

Query: 181 QDDFKDYQVHRDDSNREEVASVARAGFQLANLFKGRVLYSRIWGFCKSASEGRNRASSAF 240
           QDDFKDYQVHRDDSNREEVASVARAGFQLANLFKGRVLYSRIWGFCKSASEGRNRASSAF
Sbjct: 181 QDDFKDYQVHRDDSNREEVASVARAGFQLANLFKGRVLYSRIWGFCKSASEGRNRASSAF 240

Query: 241 RIA 244
           RIA
Sbjct: 241 RIA 243

BLAST of CSPI04G02030.1 vs. NCBI nr
Match: gi|659108389|ref|XP_008454172.1| (PREDICTED: uncharacterized protein LOC103494657 isoform X1 [Cucumis melo])

HSP 1 Score: 471.5 bits (1212), Expect = 9.1e-130
Identity = 222/242 (91.74%), Postives = 233/242 (96.28%), Query Frame = 1

Query: 1   MPTTSPRSDRTKIANLWRFTRFIDYRCFAPPTLDFDPDSDALSRRPYRLLPRQDHIDLFV 60
           MPT+SP SDRTKIANLWRFTRFIDYRCFAPPTLDF+PDSDALSRRPYRLLPRQD IDLFV
Sbjct: 1   MPTSSPHSDRTKIANLWRFTRFIDYRCFAPPTLDFNPDSDALSRRPYRLLPRQDLIDLFV 60

Query: 61  LKLDGSVFGIRVSRNSTVADLKRAIEKVFDSPGGSEHYKITWSLIWGHFCLCYEGEKLID 120
           LKLDGSVFG+RVSRNSTVADLKRAIE+VFDSPGGSEHYKITWSLIWGHFCLCYEGEKL D
Sbjct: 61  LKLDGSVFGVRVSRNSTVADLKRAIEEVFDSPGGSEHYKITWSLIWGHFCLCYEGEKLTD 120

Query: 121 DKTCIKGYGIKDGDQLQFIRHMSINCLSMKKDRKNQTVPCKAMLFFSPESKAVEENQADG 180
           DK CI+GY IKDGDQLQFIRHMSINCLSMKKDRKNQTVPCK +LFFSPESKAVEENQADG
Sbjct: 121 DKACIRGYSIKDGDQLQFIRHMSINCLSMKKDRKNQTVPCKTILFFSPESKAVEENQADG 180

Query: 181 QDDFKDYQVHRDDSNREEVASVARAGFQLANLFKGRVLYSRIWGFCKSASEGRNRASSAF 240
           +DD KDYQVHRDDSN+EEVAS+ARA FQL+N FKGRVLYSR+WGFCKSASEGRNRASSAF
Sbjct: 181 RDDIKDYQVHRDDSNQEEVASMARARFQLSNFFKGRVLYSRVWGFCKSASEGRNRASSAF 240

Query: 241 RI 243
           RI
Sbjct: 241 RI 242

BLAST of CSPI04G02030.1 vs. NCBI nr
Match: gi|659108391|ref|XP_008454174.1| (PREDICTED: uncharacterized protein LOC103494657 isoform X2 [Cucumis melo])

HSP 1 Score: 331.6 bits (849), Expect = 1.1e-87
Identity = 152/164 (92.68%), Postives = 158/164 (96.34%), Query Frame = 1

Query: 1   MPTTSPRSDRTKIANLWRFTRFIDYRCFAPPTLDFDPDSDALSRRPYRLLPRQDHIDLFV 60
           MPT+SP SDRTKIANLWRFTRFIDYRCFAPPTLDF+PDSDALSRRPYRLLPRQD IDLFV
Sbjct: 1   MPTSSPHSDRTKIANLWRFTRFIDYRCFAPPTLDFNPDSDALSRRPYRLLPRQDLIDLFV 60

Query: 61  LKLDGSVFGIRVSRNSTVADLKRAIEKVFDSPGGSEHYKITWSLIWGHFCLCYEGEKLID 120
           LKLDGSVFG+RVSRNSTVADLKRAIE+VFDSPGGSEHYKITWSLIWGHFCLCYEGEKL D
Sbjct: 61  LKLDGSVFGVRVSRNSTVADLKRAIEEVFDSPGGSEHYKITWSLIWGHFCLCYEGEKLTD 120

Query: 121 DKTCIKGYGIKDGDQLQFIRHMSINCLSMKKDRKNQTVPCKAML 165
           DK CI+GY IKDGDQLQFIRHMSINCLSMKKDRKNQTVPCK +L
Sbjct: 121 DKACIRGYSIKDGDQLQFIRHMSINCLSMKKDRKNQTVPCKTIL 164

BLAST of CSPI04G02030.1 vs. NCBI nr
Match: gi|703101520|ref|XP_010097209.1| (hypothetical protein L484_025757 [Morus notabilis])

HSP 1 Score: 177.2 bits (448), Expect = 3.5e-41
Identity = 99/223 (44.39%), Postives = 139/223 (62.33%), Query Frame = 1

Query: 23  IDYRCFAPPTLDFDPDSDALSRRPYRLLPRQDHIDLFVLKLDGSVFGIRVSRNSTVADLK 82
           I+ RC AP TL+     D +SR  Y+ LP+  H+ L VLKLDGSVF ++V +++TVA+LK
Sbjct: 22  IERRCLAPLTLN-----DDVSR--YQKLPQLQHLKLSVLKLDGSVFEVQVVKSATVAELK 81

Query: 83  RAIEKVFDSPGGSEHYKITWSLIWGHFCLCYEGEKLIDDKTCIKGYGIKDGDQLQFIRHM 142
            AIE+ F S       KI+WSL+WGHFCLCYEG+KLI+DK+ I+ YGIK+GDQLQFIRHM
Sbjct: 82  SAIEEFFSSLPKEGQDKISWSLVWGHFCLCYEGQKLINDKSNIRNYGIKEGDQLQFIRHM 141

Query: 143 SINCLSMKKDRKNQTVPCKA-MLFFSPESKAVEENQADGQDDFKD-YQVHRDDSNREEVA 202
           SIN    KK  K +   CK  +L  S  S A EE++ +G +  K+  ++ +++ N     
Sbjct: 142 SINYSPFKKRPKKENTSCKKDLLMLSSGSNACEESEQNGMNVGKESKEIDKEEEN----- 201

Query: 203 SVARAGFQLANLFKGRVLYSRIWGFCKSASEGRNRASSAFRIA 244
               A F++A+  +G + YS++ GF +       R S   R A
Sbjct: 202 --GAAQFKMAHFLRGLLSYSKLRGFSRGGGSSDCRTSRPTRFA 230

BLAST of CSPI04G02030.1 vs. NCBI nr
Match: gi|590615459|ref|XP_007023228.1| (U11/U12 small nuclear ribonucleoprotein 25 kDa protein, putative [Theobroma cacao])

HSP 1 Score: 166.8 bits (421), Expect = 4.8e-38
Identity = 89/224 (39.73%), Postives = 133/224 (59.38%), Query Frame = 1

Query: 18  RFTRFIDYRCFAPPTLDFDP-------DSDALSRRP--YRLLPRQDHIDLFVLKLDGSVF 77
           R    +D R  AP  L+F+        D+D +  R   YR LP+Q +  L VLKLDGS+F
Sbjct: 17  RVFNLLDRRRLAP--LNFNSRGGGDGDDNDVIVARKLLYRKLPQQRNFKLSVLKLDGSLF 76

Query: 78  GIRVSRNSTVADLKRAIEKVFDSPGGSEHYKITWSLIWGHFCLCYEGEKLIDDKTCIKGY 137
            + V RN+TVA+LK AIE++F +  G  H  I+WS +WGHFCL YEG+KL+++K CI+ +
Sbjct: 77  DVNVGRNATVAELKVAIEELFATLPGDTHGSISWSHVWGHFCLSYEGQKLVNNKACIRNF 136

Query: 138 GIKDGDQLQFIRHMSINCLSMKKDRKNQTVPCKAMLFFSPESKAVEENQADGQDDFKDYQ 197
           GIKDGDQLQFIRHMS+N L +++  K+  VPCK +   S   +  + N  +  +  ++ +
Sbjct: 137 GIKDGDQLQFIRHMSVNQLPLRRRLKHHNVPCKWLSSGSSYHQEKQHNSVNFNNKDENQE 196

Query: 198 VHRDDSNREEVASVARAGFQLANLFKGRVLYSRIWGFCKSASEG 233
                 + EE   +     +L +L +G +  +R+WG  +   EG
Sbjct: 197 DSSTSDHYEEEEEIPLPEVKLGHLLRGWLSCTRLWGASRKGPEG 238

The following BLAST results are available for this feature:
Match NameE-valueIdentityDescription
SNR25_BOVIN5.0e-1137.93U11/U12 small nuclear ribonucleoprotein 25 kDa protein OS=Bos taurus GN=SNRNP25 ... [more]
SNR25_MOUSE5.0e-1137.93U11/U12 small nuclear ribonucleoprotein 25 kDa protein OS=Mus musculus GN=Snrnp2... [more]
U1125_ARATH1.5e-1033.93U11/U12 small nuclear ribonucleoprotein 25 kDa protein OS=Arabidopsis thaliana G... [more]
SNR25_HUMAN1.9e-1036.78U11/U12 small nuclear ribonucleoprotein 25 kDa protein OS=Homo sapiens GN=SNRNP2... [more]
Match NameE-valueIdentityDescription
A0A0A0KTJ1_CUCSA5.7e-13998.77Uncharacterized protein OS=Cucumis sativus GN=Csa_4G006470 PE=4 SV=1[more]
W9REE9_9ROSA2.5e-4144.39Uncharacterized protein OS=Morus notabilis GN=L484_025757 PE=4 SV=1[more]
A0A061G7L5_THECC3.3e-3839.73U11/U12 small nuclear ribonucleoprotein 25 kDa protein, putative OS=Theobroma ca... [more]
A0A0B0MCU7_GOSAR3.7e-3740.00U11/U12 small nuclear ribonucleoprotein 25 kDa OS=Gossypium arboreum GN=F383_180... [more]
B9RV86_RICCO1.4e-3643.10Putative uncharacterized protein OS=Ricinus communis GN=RCOM_0900820 PE=4 SV=1[more]
Match NameE-valueIdentityDescription
AT1G80060.11.8e-3038.00 Ubiquitin-like superfamily protein[more]
AT4G32270.12.7e-2341.56 Ubiquitin-like superfamily protein[more]
AT5G25340.14.6e-2348.25 Ubiquitin-like superfamily protein[more]
AT3G07860.18.2e-1233.93 Ubiquitin-like superfamily protein[more]
Match NameE-valueIdentityDescription
gi|449469242|ref|XP_004152330.1|8.2e-13998.77PREDICTED: uncharacterized protein LOC101206096 [Cucumis sativus][more]
gi|659108389|ref|XP_008454172.1|9.1e-13091.74PREDICTED: uncharacterized protein LOC103494657 isoform X1 [Cucumis melo][more]
gi|659108391|ref|XP_008454174.1|1.1e-8792.68PREDICTED: uncharacterized protein LOC103494657 isoform X2 [Cucumis melo][more]
gi|703101520|ref|XP_010097209.1|3.5e-4144.39hypothetical protein L484_025757 [Morus notabilis][more]
gi|590615459|ref|XP_007023228.1|4.8e-3839.73U11/U12 small nuclear ribonucleoprotein 25 kDa protein, putative [Theobroma caca... [more]
The following terms have been associated with this mRNA:
Vocabulary: INTERPRO
TermDefinition
GO Assignments
This mRNA is annotated with the following GO terms.
Category Term Accession Term Name
biological_process GO:0008150 biological_process
cellular_component GO:0005575 cellular_component
molecular_function GO:0003674 molecular_function

This mRNA is a part of the following gene feature(s):

Feature NameUnique NameType
CSPI04G02030CSPI04G02030gene


The following polypeptide feature(s) derives from this mRNA:

Feature NameUnique NameType
CSPI04G02030.1CSPI04G02030.1-proteinpolypeptide


The following three_prime_UTR feature(s) are a part of this mRNA:

Feature NameUnique NameType
CSPI04G02030.1.utr3p1CSPI04G02030.1.utr3p1three_prime_UTR


The following CDS feature(s) are a part of this mRNA:

Feature NameUnique NameType
CSPI04G02030.1.cds5CSPI04G02030.1.cds5CDS
CSPI04G02030.1.cds4CSPI04G02030.1.cds4CDS
CSPI04G02030.1.cds3CSPI04G02030.1.cds3CDS
CSPI04G02030.1.cds2CSPI04G02030.1.cds2CDS
CSPI04G02030.1.cds1CSPI04G02030.1.cds1CDS


Analysis Name: InterPro Annotations of cucumber (PI183967)
Date Performed: 2017-01-17
IPR TermIPR DescriptionSourceSource TermSource DescriptionAlignment
NoneNo IPR availableGENE3DG3DSA:3.10.20.90coord: 53..145
score: 4.1
NoneNo IPR availablePANTHERPTHR14942FAMILY NOT NAMEDcoord: 47..241
score: 8.9
NoneNo IPR availablePANTHERPTHR14942:SF2SUBFAMILY NOT NAMEDcoord: 47..241
score: 8.9