Cp4.1LG04g02020 (gene) Cucurbita pepo (Zucchini)

NameCp4.1LG04g02020
Typegene
OrganismCucurbita pepo (Cucurbita pepo (Zucchini))
DescriptionExosome complex component CSL4
LocationCp4.1LG04 : 43986 .. 46960 (-)
The following sequences are available for this feature:

Gene sequence (with intron)

Legend: five_prime_UTRCDSthree_prime_UTR
Hold the cursor over a type above to highlight its positions in the sequence below.
TCGCACGACAATTTTCTGTATAGTTTGAATTGAAACTTATACTACGATAGGGCTTGGTGACGGCGGCAGTACGAGTTGACACAGAGTTTACGAGAATGACAATGAAAGAAGGCGAGGAAGAGACTGAATTCGTGACTCCAGGTGAAGTTCTCGGAAAATTTTCGGACATGAAACCTGGGAAAGGCGCTTATGTAGCTAACAACACTGTTTATGCTTCTCTTTCTGGTTTCCGCCGCATTATTTCCCCTCCAACTGATTCTTCGGACTCGGTACACTCTCTTTCACTTCTCTTTACTGTTGCCTTGATCTTTTGAGATTTGGGATTTTTTGACATGCCTTGTTTGTTCTTGAGATCATAGGAATTGCTTTCTTTGAATCCCATATGAGTTTTGTTCTGGAATTTAAGTGAGCCGATTCAACATCTAATGTGCTTTCCTTCAAACCCTATTGTAGTTTCTTGGTTAGGTGTTAGTTCTGGGTATAATTTTGATGCCTGTTAGAGCTACGTTTAGGTTGATGATTGTGGTTCTTTTTGTGTTGGTTAAGGGAATGCTTGGGATTCTTTCCTTGTCAATGGTTAAAGTGATAGTGTTGGATGAAACTATACCATAAATGTTATGAGATTGTGTGTCTAAGGCTAATGGTGCTAAGTATGTAATGTTGAGTATTGGATCTAGTGGAGATACTCATAAATAGTTTTATCCTTGATTATAAGAGATTATAGTGGAGAGGTGGACAATTAATCCTAAAAGAATAATTGAAAAACTATATCTCTTGGGTGGTTTTGGTGTTGGCTAAGAAACTATTACCTTGCTGTATTGGTGTTTTTTATTGTCATGTTTTATCGTTAACTCAAAGTGAATATGTGAGATCCCACATCGGTTGGGGAGGAGAACGAAACATTCTTATTAGGGTGTGGAAGTCTCTCCCTAGTAGACGCGTTTTAAAAACCTTGAGGGGAAGTTCAAAGAGGACAATATCTGCTAGCGGTGAGCTTGGGCTGTTACAAATGGTATTAGAGACAGACACTGGGCGATTTGTCAGCGAGGAGGTTGAGCCCTAAAGGGGGTGGACATGAGGCGGTGTGCCAACGAAGACATTGAGCTCTAGGTGGATTGGGGTGGGGGGTCCCACATCGATTAGAGAAGGGAACGAGTGTCAGCGAAGACGTTGGGCCCTGAAGGGGGGTGGATGTGATATCCCACATTGGTTGGGGAGGGGAATGAAGCATTCTTTATAAGGGTGTGGAAACATCTCCCTAGCATACGCGCTTTCTAAACCTTGAGGGGAAGCCCGAAAGGGATAGCTCAAAAAGAATATTTGCTAGCGGTTGGTTTGGACTGTTACATAACAATAAAACAGGGGAGAGTTTTAAACTCATTTATTTGGTTTTCATACCCTTGCTTTTAAGCATGTATGAGATTTTAACCTCTACTTAATACGAGTCATTCGAGTCATAGAAGCACGTGGTAATGGTTATTTACAATGTTTATGTCTTAACTGGCCTCGGACGTTGCTTATGTCATTGCTGATTTGCGTTATTAGAGACCAACTGTGGAAGTGACTGGTCATAAGGCCCATGGTGCCGTTCCAGCTCCTGGATCAATCGTCATTGCTCGAGTAAGTTGCTTACTATGTCGTTTTTATGTGCATGAGATATAAAATGCCGTCGTTCTTCAAGTCGCACCTGCTGATTGTAGGTTACAAAAGTAATGCCTAGAATGGCATCAGCTGATATCATGTGTGTTGGTCCAAAGTCTGTGAAGGAGAAGTTTAGTGGAGTTATAAGGTTTGAAACATTCTTCTGCGAAGCTCTTATTTATTCAGTTCATTTTATTGTATAATCCATCATTTAAATATGTTACTTTCTCTGGTATAGAAAGTTCGTCTACCGTATTCATTTTTCGTTTAATATATGCACCCATTTCGTGCCTTGGCTATGTAAAGCACCTCACATTGTTGACCATGTGCTACTGTCTGTTACTCTGTTCGGTTTTTTTCCATCTTAAATCGCTTCTGTAGGCAACAAGATGTCCGAGCAACGGAGATTGATAAAGTAGACATGCATTTGTCCTTTCGACCCGGTGACATCGTCAAAGCTCTTGTTGTATCCTTTATGCTTGAATTTCGTCTTGTTTGATGAACTTTGTTTTGTGGATGAGAGAAAGAAGTCTAGTTGGAGGTTTTGATTTGAGCGTATTTCTTACCTCCATTGTGAGATCCCACATCGATCAGTGAGGACGTTGGATCCTGAAGGAGGGTGGATTGTGAGATCTCATCTCGGTTGGAGAGAGGAACGAAACACCCTTTATAAGGGCGTGAAAACCTCTCCCTTCCCCTCAAGGGAAAGTCTAAAGAGGACAATATCTACTAGCGGTAGGCTTGGACCGTTGCAACCATTTTCCTTAACCAAAGTTCTCCTCGAAAGCTTTCTCTAGGAGATGCAAGGGCCTATCATCTGTCAACTGCAAAAAACGAACTCGGGGTCGTCTCTGCAGAAAGTACAGCAGGTAAAGAGTATGGTACTATTTGTGCTCTATAATGCTCTATAAATCATGCCTCTTAGTTGCTGGTTGTAAAGTGTTGAATGTCTTTATAGGTGCAGTAATGGTTCCCATAAGTTGGACGGAAATGCAGTGTCCATTAACAGGCCAAATTCAGCAAAGAAAAGTTGCCAAAGTTGGAGGATCATGACTTCCTTTTTTACTTTCCTTTCTTTTTCTCCTTATCATCTTATTTTTTGTTCCGTTTTCGATAATAGTTCGTTATTTAGATAATAGTTCGTCATTTAGACGATTGTTATTGTAACGTTTATGTTGTATGAATGAAATTATATGAGTAGCTTAGCTATCCGTGTTCGATGGAGAAGAAGATTTTGCACCCTTGTATTGTTTTATGTTCTTCATTAGTAATCCATCGCTACTGGATTTACCTGAAATTTGATGAGTTCGTGTTAGTTTATGAGTTTAAATAAG

mRNA sequence

TCGCACGACAATTTTCTGTATAGTTTGAATTGAAACTTATACTACGATAGGGCTTGGTGACGGCGGCAGTACGAGTTGACACAGAGTTTACGAGAATGACAATGAAAGAAGGCGAGGAAGAGACTGAATTCGTGACTCCAGGTGAAGTTCTCGGAAAATTTTCGGACATGAAACCTGGGAAAGGCGCTTATGTAGCTAACAACACTGTTTATGCTTCTCTTTCTGGTTTCCGCCGCATTATTTCCCCTCCAACTGATTCTTCGGACTCGAGACCAACTGTGGAAGTGACTGGTCATAAGGCCCATGGTGCCGTTCCAGCTCCTGGATCAATCGTCATTGCTCGAGTTACAAAAGTAATGCCTAGAATGGCATCAGCTGATATCATGTGTGTTGGTCCAAAGTCTGTGAAGGAGAAGTTTAGTGGAGTTATAAGGCAACAAGATGTCCGAGCAACGGAGATTGATAAAGTAGACATGCATTTGTCCTTTCGACCCGGTGACATCGTCAAAGCTCTTGTTGTATCCTTTATGCTTGAATTTCGTGCAGTAATGGTTCCCATAAGTTGGACGGAAATGCAGTGTCCATTAACAGGCCAAATTCAGCAAAGAAAAGTTGCCAAAGTTGGAGGATCATGACTTCCTTTTTTACTTTCCTTTCTTTTTCTCCTTATCATCTTATTTTTTGTTCCGTTTTCGATAATAGTTCGTTATTTAGATAATAGTTCGTCATTTAGACGATTGTTATTGTAACGTTTATGTTGTATGAATGAAATTATATGAGTAGCTTAGCTATCCGTGTTCGATGGAGAAGAAGATTTTGCACCCTTGTATTGTTTTATGTTCTTCATTAGTAATCCATCGCTACTGGATTTACCTGAAATTTGATGAGTTCGTGTTAGTTTATGAGTTTAAATAAG

Coding sequence (CDS)

ATGACAATGAAAGAAGGCGAGGAAGAGACTGAATTCGTGACTCCAGGTGAAGTTCTCGGAAAATTTTCGGACATGAAACCTGGGAAAGGCGCTTATGTAGCTAACAACACTGTTTATGCTTCTCTTTCTGGTTTCCGCCGCATTATTTCCCCTCCAACTGATTCTTCGGACTCGAGACCAACTGTGGAAGTGACTGGTCATAAGGCCCATGGTGCCGTTCCAGCTCCTGGATCAATCGTCATTGCTCGAGTTACAAAAGTAATGCCTAGAATGGCATCAGCTGATATCATGTGTGTTGGTCCAAAGTCTGTGAAGGAGAAGTTTAGTGGAGTTATAAGGCAACAAGATGTCCGAGCAACGGAGATTGATAAAGTAGACATGCATTTGTCCTTTCGACCCGGTGACATCGTCAAAGCTCTTGTTGTATCCTTTATGCTTGAATTTCGTGCAGTAATGGTTCCCATAAGTTGGACGGAAATGCAGTGTCCATTAACAGGCCAAATTCAGCAAAGAAAAGTTGCCAAAGTTGGAGGATCATGA

Protein sequence

MTMKEGEEETEFVTPGEVLGKFSDMKPGKGAYVANNTVYASLSGFRRIISPPTDSSDSRPTVEVTGHKAHGAVPAPGSIVIARVTKVMPRMASADIMCVGPKSVKEKFSGVIRQQDVRATEIDKVDMHLSFRPGDIVKALVVSFMLEFRAVMVPISWTEMQCPLTGQIQQRKVAKVGGS
BLAST of Cp4.1LG04g02020 vs. Swiss-Prot
Match: EXOS1_MOUSE (Exosome complex component CSL4 OS=Mus musculus GN=Exosc1 PE=1 SV=1)

HSP 1 Score: 105.5 bits (262), Expect = 6.0e-22
Identity = 67/186 (36.02%), Postives = 95/186 (51.08%), Query Frame = 1

Query: 12  FVTPGEVLGKFSDMKPGKGAYVANNTVYASLSGFRRIISPPTDSSDSRPTVEVTGHKAHG 71
           +  PGE L    +  PG G Y  +  +++SL+G        T  + + P V V       
Sbjct: 7   YCIPGERLCNLEEGSPGSGTYTRHGYIFSSLAGCLM----KTSENGAVPVVSVMRETESQ 66

Query: 72  AVPAPGSIVIARVTKVMPRMASADIMCVGPKSVKEKFSGVIRQQDVRATEIDKVDMHLSF 131
            +P  G++V  +V+ +  R A   I+ VG   +K  F G IR++D+RATE DKV+++ SF
Sbjct: 67  LLPDVGAVVTCKVSSINSRFAKVHILYVGSTPLKNAFRGTIRKEDIRATEKDKVEIYKSF 126

Query: 132 RPGDIVKALVVS-------FML--------------EFRAVMVPISWTEMQCPLTGQIQQ 177
           RPGDIV A V+S       ++L              E    MVPISW EMQCP T   + 
Sbjct: 127 RPGDIVLAKVISLGDAQSNYLLTTAENELGVVVAHSESGVQMVPISWCEMQCPKTHTKEF 186

BLAST of Cp4.1LG04g02020 vs. Swiss-Prot
Match: EXOS1_HUMAN (Exosome complex component CSL4 OS=Homo sapiens GN=EXOSC1 PE=1 SV=1)

HSP 1 Score: 105.5 bits (262), Expect = 6.0e-22
Identity = 68/186 (36.56%), Postives = 95/186 (51.08%), Query Frame = 1

Query: 12  FVTPGEVLGKFSDMKPGKGAYVANNTVYASLSGFRRIISPPTDSSDSRPTVEVTGHKAHG 71
           +  PGE L    +  PG G Y  +  +++SL+G        +  + + P V V       
Sbjct: 7   YCIPGERLCNLEEGSPGSGTYTRHGYIFSSLAGCLM----KSSENGALPVVSVVRETESQ 66

Query: 72  AVPAPGSIVIARVTKVMPRMASADIMCVGPKSVKEKFSGVIRQQDVRATEIDKVDMHLSF 131
            +P  G+IV  +V+ +  R A   I+ VG   +K  F G IR++DVRATE DKV+++ SF
Sbjct: 67  LLPDVGAIVTCKVSSINSRFAKVHILYVGSMPLKNSFRGTIRKEDVRATEKDKVEIYKSF 126

Query: 132 RPGDIVKALVVS-------FML--------------EFRAVMVPISWTEMQCPLTGQIQQ 177
           RPGDIV A V+S       ++L              E    MVPISW EMQCP T   + 
Sbjct: 127 RPGDIVLAKVISLGDAQSNYLLTTAENELGVVVAHSESGIQMVPISWCEMQCPKTHTKEF 186

BLAST of Cp4.1LG04g02020 vs. Swiss-Prot
Match: CSL4_SCHPO (Exosome complex component csl4 OS=Schizosaccharomyces pombe (strain 972 / ATCC 24843) GN=csl4 PE=3 SV=2)

HSP 1 Score: 74.3 bits (181), Expect = 1.5e-12
Identity = 50/131 (38.17%), Postives = 75/131 (57.25%), Query Frame = 1

Query: 13  VTPGEVLGKFSDMKPGKGAYVANNTVYASLSGFRRIISPPTDSSDSRPTVEVTGHKAHGA 72
           V PG+V+ + +    G+G     + + ++ +G   I  P  +S      VE T       
Sbjct: 4   VLPGQVVARGAPN--GEGTVKRGDYIISTRTG---IFDPEKNSVTYPRKVEETA-----V 63

Query: 73  VPAPGSIVIARVTKVMPRMASADIMCVGPKSVKEKFSGVIRQQDVRATEIDKVDMHLSFR 132
           +P  GSIV+ARV+++  R A+ +I  V     K++F GVI  QD+RATE +KV +  SFR
Sbjct: 64  LPNVGSIVLARVSRINARQATVNISVVDDVCTKDEFQGVIHVQDIRATEKNKVKVQNSFR 123

Query: 133 PGDIVKALVVS 144
           PGDIV+ALV+S
Sbjct: 124 PGDIVRALVIS 124

BLAST of Cp4.1LG04g02020 vs. TrEMBL
Match: A0A0A0L935_CUCSA (Uncharacterized protein OS=Cucumis sativus GN=Csa_3G627150 PE=4 SV=1)

HSP 1 Score: 288.5 bits (737), Expect = 5.6e-75
Identity = 152/196 (77.55%), Postives = 161/196 (82.14%), Query Frame = 1

Query: 3   MKEGEEETEFVTPGEVLGKFSDMKPGKGAYVANNTVYASLSGFRRIISPPTDSSDSRPTV 62
           MKEGE++TEFVTPGEVLG FSD KPG+GAYV +NTVYASLSGFRRII PP+DSSD R TV
Sbjct: 1   MKEGEKDTEFVTPGEVLGNFSDFKPGRGAYVTDNTVYASLSGFRRIIHPPSDSSDLRSTV 60

Query: 63  EVTGHKAHGAVPAPGSIVIARVTKVMPRMASADIMCVGPKSVKEKFSGVIRQQDVRATEI 122
           EVTGHKAHGAVPAPGSIVI RVTKVM +MASADIMCVGPKSVKEKF+G+IRQQDVRATEI
Sbjct: 61  EVTGHKAHGAVPAPGSIVIVRVTKVMTKMASADIMCVGPKSVKEKFTGIIRQQDVRATEI 120

Query: 123 DKVDMHLSFRPGDIVKAL--------------------VVSFMLEFRAVMVPISWTEMQC 179
           DKVDMHLSFRPGDIVKAL                    VVS      A MVPISWTEMQC
Sbjct: 121 DKVDMHLSFRPGDIVKALVLSLGDARAYHLSTAKNELGVVSAESTAGAEMVPISWTEMQC 180

BLAST of Cp4.1LG04g02020 vs. TrEMBL
Match: A0A061F1N6_THECC (Nucleic acid-binding, OB-fold-like protein isoform 1 OS=Theobroma cacao GN=TCM_026126 PE=4 SV=1)

HSP 1 Score: 252.7 bits (644), Expect = 3.4e-64
Identity = 134/191 (70.16%), Postives = 148/191 (77.49%), Query Frame = 1

Query: 8   EETEFVTPGEVLGKFSDMKPGKGAYVA--NNTVYASLSGFRRIISPPTDSSDSRPTVEVT 67
           EE E VTPGE+LG+ +++K GKGAYV   N  +YASL+GFRRI SPP DS D RPTVEVT
Sbjct: 2   EEAEMVTPGEMLGRATELKAGKGAYVVQHNKNIYASLTGFRRIQSPPPDSPDQRPTVEVT 61

Query: 68  GHKAHGAVPAPGSIVIARVTKVMPRMASADIMCVGPKSVKEKFSGVIRQQDVRATEIDKV 127
           GHKAHG VP PGS+VIARVTKVM R+ASADIMCVGPKSV+EKFSG+IRQQDVRATEIDKV
Sbjct: 62  GHKAHGPVPEPGSVVIARVTKVMARIASADIMCVGPKSVREKFSGIIRQQDVRATEIDKV 121

Query: 128 DMHLSFRPGDIVKAL--------------------VVSFMLEFRAVMVPISWTEMQCPLT 177
           DMHLSFRPGDIV+A+                    VVS      A MVPISWTEMQCPLT
Sbjct: 122 DMHLSFRPGDIVRAVVLSLGDARAYYLSTAKNELGVVSAESSAGAAMVPISWTEMQCPLT 181

BLAST of Cp4.1LG04g02020 vs. TrEMBL
Match: A0A0B0MMJ8_GOSAR (Exosome complex component CSL4 OS=Gossypium arboreum GN=F383_25005 PE=4 SV=1)

HSP 1 Score: 251.9 bits (642), Expect = 5.8e-64
Identity = 135/192 (70.31%), Postives = 148/192 (77.08%), Query Frame = 1

Query: 7   EEETEFVTPGEVLGKFSDMKPGKGAYVA--NNTVYASLSGFRRIISPPTDSSDSRPTVEV 66
           EE  E VTPGEVLGK +D+K GKGAY A  NNT+YASL+GFRRI +PP  S+D RPTVEV
Sbjct: 2   EEADEMVTPGEVLGKATDLKAGKGAYAASHNNTIYASLTGFRRIQAPPPASNDKRPTVEV 61

Query: 67  TGHKAHGAVPAPGSIVIARVTKVMPRMASADIMCVGPKSVKEKFSGVIRQQDVRATEIDK 126
           TGHKAHG VP PGS+VIARVTKVM R ASADIMCVGPKSV+EKF+G+IRQQDVRATEIDK
Sbjct: 62  TGHKAHGPVPEPGSVVIARVTKVMARTASADIMCVGPKSVREKFTGIIRQQDVRATEIDK 121

Query: 127 VDMHLSFRPGDIVKAL--------------------VVSFMLEFRAVMVPISWTEMQCPL 177
           VDMHLSFRPGDIV+A+                    VVS      A MVPISWTEMQCPL
Sbjct: 122 VDMHLSFRPGDIVRAVVLSLGDARAYYLSTAKNELGVVSAESSAGATMVPISWTEMQCPL 181

BLAST of Cp4.1LG04g02020 vs. TrEMBL
Match: A0A061F8I4_THECC (Nucleic acid-binding, OB-fold-like protein isoform 2 OS=Theobroma cacao GN=TCM_026126 PE=4 SV=1)

HSP 1 Score: 251.1 bits (640), Expect = 9.9e-64
Identity = 133/190 (70.00%), Postives = 147/190 (77.37%), Query Frame = 1

Query: 8   EETEFVTPGEVLGKFSDMKPGKGAYVA--NNTVYASLSGFRRIISPPTDSSDSRPTVEVT 67
           EE E VTPGE+LG+ +++K GKGAYV   N  +YASL+GFRRI SPP DS D RPTVEVT
Sbjct: 2   EEAEMVTPGEMLGRATELKAGKGAYVVQHNKNIYASLTGFRRIQSPPPDSPDQRPTVEVT 61

Query: 68  GHKAHGAVPAPGSIVIARVTKVMPRMASADIMCVGPKSVKEKFSGVIRQQDVRATEIDKV 127
           GHKAHG VP PGS+VIARVTKVM R+ASADIMCVGPKSV+EKFSG+IRQQDVRATEIDKV
Sbjct: 62  GHKAHGPVPEPGSVVIARVTKVMARIASADIMCVGPKSVREKFSGIIRQQDVRATEIDKV 121

Query: 128 DMHLSFRPGDIVKAL--------------------VVSFMLEFRAVMVPISWTEMQCPLT 176
           DMHLSFRPGDIV+A+                    VVS      A MVPISWTEMQCPLT
Sbjct: 122 DMHLSFRPGDIVRAVVLSLGDARAYYLSTAKNELGVVSAESSAGAAMVPISWTEMQCPLT 181

BLAST of Cp4.1LG04g02020 vs. TrEMBL
Match: A0A0D2U312_GOSRA (Uncharacterized protein OS=Gossypium raimondii GN=B456_013G188600 PE=4 SV=1)

HSP 1 Score: 250.0 bits (637), Expect = 2.2e-63
Identity = 134/192 (69.79%), Postives = 147/192 (76.56%), Query Frame = 1

Query: 7   EEETEFVTPGEVLGKFSDMKPGKGAYVA--NNTVYASLSGFRRIISPPTDSSDSRPTVEV 66
           EE  E VTPGEVLG+ +D+K GKGAY A  NNT+YASL+GFRRI +PP  S D RPTVEV
Sbjct: 2   EEADEMVTPGEVLGRATDLKAGKGAYAASHNNTIYASLTGFRRIQAPPPASHDKRPTVEV 61

Query: 67  TGHKAHGAVPAPGSIVIARVTKVMPRMASADIMCVGPKSVKEKFSGVIRQQDVRATEIDK 126
           TGHKAHG VP PGS+VIARVTKVM R ASADIMCVGPKSV+EKF+G+IRQQDVRATEIDK
Sbjct: 62  TGHKAHGPVPEPGSVVIARVTKVMARTASADIMCVGPKSVREKFTGIIRQQDVRATEIDK 121

Query: 127 VDMHLSFRPGDIVKALV--------------------VSFMLEFRAVMVPISWTEMQCPL 177
           VDMHLSFRPGDIV+A+V                    VS      A MVPISWTEMQCPL
Sbjct: 122 VDMHLSFRPGDIVRAVVLSLGDARAYYLSTAKNELGIVSAESSAGATMVPISWTEMQCPL 181

BLAST of Cp4.1LG04g02020 vs. TAIR10
Match: AT5G38890.1 (AT5G38890.1 Nucleic acid-binding, OB-fold-like protein)

HSP 1 Score: 213.8 bits (543), Expect = 8.8e-56
Identity = 114/189 (60.32%), Postives = 138/189 (73.02%), Query Frame = 1

Query: 10  TEFVTPGEVLGKFSDMKPGKGAYVANNTVYASLSGFRRIISPPTDSSDSRPTVEVTGHKA 69
           T  VTPG+V+GK ++ K GKGAYV + T+YASL+G  RI+SP  +S D R  VEVTGHKA
Sbjct: 3   TGLVTPGDVIGKATEFKAGKGAYVNDATIYASLTGTCRIVSPLPESIDQRAIVEVTGHKA 62

Query: 70  HGAVPAPGSIVIARVTKVMPRMASADIMCVGPKSVKEKFSGVIRQQDVRATEIDKVDMHL 129
           HG +P  GS+VIARVTKVM +MA+ DI+CVG K+V+E F+GVIRQQDVRATEIDKVDMH 
Sbjct: 63  HGPIPETGSVVIARVTKVMTKMAAVDILCVGSKAVRENFAGVIRQQDVRATEIDKVDMHQ 122

Query: 130 SFRPGDIVKALVVSFMLEFRA---------------------VMVPISWTEMQCPLTGQI 178
           SF  GDIV+A+V+S + + RA                      MVPISWTEMQCPL+GQ 
Sbjct: 123 SFHAGDIVRAMVLS-LGDARAYYLSTAKNELGVVSAESAAGETMVPISWTEMQCPLSGQT 182

BLAST of Cp4.1LG04g02020 vs. NCBI nr
Match: gi|659130201|ref|XP_008465048.1| (PREDICTED: exosome complex component CSL4 [Cucumis melo])

HSP 1 Score: 292.4 bits (747), Expect = 5.5e-76
Identity = 154/196 (78.57%), Postives = 163/196 (83.16%), Query Frame = 1

Query: 3   MKEGEEETEFVTPGEVLGKFSDMKPGKGAYVANNTVYASLSGFRRIISPPTDSSDSRPTV 62
           MKEGE+ETEFVTPGE+LG FSD KPG+GAYVA+NTVYASLSGFRRII PP+DSSD R TV
Sbjct: 1   MKEGEKETEFVTPGEILGNFSDFKPGRGAYVADNTVYASLSGFRRIIPPPSDSSDLRSTV 60

Query: 63  EVTGHKAHGAVPAPGSIVIARVTKVMPRMASADIMCVGPKSVKEKFSGVIRQQDVRATEI 122
           EVTGHKAHGAVPAPGSIVIARVTKVM +MASADIMCVGPKSVKEKF+G+IRQQDVRATEI
Sbjct: 61  EVTGHKAHGAVPAPGSIVIARVTKVMTKMASADIMCVGPKSVKEKFTGIIRQQDVRATEI 120

Query: 123 DKVDMHLSFRPGDIVKAL--------------------VVSFMLEFRAVMVPISWTEMQC 179
           DKVDMHLSFRPGDIVKAL                    VVS      A MVPISWTEMQC
Sbjct: 121 DKVDMHLSFRPGDIVKALVLSLGDAKAYHLSTAKNELGVVSAESTAGAEMVPISWTEMQC 180

BLAST of Cp4.1LG04g02020 vs. NCBI nr
Match: gi|449453260|ref|XP_004144376.1| (PREDICTED: exosome complex component CSL4 [Cucumis sativus])

HSP 1 Score: 288.5 bits (737), Expect = 8.0e-75
Identity = 152/196 (77.55%), Postives = 161/196 (82.14%), Query Frame = 1

Query: 3   MKEGEEETEFVTPGEVLGKFSDMKPGKGAYVANNTVYASLSGFRRIISPPTDSSDSRPTV 62
           MKEGE++TEFVTPGEVLG FSD KPG+GAYV +NTVYASLSGFRRII PP+DSSD R TV
Sbjct: 1   MKEGEKDTEFVTPGEVLGNFSDFKPGRGAYVTDNTVYASLSGFRRIIHPPSDSSDLRSTV 60

Query: 63  EVTGHKAHGAVPAPGSIVIARVTKVMPRMASADIMCVGPKSVKEKFSGVIRQQDVRATEI 122
           EVTGHKAHGAVPAPGSIVI RVTKVM +MASADIMCVGPKSVKEKF+G+IRQQDVRATEI
Sbjct: 61  EVTGHKAHGAVPAPGSIVIVRVTKVMTKMASADIMCVGPKSVKEKFTGIIRQQDVRATEI 120

Query: 123 DKVDMHLSFRPGDIVKAL--------------------VVSFMLEFRAVMVPISWTEMQC 179
           DKVDMHLSFRPGDIVKAL                    VVS      A MVPISWTEMQC
Sbjct: 121 DKVDMHLSFRPGDIVKALVLSLGDARAYHLSTAKNELGVVSAESTAGAEMVPISWTEMQC 180

BLAST of Cp4.1LG04g02020 vs. NCBI nr
Match: gi|1012238939|ref|XP_015940370.1| (PREDICTED: exosome complex component CSL4 [Arachis duranensis])

HSP 1 Score: 255.8 bits (652), Expect = 5.8e-65
Identity = 137/200 (68.50%), Postives = 156/200 (78.00%), Query Frame = 1

Query: 1   MTMKEGEEETE--FVTPGEVLGKFSDMKPGKGAYVA--NNTVYASLSGFRRIISPPTDSS 60
           M M+E E++ E   VTPGEVLG+ SD+KPG+GAY A  N TVYASL+GFRR +SPP DSS
Sbjct: 3   MAMEEDEKQQEAVLVTPGEVLGRVSDIKPGRGAYAALHNETVYASLTGFRRTLSPPPDSS 62

Query: 61  DSRPTVEVTGHKAHGAVPAPGSIVIARVTKVMPRMASADIMCVGPKSVKEKFSGVIRQQD 120
           D RPTVEVTGHKAHG VP PGSIVIAR+T+VM + ASADIMCVGPKSV+EKF+G+IRQQD
Sbjct: 63  DQRPTVEVTGHKAHGPVPQPGSIVIARITEVMAKTASADIMCVGPKSVREKFTGIIRQQD 122

Query: 121 VRATEIDKVDMHLSFRPGDIVKALVVS-------FMLEFR-------------AVMVPIS 177
           VRATEIDKVDMHLSF PGDIVKALV+S       F+   +             A MVP+S
Sbjct: 123 VRATEIDKVDMHLSFHPGDIVKALVLSLGDSRAYFLSTAKNELGVVSAESIAGACMVPVS 182

BLAST of Cp4.1LG04g02020 vs. NCBI nr
Match: gi|590641743|ref|XP_007030316.1| (Nucleic acid-binding, OB-fold-like protein isoform 1 [Theobroma cacao])

HSP 1 Score: 252.7 bits (644), Expect = 4.9e-64
Identity = 134/191 (70.16%), Postives = 148/191 (77.49%), Query Frame = 1

Query: 8   EETEFVTPGEVLGKFSDMKPGKGAYVA--NNTVYASLSGFRRIISPPTDSSDSRPTVEVT 67
           EE E VTPGE+LG+ +++K GKGAYV   N  +YASL+GFRRI SPP DS D RPTVEVT
Sbjct: 2   EEAEMVTPGEMLGRATELKAGKGAYVVQHNKNIYASLTGFRRIQSPPPDSPDQRPTVEVT 61

Query: 68  GHKAHGAVPAPGSIVIARVTKVMPRMASADIMCVGPKSVKEKFSGVIRQQDVRATEIDKV 127
           GHKAHG VP PGS+VIARVTKVM R+ASADIMCVGPKSV+EKFSG+IRQQDVRATEIDKV
Sbjct: 62  GHKAHGPVPEPGSVVIARVTKVMARIASADIMCVGPKSVREKFSGIIRQQDVRATEIDKV 121

Query: 128 DMHLSFRPGDIVKAL--------------------VVSFMLEFRAVMVPISWTEMQCPLT 177
           DMHLSFRPGDIV+A+                    VVS      A MVPISWTEMQCPLT
Sbjct: 122 DMHLSFRPGDIVRAVVLSLGDARAYYLSTAKNELGVVSAESSAGAAMVPISWTEMQCPLT 181

BLAST of Cp4.1LG04g02020 vs. NCBI nr
Match: gi|728815916|gb|KHG02005.1| (Exosome complex component CSL4 [Gossypium arboreum])

HSP 1 Score: 251.9 bits (642), Expect = 8.3e-64
Identity = 135/192 (70.31%), Postives = 148/192 (77.08%), Query Frame = 1

Query: 7   EEETEFVTPGEVLGKFSDMKPGKGAYVA--NNTVYASLSGFRRIISPPTDSSDSRPTVEV 66
           EE  E VTPGEVLGK +D+K GKGAY A  NNT+YASL+GFRRI +PP  S+D RPTVEV
Sbjct: 2   EEADEMVTPGEVLGKATDLKAGKGAYAASHNNTIYASLTGFRRIQAPPPASNDKRPTVEV 61

Query: 67  TGHKAHGAVPAPGSIVIARVTKVMPRMASADIMCVGPKSVKEKFSGVIRQQDVRATEIDK 126
           TGHKAHG VP PGS+VIARVTKVM R ASADIMCVGPKSV+EKF+G+IRQQDVRATEIDK
Sbjct: 62  TGHKAHGPVPEPGSVVIARVTKVMARTASADIMCVGPKSVREKFTGIIRQQDVRATEIDK 121

Query: 127 VDMHLSFRPGDIVKAL--------------------VVSFMLEFRAVMVPISWTEMQCPL 177
           VDMHLSFRPGDIV+A+                    VVS      A MVPISWTEMQCPL
Sbjct: 122 VDMHLSFRPGDIVRAVVLSLGDARAYYLSTAKNELGVVSAESSAGATMVPISWTEMQCPL 181

The following BLAST results are available for this feature:
Match NameE-valueIdentityDescription
EXOS1_MOUSE6.0e-2236.02Exosome complex component CSL4 OS=Mus musculus GN=Exosc1 PE=1 SV=1[more]
EXOS1_HUMAN6.0e-2236.56Exosome complex component CSL4 OS=Homo sapiens GN=EXOSC1 PE=1 SV=1[more]
CSL4_SCHPO1.5e-1238.17Exosome complex component csl4 OS=Schizosaccharomyces pombe (strain 972 / ATCC 2... [more]
Match NameE-valueIdentityDescription
A0A0A0L935_CUCSA5.6e-7577.55Uncharacterized protein OS=Cucumis sativus GN=Csa_3G627150 PE=4 SV=1[more]
A0A061F1N6_THECC3.4e-6470.16Nucleic acid-binding, OB-fold-like protein isoform 1 OS=Theobroma cacao GN=TCM_0... [more]
A0A0B0MMJ8_GOSAR5.8e-6470.31Exosome complex component CSL4 OS=Gossypium arboreum GN=F383_25005 PE=4 SV=1[more]
A0A061F8I4_THECC9.9e-6470.00Nucleic acid-binding, OB-fold-like protein isoform 2 OS=Theobroma cacao GN=TCM_0... [more]
A0A0D2U312_GOSRA2.2e-6369.79Uncharacterized protein OS=Gossypium raimondii GN=B456_013G188600 PE=4 SV=1[more]
Match NameE-valueIdentityDescription
AT5G38890.18.8e-5660.32 Nucleic acid-binding, OB-fold-like protein[more]
Match NameE-valueIdentityDescription
gi|659130201|ref|XP_008465048.1|5.5e-7678.57PREDICTED: exosome complex component CSL4 [Cucumis melo][more]
gi|449453260|ref|XP_004144376.1|8.0e-7577.55PREDICTED: exosome complex component CSL4 [Cucumis sativus][more]
gi|1012238939|ref|XP_015940370.1|5.8e-6568.50PREDICTED: exosome complex component CSL4 [Arachis duranensis][more]
gi|590641743|ref|XP_007030316.1|4.9e-6470.16Nucleic acid-binding, OB-fold-like protein isoform 1 [Theobroma cacao][more]
gi|728815916|gb|KHG02005.1|8.3e-6470.31Exosome complex component CSL4 [Gossypium arboreum][more]
The following terms have been associated with this gene:
Vocabulary: Molecular Function
TermDefinition
GO:0003723RNA binding
Vocabulary: Cellular Component
TermDefinition
GO:0000178exosome (RNase complex)
Vocabulary: INTERPRO
TermDefinition
IPR025721Exosome_cplx_N_dom
IPR019495EXOSC1
IPR012340NA-bd_OB-fold
GO Assignments
This gene is annotated with the following GO terms.
Category Term Accession Term Name
biological_process GO:0008150 biological_process
biological_process GO:0006626 protein targeting to mitochondrion
cellular_component GO:0000178 exosome (RNase complex)
molecular_function GO:0003723 RNA binding

The following mRNA feature(s) are a part of this gene:

Feature NameUnique NameType
Cp4.1LG04g02020.1Cp4.1LG04g02020.1mRNA


Analysis Name: InterPro Annotations of Cucurbita pepo
Date Performed: 2017-12-02
IPR TermIPR DescriptionSourceSource TermSource DescriptionAlignment
IPR012340Nucleic acid-binding, OB-foldunknownSSF50249Nucleic acid-binding proteinscoord: 73..173
score: 2.52
IPR019495Exosome complex component CSL4PFAMPF10447EXOSC1coord: 103..143
score: 8.5
IPR025721Exosome complex component, N-terminal domainPFAMPF14382ECR1_Ncoord: 12..48
score: 4.7
NoneNo IPR availablePANTHERPTHR126863'-5' EXORIBONUCLEASE CSL4-RELATEDcoord: 4..176
score: 4.8
NoneNo IPR availablePANTHERPTHR12686:SF8EXOSOME COMPLEX COMPONENT CSL4coord: 4..176
score: 4.8
NoneNo IPR availableunknownSSF110324Ribosomal L27 protein-likecoord: 12..48
score: 3.1

The following gene(s) are orthologous to this gene:
GeneOrthologueOrganismBlock
Cp4.1LG04g02020CmoCh11G009830Cucurbita moschata (Rifu)cmocpeB127
Cp4.1LG04g02020Lsi06G007630Bottle gourd (USVL1VR-Ls)cpelsiB559
Cp4.1LG04g02020Bhi02G001064Wax gourdcpewgoB0899
Cp4.1LG04g02020Carg16466Silver-seed gourdcarcpeB1261
The following gene(s) are paralogous to this gene:

None

The following block(s) are covering this gene:
GeneOrganismBlock
Cp4.1LG04g02020Cucumber (Gy14) v1cgycpeB1051
Cp4.1LG04g02020Cucurbita maxima (Rimu)cmacpeB095
Cp4.1LG04g02020Cucurbita maxima (Rimu)cmacpeB147
Cp4.1LG04g02020Cucurbita moschata (Rifu)cmocpeB071
Cp4.1LG04g02020Cucumber (Chinese Long) v2cpecuB639
Cp4.1LG04g02020Cucumber (Gy14) v2cgybcpeB112
Cp4.1LG04g02020Melon (DHL92) v3.6.1cpemedB729
Cp4.1LG04g02020Cucumber (Chinese Long) v3cpecucB0799