CmoCh03G006520 (gene) Cucurbita moschata (Rifu) v1

Overview
NameCmoCh03G006520
Typegene
OrganismCucurbita moschata (Cucurbita moschata (Rifu) v1)
Descriptionheterogeneous nuclear ribonucleoprotein H2 isoform X1
LocationCmo_Chr03: 5900474 .. 5905759 (+)
RNA-Seq ExpressionCmoCh03G006520
SyntenyCmoCh03G006520
Sequences
The following sequences are available for this feature:

Gene sequence (with intron)

Legend: exonCDSpolypeptide
Hold the cursor over a type above to highlight its positions in the sequence below.
ATGGGGAAGCAAGGGTATTTAGGTAATCTTGAATCCGCAATCAAAGTTTAACCCTAGTTGTGTCTGTGTTCAATTTCGGTGGTGGAATCGGAAGGCAAATCGAAGTTCAGAATCGTCCGTAATCTTGGTTAGGGTTTCCCTTTTCCCGCTATCTTCCCTTCTATTTCCCTCTGAATCCACTCCCTTATTCTTCATAATCGCTCCAATTTACTCCTTTTCGTTGGATTATCAGACTAGCTTTTGGGGATTCAAGACAATCATGTACGGACCAAGAGGGTAAGTCCAGAATTTCTGACCTTGTTTATGACTCTTCCGTGTCGTTTTTTCTTGTTTCGTTTCTTGGTTGTTGTGTTTATACCGGCGTTCTTCTTTTTGCTGGGTTGCTTGTCTTGTTTTGTTGGAATATCTGGGAATTTTGATGGATTCTGCTTGAAATTGCGTGTTGGGATCGAGGAATGAGGAGCTCCTCTGTTGTTCTCCAAGGCCTCTGTTGATTTGTATGCCAATGTTGTGAAATGTGTTTTTCCAGTAAAAATATTCTTGAAGAACAACATATGTTTTGTCAGTGTAATTTAAAAGTAAAGTAGTTTTTTGTTAAAAGAAAATGGAAGCGCGATTTGTGAAGTTATCCATGCGTTGAATGTTCTTTACATGATTCTCTTCTTTATATTTAAACTTTAAAATGAACTCATCGCTGTGGGTGGCTTTTCTAGGCCTTTCTCGGATAAGATAGATATCAAGATTCTTGTCATAGGATGCCATATATCTGAATCCTGTTAGGATTTGTTTTTTTTTTTTTTTTTTTTTTTTTTTGAGTTTTTCAACCGAAAAGCAAGAATTCGATTACGGTATTCAGTGTCTCATGATCTATCTCTCTTTTTTGTACCACTGTCAAATTTAGAAGAAGCTTTTGCAGCGGTATTTCGGCTATGAAGGCCAGTGATTTCATTGGCTAATTAATGTTCTCATTGGTAGTCTTTACTTTACATCGAGGGCTTACATTGGTATAAACATGGGAAAATCATTTTTTATTGCTTCTTGATTTGATGTTCATTGAGCCTGAAATTTACTTTTTGCTGCTACGTATAGAAATAATTATCCCTTCCATATTTAGTTGAATTGATTTGTTTGTAGGAGTGGTAATGGCATGTAAGTTCATTGACCCAATAGGTTTTAGAGGTGTAGGTTGGGCTTGGTTTTATCAACTTGCTTGAGCGGGGATTTTCCTTTCTTTTTCTGGGTTGGTAATTGAAGGGCAATGTTGGGGAGCGGGGGGGTTTCGGATGGGTACGAAGTCGGCTCAAAAAGACAAAGAATGATGGAACCGAATCCCTACTTCGCAGTTAGCAGCAGCACTGCTGGATTTCAACCTTACGGCTATGGGAGTTTTCCACCTACTCATGCCTTTCCCGTGGTTCGCCTTAGAGGACTTCCCTTCAACTGCACTGACATTGATATTTTCAAGTTCTTTGCTGGACTGGACATTGTGGATGTGCTGCTTGTCAACAAGAATGGGCGATTCATGGGAGAAGCCTTTGTTGTCTTTGCTGGATCTTTGCAGGTTGAGTTCGCGTTGCAACGGGATCGACAGAATATGGGGCGTAGATATGTAGAAGTCTTTAGCTGCAAAAGGCAGGATTATTATAATGCTGTTGCTGCTGAAGTAAATTATGAGGGCATTTATGATAATGACTACAATGGAAGTCCTCCTCCTCGACAAAAGAGGATCAGCGACAAGGACCAGATGGAATACACCGAGATACTGAAGCTGCGTGGTCTCCCCTTCTCTGTGACAAAATCCAACATCATTGAATTTTTCGGAGAGTTCGACCTTGCAGAAGAAAGGATACATATTGCAAGCCGTTCAGATGGGAAGGCTACTGGGGAGGCTTATGTGGAGTTTGCTTCAGCAGAGGATGCAAAGAGAGCAATGAGCAAGGACAAGATGACAATTGGATCGAGGTACGTGGAGTTGTTTCCTTCAACCCCAAATGAAGCTAGGAGAGCTGAGTCAAGGTCAAGACAGTGATATGTAGGTAACTTGATCATGGAGTCTAAAGTGTTCTCTATTGCTATATTTTGGTGCCTAGGCTTCGCTGTTCTTGTAGTTTTCCAGTTTGTAGGATGACTGTCGTTGTGGTATAGTTCTCTGCAGTTTAGGCTGAATCTGTGGAGTAAATAGTTACTGCATGTTGAGATTGAAGTCCTATGAGTTCATAACTTGGTTAGAACACCGACTTTTTTCCTCAATTTCACCACATTTTTTTAGAGTTACTCTTCTCTGTGCGGTTTACTTGAAAAATGAGATGAGGCCTAGCCGACAGCTCCTGAAATTCAATTATTCAAGGTACTATAAAGGGCCATACCAATAATGGCTGAACAATACCCAATAGCCAGATCTAAAAGCTGTCATGAATGGTGGATTCTGTTCATCTGCCACTTTTCATACATAAGGCGAGGGTCAGTGAATGTTTTCATTTAGAATTCAGGATAAATTGTTAGCTTAACTCTGTTTTGAAAGTACTCTTGTTGGAATCTGAAATGGTGCGACTGTTTTTTGTGAATGATATTTTATTATGTATGAGAATGTTGTGATTGTTCTTGAATAATCATCAATAGGCTTTTTTGTGGAGGTTCTTGAAGTTGAATGATAGGTTTTTTCTGCCTCAAAAGATGAAACTAGCTCACTCATAAGAACTCGACCAAACTGATTAACGCATGAACATTGAAGGTGAGATTAATTTGAAACTTTCAAGATCGATGGAAAGTTTTCATTTACCACGAAACTGCTAAAAGAGCACGGTTTTGCAACAAATTCTAATTTACACAATTTAAGAGCATTTTGTATAATCTTTCTCTAGGAAAAAGTTATTTATTTATGGATTCTTTTAAATATAGTTACGGTTTATGAAATGTTATATATCGTTTTGTTGATCCTAAAGACTAAAGACGTTTGGATATCGAGATCAAAATCTATTTTCATTTACACGCGGGTTGAACTATCTACGATAATATGTAAGTTTATTATTTTACTATATTGCATCATTGAAACCTATAATCTTTTAAAAGAAACTGTCAGGTCATTCATTAATGGTGAATAATATTAGGAAGGGAAAATCCCTAAAAACCAGCCCAGATGGGTTTCAGGCATTTAAAAGATGGGCAACAGAAAATGGGAACTTGTGCGTGATTCTTAAGCCAAATGACGAAGAAGTTCAACATCCAATCACCATTCCATCTCCTGAAGAACCACTTTTGGTTCATTCTTTATTCTCTGCTTGGAGTTTTCTGCTCTTTGAATTTGTCTTCTTCACAGAACAGAATAAGAGCAGCTTCCATGTAAATCATCCATGTAAAGGGCTGTCTAAGTTTTCAGAGCTTTCGTTGTTTCACAGGTAATCTTTCCATCCATTTATGGAAGTTGGTTCTGGCACATGCCATGCTCTGACTCTGCTTGCTTTATCTGCTTCTCCTTCTCTGTTTGTCACTTTCTCCTCTTATTTCTGCTAACAACATCATCTGTTTCATGATTAAAGATGACGAGTTCTGGGTTCAGTACAGAGTTCCCAAATGTTGCAGATCCAATGCATGATTCATATCTATCTTGTTTTCCCAAGCTAAACGATGAGAAGTTTCTTCATCAAGACCTCAACTTCCTTCCTTGCTCTTCTTCAATGGCTGTATCTAAAGGGCCCGAGATTCATCAAATGAAAGAACTCTGTGAACCAGGTTTTGCTTCTTCAACTTCTTTTTACACAGATCATGTCCCTGTTATGAATGTTGGCTAACCAGGTGGGGAATTCCCAGTTCTTGCAAAGAAGAAAAGGAGAGCTACAAGTGAGCATATTGCCGGAATTGCTCTATCAGATCTGGCTAAATACTTTGATGTTCCTATTACAGAAGCTTCAAGAAGTCTAAATGTTGGATTAACAGTGCTGAAAAGAAAATGCAGAGAGTTTGGGATTCATCGATGGCCGCACAGGAAGATCAAGTCCATTGATGGTCTAATCCGAGATCTTCAGGTTATCATCCCTTTCTTTCTACTTTGCATTTGCAGCCTTGAAAGGCACACATAGTTTGTTTATGGAAAAGGGTGTTTGTTTGTTTGTTTGTTTGTTTGTAGGAAGAAGCAAAGCATAGAGAGGAAGATCAGAAAGCTTTGATGGCAGTGACAAAGAGGCAAATGATGTTGCAGAATGAAAGAGAGAGCATCGAGAGGACACCATTTAGAGAGCTGGAGAGTGAGACCAAGAGATTTAGGCAAGATGTTTTCAAGAGAAGGCATAAAGCTAGAGCTCTAGTAAGCCACAGTCCAGTTTAGTTTGCAGCAGCCATTATATTCAACAAAATAAATGACTTGCAAAGTAGTTTGTTACTTTTTTTAATCTCGATCTTGTCTCGAGATCAAGGTCTTTCAATGCAATATTGTATCTCATCTTGGTTCCCAATCGACATGGCCCACATGTCTTGATGTTATTCCGGGTCTCAGGATGTCGGCCATTACTAATCACTTGGTCATGTCATCGAAAATGGGTCCACGTGACTCTCATCTCATTAGGACATTCCAAACATCCATCCATCCGGGTTCTATCGAGGCATTGACCCTCGCGTGCCCATGTCAAGTCATTTACGGATACTGTACCGAATAACAAGTGTATAATTCAAACTTTGACACGTGGGCTAGGGCGCAATACTGGCACACCCATGCCATGCCACTCGATACTGTCCTTGAAAAACTCATTTCAAAGGAAGTCGCCCGAGACACTCGTCTCAGCACACTCACCCTCGCTGTGACACTCACGTGCCACGTGATAACTCACTTAGGCAAGAGGGAAATGTGCAACATCGGTTAAAAAATTAAAAAATTATATTTTCAATCCATAATGGGTTTGAATAATTGCTACAATGATATATCTCTAAAATGTATATTGTTCGGAGTCATATTGGATATGACACGACAATACATAACATAAACATCTGAAAAGGGAACTGTGATATCATTCTGGGATATCAGTACTAATGAGTAACTTGGCTCCTCAGATGGAACCAAGAGAAAACTACATAAATGACTGAGAAATATTCGAAAGAACAAATGAATTTCAATGAATAGTTAGATGAATTTGCCAATCTCTGGAACCAAAATCACGATCGAGGTTTGTGCAGACGAGGTCGATTGCAGAAAATTCGACCATTGGCCCGAACAAAACCTGCTATAGCTATGCTCAGGCCTGCCCACAGGATAGAAAAGAACAAGTGTCGAATATCATAGCGAGCTGCAAAAGATAA

mRNA sequence

ATGGGGAAGCAAGGACTAGCTTTTGGGGATTCAAGACAATCATGTACGGACCAAGAGGTTTTTCAACCGAAAAGCAAGAATTCGATTACGAAGAAGCTTTTGCAGCGGAGTGGTAATGGCATGGCAATGTTGGGGAGCGGGGGGGTTTCGGATGGGTACGAAGTCGGCTCAAAAAGACAAAGAATGATGGAACCGAATCCCTACTTCGCAGTTAGCAGCAGCACTGCTGGATTTCAACCTTACGGCTATGGGAGTTTTCCACCTACTCATGCCTTTCCCGTGGTTCGCCTTAGAGGACTTCCCTTCAACTGCACTGACATTGATATTTTCAAGTTCTTTGCTGGACTGGACATTGTGGATGTGCTGCTTGTCAACAAGAATGGGCGATTCATGGGAGAAGCCTTTGTTGTCTTTGCTGGATCTTTGCAGGTTGAGTTCGCGTTGCAACGGGATCGACAGAATATGGGGCGTAGATATGTAGAAGTCTTTAGCTGCAAAAGGCAGGATTATTATAATGCTGTTGCTGCTGAAGTAAATTATGAGGGCATTTATGATAATGACTACAATGGAAGTCCTCCTCCTCGACAAAAGAGGATCAGCGACAAGGACCAGATGGAATACACCGAGATACTGAAGCTGCGTGGTCTCCCCTTCTCTGTGACAAAATCCAACATCATTGAATTTTTCGGAGAGTTCGACCTTGCAGAAGAAAGGATACATATTGCAAGCCGTTCAGATGGGAAGGCTACTGGGGAGGCTTATGTGGAGTTTGCTTCAGCAGAGGATGCAAAGAGAGCAATGAGCAAGGACAAGATGACAATTGGATCGAGGTACGTGGAGTTGTTTCCTTCAACCCCAAATGAAGCTAGGAGAGCTGAGTCAAGAGTTACTCTTCTCTGTGCGGTTTACTTGAAAAATGAGATGAGGCCTAGCCGACAGCTCCTGAAATTCAATTATTCAAGGAAGGGAAAATCCCTAAAAACCAGCCCAGATGGGTTTCAGGCATTTAAAAGATGGGCAACAGAAAATGGGAACTTGTGCGTGATTCTTAAGCCAAATGACGAAGAAGTTCAACATCCAATCACCATTCCATCTCCTGAAGAACCACTTTTGGTTCATTCTTTATTCTCTGCTTGGAGTTTTCTGCTCTTTGAATTTGTCTTCTTCACAGAACAGAATAAGAGCAGCTTCCATATGACGAGTTCTGGGTTCAGTACAGAGTTCCCAAATGTTGCAGATCCAATGCATGATTCATATCTATCTTGTTTTCCCAAGCTAAACGATGAGAAGTTTCTTCATCAAGACCTCAACTTCCTTCCTTGCTCTTCTTCAATGGCTGTATCTAAAGGGCCCGAGATTCATCAAATGAAAGAACTCTGTGAACCAGGTGGGGAATTCCCAGTTCTTGCAAAGAAGAAAAGGAGAGCTACAAGTGAGCATATTGCCGGAATTGCTCTATCAGATCTGGCTAAATACTTTGATGTTCCTATTACAGAAGCTTCAAGAAGTCTAAATGTTGGATTAACAGTGCTGAAAAGAAAATGCAGAGAGTTTGGGATTCATCGATGGCCGCACAGGAAGATCAAGTCCATTGATGGTCTAATCCGAGATCTTCAGGAAGAAGCAAAGCATAGAGAGGAAGATCAGAAAGCTTTGATGGCAGTGACAAAGAGGCAAATGATGTTGCAGAATGAAAGAGAGAGCATCGAGAGGACACCATTTAGAGAGCTGGAGAGTGAGACCAAGAGATTTAGGCAAGATGTTTTCAAGAGAAGGCATAAAGCTAGAGCTCTAACGAGGTCGATTGCAGAAAATTCGACCATTGGCCCGAACAAAACCTGCTATAGCTATGCTCAGGCCTGCCCACAGGATAGAAAAGAACAAGTGTCGAATATCATAGCGAGCTGCAAAAGATAA

Coding sequence (CDS)

ATGGGGAAGCAAGGACTAGCTTTTGGGGATTCAAGACAATCATGTACGGACCAAGAGGTTTTTCAACCGAAAAGCAAGAATTCGATTACGAAGAAGCTTTTGCAGCGGAGTGGTAATGGCATGGCAATGTTGGGGAGCGGGGGGGTTTCGGATGGGTACGAAGTCGGCTCAAAAAGACAAAGAATGATGGAACCGAATCCCTACTTCGCAGTTAGCAGCAGCACTGCTGGATTTCAACCTTACGGCTATGGGAGTTTTCCACCTACTCATGCCTTTCCCGTGGTTCGCCTTAGAGGACTTCCCTTCAACTGCACTGACATTGATATTTTCAAGTTCTTTGCTGGACTGGACATTGTGGATGTGCTGCTTGTCAACAAGAATGGGCGATTCATGGGAGAAGCCTTTGTTGTCTTTGCTGGATCTTTGCAGGTTGAGTTCGCGTTGCAACGGGATCGACAGAATATGGGGCGTAGATATGTAGAAGTCTTTAGCTGCAAAAGGCAGGATTATTATAATGCTGTTGCTGCTGAAGTAAATTATGAGGGCATTTATGATAATGACTACAATGGAAGTCCTCCTCCTCGACAAAAGAGGATCAGCGACAAGGACCAGATGGAATACACCGAGATACTGAAGCTGCGTGGTCTCCCCTTCTCTGTGACAAAATCCAACATCATTGAATTTTTCGGAGAGTTCGACCTTGCAGAAGAAAGGATACATATTGCAAGCCGTTCAGATGGGAAGGCTACTGGGGAGGCTTATGTGGAGTTTGCTTCAGCAGAGGATGCAAAGAGAGCAATGAGCAAGGACAAGATGACAATTGGATCGAGGTACGTGGAGTTGTTTCCTTCAACCCCAAATGAAGCTAGGAGAGCTGAGTCAAGAGTTACTCTTCTCTGTGCGGTTTACTTGAAAAATGAGATGAGGCCTAGCCGACAGCTCCTGAAATTCAATTATTCAAGGAAGGGAAAATCCCTAAAAACCAGCCCAGATGGGTTTCAGGCATTTAAAAGATGGGCAACAGAAAATGGGAACTTGTGCGTGATTCTTAAGCCAAATGACGAAGAAGTTCAACATCCAATCACCATTCCATCTCCTGAAGAACCACTTTTGGTTCATTCTTTATTCTCTGCTTGGAGTTTTCTGCTCTTTGAATTTGTCTTCTTCACAGAACAGAATAAGAGCAGCTTCCATATGACGAGTTCTGGGTTCAGTACAGAGTTCCCAAATGTTGCAGATCCAATGCATGATTCATATCTATCTTGTTTTCCCAAGCTAAACGATGAGAAGTTTCTTCATCAAGACCTCAACTTCCTTCCTTGCTCTTCTTCAATGGCTGTATCTAAAGGGCCCGAGATTCATCAAATGAAAGAACTCTGTGAACCAGGTGGGGAATTCCCAGTTCTTGCAAAGAAGAAAAGGAGAGCTACAAGTGAGCATATTGCCGGAATTGCTCTATCAGATCTGGCTAAATACTTTGATGTTCCTATTACAGAAGCTTCAAGAAGTCTAAATGTTGGATTAACAGTGCTGAAAAGAAAATGCAGAGAGTTTGGGATTCATCGATGGCCGCACAGGAAGATCAAGTCCATTGATGGTCTAATCCGAGATCTTCAGGAAGAAGCAAAGCATAGAGAGGAAGATCAGAAAGCTTTGATGGCAGTGACAAAGAGGCAAATGATGTTGCAGAATGAAAGAGAGAGCATCGAGAGGACACCATTTAGAGAGCTGGAGAGTGAGACCAAGAGATTTAGGCAAGATGTTTTCAAGAGAAGGCATAAAGCTAGAGCTCTAACGAGGTCGATTGCAGAAAATTCGACCATTGGCCCGAACAAAACCTGCTATAGCTATGCTCAGGCCTGCCCACAGGATAGAAAAGAACAAGTGTCGAATATCATAGCGAGCTGCAAAAGATAA

Protein sequence

MGKQGLAFGDSRQSCTDQEVFQPKSKNSITKKLLQRSGNGMAMLGSGGVSDGYEVGSKRQRMMEPNPYFAVSSSTAGFQPYGYGSFPPTHAFPVVRLRGLPFNCTDIDIFKFFAGLDIVDVLLVNKNGRFMGEAFVVFAGSLQVEFALQRDRQNMGRRYVEVFSCKRQDYYNAVAAEVNYEGIYDNDYNGSPPPRQKRISDKDQMEYTEILKLRGLPFSVTKSNIIEFFGEFDLAEERIHIASRSDGKATGEAYVEFASAEDAKRAMSKDKMTIGSRYVELFPSTPNEARRAESRVTLLCAVYLKNEMRPSRQLLKFNYSRKGKSLKTSPDGFQAFKRWATENGNLCVILKPNDEEVQHPITIPSPEEPLLVHSLFSAWSFLLFEFVFFTEQNKSSFHMTSSGFSTEFPNVADPMHDSYLSCFPKLNDEKFLHQDLNFLPCSSSMAVSKGPEIHQMKELCEPGGEFPVLAKKKRRATSEHIAGIALSDLAKYFDVPITEASRSLNVGLTVLKRKCREFGIHRWPHRKIKSIDGLIRDLQEEAKHREEDQKALMAVTKRQMMLQNERESIERTPFRELESETKRFRQDVFKRRHKARALTRSIAENSTIGPNKTCYSYAQACPQDRKEQVSNIIASCKR
Homology
BLAST of CmoCh03G006520 vs. ExPASy Swiss-Prot
Match: O81791 (Protein RKD5 OS=Arabidopsis thaliana OX=3702 GN=RKD5 PE=3 SV=1)

HSP 1 Score: 154.8 bits (390), Expect = 3.2e-36
Identity = 89/172 (51.74%), Postives = 120/172 (69.77%), Query Frame = 0

Query: 432 LHQDLNFLPCS---SSMAVSKGPEIHQMK----ELCEPGGEFPVLAKKKRRATSEHIAGI 491
           L QDLN LP S   S  +V++  E  + +    E  E   +  +L KKK+R  S H+A +
Sbjct: 184 LKQDLNCLPDSETESEESVNEKTEHSEFENDKTEQSESDAKTEIL-KKKKRTPSRHVAEL 243

Query: 492 ALSDLAKYFDVPITEASRSLNVGLTVLKRKCREFGIHRWPHRKIKSIDGLIRDLQEEA-K 551
           +L +L+KYFD+ I EASR+L VGLTVLK+KCREFGI RWPHRKIKS+D LI DLQ EA K
Sbjct: 244 SLEELSKYFDLTIVEASRNLKVGLTVLKKKCREFGIPRWPHRKIKSLDCLIHDLQREAEK 303

Query: 552 HREEDQKALMAVTKRQMMLQNERESIERTPFRELESETKRFRQDVFKRRHKA 596
            +E+++ A MAV K+Q  L+ E+ +I + PF E+  ETK+FRQ+ FK+RH+A
Sbjct: 304 QQEKNEAAAMAVAKKQEKLETEKRNIVKRPFMEIGIETKKFRQENFKKRHRA 354

BLAST of CmoCh03G006520 vs. ExPASy Swiss-Prot
Match: Q3US41 (Epithelial splicing regulatory protein 1 OS=Mus musculus OX=10090 GN=Esrp1 PE=1 SV=2)

HSP 1 Score: 117.5 bits (293), Expect = 5.6e-25
Identity = 73/208 (35.10%), Postives = 109/208 (52.40%), Query Frame = 0

Query: 94  VVRLRGLPFNCTDIDIFKFFAGLDIVD---VLLVNKNGRFMGEAFVVFAGSLQVEFALQR 153
           VVR RGLP+  +D DI +FF GL+I      L +N  GR  GEA V F      + ALQR
Sbjct: 225 VVRARGLPWQSSDQDIARFFKGLNIAKGGAALCLNAQGRRNGEALVRFVSEEHRDLALQR 284

Query: 154 DRQNMGRRYVEVFSCKRQDYYNAVAAEVNYEGIYDNDYNGSPPPRQKRISDKDQMEYTEI 213
            + +MG RY+EV+    +D+        N    +              +S ++Q+    I
Sbjct: 285 HKHHMGTRYIEVYKATGEDFLKIAGGTSNEVAQF--------------LSKENQV----I 344

Query: 214 LKLRGLPFSVTKSNIIEFFGE---FDLAEERIHIASRSDGKATGEAYVEFASAEDAKRAM 273
           +++RGLPF+ T   ++ FFG+       +E I   +  DG+ TG+A+V FA  E A+ A+
Sbjct: 345 VRMRGLPFTATAEEVVAFFGQHCPITGGKEGILFVTYPDGRPTGDAFVLFACEEYAQNAL 404

Query: 274 SKDKMTIGSRYVELFPSTPNEARRAESR 296
            K K  +G RY+ELF ST  E ++  +R
Sbjct: 405 RKHKELLGKRYIELFRSTAAEVQQVLNR 414

BLAST of CmoCh03G006520 vs. ExPASy Swiss-Prot
Match: Q6NXG1 (Epithelial splicing regulatory protein 1 OS=Homo sapiens OX=9606 GN=ESRP1 PE=1 SV=2)

HSP 1 Score: 117.1 bits (292), Expect = 7.4e-25
Identity = 73/208 (35.10%), Postives = 109/208 (52.40%), Query Frame = 0

Query: 94  VVRLRGLPFNCTDIDIFKFFAGLDIVD---VLLVNKNGRFMGEAFVVFAGSLQVEFALQR 153
           VVR RGLP+  +D DI +FF GL+I      L +N  GR  GEA V F      + ALQR
Sbjct: 226 VVRARGLPWQSSDQDIARFFKGLNIAKGGAALCLNAQGRRNGEALVRFVSEEHRDLALQR 285

Query: 154 DRQNMGRRYVEVFSCKRQDYYNAVAAEVNYEGIYDNDYNGSPPPRQKRISDKDQMEYTEI 213
            + +MG RY+EV+    +D+        N    +              +S ++Q+    I
Sbjct: 286 HKHHMGTRYIEVYKATGEDFLKIAGGTSNEVAQF--------------LSKENQV----I 345

Query: 214 LKLRGLPFSVTKSNIIEFFGE---FDLAEERIHIASRSDGKATGEAYVEFASAEDAKRAM 273
           +++RGLPF+ T   ++ FFG+       +E I   +  DG+ TG+A+V FA  E A+ A+
Sbjct: 346 VRMRGLPFTATAEEVVAFFGQHCPITGGKEGILFVTYPDGRPTGDAFVLFACEEYAQNAL 405

Query: 274 SKDKMTIGSRYVELFPSTPNEARRAESR 296
            K K  +G RY+ELF ST  E ++  +R
Sbjct: 406 RKHKDLLGKRYIELFRSTAAEVQQVLNR 415

BLAST of CmoCh03G006520 vs. ExPASy Swiss-Prot
Match: B2RYD2 (Epithelial splicing regulatory protein 1 OS=Rattus norvegicus OX=10116 GN=Esrp1 PE=2 SV=2)

HSP 1 Score: 117.1 bits (292), Expect = 7.4e-25
Identity = 73/208 (35.10%), Postives = 109/208 (52.40%), Query Frame = 0

Query: 94  VVRLRGLPFNCTDIDIFKFFAGLDIVD---VLLVNKNGRFMGEAFVVFAGSLQVEFALQR 153
           VVR RGLP+  +D DI +FF GL+I      L +N  GR  GEA V F      + ALQR
Sbjct: 226 VVRARGLPWQSSDQDIARFFKGLNIAKGGAALCLNAQGRRNGEALVRFVSEEHRDLALQR 285

Query: 154 DRQNMGRRYVEVFSCKRQDYYNAVAAEVNYEGIYDNDYNGSPPPRQKRISDKDQMEYTEI 213
            + +MG RY+EV+    +D+        N    +              +S ++Q+    I
Sbjct: 286 HKHHMGTRYIEVYKATGEDFLKIAGGTSNEVAQF--------------LSKENQV----I 345

Query: 214 LKLRGLPFSVTKSNIIEFFGE---FDLAEERIHIASRSDGKATGEAYVEFASAEDAKRAM 273
           +++RGLPF+ T   ++ FFG+       +E I   +  DG+ TG+A+V FA  E A+ A+
Sbjct: 346 VRMRGLPFTATAEEVVAFFGQHCPITGGKEGILFVTYPDGRPTGDAFVLFACEEYAQNAL 405

Query: 274 SKDKMTIGSRYVELFPSTPNEARRAESR 296
            K K  +G RY+ELF ST  E ++  +R
Sbjct: 406 RKHKDLLGKRYIELFRSTAAEVQQVLNR 415

BLAST of CmoCh03G006520 vs. ExPASy Swiss-Prot
Match: Q5E9J1 (Heterogeneous nuclear ribonucleoprotein F OS=Bos taurus OX=9913 GN=HNRNPF PE=2 SV=3)

HSP 1 Score: 115.9 bits (289), Expect = 1.6e-24
Identity = 73/202 (36.14%), Postives = 105/202 (51.98%), Query Frame = 0

Query: 94  VVRLRGLPFNCTDIDIFKFFAGLDIVDVL-----LVNKNGRFMGEAFVVFAGSLQVEFAL 153
           VV+LRGLP++C+  D+  F +   I D +     +  + GR  GEAFV       V+ AL
Sbjct: 12  VVKLRGLPWSCSVEDVQNFLSDCTIHDGVAGVHFIYTREGRQSGEAFVELESEDDVKLAL 71

Query: 154 QRDRQNMGRRYVEVFSCKRQDYYNAVAAEVNYEGIYDNDYNGSPPPRQKRISDKDQMEYT 213
           ++DR++MG RY+EVF   R +               D     S P      +D       
Sbjct: 72  KKDRESMGHRYIEVFKSHRTE--------------MDWVLKHSGPNSADTAND------- 131

Query: 214 EILKLRGLPFSVTKSNIIEFFGEFDLAEERIHIASRSDGKATGEAYVEFASAEDAKRAMS 273
             ++LRGLPF  TK  II+FF   ++    I +    +GK TGEA+V+FAS E A++A+ 
Sbjct: 132 GFVRLRGLPFGCTKEEIIQFFSGLEIVPNGITLPVDPEGKITGEAFVQFASQELAEKALG 191

Query: 274 KDKMTIGSRYVELFPSTPNEAR 291
           K K  IG RY+E+F S+  E R
Sbjct: 192 KHKERIGHRYIEVFKSSQEEVR 192

BLAST of CmoCh03G006520 vs. ExPASy TrEMBL
Match: A0A444WWB9 (Uncharacterized protein OS=Arachis hypogaea OX=3818 GN=Ahy_B10g100373 PE=3 SV=1)

HSP 1 Score: 572.8 bits (1475), Expect = 1.8e-159
Identity = 337/619 (54.44%), Postives = 410/619 (66.24%), Query Frame = 0

Query: 42  AMLGSGGVSDGYEVGSKRQRMMEPNPYFAVSSSTAGFQPYGY-GSFPPTHAFPVVRLRGL 101
           AMLGSGGVSDGYEVGSKRQRMME NPYFAVSS T  FQPYGY G F P   FPVVRLRGL
Sbjct: 238 AMLGSGGVSDGYEVGSKRQRMMESNPYFAVSSGTGSFQPYGYGGGFQPPPPFPVVRLRGL 297

Query: 102 PFNCTDIDIFKFFAGLDIVDVLLVNKNGRFMGEAFVVFAGSLQVEFALQRDRQNMGRRYV 161
           PFNCTDIDI KFFAGL IVDVLLVNK+GRF GEAFVVFAG++QVEFALQRDRQNMGRRYV
Sbjct: 298 PFNCTDIDILKFFAGLTIVDVLLVNKSGRFSGEAFVVFAGAMQVEFALQRDRQNMGRRYV 357

Query: 162 EVFSCKRQDYYNAVAAEVNYEGIYDNDYNGSPPP-RQKRISDKDQMEYTEILKLRGLPFS 221
           EVF CK+QDYYNAVA+EVNYEGIYDNDY+GSPPP R KR SDKDQMEYTEILK+RGLPFS
Sbjct: 358 EVFRCKKQDYYNAVASEVNYEGIYDNDYHGSPPPSRTKRFSDKDQMEYTEILKMRGLPFS 417

Query: 222 VTKSNIIEFFGEFDLAEERIHIASRSDGKATGEAYVEFASAEDAKRAMSKDKMTIGSRYV 281
            TK+ II+FF +F L E+R+HIA R DGKATGEAYVEF S ++AKRAM KDKMTIGSRYV
Sbjct: 418 ATKAQIIDFFKDFKLIEDRVHIACRPDGKATGEAYVEFVSPDEAKRAMCKDKMTIGSRYV 477

Query: 282 ELFPSTPNEARRAESRVTLLCAVYLKNEMRPSRQLLKFNYSRKGKSLKTSPDGFQAFKRW 341
           ELFPST +EARRAESR   +  + +KN +  +  LL F  + + + L  S   +Q     
Sbjct: 478 ELFPSTTDEARRAESRSRHIFNLAMKNTLLTT--LLVFKNTIR-EELIRSVHVYQIKDGK 537

Query: 342 ATENGNLCVILKPNDEEVQHPITIPSPEEPLLVHSLFSAWSFLLFEFVFFTEQNKSSFHM 401
             E     V  +           I   ++ L V  +   +   ++  +F       SFH 
Sbjct: 538 VREVEREFVFSESGSYGEMRSTPILRLQKKLCVAEVSEGYQNGVWLCIF-------SFHR 597

Query: 402 TSSGFSTEFPNVADPMHDSYLSCFP----------KLNDEK------------------- 461
                 +  PN+     +  L   P          KL  +K                   
Sbjct: 598 DHKPQFSSIPNLLLLSRNLKLRTIPTLLRDLRVIYKLEKDKDVLSDSTGEERQGNNKDSQ 657

Query: 462 -------FLHQDLNFLPCSSSMAVSKGPEIHQMKELCEPGGEFPVLAKKKRRATSEHIAG 521
                   L QDLNFLP    M  S+ P+ +++     P  +    A+KK+RA+S+ +A 
Sbjct: 658 PLRKVVPVLTQDLNFLPYEDDM--SESPD-NKLDVQALPDAD---SAEKKKRASSDRVAK 717

Query: 522 IALSDLAKYFDVPITEASRSLNVGLTVLKRKCREFGIHRWPHRKIKSIDGLIRDLQEEAK 581
           I LS+L KYFD+PI EASR LNVGLTVLKRKCREFGI RWPHRKIKS+D LI ++QEEA 
Sbjct: 718 ITLSELVKYFDIPIVEASRRLNVGLTVLKRKCREFGIPRWPHRKIKSLDSLIHEIQEEAN 777

Query: 582 HREEDQK-ALMAVTKRQMMLQNERESIERTPFRELESETKRFRQDVFKRRHKARA---LT 619
           ++E D K A +A  +++ ML++E+E+IER PF +++SETK+ RQD+FKRRH+ARA   L 
Sbjct: 778 NQESDDKAAALAAIEKRRMLESEKENIERKPFMDIQSETKKLRQDIFKRRHRARAQKHLG 837

BLAST of CmoCh03G006520 vs. ExPASy TrEMBL
Match: A0A444WWD1 (Uncharacterized protein OS=Arachis hypogaea OX=3818 GN=Ahy_B10g100373 PE=3 SV=1)

HSP 1 Score: 572.8 bits (1475), Expect = 1.8e-159
Identity = 337/619 (54.44%), Postives = 410/619 (66.24%), Query Frame = 0

Query: 42  AMLGSGGVSDGYEVGSKRQRMMEPNPYFAVSSSTAGFQPYGY-GSFPPTHAFPVVRLRGL 101
           AMLGSGGVSDGYEVGSKRQRMME NPYFAVSS T  FQPYGY G F P   FPVVRLRGL
Sbjct: 238 AMLGSGGVSDGYEVGSKRQRMMESNPYFAVSSGTGSFQPYGYGGGFQPPPPFPVVRLRGL 297

Query: 102 PFNCTDIDIFKFFAGLDIVDVLLVNKNGRFMGEAFVVFAGSLQVEFALQRDRQNMGRRYV 161
           PFNCTDIDI KFFAGL IVDVLLVNK+GRF GEAFVVFAG++QVEFALQRDRQNMGRRYV
Sbjct: 298 PFNCTDIDILKFFAGLTIVDVLLVNKSGRFSGEAFVVFAGAMQVEFALQRDRQNMGRRYV 357

Query: 162 EVFSCKRQDYYNAVAAEVNYEGIYDNDYNGSPPP-RQKRISDKDQMEYTEILKLRGLPFS 221
           EVF CK+QDYYNAVA+EVNYEGIYDNDY+GSPPP R KR SDKDQMEYTEILK+RGLPFS
Sbjct: 358 EVFRCKKQDYYNAVASEVNYEGIYDNDYHGSPPPSRTKRFSDKDQMEYTEILKMRGLPFS 417

Query: 222 VTKSNIIEFFGEFDLAEERIHIASRSDGKATGEAYVEFASAEDAKRAMSKDKMTIGSRYV 281
            TK+ II+FF +F L E+R+HIA R DGKATGEAYVEF S ++AKRAM KDKMTIGSRYV
Sbjct: 418 ATKAQIIDFFKDFKLIEDRVHIACRPDGKATGEAYVEFVSPDEAKRAMCKDKMTIGSRYV 477

Query: 282 ELFPSTPNEARRAESRVTLLCAVYLKNEMRPSRQLLKFNYSRKGKSLKTSPDGFQAFKRW 341
           ELFPST +EARRAESR   +  + +KN +  +  LL F  + + + L  S   +Q     
Sbjct: 478 ELFPSTTDEARRAESRSRHIFNLAMKNTLLTT--LLVFKNTIR-EELIRSVHVYQIKDGK 537

Query: 342 ATENGNLCVILKPNDEEVQHPITIPSPEEPLLVHSLFSAWSFLLFEFVFFTEQNKSSFHM 401
             E     V  +           I   ++ L V  +   +   ++  +F       SFH 
Sbjct: 538 VREVEREFVFSESGSYGEMRSTPILRLQKKLCVAEVSEGYQNGVWLCIF-------SFHR 597

Query: 402 TSSGFSTEFPNVADPMHDSYLSCFP----------KLNDEK------------------- 461
                 +  PN+     +  L   P          KL  +K                   
Sbjct: 598 DHKPQFSSIPNLLLLSRNLKLRTIPTLLRDLRVIYKLEKDKDVLSDSTGEERQGNNKDSQ 657

Query: 462 -------FLHQDLNFLPCSSSMAVSKGPEIHQMKELCEPGGEFPVLAKKKRRATSEHIAG 521
                   L QDLNFLP    M  S+ P+ +++     P  +    A+KK+RA+S+ +A 
Sbjct: 658 PLRKVVPVLTQDLNFLPYEDDM--SESPD-NKLDVQALPDAD---SAEKKKRASSDRVAK 717

Query: 522 IALSDLAKYFDVPITEASRSLNVGLTVLKRKCREFGIHRWPHRKIKSIDGLIRDLQEEAK 581
           I LS+L KYFD+PI EASR LNVGLTVLKRKCREFGI RWPHRKIKS+D LI ++QEEA 
Sbjct: 718 ITLSELVKYFDIPIVEASRRLNVGLTVLKRKCREFGIPRWPHRKIKSLDSLIHEIQEEAN 777

Query: 582 HREEDQK-ALMAVTKRQMMLQNERESIERTPFRELESETKRFRQDVFKRRHKARA---LT 619
           ++E D K A +A  +++ ML++E+E+IER PF +++SETK+ RQD+FKRRH+ARA   L 
Sbjct: 778 NQESDDKAAALAAIEKRRMLESEKENIERKPFMDIQSETKKLRQDIFKRRHRARAQKHLG 837

BLAST of CmoCh03G006520 vs. ExPASy TrEMBL
Match: A0A444WWG1 (Uncharacterized protein OS=Arachis hypogaea OX=3818 GN=Ahy_B10g100373 PE=3 SV=1)

HSP 1 Score: 572.8 bits (1475), Expect = 1.8e-159
Identity = 337/619 (54.44%), Postives = 410/619 (66.24%), Query Frame = 0

Query: 42  AMLGSGGVSDGYEVGSKRQRMMEPNPYFAVSSSTAGFQPYGY-GSFPPTHAFPVVRLRGL 101
           AMLGSGGVSDGYEVGSKRQRMME NPYFAVSS T  FQPYGY G F P   FPVVRLRGL
Sbjct: 238 AMLGSGGVSDGYEVGSKRQRMMESNPYFAVSSGTGSFQPYGYGGGFQPPPPFPVVRLRGL 297

Query: 102 PFNCTDIDIFKFFAGLDIVDVLLVNKNGRFMGEAFVVFAGSLQVEFALQRDRQNMGRRYV 161
           PFNCTDIDI KFFAGL IVDVLLVNK+GRF GEAFVVFAG++QVEFALQRDRQNMGRRYV
Sbjct: 298 PFNCTDIDILKFFAGLTIVDVLLVNKSGRFSGEAFVVFAGAMQVEFALQRDRQNMGRRYV 357

Query: 162 EVFSCKRQDYYNAVAAEVNYEGIYDNDYNGSPPP-RQKRISDKDQMEYTEILKLRGLPFS 221
           EVF CK+QDYYNAVA+EVNYEGIYDNDY+GSPPP R KR SDKDQMEYTEILK+RGLPFS
Sbjct: 358 EVFRCKKQDYYNAVASEVNYEGIYDNDYHGSPPPSRTKRFSDKDQMEYTEILKMRGLPFS 417

Query: 222 VTKSNIIEFFGEFDLAEERIHIASRSDGKATGEAYVEFASAEDAKRAMSKDKMTIGSRYV 281
            TK+ II+FF +F L E+R+HIA R DGKATGEAYVEF S ++AKRAM KDKMTIGSRYV
Sbjct: 418 ATKAQIIDFFKDFKLIEDRVHIACRPDGKATGEAYVEFVSPDEAKRAMCKDKMTIGSRYV 477

Query: 282 ELFPSTPNEARRAESRVTLLCAVYLKNEMRPSRQLLKFNYSRKGKSLKTSPDGFQAFKRW 341
           ELFPST +EARRAESR   +  + +KN +  +  LL F  + + + L  S   +Q     
Sbjct: 478 ELFPSTTDEARRAESRSRHIFNLAMKNTLLTT--LLVFKNTIR-EELIRSVHVYQIKDGK 537

Query: 342 ATENGNLCVILKPNDEEVQHPITIPSPEEPLLVHSLFSAWSFLLFEFVFFTEQNKSSFHM 401
             E     V  +           I   ++ L V  +   +   ++  +F       SFH 
Sbjct: 538 VREVEREFVFSESGSYGEMRSTPILRLQKKLCVAEVSEGYQNGVWLCIF-------SFHR 597

Query: 402 TSSGFSTEFPNVADPMHDSYLSCFP----------KLNDEK------------------- 461
                 +  PN+     +  L   P          KL  +K                   
Sbjct: 598 DHKPQFSSIPNLLLLSRNLKLRTIPTLLRDLRVIYKLEKDKDVLSDSTGEERQGNNKDSQ 657

Query: 462 -------FLHQDLNFLPCSSSMAVSKGPEIHQMKELCEPGGEFPVLAKKKRRATSEHIAG 521
                   L QDLNFLP    M  S+ P+ +++     P  +    A+KK+RA+S+ +A 
Sbjct: 658 PLRKVVPVLTQDLNFLPYEDDM--SESPD-NKLDVQALPDAD---SAEKKKRASSDRVAK 717

Query: 522 IALSDLAKYFDVPITEASRSLNVGLTVLKRKCREFGIHRWPHRKIKSIDGLIRDLQEEAK 581
           I LS+L KYFD+PI EASR LNVGLTVLKRKCREFGI RWPHRKIKS+D LI ++QEEA 
Sbjct: 718 ITLSELVKYFDIPIVEASRRLNVGLTVLKRKCREFGIPRWPHRKIKSLDSLIHEIQEEAN 777

Query: 582 HREEDQK-ALMAVTKRQMMLQNERESIERTPFRELESETKRFRQDVFKRRHKARA---LT 619
           ++E D K A +A  +++ ML++E+E+IER PF +++SETK+ RQD+FKRRH+ARA   L 
Sbjct: 778 NQESDDKAAALAAIEKRRMLESEKENIERKPFMDIQSETKKLRQDIFKRRHRARAQKHLG 837

BLAST of CmoCh03G006520 vs. ExPASy TrEMBL
Match: A0A444WWK2 (Uncharacterized protein OS=Arachis hypogaea OX=3818 GN=Ahy_B10g100373 PE=3 SV=1)

HSP 1 Score: 572.8 bits (1475), Expect = 1.8e-159
Identity = 337/619 (54.44%), Postives = 410/619 (66.24%), Query Frame = 0

Query: 42  AMLGSGGVSDGYEVGSKRQRMMEPNPYFAVSSSTAGFQPYGY-GSFPPTHAFPVVRLRGL 101
           AMLGSGGVSDGYEVGSKRQRMME NPYFAVSS T  FQPYGY G F P   FPVVRLRGL
Sbjct: 238 AMLGSGGVSDGYEVGSKRQRMMESNPYFAVSSGTGSFQPYGYGGGFQPPPPFPVVRLRGL 297

Query: 102 PFNCTDIDIFKFFAGLDIVDVLLVNKNGRFMGEAFVVFAGSLQVEFALQRDRQNMGRRYV 161
           PFNCTDIDI KFFAGL IVDVLLVNK+GRF GEAFVVFAG++QVEFALQRDRQNMGRRYV
Sbjct: 298 PFNCTDIDILKFFAGLTIVDVLLVNKSGRFSGEAFVVFAGAMQVEFALQRDRQNMGRRYV 357

Query: 162 EVFSCKRQDYYNAVAAEVNYEGIYDNDYNGSPPP-RQKRISDKDQMEYTEILKLRGLPFS 221
           EVF CK+QDYYNAVA+EVNYEGIYDNDY+GSPPP R KR SDKDQMEYTEILK+RGLPFS
Sbjct: 358 EVFRCKKQDYYNAVASEVNYEGIYDNDYHGSPPPSRTKRFSDKDQMEYTEILKMRGLPFS 417

Query: 222 VTKSNIIEFFGEFDLAEERIHIASRSDGKATGEAYVEFASAEDAKRAMSKDKMTIGSRYV 281
            TK+ II+FF +F L E+R+HIA R DGKATGEAYVEF S ++AKRAM KDKMTIGSRYV
Sbjct: 418 ATKAQIIDFFKDFKLIEDRVHIACRPDGKATGEAYVEFVSPDEAKRAMCKDKMTIGSRYV 477

Query: 282 ELFPSTPNEARRAESRVTLLCAVYLKNEMRPSRQLLKFNYSRKGKSLKTSPDGFQAFKRW 341
           ELFPST +EARRAESR   +  + +KN +  +  LL F  + + + L  S   +Q     
Sbjct: 478 ELFPSTTDEARRAESRSRHIFNLAMKNTLLTT--LLVFKNTIR-EELIRSVHVYQIKDGK 537

Query: 342 ATENGNLCVILKPNDEEVQHPITIPSPEEPLLVHSLFSAWSFLLFEFVFFTEQNKSSFHM 401
             E     V  +           I   ++ L V  +   +   ++  +F       SFH 
Sbjct: 538 VREVEREFVFSESGSYGEMRSTPILRLQKKLCVAEVSEGYQNGVWLCIF-------SFHR 597

Query: 402 TSSGFSTEFPNVADPMHDSYLSCFP----------KLNDEK------------------- 461
                 +  PN+     +  L   P          KL  +K                   
Sbjct: 598 DHKPQFSSIPNLLLLSRNLKLRTIPTLLRDLRVIYKLEKDKDVLSDSTGEERQGNNKDSQ 657

Query: 462 -------FLHQDLNFLPCSSSMAVSKGPEIHQMKELCEPGGEFPVLAKKKRRATSEHIAG 521
                   L QDLNFLP    M  S+ P+ +++     P  +    A+KK+RA+S+ +A 
Sbjct: 658 PLRKVVPVLTQDLNFLPYEDDM--SESPD-NKLDVQALPDAD---SAEKKKRASSDRVAK 717

Query: 522 IALSDLAKYFDVPITEASRSLNVGLTVLKRKCREFGIHRWPHRKIKSIDGLIRDLQEEAK 581
           I LS+L KYFD+PI EASR LNVGLTVLKRKCREFGI RWPHRKIKS+D LI ++QEEA 
Sbjct: 718 ITLSELVKYFDIPIVEASRRLNVGLTVLKRKCREFGIPRWPHRKIKSLDSLIHEIQEEAN 777

Query: 582 HREEDQK-ALMAVTKRQMMLQNERESIERTPFRELESETKRFRQDVFKRRHKARA---LT 619
           ++E D K A +A  +++ ML++E+E+IER PF +++SETK+ RQD+FKRRH+ARA   L 
Sbjct: 778 NQESDDKAAALAAIEKRRMLESEKENIERKPFMDIQSETKKLRQDIFKRRHRARAQKHLG 837

BLAST of CmoCh03G006520 vs. ExPASy TrEMBL
Match: A0A6J1GEF2 (heterogeneous nuclear ribonucleoprotein H-like isoform X1 OS=Cucurbita moschata OX=3662 GN=LOC111453406 PE=4 SV=1)

HSP 1 Score: 511.9 bits (1317), Expect = 3.8e-141
Identity = 254/254 (100.00%), Postives = 254/254 (100.00%), Query Frame = 0

Query: 42  AMLGSGGVSDGYEVGSKRQRMMEPNPYFAVSSSTAGFQPYGYGSFPPTHAFPVVRLRGLP 101
           AMLGSGGVSDGYEVGSKRQRMMEPNPYFAVSSSTAGFQPYGYGSFPPTHAFPVVRLRGLP
Sbjct: 7   AMLGSGGVSDGYEVGSKRQRMMEPNPYFAVSSSTAGFQPYGYGSFPPTHAFPVVRLRGLP 66

Query: 102 FNCTDIDIFKFFAGLDIVDVLLVNKNGRFMGEAFVVFAGSLQVEFALQRDRQNMGRRYVE 161
           FNCTDIDIFKFFAGLDIVDVLLVNKNGRFMGEAFVVFAGSLQVEFALQRDRQNMGRRYVE
Sbjct: 67  FNCTDIDIFKFFAGLDIVDVLLVNKNGRFMGEAFVVFAGSLQVEFALQRDRQNMGRRYVE 126

Query: 162 VFSCKRQDYYNAVAAEVNYEGIYDNDYNGSPPPRQKRISDKDQMEYTEILKLRGLPFSVT 221
           VFSCKRQDYYNAVAAEVNYEGIYDNDYNGSPPPRQKRISDKDQMEYTEILKLRGLPFSVT
Sbjct: 127 VFSCKRQDYYNAVAAEVNYEGIYDNDYNGSPPPRQKRISDKDQMEYTEILKLRGLPFSVT 186

Query: 222 KSNIIEFFGEFDLAEERIHIASRSDGKATGEAYVEFASAEDAKRAMSKDKMTIGSRYVEL 281
           KSNIIEFFGEFDLAEERIHIASRSDGKATGEAYVEFASAEDAKRAMSKDKMTIGSRYVEL
Sbjct: 187 KSNIIEFFGEFDLAEERIHIASRSDGKATGEAYVEFASAEDAKRAMSKDKMTIGSRYVEL 246

Query: 282 FPSTPNEARRAESR 296
           FPSTPNEARRAESR
Sbjct: 247 FPSTPNEARRAESR 260

BLAST of CmoCh03G006520 vs. NCBI nr
Match: KAG6603861.1 (hypothetical protein SDJN03_04470, partial [Cucurbita argyrosperma subsp. sororia])

HSP 1 Score: 775.0 bits (2000), Expect = 5.0e-220
Identity = 417/557 (74.87%), Postives = 419/557 (75.22%), Query Frame = 0

Query: 42  AMLGSGGVSDGYEVGSKRQRMMEPNPYFAVSSSTAGFQPYGYGSFPPTHAFPVVRLRGLP 101
           AMLGSGGVSDGYEVGSKRQRMMEPNPYFAVSSSTAGFQPYGYGSFPPTHAFPVVRLRGLP
Sbjct: 292 AMLGSGGVSDGYEVGSKRQRMMEPNPYFAVSSSTAGFQPYGYGSFPPTHAFPVVRLRGLP 351

Query: 102 FNCTDIDIFKFFAGLDIVDVLLVNKNGRFMGEAFVVFAGSLQVEFALQRDRQNMGRRYVE 161
           FNCTDIDIFKFFAGLDIVDVLLVNKNGRFMGEAFVVFAGSLQVEFALQRDRQNMGRRYVE
Sbjct: 352 FNCTDIDIFKFFAGLDIVDVLLVNKNGRFMGEAFVVFAGSLQVEFALQRDRQNMGRRYVE 411

Query: 162 VFSCKRQDYYNAVAAEVNYEGIYDNDYNGSPPPRQKRISDKDQMEYTEILKLRGLPFSVT 221
           VFSCKRQDYYNAVAAEVNYEGIYDNDYNGSPPPRQKRISDKDQMEYTEILKLRGLPFSVT
Sbjct: 412 VFSCKRQDYYNAVAAEVNYEGIYDNDYNGSPPPRQKRISDKDQMEYTEILKLRGLPFSVT 471

Query: 222 KSNIIEFFGEFDLAEERIHIASRSDGKATGEAYVEFASAEDAKRAMSKDKMTIGSRYVEL 281
           KSNIIEFFGEFDLAEERIHIASR DGKATGEAYVEFASAEDAKRAMSKDKMTIGSRYVEL
Sbjct: 472 KSNIIEFFGEFDLAEERIHIASRPDGKATGEAYVEFASAEDAKRAMSKDKMTIGSRYVEL 531

Query: 282 FPSTPNEARRAESRVTLLCAVYLKNEMRPSRQLLKFNYSRKGKSLKTSPDGFQAFKRWAT 341
           FPSTPNEARRAESR  ++                                          
Sbjct: 532 FPSTPNEARRAESRAFVVS----------------------------------------- 591

Query: 342 ENGNLCVILKPNDEEVQHPITIPSPEEPLLVHSLFSAWSFLLFEFVFFTEQNKSSFHMTS 401
                                                                    MTS
Sbjct: 592 --------------------------------------------------------QMTS 651

Query: 402 SGFSTEFPNVADPMHDSYLSCFPKLNDEKFLHQDLNFLPCSSSMAVSKGPEIHQMKELCE 461
           SGFSTEFPNVADPMHDSYLSCFPKLNDEKFLHQDLNFLPCSSSMAVSKGPEIHQMKELCE
Sbjct: 652 SGFSTEFPNVADPMHDSYLSCFPKLNDEKFLHQDLNFLPCSSSMAVSKGPEIHQMKELCE 711

Query: 462 PGGEFPVLAKKKRRATSEHIAGIALSDLAKYFDVPITEASRSLNVGLTVLKRKCREFGIH 521
           P                                    EASRSLNVGLTVLKRKCREFGIH
Sbjct: 712 P------------------------------------EASRSLNVGLTVLKRKCREFGIH 715

Query: 522 RWPHRKIKSIDGLIRDLQEEAKHREEDQKALMAVTKRQMMLQNERESIERTPFRELESET 581
           RWPHRKIKSIDGLIRDLQEEAKHREEDQKALMAVTKRQMMLQNERESIERTPFRELESET
Sbjct: 772 RWPHRKIKSIDGLIRDLQEEAKHREEDQKALMAVTKRQMMLQNERESIERTPFRELESET 715

Query: 582 KRFRQDVFKRRHKARAL 599
           KRFRQDVFKRRHKARAL
Sbjct: 832 KRFRQDVFKRRHKARAL 715

BLAST of CmoCh03G006520 vs. NCBI nr
Match: RYQ81764.1 (hypothetical protein Ahy_B10g100373 isoform A [Arachis hypogaea])

HSP 1 Score: 572.8 bits (1475), Expect = 3.7e-159
Identity = 337/619 (54.44%), Postives = 410/619 (66.24%), Query Frame = 0

Query: 42  AMLGSGGVSDGYEVGSKRQRMMEPNPYFAVSSSTAGFQPYGY-GSFPPTHAFPVVRLRGL 101
           AMLGSGGVSDGYEVGSKRQRMME NPYFAVSS T  FQPYGY G F P   FPVVRLRGL
Sbjct: 238 AMLGSGGVSDGYEVGSKRQRMMESNPYFAVSSGTGSFQPYGYGGGFQPPPPFPVVRLRGL 297

Query: 102 PFNCTDIDIFKFFAGLDIVDVLLVNKNGRFMGEAFVVFAGSLQVEFALQRDRQNMGRRYV 161
           PFNCTDIDI KFFAGL IVDVLLVNK+GRF GEAFVVFAG++QVEFALQRDRQNMGRRYV
Sbjct: 298 PFNCTDIDILKFFAGLTIVDVLLVNKSGRFSGEAFVVFAGAMQVEFALQRDRQNMGRRYV 357

Query: 162 EVFSCKRQDYYNAVAAEVNYEGIYDNDYNGSPPP-RQKRISDKDQMEYTEILKLRGLPFS 221
           EVF CK+QDYYNAVA+EVNYEGIYDNDY+GSPPP R KR SDKDQMEYTEILK+RGLPFS
Sbjct: 358 EVFRCKKQDYYNAVASEVNYEGIYDNDYHGSPPPSRTKRFSDKDQMEYTEILKMRGLPFS 417

Query: 222 VTKSNIIEFFGEFDLAEERIHIASRSDGKATGEAYVEFASAEDAKRAMSKDKMTIGSRYV 281
            TK+ II+FF +F L E+R+HIA R DGKATGEAYVEF S ++AKRAM KDKMTIGSRYV
Sbjct: 418 ATKAQIIDFFKDFKLIEDRVHIACRPDGKATGEAYVEFVSPDEAKRAMCKDKMTIGSRYV 477

Query: 282 ELFPSTPNEARRAESRVTLLCAVYLKNEMRPSRQLLKFNYSRKGKSLKTSPDGFQAFKRW 341
           ELFPST +EARRAESR   +  + +KN +  +  LL F  + + + L  S   +Q     
Sbjct: 478 ELFPSTTDEARRAESRSRHIFNLAMKNTLLTT--LLVFKNTIR-EELIRSVHVYQIKDGK 537

Query: 342 ATENGNLCVILKPNDEEVQHPITIPSPEEPLLVHSLFSAWSFLLFEFVFFTEQNKSSFHM 401
             E     V  +           I   ++ L V  +   +   ++  +F       SFH 
Sbjct: 538 VREVEREFVFSESGSYGEMRSTPILRLQKKLCVAEVSEGYQNGVWLCIF-------SFHR 597

Query: 402 TSSGFSTEFPNVADPMHDSYLSCFP----------KLNDEK------------------- 461
                 +  PN+     +  L   P          KL  +K                   
Sbjct: 598 DHKPQFSSIPNLLLLSRNLKLRTIPTLLRDLRVIYKLEKDKDVLSDSTGEERQGNNKDSQ 657

Query: 462 -------FLHQDLNFLPCSSSMAVSKGPEIHQMKELCEPGGEFPVLAKKKRRATSEHIAG 521
                   L QDLNFLP    M  S+ P+ +++     P  +    A+KK+RA+S+ +A 
Sbjct: 658 PLRKVVPVLTQDLNFLPYEDDM--SESPD-NKLDVQALPDAD---SAEKKKRASSDRVAK 717

Query: 522 IALSDLAKYFDVPITEASRSLNVGLTVLKRKCREFGIHRWPHRKIKSIDGLIRDLQEEAK 581
           I LS+L KYFD+PI EASR LNVGLTVLKRKCREFGI RWPHRKIKS+D LI ++QEEA 
Sbjct: 718 ITLSELVKYFDIPIVEASRRLNVGLTVLKRKCREFGIPRWPHRKIKSLDSLIHEIQEEAN 777

Query: 582 HREEDQK-ALMAVTKRQMMLQNERESIERTPFRELESETKRFRQDVFKRRHKARA---LT 619
           ++E D K A +A  +++ ML++E+E+IER PF +++SETK+ RQD+FKRRH+ARA   L 
Sbjct: 778 NQESDDKAAALAAIEKRRMLESEKENIERKPFMDIQSETKKLRQDIFKRRHRARAQKHLG 837

BLAST of CmoCh03G006520 vs. NCBI nr
Match: RYQ81765.1 (hypothetical protein Ahy_B10g100373 isoform C [Arachis hypogaea])

HSP 1 Score: 572.8 bits (1475), Expect = 3.7e-159
Identity = 337/619 (54.44%), Postives = 410/619 (66.24%), Query Frame = 0

Query: 42  AMLGSGGVSDGYEVGSKRQRMMEPNPYFAVSSSTAGFQPYGY-GSFPPTHAFPVVRLRGL 101
           AMLGSGGVSDGYEVGSKRQRMME NPYFAVSS T  FQPYGY G F P   FPVVRLRGL
Sbjct: 238 AMLGSGGVSDGYEVGSKRQRMMESNPYFAVSSGTGSFQPYGYGGGFQPPPPFPVVRLRGL 297

Query: 102 PFNCTDIDIFKFFAGLDIVDVLLVNKNGRFMGEAFVVFAGSLQVEFALQRDRQNMGRRYV 161
           PFNCTDIDI KFFAGL IVDVLLVNK+GRF GEAFVVFAG++QVEFALQRDRQNMGRRYV
Sbjct: 298 PFNCTDIDILKFFAGLTIVDVLLVNKSGRFSGEAFVVFAGAMQVEFALQRDRQNMGRRYV 357

Query: 162 EVFSCKRQDYYNAVAAEVNYEGIYDNDYNGSPPP-RQKRISDKDQMEYTEILKLRGLPFS 221
           EVF CK+QDYYNAVA+EVNYEGIYDNDY+GSPPP R KR SDKDQMEYTEILK+RGLPFS
Sbjct: 358 EVFRCKKQDYYNAVASEVNYEGIYDNDYHGSPPPSRTKRFSDKDQMEYTEILKMRGLPFS 417

Query: 222 VTKSNIIEFFGEFDLAEERIHIASRSDGKATGEAYVEFASAEDAKRAMSKDKMTIGSRYV 281
            TK+ II+FF +F L E+R+HIA R DGKATGEAYVEF S ++AKRAM KDKMTIGSRYV
Sbjct: 418 ATKAQIIDFFKDFKLIEDRVHIACRPDGKATGEAYVEFVSPDEAKRAMCKDKMTIGSRYV 477

Query: 282 ELFPSTPNEARRAESRVTLLCAVYLKNEMRPSRQLLKFNYSRKGKSLKTSPDGFQAFKRW 341
           ELFPST +EARRAESR   +  + +KN +  +  LL F  + + + L  S   +Q     
Sbjct: 478 ELFPSTTDEARRAESRSRHIFNLAMKNTLLTT--LLVFKNTIR-EELIRSVHVYQIKDGK 537

Query: 342 ATENGNLCVILKPNDEEVQHPITIPSPEEPLLVHSLFSAWSFLLFEFVFFTEQNKSSFHM 401
             E     V  +           I   ++ L V  +   +   ++  +F       SFH 
Sbjct: 538 VREVEREFVFSESGSYGEMRSTPILRLQKKLCVAEVSEGYQNGVWLCIF-------SFHR 597

Query: 402 TSSGFSTEFPNVADPMHDSYLSCFP----------KLNDEK------------------- 461
                 +  PN+     +  L   P          KL  +K                   
Sbjct: 598 DHKPQFSSIPNLLLLSRNLKLRTIPTLLRDLRVIYKLEKDKDVLSDSTGEERQGNNKDSQ 657

Query: 462 -------FLHQDLNFLPCSSSMAVSKGPEIHQMKELCEPGGEFPVLAKKKRRATSEHIAG 521
                   L QDLNFLP    M  S+ P+ +++     P  +    A+KK+RA+S+ +A 
Sbjct: 658 PLRKVVPVLTQDLNFLPYEDDM--SESPD-NKLDVQALPDAD---SAEKKKRASSDRVAK 717

Query: 522 IALSDLAKYFDVPITEASRSLNVGLTVLKRKCREFGIHRWPHRKIKSIDGLIRDLQEEAK 581
           I LS+L KYFD+PI EASR LNVGLTVLKRKCREFGI RWPHRKIKS+D LI ++QEEA 
Sbjct: 718 ITLSELVKYFDIPIVEASRRLNVGLTVLKRKCREFGIPRWPHRKIKSLDSLIHEIQEEAN 777

Query: 582 HREEDQK-ALMAVTKRQMMLQNERESIERTPFRELESETKRFRQDVFKRRHKARA---LT 619
           ++E D K A +A  +++ ML++E+E+IER PF +++SETK+ RQD+FKRRH+ARA   L 
Sbjct: 778 NQESDDKAAALAAIEKRRMLESEKENIERKPFMDIQSETKKLRQDIFKRRHRARAQKHLG 837

BLAST of CmoCh03G006520 vs. NCBI nr
Match: RYQ81766.1 (hypothetical protein Ahy_B10g100373 isoform B [Arachis hypogaea])

HSP 1 Score: 572.8 bits (1475), Expect = 3.7e-159
Identity = 337/619 (54.44%), Postives = 410/619 (66.24%), Query Frame = 0

Query: 42  AMLGSGGVSDGYEVGSKRQRMMEPNPYFAVSSSTAGFQPYGY-GSFPPTHAFPVVRLRGL 101
           AMLGSGGVSDGYEVGSKRQRMME NPYFAVSS T  FQPYGY G F P   FPVVRLRGL
Sbjct: 238 AMLGSGGVSDGYEVGSKRQRMMESNPYFAVSSGTGSFQPYGYGGGFQPPPPFPVVRLRGL 297

Query: 102 PFNCTDIDIFKFFAGLDIVDVLLVNKNGRFMGEAFVVFAGSLQVEFALQRDRQNMGRRYV 161
           PFNCTDIDI KFFAGL IVDVLLVNK+GRF GEAFVVFAG++QVEFALQRDRQNMGRRYV
Sbjct: 298 PFNCTDIDILKFFAGLTIVDVLLVNKSGRFSGEAFVVFAGAMQVEFALQRDRQNMGRRYV 357

Query: 162 EVFSCKRQDYYNAVAAEVNYEGIYDNDYNGSPPP-RQKRISDKDQMEYTEILKLRGLPFS 221
           EVF CK+QDYYNAVA+EVNYEGIYDNDY+GSPPP R KR SDKDQMEYTEILK+RGLPFS
Sbjct: 358 EVFRCKKQDYYNAVASEVNYEGIYDNDYHGSPPPSRTKRFSDKDQMEYTEILKMRGLPFS 417

Query: 222 VTKSNIIEFFGEFDLAEERIHIASRSDGKATGEAYVEFASAEDAKRAMSKDKMTIGSRYV 281
            TK+ II+FF +F L E+R+HIA R DGKATGEAYVEF S ++AKRAM KDKMTIGSRYV
Sbjct: 418 ATKAQIIDFFKDFKLIEDRVHIACRPDGKATGEAYVEFVSPDEAKRAMCKDKMTIGSRYV 477

Query: 282 ELFPSTPNEARRAESRVTLLCAVYLKNEMRPSRQLLKFNYSRKGKSLKTSPDGFQAFKRW 341
           ELFPST +EARRAESR   +  + +KN +  +  LL F  + + + L  S   +Q     
Sbjct: 478 ELFPSTTDEARRAESRSRHIFNLAMKNTLLTT--LLVFKNTIR-EELIRSVHVYQIKDGK 537

Query: 342 ATENGNLCVILKPNDEEVQHPITIPSPEEPLLVHSLFSAWSFLLFEFVFFTEQNKSSFHM 401
             E     V  +           I   ++ L V  +   +   ++  +F       SFH 
Sbjct: 538 VREVEREFVFSESGSYGEMRSTPILRLQKKLCVAEVSEGYQNGVWLCIF-------SFHR 597

Query: 402 TSSGFSTEFPNVADPMHDSYLSCFP----------KLNDEK------------------- 461
                 +  PN+     +  L   P          KL  +K                   
Sbjct: 598 DHKPQFSSIPNLLLLSRNLKLRTIPTLLRDLRVIYKLEKDKDVLSDSTGEERQGNNKDSQ 657

Query: 462 -------FLHQDLNFLPCSSSMAVSKGPEIHQMKELCEPGGEFPVLAKKKRRATSEHIAG 521
                   L QDLNFLP    M  S+ P+ +++     P  +    A+KK+RA+S+ +A 
Sbjct: 658 PLRKVVPVLTQDLNFLPYEDDM--SESPD-NKLDVQALPDAD---SAEKKKRASSDRVAK 717

Query: 522 IALSDLAKYFDVPITEASRSLNVGLTVLKRKCREFGIHRWPHRKIKSIDGLIRDLQEEAK 581
           I LS+L KYFD+PI EASR LNVGLTVLKRKCREFGI RWPHRKIKS+D LI ++QEEA 
Sbjct: 718 ITLSELVKYFDIPIVEASRRLNVGLTVLKRKCREFGIPRWPHRKIKSLDSLIHEIQEEAN 777

Query: 582 HREEDQK-ALMAVTKRQMMLQNERESIERTPFRELESETKRFRQDVFKRRHKARA---LT 619
           ++E D K A +A  +++ ML++E+E+IER PF +++SETK+ RQD+FKRRH+ARA   L 
Sbjct: 778 NQESDDKAAALAAIEKRRMLESEKENIERKPFMDIQSETKKLRQDIFKRRHRARAQKHLG 837

BLAST of CmoCh03G006520 vs. NCBI nr
Match: RYQ81767.1 (hypothetical protein Ahy_B10g100373 isoform D [Arachis hypogaea])

HSP 1 Score: 572.8 bits (1475), Expect = 3.7e-159
Identity = 337/619 (54.44%), Postives = 410/619 (66.24%), Query Frame = 0

Query: 42  AMLGSGGVSDGYEVGSKRQRMMEPNPYFAVSSSTAGFQPYGY-GSFPPTHAFPVVRLRGL 101
           AMLGSGGVSDGYEVGSKRQRMME NPYFAVSS T  FQPYGY G F P   FPVVRLRGL
Sbjct: 238 AMLGSGGVSDGYEVGSKRQRMMESNPYFAVSSGTGSFQPYGYGGGFQPPPPFPVVRLRGL 297

Query: 102 PFNCTDIDIFKFFAGLDIVDVLLVNKNGRFMGEAFVVFAGSLQVEFALQRDRQNMGRRYV 161
           PFNCTDIDI KFFAGL IVDVLLVNK+GRF GEAFVVFAG++QVEFALQRDRQNMGRRYV
Sbjct: 298 PFNCTDIDILKFFAGLTIVDVLLVNKSGRFSGEAFVVFAGAMQVEFALQRDRQNMGRRYV 357

Query: 162 EVFSCKRQDYYNAVAAEVNYEGIYDNDYNGSPPP-RQKRISDKDQMEYTEILKLRGLPFS 221
           EVF CK+QDYYNAVA+EVNYEGIYDNDY+GSPPP R KR SDKDQMEYTEILK+RGLPFS
Sbjct: 358 EVFRCKKQDYYNAVASEVNYEGIYDNDYHGSPPPSRTKRFSDKDQMEYTEILKMRGLPFS 417

Query: 222 VTKSNIIEFFGEFDLAEERIHIASRSDGKATGEAYVEFASAEDAKRAMSKDKMTIGSRYV 281
            TK+ II+FF +F L E+R+HIA R DGKATGEAYVEF S ++AKRAM KDKMTIGSRYV
Sbjct: 418 ATKAQIIDFFKDFKLIEDRVHIACRPDGKATGEAYVEFVSPDEAKRAMCKDKMTIGSRYV 477

Query: 282 ELFPSTPNEARRAESRVTLLCAVYLKNEMRPSRQLLKFNYSRKGKSLKTSPDGFQAFKRW 341
           ELFPST +EARRAESR   +  + +KN +  +  LL F  + + + L  S   +Q     
Sbjct: 478 ELFPSTTDEARRAESRSRHIFNLAMKNTLLTT--LLVFKNTIR-EELIRSVHVYQIKDGK 537

Query: 342 ATENGNLCVILKPNDEEVQHPITIPSPEEPLLVHSLFSAWSFLLFEFVFFTEQNKSSFHM 401
             E     V  +           I   ++ L V  +   +   ++  +F       SFH 
Sbjct: 538 VREVEREFVFSESGSYGEMRSTPILRLQKKLCVAEVSEGYQNGVWLCIF-------SFHR 597

Query: 402 TSSGFSTEFPNVADPMHDSYLSCFP----------KLNDEK------------------- 461
                 +  PN+     +  L   P          KL  +K                   
Sbjct: 598 DHKPQFSSIPNLLLLSRNLKLRTIPTLLRDLRVIYKLEKDKDVLSDSTGEERQGNNKDSQ 657

Query: 462 -------FLHQDLNFLPCSSSMAVSKGPEIHQMKELCEPGGEFPVLAKKKRRATSEHIAG 521
                   L QDLNFLP    M  S+ P+ +++     P  +    A+KK+RA+S+ +A 
Sbjct: 658 PLRKVVPVLTQDLNFLPYEDDM--SESPD-NKLDVQALPDAD---SAEKKKRASSDRVAK 717

Query: 522 IALSDLAKYFDVPITEASRSLNVGLTVLKRKCREFGIHRWPHRKIKSIDGLIRDLQEEAK 581
           I LS+L KYFD+PI EASR LNVGLTVLKRKCREFGI RWPHRKIKS+D LI ++QEEA 
Sbjct: 718 ITLSELVKYFDIPIVEASRRLNVGLTVLKRKCREFGIPRWPHRKIKSLDSLIHEIQEEAN 777

Query: 582 HREEDQK-ALMAVTKRQMMLQNERESIERTPFRELESETKRFRQDVFKRRHKARA---LT 619
           ++E D K A +A  +++ ML++E+E+IER PF +++SETK+ RQD+FKRRH+ARA   L 
Sbjct: 778 NQESDDKAAALAAIEKRRMLESEKENIERKPFMDIQSETKKLRQDIFKRRHRARAQKHLG 837

BLAST of CmoCh03G006520 vs. TAIR 10
Match: AT5G66010.1 (RNA-binding (RRM/RBD/RNP motifs) family protein )

HSP 1 Score: 333.2 bits (853), Expect = 4.7e-91
Identity = 170/262 (64.89%), Postives = 202/262 (77.10%), Query Frame = 0

Query: 38  GNGMAMLGSGGVSDGYEVGSKRQRMMEPNPYFAVSSSTAGFQPYGYGSFPPTHAFPVVRL 97
           G+  AM GSG    GYEVGSKRQRMM+ NPY AV +    F P+GY        FPVVRL
Sbjct: 3   GSRGAMFGSG----GYEVGSKRQRMMQSNPYLAVGTGPTSFPPFGYAG-----GFPVVRL 62

Query: 98  RGLPFNCTDIDIFKFFAGLDIVDVLLVNKNGRFMGEAFVVFAGSLQVEFALQRDRQNMGR 157
           RGLPFNC DIDIF+FFAGL+IVDVLLV+KNG+F GEAFVVFAG +QVE ALQRDR NMGR
Sbjct: 63  RGLPFNCADIDIFEFFAGLNIVDVLLVSKNGKFSGEAFVVFAGPMQVEIALQRDRHNMGR 122

Query: 158 RYVEVFSCKRQDYYNAVAAEVNYEGIYDNDYNGSPPP----RQKRISDKDQMEYTEILKL 217
           RYVEVF C +QDYYNAVAAE   EG Y+ +   SPPP    R KR S+K+++EYTE+LK+
Sbjct: 123 RYVEVFRCSKQDYYNAVAAE---EGAYEYEVRASPPPTGPSRAKRFSEKEKLEYTEVLKM 182

Query: 218 RGLPFSVTKSNIIEFFGEFDLAEERIHIASRSDGKATGEAYVEFASAEDAKRAMSKDKMT 277
           RGLP+SV K  IIEFF  + + + R+ +  R DGKATGEA+VEF + E+A+RAM+KDKM+
Sbjct: 183 RGLPYSVNKPQIIEFFSGYKVIQGRVQVVCRPDGKATGEAFVEFETGEEARRAMAKDKMS 242

Query: 278 IGSRYVELFPSTPNEARRAESR 296
           IGSRYVELFP+T  EARRAE+R
Sbjct: 243 IGSRYVELFPTTREEARRAEAR 252

BLAST of CmoCh03G006520 vs. TAIR 10
Match: AT3G20890.1 (RNA-binding (RRM/RBD/RNP motifs) family protein )

HSP 1 Score: 236.1 bits (601), Expect = 7.7e-62
Identity = 130/286 (45.45%), Postives = 179/286 (62.59%), Query Frame = 0

Query: 45  GSGGVSDGYEVGSKRQRMME---PNPYFAV-SSSTAGFQPYGYGSFPPTHAFPVVRLRGL 104
           G G   DG E+G KRQRM++   P P++    SS   + PYG+ + PP   FP VRLRGL
Sbjct: 5   GYGDGPDGREMGPKRQRMIDQGPPGPFYGPHPSSGFMYNPYGFVAPPPPPPFPAVRLRGL 64

Query: 105 PFNCTDIDIFKFFAGLDIVDVLLVNKNGRFMGEAFVVFAGSLQVEFALQRDRQNMGRRYV 164
           PF+C ++D+ +FF GLD+VDVL V++N +  GEAF V    LQV+FALQ++RQNMGRRYV
Sbjct: 65  PFDCAELDVVEFFHGLDVVDVLFVHRNNKVTGEAFCVLGYPLQVDFALQKNRQNMGRRYV 124

Query: 165 EVFSCKRQDYYNAVAAEVNYEGIY--------------------------DNDYNGSPPP 224
           EVF   +Q+YY A+A EV    ++                               GS P 
Sbjct: 125 EVFRSTKQEYYKAIANEVAESRVHGMASGGGGGLGGGNGSGGGGGGGGGGGRISGGSSPR 184

Query: 225 R---QKRISD--KDQMEYTEILKLRGLPFSVTKSNIIEFFGEFDLAEERIHIASRSDGKA 284
           R   + R SD  K+ +E+T IL+LRGLPFS  K +I++FF +F+L+E+ +H+    +G+ 
Sbjct: 185 RHVQRARSSDDGKEDIEHTGILRLRGLPFSAGKEDILDFFKDFELSEDFVHVTVNGEGRP 244

Query: 285 TGEAYVEFASAEDAKRAMSKDKMTIGSRYVELFPSTPNEARRAESR 296
           TGEA+VEF +AED++ AM KD+ T+GSRY+ELFPS+  E   A SR
Sbjct: 245 TGEAFVEFRNAEDSRAAMVKDRKTLGSRYIELFPSSVEELEEALSR 290

BLAST of CmoCh03G006520 vs. TAIR 10
Match: AT5G66010.2 (RNA-binding (RRM/RBD/RNP motifs) family protein )

HSP 1 Score: 201.8 bits (512), Expect = 1.6e-51
Identity = 100/158 (63.29%), Postives = 124/158 (78.48%), Query Frame = 0

Query: 142 LQVEFALQRDRQNMGRRYVEVFSCKRQDYYNAVAAEVNYEGIYDNDYNGSPPP----RQK 201
           +QVE ALQRDR NMGRRYVEVF C +QDYYNAVAAE   EG Y+ +   SPPP    R K
Sbjct: 1   MQVEIALQRDRHNMGRRYVEVFRCSKQDYYNAVAAE---EGAYEYEVRASPPPTGPSRAK 60

Query: 202 RISDKDQMEYTEILKLRGLPFSVTKSNIIEFFGEFDLAEERIHIASRSDGKATGEAYVEF 261
           R S+K+++EYTE+LK+RGLP+SV K  IIEFF  + + + R+ +  R DGKATGEA+VEF
Sbjct: 61  RFSEKEKLEYTEVLKMRGLPYSVNKPQIIEFFSGYKVIQGRVQVVCRPDGKATGEAFVEF 120

Query: 262 ASAEDAKRAMSKDKMTIGSRYVELFPSTPNEARRAESR 296
            + E+A+RAM+KDKM+IGSRYVELFP+T  EARRAE+R
Sbjct: 121 ETGEEARRAMAKDKMSIGSRYVELFPTTREEARRAEAR 155

BLAST of CmoCh03G006520 vs. TAIR 10
Match: AT4G35590.1 (RWP-RK domain-containing protein )

HSP 1 Score: 154.8 bits (390), Expect = 2.3e-37
Identity = 89/172 (51.74%), Postives = 120/172 (69.77%), Query Frame = 0

Query: 432 LHQDLNFLPCS---SSMAVSKGPEIHQMK----ELCEPGGEFPVLAKKKRRATSEHIAGI 491
           L QDLN LP S   S  +V++  E  + +    E  E   +  +L KKK+R  S H+A +
Sbjct: 184 LKQDLNCLPDSETESEESVNEKTEHSEFENDKTEQSESDAKTEIL-KKKKRTPSRHVAEL 243

Query: 492 ALSDLAKYFDVPITEASRSLNVGLTVLKRKCREFGIHRWPHRKIKSIDGLIRDLQEEA-K 551
           +L +L+KYFD+ I EASR+L VGLTVLK+KCREFGI RWPHRKIKS+D LI DLQ EA K
Sbjct: 244 SLEELSKYFDLTIVEASRNLKVGLTVLKKKCREFGIPRWPHRKIKSLDCLIHDLQREAEK 303

Query: 552 HREEDQKALMAVTKRQMMLQNERESIERTPFRELESETKRFRQDVFKRRHKA 596
            +E+++ A MAV K+Q  L+ E+ +I + PF E+  ETK+FRQ+ FK+RH+A
Sbjct: 304 QQEKNEAAAMAVAKKQEKLETEKRNIVKRPFMEIGIETKKFRQENFKKRHRA 354

BLAST of CmoCh03G006520 vs. TAIR 10
Match: AT5G53040.1 (RWP-RK domain-containing protein )

HSP 1 Score: 98.2 bits (243), Expect = 2.5e-20
Identity = 65/184 (35.33%), Postives = 98/184 (53.26%), Query Frame = 0

Query: 416 HDSYLSCFPKLNDEKFLHQDLNFL---PCSSSMAVSKGPEIHQMKELCEPGGEFPVLAKK 475
           ++S+   F  +  +  +  D++ L   P  SS + S  P   Q   L        V  KK
Sbjct: 77  YNSFEDFFENIEVDNTIPSDIHLLTQEPYFSSDSSSSSPLAIQNDGLISNVKVEKVTVKK 136

Query: 476 KRRATSEHIAGIALSDLAKYFDVPITEASRSLNVGLTVLKRKCREFGIHRWPHRKIKSID 535
           KR    +    + +S++ ++FD PI +A++ LNVGLTVLK++CRE GI+RWPHRK+KS++
Sbjct: 137 KRNLKKKRQDKLEMSEIKQFFDRPIMKAAKELNVGLTVLKKRCRELGIYRWPHRKLKSLN 196

Query: 536 GLIRDLQEEAKHREEDQKALMAVTKRQMMLQNERESIERTPFRELESETKRFRQDVFKRR 595
            LI++L+      EE+ K           L+  R  IE+ P  EL   TK+ RQ  FK  
Sbjct: 197 SLIKNLKNVG--MEEEVK----------NLEEHRFLIEQEPDAELSDGTKKLRQACFKAN 248

Query: 596 HKAR 597
           +K R
Sbjct: 257 YKRR 248

The following BLAST results are available for this feature:
Match NameE-valueIdentityDescription
O817913.2e-3651.74Protein RKD5 OS=Arabidopsis thaliana OX=3702 GN=RKD5 PE=3 SV=1[more]
Q3US415.6e-2535.10Epithelial splicing regulatory protein 1 OS=Mus musculus OX=10090 GN=Esrp1 PE=1 ... [more]
Q6NXG17.4e-2535.10Epithelial splicing regulatory protein 1 OS=Homo sapiens OX=9606 GN=ESRP1 PE=1 S... [more]
B2RYD27.4e-2535.10Epithelial splicing regulatory protein 1 OS=Rattus norvegicus OX=10116 GN=Esrp1 ... [more]
Q5E9J11.6e-2436.14Heterogeneous nuclear ribonucleoprotein F OS=Bos taurus OX=9913 GN=HNRNPF PE=2 S... [more]
Match NameE-valueIdentityDescription
A0A444WWB91.8e-15954.44Uncharacterized protein OS=Arachis hypogaea OX=3818 GN=Ahy_B10g100373 PE=3 SV=1[more]
A0A444WWD11.8e-15954.44Uncharacterized protein OS=Arachis hypogaea OX=3818 GN=Ahy_B10g100373 PE=3 SV=1[more]
A0A444WWG11.8e-15954.44Uncharacterized protein OS=Arachis hypogaea OX=3818 GN=Ahy_B10g100373 PE=3 SV=1[more]
A0A444WWK21.8e-15954.44Uncharacterized protein OS=Arachis hypogaea OX=3818 GN=Ahy_B10g100373 PE=3 SV=1[more]
A0A6J1GEF23.8e-141100.00heterogeneous nuclear ribonucleoprotein H-like isoform X1 OS=Cucurbita moschata ... [more]
Match NameE-valueIdentityDescription
KAG6603861.15.0e-22074.87hypothetical protein SDJN03_04470, partial [Cucurbita argyrosperma subsp. sorori... [more]
RYQ81764.13.7e-15954.44hypothetical protein Ahy_B10g100373 isoform A [Arachis hypogaea][more]
RYQ81765.13.7e-15954.44hypothetical protein Ahy_B10g100373 isoform C [Arachis hypogaea][more]
RYQ81766.13.7e-15954.44hypothetical protein Ahy_B10g100373 isoform B [Arachis hypogaea][more]
RYQ81767.13.7e-15954.44hypothetical protein Ahy_B10g100373 isoform D [Arachis hypogaea][more]
Match NameE-valueIdentityDescription
AT5G66010.14.7e-9164.89RNA-binding (RRM/RBD/RNP motifs) family protein [more]
AT3G20890.17.7e-6245.45RNA-binding (RRM/RBD/RNP motifs) family protein [more]
AT5G66010.21.6e-5163.29RNA-binding (RRM/RBD/RNP motifs) family protein [more]
AT4G35590.12.3e-3751.74RWP-RK domain-containing protein [more]
AT5G53040.12.5e-2035.33RWP-RK domain-containing protein [more]
InterPro
Analysis Name: InterPro Annotations of Cucurbita moschata (Rifu) v1
Date Performed: 2021-10-25
IPR TermIPR DescriptionSourceSource TermSource DescriptionAlignment
IPR000504RNA recognition motif domainSMARTSM00360rrm1_1coord: 210..282
e-value: 1.3E-8
score: 44.6
coord: 94..163
e-value: 2.5E-5
score: 33.7
IPR000504RNA recognition motif domainPFAMPF00076RRM_1coord: 96..158
e-value: 2.1E-5
score: 24.2
coord: 213..279
e-value: 8.2E-9
score: 35.2
IPR000504RNA recognition motif domainPROSITEPS50102RRMcoord: 209..286
score: 10.496632
IPR012677Nucleotide-binding alpha-beta plait domain superfamilyGENE3D3.30.70.330coord: 91..176
e-value: 6.4E-22
score: 79.5
IPR012677Nucleotide-binding alpha-beta plait domain superfamilyGENE3D3.30.70.330coord: 194..297
e-value: 1.1E-28
score: 101.4
IPR003035RWP-RK domainPFAMPF02042RWP-RKcoord: 484..531
e-value: 1.3E-21
score: 76.3
IPR003035RWP-RK domainPROSITEPS51519RWP_RKcoord: 466..551
score: 18.667007
NoneNo IPR availablePANTHERPTHR13976:SF71RNA-BINDING (RRM/RBD/RNP MOTIFS) FAMILY PROTEINcoord: 42..296
NoneNo IPR availablePANTHERPTHR13976HETEROGENEOUS NUCLEAR RIBONUCLEOPROTEIN-RELATEDcoord: 42..296
NoneNo IPR availableCDDcd12254RRM_hnRNPH_ESRPs_RBM12_likecoord: 210..282
e-value: 4.17896E-28
score: 105.332
NoneNo IPR availableCDDcd12254RRM_hnRNPH_ESRPs_RBM12_likecoord: 94..163
e-value: 5.94729E-24
score: 93.7759
IPR035979RNA-binding domain superfamilySUPERFAMILY54928RNA-binding domain, RBDcoord: 87..172
IPR035979RNA-binding domain superfamilySUPERFAMILY54928RNA-binding domain, RBDcoord: 208..294

Relationships

The following mRNA feature(s) are a part of this gene:

Feature NameUnique NameType
CmoCh03G006520.1CmoCh03G006520.1mRNA


GO Annotation
GO Assignments
This gene is annotated with the following GO terms.
Category Term Accession Term Name
biological_process GO:0009987 cellular process
cellular_component GO:1990904 ribonucleoprotein complex
molecular_function GO:0016301 kinase activity
molecular_function GO:0003723 RNA binding
molecular_function GO:0003676 nucleic acid binding