Sequences
The following sequences are available for this feature:
Gene sequence (with intron)
Legend: exonfive_prime_UTRpolypeptideCDSthree_prime_UTR
Hold the cursor over a type above to highlight its positions in the sequence below.
CGACGATGTTCGAGTTAAAACAATTGTCGTGGCAATCTGTTCGCGTAATGTGCTAAGGATTCTATGGTGAAGGCAAGAGAACAGATTGGGGAAATGATGCAGAACCATTGTTAGGAGCAGGTTTGATGTTGAGAATTGAAGTAACCAAGTTGTGTATCTGAACGTTTTCGAAAGATGGGACTTTTCTCCCTCTTGCAAACCGTGGACCATGGCCAATGAAAATGGTTCTCATGGAAAATATGGCATTGTCATACCCATGATCTCCTCCACACTCCTTCCCTTTACTGTTCTTCATCTCCACCTTAAATCCCTCGTCAACCAACCCGATCACTGGAGGAATCCGATCGCTCGCACTGTAGTGAAGCCGAGACGGAAGCTCTTCCTTGACATACATTTTCAGATATTTCCCATTCGAAACTTTCCCAGATTGCAAAGCTTCATTCATTTTCAAGACAACATCAGCAGGTGAAACATCAGAAGGTGGACGAATTGCCAGCAGAGGTGTTCTTGATTGAACCCACCTCTCTGGAATAACAATCCATGGAGATAAATCCTCAAGAAAAATCAACTTCTTATCACAGTTTCCGACCATTCCATGATCACCAACCAAGATTACATTAACATCCTCGAAAACCCCTCTTTTTTCCAGGCCAGAAATCAGTTTACCAAGCATTAGATCAATTCTAGCTACAGCATCAGTGATTTGTGGATCATCAGGACCAACCTTGTGACCCTGATGATCTGGGTCCTCAAAATACAGAGTCATGAACACAGGAATTTCCTCACTAGGCAAATCAAAGTACTGCAAAATCATATCTACTCGTTCCTCAAATGGAACCGAACCATTATAGTGATGGCAGAAATTTACAGGGCAAGACCATGAGCCTTTGTTTACCTCAGCACCGACCCAGAAGACTGTTGCTGCACTGAGCCCTTGATTGACCACCGTCTCCCACAGCGGCTCGCCGAGCCACCACTTAGGGTCGTGATTACCCATCGTGAAGGCGTCTCCGGTCACCGGATCGAGAAAAAAGTTGTTGATTATGCCGTGATGGGCGGGATAAAGGCCAGTGACGATCGAGTAATGATTAGGAAAAGTTAGAGTTGGAAAAACAGGAATCAAACCCCTCTCGGCCTCGGTCCCATTTGCAATCAGACGGTCAATGTTTGGGGTTGAGGTTTTGAATTGATACCCAAATCGAAACCCATCGGAGGAGATCAATAGAACCACTGGACGTTTCAGCTTAGAGAGAGTACGGGCGGTGGAGTTGACGCCGTCGGAAGTGGAGGCGGCGGAGAAGAAGAGGAAGGCGAAGGCAAAGGCGGCGGAGAGAGCGAAACAAGTACTGGAAGTAAGAGAAATGAATACAATGCTTCTTTTGTGAGATGGGTTTGAAGAGAGCAAAGCGGTGGACGGGCTAGACGAGTCTTCTTCCTGATCTGCCGTCGGGCTAGAGCAAAGCGGCGGATTGGAATCGGGACCCATATCGGAAGAACCAGTTCGTATACTCTCGTCTTCTTCCAGTCAGCAGCAGAGTGTATGATTCGCAGTTGTAATTTTTTTTTTTAATTTAAATTTATTATAGTACTTGGCGGCATAAATCATTTTATTTATACTAATCTCTCATGGGCTTGGCCCAAGGAAAACCGGCCTCCCGAATTAACTTGAATGGGCCGATCTAATCGAATAGGCCCAGAATAAAGTATCATCAAAACAGATGCGAAATCAGGAAACGACACCGTTTGGCGTCGCCGTAGAATTGACAGACAGAGGTAAGGAGCAGGACTGTAGAAGAAAGGCTTGGAGAACTGCAGCAGAGAGGGAGGCGGCAATGGCGTACCTGAGCATGGGAGAGGCACATAGAAGAATTACAGAGTACTTGAATCGATTTTCGGACTCTGTTGCGTCACAAGATGGAGCTTCTCTCAAATCTCTTCTTTCTCTCTCCTCCAATTCCCCCAATCTTCTCGCCCTCGCCGACTCTCTCAACGTTTTCCAGGTACTGTATCAAGTCTACAAGAGCTCTGATAATCGTCACCGTTTTTAATCATTGGTGAAGTTTAAGTTTGAGGATTGGTTTTTGACAGGATGCCAATCGTTTAATCAGACAGTCGGATAGATATTCTCAGTTTGGAGATATATTGGTGAACTTCTTTCGTGCATTGCAGTGTTACCGTCTCGGAAATTTGGTTGATGCCTATCATGCTTTCGAAAAATCCGCCAAGTATTTGTTTCCCCTCCAATCTACTTGATTTTTGATGCTTTACTTTTAGTTTGAAGTCGTGTAATTGTGCTTGTAATTGTGGAAATTTTCCAGCGCTTTCACTCAGGAATTTCGGTCTTGGGATTCAGCTTGGGCGTTGGAAGCATTGTATGTAGTTGCTTACGAGATTAGGATCATTGCGGAGAGGGTATGATCTTAGGTCTTATTTGGAATTTCATATGTTCGTTAACTGGTTAATTAGGCTTCTGTATTCACAGGCCGACCGAGAGCTCGCTTCAAATGGAAAATCCCCAGAAAAGTTGAAAGGAGCTGGCTCATTTCTTATGAAAGTGTTTGGTGTTCTTGCCGTATGTTTCTTTGTTCAAACAATTTATTGCGCTATTTTCCGCATTGATACTTTAGATTTGCGGATTTTTCTTGCATACATAAACTTTGTTGATTGTATTTTAGGGAAAAGGCCCTAAACGTGTTGGAGCATTGTATGTGACATGTCAGTTGTTCAAGATATACTTCAAGGTAAGAAATTGTTACTATAATCTTCAGATTCAGTTTGATATTTCTTTTCCCCGTTTTGATAACAATTGTTTTCGAAAAGTACCCAACTTTCCCTGCTGCACACTATCCATTGCATTTGAATTAATGACAAATAAATTCAAGTTGCCATATATCTGTGACAATGTTATCTGCCAGGATATCGTTGCGATCGAAATTAGAGAAACGTAGTTGGAGAAAGTCTCTGTTTAGTTGAAACGTTACGATTTATAGCAACTTTTTAAAATCATATCTGATGCTTAGCTTACTCTATGTATCTCTAGTTATATAATGATGAAGTAACTAATTTGATTGCAGCTTGGTACTGTGCACCTGTGCCGTAGTGTGATAAGGAGCATTGAAACAGCTCGAATATTTGACTTTGAGGAGTTCCCTAAAAGAGACAGGGTCAGAATTGGTTTTGGTTGCCTCCATTCCCACACACCCACCTACCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNCCCCCCCCCCCCCCCCCCCCCCCCCCCCGGATCGTCATCATCCTGTACTTTCATCAGGATGGTTCGGCCATCTTGGTTTATATTGAAAAGTTCTGACCAAAGTAAATGTGTTATTTAGGTGACCTACATGTATTATACAGGACGTCTGGAAGTTTTCAACGAGAATTTCCCTGCTGTGAGTAATTCTATGGCTTTATCATTTACGGATTTGGAATCTAAATTGATCAATTAATGCTAGTAGTTCTATGGTTTTATCATTTTTTCTTACCATGTTGTGAATAAATAATACGTGAAGCCTTTTTTCCCTTTCTTCTCAACCACTTTTTTACTGAATCACTTCTGTGATATGTCAGGCTGACCAGAAGTTATCATACGCCTTGATGCATTGCAACCCTCAAAGAGAAGCTAATATAAGGTTTGACTCAGGTGTGCTTGCTGCATATAATTTGCTTTTGCTTAGACAATTATGAATAGATTATTGAGTGATTTATTTAGTATACTTTTGTATTTATGGCTGAATGCTTCAAATAGATAAATCTTACAACTATATGCTGATGAGAGGGTACATTTAATATTTTCGAAAATTAACAGCTCGTGCATCTTCTTGTAAGAATATATGTTGCTCAAGCCCACCGCTAGTAGATATTGTCCTCTTTGGGATTTCTCTTCTGGACTTCCCCTCAAGATTTTTAAAACGCGTCTGCTAAGGAGGGGTTTCCACACCCTTATAAAGAATGTTTCGTTCTCCTCCCCAAACCGATGTGGGATCTCACAATATAAGAGGAAAGTCTGGAGGGGCTCATGACAAGATGACCTACACTCCTATGGTTTGTCATAGAATTTAATGAGTGGTTTTAATTTGGGAGAGGCCCTCCTGTCCGGTCATCATCTTTTCTTAGGAATTAACATGATTATTAATCTTTTCAACATTATGAATGCCCTTGAGAAAGCATCCATCCTGGACATAAAATTGCTGCACACCTTAGATTCAGGCCTCAACTGTGAAGCATTGTATAGGGAGCTCTTAACCTCTAGTATCAGGTGTAAGCTTGCAGGAAGAACATTGCAGCAAAACATGGAACTACCTGCCTAGGGCTTGGATCTTTCTATCTGTAAATGATTCTTGGGGGTGTTGGGACTTGGACTTTGTAAGATTTTTTCAGTATATTGAGATATGGGGGTGTGCTTCAGTCTCCCACTTGTTTTCTTCCTTCGGCGTTTCTGGTTCCTTCTCAGTTCGACCTCTTTCTAATGATGTCATCACTACCGCTTCCCATGATAACGCCTTGCTACTTCAACCCATAGAAATTCAAGTTTTTTTTTTTTTTTCCCTTGGGAACTCTGTCATTTAAGCTTAAACATCCCTCTATGGCTCTGATCCATGGATGGTATCTGTCATTCAAAGAAGAATGGAATCATAAAGGCACATCTTCATCAACTATTAATTTGCTGAAAATTTCTGGAGCTTGGTCAAAGACAAAAATACATTCATTACTGTGATAGAGAAGGAAGATACTTCGGCCCCACGGTTCTAGAGCTTGGCGAAAGCCTGTTTTTCCTTCCAACTATTATACTATTGGATGAACCTTTGTAATCTGCTTGGCTTGTGACTTGGCTTTGCTCTCTCTTCTTCGTCTTCTCATTCGTTTTTCATGGTAATAGTTATTATAACTTCATATCCTAGACTAATTGTTTGTGCCCGTTAAAATATTCTTTGGCTTTATTATCTATCATTATATAGTTGTTTTGATTCATTTGGTTATCAATTATCTAGTTACTTCATGAGTAGTTCTCTGTAGATTGTGAAATTGAAGGGAGTCTAACCCATGCTAAGTCGCAATAATTTTGGAACAGAATGATATTAAAATATCTCATACCTGTGAAGCTTTCGATGGGCATCTTGCCTACAAGGGTGCTTCTCGAGAAGTATAACCTTTTTGAGGTATGGCAAATTTTTACAAATGTTCTCAGCCTAACTTATTTAAGCTACTTGTGAAGCTTTCTAGAACATAATGATTTTTATGCATCATAAAGCTTACTAATAATGTTCTTCTCGCAGTATGGAAATGTTGTGCAATCTCTTACAAGAGGTGATCCGAGGCTTCTTCGACAAGCTCTTCAAGAGCACGAAGATCGGTAAACCTGTTTTTTCAATTTTTTGTCTGTGAGAATATTTTGACCGAGGATGTATACTCAAAGTTGTTTCCTTTGTTTTATCAGGTTCTTAAGATCAGGGGTGTATCTCGTCTTAGAGAAGTTGGAACTTCAAGTTTACCAAAGATTGGTGAAGAAAATGTAAGCTCACCGTTCTTGTATTTTGTGTCCTTCTAGTTGATGGATAGCTCAAACGAATCTACGGACTTAAAGAGGCTCCTCCTCACTATTTGTTCATTATTTCTGATTCACTACACTTTCTCCAGCTACTTTATACAAAGGCAAAAGGATCCAAACAAAGCTCACCAGATCAAATTAGAAGTAATTGTAAAAGCTTTACAATGGTTGGAAGTAGACATGGACGTTGATGAGGTATTTGTTTATCCGTCGCTCAATGTTGTAATAGTCGAAAACCGACCCGTGTTGTCTCATACATGCCTTAAACTTGCAGGTAGAGTGCATCATGGCCATACTCATAAATAAGGGTCTTGTGAAGGGGTACTTCGCCCACAAGAGTAAAGTTGCAGTGGTAAGCAAACAAGATCCTTTTCCTCGACTAAATGGAAAACCAGTCGGTTCATGAACCAATAACTTAACAAGGTTGTTTTGTAGTCATTAGTCTATTTTATAGTGGCAGCTTTGAAGTGTTTCATTTCTCTGGATCTCTTCTGCTTAAACACTTATTTGACTTTGAACATATTTAAACGATAGACTTGAATCAAAACTACACCATCTTGTACGACAAAAATGGTGTGCAGAGCAGAATTAAGAAGCTTAGTCGAAGGTTAAATTTGTTCATTAGTGGTTTAAAGATTTATTACATTCAAGATAGCCTTTGAAATACTTGTGAATTTTAGGAATTAAAG
mRNA sequence
CGACGATGTTCGAGTTAAAACAATTGTCGTGGCAATCTGTTCGCGTAATGTGCTAAGGATTCTATGGTGAAGGCAAGAGAACAGATTGGGGAAATGATGCAGAACCATTGTTAGGAGCAGGTTTGATGTTGAGAATTGAAGTAACCAAGTTGTGTATCTGAACGTTTTCGAAAGATGGGACTTTTCTCCCTCTTGCAAACCGTGGACCATGGCCAATGAAAATGGTTCTCATGGAAAATATGGCATTGTCATACCCATGATCTCCTCCACACTCCTTCCCTTTACTGTTCTTCATCTCCACCTTAAATCCCTCGTCAACCAACCCGATCACTGGAGGAATCCGATCGCTCGCACTGTAGTGAAGCCGAGACGGAAGCTCTTCCTTGACATACATTTTCAGATATTTCCCATTCGAAACTTTCCCAGATTGCAAAGCTTCATTCATTTTCAAGACAACATCAGCAGGTGAAACATCAGAAGGTGGACGAATTGCCAGCAGAGGTGTTCTTGATTGAACCCACCTCTCTGGAATAACAATCCATGGAGATAAATCCTCAAGAAAAATCAACTTCTTATCACAGTTTCCGACCATTCCATGATCACCAACCAAGATTACATTAACATCCTCGAAAACCCCTCTTTTTTCCAGGCCAGAAATCAGTTTACCAAGCATTAGATCAATTCTAGCTACAGCATCAGTGATTTGTGGATCATCAGGACCAACCTTGTGACCCTGATGATCTGGGTCCTCAAAATACAGAGTCATGAACACAGGAATTTCCTCACTAGGCAAATCAAAGTACTGCAAAATCATATCTACTCGTTCCTCAAATGGAACCGAACCATTATAGTGATGGCAGAAATTTACAGGGCAAGACCATGAGCCTTTGTTTACCTCAGCACCGACCCAGAAGACTGTTGCTGCACTGAGCCCTTGATTGACCACCGTCTCCCACAGCGGCTCGCCGAGCCACCACTTAGGGTCGTGATTACCCATCGTGAAGGCGTCTCCGGTCACCGGATCGAGAAAAAAGTTGTTGATTATGCCGTGATGGGCGGGATAAAGGCCAGTGACGATCGAGTAATGATTAGGAAAAGTTAGAGTTGGAAAAACAGGAATCAAACCCCTCTCGGCCTCGGTCCCATTTGCAATCAGACGGTCAATGTTTGGGGTTGAGGTTTTGAATTGATACCCAAATCGAAACCCATCGGAGGAGATCAATAGAACCACTGGACGTTTCAGCTTAGAGAGAGTACGGGCGGTGGAGTTGACGCCGTCGGAAGTGGAGGCGGCGGAGAAGAAGAGGAAGGCGAAGGCAAAGGCGGCGGAGAGAGCGAAACAAGTACTGGAAGTAAGAGAAATGAATACAATGCTTCTTTTGTGAGATGGGTTTGAAGAGAGCAAAGCGGTGGACGGGCTAGACGAGTCTTCTTCCTGATCTGCCGTCGGGCTAGAGCAAAGCGGCGGATTGGAATCGGGACCCATATCGGAAGAACCAGTTCGTATACTCTCGTCTTCTTCCAGTCAGCAGCAGAGTGTATGATTCGCAGTTGTAATTTTTTTTTTTAATTTAAATTTATTATAGTACTTGGCGGCATAAATCATTTTATTTATACTAATCTCTCATGGGCTTGGCCCAAGGAAAACCGGCCTCCCGAATTAACTTGAATGGGCCGATCTAATCGAATAGGCCCAGAATAAAGTATCATCAAAACAGATGCGAAATCAGGAAACGACACCGTTTGGCGTCGCCGTAGAATTGACAGACAGAGGTAAGGAGCAGGACTGTAGAAGAAAGGCTTGGAGAACTGCAGCAGAGAGGGAGGCGGCAATGGCGTACCTGAGCATGGGAGAGGCACATAGAAGAATTACAGAGTACTTGAATCGATTTTCGGACTCTGTTGCGTCACAAGATGGAGCTTCTCTCAAATCTCTTCTTTCTCTCTCCTCCAATTCCCCCAATCTTCTCGCCCTCGCCGACTCTCTCAACGTTTTCCAGGATGCCAATCGTTTAATCAGACAGTCGGATAGATATTCTCAGTTTGGAGATATATTGGTGAACTTCTTTCGTGCATTGCAGTGTTACCGTCTCGGAAATTTGGTTGATGCCTATCATGCTTTCGAAAAATCCGCCAACGCTTTCACTCAGGAATTTCGGTCTTGGGATTCAGCTTGGGCGTTGGAAGCATTGTATGTAGTTGCTTACGAGATTAGGATCATTGCGGAGAGGGCCGACCGAGAGCTCGCTTCAAATGGAAAATCCCCAGAAAAGTTGAAAGGAGCTGGCTCATTTCTTATGAAAGTGTTTGGTGTTCTTGCCGGAAAAGGCCCTAAACGTGTTGGAGCATTGTATGTGACATGTCAGTTGTTCAAGATATACTTCAAGCTTGGTACTGTGCACCTGTGCCGTAGTGTGATAAGGAGCATTGAAACAGCTCGAATATTTGACTTTGAGGAGTTCCCTAAAAGAGACAGGGTGACCTACATGTATTATACAGGACGTCTGGAAGTTTTCAACGAGAATTTCCCTGCTGCTGACCAGAAGTTATCATACGCCTTGATGCATTGCAACCCTCAAAGAGAAGCTAATATAAGAATGATATTAAAATATCTCATACCTGTGAAGCTTTCGATGGGCATCTTGCCTACAAGGGTGCTTCTCGAGAAGTATAACCTTTTTGAGTATGGAAATGTTGTGCAATCTCTTACAAGAGGTGATCCGAGGCTTCTTCGACAAGCTCTTCAAGAGCACGAAGATCGGTTCTTAAGATCAGGGGTGTATCTCGTCTTAGAGAAGTTGGAACTTCAAGTTTACCAAAGATTGGTGAAGAAAATCTACTTTATACAAAGGCAAAAGGATCCAAACAAAGCTCACCAGATCAAATTAGAAGTAATTGTAAAAGCTTTACAATGGTTGGAAGTAGACATGGACGTTGATGAGGTAGAGTGCATCATGGCCATACTCATAAATAAGGGTCTTGTGAAGGGGTACTTCGCCCACAAGAGTAAAGTTGCAGTGGTAAGCAAACAAGATCCTTTTCCTCGACTAAATGGAAAACCAGTCGGTTCATGAACCAATAACTTAACAAGGTTGTTTTGTAGTCATTAGTCTATTTTATAGTGGCAGCTTTGAAGTGTTTCATTTCTCTGGATCTCTTCTGCTTAAACACTTATTTGACTTTGAACATATTTAAACGATAGACTTGAATCAAAACTACACCATCTTGTACGACAAAAATGGTGTGCAGAGCAGAATTAAGAAGCTTAGTCGAAGGTTAAATTTGTTCATTAGTGGTTTAAAGATTTATTACATTCAAGATAGCCTTTGAAATACTTGTGAATTTTAGGAATTAAAG
Coding sequence (CDS)
ATGCGAAATCAGGAAACGACACCGTTTGGCGTCGCCGTAGAATTGACAGACAGAGGTAAGGAGCAGGACTGTAGAAGAAAGGCTTGGAGAACTGCAGCAGAGAGGGAGGCGGCAATGGCGTACCTGAGCATGGGAGAGGCACATAGAAGAATTACAGAGTACTTGAATCGATTTTCGGACTCTGTTGCGTCACAAGATGGAGCTTCTCTCAAATCTCTTCTTTCTCTCTCCTCCAATTCCCCCAATCTTCTCGCCCTCGCCGACTCTCTCAACGTTTTCCAGGATGCCAATCGTTTAATCAGACAGTCGGATAGATATTCTCAGTTTGGAGATATATTGGTGAACTTCTTTCGTGCATTGCAGTGTTACCGTCTCGGAAATTTGGTTGATGCCTATCATGCTTTCGAAAAATCCGCCAACGCTTTCACTCAGGAATTTCGGTCTTGGGATTCAGCTTGGGCGTTGGAAGCATTGTATGTAGTTGCTTACGAGATTAGGATCATTGCGGAGAGGGCCGACCGAGAGCTCGCTTCAAATGGAAAATCCCCAGAAAAGTTGAAAGGAGCTGGCTCATTTCTTATGAAAGTGTTTGGTGTTCTTGCCGGAAAAGGCCCTAAACGTGTTGGAGCATTGTATGTGACATGTCAGTTGTTCAAGATATACTTCAAGCTTGGTACTGTGCACCTGTGCCGTAGTGTGATAAGGAGCATTGAAACAGCTCGAATATTTGACTTTGAGGAGTTCCCTAAAAGAGACAGGGTGACCTACATGTATTATACAGGACGTCTGGAAGTTTTCAACGAGAATTTCCCTGCTGCTGACCAGAAGTTATCATACGCCTTGATGCATTGCAACCCTCAAAGAGAAGCTAATATAAGAATGATATTAAAATATCTCATACCTGTGAAGCTTTCGATGGGCATCTTGCCTACAAGGGTGCTTCTCGAGAAGTATAACCTTTTTGAGTATGGAAATGTTGTGCAATCTCTTACAAGAGGTGATCCGAGGCTTCTTCGACAAGCTCTTCAAGAGCACGAAGATCGGTTCTTAAGATCAGGGGTGTATCTCGTCTTAGAGAAGTTGGAACTTCAAGTTTACCAAAGATTGGTGAAGAAAATCTACTTTATACAAAGGCAAAAGGATCCAAACAAAGCTCACCAGATCAAATTAGAAGTAATTGTAAAAGCTTTACAATGGTTGGAAGTAGACATGGACGTTGATGAGGTAGAGTGCATCATGGCCATACTCATAAATAAGGGTCTTGTGAAGGGGTACTTCGCCCACAAGAGTAAAGTTGCAGTGGTAAGCAAACAAGATCCTTTTCCTCGACTAAATGGAAAACCAGTCGGTTCATGA
Protein sequence
MRNQETTPFGVAVELTDRGKEQDCRRKAWRTAAEREAAMAYLSMGEAHRRITEYLNRFSDSVASQDGASLKSLLSLSSNSPNLLALADSLNVFQDANRLIRQSDRYSQFGDILVNFFRALQCYRLGNLVDAYHAFEKSANAFTQEFRSWDSAWALEALYVVAYEIRIIAERADRELASNGKSPEKLKGAGSFLMKVFGVLAGKGPKRVGALYVTCQLFKIYFKLGTVHLCRSVIRSIETARIFDFEEFPKRDRVTYMYYTGRLEVFNENFPAADQKLSYALMHCNPQREANIRMILKYLIPVKLSMGILPTRVLLEKYNLFEYGNVVQSLTRGDPRLLRQALQEHEDRFLRSGVYLVLEKLELQVYQRLVKKIYFIQRQKDPNKAHQIKLEVIVKALQWLEVDMDVDEVECIMAILINKGLVKGYFAHKSKVAVVSKQDPFPRLNGKPVGS
Homology
BLAST of CmoCh12G013280 vs. ExPASy Swiss-Prot
Match:
Q8GWE6 (Enhanced ethylene response protein 5 OS=Arabidopsis thaliana OX=3702 GN=EER5 PE=1 SV=1)
HSP 1 Score: 644.0 bits (1660), Expect = 1.2e-183
Identity = 321/413 (77.72%), Postives = 367/413 (88.86%), Query Frame = 0
Query: 39 MAYLSMGEAHRRITEYLNRFSDSVASQDGASLKSLLSLSSNSPNLLALADSLNVFQDANR 98
MAY+SMGEAHRRITEYLNRF D+V+ QD ++L LLS SSNSP LL+LAD+LNVFQD++
Sbjct: 1 MAYVSMGEAHRRITEYLNRFCDAVSYQDSSTLCRLLSFSSNSPPLLSLADALNVFQDSSS 60
Query: 99 LIRQSDRYSQFGDILVNFFRALQCYRLGNLVDAYHAFEKSANAFTQEFRSWDSAWALEAL 158
LIRQSDR+S++G+IL + FR+LQ YR+GNLV+AY AF+K ANAF QEFR+W+SAWALEAL
Sbjct: 61 LIRQSDRFSEYGEILAHVFRSLQSYRVGNLVEAYLAFDKFANAFVQEFRNWESAWALEAL 120
Query: 159 YVVAYEIRIIAERADRELASNGKSPEKLKGAGSFLMKVFGVLAGKGPKRVGALYVTCQLF 218
YVV YEIR++AE+AD++L SNGKSPEKLK AGS LMKVFGVLAGKGPKRVGALYVTCQLF
Sbjct: 121 YVVCYEIRVLAEKADKDLTSNGKSPEKLKAAGSLLMKVFGVLAGKGPKRVGALYVTCQLF 180
Query: 219 KIYFKLGTVHLCRSVIRSIETARIFDFEEFPKRDRVTYMYYTGRLEVFNENFPAADQKLS 278
K YFKLGTV+LCRSVIRSIETARIFDFEEFP+RD+VTYMYYTGRLEVFNENFPAAD KLS
Sbjct: 181 KTYFKLGTVNLCRSVIRSIETARIFDFEEFPRRDKVTYMYYTGRLEVFNENFPAADTKLS 240
Query: 279 YALMHCNPQREANIRMILKYLIPVKLSMGILPTRVLLEKYNLFEYGNVVQSLTRGDPRLL 338
YAL +CNP+RE NIRMILKYL+PVKLS+GI+P LL YNL EY +VQ+L +GD RLL
Sbjct: 241 YALQNCNPKRERNIRMILKYLVPVKLSLGIIPKDELLRNYNLHEYTKIVQALRKGDLRLL 300
Query: 339 RQALQEHEDRFLRSGVYLVLEKLELQVYQRLVKKIYFIQRQKDPNKAHQIKLEVIVKALQ 398
R ALQEHEDRFLRSGVYLVLEKLELQVYQRL+KKIY Q+ DP +AHQ+KLE I KAL+
Sbjct: 301 RHALQEHEDRFLRSGVYLVLEKLELQVYQRLMKKIYINQKLSDPARAHQLKLEGIAKALR 360
Query: 399 WLEVDMDVDEVECIMAILINKGLVKGYFAHKSKVAVVSKQDPFPRLNGKPVGS 452
WL++DMD+DEVECIM ILI K LVKGY AHKSKV V+SKQDPFP+LNGKPV S
Sbjct: 361 WLDMDMDLDEVECIMTILIYKNLVKGYLAHKSKVVVLSKQDPFPKLNGKPVSS 413
BLAST of CmoCh12G013280 vs. ExPASy Swiss-Prot
Match:
Q5JVF3 (PCI domain-containing protein 2 OS=Homo sapiens OX=9606 GN=PCID2 PE=1 SV=2)
HSP 1 Score: 250.0 bits (637), Expect = 5.2e-65
Identity = 152/417 (36.45%), Postives = 246/417 (58.99%), Query Frame = 0
Query: 47 AHRRITEYLNRFSDSVASQDGASLKSLLSLSSNSPNLLALADSLNVFQDANRLIRQSDRY 106
AH I +YL + +++ S+DGAS L+S P++ AN ++ +
Sbjct: 2 AHITINQYLQQVYEAIDSRDGASCAELVSF--KHPHV------------ANPRLQMASPE 61
Query: 107 SQFGDIL-----VNFFRALQC-YRLGN--LVDAYHAFEKSANAFTQEFRSW-DSAWALEA 166
+ +L F L+C Y +GN ++AY +F + F++ + WAL
Sbjct: 62 EKCQQVLEPPYDEMFAAHLRCTYAVGNHDFIEAYKCQTVIVQSFLRAFQAHKEENWALPV 121
Query: 167 LYVVAYEIRIIAERADRELASNGKSP--EKLKGAGSFLMKVFGVLAG------KGPKRVG 226
+Y VA ++R+ A AD++L GKS + L+ A LM F V A + K+ G
Sbjct: 122 MYAVALDLRVFANNADQQLVKKGKSKVGDMLEKAAELLMSCFRVCASDTRAGIEDSKKWG 181
Query: 227 ALYVTCQLFKIYFKLGTVHLCRSVIRSIETARIFDFEEFPKRDRVTYMYYTGRLEVFNEN 286
L++ QLFKIYFK+ +HLC+ +IR+I+++ + D ++ RVTY YY GR +F+ +
Sbjct: 182 MLFLVNQLFKIYFKINKLHLCKPLIRAIDSSNLKD--DYSTAQRVTYKYYVGRKAMFDSD 241
Query: 287 FPAADQKLSYALMHCNPQREANIRMILKYLIPVKLSMGILPTRVLLEKYNLFEYGNVVQS 346
F A++ LS+A HC+ + N RMIL YL+PVK+ +G +PT LL+KY+L ++ V ++
Sbjct: 242 FKQAEEYLSFAFEHCHRSSQKNKRMILIYLLPVKMLLGHMPTVELLKKYHLMQFAEVTRA 301
Query: 347 LTRGDPRLLRQALQEHEDRFLRSGVYLVLEKLELQVYQRLVKKIYFIQRQKDPNKAHQIK 406
++ G+ LL +AL +HE F+R G++L+LEKL++ Y+ L KK+Y + K HQ+
Sbjct: 302 VSEGNLLLLHEALAKHEAFFIRCGIFLILEKLKIITYRNLFKKVYLLL------KTHQLS 361
Query: 407 LEVIVKALQWLEV-DMDVDEVECIMAILINKGLVKGYFAHKSKVAVVSKQDPFPRLN 446
L+ + AL++++V D+D+DEV+CI+A LI G VKGY +H+ + VVSKQ+PFP L+
Sbjct: 362 LDAFLVALKFMQVEDVDIDEVQCILANLIYMGHVKGYISHQHQKLVVSKQNPFPPLS 396
BLAST of CmoCh12G013280 vs. ExPASy Swiss-Prot
Match:
Q8BFV2 (PCI domain-containing protein 2 OS=Mus musculus OX=10090 GN=Pcid2 PE=1 SV=1)
HSP 1 Score: 247.7 bits (631), Expect = 2.6e-64
Identity = 150/417 (35.97%), Postives = 244/417 (58.51%), Query Frame = 0
Query: 47 AHRRITEYLNRFSDSVASQDGASLKSLLSLSSNSPNLLALADSLNVFQDANRLIRQSDRY 106
AH I +YL + +++ ++DGAS L+S P++ AN ++ +
Sbjct: 2 AHITINQYLQQVYEAIDTRDGASCAELVSF--KHPHV------------ANPRLQMASPE 61
Query: 107 SQFGDIL-----VNFFRALQC-YRLGN--LVDAYHAFEKSANAFTQEFRSW-DSAWALEA 166
+ +L F L+C Y +GN ++AY +F + F++ + WAL
Sbjct: 62 EKCQQVLEPPYDEMFAAHLRCTYAVGNHDFIEAYKCQTVIVQSFLRAFQAHKEENWALPV 121
Query: 167 LYVVAYEIRIIAERADRELASNGKSP--EKLKGAGSFLMKVFGVLAG------KGPKRVG 226
+Y VA ++RI A AD++L GKS + L+ A LM F V A + K+ G
Sbjct: 122 MYAVALDLRIFANNADQQLVKKGKSKVGDMLEKAAELLMSCFRVCASDTRAGIEDSKKWG 181
Query: 227 ALYVTCQLFKIYFKLGTVHLCRSVIRSIETARIFDFEEFPKRDRVTYMYYTGRLEVFNEN 286
L++ QLFKIYFK+ +HLC+ +IR+I+++ + D ++ R+TY YY GR +F+ +
Sbjct: 182 MLFLVNQLFKIYFKINKLHLCKPLIRAIDSSNLKD--DYSTAQRITYKYYVGRKAMFDSD 241
Query: 287 FPAADQKLSYALMHCNPQREANIRMILKYLIPVKLSMGILPTRVLLEKYNLFEYGNVVQS 346
F A++ LS+A HC+ + N RMIL YL+PVK+ +G +PT LL KY+L ++ V ++
Sbjct: 242 FKQAEEYLSFAFEHCHRSSQKNKRMILIYLLPVKMLLGHMPTIELLRKYHLMQFSEVTKA 301
Query: 347 LTRGDPRLLRQALQEHEDRFLRSGVYLVLEKLELQVYQRLVKKIYFIQRQKDPNKAHQIK 406
++ G+ LL +AL +HE F+R G++L+LEKL++ Y+ L KK+Y + K HQ+
Sbjct: 302 VSEGNLLLLNEALAKHETFFIRCGIFLILEKLKIITYRNLFKKVYLLL------KTHQLS 361
Query: 407 LEVIVKALQWLEV-DMDVDEVECIMAILINKGLVKGYFAHKSKVAVVSKQDPFPRLN 446
L+ + AL+++ V D+D+DEV+CI+A LI G +KGY +H+ + VVSKQ+PFP L+
Sbjct: 362 LDAFLVALKFMHVEDVDIDEVQCILANLIYMGHIKGYISHQHQKLVVSKQNPFPPLS 396
BLAST of CmoCh12G013280 vs. ExPASy Swiss-Prot
Match:
Q2TBN6 (PCI domain-containing protein 2 OS=Bos taurus OX=9913 GN=PCID2 PE=2 SV=1)
HSP 1 Score: 241.5 bits (615), Expect = 1.8e-62
Identity = 149/420 (35.48%), Postives = 243/420 (57.86%), Query Frame = 0
Query: 47 AHRRITEYLNRFSDSVASQDGASLKSLLSLSSNSPNLLALADSLNVFQDANRLIRQSDRY 106
AH I +YL + +++ ++DGASL L+S P++ AN ++ +
Sbjct: 2 AHITINQYLQQVCEAIDTRDGASLAELVSF--KHPHV------------ANPRLQMASPE 61
Query: 107 SQFGDIL-----VNFFRALQC-YRLGN--LVDAYHAFEKSANAFTQEFRSW-DSAWALEA 166
+ +L F L+C Y +GN ++AY +F + F++ + WAL
Sbjct: 62 EKCQQVLEPPYDEMFAAHLRCTYAVGNHDFIEAYKCQTVIVQSFLRAFQAHKEENWALPV 121
Query: 167 LYVVAYEIRIIAERADRELASNGKSP--EKLKGAGSFLMKVFGVLAG------KGPKRVG 226
+Y VA ++RI A AD++L GKS + L+ A LM F V A + K+ G
Sbjct: 122 MYAVALDLRIFANNADQQLVKKGKSKVGDMLEKAAELLMGCFRVCASDTRAGIEDSKKRG 181
Query: 227 ALYVTCQLFKIYFKLGTVHLCRSVIRSIETARIFDFEEFPKRDRVTYMYYTGRLEVFNEN 286
L++ QLFKIYFK+ +HLC+ +IR+I+++ + D ++ RVTY YY GR +F+ +
Sbjct: 182 MLFLVNQLFKIYFKINKLHLCKPLIRAIDSSNLKD--DYSTAQRVTYRYYVGRKAMFDSD 241
Query: 287 FPAADQKLSYALMHCNPQREANIRMILKYLIPVKLSMGILPTRVLLEKYNLFEYGNVVQS 346
F A++ LS+A HC+ + N RM+L YL+PVK+ +G +PT LL KY+L ++ V ++
Sbjct: 242 FKQAEEYLSFAFEHCHRSSQKNKRMVLIYLLPVKMLLGHMPTIELLRKYHLMQFAEVTRA 301
Query: 347 LTRGDPRLLRQALQEHEDRFLRSGVYLVLEKLELQVYQRLVKKIYFIQRQKDPN---KAH 406
++ G+ LL +AL HE F+R G++L+LEKL++ Y+ L KK+ + K H
Sbjct: 302 VSEGNLLLLNEALAAHETFFIRCGIFLILEKLKIITYRNLFKKVNSLSSASSRYLLLKTH 361
Query: 407 QIKLEVIVKALQWLEV-DMDVDEVECIMAILINKGLVKGYFAHKSKVAVVSKQDPFPRLN 446
Q+ L+ + AL++++V D+D+ EV+CI+A LI G +KGY +H+ + VVSKQ+PFP L+
Sbjct: 362 QLSLDAFLVALKFMQVEDVDIAEVQCILANLIYMGHIKGYISHQHQKLVVSKQNPFPPLS 405
BLAST of CmoCh12G013280 vs. ExPASy Swiss-Prot
Match:
Q5FWP8 (PCI domain-containing protein 2 OS=Xenopus laevis OX=8355 GN=pcid2 PE=2 SV=1)
HSP 1 Score: 240.0 bits (611), Expect = 5.3e-62
Identity = 143/410 (34.88%), Postives = 239/410 (58.29%), Query Frame = 0
Query: 47 AHRRITEYLNRFSDSVASQDGASLKSLLSLSSNSPNLL-ALADSLNVFQDANRLIRQSDR 106
AH I +YL + +++ S+DG + L+S P++ A L+ + +++
Sbjct: 2 AHITINQYLQQVQEAIDSKDGFNCADLVSF--KHPHVANARLQLLSPEEKCQQVLE---- 61
Query: 107 YSQFGDILVNFFRALQCYRLGNLVDAYHAFEKSANAFTQEFRSW-DSAWALEALYVVAYE 166
+ ++ R + + V+AY +F + F++ + WAL +Y + +
Sbjct: 62 -PPYDEMFAAHLRCINAASNHDFVEAYKYQTLVVQSFLKSFQAHKEENWALPIMYSITLD 121
Query: 167 IRIIAERADRELASNGKSP--EKLKGAGSFLMKVFGVLAG------KGPKRVGALYVTCQ 226
+RI A AD++L GK + L+ A LM F V A + K+ G L++ Q
Sbjct: 122 LRIFANNADQQLVKKGKGKVGDMLEKAAEILMSCFRVCASDTRAAFEDSKKWGMLFLVNQ 181
Query: 227 LFKIYFKLGTVHLCRSVIRSIETARIFDFEEFPKRDRVTYMYYTGRLEVFNENFPAADQK 286
LFKIYFK+ +HLC+ +IR+I+++ EE+ RVT+ YY GR +F+ +F A++
Sbjct: 182 LFKIYFKISKLHLCKPLIRAIDSSNF--KEEYTMAQRVTFKYYVGRKSMFDSDFKKAEEY 241
Query: 287 LSYALMHCNPQREANIRMILKYLIPVKLSMGILPTRVLLEKYNLFEYGNVVQSLTRGDPR 346
LS+A HC+ + N RMIL YL+PVK+ +G +PT LL+KY+L ++ V ++++ G+
Sbjct: 242 LSFAFEHCHRSSQKNKRMILIYLLPVKMLLGHMPTIHLLKKYDLMQFAEVTKAVSEGNLL 301
Query: 347 LLRQALQEHEDRFLRSGVYLVLEKLELQVYQRLVKKIYFIQRQKDPNKAHQIKLEVIVKA 406
LL +AL +HE F+R G++L+LEKL++ Y+ L KK+Y + K HQ+ L+ + A
Sbjct: 302 LLTEALTKHETFFIRCGIFLILEKLKIISYRNLFKKVYLLL------KTHQLSLDAFLVA 361
Query: 407 LQWLEV-DMDVDEVECIMAILINKGLVKGYFAHKSKVAVVSKQDPFPRLN 446
L+++EV D+D+DEV+CI+A LI G +KGY +H+ + VVSKQ+PFP L+
Sbjct: 362 LKFMEVGDVDIDEVQCIIANLIYMGHIKGYISHQHQKLVVSKQNPFPPLS 396
BLAST of CmoCh12G013280 vs. ExPASy TrEMBL
Match:
A0A6J1FC68 (enhanced ethylene response protein 5 OS=Cucurbita moschata OX=3662 GN=LOC111444072 PE=4 SV=1)
HSP 1 Score: 883.6 bits (2282), Expect = 3.4e-253
Identity = 451/451 (100.00%), Postives = 451/451 (100.00%), Query Frame = 0
Query: 1 MRNQETTPFGVAVELTDRGKEQDCRRKAWRTAAEREAAMAYLSMGEAHRRITEYLNRFSD 60
MRNQETTPFGVAVELTDRGKEQDCRRKAWRTAAEREAAMAYLSMGEAHRRITEYLNRFSD
Sbjct: 1 MRNQETTPFGVAVELTDRGKEQDCRRKAWRTAAEREAAMAYLSMGEAHRRITEYLNRFSD 60
Query: 61 SVASQDGASLKSLLSLSSNSPNLLALADSLNVFQDANRLIRQSDRYSQFGDILVNFFRAL 120
SVASQDGASLKSLLSLSSNSPNLLALADSLNVFQDANRLIRQSDRYSQFGDILVNFFRAL
Sbjct: 61 SVASQDGASLKSLLSLSSNSPNLLALADSLNVFQDANRLIRQSDRYSQFGDILVNFFRAL 120
Query: 121 QCYRLGNLVDAYHAFEKSANAFTQEFRSWDSAWALEALYVVAYEIRIIAERADRELASNG 180
QCYRLGNLVDAYHAFEKSANAFTQEFRSWDSAWALEALYVVAYEIRIIAERADRELASNG
Sbjct: 121 QCYRLGNLVDAYHAFEKSANAFTQEFRSWDSAWALEALYVVAYEIRIIAERADRELASNG 180
Query: 181 KSPEKLKGAGSFLMKVFGVLAGKGPKRVGALYVTCQLFKIYFKLGTVHLCRSVIRSIETA 240
KSPEKLKGAGSFLMKVFGVLAGKGPKRVGALYVTCQLFKIYFKLGTVHLCRSVIRSIETA
Sbjct: 181 KSPEKLKGAGSFLMKVFGVLAGKGPKRVGALYVTCQLFKIYFKLGTVHLCRSVIRSIETA 240
Query: 241 RIFDFEEFPKRDRVTYMYYTGRLEVFNENFPAADQKLSYALMHCNPQREANIRMILKYLI 300
RIFDFEEFPKRDRVTYMYYTGRLEVFNENFPAADQKLSYALMHCNPQREANIRMILKYLI
Sbjct: 241 RIFDFEEFPKRDRVTYMYYTGRLEVFNENFPAADQKLSYALMHCNPQREANIRMILKYLI 300
Query: 301 PVKLSMGILPTRVLLEKYNLFEYGNVVQSLTRGDPRLLRQALQEHEDRFLRSGVYLVLEK 360
PVKLSMGILPTRVLLEKYNLFEYGNVVQSLTRGDPRLLRQALQEHEDRFLRSGVYLVLEK
Sbjct: 301 PVKLSMGILPTRVLLEKYNLFEYGNVVQSLTRGDPRLLRQALQEHEDRFLRSGVYLVLEK 360
Query: 361 LELQVYQRLVKKIYFIQRQKDPNKAHQIKLEVIVKALQWLEVDMDVDEVECIMAILINKG 420
LELQVYQRLVKKIYFIQRQKDPNKAHQIKLEVIVKALQWLEVDMDVDEVECIMAILINKG
Sbjct: 361 LELQVYQRLVKKIYFIQRQKDPNKAHQIKLEVIVKALQWLEVDMDVDEVECIMAILINKG 420
Query: 421 LVKGYFAHKSKVAVVSKQDPFPRLNGKPVGS 452
LVKGYFAHKSKVAVVSKQDPFPRLNGKPVGS
Sbjct: 421 LVKGYFAHKSKVAVVSKQDPFPRLNGKPVGS 451
BLAST of CmoCh12G013280 vs. ExPASy TrEMBL
Match:
A0A6J1HL42 (enhanced ethylene response protein 5 OS=Cucurbita maxima OX=3661 GN=LOC111465537 PE=4 SV=1)
HSP 1 Score: 879.4 bits (2271), Expect = 6.4e-252
Identity = 448/451 (99.33%), Postives = 450/451 (99.78%), Query Frame = 0
Query: 1 MRNQETTPFGVAVELTDRGKEQDCRRKAWRTAAEREAAMAYLSMGEAHRRITEYLNRFSD 60
MRNQETTPFG+AVELT+RGKEQDCRRKAWR AAEREAAMAYLSMGEAHRRITEYLNRFSD
Sbjct: 1 MRNQETTPFGIAVELTERGKEQDCRRKAWRIAAEREAAMAYLSMGEAHRRITEYLNRFSD 60
Query: 61 SVASQDGASLKSLLSLSSNSPNLLALADSLNVFQDANRLIRQSDRYSQFGDILVNFFRAL 120
SVASQDGASLKSLLSLSSNSPNLLALADSLNVFQDANRLIRQSDRYSQFGDILVNFFRAL
Sbjct: 61 SVASQDGASLKSLLSLSSNSPNLLALADSLNVFQDANRLIRQSDRYSQFGDILVNFFRAL 120
Query: 121 QCYRLGNLVDAYHAFEKSANAFTQEFRSWDSAWALEALYVVAYEIRIIAERADRELASNG 180
QCYRLGNLVDAYHAFEKSANAFTQEFRSWDSAWALEALYVVAYEIRIIAERADRELASNG
Sbjct: 121 QCYRLGNLVDAYHAFEKSANAFTQEFRSWDSAWALEALYVVAYEIRIIAERADRELASNG 180
Query: 181 KSPEKLKGAGSFLMKVFGVLAGKGPKRVGALYVTCQLFKIYFKLGTVHLCRSVIRSIETA 240
KSPEKLKGAGSFLMKVFGVLAGKGPKRVGALYVTCQLFKIYFKLGTVHLCRSVIRSIETA
Sbjct: 181 KSPEKLKGAGSFLMKVFGVLAGKGPKRVGALYVTCQLFKIYFKLGTVHLCRSVIRSIETA 240
Query: 241 RIFDFEEFPKRDRVTYMYYTGRLEVFNENFPAADQKLSYALMHCNPQREANIRMILKYLI 300
RIFDFEEFPKRDRVTYMYYTGRLEVFNENFPAADQKLSYALMHCNPQREANIRMILKYLI
Sbjct: 241 RIFDFEEFPKRDRVTYMYYTGRLEVFNENFPAADQKLSYALMHCNPQREANIRMILKYLI 300
Query: 301 PVKLSMGILPTRVLLEKYNLFEYGNVVQSLTRGDPRLLRQALQEHEDRFLRSGVYLVLEK 360
PVKLSMGILPTRVLLEKYNLFEYGNVVQSLTRGDPRLLRQALQEHEDRFLRSGVYLVLEK
Sbjct: 301 PVKLSMGILPTRVLLEKYNLFEYGNVVQSLTRGDPRLLRQALQEHEDRFLRSGVYLVLEK 360
Query: 361 LELQVYQRLVKKIYFIQRQKDPNKAHQIKLEVIVKALQWLEVDMDVDEVECIMAILINKG 420
LELQVYQRLVKKIYFIQRQKDPNKAHQIKLEVIVKALQWLEVDMDVDEVECIMAILINKG
Sbjct: 361 LELQVYQRLVKKIYFIQRQKDPNKAHQIKLEVIVKALQWLEVDMDVDEVECIMAILINKG 420
Query: 421 LVKGYFAHKSKVAVVSKQDPFPRLNGKPVGS 452
LVKGYFAHKSKVAVVSKQDPFPRLNGKPVGS
Sbjct: 421 LVKGYFAHKSKVAVVSKQDPFPRLNGKPVGS 451
BLAST of CmoCh12G013280 vs. ExPASy TrEMBL
Match:
A0A5D3CHS8 (Enhanced ethylene response protein 5 isoform X1 OS=Cucumis melo var. makuwa OX=1194695 GN=E5676_scaffold332G00910 PE=4 SV=1)
HSP 1 Score: 779.6 bits (2012), Expect = 6.9e-222
Identity = 396/413 (95.88%), Postives = 405/413 (98.06%), Query Frame = 0
Query: 39 MAYLSMGEAHRRITEYLNRFSDSVASQDGASLKSLLSLSSNSPNLLALADSLNVFQDANR 98
MAYLSMGEAHRRITEYLNRFSDSV+SQDG SLKSLL+LSSNSPNLLALADSLNVFQDANR
Sbjct: 1 MAYLSMGEAHRRITEYLNRFSDSVSSQDGVSLKSLLALSSNSPNLLALADSLNVFQDANR 60
Query: 99 LIRQSDRYSQFGDILVNFFRALQCYRLGNLVDAYHAFEKSANAFTQEFRSWDSAWALEAL 158
LIRQSDRYSQFG++LVNFFRALQCYRLGNLVDAY AFEK ANAFTQEFRSWDSAWALEAL
Sbjct: 61 LIRQSDRYSQFGEMLVNFFRALQCYRLGNLVDAYQAFEKFANAFTQEFRSWDSAWALEAL 120
Query: 159 YVVAYEIRIIAERADRELASNGKSPEKLKGAGSFLMKVFGVLAGKGPKRVGALYVTCQLF 218
YVVAYEIRIIAERADRELASNGKSPEKLKGAGSFLMKVFGVLAGKGPKRVGALYVTCQLF
Sbjct: 121 YVVAYEIRIIAERADRELASNGKSPEKLKGAGSFLMKVFGVLAGKGPKRVGALYVTCQLF 180
Query: 219 KIYFKLGTVHLCRSVIRSIETARIFDFEEFPKRDRVTYMYYTGRLEVFNENFPAADQKLS 278
KIYFKLGTVHLCRSVIRSIETARIFDFEEFPKRDRVTYMYYTGRLEVFNENFPAADQKLS
Sbjct: 181 KIYFKLGTVHLCRSVIRSIETARIFDFEEFPKRDRVTYMYYTGRLEVFNENFPAADQKLS 240
Query: 279 YALMHCNPQREANIRMILKYLIPVKLSMGILPTRVLLEKYNLFEYGNVVQSLTRGDPRLL 338
YALMHCNPQRE+NIRMILKYLIPVKLSMGILPT+ LLEKYNLFEY NVVQ+L RGDPRLL
Sbjct: 241 YALMHCNPQRESNIRMILKYLIPVKLSMGILPTKSLLEKYNLFEYENVVQALKRGDPRLL 300
Query: 339 RQALQEHEDRFLRSGVYLVLEKLELQVYQRLVKKIYFIQRQKDPNKAHQIKLEVIVKALQ 398
R ALQEHED+FLRSGVYLVLEKLELQVYQRLVKKIYFIQRQKDPNKAHQIKLEVIVKALQ
Sbjct: 301 RHALQEHEDQFLRSGVYLVLEKLELQVYQRLVKKIYFIQRQKDPNKAHQIKLEVIVKALQ 360
Query: 399 WLEVDMDVDEVECIMAILINKGLVKGYFAHKSKVAVVSKQDPFPRLNGKPVGS 452
WLEVDMD+DEVECIMAILINK LVKGYFAHKSKVAVVSKQDPFPRLNGKPVGS
Sbjct: 361 WLEVDMDIDEVECIMAILINKSLVKGYFAHKSKVAVVSKQDPFPRLNGKPVGS 413
BLAST of CmoCh12G013280 vs. ExPASy TrEMBL
Match:
A0A6J1DPL3 (enhanced ethylene response protein 5 OS=Momordica charantia OX=3673 GN=LOC111021916 PE=4 SV=1)
HSP 1 Score: 776.9 bits (2005), Expect = 4.5e-221
Identity = 397/413 (96.13%), Postives = 404/413 (97.82%), Query Frame = 0
Query: 39 MAYLSMGEAHRRITEYLNRFSDSVASQDGASLKSLLSLSSNSPNLLALADSLNVFQDANR 98
MAYLSMGEAHRRITEYL+RFSDSVASQDGASLKSLLSLSSNSPNLLALADSLNVFQDANR
Sbjct: 1 MAYLSMGEAHRRITEYLSRFSDSVASQDGASLKSLLSLSSNSPNLLALADSLNVFQDANR 60
Query: 99 LIRQSDRYSQFGDILVNFFRALQCYRLGNLVDAYHAFEKSANAFTQEFRSWDSAWALEAL 158
LIRQSDRYSQFG+ILVNFFRALQ YR+GNL+DAY AFEKSANAFTQEFR+WDSAWALEAL
Sbjct: 61 LIRQSDRYSQFGEILVNFFRALQSYRIGNLLDAYQAFEKSANAFTQEFRTWDSAWALEAL 120
Query: 159 YVVAYEIRIIAERADRELASNGKSPEKLKGAGSFLMKVFGVLAGKGPKRVGALYVTCQLF 218
YVVAYEIRIIAERADRELASNGKSPEKLKGAGSFLMKVFGVLAGKGPKRVGALYVTCQLF
Sbjct: 121 YVVAYEIRIIAERADRELASNGKSPEKLKGAGSFLMKVFGVLAGKGPKRVGALYVTCQLF 180
Query: 219 KIYFKLGTVHLCRSVIRSIETARIFDFEEFPKRDRVTYMYYTGRLEVFNENFPAADQKLS 278
KIYFKLGTVHLCRSVIRSIETARIFDFEEFPKRDRVTYMYYTGRLEVFNENFPAADQKLS
Sbjct: 181 KIYFKLGTVHLCRSVIRSIETARIFDFEEFPKRDRVTYMYYTGRLEVFNENFPAADQKLS 240
Query: 279 YALMHCNPQREANIRMILKYLIPVKLSMGILPTRVLLEKYNLFEYGNVVQSLTRGDPRLL 338
YALMHCNPQREANIRMILKYLIPVKLSMGILPT LL KYNL EY NVVQ+L RGDPRLL
Sbjct: 241 YALMHCNPQREANIRMILKYLIPVKLSMGILPTNSLLGKYNLSEYANVVQALKRGDPRLL 300
Query: 339 RQALQEHEDRFLRSGVYLVLEKLELQVYQRLVKKIYFIQRQKDPNKAHQIKLEVIVKALQ 398
RQALQEHEDRFLRSGVYLVLEKLELQVYQRLVKK+YFIQRQKDPNKAHQIKLEVIVKALQ
Sbjct: 301 RQALQEHEDRFLRSGVYLVLEKLELQVYQRLVKKVYFIQRQKDPNKAHQIKLEVIVKALQ 360
Query: 399 WLEVDMDVDEVECIMAILINKGLVKGYFAHKSKVAVVSKQDPFPRLNGKPVGS 452
WLEVDMDVDEVECIMAILINK LVKGYFAHKSKVAVVSKQDPFPRLNGKPVGS
Sbjct: 361 WLEVDMDVDEVECIMAILINKNLVKGYFAHKSKVAVVSKQDPFPRLNGKPVGS 413
BLAST of CmoCh12G013280 vs. ExPASy TrEMBL
Match:
A0A1S4DZE8 (enhanced ethylene response protein 5 isoform X1 OS=Cucumis melo OX=3656 GN=LOC103498473 PE=4 SV=1)
HSP 1 Score: 776.9 bits (2005), Expect = 4.5e-221
Identity = 395/413 (95.64%), Postives = 404/413 (97.82%), Query Frame = 0
Query: 39 MAYLSMGEAHRRITEYLNRFSDSVASQDGASLKSLLSLSSNSPNLLALADSLNVFQDANR 98
MAYLSMGEAHRRITEYLNRFSDSV+SQDG SLKSLL+LSSNSPNLLALADSLNVFQDANR
Sbjct: 1 MAYLSMGEAHRRITEYLNRFSDSVSSQDGVSLKSLLALSSNSPNLLALADSLNVFQDANR 60
Query: 99 LIRQSDRYSQFGDILVNFFRALQCYRLGNLVDAYHAFEKSANAFTQEFRSWDSAWALEAL 158
LIRQSDRYSQFG++LVNFFRALQCYRLGNLVDAY AFEK ANAFTQEFRSWDSAWALEAL
Sbjct: 61 LIRQSDRYSQFGEMLVNFFRALQCYRLGNLVDAYQAFEKFANAFTQEFRSWDSAWALEAL 120
Query: 159 YVVAYEIRIIAERADRELASNGKSPEKLKGAGSFLMKVFGVLAGKGPKRVGALYVTCQLF 218
YVVAYEIRIIAERADRELASNGKSPEKLKGAGSFLMKVFGVLAGKGPKRVGALYVTCQLF
Sbjct: 121 YVVAYEIRIIAERADRELASNGKSPEKLKGAGSFLMKVFGVLAGKGPKRVGALYVTCQLF 180
Query: 219 KIYFKLGTVHLCRSVIRSIETARIFDFEEFPKRDRVTYMYYTGRLEVFNENFPAADQKLS 278
KIYFKLGTVHLCRSVIRSIETARIFDFEEFPKRDRVTYMYYTGRLEVFNENFPAADQKLS
Sbjct: 181 KIYFKLGTVHLCRSVIRSIETARIFDFEEFPKRDRVTYMYYTGRLEVFNENFPAADQKLS 240
Query: 279 YALMHCNPQREANIRMILKYLIPVKLSMGILPTRVLLEKYNLFEYGNVVQSLTRGDPRLL 338
YALMHCNPQRE+NIRMILKYLIPVKLSMGILPT+ LLEKYNLFEY NVVQ+L RGDPR L
Sbjct: 241 YALMHCNPQRESNIRMILKYLIPVKLSMGILPTKSLLEKYNLFEYENVVQALKRGDPRPL 300
Query: 339 RQALQEHEDRFLRSGVYLVLEKLELQVYQRLVKKIYFIQRQKDPNKAHQIKLEVIVKALQ 398
R ALQEHED+FLRSGVYLVLEKLELQVYQRLVKKIYFIQRQKDPNKAHQIKLEVIVKALQ
Sbjct: 301 RHALQEHEDQFLRSGVYLVLEKLELQVYQRLVKKIYFIQRQKDPNKAHQIKLEVIVKALQ 360
Query: 399 WLEVDMDVDEVECIMAILINKGLVKGYFAHKSKVAVVSKQDPFPRLNGKPVGS 452
WLEVDMD+DEVECIMAILINK LVKGYFAHKSKVAVVSKQDPFPRLNGKPVGS
Sbjct: 361 WLEVDMDIDEVECIMAILINKSLVKGYFAHKSKVAVVSKQDPFPRLNGKPVGS 413
BLAST of CmoCh12G013280 vs. TAIR 10
Match:
AT2G19560.1 (proteasome family protein )
HSP 1 Score: 644.0 bits (1660), Expect = 8.7e-185
Identity = 321/413 (77.72%), Postives = 367/413 (88.86%), Query Frame = 0
Query: 39 MAYLSMGEAHRRITEYLNRFSDSVASQDGASLKSLLSLSSNSPNLLALADSLNVFQDANR 98
MAY+SMGEAHRRITEYLNRF D+V+ QD ++L LLS SSNSP LL+LAD+LNVFQD++
Sbjct: 1 MAYVSMGEAHRRITEYLNRFCDAVSYQDSSTLCRLLSFSSNSPPLLSLADALNVFQDSSS 60
Query: 99 LIRQSDRYSQFGDILVNFFRALQCYRLGNLVDAYHAFEKSANAFTQEFRSWDSAWALEAL 158
LIRQSDR+S++G+IL + FR+LQ YR+GNLV+AY AF+K ANAF QEFR+W+SAWALEAL
Sbjct: 61 LIRQSDRFSEYGEILAHVFRSLQSYRVGNLVEAYLAFDKFANAFVQEFRNWESAWALEAL 120
Query: 159 YVVAYEIRIIAERADRELASNGKSPEKLKGAGSFLMKVFGVLAGKGPKRVGALYVTCQLF 218
YVV YEIR++AE+AD++L SNGKSPEKLK AGS LMKVFGVLAGKGPKRVGALYVTCQLF
Sbjct: 121 YVVCYEIRVLAEKADKDLTSNGKSPEKLKAAGSLLMKVFGVLAGKGPKRVGALYVTCQLF 180
Query: 219 KIYFKLGTVHLCRSVIRSIETARIFDFEEFPKRDRVTYMYYTGRLEVFNENFPAADQKLS 278
K YFKLGTV+LCRSVIRSIETARIFDFEEFP+RD+VTYMYYTGRLEVFNENFPAAD KLS
Sbjct: 181 KTYFKLGTVNLCRSVIRSIETARIFDFEEFPRRDKVTYMYYTGRLEVFNENFPAADTKLS 240
Query: 279 YALMHCNPQREANIRMILKYLIPVKLSMGILPTRVLLEKYNLFEYGNVVQSLTRGDPRLL 338
YAL +CNP+RE NIRMILKYL+PVKLS+GI+P LL YNL EY +VQ+L +GD RLL
Sbjct: 241 YALQNCNPKRERNIRMILKYLVPVKLSLGIIPKDELLRNYNLHEYTKIVQALRKGDLRLL 300
Query: 339 RQALQEHEDRFLRSGVYLVLEKLELQVYQRLVKKIYFIQRQKDPNKAHQIKLEVIVKALQ 398
R ALQEHEDRFLRSGVYLVLEKLELQVYQRL+KKIY Q+ DP +AHQ+KLE I KAL+
Sbjct: 301 RHALQEHEDRFLRSGVYLVLEKLELQVYQRLMKKIYINQKLSDPARAHQLKLEGIAKALR 360
Query: 399 WLEVDMDVDEVECIMAILINKGLVKGYFAHKSKVAVVSKQDPFPRLNGKPVGS 452
WL++DMD+DEVECIM ILI K LVKGY AHKSKV V+SKQDPFP+LNGKPV S
Sbjct: 361 WLDMDMDLDEVECIMTILIYKNLVKGYLAHKSKVVVLSKQDPFPKLNGKPVSS 413
The following BLAST results are available for this feature:
Match Name | E-value | Identity | Description | |
Q8GWE6 | 1.2e-183 | 77.72 | Enhanced ethylene response protein 5 OS=Arabidopsis thaliana OX=3702 GN=EER5 PE=... | [more] |
Q5JVF3 | 5.2e-65 | 36.45 | PCI domain-containing protein 2 OS=Homo sapiens OX=9606 GN=PCID2 PE=1 SV=2 | [more] |
Q8BFV2 | 2.6e-64 | 35.97 | PCI domain-containing protein 2 OS=Mus musculus OX=10090 GN=Pcid2 PE=1 SV=1 | [more] |
Q2TBN6 | 1.8e-62 | 35.48 | PCI domain-containing protein 2 OS=Bos taurus OX=9913 GN=PCID2 PE=2 SV=1 | [more] |
Q5FWP8 | 5.3e-62 | 34.88 | PCI domain-containing protein 2 OS=Xenopus laevis OX=8355 GN=pcid2 PE=2 SV=1 | [more] |
Match Name | E-value | Identity | Description | |
A0A6J1FC68 | 3.4e-253 | 100.00 | enhanced ethylene response protein 5 OS=Cucurbita moschata OX=3662 GN=LOC1114440... | [more] |
A0A6J1HL42 | 6.4e-252 | 99.33 | enhanced ethylene response protein 5 OS=Cucurbita maxima OX=3661 GN=LOC111465537... | [more] |
A0A5D3CHS8 | 6.9e-222 | 95.88 | Enhanced ethylene response protein 5 isoform X1 OS=Cucumis melo var. makuwa OX=1... | [more] |
A0A6J1DPL3 | 4.5e-221 | 96.13 | enhanced ethylene response protein 5 OS=Momordica charantia OX=3673 GN=LOC111021... | [more] |
A0A1S4DZE8 | 4.5e-221 | 95.64 | enhanced ethylene response protein 5 isoform X1 OS=Cucumis melo OX=3656 GN=LOC10... | [more] |
Match Name | E-value | Identity | Description | |
AT2G19560.1 | 8.7e-185 | 77.72 | proteasome family protein | [more] |