Sequences
The following sequences are available for this feature:
Gene sequence (with intron)
Legend: exonCDSpolypeptidethree_prime_UTR
Hold the cursor over a type above to highlight its positions in the sequence below.
ATGAAGGACCCGTTGGAGCGGACCAAAGTAGTGATTAGGCACTTGCCCCCTTCTCTTCCTCACTCAGATCTCTTCCACAATATTGATGATCGATTCGCTGGTCGCTACAATTGGTGTTACTATCGTCCTGGAAAGACCAGGTTCGCCCCTTTTCTTCTTCTTCTTCTTCTTCTTCTTCTTGTCGCTTACCTTTTCTCTAATTCCAGCTTTCAAACGAAAAATTAGTCTTATAAACTGGGTTTTCAGTTATATGTTGTCATATGTTGTTATATGTCGTCTTGGATTTTCAGAAATCTGGAGGTTGAATGAGTAGGGCCATGCGCGAAGTGGGTATCTCTTGGATTGTGTCTTATGTGCTTTACGCTCTTTCCCCCCCNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNCCCCCCCTTAGGTGGGCTATTGGGGAGGGTTGGTGATTAGAATAATGTAGGAGACTTTTTTCTCCCCTCCAATTTCATGCCCAAATGTGTATGATTATGATTCTGGAAACGAAATCCAGCAATCGTTCTAGAGTTCGTGTGTTTTTTTGGTTAGAATCGGCTAAGTTCTTACTTTCTCATGTTGTGTACTTTCTCTTGGTTTCTAAAGCTCACAATACTGAAGGTTTTTTACTCATTCAGAGGTTTCTTCTGAAATAATTTTGCAGTCAGAAGGACCAGAGATATGCTCGAGCTTACATAGATTTTACGCGACCTGAAGATGTTTTTGAGTTTGCGGAGTTTTTTGATGGGCATGTTTTCGTTAATGAGAAGGGTAAAGCCGACAGCTTGATCTTTTTTCTTTATTGAACTTATGAGTAGCTTCTTGTTTCTTGTGGGTTTGGTGGGTGATTCATTGAAACCTTACCCATTTCTTAGAAGTGGGAGATAACGAATTTATGAATGATTCCATTGCTTAATGTTGATTGATATGCTTACAGATTGCTCCCTTTGAGGTTTTCTGATGTGCATTACCCATTTCTAAGAACTGTTTAATGCTTGGATCATGTTTCTCTCCATATGTTTAAAGTTTCGATTGAAGTTAACTTATTTAGTCAAATAATTACAGGTGCTCAGTTTAAGGCTGTAGTTGAATATGCACCTTCACAGCGTGTTCCCAGATCATCGACCAAAAAGGATGGTCGTGAGGGGACTATATACAAAGGTAATGATCGGTTGGTGAGAAGCGGAGAGTACGTGGTTGATCTTGTTTTATATTGGTTATTATTTTAGTGTAAATTGTCTTTTCTCGATGAAAGCAGATCCGGATTATTTGGAGTTTCTCAAACTTATTGCAAAGCCTGCTGAGCATCTTCCAAGTGCTGAAATACAGTTGGAAAGAAAAGAAGCAGAGCAATCTGGTTTGTTATTATGTTCACTGACTGACTTATTGATATTACTCGCGTTCGTATATGTTTATAATGTATAATAGGGTGCCGAAAAGTTGCTGAATACTCATTATGTTACTTTGCTATGAGCTTCCATAGGGATACTATTCACTTTAAATTCAGATAATTCTTGAAGTTTTATATTTCTACTTTTGTTTGCAGCTGCTGCGAAAGAAACTCCAATCGTTACCCCTCTCATGGAGTTTGTTCGCCAGAAACGAGCAGTTGAGAGCGGAACGCAGGTGAGATTGCTGTATTCTTAATACATGGACCTGTGTGCTATATTTTAATCATTCAGAGAAAATGAAACCACTTTCTTTATTTTTTGTACTTCATAAGGTGTTGGTTCTGTCCATGTAGAGATTATTATTTATACATTCACTTCAACATTAAGTCGAGATTCATTCATTGGAACTCGTCTGCTGCTTGTGTAATGATTTCAGTATCTTTTCCTTTCGTATTGGTTAATACCATGCATTATGTGAATCCAAATGCTCTTACGATGGTTTTCTATGTGAGCTTAAAGATTGTATTCCTCTTCTAATTTATAATATACCAATTCCTCATTATCTCATGTAAAGTTTTTTTTGGTTCAATAGGGATCTTCAGTACCTCGGAAGGTCAAAAGAGGTGGGGGAGCATCCTCTAGAAAGCCTGAGTCGAATTCCGTGAAGCGAGGGATGGAGAAGAAGAAGGCATGTGCTACTTCTTCAAAGTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTAACTTTATAAAATTCACAAGTGTTGGTTTGCTAGTGTCTCCACTATTTTTCCCCTATAGTTTTTGGGAAAATAAGCTTTCTTTGCTCATGTTAGATCTAAGGATGGGAAGCAGAATGTTTTTAACTAACAAAATAGGATAATTACAAAAGTCCTTACAAACCAAGCAACTCTTAAGGCGTCAACCAAAATTGGAAATGAAGGCAGCAAACTGCCATCTCCCAAGGATGCGATCTAAATCCTCAAATGCTCTCCTTCTAAGAATGCATTGGTGCAGCCTTAACACACAAAGGAAGATTGCTTCTGTATTTCATATGTAGTGTCAACTCCCTAATGCAAGCCCATACTCCTAGTCCTGCAGTTCTAGAAACTTGACCTTCTTGGAAATTTGTACTTTCGACAGTGAGGTGAAGCAAAAACTAATGATATCAATGGTTTTGACTGCAGAAACCAATCTAAAGTTTCTAATGTTATTGGTGGAAATAGTTTGCCTAACTGTGGTGTAAACTGTGTCATGCTACACCACGTCGAGGTTATGATAAGTGAACGTTGAAATTAAGGCTTTAGTTTTTTGCCCCTCGCACTTACTTCGTGGATACCCTTTTTTTGTGAACAAATTGTGTCTATATTTTTGTCTGAAGACTTTAGTTCTGTTATACATGCAGTATATTTTAAAGGAGAGTGTGAAGAACACAAATCGCAGAGACAAGTCAAATTTCATTCTGGTGCCCAGGCGAGAGGATCAGTTAGCTACCTCAGGTGGAATTGGAATTTCTGATGTTGGTACAGGTATTTCTATTTAGGTTCTTTGTGGTCAGCGACATCATTTCTGTCATTTTTGTTTTGTTCTAAGTTATTGATCCGATATTTTATTGTTAGGCTGTTGAAAAATATTATCTTAGCAATTCAGTGTTTCACGTCATTGAGGTGCAGGGGAACGTGAGCCTATTTGATTTAAGATCTGAACACTTTATGCACTATTCCGCTGCTGGATTTTGTTTGTTCTTTTTTTTTTTTTTTTTTGGAAGAGTTNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTGGAAGAGTTCTCACAATCATTTTATATTTTTTCCCATTTCAAATTACCCTGAGCCCAGTAGTAATGTTCTTGTAATTTTCACTTTCCTCTTGATTAATAATTGTTTTCTTGTTTCTTTCTTGGATTCTGCCATATTGTGCTGTTGCAGATTGCAGGATTGTAATATTTGAACATGCATTTTAATATACGCATTATTAAGTTACCTTAAACACTTGGAAGTGATATATTTGAGGGATGGTAGGGAAAAGGAAAAAGGAAAAAGGAAAACAAGGTTGTTCATTAGATAGTGCACATTGTCTGAAATTTTGAAATATGTTCTTTCCCTGCATGGGACTTTGCTACACATTTGTTTCTTTTAATTTATCAATATGCTCTACATGTTTATAATTAACTTGCAGTCTAGAATTTGAAGCCTTTGTTTCATATGTATCTGTAACAGCTGACTCTGGAAAGAAAAAAATTCTGCTTCTCAAGGGGAAAGAGCGAGATATATCTCATGTACGTTTCTTTAGTTCTCTCGACTAATTATTCATTTCCAACTTTGTTACTCGCTTTAGAATTATTAATTTGTGTATCAATTGTCACCCAAGGCTGATTTTGGAGGGCACAAATTCTATTCTCTTGCCTCAAATTTTGACACTTGCTTATCTATTTCTTGGATAATGATACATTTGCAGGTTTCTGATGGTATGCTACAGTTGCAGAGCGCGACATCTTCTGGGACTTCTCCTGCTAGTGCATCAAAACATAATCAGAGGCGTGAGGCTAGTGGAAGCATGATCAGAAGCATTCTTTTAAATAATGAATCACGTCATGGACAGACTTCATCAGGAGCTCCATCTCATCAAAAAATTCAGATCCTGAACTCAGACAATGGGAAGCGACTACCTCGTTCCATTAATGCTCGATCTGGGTCTAATGATATGTCCAGTAATGAACCAAATCCATCTGGTTTGGAAGGGGATGGAAAAAGGCCTTCAGATGGTAAACCTAGTAAGAAGGAACTTCATGGCCTGGGTAGTCTTAGCGAGAAGCAAGAAAAACGAATAAGAAACAAAGATAGACCTGATCGTGGTGTGTGGGCGCCACGTAGTCGTTCAGATGCTTCAGTCTCACAACTTGAGGAATCCTCTGTTGCACAATCTAGCCATTTGCTTTCTGACTCAGTTGAAGGTACTACTATTATAATGCTTTGGAGACGATGAGGTGCATATATATACAATCATATTATTGAAGTATCGGACCTTGATCATTGTGTCAAATCATACTGCTTGATATGAGCTAAATGATTTATATAGACTCTGTTTCACGAGAAATTATAACCCAATATATATTTTATGCATTTACAGCATACCGTGGGGAAATGAAAGAAGATATTCATGTAAGCAGGACTGGGGATGTTACAACTACTGTGAGTGGACGTAGCAGTTCAGTGGAAAATGGTTGTACTTCCTCACGAACGCTTCTTACGTTCACTTATTTTAGGAAACTTTAAAAGTACTTTCTACTTTTCGTTTAATCTAAAACATGTTGTGCTTTCTTTAGGTTCTGTTAGACATGGTGGACGTCGTGGGGCTGGGCATGTCATGAAGGATGATGGGTCATCGAACCCAAATGAAGGGAAGCCGTCTAAAAGAGGTGTTGTCAGTGGTCATGAGGTTCGCTTCTGCATCCATGCTAGTGATTGTATCTCTCGTGTTACATGATCACTTCTTTTTCAGAAACTCATTATCTTTGATGCCTATGCAGAAACAAGTATGGGTTCAGAAGTCATCTTCAGGTTCTTAACATATTTAAATCATGTGAGTCTTTTGCATATTAGCAATTTTTTATCTCCTTTATGATACGAGCAAGACTGTTTTAAGAGGTTTTTGTTCCCAGAGATGTCCAACCGAACGACAAAGCAGTGTCTAAATTAACTAAAGATTCTGGTTTAAAGTAACCATAGCACATGTTTGAATCCATCCATCGTGCCAACATTTTGCTAATATAATTAAGAGGTTATTGCTTATATTGCTTAGATTCATCGCCTTCCTCACTGCTGCCATGCATTCTCCTTGAATTCATCTGATATCTGCATATAATTAACATATTGGCTGGATTTTGTGTGCTGTTTTATTTATTGTTCTCTATTGGATCGTGACGATGTGTATGCTATCGAACAGATAATGGCATCCAGACAGGACTATGGTTCCTCAAACTTCCAGGGGGAATATCTCCATTCTGAGGACTTAATGCACATCCTCAGAAGGCATCATCATGTAAAATGGAGATATGCCCTGGAAGCAGTTGGTAACAGTAATAGGTTTCATGAACTGGATCGCAAGATGCAATTGTTTCGTATACGGGTGGTTACAAAACACAGAAGTGTTCTCTGGTTGAAGACCGAATATGCGTTCTATGGATTTCTGGATTTACAAAATCGAAGTTTTTTTGGTTGAAGACCAATATGGTTTGTGGATTTTGGTTTGGTATGATCTGATTTTTTTGCCCAAACATACCAAGTGTGACACTGCTTCACCTGGAGGAACGTGCTGGTATATACATACGTGGCTTGCGTGACACGAGATCAAAGCCAAGAGGGAGATTCATTCTAACTTTCCCTTTGCTTGATGTTTTGGTCATAATGCTATGTTTCTTTTCCCTTAGGAGCTTGAAATATAGAAAGTACCTAGTAGTACTCTGCTTACCATAATCTCACTGGGGGTGGTTAGTACACAATTGTACTTGATTCTTTGCGTGGGGCTGTGAATCAGAATCATCAGACTTAATACTCTTTCATGTCGTATATGAAGAATGTGATGGATGATTATGATATATGTGGACTAATATATCACTTTTATTTA
mRNA sequence
ATGAAGGACCCGTTGGAGCGGACCAAAGTAGTGATTAGGCACTTGCCCCCTTCTCTTCCTCACTCAGATCTCTTCCACAATATTGATGATCGATTCGCTGGTCGCTACAATTGGTGTTACTATCGTCCTGGAAAGACCAGGTTCGCCCCTTTTCTTCTTCTTCTTCTTCTTCTTCTTCTTGTGGGCTATTGGGGAGGGTTGGTGATTAGAATAATTCAGAAGGACCAGAGATATGCTCGAGCTTACATAGATTTTACGCGACCTGAAGATGTTTTTGAGTTTGCGGAGTTTTTTGATGGGCATGTTTTCGTTAATGAGAAGGGTGCTCAGTTTAAGGCTGTAGTTGAATATGCACCTTCACAGCGTGTTCCCAGATCATCGACCAAAAAGGATGGTCGTGAGGGGACTATATACAAAGATCCGGATTATTTGGAGTTTCTCAAACTTATTGCAAAGCCTGCTGAGCATCTTCCAAGTGCTGAAATACAGTTGGAAAGAAAAGAAGCAGAGCAATCTGCTGCTGCGAAAGAAACTCCAATCGTTACCCCTCTCATGGAGTTTGTTCGCCAGAAACGAGCAGTTGAGAGCGGAACGCAGTATATTTTAAAGGAGAGTGTGAAGAACACAAATCGCAGAGACAAGTCAAATTTCATTCTGGTGCCCAGGCGAGAGGATCAGTTAGCTACCTCAGGTGGAATTGGAATTTCTGATGCTGTTGAAAAATATTATCTTAGCAATTCAGTGTTTCACGTCATTGAGGTGCAGGGGAACATTGCAGGATTTCTAGAATTTGAAGCCTTTGTTTCATATGTATCTGTAACAGCTGACTCTGGAAAGAAAAAAATTCTGCTTCTCAAGGGGAAAGAGCGAGATATATCTCATGTTTCTGATGGTATGCTACAGTTGCAGAGCGCGACATCTTCTGGGACTTCTCCTGCTAGTGCATCAAAACATAATCAGAGGCGTGAGGCTAGTGGAAGCATGATCAGAAGCATTCTTTTAAATAATGAATCACGTCATGGACAGACTTCATCAGGAGCTCCATCTCATCAAAAAATTCAGATCCTGAACTCAGACAATGGGAAGCGACTACCTCGTTCCATTAATGCTCGATCTGGGTCTAATGATATGTCCAGTAATGAACCAAATCCATCTGGTTTGGAAGGGGATGGAAAAAGGCCTTCAGATGGTAAACCTAGTAAGAAGGAACTTCATGGCCTGGGTAGTCTTAGCGAGAAGCAAGAAAAACGAATAAGAAACAAAGATAGACCTGATCGTGGTGTGTGGGCGCCACGTAGTCGTTCAGATGCTTCAGTCTCACAACTTGAGGAATCCTCTGTTGCACAATCTAGCCATTTGCTTTCTGACTCAGTTGAAGACTCTGTTTCACGAGAAATTATAACCCAATATATATTTTATGCATTTACAGCATACCGTGGGGAAATGAAAGAAGATATTCATGTAAGCAGGACTGGGGATGTTACAACTACTGTGAGTGGACGAAACTTTAAAAGTACTTTCTACTTTTCGTTTAATCTAAAACATGTTGTGCTTTCTTTAGGTTCTGTTAGACATGGTGGACGTCGTGGGGCTGGGCATGTCATGAAGGATGATGGGTCATCGAACCCAAATGAAGGGAAGCCGTCTAAAAGAGGTGTTGTCAGTGGTCATGAGGTTCGCTTCTGCATCCATGCTAGTGATTAAACAAGTATGGGTTCAGAAGTCATCTTCAGGTTCTTAACATATTTAAATCATATAATGGCATCCAGACAGGACTATGGTTCCTCAAACTTCCAGGGGGAATATCTCCATTCTGAGGACTTAATGCACATCCTCAGAAGGCATCATCATGTAAAATGGAGATATGCCCTGGAAGCAGTTGGTAACAGTAATAGGTTTCATGAACTGGATCGCAAGATGCAATTGTTTCGTATACGGGTGGTTACAAAACACAGAAGTGTTCTCTGGTTGAAGACCGAATATGCGTTCTATGGATTTCTGGATTTACAAAATCGAAGTTTTTTTGGTTGAAGACCAATATGGTTTGTGGATTTTGGTTTGGTATGATCTGATTTTTTTGCCCAAACATACCAAGTGTGACACTGCTTCACCTGGAGGAACGTGCTGGTATATACATACGTGGCTTGCGTGACACGAGATCAAAGCCAAGAGGGAGATTCATTCTAACTTTCCCTTTGCTTGATGTTTTGGTCATAATGCTATGTTTCTTTTCCCTTAGGAGCTTGAAATATAGAAAGTACCTAGTAGTACTCTGCTTACCATAATCTCACTGGGGGTGGTTAGTACACAATTGTACTTGATTCTTTGCGTGGGGCTGTGAATCAGAATCATCAGACTTAATACTCTTTCATGTCGTATATGAAGAATGTGATGGATGATTATGATATATGTGGACTAATATATCACTTTTATTTA
Coding sequence (CDS)
ATGAAGGACCCGTTGGAGCGGACCAAAGTAGTGATTAGGCACTTGCCCCCTTCTCTTCCTCACTCAGATCTCTTCCACAATATTGATGATCGATTCGCTGGTCGCTACAATTGGTGTTACTATCGTCCTGGAAAGACCAGGTTCGCCCCTTTTCTTCTTCTTCTTCTTCTTCTTCTTCTTGTGGGCTATTGGGGAGGGTTGGTGATTAGAATAATTCAGAAGGACCAGAGATATGCTCGAGCTTACATAGATTTTACGCGACCTGAAGATGTTTTTGAGTTTGCGGAGTTTTTTGATGGGCATGTTTTCGTTAATGAGAAGGGTGCTCAGTTTAAGGCTGTAGTTGAATATGCACCTTCACAGCGTGTTCCCAGATCATCGACCAAAAAGGATGGTCGTGAGGGGACTATATACAAAGATCCGGATTATTTGGAGTTTCTCAAACTTATTGCAAAGCCTGCTGAGCATCTTCCAAGTGCTGAAATACAGTTGGAAAGAAAAGAAGCAGAGCAATCTGCTGCTGCGAAAGAAACTCCAATCGTTACCCCTCTCATGGAGTTTGTTCGCCAGAAACGAGCAGTTGAGAGCGGAACGCAGTATATTTTAAAGGAGAGTGTGAAGAACACAAATCGCAGAGACAAGTCAAATTTCATTCTGGTGCCCAGGCGAGAGGATCAGTTAGCTACCTCAGGTGGAATTGGAATTTCTGATGCTGTTGAAAAATATTATCTTAGCAATTCAGTGTTTCACGTCATTGAGGTGCAGGGGAACATTGCAGGATTTCTAGAATTTGAAGCCTTTGTTTCATATGTATCTGTAACAGCTGACTCTGGAAAGAAAAAAATTCTGCTTCTCAAGGGGAAAGAGCGAGATATATCTCATGTTTCTGATGGTATGCTACAGTTGCAGAGCGCGACATCTTCTGGGACTTCTCCTGCTAGTGCATCAAAACATAATCAGAGGCGTGAGGCTAGTGGAAGCATGATCAGAAGCATTCTTTTAAATAATGAATCACGTCATGGACAGACTTCATCAGGAGCTCCATCTCATCAAAAAATTCAGATCCTGAACTCAGACAATGGGAAGCGACTACCTCGTTCCATTAATGCTCGATCTGGGTCTAATGATATGTCCAGTAATGAACCAAATCCATCTGGTTTGGAAGGGGATGGAAAAAGGCCTTCAGATGGTAAACCTAGTAAGAAGGAACTTCATGGCCTGGGTAGTCTTAGCGAGAAGCAAGAAAAACGAATAAGAAACAAAGATAGACCTGATCGTGGTGTGTGGGCGCCACGTAGTCGTTCAGATGCTTCAGTCTCACAACTTGAGGAATCCTCTGTTGCACAATCTAGCCATTTGCTTTCTGACTCAGTTGAAGACTCTGTTTCACGAGAAATTATAACCCAATATATATTTTATGCATTTACAGCATACCGTGGGGAAATGAAAGAAGATATTCATGTAAGCAGGACTGGGGATGTTACAACTACTGTGAGTGGACGAAACTTTAAAAGTACTTTCTACTTTTCGTTTAATCTAAAACATGTTGTGCTTTCTTTAGGTTCTGTTAGACATGGTGGACGTCGTGGGGCTGGGCATGTCATGAAGGATGATGGGTCATCGAACCCAAATGAAGGGAAGCCGTCTAAAAGAGGTGTTGTCAGTGGTCATGAGGTTCGCTTCTGCATCCATGCTAGTGATTAA
Protein sequence
MKDPLERTKVVIRHLPPSLPHSDLFHNIDDRFAGRYNWCYYRPGKTRFAPFLLLLLLLLLVGYWGGLVIRIIQKDQRYARAYIDFTRPEDVFEFAEFFDGHVFVNEKGAQFKAVVEYAPSQRVPRSSTKKDGREGTIYKDPDYLEFLKLIAKPAEHLPSAEIQLERKEAEQSAAAKETPIVTPLMEFVRQKRAVESGTQYILKESVKNTNRRDKSNFILVPRREDQLATSGGIGISDAVEKYYLSNSVFHVIEVQGNIAGFLEFEAFVSYVSVTADSGKKKILLLKGKERDISHVSDGMLQLQSATSSGTSPASASKHNQRREASGSMIRSILLNNESRHGQTSSGAPSHQKIQILNSDNGKRLPRSINARSGSNDMSSNEPNPSGLEGDGKRPSDGKPSKKELHGLGSLSEKQEKRIRNKDRPDRGVWAPRSRSDASVSQLEESSVAQSSHLLSDSVEDSVSREIITQYIFYAFTAYRGEMKEDIHVSRTGDVTTTVSGRNFKSTFYFSFNLKHVVLSLGSVRHGGRRGAGHVMKDDGSSNPNEGKPSKRGVVSGHEVRFCIHASD
Homology
BLAST of CmoCh09G013210 vs. ExPASy Swiss-Prot
Match:
Q9FVW4 (Regulator of nonsense transcripts UPF3 OS=Arabidopsis thaliana OX=3702 GN=UPF3 PE=1 SV=1)
HSP 1 Score: 322.8 bits (826), Expect = 7.9e-87
Identity = 220/571 (38.53%), Postives = 296/571 (51.84%), Query Frame = 0
Query: 1 MKDPLERTKVVIRHLPPSLPHSDLFHNIDDRFAGRYNWCYYRPGKTRFAPFLLLLLLLLL 60
MK+PL++ KVV+RHLPPSL SDL ID RFA RYNW +RPGK+ +
Sbjct: 1 MKEPLQKKKVVVRHLPPSLSQSDLLSQIDPRFADRYNWVSFRPGKSSY------------ 60
Query: 61 VGYWGGLVIRIIQKDQRYARAYIDFTRPEDVFEFAEFFDGHVFVNEKGAQFKAVVEYAPS 120
K+Q+Y+RAY+ F PEDV+EFA FF+GHVFVNEKGAQFKA+VEYAPS
Sbjct: 61 -------------KNQKYSRAYVSFKAPEDVYEFAAFFNGHVFVNEKGAQFKAIVEYAPS 120
Query: 121 QRVPRSSTKKDGREGTIYKDPDYLEFLKLIAKPAEHLPSAEIQLERKEAEQSAAAKETPI 180
QRVP+ S KKD REG+I KDPDYLEFLK+IA+P E+LPSAEIQLER+EAEQS A+K PI
Sbjct: 121 QRVPKPSDKKDPREGSISKDPDYLEFLKVIAQPVENLPSAEIQLERREAEQSGASKAAPI 180
Query: 181 VTPLMEFVRQKRAVESGTQYIL---------------KESVKNTNRRDKSNFILVPRRED 240
VTPLMEF+RQKRA G Q + K S + + R + +
Sbjct: 181 VTPLMEFIRQKRATVMGPQGLSDIRRGGRRTRVVSANKPSPRPSKRNSEKKKYVEKESSK 240
Query: 241 QLATSGGIGISDAVEKYYLSNSVFHVIEVQGNIAGFLEFEAFVSYVSVTADSGKKKILLL 300
+ +S + Y SNS E+ GN + ++ +++T DSGKKKILLL
Sbjct: 241 NVPRKTTADVSSSKPDYRQSNSSGK--ELPGNETAAI-IDSSPPGIALTMDSGKKKILLL 300
Query: 301 KGKERDISHVSDGMLQLQSATSSGTSPASASKHNQRREASGSMIRSILLNNESRHGQTSS 360
+ K+RD + Q ++ + ++ S+ NQ+ + G +I+ ILL N+SR Q+S+
Sbjct: 301 RSKDRDNPDNPPPQPE-QHIDTNLSRNSTDSRQNQKSDVGGRLIKGILLRNDSRPSQSST 360
Query: 361 GAPSHQKIQILNSDNGKRLPRSINARSGSNDMSSNEPNPSGLEGDGKRPSDGKPSKKELH 420
S Q+++ ++N KR R N R+G K+ H
Sbjct: 361 FVQSEQRVEPSEAENYKRPSRPANTRAG----------------------------KDYH 420
Query: 421 GLGSLSEKQEKRIRNKDRPDRGVWAPRSRSDASVSQLEESSVAQSSHLLSDSVEDSVSRE 480
G++SEKQE+R RNKDRPDR +WAP R D S Q S+
Sbjct: 421 TSGTISEKQERRTRNKDRPDRVMWAP--RRDGSEDQPLSSA------------------- 465
Query: 481 IITQYIFYAFTAYRGEMKEDIHVSRTGDVTTTVSGRNFKSTFYFSFNLKHVVLSLGSVRH 540
GE+K+ + R+G+V + G ++ GS RH
Sbjct: 481 -----------GNNGEVKDRMFSQRSGEVVNSSGGHTLEN---------------GSARH 465
Query: 541 GGRRGAGHVMKDDGSSNPNEGKPSKRGVVSG 557
RR G K++ EGK S+RG G
Sbjct: 541 SSRRVGGRNRKEEVVI--GEGKTSRRGSGGG 465
BLAST of CmoCh09G013210 vs. ExPASy Swiss-Prot
Match:
B0S733 (Regulator of nonsense transcripts 3A OS=Danio rerio OX=7955 GN=upf3a PE=1 SV=2)
HSP 1 Score: 82.8 bits (203), Expect = 1.4e-14
Identity = 53/139 (38.13%), Postives = 84/139 (60.43%), Query Frame = 0
Query: 78 YARAYIDFTRPEDVFEFAEFFDGHVFVNEKGAQFKAVVEYAPSQRVPRSS-TKKDGREGT 137
++RAYI+F PED+ F + FDG+VF++ KG ++ AVVE+AP Q+V + KKD + GT
Sbjct: 86 FSRAYINFKNPEDIIIFRDRFDGYVFIDNKGQEYPAVVEFAPFQKVSKKKLKKKDAKAGT 145
Query: 138 IYKDPDYLEFLKLIAKPAE-HLPSAEIQLERKEAE-QSAAAKETPIVTPLMEFVRQKRAV 197
I +DP+Y FL+ + E + + E L EA+ + AK T TPL+E+++ K+
Sbjct: 146 IEEDPEYRRFLENYSCDEEKSMANPETLLGEIEAKTRELIAKRT---TPLLEYIKNKKLE 205
Query: 198 ESGTQYILKESVKNTNRRD 214
+ Q I +E + RR+
Sbjct: 206 K---QRIREEKREERRRRE 218
BLAST of CmoCh09G013210 vs. ExPASy Swiss-Prot
Match:
Q9H1J1 (Regulator of nonsense transcripts 3A OS=Homo sapiens OX=9606 GN=UPF3A PE=1 SV=1)
HSP 1 Score: 77.8 bits (190), Expect = 4.4e-13
Identity = 48/139 (34.53%), Postives = 82/139 (58.99%), Query Frame = 0
Query: 78 YARAYIDFTRPEDVFEFAEFFDGHVFVNEKGAQFKAVVEYAPSQRVPRSS-TKKDGREGT 137
Y+RAYI+F P+D+ F + FDG++F++ KG ++ AVVE+AP Q++ + KKD + G+
Sbjct: 111 YSRAYINFRNPDDILLFRDRFDGYIFLDSKGLEYPAVVEFAPFQKIAKKKLRKKDAKTGS 170
Query: 138 IYKDPDYLEFLKLIAKPAEHL-PSAEIQLERKEAE-QSAAAKETPIVTPLMEFVRQKRAV 197
I DP+Y +FL+ E + E L EA+ + A+ T TPL+E+++ ++
Sbjct: 171 IEDDPEYKKFLETYCVEEEKTSANPETLLGEMEAKTRELIARRT---TPLLEYIKNRKLE 230
Query: 198 ESGTQYILKESVKNTNRRD 214
+ Q I +E + RR+
Sbjct: 231 K---QRIREEKREERRRRE 243
BLAST of CmoCh09G013210 vs. ExPASy Swiss-Prot
Match:
Q10267 (Nonsense-mediated mRNA decay protein 3 OS=Schizosaccharomyces pombe (strain 972 / ATCC 24843) OX=284812 GN=upf3 PE=3 SV=1)
HSP 1 Score: 69.3 bits (168), Expect = 1.6e-10
Identity = 51/205 (24.88%), Postives = 91/205 (44.39%), Query Frame = 0
Query: 9 KVVIRHLPPSLPHSDLFHNIDDRFAGRYNWCYYRPGKTRFAPFLLLLLLLLLVGYWGGLV 68
KV++ +LPP+LP +I+ F W + GK
Sbjct: 13 KVLVFNLPPTLPEQVFLQSINS-FLPHVEWHRFSKGKA---------------------- 72
Query: 69 IRIIQKDQRYARAYIDFTRPEDVFEFAEFFDGHVFVNEKGAQFKAVVEYAPSQRVPRSST 128
+ + + + AY+ F V EF + GH F+++K ++A+V AP Q++P S
Sbjct: 73 -TVGTRSELLSFAYLKFQSATAVQEFFRVYQGHTFIDKKNNTYRAIVTIAPYQKIPPSKV 132
Query: 129 KKDGREGTIYKDPDYLEFLKLIAKPAEHLPSAEIQLERKEAEQSAA----AKETPIVTPL 188
K D EG++ +DP + EF +++R+ Q+A+ ++ TPL
Sbjct: 133 KADSLEGSLEQDPKFQEF----------------KVQRESYSQTASNDDVIEKLQTSTPL 177
Query: 189 MEFVRQKR--AVESGTQYILKESVK 208
++++ +K+ VE G K+SVK
Sbjct: 193 LQYLAEKKNAVVEKGKSKPSKKSVK 177
BLAST of CmoCh09G013210 vs. ExPASy Swiss-Prot
Match:
Q9BZI7 (Regulator of nonsense transcripts 3B OS=Homo sapiens OX=9606 GN=UPF3B PE=1 SV=1)
HSP 1 Score: 67.0 bits (162), Expect = 7.8e-10
Identity = 100/387 (25.84%), Postives = 170/387 (43.93%), Query Frame = 0
Query: 78 YARAYIDFTRPEDVFEFAEFFDGHVFVNEKGAQFKAVVEYAPSQRVPRSSTKK-DGREGT 137
YARAYI+F ED+ F + FDG+VF++ KG ++ A+VE+AP Q+ + TKK D + GT
Sbjct: 94 YARAYINFKNQEDIILFRDRFDGYVFLDNKGQEYPAIVEFAPFQKAAKKKTKKRDTKVGT 153
Query: 138 IYKDPDYLEFLKLIAKPAEHLPSA-EIQLERKEAE-QSAAAKETPIVTPLMEFVRQK-RA 197
I DP+Y +FL+ A E + S E LE EA+ + AK+T TPL+ F++ K R
Sbjct: 154 IDDDPEYRKFLESYATDNEKMTSTPETLLEEIEAKNRELIAKKT---TPLLSFLKNKQRM 213
Query: 198 VESGTQYILKESVKNTNRRDKSNFILVPRREDQLATSGGIGISDAVEKYYLSNSVFHVIE 257
E + + ++ +R++ +E++ I +++ + + E
Sbjct: 214 REEKREERRRREIERKRQREEER---RKWKEEEKRKRKDIEKLKKIDRIPERDKLKD--E 273
Query: 258 VQGNIAGFLEFEAFVSYVSVTADSGKKKIL--LLKGKERDISHVSDGMLQLQSATSSGTS 317
+ + FL + + G +K L K K+ D ++SD QS T S
Sbjct: 274 PKIKVHRFLLQAVNQKNLLKKPEKGDEKELDKREKAKKLDKENLSDERASGQSCTLPKRS 333
Query: 318 PASASKHNQRR--EASGSMIRSILLNNESRHGQTSSGAPSHQKIQILNSDNGKRLPRSIN 377
+ +R + SG R R + ++ ++ + +R +
Sbjct: 334 DSELKDEKPKRPEDESGRDYR-----EREREYERDQERILRERERLKRQEEERRRQKERY 393
Query: 378 ARSGSNDMSSNEPNPSGLEGDGKRPSDGKPSKKELHGLGSLSEKQEK-----RIRNKDRP 437
+ + E E D R K E G +EK+E+ RIRNKDRP
Sbjct: 394 EKEKTFKRKEEEMKK---EKDTLRDKGKKAESTESIGSSEKTEKKEEVVKRDRIRNKDRP 453
Query: 438 DRGVWAPRSRSDASVSQLEESSVAQSS 452
++ P +RS + ++S+ + S
Sbjct: 454 AMQLYQPGARSRNRLCPPDDSTKSGDS 464
BLAST of CmoCh09G013210 vs. ExPASy TrEMBL
Match:
A0A6J1EI29 (regulator of nonsense transcripts UPF3-like OS=Cucurbita moschata OX=3662 GN=LOC111432763 PE=3 SV=1)
HSP 1 Score: 825.1 bits (2130), Expect = 1.8e-235
Identity = 461/598 (77.09%), Postives = 465/598 (77.76%), Query Frame = 0
Query: 1 MKDPLERTKVVIRHLPPSLPHSDLFHNIDDRFAGRYNWCYYRPGKTRFAPFLLLLLLLLL 60
MKDPLERTKVVIRHLPPSLPHSDLFHNIDDRFAGRYNWCYYRPGKT
Sbjct: 1 MKDPLERTKVVIRHLPPSLPHSDLFHNIDDRFAGRYNWCYYRPGKTS------------- 60
Query: 61 VGYWGGLVIRIIQKDQRYARAYIDFTRPEDVFEFAEFFDGHVFVNEKGAQFKAVVEYAPS 120
QKDQRYARAYIDFTRPEDVFEFAEFFDGHVFVNEKGAQFKAVVEYAPS
Sbjct: 61 ------------QKDQRYARAYIDFTRPEDVFEFAEFFDGHVFVNEKGAQFKAVVEYAPS 120
Query: 121 QRVPRSSTKKDGREGTIYKDPDYLEFLKLIAKPAEHLPSAEIQLERKEAEQSAAAKETPI 180
QRVPRSSTKKDGREGTIYKDPDYLEFLKLIAKPAEHLPSAEIQLERKEAEQSAAAKETPI
Sbjct: 121 QRVPRSSTKKDGREGTIYKDPDYLEFLKLIAKPAEHLPSAEIQLERKEAEQSAAAKETPI 180
Query: 181 VTPLMEFVRQKRAVESGTQ--------------------------------YILKESVKN 240
VTPLMEFVRQKRAVESGTQ YILKESVKN
Sbjct: 181 VTPLMEFVRQKRAVESGTQGSSVPRKVKRGGGASSRKPESNSVKRGMEKKKYILKESVKN 240
Query: 241 TNRRDKSNFILVPRREDQLATSGGIGISDAVEKYYLSNSVFHVIEVQGNIAGFLEFEAFV 300
TNRRDKSNFILVPRREDQLATSGGIGISD
Sbjct: 241 TNRRDKSNFILVPRREDQLATSGGIGISDV------------------------------ 300
Query: 301 SYVSVTADSGKKKILLLKGKERDISHVSDGMLQLQSATSSGTSPASASKHNQRREASGSM 360
TADSGKKKILLLKGKERDISHVSDGMLQLQSATSSGTSPASASKHNQRREASGSM
Sbjct: 301 ----GTADSGKKKILLLKGKERDISHVSDGMLQLQSATSSGTSPASASKHNQRREASGSM 360
Query: 361 IRSILLNNESRHGQTSSGAPSHQKIQILNSDNGKRLPRSINARSGSNDMSSNEPNPSGLE 420
IRSILLNNESRHGQTSSGAPSHQKIQILNSDNGKRLPRSINARSGSNDMSSNEPNPSGLE
Sbjct: 361 IRSILLNNESRHGQTSSGAPSHQKIQILNSDNGKRLPRSINARSGSNDMSSNEPNPSGLE 420
Query: 421 GDGKRPSDGKPSKKELHGLGSLSEKQEKRIRNKDRPDRGVWAPRSRSDASVSQLEESSVA 480
GDGKRPSDGKPSKKELHGLGSLSEKQEKRIRNKDRPDRGVWAPRSRSDASVSQLEESSVA
Sbjct: 421 GDGKRPSDGKPSKKELHGLGSLSEKQEKRIRNKDRPDRGVWAPRSRSDASVSQLEESSVA 480
Query: 481 QSSHLLSDSVEDSVSREIITQYIFYAFTAYRGEMKEDIHVSRTGDVTTTVSGRNFKSTFY 540
QSSHLLSDSVE AYRGEMKEDIHVSRTGDVTTTVSGR+
Sbjct: 481 QSSHLLSDSVE-----------------AYRGEMKEDIHVSRTGDVTTTVSGRSSS---- 509
Query: 541 FSFNLKHVVLSLGSVRHGGRRGAGHVMKDDGSSNPNEGKPSKRGVVSGHEVRFCIHAS 567
+ GSVRHGGRRGAGHVMKDDGSSNPNEGKPSKRGVVSGHE + + S
Sbjct: 541 ---------VENGSVRHGGRRGAGHVMKDDGSSNPNEGKPSKRGVVSGHEKQVWVQKS 509
BLAST of CmoCh09G013210 vs. ExPASy TrEMBL
Match:
A0A6J1IG63 (regulator of nonsense transcripts UPF3-like OS=Cucurbita maxima OX=3661 GN=LOC111472590 PE=3 SV=1)
HSP 1 Score: 813.5 bits (2100), Expect = 5.4e-232
Identity = 455/598 (76.09%), Postives = 459/598 (76.76%), Query Frame = 0
Query: 1 MKDPLERTKVVIRHLPPSLPHSDLFHNIDDRFAGRYNWCYYRPGKTRFAPFLLLLLLLLL 60
MKDPLERTKVVIRHLPPSLPHSDLFHNIDDRFAGRYNWCYYRPGKT
Sbjct: 1 MKDPLERTKVVIRHLPPSLPHSDLFHNIDDRFAGRYNWCYYRPGKTS------------- 60
Query: 61 VGYWGGLVIRIIQKDQRYARAYIDFTRPEDVFEFAEFFDGHVFVNEKGAQFKAVVEYAPS 120
QKDQRYARAYIDFTRPEDVFEFAEFFDGHVFVNEKGAQFKAVVEYAPS
Sbjct: 61 ------------QKDQRYARAYIDFTRPEDVFEFAEFFDGHVFVNEKGAQFKAVVEYAPS 120
Query: 121 QRVPRSSTKKDGREGTIYKDPDYLEFLKLIAKPAEHLPSAEIQLERKEAEQSAAAKETPI 180
QRVPRSSTKKDGREGTIYKDPDYLEFLKLIAKPAEHLPSAEIQLERKEAEQS AAKETPI
Sbjct: 121 QRVPRSSTKKDGREGTIYKDPDYLEFLKLIAKPAEHLPSAEIQLERKEAEQSGAAKETPI 180
Query: 181 VTPLMEFVRQKRAVESGTQ--------------------------------YILKESVKN 240
VTPLMEFVRQKRAVESGTQ YILKESVKN
Sbjct: 181 VTPLMEFVRQKRAVESGTQGSSLPRKVKRGGGASSRKPESNSAKRGMEKKKYILKESVKN 240
Query: 241 TNRRDKSNFILVPRREDQLATSGGIGISDAVEKYYLSNSVFHVIEVQGNIAGFLEFEAFV 300
TNRRDKSNFILVPRREDQLATSGGIGISD
Sbjct: 241 TNRRDKSNFILVPRREDQLATSGGIGISDV------------------------------ 300
Query: 301 SYVSVTADSGKKKILLLKGKERDISHVSDGMLQLQSATSSGTSPASASKHNQRREASGSM 360
ADSGKKKILLLKGKERDISHVSDGMLQLQS TS+GTSPASASKHNQRREASGSM
Sbjct: 301 ----GIADSGKKKILLLKGKERDISHVSDGMLQLQSVTSTGTSPASASKHNQRREASGSM 360
Query: 361 IRSILLNNESRHGQTSSGAPSHQKIQILNSDNGKRLPRSINARSGSNDMSSNEPNPSGLE 420
IRSILLNNESRHGQTSSGAPSHQKIQILNSDNGKRLPRSINARSGSNDMSSNEPN SGLE
Sbjct: 361 IRSILLNNESRHGQTSSGAPSHQKIQILNSDNGKRLPRSINARSGSNDMSSNEPNSSGLE 420
Query: 421 GDGKRPSDGKPSKKELHGLGSLSEKQEKRIRNKDRPDRGVWAPRSRSDASVSQLEESSVA 480
GDGKRPSDGKPSKKELHGLGSLSEKQEKRIRNKDRPDRGVWAPRSRSDASVSQLEESSVA
Sbjct: 421 GDGKRPSDGKPSKKELHGLGSLSEKQEKRIRNKDRPDRGVWAPRSRSDASVSQLEESSVA 480
Query: 481 QSSHLLSDSVEDSVSREIITQYIFYAFTAYRGEMKEDIHVSRTGDVTTTVSGRNFKSTFY 540
QSSHLLSDSVE AYRGEMKEDIHVSRTGDVTTTVSGRN
Sbjct: 481 QSSHLLSDSVE-----------------AYRGEMKEDIHVSRTGDVTTTVSGRNSS---- 509
Query: 541 FSFNLKHVVLSLGSVRHGGRRGAGHVMKDDGSSNPNEGKPSKRGVVSGHEVRFCIHAS 567
+ GSVRHGGRRGAGHVMKDDGS NPNEGKPSKRGV SGHE + + S
Sbjct: 541 ---------VENGSVRHGGRRGAGHVMKDDGSLNPNEGKPSKRGVASGHEKQVWVQKS 509
BLAST of CmoCh09G013210 vs. ExPASy TrEMBL
Match:
A0A0A0KSY5 (Smg4_UPF3 domain-containing protein OS=Cucumis sativus OX=3659 GN=Csa_5G650630 PE=3 SV=1)
HSP 1 Score: 741.1 bits (1912), Expect = 3.4e-210
Identity = 423/598 (70.74%), Postives = 435/598 (72.74%), Query Frame = 0
Query: 1 MKDPLERTKVVIRHLPPSLPHSDLFHNIDDRFAGRYNWCYYRPGKTRFAPFLLLLLLLLL 60
MKDPLERTKVVIRHLPPSL HSDLFH+I DRFAGR+NW YYRPGKT
Sbjct: 1 MKDPLERTKVVIRHLPPSLSHSDLFHHIHDRFAGRFNWSYYRPGKTS------------- 60
Query: 61 VGYWGGLVIRIIQKDQRYARAYIDFTRPEDVFEFAEFFDGHVFVNEKGAQFKAVVEYAPS 120
QKDQRYARAYIDFTRPEDVFEFAEFFDGHVFVNEKGAQ+KAVVEYAPS
Sbjct: 61 ------------QKDQRYARAYIDFTRPEDVFEFAEFFDGHVFVNEKGAQYKAVVEYAPS 120
Query: 121 QRVPRSSTKKDGREGTIYKDPDYLEFLKLIAKPAEHLPSAEIQLERKEAEQSAAAKETPI 180
QRVPRSSTKKDGREGTIYKDPDYLEFLKLIAKPAEHLPSAEIQLERKEAEQS AAKETPI
Sbjct: 121 QRVPRSSTKKDGREGTIYKDPDYLEFLKLIAKPAEHLPSAEIQLERKEAEQSGAAKETPI 180
Query: 181 VTPLMEFVRQKRAVESGTQ--------------------------------YILKESVKN 240
VTPLMEFVRQKRAVESGTQ YILK+SVKN
Sbjct: 181 VTPLMEFVRQKRAVESGTQGSSVPRKVKRGGAASSRKPESNSMKRGMEKKKYILKDSVKN 240
Query: 241 TNRRDKSNFILVPRREDQLATSGGIGISDAVEKYYLSNSVFHVIEVQGNIAGFLEFEAFV 300
TNRRDKSNFILVPRREDQ ATS IGISD
Sbjct: 241 TNRRDKSNFILVPRREDQSATSSAIGISDV------------------------------ 300
Query: 301 SYVSVTADSGKKKILLLKGKERDISHVSDGMLQLQSATSSGTSPASASKHNQRREASGSM 360
TAD GKKKILLLKGKERDISHVSD MLQLQSATSSG SPASASKHN RREA G +
Sbjct: 301 ----GTADFGKKKILLLKGKERDISHVSDDMLQLQSATSSGNSPASASKHNHRREAGGGV 360
Query: 361 IRSILLNNESRHGQTSSGAPSHQKIQILNSDNGKRLPRSINARSGSNDMSSNEPNPSGLE 420
IRSILLNNE+RHGQ+SS A SHQKIQILNSDNGKR PR NARSGSND+SSNEPNPSG E
Sbjct: 361 IRSILLNNEARHGQSSSVAQSHQKIQILNSDNGKRPPRPTNARSGSNDISSNEPNPSGSE 420
Query: 421 GDGKRPSDGKPSKKELHGLGSLSEKQEKRIRNKDRPDRGVWAPRSRSDASVSQLEESSVA 480
GDGKR SD K SKKELHGLGS SEKQEKRIRNKDRPDRGVWAPRSRSDASVSQLEESSV
Sbjct: 421 GDGKRASDNKFSKKELHGLGSASEKQEKRIRNKDRPDRGVWAPRSRSDASVSQLEESSVP 480
Query: 481 QSSHLLSDSVEDSVSREIITQYIFYAFTAYRGEMKEDIHVSRTGDVTTTVSGRNFKSTFY 540
QSSHLLSDSVE A+RGEMKEDIH SRTGDVTT VSGRN
Sbjct: 481 QSSHLLSDSVE-----------------AFRGEMKEDIHGSRTGDVTTIVSGRNSS---- 509
Query: 541 FSFNLKHVVLSLGSVRHGGRRGAGHVMKDDGSSNPNEGKPSKRGVVSGHEVRFCIHAS 567
+ GSVRH GRRGAGHVMKDDGS NPNEGKPSKRGV GHE + + S
Sbjct: 541 ---------VENGSVRHVGRRGAGHVMKDDGSLNPNEGKPSKRGVAGGHEKQVWVQKS 509
BLAST of CmoCh09G013210 vs. ExPASy TrEMBL
Match:
A0A1S3CB69 (regulator of nonsense transcripts UPF3 isoform X1 OS=Cucumis melo OX=3656 GN=LOC103498712 PE=3 SV=1)
HSP 1 Score: 738.0 bits (1904), Expect = 2.9e-209
Identity = 421/598 (70.40%), Postives = 434/598 (72.58%), Query Frame = 0
Query: 1 MKDPLERTKVVIRHLPPSLPHSDLFHNIDDRFAGRYNWCYYRPGKTRFAPFLLLLLLLLL 60
MKDPLERTKVVIRHLPPSL HSDLFH+I DRFAGR+NW YYRPGKT
Sbjct: 1 MKDPLERTKVVIRHLPPSLSHSDLFHHIHDRFAGRFNWSYYRPGKTS------------- 60
Query: 61 VGYWGGLVIRIIQKDQRYARAYIDFTRPEDVFEFAEFFDGHVFVNEKGAQFKAVVEYAPS 120
QKDQRYARAYIDFTRPEDVFEFAEFFDGHVFVNEKGAQ+KAVVEYAPS
Sbjct: 61 ------------QKDQRYARAYIDFTRPEDVFEFAEFFDGHVFVNEKGAQYKAVVEYAPS 120
Query: 121 QRVPRSSTKKDGREGTIYKDPDYLEFLKLIAKPAEHLPSAEIQLERKEAEQSAAAKETPI 180
QRVPRSSTKKDGREGTIYKDPDYLEFLKLIAKPAEHLPSAEIQLERKEAEQS AAKETPI
Sbjct: 121 QRVPRSSTKKDGREGTIYKDPDYLEFLKLIAKPAEHLPSAEIQLERKEAEQSGAAKETPI 180
Query: 181 VTPLMEFVRQKRAVESGTQ--------------------------------YILKESVKN 240
VTPLMEFVRQKRAVESGTQ YILK+SVKN
Sbjct: 181 VTPLMEFVRQKRAVESGTQGSSVPRKVKRGGAASSRKPESNSMKRGMEKKKYILKDSVKN 240
Query: 241 TNRRDKSNFILVPRREDQLATSGGIGISDAVEKYYLSNSVFHVIEVQGNIAGFLEFEAFV 300
TNRRDKSNFILVPRR+DQ A S GIGISD
Sbjct: 241 TNRRDKSNFILVPRRDDQSANSSGIGISDV------------------------------ 300
Query: 301 SYVSVTADSGKKKILLLKGKERDISHVSDGMLQLQSATSSGTSPASASKHNQRREASGSM 360
AD GKKKILLLKGKERDISHVSD MLQLQSATSSG SPASASKHN RREA GS+
Sbjct: 301 ----GAADFGKKKILLLKGKERDISHVSDDMLQLQSATSSGNSPASASKHNHRREAGGSV 360
Query: 361 IRSILLNNESRHGQTSSGAPSHQKIQILNSDNGKRLPRSINARSGSNDMSSNEPNPSGLE 420
IRSILLNNE+RHGQ+SS A SHQKIQILNSDNGKR PR NARSGSND+SSNEPNPSG E
Sbjct: 361 IRSILLNNEARHGQSSSVAQSHQKIQILNSDNGKRPPRPTNARSGSNDISSNEPNPSGSE 420
Query: 421 GDGKRPSDGKPSKKELHGLGSLSEKQEKRIRNKDRPDRGVWAPRSRSDASVSQLEESSVA 480
GDGKR D K SKKELHGLGS SEKQEKRIRNKDRPDRGVWAPRSRSDASVSQLEESSV+
Sbjct: 421 GDGKRAPDNKFSKKELHGLGSASEKQEKRIRNKDRPDRGVWAPRSRSDASVSQLEESSVS 480
Query: 481 QSSHLLSDSVEDSVSREIITQYIFYAFTAYRGEMKEDIHVSRTGDVTTTVSGRNFKSTFY 540
QSSHLLSDSVE AYRGEMKEDIH SRT DVTT VSGRN
Sbjct: 481 QSSHLLSDSVE-----------------AYRGEMKEDIHGSRTADVTTIVSGRNSS---- 509
Query: 541 FSFNLKHVVLSLGSVRHGGRRGAGHVMKDDGSSNPNEGKPSKRGVVSGHEVRFCIHAS 567
+ GSVRH GRRGAGHVMKDDGS NPNEGKPSKRGV GHE + + S
Sbjct: 541 ---------VENGSVRHVGRRGAGHVMKDDGSLNPNEGKPSKRGVAGGHEKQVWVQKS 509
BLAST of CmoCh09G013210 vs. ExPASy TrEMBL
Match:
A0A6J1D8M3 (regulator of nonsense transcripts UPF3 isoform X1 OS=Momordica charantia OX=3673 GN=LOC111018259 PE=3 SV=1)
HSP 1 Score: 731.1 bits (1886), Expect = 3.5e-207
Identity = 411/598 (68.73%), Postives = 435/598 (72.74%), Query Frame = 0
Query: 1 MKDPLERTKVVIRHLPPSLPHSDLFHNIDDRFAGRYNWCYYRPGKTRFAPFLLLLLLLLL 60
MKDPLERTKVVIRHLPPSLPHSDLFH+IDDRFAGRYNWCYYRPGKT
Sbjct: 1 MKDPLERTKVVIRHLPPSLPHSDLFHHIDDRFAGRYNWCYYRPGKTS------------- 60
Query: 61 VGYWGGLVIRIIQKDQRYARAYIDFTRPEDVFEFAEFFDGHVFVNEKGAQFKAVVEYAPS 120
QKDQRY+RAYIDFTRPEDVFEFAEFFDGHVFVNEKG Q+KAV+EYAPS
Sbjct: 61 ------------QKDQRYSRAYIDFTRPEDVFEFAEFFDGHVFVNEKGTQYKAVIEYAPS 120
Query: 121 QRVPRSSTKKDGREGTIYKDPDYLEFLKLIAKPAEHLPSAEIQLERKEAEQSAAAKETPI 180
QRVPRSSTKKDGR+GTIYKDPDYLEFLKLIAKPAEHLPSAEIQLERKEAEQS AKETPI
Sbjct: 121 QRVPRSSTKKDGRDGTIYKDPDYLEFLKLIAKPAEHLPSAEIQLERKEAEQSGVAKETPI 180
Query: 181 VTPLMEFVRQKRAVESGTQ--------------------------------YILKESVKN 240
VTPLMEFVRQKRAVESGTQ YILK+SVKN
Sbjct: 181 VTPLMEFVRQKRAVESGTQGSSVSRKVKRGGGASSRKPESNSMKRGMEKKKYILKDSVKN 240
Query: 241 TNRRDKSNFILVPRREDQLATSGGIGISDAVEKYYLSNSVFHVIEVQGNIAGFLEFEAFV 300
TNRRDKSNFILVPRR+DQ ATS GIGISD
Sbjct: 241 TNRRDKSNFILVPRRDDQPATSSGIGISDV------------------------------ 300
Query: 301 SYVSVTADSGKKKILLLKGKERDISHVSDGMLQLQSATSSGTSPASASKHNQRREASGSM 360
TADSGKKKILLLKGKERDISHV+DGML LQSA+SSG SP +A+K NQRREASGS+
Sbjct: 301 ----TTADSGKKKILLLKGKERDISHVADGMLHLQSASSSGNSPTTATKQNQRREASGSL 360
Query: 361 IRSILLNNESRHGQTSSGAPSHQKIQILNSDNGKRLPRSINARSGSNDMSSNEPNPSGLE 420
IRSILLNNE RHGQ SS + SHQKIQILNSDNGKR PR INARSGSNDMS+NEPNPSG E
Sbjct: 361 IRSILLNNEPRHGQGSSVSQSHQKIQILNSDNGKRPPRPINARSGSNDMSNNEPNPSGSE 420
Query: 421 GDGKRPSDGKPSKKELHGLGSLSEKQEKRIRNKDRPDRGVWAPRSRSDASVSQLEESSVA 480
GDGKR + K SKKELHGLGS SEKQEKRIRNKDRPDRGVWAPRSRSDA VSQL+ESS++
Sbjct: 421 GDGKRAPESKFSKKELHGLGSASEKQEKRIRNKDRPDRGVWAPRSRSDALVSQLDESSIS 480
Query: 481 QSSHLLSDSVEDSVSREIITQYIFYAFTAYRGEMKEDIHVSRTGDVTTTVSGRNFKSTFY 540
QSSHLLSDSVE A+R EMK+DIH SRTGDVTTTVSGR
Sbjct: 481 QSSHLLSDSVE-----------------AFRVEMKDDIHGSRTGDVTTTVSGRGSS---- 509
Query: 541 FSFNLKHVVLSLGSVRHGGRRGAGHVMKDDGSSNPNEGKPSKRGVVSGHEVRFCIHAS 567
+ GSVRH GRRGAGHVMKDDGS NPNEGK SKRG+ GHE + + S
Sbjct: 541 ---------VENGSVRHVGRRGAGHVMKDDGSLNPNEGKSSKRGLAGGHEKQVWVQKS 509
BLAST of CmoCh09G013210 vs. TAIR 10
Match:
AT1G33980.2 (Smg-4/UPF3 family protein )
HSP 1 Score: 326.2 bits (835), Expect = 5.1e-89
Identity = 223/571 (39.05%), Postives = 299/571 (52.36%), Query Frame = 0
Query: 1 MKDPLERTKVVIRHLPPSLPHSDLFHNIDDRFAGRYNWCYYRPGKTRFAPFLLLLLLLLL 60
MK+PL++ KVV+RHLPPSL SDL ID RFA RYNW +RPGK+R
Sbjct: 1 MKEPLQKKKVVVRHLPPSLSQSDLLSQIDPRFADRYNWVSFRPGKSR------------- 60
Query: 61 VGYWGGLVIRIIQKDQRYARAYIDFTRPEDVFEFAEFFDGHVFVNEKGAQFKAVVEYAPS 120
+GY K+Q+Y+RAY+ F PEDV+EFA FF+GHVFVNEKGAQFKA+VEYAPS
Sbjct: 61 LGY----------KNQKYSRAYVSFKAPEDVYEFAAFFNGHVFVNEKGAQFKAIVEYAPS 120
Query: 121 QRVPRSSTKKDGREGTIYKDPDYLEFLKLIAKPAEHLPSAEIQLERKEAEQSAAAKETPI 180
QRVP+ S KKD REG+I KDPDYLEFLK+IA+P E+LPSAEIQLER+EAEQS A+K PI
Sbjct: 121 QRVPKPSDKKDPREGSISKDPDYLEFLKVIAQPVENLPSAEIQLERREAEQSGASKAAPI 180
Query: 181 VTPLMEFVRQKRAVESGTQYIL---------------KESVKNTNRRDKSNFILVPRRED 240
VTPLMEF+RQKRA G Q + K S + + R + +
Sbjct: 181 VTPLMEFIRQKRATVMGPQGLSDIRRGGRRTRVVSANKPSPRPSKRNSEKKKYVEKESSK 240
Query: 241 QLATSGGIGISDAVEKYYLSNSVFHVIEVQGNIAGFLEFEAFVSYVSVTADSGKKKILLL 300
+ +S + Y SNS E+ GN + ++ +++T DSGKKKILLL
Sbjct: 241 NVPRKTTADVSSSKPDYRQSNSSGK--ELPGNETAAI-IDSSPPGIALTMDSGKKKILLL 300
Query: 301 KGKERDISHVSDGMLQLQSATSSGTSPASASKHNQRREASGSMIRSILLNNESRHGQTSS 360
+ K+RD + Q ++ + ++ S+ NQ+ + G +I+ ILL N+SR Q+S+
Sbjct: 301 RSKDRDNPDNPPPQPE-QHIDTNLSRNSTDSRQNQKSDVGGRLIKGILLRNDSRPSQSST 360
Query: 361 GAPSHQKIQILNSDNGKRLPRSINARSGSNDMSSNEPNPSGLEGDGKRPSDGKPSKKELH 420
S Q+++ ++N KR R N R+G K+ H
Sbjct: 361 FVQSEQRVEPSEAENYKRPSRPANTRAG----------------------------KDYH 420
Query: 421 GLGSLSEKQEKRIRNKDRPDRGVWAPRSRSDASVSQLEESSVAQSSHLLSDSVEDSVSRE 480
G++SEKQE+R RNKDRPDR +WAP R D S Q S+
Sbjct: 421 TSGTISEKQERRTRNKDRPDRVMWAP--RRDGSEDQPLSSA------------------- 467
Query: 481 IITQYIFYAFTAYRGEMKEDIHVSRTGDVTTTVSGRNFKSTFYFSFNLKHVVLSLGSVRH 540
GE+K+ + R+G+V + G ++ GS RH
Sbjct: 481 -----------GNNGEVKDRMFSQRSGEVVNSSGGHTLEN---------------GSARH 467
Query: 541 GGRRGAGHVMKDDGSSNPNEGKPSKRGVVSG 557
RR G K++ EGK S+RG G
Sbjct: 541 SSRRVGGRNRKEEVVI--GEGKTSRRGSGGG 467
BLAST of CmoCh09G013210 vs. TAIR 10
Match:
AT1G33980.1 (Smg-4/UPF3 family protein )
HSP 1 Score: 322.8 bits (826), Expect = 5.6e-88
Identity = 220/571 (38.53%), Postives = 296/571 (51.84%), Query Frame = 0
Query: 1 MKDPLERTKVVIRHLPPSLPHSDLFHNIDDRFAGRYNWCYYRPGKTRFAPFLLLLLLLLL 60
MK+PL++ KVV+RHLPPSL SDL ID RFA RYNW +RPGK+ +
Sbjct: 1 MKEPLQKKKVVVRHLPPSLSQSDLLSQIDPRFADRYNWVSFRPGKSSY------------ 60
Query: 61 VGYWGGLVIRIIQKDQRYARAYIDFTRPEDVFEFAEFFDGHVFVNEKGAQFKAVVEYAPS 120
K+Q+Y+RAY+ F PEDV+EFA FF+GHVFVNEKGAQFKA+VEYAPS
Sbjct: 61 -------------KNQKYSRAYVSFKAPEDVYEFAAFFNGHVFVNEKGAQFKAIVEYAPS 120
Query: 121 QRVPRSSTKKDGREGTIYKDPDYLEFLKLIAKPAEHLPSAEIQLERKEAEQSAAAKETPI 180
QRVP+ S KKD REG+I KDPDYLEFLK+IA+P E+LPSAEIQLER+EAEQS A+K PI
Sbjct: 121 QRVPKPSDKKDPREGSISKDPDYLEFLKVIAQPVENLPSAEIQLERREAEQSGASKAAPI 180
Query: 181 VTPLMEFVRQKRAVESGTQYIL---------------KESVKNTNRRDKSNFILVPRRED 240
VTPLMEF+RQKRA G Q + K S + + R + +
Sbjct: 181 VTPLMEFIRQKRATVMGPQGLSDIRRGGRRTRVVSANKPSPRPSKRNSEKKKYVEKESSK 240
Query: 241 QLATSGGIGISDAVEKYYLSNSVFHVIEVQGNIAGFLEFEAFVSYVSVTADSGKKKILLL 300
+ +S + Y SNS E+ GN + ++ +++T DSGKKKILLL
Sbjct: 241 NVPRKTTADVSSSKPDYRQSNSSGK--ELPGNETAAI-IDSSPPGIALTMDSGKKKILLL 300
Query: 301 KGKERDISHVSDGMLQLQSATSSGTSPASASKHNQRREASGSMIRSILLNNESRHGQTSS 360
+ K+RD + Q ++ + ++ S+ NQ+ + G +I+ ILL N+SR Q+S+
Sbjct: 301 RSKDRDNPDNPPPQPE-QHIDTNLSRNSTDSRQNQKSDVGGRLIKGILLRNDSRPSQSST 360
Query: 361 GAPSHQKIQILNSDNGKRLPRSINARSGSNDMSSNEPNPSGLEGDGKRPSDGKPSKKELH 420
S Q+++ ++N KR R N R+G K+ H
Sbjct: 361 FVQSEQRVEPSEAENYKRPSRPANTRAG----------------------------KDYH 420
Query: 421 GLGSLSEKQEKRIRNKDRPDRGVWAPRSRSDASVSQLEESSVAQSSHLLSDSVEDSVSRE 480
G++SEKQE+R RNKDRPDR +WAP R D S Q S+
Sbjct: 421 TSGTISEKQERRTRNKDRPDRVMWAP--RRDGSEDQPLSSA------------------- 465
Query: 481 IITQYIFYAFTAYRGEMKEDIHVSRTGDVTTTVSGRNFKSTFYFSFNLKHVVLSLGSVRH 540
GE+K+ + R+G+V + G ++ GS RH
Sbjct: 481 -----------GNNGEVKDRMFSQRSGEVVNSSGGHTLEN---------------GSARH 465
Query: 541 GGRRGAGHVMKDDGSSNPNEGKPSKRGVVSG 557
RR G K++ EGK S+RG G
Sbjct: 541 SSRRVGGRNRKEEVVI--GEGKTSRRGSGGG 465
The following BLAST results are available for this feature:
Match Name | E-value | Identity | Description | |
Q9FVW4 | 7.9e-87 | 38.53 | Regulator of nonsense transcripts UPF3 OS=Arabidopsis thaliana OX=3702 GN=UPF3 P... | [more] |
B0S733 | 1.4e-14 | 38.13 | Regulator of nonsense transcripts 3A OS=Danio rerio OX=7955 GN=upf3a PE=1 SV=2 | [more] |
Q9H1J1 | 4.4e-13 | 34.53 | Regulator of nonsense transcripts 3A OS=Homo sapiens OX=9606 GN=UPF3A PE=1 SV=1 | [more] |
Q10267 | 1.6e-10 | 24.88 | Nonsense-mediated mRNA decay protein 3 OS=Schizosaccharomyces pombe (strain 972 ... | [more] |
Q9BZI7 | 7.8e-10 | 25.84 | Regulator of nonsense transcripts 3B OS=Homo sapiens OX=9606 GN=UPF3B PE=1 SV=1 | [more] |
Match Name | E-value | Identity | Description | |
A0A6J1EI29 | 1.8e-235 | 77.09 | regulator of nonsense transcripts UPF3-like OS=Cucurbita moschata OX=3662 GN=LOC... | [more] |
A0A6J1IG63 | 5.4e-232 | 76.09 | regulator of nonsense transcripts UPF3-like OS=Cucurbita maxima OX=3661 GN=LOC11... | [more] |
A0A0A0KSY5 | 3.4e-210 | 70.74 | Smg4_UPF3 domain-containing protein OS=Cucumis sativus OX=3659 GN=Csa_5G650630 P... | [more] |
A0A1S3CB69 | 2.9e-209 | 70.40 | regulator of nonsense transcripts UPF3 isoform X1 OS=Cucumis melo OX=3656 GN=LOC... | [more] |
A0A6J1D8M3 | 3.5e-207 | 68.73 | regulator of nonsense transcripts UPF3 isoform X1 OS=Momordica charantia OX=3673... | [more] |