Sequences
The following sequences are available for this feature:
Gene sequence (with intron)
Legend: five_prime_UTRexonCDSpolypeptidethree_prime_UTR
Hold the cursor over a type above to highlight its positions in the sequence below.
TAGCCGACAATTAGCTTTGAGAAGAATCGGCGCCATTGCTTAAAACGCGGGGAATGAAATGAAACCCTGAGAATCATTTTCATCACGGCGTTCGTCTCAGCCGTCGCGTTTTGCTTTGGATAGCTCAGGAAGTTCTGCTATCGCGATTTACCGCCGTCGAATATGTTCTATTGGACTACTGTTTCCATATTTTCTGATTCTTTGTAATCTATAGAACCTGATTTTAGCGGCTGAACTACTTCGATTCGGTTCTACATCTATTATTTCTAAATCGTCGCTGCGGAGCGTTTTCAGCCGGCGATTGCTGCGAATTTGACTATATACTTATTTTCAGTTACTGTTTATGTTCAACCATGGTGGTTGGGAGCAAACTCTTCAAAACGAAGTTATGCGTTCTTTACCAGAAAGGCCGCTGTTCTCGGCCAAGTTGTTCGTTCGCGCATGGAACCGCTGAACTCAGGCGATTTGCTGGTGCTCACGACGGTAATTTGCTCCTTATATCTTTAATCTTTCAATGTGTTCTCTAATTTTGGGATTTAGGTTGATGAAAATGGAAGTTTATAGTAGTTAGAGGCTACATGTGTCTGCAAGTTTGGTTATATGTTGTTAACAGTGGTATAGTGGTGAGATGATGTTGTTGAGTTTAGTTGTTTGGGTATGAGTTTGATTCAAGGGATCTCTGTGTTTAATCCCTGGGGAGAGGAATAGAGTAGTTTAGGATTAATGGTGCTTCTTTCTTCCATTATGATCGATCGATTTGTTATAATTAATGTTGAGGGCTTGATTTGCAGCATTCTATTTTTTAGTCCTATTAATGTCTGGTCTTCATATTCTTTCTTCTTTCTTTCTGTTCCTGTTAAGTACCGTTTTCTAAGGTGTATGTTTTTTTTTCTTAATTATACGTTCATATTTCCCGATTAGAGGCGCTTCTTTAGATTAATAAGTAGGTTCTTTCAGTTAATGATCTGAAAGTATTCTGTCTCCTGTGTATTTATAGCTATACGAACACAATCTTCCTCATCTATGGCTCTATGGTTTGGGTATCTTGTTGCTTGCTATCTTCAATATTTTAATTGGTTCAGCACGAGTTGTCACATGAGATTCCACGCCTCTACCAATGGATATTTATTATAGAAGGAAGTCGAATGATTCCGAATAAAAATTTCTTCAAGTTGCTGGCCACTGAAGTGCTAGTAAAAAATAATCATCGGGATCAGAGTTTTTTTTTCATTGGAATGGATATGGTGGATTTTTCTCTATAGAGCTGCCTGTTATATATTCCTGGAGATGTTTATGGCCCTTGTGGTGCATCAACTAAACCAGAAAGTACTTGTTGAAGAGTATAATTGTTTTCCCTAATTTATATATCTCCCACGGCTTTTGTTAATTATTCCATTATGGTTTTCTCGGTTATGCTACTGTTGTTCACGGGGCTTCATTTATCTTAGAAGAGGTTCTTTGTTGAAGAGTATAATTGTTTTCCCTAATTTATATATCTCCCACGGCTTTTGTTAATTATTCCGTTATGGTTTTCTCGGTTAAGCCACCGTTGTTCACGGGGCTTCATTGATCTTAGAAGAGGTTCTTCATTGGTTTATGCCTCCTGAGTATGCCAAATGGACATCTTGTTGGAGTGATGCAATTTTTATGGGAGTATTGCATGAAGAATTGAAATTGAATCGATATCATTGGAATGTACTTGTATCATTATGTGGAATCCCGGAGATATCTCCCATTTAGATAAAAGTTATGTGAACTTATTGTTCATCGAGGACACAGGAGTTAATCCTTCGTTCTATTCTGAATATGAAGGTTATCCAAAGGGTTTTATCCCAGGCATGGATTTTTCTATATCTTTTCATTATTTCTATGGCCCTTTTGGAACTGGCATGGCGGCACTATTACTGTTCCTTATTTTGTTTCTCATCTTTTCCATGTAATGGTAAGATAATCCTTGTAAGGACTCCATTGCTTCATTAACTTTTTTAGAAAGAATATCTGGCGATAACTAAACTTGTTAAAATAAAATTTCCCCATCTTTGCAGGTAGACGGGAATATAGAGGAAATGATTTAAGGCATAAGCTTGACAGCAGGCATTCTCCTTTGCAGGAAAGGGATTCCAGGGGGCGACATGGACCTCGTGGTATTAAAGTTATATGTTTTTCTTCGTGTGTGTGTATGTGTGTGTCTCCTTCCCCCTCCACCAAGTAGTCTTTTCTCGATGTCACCTATGCTGCGATCCCTAATTTATATGATGCTTTCTTTTTACATTTTCTAATTAAGGAATGTAGGAAGCATGATTCTTAACATACTGATGCAATTATATTTTAAGAGAATATGTGGACTGCACAATTCTTTTACATGTACTAATAGCAAAGTTCCTTTCTGATTTCTCTCATGTGGAATTGGAATTGCAGATTATAGCTCATCCTGGTCACTTGAACGACAAAGGTAAATTGTGACAATCTTTCTTTTTAAAAAATTGTATTCTCATGTTATTTTATCACTGGAGGCCGTGCTGGCTCTCAATTTATGATCCCCATCTTATATGTTTATGTTGTAGCAGTGATACTTGATTGCCTTTTACTATTAGATTGATACTAATAGTGTGAAAGATAATAAGAAAATTGATCTGCAGTCAGATTATATTATCTAAGAATTCAAGTGAAGGAAAGGTAGGAATGGATCCTGTGATGGAAATATCTATTGAGGGATCAAATTTTATTTGAAGCTCTCACAAGGAATGTTTATGACATTTTCCTCATGGGAAAATAGCCATGTATATTGATGACTTCCTTCTTACTTGAGTATTTCCAGTCCTTTTTTTTTTTGTTTTGTTAAGAGAGTACTTCCAGTGCCTGATTCCATTATTTCCTGAGCCATGCTTGAACTTTACAATTTCTTTTACTTCTTGGTTTGAAGATCTTATGTGGCATATTAGTATGGCCTGATTTCATTCTTCTACACCCAATTTTTTTTTTCATGAGTACTGGAACAAAAAATTGCTTCGTAGTAATTTTCTGGTGAATATCTTGAAATGGTACACATTCTTGTCTCAGTGATCGTAAAAGAAGGAAAAAAGAATATGGGGATTCCCGTAATGATTATTCTGGGAATTTGAGGATTTCAGATAGGAGTGAAGAACGAGATAGAGAAGGGAAAATCTCATCTGCTTCTAGAGACACTCTTGAGGGGCAGGTAAACAAAACGATGGTTTCTAACTTTTATCTTCTATTTAAAAAACAGTTACGCTCTGGAAATCTTGTTTGATCGTCTTTGAATCACACAGTTAAACAAGATGCAGGCAGATATTGAGATGGCTGAGCACCACAAACACCAAGCTGAGGTAAGTATTGTCTCTTTATTTTCTGATTCCTGCCTCAAATTCTGTGCAGGCAGTAGGCAATAGGCAATACATATTTCCTTTTCTCTATCCATGCAATAGGCAATACAATCTAACTCTTTTTAAAAATTTCCTTTATGTATTTTCCCAAATGTATTCTTGGGAGTGGTTTGAGATGTATGGTCCCTTGTTAGGTTCTATGTTTCTATCTAGGCTTCAGTGATGTGACCTTTTTGTAATTACCCTCTAGGTCTTATTTTACTGGATTGGAGTCCGTCCTTGTAGTTTGGCTCCCGTGTGTGGGCTTTATTTTTGTATTTCTTGTATTCTTTTATTTTTTCCTCGATGAAAATTCTTTTATTCCTAAAAAAAAGAAAAGAAAAGGTCCACTCATTTTTAATGAAGTTTTAAGCTTCAGTTTTTGTTTTTATTGGACAACAAATTGGCATGAGAGCAAATTAATAGGGATTGGATCCGAAACTTAGGCAAGAGTAGCTGAAGTTTATGAGACGGTGAAAGGGTGGAAGATGATTTAGCATCAATGAAGTCAGTGAAGAGCCATTAGTCTGACCGCTAAACCTTTTTTTTTTTTTTTTTTTTTTTTTTTAACAAGAAATAGAACTTTTAATTGATTGAATGAGAAGATTAATGCTCATAAGACGTGAACTCCAAAATTTTTCATTGTAATTCGCTTTTGGTGGACTAATGCTCAAGGGCCCAAAAGCTAACTACAAAAAAGGCTCCAATTTGTAAGGACAAATCCAAGACCATAAATACAAAAAGGTCAAGTGACTGATACCCACAAGGAAGCATTAAACCTCACAAGCTCCTGAACCTCCTCACTCGACTTCTCTTCGTCTCTAAAAATTCTATTAGGAGAATGCAAGAACACCTTTTTCTACCACCAAAAGAACTCATCCACCAATCCCACATGAAGTTAGCAAACTGGCCCTCTCACAGTAAATGGGACCAATGCTCTTCATTCCTCCTACAGAGAAGGCACCATTGCAGATAAAATATTGAGGTAGAGTGCCTTTAGATGTGATTTTGAGTGTTAACTTCCCCAAGTAAAACTTGGAACGCTAAAAAGTTCACCTTCTTAGAATATTAACCTTCAAGAAAGAAGCAAAAATAGAAGCCTCAGGATCAGTTAACTAGGCAAGTGAGGAAGACAAAGAACAAAAAAAGGAATGACTAGAAAAAGCACCTAAAAGGTTTGGGGTTTAGACTCTAGTACCTCTCCTCCCCAAAATAATAACCTATTCTCAAAGGGTATAAAGAAGGCCACCTTATCAATTGCCTCCATACTAATTAGAGAATGATAAAACCCTAAGGAGATCTAACAAGAGAGATTATGAGAAGACAACACTAATGCCATTGAATGCAATCTCTTATTTGAAAGGAAATAGAGACGAGGAAAAAGGATGCACAAAGATCCTCTAAAAAAAAAATAAACTCTTGAAGTTTTCCCACATTGCATTGACGTTTGAAATTTACAAAAGGGATGCATTATCCATGATTTTGACAAAAAGTCCTTCCCAATTTACAAAGAGGGAGGTACGACTATAGGAAGCAAAGATATTAGACAATTGACACCAAGATATAATGTGGTAAACAATATTGTTGAAGAGTTTTGTATAAGTTTGTGTCTTTTCTGTGAATACTCTTTGATTTCCTTCTTTCCACATATTCCAAAAGAAACCAATGATGAGGTTCTTCCATAGTATGGTTTTTGTATTCTTGAAGGGGTGGTACGTTATGGCCGTGTCCAATAAATTCTTTACCTTCCTAGGAAAAGTGAGATGCCATCCGAAGGTATTAAGAATTTTTGTCCAAAAATTTTAAGCTTACGTGCAATGCATAAATAAATGCATTCGTTATTCGTTTTCTTTCTTACCTAATAGGCACCAATTTGGGAATAGAGTCATGTTTTTAAAGATTTTCAGTTGTAATAATGGCTTTATGCACTATTTCCCAAAGGAAATATTTTAACTTTTTAAGATGATACTTTTTAAGATGATTTCCTTTCCATATTATTTTTGCTAGCACAAGATTTATTGCTTCTACTTTTTTCCCTGTCTGTCATCAAAGATTTTGTAGAGGACTTTGTCAGTGATGGGAAGTCATGTCAATGAGTCTGCTTCGTTAGACCATACAACTGGGGCAAGCTTGAGGCTTAATTTGGCTCATTTTGTAGTGTCGTTATCCTTTAGGTTTCTACCAAGCTTCAGGTCCCAAAAATGGTTGACATTATTCCATGTTTCTTTCATCATTCCATGTGGTGGTTAGTGTTGATACCAGCTTATCGCTTTGGTCGGCCTGTAGGTGACTGAGTGAGCTTGTTGGTCAGTATGGGGTTGAAATAATCAGCCTAGGGTCGGGGATTTGTGGGGTAAGTAGGGTATGGGAACAGTTATGACTGAACTAAGCATCTTGAATGTATAATAAACATTTGTATTTTCCTCTTTCACAGTAATGTAGATTCATTCCTAACGGAGATTATTTTAATGTCTACTGGTTGCAAACTTTGCAACCTGGTTGACCCAAAATTTGCCCCTAGAGTGCGTAGTGGTCACTTAATTCTTATCATATGGTTGAAACTGGAACTCCCTTAACTGGGAAGGCGCTTTTATGCTGTTTCAAGTGTCATATGCTATTGATTGGTATTTTGACTGAGCAATATTTTCAAGATAATGTATATTTCATTCGCTCGGTGAAATGATAAATATATATTGAAAATATTAGCAAATTGTACTTCATTTTTGAAGGTCTATCTGGATGAGAGGATCCAAGAAGTGGATAGTTTAACTTCTAGAATTCAGGAGCTAGAATCCCAATTATATAAAGAGAGGGAGGATTCTAGAAGGTACTCATAATCGTCTTTTAAGTACACGTTGACATGTTGCCTGCTGAAAGTGGACACCTACCTCTTATTCCTATTTTTCTTGGATCTTTAATATTCAGGATCAAATCAAAAATCAAGAAATTTGTCAAAGCACATAATCGTTACTCAAGGATACAAGATGAACTGAAGCGGCAAGTATACTTGTCCAATTCTTTTGGTTGCGCTTTCTAATGCTTATGACTTTCTGAAAATTATGATAACTTTTCTGAACTTGTAGTTCGCAAGTCCGACTTGAACAGTTGGGGGATCAGCTGGGGTCGGATGTTAATAAAATTGGTGCCAACGAAGAAGACTCGAGCATTAATATTGTGAGCGATGGAGAGGACCCTGGTTTTCATGCAGTTAGCCCTCTTCATGACCTGCAAAAAGACAATTCTGCAAGCAAGAAGAAGCATATTGTTCAAGATATTGCGGAAAACTCAAAACGAGGTGCCTTTTGAGAACTAACTTCTTTACATTGATGTGTCATTTAAAAAAAAAATCTTTTATTTGGTTTGAAATCTCTTAATATGCCTTCAGCCCTAGTGGGCTGTGATAATTTGTAATTATGATGTTTTTCCGTCTTGGACCTTTATTTCCTTCATACTTGGAATAAAATTTGTGCATGCTACATTAGCTGATTTAAACAAAGGAAGTAAAGAGGCAGTGGGCAGATTGAGAAGGTTCTCTCGGTGGAATGCCCATCCTTCTCAATCGGTATATAGCAAAATTGAGGCAGTTGGCAATGAAGTCAATGCTTTGATACCTACAGCAAATGACAGCAAGCAGAAAAGAGGAAGGACATCTACTGCTGTTTCTTCTGCGGACAAGGTGACTTTGATATTTTTATATTCAATTTTATTTGTTTGTACAATGTACAACGTGCTGTGCATTTGAGGTTTGGATGTCTGATTTGTTATCGACATATGGAACACGAAGCCTATAGGTTTTGGCAAGCATGAAGTGTGCTACTCTCTAGCTATATAATTCATCTTTTCGTAGTTCTTGTTGTTTTCAGGACCTAGGATAACATTCTCCCTCTGTCTTGCAAGATTAATATGAAGCTTTCCTTGTATGCTTTGCAAATGACATCGTTAAATTCTTTGTACGTCTATGCACTCAATTTCTCAACTCAAATCTGCGCAGGTTAGGGGTCTGGAATCAGGTGTTGTGCCATTAACCAGCATGGCTGCCCATGCAGTTGATGAAGAAGTTGATATTGAATTGGACGTCAACAATAAAGTGAACGAAACGAGGGAAAATACTAAGGAAGCTTCGTTCGGTAGCCTGCCCTTTCCGCCCCCTCCACCTCCAATCCGTGAAATTAACCATTCAAGGGTTAGCAAGTTCTCTGAATTTCACGACTTTCACTTTTTTCGTTTAATCCATATACAATTGGAACTTTTCTACTTTGTTTCTTCTATAAGTTTGAACCTAGAAAGGTAGATATGAGATCCCACATTGATTATCGATTAGAGAAGGGAATGAGTGCCAGTGTTTGAAGGGAGGTGGATTGTGAGATCCCACATCGGCTGGAGAGGAGAACGAAGCAGGGTGTGGAAACCTCTTCCTATCAGACATATTTTAAAAACCTTGAGGGAAACCCCTGCTAGCAGTGGGCTTGAGCTGTTGGAGAACGAAGCACGGTGTGGAAACCTCTTCCTATCAGACATATTTTAAAAACCTTGAGGGAAATCCCTGCTAGCGGTGGGCTTGAGCTGTTACCGTAGAAAGTCGAGAGTGTTATTCAATATGGTTTTATTTCTTTAACAAAGAATAAAGCTATTGTCACCTCTTACTTGTAGCATCTAACATAATATTTGATTTGCAGTATGAAAGTGAAGATCAAAATGTAGATGTGGTGGCGCTCGATGACGAAAGGGCACGCTTGGACAATGCCTAACATTGAATCCTGTTGTAGAGATTAAATGAATTCTTAGCAAAGTAATTAGTTTTAGTTTCATAGAGTTGCGCACAGGTTAGGGTTTTATATGTTCTTGTTTAATACACTTGTAATTTCTATATGTTCTTGTTTAATACACTTGTAATTTCGAGTTTGTGATTTAGCATATCAGATACCGATGACCTCTCCATTCTATTTTTTCATGGTAGATTTTGGTTGCTTTTTTTGAATTATCATGGCGAAGATATTCTCTCTGAACTAGTACAAGCATTTCAATGCACTAATTTACTTTGTGATTCGTCACAAATTTC
mRNA sequence
TAGCCGACAATTAGCTTTGAGAAGAATCGGCGCCATTGCTTAAAACGCGGGGAATGAAATGAAACCCTGAGAATCATTTTCATCACGGCGTTCGTCTCAGCCGTCGCGTTTTGCTTTGGATAGCTCAGGAAGTTCTGCTATCGCGATTTACCGCCGTCGAATATGTTCTATTGGACTACTGTTTCCATATTTTCTGATTCTTTGTAATCTATAGAACCTGATTTTAGCGGCTGAACTACTTCGATTCGGTTCTACATCTATTATTTCTAAATCGTCGCTGCGGAGCGTTTTCAGCCGGCGATTGCTGCGAATTTGACTATATACTTATTTTCAGTTACTGTTTATGTTCAACCATGGTGGTTGGGAGCAAACTCTTCAAAACGAAGTTATGCGTTCTTTACCAGAAAGGCCGCTGTTCTCGGCCAAGTTGTTCGTTCGCGCATGGAACCGCTGAACTCAGGCGATTTGCTGGTGCTCACGACGGTAGACGGGAATATAGAGGAAATGATTTAAGGCATAAGCTTGACAGCAGGCATTCTCCTTTGCAGGAAAGGGATTCCAGGGGGCGACATGGACCTCGTGATTATAGCTCATCCTGGTCACTTGAACGACAAAGTGATCGTAAAAGAAGGAAAAAAGAATATGGGGATTCCCGTAATGATTATTCTGGGAATTTGAGGATTTCAGATAGGAGTGAAGAACGAGATAGAGAAGGGAAAATCTCATCTGCTTCTAGAGACACTCTTGAGGGGCAGTTAAACAAGATGCAGGCAGATATTGAGATGGCTGAGCACCACAAACACCAAGCTGAGGTCTATCTGGATGAGAGGATCCAAGAAGTGGATAGTTTAACTTCTAGAATTCAGGAGCTAGAATCCCAATTATATAAAGAGAGGGAGGATTCTAGAAGGATCAAATCAAAAATCAAGAAATTTGTCAAAGCACATAATCGTTACTCAAGGATACAAGATGAACTGAAGCGGCAAGTATACTTTTCGCAAGTCCGACTTGAACAGTTGGGGGATCAGCTGGGGTCGGATGTTAATAAAATTGGTGCCAACGAAGAAGACTCGAGCATTAATATTGTGAGCGATGGAGAGGACCCTGGTTTTCATGCAGTTAGCCCTCTTCATGACCTGCAAAAAGACAATTCTGCAAGCAAGAAGAAGCATATTGTTCAAGATATTGCGGAAAACTCAAAACGAGCTGATTTAAACAAAGGAAGTAAAGAGGCAGTGGGCAGATTGAGAAGGTTCTCTCGGTGGAATGCCCATCCTTCTCAATCGGTATATAGCAAAATTGAGGCAGTTGGCAATGAAGTCAATGCTTTGATACCTACAGCAAATGACAGCAAGCAGAAAAGAGGAAGGACATCTACTGCTGTTTCTTCTGCGGACAAGGTTAGGGGTCTGGAATCAGGTGTTGTGCCATTAACCAGCATGGCTGCCCATGCAGTTGATGAAGAAGTTGATATTGAATTGGACGTCAACAATAAAGTGAACGAAACGAGGGAAAATACTAAGGAAGCTTCGTTCGGTAGCCTGCCCTTTCCGCCCCCTCCACCTCCAATCCGTGAAATTAACCATTCAAGGTATGAAAGTGAAGATCAAAATGTAGATGTGGTGGCGCTCGATGACGAAAGGGCACGCTTGGACAATGCCTAACATTGAATCCTGTTGTAGAGATTAAATGAATTCTTAGCAAAGTAATTAGTTTTAGTTTCATAGAGTTGCGCACAGGTTAGGGTTTTATATGTTCTTGTTTAATACACTTGTAATTTCTATATGTTCTTGTTTAATACACTTGTAATTTCGAGTTTGTGATTTAGCATATCAGATACCGATGACCTCTCCATTCTATTTTTTCATGGTAGATTTTGGTTGCTTTTTTTGAATTATCATGGCGAAGATATTCTCTCTGAACTAGTACAAGCATTTCAATGCACTAATTTACTTTGTGATTCGTCACAAATTTC
Coding sequence (CDS)
ATGGTGGTTGGGAGCAAACTCTTCAAAACGAAGTTATGCGTTCTTTACCAGAAAGGCCGCTGTTCTCGGCCAAGTTGTTCGTTCGCGCATGGAACCGCTGAACTCAGGCGATTTGCTGGTGCTCACGACGGTAGACGGGAATATAGAGGAAATGATTTAAGGCATAAGCTTGACAGCAGGCATTCTCCTTTGCAGGAAAGGGATTCCAGGGGGCGACATGGACCTCGTGATTATAGCTCATCCTGGTCACTTGAACGACAAAGTGATCGTAAAAGAAGGAAAAAAGAATATGGGGATTCCCGTAATGATTATTCTGGGAATTTGAGGATTTCAGATAGGAGTGAAGAACGAGATAGAGAAGGGAAAATCTCATCTGCTTCTAGAGACACTCTTGAGGGGCAGTTAAACAAGATGCAGGCAGATATTGAGATGGCTGAGCACCACAAACACCAAGCTGAGGTCTATCTGGATGAGAGGATCCAAGAAGTGGATAGTTTAACTTCTAGAATTCAGGAGCTAGAATCCCAATTATATAAAGAGAGGGAGGATTCTAGAAGGATCAAATCAAAAATCAAGAAATTTGTCAAAGCACATAATCGTTACTCAAGGATACAAGATGAACTGAAGCGGCAAGTATACTTTTCGCAAGTCCGACTTGAACAGTTGGGGGATCAGCTGGGGTCGGATGTTAATAAAATTGGTGCCAACGAAGAAGACTCGAGCATTAATATTGTGAGCGATGGAGAGGACCCTGGTTTTCATGCAGTTAGCCCTCTTCATGACCTGCAAAAAGACAATTCTGCAAGCAAGAAGAAGCATATTGTTCAAGATATTGCGGAAAACTCAAAACGAGCTGATTTAAACAAAGGAAGTAAAGAGGCAGTGGGCAGATTGAGAAGGTTCTCTCGGTGGAATGCCCATCCTTCTCAATCGGTATATAGCAAAATTGAGGCAGTTGGCAATGAAGTCAATGCTTTGATACCTACAGCAAATGACAGCAAGCAGAAAAGAGGAAGGACATCTACTGCTGTTTCTTCTGCGGACAAGGTTAGGGGTCTGGAATCAGGTGTTGTGCCATTAACCAGCATGGCTGCCCATGCAGTTGATGAAGAAGTTGATATTGAATTGGACGTCAACAATAAAGTGAACGAAACGAGGGAAAATACTAAGGAAGCTTCGTTCGGTAGCCTGCCCTTTCCGCCCCCTCCACCTCCAATCCGTGAAATTAACCATTCAAGGTATGAAAGTGAAGATCAAAATGTAGATGTGGTGGCGCTCGATGACGAAAGGGCACGCTTGGACAATGCCTAA
Protein sequence
MVVGSKLFKTKLCVLYQKGRCSRPSCSFAHGTAELRRFAGAHDGRREYRGNDLRHKLDSRHSPLQERDSRGRHGPRDYSSSWSLERQSDRKRRKKEYGDSRNDYSGNLRISDRSEERDREGKISSASRDTLEGQLNKMQADIEMAEHHKHQAEVYLDERIQEVDSLTSRIQELESQLYKEREDSRRIKSKIKKFVKAHNRYSRIQDELKRQVYFSQVRLEQLGDQLGSDVNKIGANEEDSSINIVSDGEDPGFHAVSPLHDLQKDNSASKKKHIVQDIAENSKRADLNKGSKEAVGRLRRFSRWNAHPSQSVYSKIEAVGNEVNALIPTANDSKQKRGRTSTAVSSADKVRGLESGVVPLTSMAAHAVDEEVDIELDVNNKVNETRENTKEASFGSLPFPPPPPPIREINHSRYESEDQNVDVVALDDERARLDNA
Homology
BLAST of Cp4.1LG04g03520 vs. ExPASy Swiss-Prot
Match:
Q93XW7 (Zinc finger CCCH domain-containing protein 40 OS=Arabidopsis thaliana OX=3702 GN=At3g21810 PE=1 SV=1)
HSP 1 Score: 189.1 bits (479), Expect = 1.0e-46
Identity = 164/457 (35.89%), Postives = 239/457 (52.30%), Query Frame = 0
Query: 4 GSKLFKTKLCVLYQK-GRCSRPSCSFAHGTAELRR-----FAGAHDG-------RREYRG 63
GS ++KTKLC+L+ K G CSRP+C+FAHG AELRR F G RR
Sbjct: 3 GSSMYKTKLCILFNKTGDCSRPNCTFAHGNAELRRPGESSFTGRRHNMDSDLRDRRHNMD 62
Query: 64 NDLRHKLDSRHSPLQERDSRGRHGPR-----DYSSSWSLERQSDRKRRKKEYGDSRNDYS 123
+DLR +L + SP + R S R G R + +S E + D+ R+ D R DY+
Sbjct: 63 SDLRDRLGRQFSP-ERRPSLDRSGRRVQRFSGHDNSMPFENRRDKDYRENRRFDERRDYA 122
Query: 124 GNLRISDRSEERDREGKIS-SASRDTLEGQLNKMQADIEMAEHHKHQAEVYLDERIQEVD 183
G L++ +R E+R +G+ + LE QL ++ D++M K + E ++ + EVD
Sbjct: 123 GGLKVGNRIEDRAEDGRNKFHGYNNVLEEQLKDVEMDVKMLTDDKLRLEASVERKAHEVD 182
Query: 184 SLTSRIQELESQLYKEREDSRRIKSKIKKFVKAHNRYSRIQDELKRQVYFSQVRLEQLGD 243
LTSRIQELE+QL +E+++ RRI S KKFVK +NR+ R QD+LKR S+ RL++LG+
Sbjct: 183 ILTSRIQELETQLDREKDECRRITSSSKKFVKEYNRFLRAQDDLKR----SEARLQKLGN 242
Query: 244 QLGSDVNKIGANEEDSSINIVSDGEDPGFH---AVSPLHDLQKDNSASKKKHIVQDIAEN 303
QL + + N D ++IVSD E G + A P ++LQ +S S+KKH V
Sbjct: 243 QLSTYLAGSEGNNRDVGLDIVSDEETNGRNLRTACDPHNELQNTSSLSRKKHYVDQYTTK 302
Query: 304 SKRAD--LNKGSKEAV-GRLRRFSRWNAHPSQSVYSKIEAVGNEVNAL-IPTANDSKQKR 363
D + +G +E V +R WN S+S + N+ + + ++ + KR
Sbjct: 303 EPVEDGLIGRGEEEKVENEKKRPPCWNMLSSKSYSEEESGAWNDEDTINRSSSKEDNWKR 362
Query: 364 GRTSTAVSSADKVRGLESGVVPLTSMAAHAVDEEVDIELDVNNKVNETRENTKEASFGS- 423
R S S+ DK V+ TSMAA D+ V E+ E EA+ GS
Sbjct: 363 RRFSIGTSATDK-------VILSTSMAAREFDD-----------VAESEEENPEAANGSP 422
Query: 424 LPFPPPPPPIREINHSRYESEDQNVDVV----ALDDE 430
L PPPPP R+ H + + +D N DV+ A DD+
Sbjct: 423 LISLPPPPPFRDA-HVQRDEDDVNGDVMEQKKAYDDD 435
BLAST of Cp4.1LG04g03520 vs. ExPASy Swiss-Prot
Match:
Q6H7U2 (Zinc finger CCCH domain-containing protein 13 OS=Oryza sativa subsp. japonica OX=39947 GN=Os02g0161200 PE=2 SV=1)
HSP 1 Score: 184.1 bits (466), Expect = 3.4e-45
Identity = 155/441 (35.15%), Postives = 237/441 (53.74%), Query Frame = 0
Query: 8 FKTKLCVLYQKGRCSRPSCSFAHGTAELRR---FAGA---HDGRREYRGNDLRHKLDSRH 67
+KTKLC L+Q+G C+R +CSFAHG ++RR GA H GRR+YR D R ++D R
Sbjct: 11 YKTKLCALWQRGNCNRDTCSFAHGHGDIRRPPSSRGAFTHHPGRRDYRAGDFRGRIDRRF 70
Query: 68 SPLQE----RDSRGRHGP----------RDYSSSWSLERQSDRKRRKKEYGDSRNDYSGN 127
SP + R+SRG H P RD S S S R+S+R+ KK D + S +
Sbjct: 71 SPRRRHSPGRESRG-HRPLYDRRPSSRERDSSYSRSPSRKSERRHEKKT-DDGETNSSRS 130
Query: 128 LRISDRSEERDREGKISSASRDTLEGQLNKMQADIEMAEHHKHQAEVYLDERIQEVDSLT 187
L +SD ++E+ ++ S ++ E QL +++ D+E K Q EV LDE+I EV ++
Sbjct: 131 LSLSDNNDEKKKDKFSSGDEKEDHEKQLKQIRLDMEALRDDKTQMEVILDEKIDEVRKIS 190
Query: 188 SRIQELESQLYKEREDSRRIKSKIKKFVKAHNRYSRIQDELKRQVYFSQVRLEQLGDQLG 247
S++ +LE QL +E+++ R+ SK+KKF+KAH R+ + Q+E+KR SQ R E+LGD L
Sbjct: 191 SKVNDLEVQLRREKDECHRMTSKMKKFIKAHARFLKAQEEVKR----SQARFERLGDLLA 250
Query: 248 SDVNKIGANEEDSSINIVSDGEDPGFHAVSPLHDLQKDNSASKKKHIVQDIAENSKRADL 307
SD+ K GANEE SS+N ED L++ + +A+KK+ I +E +K +
Sbjct: 251 SDILKRGANEEGSSVN-----ED--------LNERSPNTAATKKRSIPYSTSEEAKA--V 310
Query: 308 NKGSKEAVGRLRRFSRWNAHPSQ-SVYSKIEAVGNEVNALIPTANDSKQKRGRTSTAVSS 367
K + + R ++ + + SK + D K K G A
Sbjct: 311 KKRRERDSDTMTRSDKYRSDVTDFDKTSKGTEATKSLYLKKKLWEDEKSKLG----ANIF 370
Query: 368 ADKVRGLE-SGVVPLTSMAAHAVD---EEVDIELDVNNKVNETRENTKEASFGSLPFPPP 424
+KV+G V+P T MAAHA+D E +++E D + ++ EN + S P
Sbjct: 371 TEKVKGSPVRHVLPSTGMAAHAIDDLNEAIELE-DRHESIDALLENDADDKTRSPAIPLQ 425
BLAST of Cp4.1LG04g03520 vs. NCBI nr
Match:
XP_023530203.1 (zinc finger CCCH domain-containing protein 40-like isoform X4 [Cucurbita pepo subsp. pepo])
HSP 1 Score: 820 bits (2117), Expect = 9.83e-299
Identity = 432/436 (99.08%), Postives = 432/436 (99.08%), Query Frame = 0
Query: 1 MVVGSKLFKTKLCVLYQKGRCSRPSCSFAHGTAELRRFAGAHDGRREYRGNDLRHKLDSR 60
MVVGSKLFKTKLCVLYQKGRCSRPSCSFAHGTAELRRFAGAHDGRREYRGNDLRHKLDSR
Sbjct: 1 MVVGSKLFKTKLCVLYQKGRCSRPSCSFAHGTAELRRFAGAHDGRREYRGNDLRHKLDSR 60
Query: 61 HSPLQERDSRGRHGPRDYSSSWSLERQSDRKRRKKEYGDSRNDYSGNLRISDRSEERDRE 120
HSPLQERDSRGRHGPRDYSSSWSLERQSDRKRRKKEYGDSRNDYSGNLRISDRSEERDRE
Sbjct: 61 HSPLQERDSRGRHGPRDYSSSWSLERQSDRKRRKKEYGDSRNDYSGNLRISDRSEERDRE 120
Query: 121 GKISSASRDTLEGQLNKMQADIEMAEHHKHQAEVYLDERIQEVDSLTSRIQELESQLYKE 180
GKISSASRDTLEGQLNKMQADIEMAEHHKHQAEVYLDERIQEVDSLTSRIQELESQLYKE
Sbjct: 121 GKISSASRDTLEGQLNKMQADIEMAEHHKHQAEVYLDERIQEVDSLTSRIQELESQLYKE 180
Query: 181 REDSRRIKSKIKKFVKAHNRYSRIQDELKRQVYFSQVRLEQLGDQLGSDVNKIGANEEDS 240
REDSRRIKSKIKKFVKAHNRYSRIQDELKR SQVRLEQLGDQLGSDVNKIGANEEDS
Sbjct: 181 REDSRRIKSKIKKFVKAHNRYSRIQDELKR----SQVRLEQLGDQLGSDVNKIGANEEDS 240
Query: 241 SINIVSDGEDPGFHAVSPLHDLQKDNSASKKKHIVQDIAENSKRADLNKGSKEAVGRLRR 300
SINIVSDGEDPGFHAVSPLHDLQKDNSASKKKHIVQDIAENSKRADLNKGSKEAVGRLRR
Sbjct: 241 SINIVSDGEDPGFHAVSPLHDLQKDNSASKKKHIVQDIAENSKRADLNKGSKEAVGRLRR 300
Query: 301 FSRWNAHPSQSVYSKIEAVGNEVNALIPTANDSKQKRGRTSTAVSSADKVRGLESGVVPL 360
FSRWNAHPSQSVYSKIEAVGNEVNALIPTANDSKQKRGRTSTAVSSADKVRGLESGVVPL
Sbjct: 301 FSRWNAHPSQSVYSKIEAVGNEVNALIPTANDSKQKRGRTSTAVSSADKVRGLESGVVPL 360
Query: 361 TSMAAHAVDEEVDIELDVNNKVNETRENTKEASFGSLPFPPPPPPIREINHSRYESEDQN 420
TSMAAHAVDEEVDIELDVNNKVNETRENTKEASFGSLPFPPPPPPIREINHSRYESEDQN
Sbjct: 361 TSMAAHAVDEEVDIELDVNNKVNETRENTKEASFGSLPFPPPPPPIREINHSRYESEDQN 420
Query: 421 VDVVALDDERARLDNA 436
VDVVALDDERARLDNA
Sbjct: 421 VDVVALDDERARLDNA 432
BLAST of Cp4.1LG04g03520 vs. NCBI nr
Match:
XP_022930759.1 (zinc finger CCCH domain-containing protein 40-like isoform X2 [Cucurbita moschata])
HSP 1 Score: 801 bits (2068), Expect = 2.99e-291
Identity = 423/437 (96.80%), Postives = 429/437 (98.17%), Query Frame = 0
Query: 1 MVVGSKLFKTKLCVLYQKGRCSRPSCSFAHGTAELRRFAGAHDGRREYRGNDLRHKLDSR 60
MVVGSKLFKTKLCVLYQKGRCSRPSCSFAHGTAELRRFAGAHDGRREYRGNDLRHKLDSR
Sbjct: 1 MVVGSKLFKTKLCVLYQKGRCSRPSCSFAHGTAELRRFAGAHDGRREYRGNDLRHKLDSR 60
Query: 61 HSPLQERDSRGRHGPRDYSSSWSLERQSDRKRRKKEYGDSRNDYSGNLRISDRSEERDRE 120
HSPLQERDSRGRHGPRDYSSSWSLERQSDRKRRKKEYGDSRNDYSGNLRISDRSEERDRE
Sbjct: 61 HSPLQERDSRGRHGPRDYSSSWSLERQSDRKRRKKEYGDSRNDYSGNLRISDRSEERDRE 120
Query: 121 GKISSASRDTLEGQLNKMQADIEMAEHHKHQAEVYLDERIQEVDSLTSRIQELESQLYKE 180
GKISSASRDTLEGQLNKMQADIEMAEHHKHQAEVYLDERIQEVDSLTSRIQELESQLYKE
Sbjct: 121 GKISSASRDTLEGQLNKMQADIEMAEHHKHQAEVYLDERIQEVDSLTSRIQELESQLYKE 180
Query: 181 REDSRRIKSKIKKFVKAHNRYSRIQDELKRQVYFSQVRLEQLGDQLGSDVNKIGANEEDS 240
REDSRRIKSKIKKFVKAHNRYSRIQDELKR SQVRL+QLGDQLGSDVNKIGANEEDS
Sbjct: 181 REDSRRIKSKIKKFVKAHNRYSRIQDELKR----SQVRLQQLGDQLGSDVNKIGANEEDS 240
Query: 241 SINIVSDGEDPGFHAVSPLHDLQKDNSASKKKHIVQDIAENSKRADLNKGSKEAVGRLRR 300
SINIVSDGEDPGFHAVSPLHDLQKDNSASKKKHIVQDIA NSKRADLNKGSKEAVGRLRR
Sbjct: 241 SINIVSDGEDPGFHAVSPLHDLQKDNSASKKKHIVQDIAGNSKRADLNKGSKEAVGRLRR 300
Query: 301 FSRWNAHPSQSVYSKIEAVGNEVNALIPTANDSKQKRGRTSTAVSSADKVRGLESGVV-P 360
FSRWNAHPSQSVYSKIEAVGNEVN LIPTAN+SKQKRGRTST VSSADKVRGLESGVV P
Sbjct: 301 FSRWNAHPSQSVYSKIEAVGNEVNDLIPTANESKQKRGRTSTTVSSADKVRGLESGVVVP 360
Query: 361 LTSMAAHAVDEEVDIELDVNNKVNETRENTKEASFGSLPFPPPPPPIREINHSRYESEDQ 420
LTSMAAHAVDEEVDIEL++N+KVNETRENTKEASFGSLPFPPPPPPIREINHS+YESEDQ
Sbjct: 361 LTSMAAHAVDEEVDIELEINSKVNETRENTKEASFGSLPFPPPPPPIREINHSKYESEDQ 420
Query: 421 NVDVVALDDERARLDNA 436
NVDVVALDDERARLDNA
Sbjct: 421 NVDVVALDDERARLDNA 433
BLAST of Cp4.1LG04g03520 vs. NCBI nr
Match:
XP_022988786.1 (zinc finger CCCH domain-containing protein 40-like isoform X1 [Cucurbita maxima])
HSP 1 Score: 798 bits (2062), Expect = 2.55e-290
Identity = 423/438 (96.58%), Postives = 428/438 (97.72%), Query Frame = 0
Query: 1 MVVGSKLFKTKLCVLYQKGRCSRPSCSFAHGTAELRRFAGAHDGRREYRGNDLRHKLDSR 60
MVVGSKLFKTKLCVLYQKGRCSRPSCSFAHGTAELRRFAGAHDGRREYRGNDLRHKLDSR
Sbjct: 1 MVVGSKLFKTKLCVLYQKGRCSRPSCSFAHGTAELRRFAGAHDGRREYRGNDLRHKLDSR 60
Query: 61 HSPLQERDSRGRHGPRDYSSSWSLERQSDRKRRKKEYGDSRNDYSGNLRISDRSEERDRE 120
HSPLQERDSRGRHGPRDYSSSWSLERQSDRKRRKKEYGDSRNDYSGNLRISDRSEERDRE
Sbjct: 61 HSPLQERDSRGRHGPRDYSSSWSLERQSDRKRRKKEYGDSRNDYSGNLRISDRSEERDRE 120
Query: 121 GKISSASRDTLEGQLNKMQADIEMAEHHKHQAEVYLDERIQEVDSLTSRIQELESQLYKE 180
GKISSASRDTLEGQLNKMQADIEMAEHHKHQAEVYLDERIQEVDSLTSRIQELESQLYKE
Sbjct: 121 GKISSASRDTLEGQLNKMQADIEMAEHHKHQAEVYLDERIQEVDSLTSRIQELESQLYKE 180
Query: 181 REDSRRIKSKIKKFVKAHNRYSRIQDELKRQVYFSQVRLEQLGDQLGSDVNKIGANEEDS 240
REDSRRIKSKIKKFVKAHNRYSRIQDELKR SQ RL+QLGDQLGSDVNKIGANEEDS
Sbjct: 181 REDSRRIKSKIKKFVKAHNRYSRIQDELKR----SQARLQQLGDQLGSDVNKIGANEEDS 240
Query: 241 SINIVSDGEDPGFHAVSPLHDLQKDNSASKKKHIVQDIAENSKRADLNKGSKEAVGRLRR 300
SINIVSDGEDPGFHAVSPLHDLQKDNSASKKKH+VQDIAENSKRADLNKGSKEAVGRLRR
Sbjct: 241 SINIVSDGEDPGFHAVSPLHDLQKDNSASKKKHVVQDIAENSKRADLNKGSKEAVGRLRR 300
Query: 301 FSRWNAHPSQSVYSKIEAVGNEVNALIPTANDSKQKRGRTSTAVSSADKVRGLESGVV-P 360
FSRWNAHPSQSVYSKIEAVGNEVN LIPTAN+SKQKRGRTST VSSADKVRGLESGVV P
Sbjct: 301 FSRWNAHPSQSVYSKIEAVGNEVNDLIPTANESKQKRGRTSTTVSSADKVRGLESGVVVP 360
Query: 361 LTSMAAHAVDEEVDIELDVNNKVNETRENTKEASFGSLPFPPPPPP-IREINHSRYESED 420
LTSMAAHAVDEEVDIEL++NNKVNETRENTKEASFGSLPFPPPPPP IREINHSRYESED
Sbjct: 361 LTSMAAHAVDEEVDIELEINNKVNETRENTKEASFGSLPFPPPPPPPIREINHSRYESED 420
Query: 421 QNVDVVALDDERARLDNA 436
QNVDVV LDDERARLDNA
Sbjct: 421 QNVDVVTLDDERARLDNA 434
BLAST of Cp4.1LG04g03520 vs. NCBI nr
Match:
XP_023530200.1 (zinc finger CCCH domain-containing protein 40-like isoform X3 [Cucurbita pepo subsp. pepo])
HSP 1 Score: 732 bits (1889), Expect = 5.76e-264
Identity = 389/394 (98.73%), Postives = 390/394 (98.98%), Query Frame = 0
Query: 43 DGRREYRGNDLRHKLDSRHSPLQERDSRGRHGPRDYSSSWSLERQSDRKRRKKEYGDSRN 102
+GRREYRGNDLRHKLDSRHSPLQERDSRGRHGPRDYSSSWSLERQSDRKRRKKEYGDSRN
Sbjct: 45 NGRREYRGNDLRHKLDSRHSPLQERDSRGRHGPRDYSSSWSLERQSDRKRRKKEYGDSRN 104
Query: 103 DYSGNLRISDRSEERDREGKISSASRDTLEGQLNKMQADIEMAEHHKHQAEVYLDERIQE 162
DYSGNLRISDRSEERDREGKISSASRDTLEGQLNKMQADIEMAEHHKHQAEVYLDERIQE
Sbjct: 105 DYSGNLRISDRSEERDREGKISSASRDTLEGQLNKMQADIEMAEHHKHQAEVYLDERIQE 164
Query: 163 VDSLTSRIQELESQLYKEREDSRRIKSKIKKFVKAHNRYSRIQDELKRQVYFSQVRLEQL 222
VDSLTSRIQELESQLYKEREDSRRIKSKIKKFVKAHNRYSRIQDELKR SQVRLEQL
Sbjct: 165 VDSLTSRIQELESQLYKEREDSRRIKSKIKKFVKAHNRYSRIQDELKR----SQVRLEQL 224
Query: 223 GDQLGSDVNKIGANEEDSSINIVSDGEDPGFHAVSPLHDLQKDNSASKKKHIVQDIAENS 282
GDQLGSDVNKIGANEEDSSINIVSDGEDPGFHAVSPLHDLQKDNSASKKKHIVQDIAENS
Sbjct: 225 GDQLGSDVNKIGANEEDSSINIVSDGEDPGFHAVSPLHDLQKDNSASKKKHIVQDIAENS 284
Query: 283 KRADLNKGSKEAVGRLRRFSRWNAHPSQSVYSKIEAVGNEVNALIPTANDSKQKRGRTST 342
KRADLNKGSKEAVGRLRRFSRWNAHPSQSVYSKIEAVGNEVNALIPTANDSKQKRGRTST
Sbjct: 285 KRADLNKGSKEAVGRLRRFSRWNAHPSQSVYSKIEAVGNEVNALIPTANDSKQKRGRTST 344
Query: 343 AVSSADKVRGLESGVVPLTSMAAHAVDEEVDIELDVNNKVNETRENTKEASFGSLPFPPP 402
AVSSADKVRGLESGVVPLTSMAAHAVDEEVDIELDVNNKVNETRENTKEASFGSLPFPPP
Sbjct: 345 AVSSADKVRGLESGVVPLTSMAAHAVDEEVDIELDVNNKVNETRENTKEASFGSLPFPPP 404
Query: 403 PPPIREINHSRYESEDQNVDVVALDDERARLDNA 436
PPPIREINHSRYESEDQNVDVVALDDERARLDNA
Sbjct: 405 PPPIREINHSRYESEDQNVDVVALDDERARLDNA 434
BLAST of Cp4.1LG04g03520 vs. NCBI nr
Match:
XP_023530198.1 (zinc finger CCCH domain-containing protein 13-like isoform X1 [Cucurbita pepo subsp. pepo])
HSP 1 Score: 732 bits (1889), Expect = 9.02e-264
Identity = 389/394 (98.73%), Postives = 390/394 (98.98%), Query Frame = 0
Query: 43 DGRREYRGNDLRHKLDSRHSPLQERDSRGRHGPRDYSSSWSLERQSDRKRRKKEYGDSRN 102
+GRREYRGNDLRHKLDSRHSPLQERDSRGRHGPRDYSSSWSLERQSDRKRRKKEYGDSRN
Sbjct: 57 NGRREYRGNDLRHKLDSRHSPLQERDSRGRHGPRDYSSSWSLERQSDRKRRKKEYGDSRN 116
Query: 103 DYSGNLRISDRSEERDREGKISSASRDTLEGQLNKMQADIEMAEHHKHQAEVYLDERIQE 162
DYSGNLRISDRSEERDREGKISSASRDTLEGQLNKMQADIEMAEHHKHQAEVYLDERIQE
Sbjct: 117 DYSGNLRISDRSEERDREGKISSASRDTLEGQLNKMQADIEMAEHHKHQAEVYLDERIQE 176
Query: 163 VDSLTSRIQELESQLYKEREDSRRIKSKIKKFVKAHNRYSRIQDELKRQVYFSQVRLEQL 222
VDSLTSRIQELESQLYKEREDSRRIKSKIKKFVKAHNRYSRIQDELKR SQVRLEQL
Sbjct: 177 VDSLTSRIQELESQLYKEREDSRRIKSKIKKFVKAHNRYSRIQDELKR----SQVRLEQL 236
Query: 223 GDQLGSDVNKIGANEEDSSINIVSDGEDPGFHAVSPLHDLQKDNSASKKKHIVQDIAENS 282
GDQLGSDVNKIGANEEDSSINIVSDGEDPGFHAVSPLHDLQKDNSASKKKHIVQDIAENS
Sbjct: 237 GDQLGSDVNKIGANEEDSSINIVSDGEDPGFHAVSPLHDLQKDNSASKKKHIVQDIAENS 296
Query: 283 KRADLNKGSKEAVGRLRRFSRWNAHPSQSVYSKIEAVGNEVNALIPTANDSKQKRGRTST 342
KRADLNKGSKEAVGRLRRFSRWNAHPSQSVYSKIEAVGNEVNALIPTANDSKQKRGRTST
Sbjct: 297 KRADLNKGSKEAVGRLRRFSRWNAHPSQSVYSKIEAVGNEVNALIPTANDSKQKRGRTST 356
Query: 343 AVSSADKVRGLESGVVPLTSMAAHAVDEEVDIELDVNNKVNETRENTKEASFGSLPFPPP 402
AVSSADKVRGLESGVVPLTSMAAHAVDEEVDIELDVNNKVNETRENTKEASFGSLPFPPP
Sbjct: 357 AVSSADKVRGLESGVVPLTSMAAHAVDEEVDIELDVNNKVNETRENTKEASFGSLPFPPP 416
Query: 403 PPPIREINHSRYESEDQNVDVVALDDERARLDNA 436
PPPIREINHSRYESEDQNVDVVALDDERARLDNA
Sbjct: 417 PPPIREINHSRYESEDQNVDVVALDDERARLDNA 446
BLAST of Cp4.1LG04g03520 vs. ExPASy TrEMBL
Match:
A0A6J1ESE6 (zinc finger CCCH domain-containing protein 40-like isoform X2 OS=Cucurbita moschata OX=3662 GN=LOC111437138 PE=4 SV=1)
HSP 1 Score: 801 bits (2068), Expect = 1.45e-291
Identity = 423/437 (96.80%), Postives = 429/437 (98.17%), Query Frame = 0
Query: 1 MVVGSKLFKTKLCVLYQKGRCSRPSCSFAHGTAELRRFAGAHDGRREYRGNDLRHKLDSR 60
MVVGSKLFKTKLCVLYQKGRCSRPSCSFAHGTAELRRFAGAHDGRREYRGNDLRHKLDSR
Sbjct: 1 MVVGSKLFKTKLCVLYQKGRCSRPSCSFAHGTAELRRFAGAHDGRREYRGNDLRHKLDSR 60
Query: 61 HSPLQERDSRGRHGPRDYSSSWSLERQSDRKRRKKEYGDSRNDYSGNLRISDRSEERDRE 120
HSPLQERDSRGRHGPRDYSSSWSLERQSDRKRRKKEYGDSRNDYSGNLRISDRSEERDRE
Sbjct: 61 HSPLQERDSRGRHGPRDYSSSWSLERQSDRKRRKKEYGDSRNDYSGNLRISDRSEERDRE 120
Query: 121 GKISSASRDTLEGQLNKMQADIEMAEHHKHQAEVYLDERIQEVDSLTSRIQELESQLYKE 180
GKISSASRDTLEGQLNKMQADIEMAEHHKHQAEVYLDERIQEVDSLTSRIQELESQLYKE
Sbjct: 121 GKISSASRDTLEGQLNKMQADIEMAEHHKHQAEVYLDERIQEVDSLTSRIQELESQLYKE 180
Query: 181 REDSRRIKSKIKKFVKAHNRYSRIQDELKRQVYFSQVRLEQLGDQLGSDVNKIGANEEDS 240
REDSRRIKSKIKKFVKAHNRYSRIQDELKR SQVRL+QLGDQLGSDVNKIGANEEDS
Sbjct: 181 REDSRRIKSKIKKFVKAHNRYSRIQDELKR----SQVRLQQLGDQLGSDVNKIGANEEDS 240
Query: 241 SINIVSDGEDPGFHAVSPLHDLQKDNSASKKKHIVQDIAENSKRADLNKGSKEAVGRLRR 300
SINIVSDGEDPGFHAVSPLHDLQKDNSASKKKHIVQDIA NSKRADLNKGSKEAVGRLRR
Sbjct: 241 SINIVSDGEDPGFHAVSPLHDLQKDNSASKKKHIVQDIAGNSKRADLNKGSKEAVGRLRR 300
Query: 301 FSRWNAHPSQSVYSKIEAVGNEVNALIPTANDSKQKRGRTSTAVSSADKVRGLESGVV-P 360
FSRWNAHPSQSVYSKIEAVGNEVN LIPTAN+SKQKRGRTST VSSADKVRGLESGVV P
Sbjct: 301 FSRWNAHPSQSVYSKIEAVGNEVNDLIPTANESKQKRGRTSTTVSSADKVRGLESGVVVP 360
Query: 361 LTSMAAHAVDEEVDIELDVNNKVNETRENTKEASFGSLPFPPPPPPIREINHSRYESEDQ 420
LTSMAAHAVDEEVDIEL++N+KVNETRENTKEASFGSLPFPPPPPPIREINHS+YESEDQ
Sbjct: 361 LTSMAAHAVDEEVDIELEINSKVNETRENTKEASFGSLPFPPPPPPIREINHSKYESEDQ 420
Query: 421 NVDVVALDDERARLDNA 436
NVDVVALDDERARLDNA
Sbjct: 421 NVDVVALDDERARLDNA 433
BLAST of Cp4.1LG04g03520 vs. ExPASy TrEMBL
Match:
A0A6J1JI85 (zinc finger CCCH domain-containing protein 40-like isoform X1 OS=Cucurbita maxima OX=3661 GN=LOC111486025 PE=4 SV=1)
HSP 1 Score: 798 bits (2062), Expect = 1.23e-290
Identity = 423/438 (96.58%), Postives = 428/438 (97.72%), Query Frame = 0
Query: 1 MVVGSKLFKTKLCVLYQKGRCSRPSCSFAHGTAELRRFAGAHDGRREYRGNDLRHKLDSR 60
MVVGSKLFKTKLCVLYQKGRCSRPSCSFAHGTAELRRFAGAHDGRREYRGNDLRHKLDSR
Sbjct: 1 MVVGSKLFKTKLCVLYQKGRCSRPSCSFAHGTAELRRFAGAHDGRREYRGNDLRHKLDSR 60
Query: 61 HSPLQERDSRGRHGPRDYSSSWSLERQSDRKRRKKEYGDSRNDYSGNLRISDRSEERDRE 120
HSPLQERDSRGRHGPRDYSSSWSLERQSDRKRRKKEYGDSRNDYSGNLRISDRSEERDRE
Sbjct: 61 HSPLQERDSRGRHGPRDYSSSWSLERQSDRKRRKKEYGDSRNDYSGNLRISDRSEERDRE 120
Query: 121 GKISSASRDTLEGQLNKMQADIEMAEHHKHQAEVYLDERIQEVDSLTSRIQELESQLYKE 180
GKISSASRDTLEGQLNKMQADIEMAEHHKHQAEVYLDERIQEVDSLTSRIQELESQLYKE
Sbjct: 121 GKISSASRDTLEGQLNKMQADIEMAEHHKHQAEVYLDERIQEVDSLTSRIQELESQLYKE 180
Query: 181 REDSRRIKSKIKKFVKAHNRYSRIQDELKRQVYFSQVRLEQLGDQLGSDVNKIGANEEDS 240
REDSRRIKSKIKKFVKAHNRYSRIQDELKR SQ RL+QLGDQLGSDVNKIGANEEDS
Sbjct: 181 REDSRRIKSKIKKFVKAHNRYSRIQDELKR----SQARLQQLGDQLGSDVNKIGANEEDS 240
Query: 241 SINIVSDGEDPGFHAVSPLHDLQKDNSASKKKHIVQDIAENSKRADLNKGSKEAVGRLRR 300
SINIVSDGEDPGFHAVSPLHDLQKDNSASKKKH+VQDIAENSKRADLNKGSKEAVGRLRR
Sbjct: 241 SINIVSDGEDPGFHAVSPLHDLQKDNSASKKKHVVQDIAENSKRADLNKGSKEAVGRLRR 300
Query: 301 FSRWNAHPSQSVYSKIEAVGNEVNALIPTANDSKQKRGRTSTAVSSADKVRGLESGVV-P 360
FSRWNAHPSQSVYSKIEAVGNEVN LIPTAN+SKQKRGRTST VSSADKVRGLESGVV P
Sbjct: 301 FSRWNAHPSQSVYSKIEAVGNEVNDLIPTANESKQKRGRTSTTVSSADKVRGLESGVVVP 360
Query: 361 LTSMAAHAVDEEVDIELDVNNKVNETRENTKEASFGSLPFPPPPPP-IREINHSRYESED 420
LTSMAAHAVDEEVDIEL++NNKVNETRENTKEASFGSLPFPPPPPP IREINHSRYESED
Sbjct: 361 LTSMAAHAVDEEVDIELEINNKVNETRENTKEASFGSLPFPPPPPPPIREINHSRYESED 420
Query: 421 QNVDVVALDDERARLDNA 436
QNVDVV LDDERARLDNA
Sbjct: 421 QNVDVVTLDDERARLDNA 434
BLAST of Cp4.1LG04g03520 vs. ExPASy TrEMBL
Match:
A0A6J1EWA4 (zinc finger CCCH domain-containing protein 40-like isoform X1 OS=Cucurbita moschata OX=3662 GN=LOC111437138 PE=4 SV=1)
HSP 1 Score: 713 bits (1840), Expect = 8.39e-257
Identity = 380/395 (96.20%), Postives = 387/395 (97.97%), Query Frame = 0
Query: 43 DGRREYRGNDLRHKLDSRHSPLQERDSRGRHGPRDYSSSWSLERQSDRKRRKKEYGDSRN 102
+GRREYRGNDLRHKLDSRHSPLQERDSRGRHGPRDYSSSWSLERQSDRKRRKKEYGDSRN
Sbjct: 45 NGRREYRGNDLRHKLDSRHSPLQERDSRGRHGPRDYSSSWSLERQSDRKRRKKEYGDSRN 104
Query: 103 DYSGNLRISDRSEERDREGKISSASRDTLEGQLNKMQADIEMAEHHKHQAEVYLDERIQE 162
DYSGNLRISDRSEERDREGKISSASRDTLEGQLNKMQADIEMAEHHKHQAEVYLDERIQE
Sbjct: 105 DYSGNLRISDRSEERDREGKISSASRDTLEGQLNKMQADIEMAEHHKHQAEVYLDERIQE 164
Query: 163 VDSLTSRIQELESQLYKEREDSRRIKSKIKKFVKAHNRYSRIQDELKRQVYFSQVRLEQL 222
VDSLTSRIQELESQLYKEREDSRRIKSKIKKFVKAHNRYSRIQDELKR SQVRL+QL
Sbjct: 165 VDSLTSRIQELESQLYKEREDSRRIKSKIKKFVKAHNRYSRIQDELKR----SQVRLQQL 224
Query: 223 GDQLGSDVNKIGANEEDSSINIVSDGEDPGFHAVSPLHDLQKDNSASKKKHIVQDIAENS 282
GDQLGSDVNKIGANEEDSSINIVSDGEDPGFHAVSPLHDLQKDNSASKKKHIVQDIA NS
Sbjct: 225 GDQLGSDVNKIGANEEDSSINIVSDGEDPGFHAVSPLHDLQKDNSASKKKHIVQDIAGNS 284
Query: 283 KRADLNKGSKEAVGRLRRFSRWNAHPSQSVYSKIEAVGNEVNALIPTANDSKQKRGRTST 342
KRADLNKGSKEAVGRLRRFSRWNAHPSQSVYSKIEAVGNEVN LIPTAN+SKQKRGRTST
Sbjct: 285 KRADLNKGSKEAVGRLRRFSRWNAHPSQSVYSKIEAVGNEVNDLIPTANESKQKRGRTST 344
Query: 343 AVSSADKVRGLESGVV-PLTSMAAHAVDEEVDIELDVNNKVNETRENTKEASFGSLPFPP 402
VSSADKVRGLESGVV PLTSMAAHAVDEEVDIEL++N+KVNETRENTKEASFGSLPFPP
Sbjct: 345 TVSSADKVRGLESGVVVPLTSMAAHAVDEEVDIELEINSKVNETRENTKEASFGSLPFPP 404
Query: 403 PPPPIREINHSRYESEDQNVDVVALDDERARLDNA 436
PPPPIREINHS+YESEDQNVDVVALDDERARLDNA
Sbjct: 405 PPPPIREINHSKYESEDQNVDVVALDDERARLDNA 435
BLAST of Cp4.1LG04g03520 vs. ExPASy TrEMBL
Match:
A0A6J1JMK2 (zinc finger CCCH domain-containing protein 40-like isoform X2 OS=Cucurbita maxima OX=3661 GN=LOC111486025 PE=4 SV=1)
HSP 1 Score: 711 bits (1834), Expect = 3.27e-256
Identity = 380/396 (95.96%), Postives = 386/396 (97.47%), Query Frame = 0
Query: 43 DGRREYRGNDLRHKLDSRHSPLQERDSRGRHGPRDYSSSWSLERQSDRKRRKKEYGDSRN 102
+GRREYRGNDLRHKLDSRHSPLQERDSRGRHGPRDYSSSWSLERQSDRKRRKKEYGDSRN
Sbjct: 24 NGRREYRGNDLRHKLDSRHSPLQERDSRGRHGPRDYSSSWSLERQSDRKRRKKEYGDSRN 83
Query: 103 DYSGNLRISDRSEERDREGKISSASRDTLEGQLNKMQADIEMAEHHKHQAEVYLDERIQE 162
DYSGNLRISDRSEERDREGKISSASRDTLEGQLNKMQADIEMAEHHKHQAEVYLDERIQE
Sbjct: 84 DYSGNLRISDRSEERDREGKISSASRDTLEGQLNKMQADIEMAEHHKHQAEVYLDERIQE 143
Query: 163 VDSLTSRIQELESQLYKEREDSRRIKSKIKKFVKAHNRYSRIQDELKRQVYFSQVRLEQL 222
VDSLTSRIQELESQLYKEREDSRRIKSKIKKFVKAHNRYSRIQDELKR SQ RL+QL
Sbjct: 144 VDSLTSRIQELESQLYKEREDSRRIKSKIKKFVKAHNRYSRIQDELKR----SQARLQQL 203
Query: 223 GDQLGSDVNKIGANEEDSSINIVSDGEDPGFHAVSPLHDLQKDNSASKKKHIVQDIAENS 282
GDQLGSDVNKIGANEEDSSINIVSDGEDPGFHAVSPLHDLQKDNSASKKKH+VQDIAENS
Sbjct: 204 GDQLGSDVNKIGANEEDSSINIVSDGEDPGFHAVSPLHDLQKDNSASKKKHVVQDIAENS 263
Query: 283 KRADLNKGSKEAVGRLRRFSRWNAHPSQSVYSKIEAVGNEVNALIPTANDSKQKRGRTST 342
KRADLNKGSKEAVGRLRRFSRWNAHPSQSVYSKIEAVGNEVN LIPTAN+SKQKRGRTST
Sbjct: 264 KRADLNKGSKEAVGRLRRFSRWNAHPSQSVYSKIEAVGNEVNDLIPTANESKQKRGRTST 323
Query: 343 AVSSADKVRGLESGVV-PLTSMAAHAVDEEVDIELDVNNKVNETRENTKEASFGSLPFPP 402
VSSADKVRGLESGVV PLTSMAAHAVDEEVDIEL++NNKVNETRENTKEASFGSLPFPP
Sbjct: 324 TVSSADKVRGLESGVVVPLTSMAAHAVDEEVDIELEINNKVNETRENTKEASFGSLPFPP 383
Query: 403 PPPP-IREINHSRYESEDQNVDVVALDDERARLDNA 436
PPPP IREINHSRYESEDQNVDVV LDDERARLDNA
Sbjct: 384 PPPPPIREINHSRYESEDQNVDVVTLDDERARLDNA 415
BLAST of Cp4.1LG04g03520 vs. ExPASy TrEMBL
Match:
A0A6J1JKJ8 (zinc finger CCCH domain-containing protein 13-like isoform X3 OS=Cucurbita maxima OX=3661 GN=LOC111486025 PE=4 SV=1)
HSP 1 Score: 690 bits (1781), Expect = 1.46e-248
Identity = 379/438 (86.53%), Postives = 384/438 (87.67%), Query Frame = 0
Query: 1 MVVGSKLFKTKLCVLYQKGRCSRPSCSFAHGTAELRRFAGAHDGRREYRGNDLRHKLDSR 60
MVVGSKLFKTKLCVLYQKGRCSRPSCSFAHGTAELRRFAGAHDGRREYRGNDLRHKLDSR
Sbjct: 1 MVVGSKLFKTKLCVLYQKGRCSRPSCSFAHGTAELRRFAGAHDGRREYRGNDLRHKLDSR 60
Query: 61 HSPLQERDSRGRHGPRDYSSSWSLERQSDRKRRKKEYGDSRNDYSGNLRISDRSEERDRE 120
HSPLQERDSRGRHGPRDYSSSWSLERQSDRKRRKKEYGDSRNDYSGNLRISDRSEERDRE
Sbjct: 61 HSPLQERDSRGRHGPRDYSSSWSLERQSDRKRRKKEYGDSRNDYSGNLRISDRSEERDRE 120
Query: 121 GKISSASRDTLEGQLNKMQADIEMAEHHKHQAEVYLDERIQEVDSLTSRIQELESQLYKE 180
GKISSASRDTLEGQLNKMQADIEMAEHHKHQAEVYLDERIQEVDSLTSRIQELESQLYKE
Sbjct: 121 GKISSASRDTLEGQLNKMQADIEMAEHHKHQAEVYLDERIQEVDSLTSRIQELESQLYKE 180
Query: 181 REDSRRIKSKIKKFVKAHNRYSRIQDELKRQVYFSQVRLEQLGDQLGSDVNKIGANEEDS 240
REDSRRIKSKIKKFVKAHNRYSRIQDELKR SQ RL+QLGDQLGSDVNKIGANEEDS
Sbjct: 181 REDSRRIKSKIKKFVKAHNRYSRIQDELKR----SQARLQQLGDQLGSDVNKIGANEEDS 240
Query: 241 SINIVSDGEDPGFHAVSPLHDLQKDNSASKKKHIVQDIAENSKRADLNKGSKEAVGRLRR 300
SINIVSDGEDPGFHAVSPLHDLQKDNSASKKKH+VQDIAENSKRA
Sbjct: 241 SINIVSDGEDPGFHAVSPLHDLQKDNSASKKKHVVQDIAENSKRA--------------- 300
Query: 301 FSRWNAHPSQSVYSKIEAVGNEVNALIPTANDSKQKRGRTSTAVSSADKVRGLESGVV-P 360
N+SKQKRGRTST VSSADKVRGLESGVV P
Sbjct: 301 ------------------------------NESKQKRGRTSTTVSSADKVRGLESGVVVP 360
Query: 361 LTSMAAHAVDEEVDIELDVNNKVNETRENTKEASFGSLPFPPPPPP-IREINHSRYESED 420
LTSMAAHAVDEEVDIEL++NNKVNETRENTKEASFGSLPFPPPPPP IREINHSRYESED
Sbjct: 361 LTSMAAHAVDEEVDIELEINNKVNETRENTKEASFGSLPFPPPPPPPIREINHSRYESED 389
Query: 421 QNVDVVALDDERARLDNA 436
QNVDVV LDDERARLDNA
Sbjct: 421 QNVDVVTLDDERARLDNA 389
BLAST of Cp4.1LG04g03520 vs. TAIR 10
Match:
AT3G21810.1 (Zinc finger C-x8-C-x5-C-x3-H type family protein )
HSP 1 Score: 189.1 bits (479), Expect = 7.4e-48
Identity = 164/457 (35.89%), Postives = 239/457 (52.30%), Query Frame = 0
Query: 4 GSKLFKTKLCVLYQK-GRCSRPSCSFAHGTAELRR-----FAGAHDG-------RREYRG 63
GS ++KTKLC+L+ K G CSRP+C+FAHG AELRR F G RR
Sbjct: 3 GSSMYKTKLCILFNKTGDCSRPNCTFAHGNAELRRPGESSFTGRRHNMDSDLRDRRHNMD 62
Query: 64 NDLRHKLDSRHSPLQERDSRGRHGPR-----DYSSSWSLERQSDRKRRKKEYGDSRNDYS 123
+DLR +L + SP + R S R G R + +S E + D+ R+ D R DY+
Sbjct: 63 SDLRDRLGRQFSP-ERRPSLDRSGRRVQRFSGHDNSMPFENRRDKDYRENRRFDERRDYA 122
Query: 124 GNLRISDRSEERDREGKIS-SASRDTLEGQLNKMQADIEMAEHHKHQAEVYLDERIQEVD 183
G L++ +R E+R +G+ + LE QL ++ D++M K + E ++ + EVD
Sbjct: 123 GGLKVGNRIEDRAEDGRNKFHGYNNVLEEQLKDVEMDVKMLTDDKLRLEASVERKAHEVD 182
Query: 184 SLTSRIQELESQLYKEREDSRRIKSKIKKFVKAHNRYSRIQDELKRQVYFSQVRLEQLGD 243
LTSRIQELE+QL +E+++ RRI S KKFVK +NR+ R QD+LKR S+ RL++LG+
Sbjct: 183 ILTSRIQELETQLDREKDECRRITSSSKKFVKEYNRFLRAQDDLKR----SEARLQKLGN 242
Query: 244 QLGSDVNKIGANEEDSSINIVSDGEDPGFH---AVSPLHDLQKDNSASKKKHIVQDIAEN 303
QL + + N D ++IVSD E G + A P ++LQ +S S+KKH V
Sbjct: 243 QLSTYLAGSEGNNRDVGLDIVSDEETNGRNLRTACDPHNELQNTSSLSRKKHYVDQYTTK 302
Query: 304 SKRAD--LNKGSKEAV-GRLRRFSRWNAHPSQSVYSKIEAVGNEVNAL-IPTANDSKQKR 363
D + +G +E V +R WN S+S + N+ + + ++ + KR
Sbjct: 303 EPVEDGLIGRGEEEKVENEKKRPPCWNMLSSKSYSEEESGAWNDEDTINRSSSKEDNWKR 362
Query: 364 GRTSTAVSSADKVRGLESGVVPLTSMAAHAVDEEVDIELDVNNKVNETRENTKEASFGS- 423
R S S+ DK V+ TSMAA D+ V E+ E EA+ GS
Sbjct: 363 RRFSIGTSATDK-------VILSTSMAAREFDD-----------VAESEEENPEAANGSP 422
Query: 424 LPFPPPPPPIREINHSRYESEDQNVDVV----ALDDE 430
L PPPPP R+ H + + +D N DV+ A DD+
Sbjct: 423 LISLPPPPPFRDA-HVQRDEDDVNGDVMEQKKAYDDD 435
The following BLAST results are available for this feature:
Match Name | E-value | Identity | Description | |
Q93XW7 | 1.0e-46 | 35.89 | Zinc finger CCCH domain-containing protein 40 OS=Arabidopsis thaliana OX=3702 GN... | [more] |
Q6H7U2 | 3.4e-45 | 35.15 | Zinc finger CCCH domain-containing protein 13 OS=Oryza sativa subsp. japonica OX... | [more] |
Match Name | E-value | Identity | Description | |
XP_023530203.1 | 9.83e-299 | 99.08 | zinc finger CCCH domain-containing protein 40-like isoform X4 [Cucurbita pepo su... | [more] |
XP_022930759.1 | 2.99e-291 | 96.80 | zinc finger CCCH domain-containing protein 40-like isoform X2 [Cucurbita moschat... | [more] |
XP_022988786.1 | 2.55e-290 | 96.58 | zinc finger CCCH domain-containing protein 40-like isoform X1 [Cucurbita maxima] | [more] |
XP_023530200.1 | 5.76e-264 | 98.73 | zinc finger CCCH domain-containing protein 40-like isoform X3 [Cucurbita pepo su... | [more] |
XP_023530198.1 | 9.02e-264 | 98.73 | zinc finger CCCH domain-containing protein 13-like isoform X1 [Cucurbita pepo su... | [more] |
Match Name | E-value | Identity | Description | |
A0A6J1ESE6 | 1.45e-291 | 96.80 | zinc finger CCCH domain-containing protein 40-like isoform X2 OS=Cucurbita mosch... | [more] |
A0A6J1JI85 | 1.23e-290 | 96.58 | zinc finger CCCH domain-containing protein 40-like isoform X1 OS=Cucurbita maxim... | [more] |
A0A6J1EWA4 | 8.39e-257 | 96.20 | zinc finger CCCH domain-containing protein 40-like isoform X1 OS=Cucurbita mosch... | [more] |
A0A6J1JMK2 | 3.27e-256 | 95.96 | zinc finger CCCH domain-containing protein 40-like isoform X2 OS=Cucurbita maxim... | [more] |
A0A6J1JKJ8 | 1.46e-248 | 86.53 | zinc finger CCCH domain-containing protein 13-like isoform X3 OS=Cucurbita maxim... | [more] |
Match Name | E-value | Identity | Description | |
AT3G21810.1 | 7.4e-48 | 35.89 | Zinc finger C-x8-C-x5-C-x3-H type family protein | [more] |