Sequences
The following sequences are available for this feature:
Gene sequence (with intron)
Legend: five_prime_UTRCDSpolypeptidethree_prime_UTR
Hold the cursor over a type above to highlight its positions in the sequence below.
GAAGAATTGAGAAAAAGAAAAAAATAATGAAAGAGGAAACAAGAGGGCAGGGATCCCGGAGAGTAAACTGCAGTTGGTAGCTGACTACAAGCAACAGAAATGGCGCGAGGATCGTCTTCAAAGAAGGACGAAGCAAAAGGAGAAATCAACCCAGAGATTGCAGAGCGAAAGCGGCTCAAGAAGCTCGCATTCTCCAATCACATACTTTCAGAAACCCAGGCAAGGCCCCAGGCTTATCTGAGCCCTTCAGCCACGGTTCTGAAGCACCATGGCAAAGACATTGTCAAGAAATCACAGCGAAAGAACAGGTTCCTCTTCTCCTTTTCAGGCTTGCTTGCTCCCGTTAGTGGTGGCAAGATTGGCGAGCTCAAGGATTTATCAACCAAGAATCCCATTCTCTATCTCGATTTTCCTCAGGTTCGTCCGTTCTTCCTTTCTCTCAATCGCCTTTCTGTTTTTCACTCCTCTGAATTTCTTCAGTTTAGTGTCCGGGTTCGTCTCTTCTCCTTTAGAATAATTTTTGTAATTGTTTGTCATCTTCATCCAGTCCTAAGTATTTGATAGATAAACCCATATCACACTCGAGGTTTTTTTTCTATGTTTTTTATTTACATACTTCTTGCCTTCTGGATTCTACAAGATAGACTGAGAATAGTACTTATATATTTTTTTGTGGGGAATACAACTTGGAAGGATAGATCTCAGAAAAGACTAGGACAGTATTGGATGAGATAACAGCAACCCAATGGCTGGTTTTTTAAGGGTAAACCAATATATTTGTGGGGGTGTGCTGCTCGGGCTTTTTTTGGGGAGGTCTCGCTGGAAAAGAACTGTAGAACCTTCGTAGAGGAAGCTGCAAATTTTATTTCCACTTGGTGGTGGCTTAGCAACACCATTCTTTGTAATTGTTTAGGGTTTAGGGGTTTGTCAATTGTGCAAAATTAAAGGACAAGCTTTCCACCACAAAAGAACCAAGGCTAAATGATTTTTGTTTTGATATATAAATAATTTGGACTCTGTGAATTTTGTTTTTTAATAGACAAGGTATGATTAATATTTGAGTTTTTTGGTTGATTTGGAGTCCATGGAGCTGTTATCAACACCCATCAATCATTTATCATCATTATATAGTTATTAGAAGTTGATGAAATTAATGTACTATCTAATTAATTATTATAGTTTCTACTAAACTCATATGTCCAGTGATATTTTTTCTCATTAAATGAGATGCTTTGCTTAGTATTTGAGTGCATAGCATTAGAGCGAGTTAAATACGTGGAAATTGTTATTGTTATTGGCTTATTATTTATTTCTACCTGGAAAATGGATGCGTAAGCATATTGTTTGTATGCACAAGCTAACCTGGACACCCACAAATTATTTTCATTCCCCGAATGCCAATGTTGATTTTATGTGGTCATGAAGCATCTAGTGGTGGAATTTTTCTGATTGCTATTTTGTTTGCTTTCCAGGGGCGTATGAAGTTGTTTGGAACTATCATGTATCCGAAGAACAGATATTTAACTTTGCAGTTTTCTAGAGGTGGAAAGAATGTGACGTGTGAAGATTGTTTTGATAATATGGTTAGTCCTTTTGCTTCTGTGTTTCATTCCTTCTATCTTGGGGACTTTATTTTTGTTCATTTCATCTAGTATTTCATTAAAAGTGCAGTTATGACTTGTAAAACCCGTAGCACAATAAATGATGTTTGTAGCTGGTTTCAATTCTTAGTATGATCTGAATGCTTCTTTATGTACTGCTTCCGAAATTCATCTTTGCCGGTTTAAGCCGTCATTTTTTGTGATTTTTCTTTCAAAGATAAAGCAAAATCTATGGTGCAATACAAATTGAGGTTTTCTAAGCCTATTGTCGGAGACCTGTATTTGGGCCGATCCAAAATGATTCTATCATTTTTGTATCTGCAATTCAATATATATGAGCATAGATAGATACATAGACATATGTACTTACCCTTCTTTCTGTTAATTTTTTTAGTAAAACTTTTTATACTTAAATGGTCGATTCTTTTATTCGTGTTATTTTCCACCTTTCAGAATGAAAACAGAGAAATGCTTCATTATCATCTCAATTGCATTTTGAGCAAACAAGTCAAGAATTGGCCACAGCTTGTGAACATCTAGCAAGGTCTTGTCTTAAATGATATGTTTGCCCGATAAAAGTGTGGCAAGAACAATTATCCACCAACAGGAAAAGGAATAACAATCTGTTTAAGTTATCCTGGTGTGTTTGTTAAATGTGATTCATGAAGATAGATACTAGATGGTGTCATTTTCTGTTCCCTGCCAATATTGATCATCTTGAAATTTAAGAACATTTGTTCAGATTCACTATGTTGAATGATGTTCTTAATTTAATCTCTGTACTGTTCTTGGCACTTCTTGTTACTTAAGACATATGACCTTTAAGCATGTACATTATGAATCAACTATACTTTTCTCTGACTGCAGATTGTCTTTTCTGATGCATGGTGGATTGGAACAAAAGATGAAAATCCAGAGGAGGCGTGTCTTGATTTTCCTAAAGATTTGACCATGGTGAGTTTCTTGATAAATTTTAGGCCCTTAAAATCTATGTATTTTTTCTCTAGTACCTCTACATTTGTATGTATAATTCCTGCGAGCATCTCTTTTCTCAAGATCATTTATGAAGCAGAATAATGGAAATGCATCCCTCTAGTCATAGTCTAATTTATGTTGTTAGAGAGAGAGAGAGAGAGAGAGAGAGAGAGAGAGACTTTGCTAACTCCCAGTGGTTGCATGTCAGGGACAATGTGGAGAATATGACTTTAACGGTGGTGCTGGTGTTACTAGTACGAGTGGGGTTGCTGGTGTTACCAGTACGAGTAAGCAGAGTGTTCAAAGGAAGGGAATCAATCCTGCTGCAGAAAATTCCTTTAAAGGAGAGCATGGAGATGATTTAGTGGGCCTTGAAGCCAGCGTGACAAATTCAATAAAGACTACGCCAGTTAGACATTCTGAAAGATCTGCCAGAAAAGTATTCAAGTATTACACTTTGCATGTACTGTTACCATGTAGTTCTTATCTTATTCTCAACAAGGCAGAAACCAATAGGGTTTCTGGTCAAATAGTATTTTGCATCAATTGTCTTATTTATCAACAATCCGTACCTTGATTGGTAGAAAAATTTGCTGCATGACCATGAGTATAGATACATATTTAACACTTGATTTATGATTTTAAAAGTTTTAACTTTAAAATTTGTTTTTCAAAGAATCCACACGTGTTTAGCTTGTCTGGGGTTTTCATAGCCATAGGTGGCCTCTGTTTCAGAGAGAGCATTCTGTGAGTTTGTGTTGCTGTCTTCTGAGAGGTTAGCTTGGGAAAAAATTTTCTTCAAGACATGGAGAACTCTTGAAACGAAGGTGATGGTTCTTAAGGAAAGTACAAAAAATTTACCATGCGTTCGAGAGGCTCACTTTACTCTCTGTACCATTTCTAAAATATGTCTATCCCCATGTCACCCAATCTATATTATTTATGAACTCTATACCCTAAAAAAGTTCATTAACCAATTATTAGTCTAACATTACTAATTTTCCTAACAGCTTATTTGTTAAATATTTTTTCAATTATTTTATTTTCTGTGATTTTAGCTAATTGGGGAGCCTATTGTAACTTTGACTTGTGGGAAGCTTTTGCCTTGGGTTATGCATTTTGTTTATGAATGAAATTAGTTTCCGTAAAAAGAAATGTCCACATTCTTGGAGAAAGTGCCACCAGTTTGTACCATTGCGGTGTCATTACATCTGTATCTACTGTCTTCTATTATTCCCAAACATATCAGGATCAATTTACTTTATGGAAATTGGCAGAAGCAAACTAGATGTTAGGAGGCCATCAGTACAAATACCATTTTCAATGATGGCATCGCCCCTCTCAAGTAAAAACAATATACATATATTGTTAGTAGTAACATTTTGAATTAATGAAACCAGCACCGAATTTGTCCTAGATGTTTTTCTAGCTCAACTACCTAAAGTCCATCCGGAATTTCAGATTCTTTCACTTGATCAGCTTGGCCGTGGTAGTTTCTATACAGTACATTGGTTTTTATTTATAATTCTAATAACCGGTGCAAACATGAAACAAACAACGGAACATTGAGTCCATCTTTTGTCATGTTGGTTGACAAATAGTCTCTCAACAAATTCAGATCAAATGTGAGTTAACGCAACTTCTTTAATCTCATATATTCCCATTCATGTTGAGGCTTATGTCAGTTAACTTTATTGTGATCTGTCCCGAAGAAATAATTTTATTATTATTATTTTTTTTTTCTTGTTAAGTTTTGCAGAGGCTTCTTCTGAGGATGAGTCTGCTGGCACGGACGCTGATTTGTCTGAAGGAGAAGAAAAAAATATTGTCATACATGAGCCTTCAATTGGAGATCATGCTAGTGAAAATATCCCATCTATGATTCTTGAGTAGTTTGTTTTAATTCATCCTCCGCAATTTTCTATTATTATTGATTGTCTTAAAATTTGTCTGCTGCTGTATGTTTCAATTCCTTAATTGATTACTACAGACGGAAGATATCTCTGTTGAATCTATAGATGAAGATGCTGTGAAAATTAAACCTCCTTTTCTTGAAGGAAATCAGACATCAATTTCTAAGGAAAAGAAAAGTTTTCGGGCTAAGGGAAGTGCTCAGAGTGATACTCGTGGACTTGTCCAGCCTACTTTACTTAGTTTGTTCAAGAAAGTGGAGGAGAAGGTAACATCCGAATACCTGTACTTAATATGTGGAAATAGGTTGCTGGCTTGGAATCCATTTACTGTTACATTGAGGGGCTTATTAAAACTAGTAGTTATTTTTCTTGATGCAAAAGGAAATGATCATTTAAAAACCAATGAAAGAACAAGCAAGAGCTGTTACAACGTAGCTATTGAGGAACCAAAAAAAAAAAATGCTCCTCAAACTATAAAACACTTGGCGAAGCAAAGGAAAGACATTTACCTCCCCTTTGGTTGTTAACAACGGTAATGATGTTTTACAAAAGTCAGTTAAAAAATGATCGTAATTCTAGAGAAAAGTATAAAGTGTGTTAGTTACTTACCTTTTACAGAGAATGTGAGACAACATGCTCTTAGAGAAGACAAAAAACATAGTAAAGAGATGCTATTAAAAAGAAAAGACCATCCGAGATTTTTTGAAAACACGATTGTTTCTTTCCATCTAAATTACCCAACAAATTACTGAAACTGCAAATTCCAAAAAGCTCTCAATCTTATACTAGCTAGTAACTCCTCCAAATTGTCCTATCCGAACAATGAACTTTATATCTTTAAGGCTTACATATATTTGAAATGAATGCTTATTGTTTTCAGAGGACACCAAGAAGTTCAAAGAGGTCTTCAGCTCCCAAAGGTTCTAATATTTTCCATTGATTTTATCATCACGACACGAGAATACTAAAGTAACACCAAATCTGCAAAATTGGTTGCCTACAGTTTCTACCCAAAAGATGCAGCTGTCTGGTTCAAAGCAAAAGATTGACCAGGTGATGACAAATCTTCATTTTTCAATCACGCCTTTTCTCCTGAAAACGACTAACAGATTATCATGTACTTTCAGGATGAAGGATCAAAGAAAAGGAGGGTTGTCCGGGGACAAGGTGATGCAATCTTTTATGATCTCTCCATTTACTTACTGCTTGTTGAAATCTCTAACAGCTTAGATATGACAATATTGGCTTGCTGAATTTGTCATCTCTCTTGTACCCTGGGTTCTCATGGAAATTATATCAGGATTGCTCGAGAAATCATACAACTTCTGTAGTAGATCTAATAGAATATTGACTCCCATTTTGCAAGATACATGTGCATTAGTATTTGGACTAATTTTCTGTGTGCTATATTTGTAGTCGGTTTATGCAAATTTGGGTTCTGGATTACCTGGTGTAGCTTTGTAATCATGTTCAGTTGACTGTGTTTATATGGTATGTTTACATGGTTGGTTGATGCATATTAATATTAGCTTGAACGTGCAGGAGGAAAAGCCCAGAAGAAGGATACAGAATATGAGGTACTGGTACCATTTTTACAAATTGTGCTTTTAAAAGATTTGCTGCTTTATTAACCTCAGGATTTCTTATGTGCACTATTCCAACTTGGAGTGGACATGCTTCCCAAAGTTATTATATAACTGGACAAATTTCCATTACTTATAGCTCCAACAGCTAGAGGTATCTTAAAAGCAATCAACTTGAGCATGTTTGGTATACCAACTGCAAAGTGCTTTAAAATGCACTTTTAAACATTTTAAGCACTTGAAAGTCATTCCAAAAACTTTTAGCTGCGGATGAAAATGTTTCATCACAAGTAATACTTTTTGCTTCCTAAGTCTCAAAGTCTTACGTTTTTTTAAAAAAATAATATTTTGTTATTTATTTTTTCAATGAAATTTTATGTGTTAACACATGATATTTCTGAAATGGTCATCGTTAGTTTTTGCTTGGTGACACTAACAAAAAACCTTAAATAGTTTTATGTTAATTATAAAAAGGCCCTTGGCATTCTTCAAACTTGAAAGAATTTCAACCAGCAATTCCTTCTCATGTGTCTCATCTTACAAGTGAGACCTTTTAGAATTTAGTTTAAAAGGATGTTTGCTTCTGTGTTTTGGGTCTCATCCAAATTTAGTTCATGACTTTCAAATGTTTTAATAGACCATAAATTAATTCTGATTGTTAGTTTCAGACTGATATTTATTCAATTCGGTTCCATAATAATGTGAATATGTTTCCAATAGAGAGGAAAGACTAATTAAGGGGGTTTTTTCCCTTCGGAAGTAACAATAAACTAACTGTAGGCTAATTTAAGATTTATTGGACATTTCTAACTGGTGAGATTAAAATAAAACGAGGACCAAAATATTGTGGTGAACTAGTATTATAACCTAAATTTTTATATGCTCTGAATTGCAAGTACATGTAGTTTGTTTCTATTTACTACTCTCAAGTCACGATTCAGTGCGGGATGAAATGGTTGACGGTAGATATCACTGTAGACTTTATTTATCTAAATCTTGTAGGTTGAAGATGAGATCGAAGATTTGTCGAGCTCTCAAGAGGTGAGTGTTCAATGGCTATTTTATAACCTACATACTTCCATGATTTCTTAGGATATTGGACTGGAAGTTAGTGGGCCCTAGGTTTGTAGTCATTTTGGAGTCTGATGAAAATATTTGACTGAACTACTCCTTTGCTATTACATATAATCAATGTCTCATTTTCCTTTGACTATTAATTTTTTGGCTTAGTTACCCTTTAAAAACTCTCTCTCCCTTGTGCCCTACACCATGGTTGTTAGAAAGTTGTCATTTACTGTAATTTTTTATTTCTATGTAATATTAAGAAAAAAACAGGCTTTCCATTTTTTTTAGAAACAACAGCTTTCATGGAGAAAGAACGAAAGTATACATGAGCATATGAAAAAGAAATAACTCAGCCCACAAAAAAAGCAATACCCGCTCCAAGGAGTTCTTAACTTTGCAAGTTAACACCTATAAAATAGCTAAAACCGAAGTGTACGAGGAAACATGGAAGCAAACAAGGGACCAAATTTCACTATGTTCCTTTTCCACCCTTCTAAAGATCCTACCATTCTGCTCTGCCCATAAAACCCACGAGATCCCACACACCTCACCTAACCTCAAAAGTGATCTTTTTCCCCACGAGGCGAATTGAGGAGGAACGAGATAATGCAATCCTAATCCTCAACTCCTCAATGATTCTTATCAAACTCTGTATGGAATATCATGATTCTTTATTAACTCCTCAATCATAGCACTAATGGTCCCTCTAGCAAACAAATGAACAACTAAACATAGTTCTTGATCGTCATCACCAATAAGCGTTCACATTGTCAAACTTTTCAAATGCAGGTGATTAAATTAAAGCGTTTGAATAATTCAGCAATTAAACTTTAAATGGGAGTCAAAAGATGAAATCAAAGTGAACTTATCATTTAGGGACCAGATGGATAGTTTAACTAGAACACATTCCTTTTAGCTTTCATAAATATTGATTATGATCTTCCACATTTTTGCTCACAAGTTCATTTGTCATCCAGGACACTGATGAAGATTGGACAAGTTGAGGTTGTTACCATTCTAACAATGGTAAGCCACACCAGCAGCTCGAATTGCATCATTGCAGGGATTATATTCAGAATTCAGAATGCTCGAATTGCTCTACCAGGTTCTTTGATGCCGCTGGCTAATTGAAATTAAAGAAATTTGACCATAAGATGATATTGAAAAGTTTTTATCATAAAGTTAGGCAATGTAAAATTATGTTTTTTATATACTTGAGAGATCAACTGTTACACTTCCTAGAAACCAGACCAAGTTTAAGGTTAAAAATATTCACAAGATTTTATGCAGATTGTTGTGCCTTTGGA
mRNA sequence
GAAGAATTGAGAAAAAGAAAAAAATAATGAAAGAGGAAACAAGAGGGCAGGGATCCCGGAGAGTAAACTGCAGTTGGTAGCTGACTACAAGCAACAGAAATGGCGCGAGGATCGTCTTCAAAGAAGGACGAAGCAAAAGGAGAAATCAACCCAGAGATTGCAGAGCGAAAGCGGCTCAAGAAGCTCGCATTCTCCAATCACATACTTTCAGAAACCCAGGCAAGGCCCCAGGCTTATCTGAGCCCTTCAGCCACGGTTCTGAAGCACCATGGCAAAGACATTGTCAAGAAATCACAGCGAAAGAACAGGTTCCTCTTCTCCTTTTCAGGCTTGCTTGCTCCCGTTAGTGGTGGCAAGATTGGCGAGCTCAAGGATTTATCAACCAAGAATCCCATTCTCTATCTCGATTTTCCTCAGGGGCGTATGAAGTTGTTTGGAACTATCATGTATCCGAAGAACAGATATTTAACTTTGCAGTTTTCTAGAGGTGGAAAGAATGTGACGTGTGAAGATTGTTTTGATAATATGATTGTCTTTTCTGATGCATGGTGGATTGGAACAAAAGATGAAAATCCAGAGGAGGCGTGTCTTGATTTTCCTAAAGATTTGACCATGGGACAATGTGGAGAATATGACTTTAACGGTGGTGCTGGTGTTACTAGTACGAGTGGGGTTGCTGGTGTTACCAGTACGAGTAAGCAGAGTGTTCAAAGGAAGGGAATCAATCCTGCTGCAGAAAATTCCTTTAAAGGAGAGCATGGAGATGATTTAGTGGGCCTTGAAGCCAGCGTGACAAATTCAATAAAGACTACGCCAGTTAGACATTCTGAAAGATCTGCCAGAAAAGTATTCAATTTTGCAGAGGCTTCTTCTGAGGATGAGTCTGCTGGCACGGACGCTGATTTGTCTGAAGGAGAAGAAAAAAATATTGTCATACATGAGCCTTCAATTGGAGATCATGCTAGTGAAAAGACGGAAGATATCTCTGTTGAATCTATAGATGAAGATGCTGTGAAAATTAAACCTCCTTTTCTTGAAGGAAATCAGACATCAATTTCTAAGGAAAAGAAAAGTTTTCGGGCTAAGGGAAGTGCTCAGAGTGATACTCGTGGACTTGTCCAGCCTACTTTACTTAGTTTGTTCAAGAAAGTGGAGGAGAAGAGGACACCAAGAAGTTCAAAGAGGTCTTCAGCTCCCAAAGTTTCTACCCAAAAGATGCAGCTGTCTGGTTCAAAGCAAAAGATTGACCAGGATGAAGGATCAAAGAAAAGGAGGGTTGTCCGGGGACAAGGAGGAAAAGCCCAGAAGAAGGATACAGAATATGAGGTTGAAGATGAGATCGAAGATTTGTCGAGCTCTCAAGAGGACACTGATGAAGATTGGACAAGTTGAGGTTGTTACCATTCTAACAATGGTAAGCCACACCAGCAGCTCGAATTGCATCATTGCAGGGATTATATTCAGAATTCAGAATGCTCGAATTGCTCTACCAGGTTCTTTGATGCCGCTGGCTAATTGAAATTAAAGAAATTTGACCATAAGATGATATTGAAAAGTTTTTATCATAAAGTTAGGCAATGTAAAATTATGTTTTTTATATACTTGAGAGATCAACTGTTACACTTCCTAGAAACCAGACCAAGTTTAAGGTTAAAAATATTCACAAGATTTTATGCAGATTGTTGTGCCTTTGGA
Coding sequence (CDS)
ATGGCGCGAGGATCGTCTTCAAAGAAGGACGAAGCAAAAGGAGAAATCAACCCAGAGATTGCAGAGCGAAAGCGGCTCAAGAAGCTCGCATTCTCCAATCACATACTTTCAGAAACCCAGGCAAGGCCCCAGGCTTATCTGAGCCCTTCAGCCACGGTTCTGAAGCACCATGGCAAAGACATTGTCAAGAAATCACAGCGAAAGAACAGGTTCCTCTTCTCCTTTTCAGGCTTGCTTGCTCCCGTTAGTGGTGGCAAGATTGGCGAGCTCAAGGATTTATCAACCAAGAATCCCATTCTCTATCTCGATTTTCCTCAGGGGCGTATGAAGTTGTTTGGAACTATCATGTATCCGAAGAACAGATATTTAACTTTGCAGTTTTCTAGAGGTGGAAAGAATGTGACGTGTGAAGATTGTTTTGATAATATGATTGTCTTTTCTGATGCATGGTGGATTGGAACAAAAGATGAAAATCCAGAGGAGGCGTGTCTTGATTTTCCTAAAGATTTGACCATGGGACAATGTGGAGAATATGACTTTAACGGTGGTGCTGGTGTTACTAGTACGAGTGGGGTTGCTGGTGTTACCAGTACGAGTAAGCAGAGTGTTCAAAGGAAGGGAATCAATCCTGCTGCAGAAAATTCCTTTAAAGGAGAGCATGGAGATGATTTAGTGGGCCTTGAAGCCAGCGTGACAAATTCAATAAAGACTACGCCAGTTAGACATTCTGAAAGATCTGCCAGAAAAGTATTCAATTTTGCAGAGGCTTCTTCTGAGGATGAGTCTGCTGGCACGGACGCTGATTTGTCTGAAGGAGAAGAAAAAAATATTGTCATACATGAGCCTTCAATTGGAGATCATGCTAGTGAAAAGACGGAAGATATCTCTGTTGAATCTATAGATGAAGATGCTGTGAAAATTAAACCTCCTTTTCTTGAAGGAAATCAGACATCAATTTCTAAGGAAAAGAAAAGTTTTCGGGCTAAGGGAAGTGCTCAGAGTGATACTCGTGGACTTGTCCAGCCTACTTTACTTAGTTTGTTCAAGAAAGTGGAGGAGAAGAGGACACCAAGAAGTTCAAAGAGGTCTTCAGCTCCCAAAGTTTCTACCCAAAAGATGCAGCTGTCTGGTTCAAAGCAAAAGATTGACCAGGATGAAGGATCAAAGAAAAGGAGGGTTGTCCGGGGACAAGGAGGAAAAGCCCAGAAGAAGGATACAGAATATGAGGTTGAAGATGAGATCGAAGATTTGTCGAGCTCTCAAGAGGACACTGATGAAGATTGGACAAGTTGA
Protein sequence
MARGSSSKKDEAKGEINPEIAERKRLKKLAFSNHILSETQARPQAYLSPSATVLKHHGKDIVKKSQRKNRFLFSFSGLLAPVSGGKIGELKDLSTKNPILYLDFPQGRMKLFGTIMYPKNRYLTLQFSRGGKNVTCEDCFDNMIVFSDAWWIGTKDENPEEACLDFPKDLTMGQCGEYDFNGGAGVTSTSGVAGVTSTSKQSVQRKGINPAAENSFKGEHGDDLVGLEASVTNSIKTTPVRHSERSARKVFNFAEASSEDESAGTDADLSEGEEKNIVIHEPSIGDHASEKTEDISVESIDEDAVKIKPPFLEGNQTSISKEKKSFRAKGSAQSDTRGLVQPTLLSLFKKVEEKRTPRSSKRSSAPKVSTQKMQLSGSKQKIDQDEGSKKRRVVRGQGGKAQKKDTEYEVEDEIEDLSSSQEDTDEDWTS*
Homology
BLAST of CSPI01G07540 vs. ExPASy Swiss-Prot
Match:
O81242 (DNA-binding protein RHL1 OS=Arabidopsis thaliana OX=3702 GN=RHL1 PE=1 SV=1)
HSP 1 Score: 302.4 bits (773), Expect = 8.4e-81
Identity = 183/373 (49.06%), Postives = 235/373 (63.00%), Query Frame = 0
Query: 1 MARGSSSKKDEAKG--EINPEIAERKRLKKLAFSNHILSETQARPQAYLSPSATVLKHHG 60
M R SSSKK +KG + + E +RKRLK LA N +LS++ A+ + L PS VLKHHG
Sbjct: 1 MVRASSSKKGGSKGGDKDDAESKQRKRLKTLALDNQLLSDSPAKSHSSLKPSKQVLKHHG 60
Query: 61 KDIVKKSQRKNRFLFSFSGLLAPVSGGKIGELKDLSTKNPILYLDFPQGRMKLFGTIMYP 120
DI++KSQRKNRFLFSF GLLAP+S IG+L LSTKNP+LYL+FPQGRMKLFGTI+YP
Sbjct: 61 TDIIRKSQRKNRFLFSFPGLLAPISAATIGDLDRLSTKNPVLYLNFPQGRMKLFGTILYP 120
Query: 121 KNRYLTLQFSRGGKNVTCEDCFDNMIVFSDAWWIGTKDENPEEACLDFPKDLTMGQCGEY 180
KNRYLTLQFSRGGKNV C+D FDNMIVFS++WWIGTK+ENPEEA LDFPK+L + E+
Sbjct: 121 KNRYLTLQFSRGGKNVLCDDYFDNMIVFSESWWIGTKEENPEEARLDFPKELAQAENTEF 180
Query: 181 DFNGGAGVTSTSGVAGVTSTSKQSVQRKGINPAAEN----SFKGEHGDDLVGLEASV--T 240
DF GGAG + V + S S + +P +N S GE DD + + V T
Sbjct: 181 DFQGGAG--GAASVKKLASPEIGSQPTETDSPEVDNEDVLSEDGEFLDDKIQVTPPVQLT 240
Query: 241 NSIKTTPVRHSERSARKVFNFAEASSEDESAGTDADLSEGEEKNIVIHEPSIGDHASEKT 300
++ TPVR S+R++ K FNFAE SSE S ++ + S+ +EK ++ EP + E++
Sbjct: 241 PPVQVTPVRQSQRNSGKKFNFAETSSEASSGESEGNTSDEDEKPLL--EPESSTRSREES 300
Query: 301 EDISVESIDEDAVKIKPPFLEGNQTSISKEKKSFRAKGSAQSDTRGLVQPTLLSLFKKVE 360
+D I A K+ + SK+ K LVQ TL +LFKK E
Sbjct: 301 QD--GNGITASASKLPEELPAKREKLKSKDSK--------------LVQATLSNLFKKAE 353
Query: 361 EKRTPRSSKRSSA 366
EK S +SS+
Sbjct: 361 EKTAGTSKAKSSS 353
BLAST of CSPI01G07540 vs. ExPASy TrEMBL
Match:
A0A0A0LVZ6 (Uncharacterized protein OS=Cucumis sativus OX=3659 GN=Csa_1G043140 PE=4 SV=1)
HSP 1 Score: 825.9 bits (2132), Expect = 8.0e-236
Identity = 430/430 (100.00%), Postives = 430/430 (100.00%), Query Frame = 0
Query: 1 MARGSSSKKDEAKGEINPEIAERKRLKKLAFSNHILSETQARPQAYLSPSATVLKHHGKD 60
MARGSSSKKDEAKGEINPEIAERKRLKKLAFSNHILSETQARPQAYLSPSATVLKHHGKD
Sbjct: 1 MARGSSSKKDEAKGEINPEIAERKRLKKLAFSNHILSETQARPQAYLSPSATVLKHHGKD 60
Query: 61 IVKKSQRKNRFLFSFSGLLAPVSGGKIGELKDLSTKNPILYLDFPQGRMKLFGTIMYPKN 120
IVKKSQRKNRFLFSFSGLLAPVSGGKIGELKDLSTKNPILYLDFPQGRMKLFGTIMYPKN
Sbjct: 61 IVKKSQRKNRFLFSFSGLLAPVSGGKIGELKDLSTKNPILYLDFPQGRMKLFGTIMYPKN 120
Query: 121 RYLTLQFSRGGKNVTCEDCFDNMIVFSDAWWIGTKDENPEEACLDFPKDLTMGQCGEYDF 180
RYLTLQFSRGGKNVTCEDCFDNMIVFSDAWWIGTKDENPEEACLDFPKDLTMGQCGEYDF
Sbjct: 121 RYLTLQFSRGGKNVTCEDCFDNMIVFSDAWWIGTKDENPEEACLDFPKDLTMGQCGEYDF 180
Query: 181 NGGAGVTSTSGVAGVTSTSKQSVQRKGINPAAENSFKGEHGDDLVGLEASVTNSIKTTPV 240
NGGAGVTSTSGVAGVTSTSKQSVQRKGINPAAENSFKGEHGDDLVGLEASVTNSIKTTPV
Sbjct: 181 NGGAGVTSTSGVAGVTSTSKQSVQRKGINPAAENSFKGEHGDDLVGLEASVTNSIKTTPV 240
Query: 241 RHSERSARKVFNFAEASSEDESAGTDADLSEGEEKNIVIHEPSIGDHASEKTEDISVESI 300
RHSERSARKVFNFAEASSEDESAGTDADLSEGEEKNIVIHEPSIGDHASEKTEDISVESI
Sbjct: 241 RHSERSARKVFNFAEASSEDESAGTDADLSEGEEKNIVIHEPSIGDHASEKTEDISVESI 300
Query: 301 DEDAVKIKPPFLEGNQTSISKEKKSFRAKGSAQSDTRGLVQPTLLSLFKKVEEKRTPRSS 360
DEDAVKIKPPFLEGNQTSISKEKKSFRAKGSAQSDTRGLVQPTLLSLFKKVEEKRTPRSS
Sbjct: 301 DEDAVKIKPPFLEGNQTSISKEKKSFRAKGSAQSDTRGLVQPTLLSLFKKVEEKRTPRSS 360
Query: 361 KRSSAPKVSTQKMQLSGSKQKIDQDEGSKKRRVVRGQGGKAQKKDTEYEVEDEIEDLSSS 420
KRSSAPKVSTQKMQLSGSKQKIDQDEGSKKRRVVRGQGGKAQKKDTEYEVEDEIEDLSSS
Sbjct: 361 KRSSAPKVSTQKMQLSGSKQKIDQDEGSKKRRVVRGQGGKAQKKDTEYEVEDEIEDLSSS 420
Query: 421 QEDTDEDWTS 431
QEDTDEDWTS
Sbjct: 421 QEDTDEDWTS 430
BLAST of CSPI01G07540 vs. ExPASy TrEMBL
Match:
A0A1S3CTA5 (DNA-binding protein RHL1 isoform X2 OS=Cucumis melo OX=3656 GN=LOC103504698 PE=4 SV=1)
HSP 1 Score: 764.6 bits (1973), Expect = 2.2e-217
Identity = 399/431 (92.58%), Postives = 412/431 (95.59%), Query Frame = 0
Query: 1 MARG-SSSKKDEAKGEINPEIAERKRLKKLAFSNHILSETQARPQAYLSPSATVLKHHGK 60
MARG SSSKKDEAKGEINPEI ERKRLKKLAFSN+ILSETQA+PQAYLSPSATVLKHHGK
Sbjct: 1 MARGSSSSKKDEAKGEINPEIGERKRLKKLAFSNNILSETQAKPQAYLSPSATVLKHHGK 60
Query: 61 DIVKKSQRKNRFLFSFSGLLAPVSGGKIGELKDLSTKNPILYLDFPQGRMKLFGTIMYPK 120
DIVKKSQRKNRFLFSFSGLLAPVSGGKIGELKDL+TKNP+LYLDFPQGRMKLFGTIMYPK
Sbjct: 61 DIVKKSQRKNRFLFSFSGLLAPVSGGKIGELKDLATKNPVLYLDFPQGRMKLFGTIMYPK 120
Query: 121 NRYLTLQFSRGGKNVTCEDCFDNMIVFSDAWWIGTKDENPEEACLDFPKDLTMGQCGEYD 180
NRYLTLQFS+GGKNVTCEDCFDNMIVFSDAWWIGTKDENPEEACLDFPK+LT+GQCGEYD
Sbjct: 121 NRYLTLQFSKGGKNVTCEDCFDNMIVFSDAWWIGTKDENPEEACLDFPKELTLGQCGEYD 180
Query: 181 FNGGAGVTSTSGVAGVTSTSKQSVQRKGINPAAENSFKGEHGDDLVGLEASVTNSIKTTP 240
FNGG AGVTSTSKQSVQ+KGINPA ENSFKGEHGDDLVGLEASVTNS+KT P
Sbjct: 181 FNGG---------AGVTSTSKQSVQKKGINPATENSFKGEHGDDLVGLEASVTNSVKTMP 240
Query: 241 VRHSERSARKVFNFAEASSEDESAGTDADLSEGEEKNIVIHEPSIGDHASEKTEDISVES 300
VRHSERSARKVFNFAEASSEDES GTD DLSEGEEKNIVIHEPSIGDHASEKTEDISVES
Sbjct: 241 VRHSERSARKVFNFAEASSEDESTGTDTDLSEGEEKNIVIHEPSIGDHASEKTEDISVES 300
Query: 301 IDEDAVKIKPPFLEGNQTSISKEKKSFRAKGSAQSDTRGLVQPTLLSLFKKVEEKRTPRS 360
IDEDAV+IKP FLEGNQTSISKEKK+ RAKGSAQSDTRGLVQPTLLSLFKKVEEKRTPRS
Sbjct: 301 IDEDAVEIKPSFLEGNQTSISKEKKNSRAKGSAQSDTRGLVQPTLLSLFKKVEEKRTPRS 360
Query: 361 SKRSSAPKVSTQKMQLSGSKQKIDQDEGSKKRRVVRGQGGKAQKKDTEYEVEDEIEDLSS 420
SKRSS PKVSTQKMQLSGSKQKIDQDEGSKKRR VRGQGGKAQ+KDTEYEVEDEIE+LSS
Sbjct: 361 SKRSSVPKVSTQKMQLSGSKQKIDQDEGSKKRRAVRGQGGKAQRKDTEYEVEDEIEELSS 420
Query: 421 SQEDTDEDWTS 431
SQEDTDEDWTS
Sbjct: 421 SQEDTDEDWTS 422
BLAST of CSPI01G07540 vs. ExPASy TrEMBL
Match:
A0A5D3BIX4 (DNA-binding protein RHL1 isoform X2 OS=Cucumis melo var. makuwa OX=1194695 GN=E5676_scaffold248G002710 PE=4 SV=1)
HSP 1 Score: 696.4 bits (1796), Expect = 7.3e-197
Identity = 364/399 (91.23%), Postives = 376/399 (94.24%), Query Frame = 0
Query: 1 MARG-SSSKKDEAKGEINPEIAERKRLKKLAFSNHILSETQARPQAYLSPSATVLKHHGK 60
MARG SSSKKDEAKGEINPEI ERKRLKKLAFSN+ILSETQA+PQAYLSPSATVLKHHGK
Sbjct: 1 MARGSSSSKKDEAKGEINPEIGERKRLKKLAFSNNILSETQAKPQAYLSPSATVLKHHGK 60
Query: 61 DIVKKSQRKNRFLFSFSGLLAPVSGGKIGELKDLSTKNPILYLDFPQGRMKLFGTIMYPK 120
DIVKKSQRKNRFLFSFSGLLAPVSGGKIGELKDL+TKNP+LYLDFPQGRMKLFGTIMYPK
Sbjct: 61 DIVKKSQRKNRFLFSFSGLLAPVSGGKIGELKDLATKNPVLYLDFPQGRMKLFGTIMYPK 120
Query: 121 NRYLTLQFSRGGKNVTCEDCFDNMIVFSDAWWIGTKDENPEEACLDFPKDLTMGQCGEYD 180
NRYLTLQFS+GGKNVTCEDCFDNMIVFSDAWWIGTKDENPEEACLDFPK+LT+GQCGEYD
Sbjct: 121 NRYLTLQFSKGGKNVTCEDCFDNMIVFSDAWWIGTKDENPEEACLDFPKELTLGQCGEYD 180
Query: 181 FNGGAGVTSTSGVAGVTSTSKQSVQRKGINPAAENSFKGEHGDDLVGLEASVTNSIKTTP 240
FNGG AGVTSTSKQSVQ+KGINPA ENSFKGEHGDDLVGLEASVTNS+KT P
Sbjct: 181 FNGG---------AGVTSTSKQSVQKKGINPATENSFKGEHGDDLVGLEASVTNSVKTMP 240
Query: 241 VRHSERSARKVFNFAEASSEDESAGTDADLSEGEEKNIVIHEPSIGDHASEKTEDISVES 300
VRHSERSARKVFNFAEASSEDES GTD DLSEGEEKNIVIHEPSIGDHA +DISVES
Sbjct: 241 VRHSERSARKVFNFAEASSEDESTGTDTDLSEGEEKNIVIHEPSIGDHA----KDISVES 300
Query: 301 IDEDAVKIKPPFLEGNQTSISKEKKSFRAKGSAQSDTRGLVQPTLLSLFKKVEEKRTPRS 360
IDEDAV+IKP FLEGNQTSISKEKK+ RAKGSAQSDTRGLVQPTLLSLFKKVEEKRTPRS
Sbjct: 301 IDEDAVEIKPSFLEGNQTSISKEKKNSRAKGSAQSDTRGLVQPTLLSLFKKVEEKRTPRS 360
Query: 361 SKRSSAPKVSTQKMQLSGSKQKIDQDEGSKKRRVVRGQG 399
SKRSS PKVSTQKMQLSGSKQKIDQDEGSKKRR VRGQG
Sbjct: 361 SKRSSVPKVSTQKMQLSGSKQKIDQDEGSKKRRAVRGQG 386
BLAST of CSPI01G07540 vs. ExPASy TrEMBL
Match:
A0A6J1E8G5 (DNA-binding protein RHL1-like OS=Cucurbita moschata OX=3662 GN=LOC111431599 PE=4 SV=1)
HSP 1 Score: 661.4 bits (1705), Expect = 2.6e-186
Identity = 357/434 (82.26%), Postives = 386/434 (88.94%), Query Frame = 0
Query: 1 MARG-SSSKKDEAKGEINPEIAERKRLKKLAFSNHILSETQARPQAYLSPSATVLKHHGK 60
MARG SSSK+DEAKGE+ PEIA RKRLKKLAF+N+ILSETQA+PQAY SPSATVLKHHGK
Sbjct: 1 MARGSSSSKRDEAKGEMEPEIAARKRLKKLAFTNNILSETQAKPQAYPSPSATVLKHHGK 60
Query: 61 DIVKKSQRKNRFLFSFSGLLAPVSGGKIGELKDLSTKNPILYLDFPQGRMKLFGTIMYPK 120
DIVKKSQRKNRFLFSFSGLLAPVSGGKIGELKDL TKNPILYLDFPQGR+KLFGTI+YPK
Sbjct: 61 DIVKKSQRKNRFLFSFSGLLAPVSGGKIGELKDLGTKNPILYLDFPQGRLKLFGTIVYPK 120
Query: 121 NRYLTLQFSRGGKNVTCEDCFDNMIVFSDAWWIGTKDENPEEACLDFPKDLTMGQCGEYD 180
NRYLTLQFSRGGKNV CEDCFDNMIVFSDAWWIGTKDENPEE LDFPK++TMG+CGEYD
Sbjct: 121 NRYLTLQFSRGGKNVMCEDCFDNMIVFSDAWWIGTKDENPEEDRLDFPKEMTMGKCGEYD 180
Query: 181 FNGGAGVTSTSGVAGVTSTSKQSVQRKGINPAAENSFKGEHGDDLVGLEASVTNSIKTTP 240
FNGG AGV STSKQSVQ+KGIN A E S KGEHGDDLV LE ++TNS+KTTP
Sbjct: 181 FNGG---------AGVASTSKQSVQKKGINRAEEKSLKGEHGDDLVDLEDNMTNSMKTTP 240
Query: 241 VRHSERSARKVFNFAEASSEDESAGTDADLSEGEEKNIVIHEPSIGDHASEKTEDISVES 300
VRHSERSA KVFNFA+A S++ESAGT AD SEGEEKNIVIHEPSIGDHASEKTE +SV+S
Sbjct: 241 VRHSERSAGKVFNFAQAFSKEESAGTYADFSEGEEKNIVIHEPSIGDHASEKTEVVSVDS 300
Query: 301 IDEDAVKIKPPFLEGNQTSISKEKKSFRAKGSAQSDTRG-LVQPTLLSLFKKVEEKRTPR 360
D+DAV+ +P FLEGN+T ISK K RAKG+AQS RG LVQPTL SLFKKVEEKRTPR
Sbjct: 301 EDKDAVE-RPRFLEGNKTPISKSKNGSRAKGNAQSGNRGLLVQPTLPSLFKKVEEKRTPR 360
Query: 361 SSKRSSAPKVSTQKMQLSGSKQKIDQDEGSKKRRVVRGQ--GGKAQKKDTEYEVEDEIED 420
SSKRSS PKVS QKMQLSGSKQKIDQDEG KKRRVV+GQ GGK ++KDTEYE ED+IE+
Sbjct: 361 SSKRSSTPKVSAQKMQLSGSKQKIDQDEGLKKRRVVQGQDDGGKFRRKDTEYEDEDDIEE 420
Query: 421 LSSSQEDTDEDWTS 431
LSSSQEDTDEDWTS
Sbjct: 421 LSSSQEDTDEDWTS 424
BLAST of CSPI01G07540 vs. ExPASy TrEMBL
Match:
A0A6J1KHM2 (DNA-binding protein RHL1-like OS=Cucurbita maxima OX=3661 GN=LOC111495823 PE=4 SV=1)
HSP 1 Score: 649.0 bits (1673), Expect = 1.3e-182
Identity = 352/434 (81.11%), Postives = 383/434 (88.25%), Query Frame = 0
Query: 1 MARG-SSSKKDEAKGEINPEIAERKRLKKLAFSNHILSETQARPQAYLSPSATVLKHHGK 60
MARG SSSK+DEAKGE++PEIA RKRLKKLAF+N+ILSETQA+PQAYLSPSATVLKHHGK
Sbjct: 1 MARGSSSSKRDEAKGEMDPEIAARKRLKKLAFTNNILSETQAKPQAYLSPSATVLKHHGK 60
Query: 61 DIVKKSQRKNRFLFSFSGLLAPVSGGKIGELKDLSTKNPILYLDFPQGRMKLFGTIMYPK 120
DIVKKSQRKNRFLFSFSGLLAPVSGGKIGELKDL TKNPILYLDFPQGR+KLFGTI+YPK
Sbjct: 61 DIVKKSQRKNRFLFSFSGLLAPVSGGKIGELKDLGTKNPILYLDFPQGRLKLFGTIVYPK 120
Query: 121 NRYLTLQFSRGGKNVTCEDCFDNMIVFSDAWWIGTKDENPEEACLDFPKDLTMGQCGEYD 180
NRYLTLQFSRGGKNV CEDCFDNMIVFSDAWWIGTKDENPEE LDFPK++TMG+CGEYD
Sbjct: 121 NRYLTLQFSRGGKNVMCEDCFDNMIVFSDAWWIGTKDENPEEDRLDFPKEMTMGKCGEYD 180
Query: 181 FNGGAGVTSTSGVAGVTSTSKQSVQRKGINPAAENSFKGEHGDDLVGLEASVTNSIKTTP 240
FNGG AGV STSKQSVQ+KGI+ A ENS K EHGDDLV LE ++TNS+KTTP
Sbjct: 181 FNGG---------AGVASTSKQSVQKKGIDRAEENSLKEEHGDDLVDLEDNMTNSMKTTP 240
Query: 241 VRHSERSARKVFNFAEASSEDESAGTDADLSEGEEKNIVIHEPSIGDHASEKTEDISVES 300
VRHSERS KVFNFA+A S++ESAGT AD SEGEEKNIVI+EPSIGDHASEKTE +SV+S
Sbjct: 241 VRHSERSGGKVFNFAQAFSKEESAGTLADFSEGEEKNIVIYEPSIGDHASEKTEVVSVDS 300
Query: 301 IDEDAVKIKPPFLEGNQTSISKEKKSFRAKGSAQSDTRG-LVQPTLLSLFKKVEEKRTPR 360
D+DAV+ +P FLEGN+T ISK K RAKG+AQS RG LVQPTL SLFKKVEEKRTPR
Sbjct: 301 EDKDAVE-RPRFLEGNKTPISKSKNGSRAKGNAQSGNRGLLVQPTLPSLFKKVEEKRTPR 360
Query: 361 SSKRSSAPKVSTQKMQLSGSKQKIDQDEGSKKRRVVRGQ--GGKAQKKDTEYEVEDEIED 420
SSKRSS PKVS QK QLSGSKQKIDQDEG KKR VV+GQ GGK +KDTEYE ED+IE+
Sbjct: 361 SSKRSSTPKVSAQKKQLSGSKQKIDQDEGLKKRGVVQGQDDGGKFGRKDTEYEDEDDIEE 420
Query: 421 LSSSQEDTDEDWTS 431
L SSQEDTDEDWTS
Sbjct: 421 LLSSQEDTDEDWTS 424
BLAST of CSPI01G07540 vs. NCBI nr
Match:
XP_004137530.1 (DNA-binding protein RHL1 isoform X2 [Cucumis sativus] >KGN64211.1 hypothetical protein Csa_014154 [Cucumis sativus])
HSP 1 Score: 825.9 bits (2132), Expect = 1.7e-235
Identity = 430/430 (100.00%), Postives = 430/430 (100.00%), Query Frame = 0
Query: 1 MARGSSSKKDEAKGEINPEIAERKRLKKLAFSNHILSETQARPQAYLSPSATVLKHHGKD 60
MARGSSSKKDEAKGEINPEIAERKRLKKLAFSNHILSETQARPQAYLSPSATVLKHHGKD
Sbjct: 1 MARGSSSKKDEAKGEINPEIAERKRLKKLAFSNHILSETQARPQAYLSPSATVLKHHGKD 60
Query: 61 IVKKSQRKNRFLFSFSGLLAPVSGGKIGELKDLSTKNPILYLDFPQGRMKLFGTIMYPKN 120
IVKKSQRKNRFLFSFSGLLAPVSGGKIGELKDLSTKNPILYLDFPQGRMKLFGTIMYPKN
Sbjct: 61 IVKKSQRKNRFLFSFSGLLAPVSGGKIGELKDLSTKNPILYLDFPQGRMKLFGTIMYPKN 120
Query: 121 RYLTLQFSRGGKNVTCEDCFDNMIVFSDAWWIGTKDENPEEACLDFPKDLTMGQCGEYDF 180
RYLTLQFSRGGKNVTCEDCFDNMIVFSDAWWIGTKDENPEEACLDFPKDLTMGQCGEYDF
Sbjct: 121 RYLTLQFSRGGKNVTCEDCFDNMIVFSDAWWIGTKDENPEEACLDFPKDLTMGQCGEYDF 180
Query: 181 NGGAGVTSTSGVAGVTSTSKQSVQRKGINPAAENSFKGEHGDDLVGLEASVTNSIKTTPV 240
NGGAGVTSTSGVAGVTSTSKQSVQRKGINPAAENSFKGEHGDDLVGLEASVTNSIKTTPV
Sbjct: 181 NGGAGVTSTSGVAGVTSTSKQSVQRKGINPAAENSFKGEHGDDLVGLEASVTNSIKTTPV 240
Query: 241 RHSERSARKVFNFAEASSEDESAGTDADLSEGEEKNIVIHEPSIGDHASEKTEDISVESI 300
RHSERSARKVFNFAEASSEDESAGTDADLSEGEEKNIVIHEPSIGDHASEKTEDISVESI
Sbjct: 241 RHSERSARKVFNFAEASSEDESAGTDADLSEGEEKNIVIHEPSIGDHASEKTEDISVESI 300
Query: 301 DEDAVKIKPPFLEGNQTSISKEKKSFRAKGSAQSDTRGLVQPTLLSLFKKVEEKRTPRSS 360
DEDAVKIKPPFLEGNQTSISKEKKSFRAKGSAQSDTRGLVQPTLLSLFKKVEEKRTPRSS
Sbjct: 301 DEDAVKIKPPFLEGNQTSISKEKKSFRAKGSAQSDTRGLVQPTLLSLFKKVEEKRTPRSS 360
Query: 361 KRSSAPKVSTQKMQLSGSKQKIDQDEGSKKRRVVRGQGGKAQKKDTEYEVEDEIEDLSSS 420
KRSSAPKVSTQKMQLSGSKQKIDQDEGSKKRRVVRGQGGKAQKKDTEYEVEDEIEDLSSS
Sbjct: 361 KRSSAPKVSTQKMQLSGSKQKIDQDEGSKKRRVVRGQGGKAQKKDTEYEVEDEIEDLSSS 420
Query: 421 QEDTDEDWTS 431
QEDTDEDWTS
Sbjct: 421 QEDTDEDWTS 430
BLAST of CSPI01G07540 vs. NCBI nr
Match:
XP_031744387.1 (DNA-binding protein RHL1 isoform X1 [Cucumis sativus])
HSP 1 Score: 819.7 bits (2116), Expect = 1.2e-233
Identity = 430/435 (98.85%), Postives = 430/435 (98.85%), Query Frame = 0
Query: 1 MARGSSSKKDEAKGEINPEIAERKRLKKLAFSNHILSETQARPQAYLSPSATVLKHHGKD 60
MARGSSSKKDEAKGEINPEIAERKRLKKLAFSNHILSETQARPQAYLSPSATVLKHHGKD
Sbjct: 1 MARGSSSKKDEAKGEINPEIAERKRLKKLAFSNHILSETQARPQAYLSPSATVLKHHGKD 60
Query: 61 IVKKSQRKNRFLFSFSGLLAPVSGGKIGELKDLSTKNPILYLDFPQGRMKLFGTIMYPKN 120
IVKKSQRKNRFLFSFSGLLAPVSGGKIGELKDLSTKNPILYLDFPQGRMKLFGTIMYPKN
Sbjct: 61 IVKKSQRKNRFLFSFSGLLAPVSGGKIGELKDLSTKNPILYLDFPQGRMKLFGTIMYPKN 120
Query: 121 RYLTLQFSRGGKNVTCEDCFDNMIVFSDAWWIGTKDENPEEACLDFPKDLTMGQCGEYDF 180
RYLTLQFSRGGKNVTCEDCFDNMIVFSDAWWIGTKDENPEEACLDFPKDLTMGQCGEYDF
Sbjct: 121 RYLTLQFSRGGKNVTCEDCFDNMIVFSDAWWIGTKDENPEEACLDFPKDLTMGQCGEYDF 180
Query: 181 NGGAGVTSTSGVAGVTSTSKQSVQRKGINPAAENSFKGEHGDDLVGLEASVTNSIKTTPV 240
NGGAGVTSTSGVAGVTSTSKQSVQRKGINPAAENSFKGEHGDDLVGLEASVTNSIKTTPV
Sbjct: 181 NGGAGVTSTSGVAGVTSTSKQSVQRKGINPAAENSFKGEHGDDLVGLEASVTNSIKTTPV 240
Query: 241 RHSERSARKVFNFAEASSEDESAGTDADLSEGEEKNIVIHEPSIGDHASEKTEDISVESI 300
RHSERSARKVFNFAEASSEDESAGTDADLSEGEEKNIVIHEPSIGDHASEKTEDISVESI
Sbjct: 241 RHSERSARKVFNFAEASSEDESAGTDADLSEGEEKNIVIHEPSIGDHASEKTEDISVESI 300
Query: 301 DEDAVKIKPPFLEGNQTSISKEKKSFRAKGSAQSDTRGLVQPTLLSLFKKVEEKRTPRSS 360
DEDAVKIKPPFLEGNQTSISKEKKSFRAKGSAQSDTRGLVQPTLLSLFKKVEEKRTPRSS
Sbjct: 301 DEDAVKIKPPFLEGNQTSISKEKKSFRAKGSAQSDTRGLVQPTLLSLFKKVEEKRTPRSS 360
Query: 361 KRSSAPKVSTQKMQLSGSKQKIDQDEGSKKRRVVRGQGGKAQKKDTEY-----EVEDEIE 420
KRSSAPKVSTQKMQLSGSKQKIDQDEGSKKRRVVRGQGGKAQKKDTEY EVEDEIE
Sbjct: 361 KRSSAPKVSTQKMQLSGSKQKIDQDEGSKKRRVVRGQGGKAQKKDTEYELQQLEVEDEIE 420
Query: 421 DLSSSQEDTDEDWTS 431
DLSSSQEDTDEDWTS
Sbjct: 421 DLSSSQEDTDEDWTS 435
BLAST of CSPI01G07540 vs. NCBI nr
Match:
XP_008467323.1 (PREDICTED: DNA-binding protein RHL1 isoform X2 [Cucumis melo])
HSP 1 Score: 764.6 bits (1973), Expect = 4.5e-217
Identity = 399/431 (92.58%), Postives = 412/431 (95.59%), Query Frame = 0
Query: 1 MARG-SSSKKDEAKGEINPEIAERKRLKKLAFSNHILSETQARPQAYLSPSATVLKHHGK 60
MARG SSSKKDEAKGEINPEI ERKRLKKLAFSN+ILSETQA+PQAYLSPSATVLKHHGK
Sbjct: 1 MARGSSSSKKDEAKGEINPEIGERKRLKKLAFSNNILSETQAKPQAYLSPSATVLKHHGK 60
Query: 61 DIVKKSQRKNRFLFSFSGLLAPVSGGKIGELKDLSTKNPILYLDFPQGRMKLFGTIMYPK 120
DIVKKSQRKNRFLFSFSGLLAPVSGGKIGELKDL+TKNP+LYLDFPQGRMKLFGTIMYPK
Sbjct: 61 DIVKKSQRKNRFLFSFSGLLAPVSGGKIGELKDLATKNPVLYLDFPQGRMKLFGTIMYPK 120
Query: 121 NRYLTLQFSRGGKNVTCEDCFDNMIVFSDAWWIGTKDENPEEACLDFPKDLTMGQCGEYD 180
NRYLTLQFS+GGKNVTCEDCFDNMIVFSDAWWIGTKDENPEEACLDFPK+LT+GQCGEYD
Sbjct: 121 NRYLTLQFSKGGKNVTCEDCFDNMIVFSDAWWIGTKDENPEEACLDFPKELTLGQCGEYD 180
Query: 181 FNGGAGVTSTSGVAGVTSTSKQSVQRKGINPAAENSFKGEHGDDLVGLEASVTNSIKTTP 240
FNGG AGVTSTSKQSVQ+KGINPA ENSFKGEHGDDLVGLEASVTNS+KT P
Sbjct: 181 FNGG---------AGVTSTSKQSVQKKGINPATENSFKGEHGDDLVGLEASVTNSVKTMP 240
Query: 241 VRHSERSARKVFNFAEASSEDESAGTDADLSEGEEKNIVIHEPSIGDHASEKTEDISVES 300
VRHSERSARKVFNFAEASSEDES GTD DLSEGEEKNIVIHEPSIGDHASEKTEDISVES
Sbjct: 241 VRHSERSARKVFNFAEASSEDESTGTDTDLSEGEEKNIVIHEPSIGDHASEKTEDISVES 300
Query: 301 IDEDAVKIKPPFLEGNQTSISKEKKSFRAKGSAQSDTRGLVQPTLLSLFKKVEEKRTPRS 360
IDEDAV+IKP FLEGNQTSISKEKK+ RAKGSAQSDTRGLVQPTLLSLFKKVEEKRTPRS
Sbjct: 301 IDEDAVEIKPSFLEGNQTSISKEKKNSRAKGSAQSDTRGLVQPTLLSLFKKVEEKRTPRS 360
Query: 361 SKRSSAPKVSTQKMQLSGSKQKIDQDEGSKKRRVVRGQGGKAQKKDTEYEVEDEIEDLSS 420
SKRSS PKVSTQKMQLSGSKQKIDQDEGSKKRR VRGQGGKAQ+KDTEYEVEDEIE+LSS
Sbjct: 361 SKRSSVPKVSTQKMQLSGSKQKIDQDEGSKKRRAVRGQGGKAQRKDTEYEVEDEIEELSS 420
Query: 421 SQEDTDEDWTS 431
SQEDTDEDWTS
Sbjct: 421 SQEDTDEDWTS 422
BLAST of CSPI01G07540 vs. NCBI nr
Match:
XP_038893754.1 (DNA-binding protein RHL1 isoform X1 [Benincasa hispida])
HSP 1 Score: 697.6 bits (1799), Expect = 6.8e-197
Identity = 374/433 (86.37%), Postives = 391/433 (90.30%), Query Frame = 0
Query: 1 MARG-SSSKKDEAKGEINPEIAERKRLKKLAFSNHILSETQARPQAYLSPSATVLKHHGK 60
MARG SSSK+DEAKGEI+P IA RKRLKKLAFSNHILSETQA+PQAYLSPSATVLKHHGK
Sbjct: 1 MARGSSSSKRDEAKGEIDPGIAARKRLKKLAFSNHILSETQAKPQAYLSPSATVLKHHGK 60
Query: 61 DIVKKSQRKNRFLFSFSGLLAPVSGGKIGELKDLSTKNPILYLDFPQGRMKLFGTIMYPK 120
DIVKKSQRKNRFLFSFSGLLAPVSGGKIGELKDL TKNPILYLDFPQGRMKLFGTIMYPK
Sbjct: 61 DIVKKSQRKNRFLFSFSGLLAPVSGGKIGELKDLRTKNPILYLDFPQGRMKLFGTIMYPK 120
Query: 121 NRYLTLQFSRGGKNVTCEDCFDNMIVFSDAWWIGTKDENPEEACLDFPKDLTMGQCGEYD 180
NRYLTLQFSRGGKNV CED FDNMIVFSDAWWIGTKDENPEEA LDFP +LT GQCGE D
Sbjct: 121 NRYLTLQFSRGGKNVMCEDYFDNMIVFSDAWWIGTKDENPEEAHLDFPIELTTGQCGECD 180
Query: 181 FNGGAGVTSTSGVAGVTSTSKQSVQRKGINPAAENSFKGEHGDDLVGLEASVTNSIKTTP 240
FNGG AGVT SKQSVQ+KGINPA ENS KGEHGDDLV L+ +VTNSIKTTP
Sbjct: 181 FNGG---------AGVTGLSKQSVQKKGINPAVENSLKGEHGDDLVDLKDNVTNSIKTTP 240
Query: 241 VRHSERSARKVFNFAEASSEDESAGTDADLSEGEEKNIVIHEPSIGDHASEKTEDISVES 300
VRHSERSARKVFNFAE SSEDES T ADLSEGEEKNIVIHEPSIGDHA EKTED+SV+S
Sbjct: 241 VRHSERSARKVFNFAEVSSEDESTSTYADLSEGEEKNIVIHEPSIGDHAREKTEDLSVDS 300
Query: 301 IDEDAVKIKPPFLEGNQTSISKEKKSFRAKGSAQSDTRGLVQPTLLSLFKKVEEKRTPRS 360
+DEDA +I+PPFLEGNQTSIS EKKS AKGSAQSDTRGLVQPTLLSLFKKVEEKRT RS
Sbjct: 301 MDEDAGEIRPPFLEGNQTSISTEKKSSLAKGSAQSDTRGLVQPTLLSLFKKVEEKRTSRS 360
Query: 361 SKRSSAPKVSTQKMQLSGSKQKIDQDEGSKKRRVVRGQ--GGKAQKKDTEYEVEDEIEDL 420
SKRSS PKVS QKMQLSGSK+KIDQDEG +KRR VRGQ GGK QKKDTEYEV+D+IE+L
Sbjct: 361 SKRSSTPKVSVQKMQLSGSKRKIDQDEGLRKRRAVRGQDDGGKIQKKDTEYEVKDDIEEL 420
Query: 421 SSSQEDTDEDWTS 431
SSSQEDTDEDWTS
Sbjct: 421 SSSQEDTDEDWTS 424
BLAST of CSPI01G07540 vs. NCBI nr
Match:
TYJ99067.1 (DNA-binding protein RHL1 isoform X2 [Cucumis melo var. makuwa])
HSP 1 Score: 696.4 bits (1796), Expect = 1.5e-196
Identity = 364/399 (91.23%), Postives = 376/399 (94.24%), Query Frame = 0
Query: 1 MARG-SSSKKDEAKGEINPEIAERKRLKKLAFSNHILSETQARPQAYLSPSATVLKHHGK 60
MARG SSSKKDEAKGEINPEI ERKRLKKLAFSN+ILSETQA+PQAYLSPSATVLKHHGK
Sbjct: 1 MARGSSSSKKDEAKGEINPEIGERKRLKKLAFSNNILSETQAKPQAYLSPSATVLKHHGK 60
Query: 61 DIVKKSQRKNRFLFSFSGLLAPVSGGKIGELKDLSTKNPILYLDFPQGRMKLFGTIMYPK 120
DIVKKSQRKNRFLFSFSGLLAPVSGGKIGELKDL+TKNP+LYLDFPQGRMKLFGTIMYPK
Sbjct: 61 DIVKKSQRKNRFLFSFSGLLAPVSGGKIGELKDLATKNPVLYLDFPQGRMKLFGTIMYPK 120
Query: 121 NRYLTLQFSRGGKNVTCEDCFDNMIVFSDAWWIGTKDENPEEACLDFPKDLTMGQCGEYD 180
NRYLTLQFS+GGKNVTCEDCFDNMIVFSDAWWIGTKDENPEEACLDFPK+LT+GQCGEYD
Sbjct: 121 NRYLTLQFSKGGKNVTCEDCFDNMIVFSDAWWIGTKDENPEEACLDFPKELTLGQCGEYD 180
Query: 181 FNGGAGVTSTSGVAGVTSTSKQSVQRKGINPAAENSFKGEHGDDLVGLEASVTNSIKTTP 240
FNGG AGVTSTSKQSVQ+KGINPA ENSFKGEHGDDLVGLEASVTNS+KT P
Sbjct: 181 FNGG---------AGVTSTSKQSVQKKGINPATENSFKGEHGDDLVGLEASVTNSVKTMP 240
Query: 241 VRHSERSARKVFNFAEASSEDESAGTDADLSEGEEKNIVIHEPSIGDHASEKTEDISVES 300
VRHSERSARKVFNFAEASSEDES GTD DLSEGEEKNIVIHEPSIGDHA +DISVES
Sbjct: 241 VRHSERSARKVFNFAEASSEDESTGTDTDLSEGEEKNIVIHEPSIGDHA----KDISVES 300
Query: 301 IDEDAVKIKPPFLEGNQTSISKEKKSFRAKGSAQSDTRGLVQPTLLSLFKKVEEKRTPRS 360
IDEDAV+IKP FLEGNQTSISKEKK+ RAKGSAQSDTRGLVQPTLLSLFKKVEEKRTPRS
Sbjct: 301 IDEDAVEIKPSFLEGNQTSISKEKKNSRAKGSAQSDTRGLVQPTLLSLFKKVEEKRTPRS 360
Query: 361 SKRSSAPKVSTQKMQLSGSKQKIDQDEGSKKRRVVRGQG 399
SKRSS PKVSTQKMQLSGSKQKIDQDEGSKKRR VRGQG
Sbjct: 361 SKRSSVPKVSTQKMQLSGSKQKIDQDEGSKKRRAVRGQG 386
BLAST of CSPI01G07540 vs. TAIR 10
Match:
AT1G48380.1 (root hair initiation protein root hairless 1 (RHL1) )
HSP 1 Score: 302.4 bits (773), Expect = 5.9e-82
Identity = 183/373 (49.06%), Postives = 235/373 (63.00%), Query Frame = 0
Query: 1 MARGSSSKKDEAKG--EINPEIAERKRLKKLAFSNHILSETQARPQAYLSPSATVLKHHG 60
M R SSSKK +KG + + E +RKRLK LA N +LS++ A+ + L PS VLKHHG
Sbjct: 1 MVRASSSKKGGSKGGDKDDAESKQRKRLKTLALDNQLLSDSPAKSHSSLKPSKQVLKHHG 60
Query: 61 KDIVKKSQRKNRFLFSFSGLLAPVSGGKIGELKDLSTKNPILYLDFPQGRMKLFGTIMYP 120
DI++KSQRKNRFLFSF GLLAP+S IG+L LSTKNP+LYL+FPQGRMKLFGTI+YP
Sbjct: 61 TDIIRKSQRKNRFLFSFPGLLAPISAATIGDLDRLSTKNPVLYLNFPQGRMKLFGTILYP 120
Query: 121 KNRYLTLQFSRGGKNVTCEDCFDNMIVFSDAWWIGTKDENPEEACLDFPKDLTMGQCGEY 180
KNRYLTLQFSRGGKNV C+D FDNMIVFS++WWIGTK+ENPEEA LDFPK+L + E+
Sbjct: 121 KNRYLTLQFSRGGKNVLCDDYFDNMIVFSESWWIGTKEENPEEARLDFPKELAQAENTEF 180
Query: 181 DFNGGAGVTSTSGVAGVTSTSKQSVQRKGINPAAEN----SFKGEHGDDLVGLEASV--T 240
DF GGAG + V + S S + +P +N S GE DD + + V T
Sbjct: 181 DFQGGAG--GAASVKKLASPEIGSQPTETDSPEVDNEDVLSEDGEFLDDKIQVTPPVQLT 240
Query: 241 NSIKTTPVRHSERSARKVFNFAEASSEDESAGTDADLSEGEEKNIVIHEPSIGDHASEKT 300
++ TPVR S+R++ K FNFAE SSE S ++ + S+ +EK ++ EP + E++
Sbjct: 241 PPVQVTPVRQSQRNSGKKFNFAETSSEASSGESEGNTSDEDEKPLL--EPESSTRSREES 300
Query: 301 EDISVESIDEDAVKIKPPFLEGNQTSISKEKKSFRAKGSAQSDTRGLVQPTLLSLFKKVE 360
+D I A K+ + SK+ K LVQ TL +LFKK E
Sbjct: 301 QD--GNGITASASKLPEELPAKREKLKSKDSK--------------LVQATLSNLFKKAE 353
Query: 361 EKRTPRSSKRSSA 366
EK S +SS+
Sbjct: 361 EKTAGTSKAKSSS 353
BLAST of CSPI01G07540 vs. TAIR 10
Match:
AT1G48380.2 (root hair initiation protein root hairless 1 (RHL1) )
HSP 1 Score: 286.2 bits (731), Expect = 4.4e-77
Identity = 183/404 (45.30%), Postives = 235/404 (58.17%), Query Frame = 0
Query: 1 MARGSSSKKDEAKG--EINPEIAERKRLKKLAFSNHILSETQARPQAYLSPSATVLKHHG 60
M R SSSKK +KG + + E +RKRLK LA N +LS++ A+ + L PS VLKHHG
Sbjct: 1 MVRASSSKKGGSKGGDKDDAESKQRKRLKTLALDNQLLSDSPAKSHSSLKPSKQVLKHHG 60
Query: 61 KDIVKKSQRKNRFLFSFSGLLAPVSGGKIGELKDLSTKNPILYLDFPQGRMKLFGTIMYP 120
DI++KSQRKNRFLFSF GLLAP+S IG+L LSTKNP+LYL+FPQGRMKLFGTI+YP
Sbjct: 61 TDIIRKSQRKNRFLFSFPGLLAPISAATIGDLDRLSTKNPVLYLNFPQGRMKLFGTILYP 120
Query: 121 KNRYLTLQFSRGGKNVTCEDCFDNMIVFSDAWWIGTKDENPEEACLDFPKDLT------- 180
KNRYLTLQFSRGGKNV C+D FDNMIVFS++WWIGTK+ENPEEA LDFPK+L
Sbjct: 121 KNRYLTLQFSRGGKNVLCDDYFDNMIVFSESWWIGTKEENPEEARLDFPKELAQVDTFHL 180
Query: 181 ------------------------MGQCGEYDFNGGAGVTSTSGVAGVTSTSKQSVQRKG 240
+ E+DF GGAG + V + S S +
Sbjct: 181 FLHFLFKTMVATEMFNMIRRILWFQAENTEFDFQGGAG--GAASVKKLASPEIGSQPTET 240
Query: 241 INPAAEN----SFKGEHGDDLVGLEASV--TNSIKTTPVRHSERSARKVFNFAEASSEDE 300
+P +N S GE DD + + V T ++ TPVR S+R++ K FNFAE SSE
Sbjct: 241 DSPEVDNEDVLSEDGEFLDDKIQVTPPVQLTPPVQVTPVRQSQRNSGKKFNFAETSSEAS 300
Query: 301 SAGTDADLSEGEEKNIVIHEPSIGDHASEKTEDISVESIDEDAVKIKPPFLEGNQTSISK 360
S ++ + S+ +EK ++ EP + E+++D I A K+ + SK
Sbjct: 301 SGESEGNTSDEDEKPLL--EPESSTRSREESQD--GNGITASASKLPEELPAKREKLKSK 360
Query: 361 EKKSFRAKGSAQSDTRGLVQPTLLSLFKKVEEKRTPRSSKRSSA 366
+ K LVQ TL +LFKK EEK S +SS+
Sbjct: 361 DSK--------------LVQATLSNLFKKAEEKTAGTSKAKSSS 384
The following BLAST results are available for this feature:
Match Name | E-value | Identity | Description | |
O81242 | 8.4e-81 | 49.06 | DNA-binding protein RHL1 OS=Arabidopsis thaliana OX=3702 GN=RHL1 PE=1 SV=1 | [more] |
Match Name | E-value | Identity | Description | |
A0A0A0LVZ6 | 8.0e-236 | 100.00 | Uncharacterized protein OS=Cucumis sativus OX=3659 GN=Csa_1G043140 PE=4 SV=1 | [more] |
A0A1S3CTA5 | 2.2e-217 | 92.58 | DNA-binding protein RHL1 isoform X2 OS=Cucumis melo OX=3656 GN=LOC103504698 PE=4... | [more] |
A0A5D3BIX4 | 7.3e-197 | 91.23 | DNA-binding protein RHL1 isoform X2 OS=Cucumis melo var. makuwa OX=1194695 GN=E5... | [more] |
A0A6J1E8G5 | 2.6e-186 | 82.26 | DNA-binding protein RHL1-like OS=Cucurbita moschata OX=3662 GN=LOC111431599 PE=4... | [more] |
A0A6J1KHM2 | 1.3e-182 | 81.11 | DNA-binding protein RHL1-like OS=Cucurbita maxima OX=3661 GN=LOC111495823 PE=4 S... | [more] |
Match Name | E-value | Identity | Description | |
XP_004137530.1 | 1.7e-235 | 100.00 | DNA-binding protein RHL1 isoform X2 [Cucumis sativus] >KGN64211.1 hypothetical p... | [more] |
XP_031744387.1 | 1.2e-233 | 98.85 | DNA-binding protein RHL1 isoform X1 [Cucumis sativus] | [more] |
XP_008467323.1 | 4.5e-217 | 92.58 | PREDICTED: DNA-binding protein RHL1 isoform X2 [Cucumis melo] | [more] |
XP_038893754.1 | 6.8e-197 | 86.37 | DNA-binding protein RHL1 isoform X1 [Benincasa hispida] | [more] |
TYJ99067.1 | 1.5e-196 | 91.23 | DNA-binding protein RHL1 isoform X2 [Cucumis melo var. makuwa] | [more] |
Match Name | E-value | Identity | Description | |
AT1G48380.1 | 5.9e-82 | 49.06 | root hair initiation protein root hairless 1 (RHL1) | [more] |
AT1G48380.2 | 4.4e-77 | 45.30 | root hair initiation protein root hairless 1 (RHL1) | [more] |