CSPI01G07540 (gene) Wild cucumber (PI 183967)

NameCSPI01G07540
Typegene
OrganismCucumis sativus (Wild cucumber (PI 183967))
DescriptionDNA-binding protein RHL1, putative
LocationChr1 : 4778243 .. 4786686 (+)
The following sequences are available for this feature:

Gene sequence (with intron)

Legend: five_prime_UTRCDSthree_prime_UTR
Hold the cursor over a type above to highlight its positions in the sequence below.
GAAGAATTGAGAAAAAGAAAAAAATAATGAAAGAGGAAACAAGAGGGCAGGGATCCCGGAGAGTAAACTGCAGTTGGTAGCTGACTACAAGCAACAGAAATGGCGCGAGGATCGTCTTCAAAGAAGGACGAAGCAAAAGGAGAAATCAACCCAGAGATTGCAGAGCGAAAGCGGCTCAAGAAGCTCGCATTCTCCAATCACATACTTTCAGAAACCCAGGCAAGGCCCCAGGCTTATCTGAGCCCTTCAGCCACGGTTCTGAAGCACCATGGCAAAGACATTGTCAAGAAATCACAGCGAAAGAACAGGTTCCTCTTCTCCTTTTCAGGCTTGCTTGCTCCCGTTAGTGGTGGCAAGATTGGCGAGCTCAAGGATTTATCAACCAAGAATCCCATTCTCTATCTCGATTTTCCTCAGGTTCGTCCGTTCTTCCTTTCTCTCAATCGCCTTTCTGTTTTTCACTCCTCTGAATTTCTTCAGTTTAGTGTCCGGGTTCGTCTCTTCTCCTTTAGAATAATTTTTGTAATTGTTTGTCATCTTCATCCAGTCCTAAGTATTTGATAGATAAACCCATATCACACTCGAGGTTTTTTTTCTATGTTTTTTATTTACATACTTCTTGCCTTCTGGATTCTACAAGATAGACTGAGAATAGTACTTATATATTTTTTTGTGGGGAATACAACTTGGAAGGATAGATCTCAGAAAAGACTAGGACAGTATTGGATGAGATAACAGCAACCCAATGGCTGGTTTTTTAAGGGTAAACCAATATATTTGTGGGGGTGTGCTGCTCGGGCTTTTTTTGGGGAGGTCTCGCTGGAAAAGAACTGTAGAACCTTCGTAGAGGAAGCTGCAAATTTTATTTCCACTTGGTGGTGGCTTAGCAACACCATTCTTTGTAATTGTTTAGGGTTTAGGGGTTTGTCAATTGTGCAAAATTAAAGGACAAGCTTTCCACCACAAAAGAACCAAGGCTAAATGATTTTTGTTTTGATATATAAATAATTTGGACTCTGTGAATTTTGTTTTTTAATAGACAAGGTATGATTAATATTTGAGTTTTTTGGTTGATTTGGAGTCCATGGAGCTGTTATCAACACCCATCAATCATTTATCATCATTATATAGTTATTAGAAGTTGATGAAATTAATGTACTATCTAATTAATTATTATAGTTTCTACTAAACTCATATGTCCAGTGATATTTTTTCTCATTAAATGAGATGCTTTGCTTAGTATTTGAGTGCATAGCATTAGAGCGAGTTAAATACGTGGAAATTGTTATTGTTATTGGCTTATTATTTATTTCTACCTGGAAAATGGATGCGTAAGCATATTGTTTGTATGCACAAGCTAACCTGGACACCCACAAATTATTTTCATTCCCCGAATGCCAATGTTGATTTTATGTGGTCATGAAGCATCTAGTGGTGGAATTTTTCTGATTGCTATTTTGTTTGCTTTCCAGGGGCGTATGAAGTTGTTTGGAACTATCATGTATCCGAAGAACAGATATTTAACTTTGCAGTTTTCTAGAGGTGGAAAGAATGTGACGTGTGAAGATTGTTTTGATAATATGGTTAGTCCTTTTGCTTCTGTGTTTCATTCCTTCTATCTTGGGGACTTTATTTTTGTTCATTTCATCTAGTATTTCATTAAAAGTGCAGTTATGACTTGTAAAACCCGTAGCACAATAAATGATGTTTGTAGCTGGTTTCAATTCTTAGTATGATCTGAATGCTTCTTTATGTACTGCTTCCGAAATTCATCTTTGCCGGTTTAAGCCGTCATTTTTTGTGATTTTTCTTTCAAAGATAAAGCAAAATCTATGGTGCAATACAAATTGAGGTTTTCTAAGCCTATTGTCGGAGACCTGTATTTGGGCCGATCCAAAATGATTCTATCATTTTTGTATCTGCAATTCAATATATATGAGCATAGATAGATACATAGACATATGTACTTACCCTTCTTTCTGTTAATTTTTTTAGTAAAACTTTTTATACTTAAATGGTCGATTCTTTTATTCGTGTTATTTTCCACCTTTCAGAATGAAAACAGAGAAATGCTTCATTATCATCTCAATTGCATTTTGAGCAAACAAGTCAAGAATTGGCCACAGCTTGTGAACATCTAGCAAGGTCTTGTCTTAAATGATATGTTTGCCCGATAAAAGTGTGGCAAGAACAATTATCCACCAACAGGAAAAGGAATAACAATCTGTTTAAGTTATCCTGGTGTGTTTGTTAAATGTGATTCATGAAGATAGATACTAGATGGTGTCATTTTCTGTTCCCTGCCAATATTGATCATCTTGAAATTTAAGAACATTTGTTCAGATTCACTATGTTGAATGATGTTCTTAATTTAATCTCTGTACTGTTCTTGGCACTTCTTGTTACTTAAGACATATGACCTTTAAGCATGTACATTATGAATCAACTATACTTTTCTCTGACTGCAGATTGTCTTTTCTGATGCATGGTGGATTGGAACAAAAGATGAAAATCCAGAGGAGGCGTGTCTTGATTTTCCTAAAGATTTGACCATGGTGAGTTTCTTGATAAATTTTAGGCCCTTAAAATCTATGTATTTTTTCTCTAGTACCTCTACATTTGTATGTATAATTCCTGCGAGCATCTCTTTTCTCAAGATCATTTATGAAGCAGAATAATGGAAATGCATCCCTCTAGTCATAGTCTAATTTATGTTGTTAGAGAGAGAGAGAGAGAGAGAGAGAGAGAGAGAGACTTTGCTAACTCCCAGTGGTTGCATGTCAGGGACAATGTGGAGAATATGACTTTAACGGTGGTGCTGGTGTTACTAGTACGAGTGGGGTTGCTGGTGTTACCAGTACGAGTAAGCAGAGTGTTCAAAGGAAGGGAATCAATCCTGCTGCAGAAAATTCCTTTAAAGGAGAGCATGGAGATGATTTAGTGGGCCTTGAAGCCAGCGTGACAAATTCAATAAAGACTACGCCAGTTAGACATTCTGAAAGATCTGCCAGAAAAGTATTCAAGTATTACACTTTGCATGTACTGTTACCATGTAGTTCTTATCTTATTCTCAACAAGGCAGAAACCAATAGGGTTTCTGGTCAAATAGTATTTTGCATCAATTGTCTTATTTATCAACAATCCGTACCTTGATTGGTAGAAAAATTTGCTGCATGACCATGAGTATAGATACATATTTAACACTTGATTTATGATTTTAAAAGTTTTAACTTTAAAATTTGTTTTTCAAAGAATCCACACGTGTTTAGCTTGTCTGGGGTTTTCATAGCCATAGGTGGCCTCTGTTTCAGAGAGAGCATTCTGTGAGTTTGTGTTGCTGTCTTCTGAGAGGTTAGCTTGGGAAAAAATTTTCTTCAAGACATGGAGAACTCTTGAAACGAAGGTGATGGTTCTTAAGGAAAGTACAAAAAATTTACCATGCGTTCGAGAGGCTCACTTTACTCTCTGTACCATTTCTAAAATATGTCTATCCCCATGTCACCCAATCTATATTATTTATGAACTCTATACCCTAAAAAAGTTCATTAACCAATTATTAGTCTAACATTACTAATTTTCCTAACAGCTTATTTGTTAAATATTTTTTCAATTATTTTATTTTCTGTGATTTTAGCTAATTGGGGAGCCTATTGTAACTTTGACTTGTGGGAAGCTTTTGCCTTGGGTTATGCATTTTGTTTATGAATGAAATTAGTTTCCGTAAAAAGAAATGTCCACATTCTTGGAGAAAGTGCCACCAGTTTGTACCATTGCGGTGTCATTACATCTGTATCTACTGTCTTCTATTATTCCCAAACATATCAGGATCAATTTACTTTATGGAAATTGGCAGAAGCAAACTAGATGTTAGGAGGCCATCAGTACAAATACCATTTTCAATGATGGCATCGCCCCTCTCAAGTAAAAACAATATACATATATTGTTAGTAGTAACATTTTGAATTAATGAAACCAGCACCGAATTTGTCCTAGATGTTTTTCTAGCTCAACTACCTAAAGTCCATCCGGAATTTCAGATTCTTTCACTTGATCAGCTTGGCCGTGGTAGTTTCTATACAGTACATTGGTTTTTATTTATAATTCTAATAACCGGTGCAAACATGAAACAAACAACGGAACATTGAGTCCATCTTTTGTCATGTTGGTTGACAAATAGTCTCTCAACAAATTCAGATCAAATGTGAGTTAACGCAACTTCTTTAATCTCATATATTCCCATTCATGTTGAGGCTTATGTCAGTTAACTTTATTGTGATCTGTCCCGAAGAAATAATTTTATTATTATTATTTTTTTTTTCTTGTTAAGTTTTGCAGAGGCTTCTTCTGAGGATGAGTCTGCTGGCACGGACGCTGATTTGTCTGAAGGAGAAGAAAAAAATATTGTCATACATGAGCCTTCAATTGGAGATCATGCTAGTGAAAATATCCCATCTATGATTCTTGAGTAGTTTGTTTTAATTCATCCTCCGCAATTTTCTATTATTATTGATTGTCTTAAAATTTGTCTGCTGCTGTATGTTTCAATTCCTTAATTGATTACTACAGACGGAAGATATCTCTGTTGAATCTATAGATGAAGATGCTGTGAAAATTAAACCTCCTTTTCTTGAAGGAAATCAGACATCAATTTCTAAGGAAAAGAAAAGTTTTCGGGCTAAGGGAAGTGCTCAGAGTGATACTCGTGGACTTGTCCAGCCTACTTTACTTAGTTTGTTCAAGAAAGTGGAGGAGAAGGTAACATCCGAATACCTGTACTTAATATGTGGAAATAGGTTGCTGGCTTGGAATCCATTTACTGTTACATTGAGGGGCTTATTAAAACTAGTAGTTATTTTTCTTGATGCAAAAGGAAATGATCATTTAAAAACCAATGAAAGAACAAGCAAGAGCTGTTACAACGTAGCTATTGAGGAACCAAAAAAAAAAAATGCTCCTCAAACTATAAAACACTTGGCGAAGCAAAGGAAAGACATTTACCTCCCCTTTGGTTGTTAACAACGGTAATGATGTTTTACAAAAGTCAGTTAAAAAATGATCGTAATTCTAGAGAAAAGTATAAAGTGTGTTAGTTACTTACCTTTTACAGAGAATGTGAGACAACATGCTCTTAGAGAAGACAAAAAACATAGTAAAGAGATGCTATTAAAAAGAAAAGACCATCCGAGATTTTTTGAAAACACGATTGTTTCTTTCCATCTAAATTACCCAACAAATTACTGAAACTGCAAATTCCAAAAAGCTCTCAATCTTATACTAGCTAGTAACTCCTCCAAATTGTCCTATCCGAACAATGAACTTTATATCTTTAAGGCTTACATATATTTGAAATGAATGCTTATTGTTTTCAGAGGACACCAAGAAGTTCAAAGAGGTCTTCAGCTCCCAAAGGTTCTAATATTTTCCATTGATTTTATCATCACGACACGAGAATACTAAAGTAACACCAAATCTGCAAAATTGGTTGCCTACAGTTTCTACCCAAAAGATGCAGCTGTCTGGTTCAAAGCAAAAGATTGACCAGGTGATGACAAATCTTCATTTTTCAATCACGCCTTTTCTCCTGAAAACGACTAACAGATTATCATGTACTTTCAGGATGAAGGATCAAAGAAAAGGAGGGTTGTCCGGGGACAAGGTGATGCAATCTTTTATGATCTCTCCATTTACTTACTGCTTGTTGAAATCTCTAACAGCTTAGATATGACAATATTGGCTTGCTGAATTTGTCATCTCTCTTGTACCCTGGGTTCTCATGGAAATTATATCAGGATTGCTCGAGAAATCATACAACTTCTGTAGTAGATCTAATAGAATATTGACTCCCATTTTGCAAGATACATGTGCATTAGTATTTGGACTAATTTTCTGTGTGCTATATTTGTAGTCGGTTTATGCAAATTTGGGTTCTGGATTACCTGGTGTAGCTTTGTAATCATGTTCAGTTGACTGTGTTTATATGGTATGTTTACATGGTTGGTTGATGCATATTAATATTAGCTTGAACGTGCAGGAGGAAAAGCCCAGAAGAAGGATACAGAATATGAGGTACTGGTACCATTTTTACAAATTGTGCTTTTAAAAGATTTGCTGCTTTATTAACCTCAGGATTTCTTATGTGCACTATTCCAACTTGGAGTGGACATGCTTCCCAAAGTTATTATATAACTGGACAAATTTCCATTACTTATAGCTCCAACAGCTAGAGGTATCTTAAAAGCAATCAACTTGAGCATGTTTGGTATACCAACTGCAAAGTGCTTTAAAATGCACTTTTAAACATTTTAAGCACTTGAAAGTCATTCCAAAAACTTTTAGCTGCGGATGAAAATGTTTCATCACAAGTAATACTTTTTGCTTCCTAAGTCTCAAAGTCTTACGTTTTTTTAAAAAAATAATATTTTGTTATTTATTTTTTCAATGAAATTTTATGTGTTAACACATGATATTTCTGAAATGGTCATCGTTAGTTTTTGCTTGGTGACACTAACAAAAAACCTTAAATAGTTTTATGTTAATTATAAAAAGGCCCTTGGCATTCTTCAAACTTGAAAGAATTTCAACCAGCAATTCCTTCTCATGTGTCTCATCTTACAAGTGAGACCTTTTAGAATTTAGTTTAAAAGGATGTTTGCTTCTGTGTTTTGGGTCTCATCCAAATTTAGTTCATGACTTTCAAATGTTTTAATAGACCATAAATTAATTCTGATTGTTAGTTTCAGACTGATATTTATTCAATTCGGTTCCATAATAATGTGAATATGTTTCCAATAGAGAGGAAAGACTAATTAAGGGGGTTTTTTCCCTTCGGAAGTAACAATAAACTAACTGTAGGCTAATTTAAGATTTATTGGACATTTCTAACTGGTGAGATTAAAATAAAACGAGGACCAAAATATTGTGGTGAACTAGTATTATAACCTAAATTTTTATATGCTCTGAATTGCAAGTACATGTAGTTTGTTTCTATTTACTACTCTCAAGTCACGATTCAGTGCGGGATGAAATGGTTGACGGTAGATATCACTGTAGACTTTATTTATCTAAATCTTGTAGGTTGAAGATGAGATCGAAGATTTGTCGAGCTCTCAAGAGGTGAGTGTTCAATGGCTATTTTATAACCTACATACTTCCATGATTTCTTAGGATATTGGACTGGAAGTTAGTGGGCCCTAGGTTTGTAGTCATTTTGGAGTCTGATGAAAATATTTGACTGAACTACTCCTTTGCTATTACATATAATCAATGTCTCATTTTCCTTTGACTATTAATTTTTTGGCTTAGTTACCCTTTAAAAACTCTCTCTCCCTTGTGCCCTACACCATGGTTGTTAGAAAGTTGTCATTTACTGTAATTTTTTATTTCTATGTAATATTAAGAAAAAAACAGGCTTTCCATTTTTTTTAGAAACAACAGCTTTCATGGAGAAAGAACGAAAGTATACATGAGCATATGAAAAAGAAATAACTCAGCCCACAAAAAAAGCAATACCCGCTCCAAGGAGTTCTTAACTTTGCAAGTTAACACCTATAAAATAGCTAAAACCGAAGTGTACGAGGAAACATGGAAGCAAACAAGGGACCAAATTTCACTATGTTCCTTTTCCACCCTTCTAAAGATCCTACCATTCTGCTCTGCCCATAAAACCCACGAGATCCCACACACCTCACCTAACCTCAAAAGTGATCTTTTTCCCCACGAGGCGAATTGAGGAGGAACGAGATAATGCAATCCTAATCCTCAACTCCTCAATGATTCTTATCAAACTCTGTATGGAATATCATGATTCTTTATTAACTCCTCAATCATAGCACTAATGGTCCCTCTAGCAAACAAATGAACAACTAAACATAGTTCTTGATCGTCATCACCAATAAGCGTTCACATTGTCAAACTTTTCAAATGCAGGTGATTAAATTAAAGCGTTTGAATAATTCAGCAATTAAACTTTAAATGGGAGTCAAAAGATGAAATCAAAGTGAACTTATCATTTAGGGACCAGATGGATAGTTTAACTAGAACACATTCCTTTTAGCTTTCATAAATATTGATTATGATCTTCCACATTTTTGCTCACAAGTTCATTTGTCATCCAGGACACTGATGAAGATTGGACAAGTTGAGGTTGTTACCATTCTAACAATGGTAAGCCACACCAGCAGCTCGAATTGCATCATTGCAGGGATTATATTCAGAATTCAGAATGCTCGAATTGCTCTACCAGGTTCTTTGATGCCGCTGGCTAATTGAAATTAAAGAAATTTGACCATAAGATGATATTGAAAAGTTTTTATCATAAAGTTAGGCAATGTAAAATTATGTTTTTTATATACTTGAGAGATCAACTGTTACACTTCCTAGAAACCAGACCAAGTTTAAGGTTAAAAATATTCACAAGATTTTATGCAGATTGTTGTGCCTTTGGA

mRNA sequence

ATGGCGCGAGGATCGTCTTCAAAGAAGGACGAAGCAAAAGGAGAAATCAACCCAGAGATTGCAGAGCGAAAGCGGCTCAAGAAGCTCGCATTCTCCAATCACATACTTTCAGAAACCCAGGCAAGGCCCCAGGCTTATCTGAGCCCTTCAGCCACGGTTCTGAAGCACCATGGCAAAGACATTGTCAAGAAATCACAGCGAAAGAACAGGTTCCTCTTCTCCTTTTCAGGCTTGCTTGCTCCCGTTAGTGGTGGCAAGATTGGCGAGCTCAAGGATTTATCAACCAAGAATCCCATTCTCTATCTCGATTTTCCTCAGGGGCGTATGAAGTTGTTTGGAACTATCATGTATCCGAAGAACAGATATTTAACTTTGCAGTTTTCTAGAGGTGGAAAGAATGTGACGTGTGAAGATTGTTTTGATAATATGATTGTCTTTTCTGATGCATGGTGGATTGGAACAAAAGATGAAAATCCAGAGGAGGCGTGTCTTGATTTTCCTAAAGATTTGACCATGGGACAATGTGGAGAATATGACTTTAACGGTGGTGCTGGTGTTACTAGTACGAGTGGGGTTGCTGGTGTTACCAGTACGAGTAAGCAGAGTGTTCAAAGGAAGGGAATCAATCCTGCTGCAGAAAATTCCTTTAAAGGAGAGCATGGAGATGATTTAGTGGGCCTTGAAGCCAGCGTGACAAATTCAATAAAGACTACGCCAGTTAGACATTCTGAAAGATCTGCCAGAAAAGTATTCAATTTTGCAGAGGCTTCTTCTGAGGATGAGTCTGCTGGCACGGACGCTGATTTGTCTGAAGGAGAAGAAAAAAATATTGTCATACATGAGCCTTCAATTGGAGATCATGCTAGTGAAAAGACGGAAGATATCTCTGTTGAATCTATAGATGAAGATGCTGTGAAAATTAAACCTCCTTTTCTTGAAGGAAATCAGACATCAATTTCTAAGGAAAAGAAAAGTTTTCGGGCTAAGGGAAGTGCTCAGAGTGATACTCGTGGACTTGTCCAGCCTACTTTACTTAGTTTGTTCAAGAAAGTGGAGGAGAAGAGGACACCAAGAAGTTCAAAGAGGTCTTCAGCTCCCAAAGTTTCTACCCAAAAGATGCAGCTGTCTGGTTCAAAGCAAAAGATTGACCAGGATGAAGGATCAAAGAAAAGGAGGGTTGTCCGGGGACAAGGAGGAAAAGCCCAGAAGAAGGATACAGAATATGAGGTTGAAGATGAGATCGAAGATTTGTCGAGCTCTCAAGAGGACACTGATGAAGATTGGACAAGTTGA

Coding sequence (CDS)

ATGGCGCGAGGATCGTCTTCAAAGAAGGACGAAGCAAAAGGAGAAATCAACCCAGAGATTGCAGAGCGAAAGCGGCTCAAGAAGCTCGCATTCTCCAATCACATACTTTCAGAAACCCAGGCAAGGCCCCAGGCTTATCTGAGCCCTTCAGCCACGGTTCTGAAGCACCATGGCAAAGACATTGTCAAGAAATCACAGCGAAAGAACAGGTTCCTCTTCTCCTTTTCAGGCTTGCTTGCTCCCGTTAGTGGTGGCAAGATTGGCGAGCTCAAGGATTTATCAACCAAGAATCCCATTCTCTATCTCGATTTTCCTCAGGGGCGTATGAAGTTGTTTGGAACTATCATGTATCCGAAGAACAGATATTTAACTTTGCAGTTTTCTAGAGGTGGAAAGAATGTGACGTGTGAAGATTGTTTTGATAATATGATTGTCTTTTCTGATGCATGGTGGATTGGAACAAAAGATGAAAATCCAGAGGAGGCGTGTCTTGATTTTCCTAAAGATTTGACCATGGGACAATGTGGAGAATATGACTTTAACGGTGGTGCTGGTGTTACTAGTACGAGTGGGGTTGCTGGTGTTACCAGTACGAGTAAGCAGAGTGTTCAAAGGAAGGGAATCAATCCTGCTGCAGAAAATTCCTTTAAAGGAGAGCATGGAGATGATTTAGTGGGCCTTGAAGCCAGCGTGACAAATTCAATAAAGACTACGCCAGTTAGACATTCTGAAAGATCTGCCAGAAAAGTATTCAATTTTGCAGAGGCTTCTTCTGAGGATGAGTCTGCTGGCACGGACGCTGATTTGTCTGAAGGAGAAGAAAAAAATATTGTCATACATGAGCCTTCAATTGGAGATCATGCTAGTGAAAAGACGGAAGATATCTCTGTTGAATCTATAGATGAAGATGCTGTGAAAATTAAACCTCCTTTTCTTGAAGGAAATCAGACATCAATTTCTAAGGAAAAGAAAAGTTTTCGGGCTAAGGGAAGTGCTCAGAGTGATACTCGTGGACTTGTCCAGCCTACTTTACTTAGTTTGTTCAAGAAAGTGGAGGAGAAGAGGACACCAAGAAGTTCAAAGAGGTCTTCAGCTCCCAAAGTTTCTACCCAAAAGATGCAGCTGTCTGGTTCAAAGCAAAAGATTGACCAGGATGAAGGATCAAAGAAAAGGAGGGTTGTCCGGGGACAAGGAGGAAAAGCCCAGAAGAAGGATACAGAATATGAGGTTGAAGATGAGATCGAAGATTTGTCGAGCTCTCAAGAGGACACTGATGAAGATTGGACAAGTTGA
BLAST of CSPI01G07540 vs. Swiss-Prot
Match: RHL1_ARATH (DNA-binding protein RHL1 OS=Arabidopsis thaliana GN=RHL1 PE=1 SV=1)

HSP 1 Score: 302.8 bits (774), Expect = 6.2e-81
Identity = 183/373 (49.06%), Postives = 236/373 (63.27%), Query Frame = 1

Query: 1   MARGSSSKKDEAKG--EINPEIAERKRLKKLAFSNHILSETQARPQAYLSPSATVLKHHG 60
           M R SSSKK  +KG  + + E  +RKRLK LA  N +LS++ A+  + L PS  VLKHHG
Sbjct: 1   MVRASSSKKGGSKGGDKDDAESKQRKRLKTLALDNQLLSDSPAKSHSSLKPSKQVLKHHG 60

Query: 61  KDIVKKSQRKNRFLFSFSGLLAPVSGGKIGELKDLSTKNPILYLDFPQGRMKLFGTIMYP 120
            DI++KSQRKNRFLFSF GLLAP+S   IG+L  LSTKNP+LYL+FPQGRMKLFGTI+YP
Sbjct: 61  TDIIRKSQRKNRFLFSFPGLLAPISAATIGDLDRLSTKNPVLYLNFPQGRMKLFGTILYP 120

Query: 121 KNRYLTLQFSRGGKNVTCEDCFDNMIVFSDAWWIGTKDENPEEACLDFPKDLTMGQCGEY 180
           KNRYLTLQFSRGGKNV C+D FDNMIVFS++WWIGTK+ENPEEA LDFPK+L   +  E+
Sbjct: 121 KNRYLTLQFSRGGKNVLCDDYFDNMIVFSESWWIGTKEENPEEARLDFPKELAQAENTEF 180

Query: 181 DFNGGAGVTSTSGVAGVTSTSKQSVQRKGINPAAEN----SFKGEHGDDLVGLEASV--T 240
           DF GGAG    + V  + S    S   +  +P  +N    S  GE  DD + +   V  T
Sbjct: 181 DFQGGAG--GAASVKKLASPEIGSQPTETDSPEVDNEDVLSEDGEFLDDKIQVTPPVQLT 240

Query: 241 NSIKTTPVRHSERSARKVFNFAEASSEDESAGTDADLSEGEEKNIVIHEPSIGDHASEKT 300
             ++ TPVR S+R++ K FNFAE SSE  S  ++ + S+ +EK ++  EP     + E++
Sbjct: 241 PPVQVTPVRQSQRNSGKKFNFAETSSEASSGESEGNTSDEDEKPLL--EPESSTRSREES 300

Query: 301 EDISVESIDEDAVKIKPPFLEGNQTSISKEKKSFRAKGSAQSDTRGLVQPTLLSLFKKVE 360
           +D +   I   A K+        +   SK+ K              LVQ TL +LFKK E
Sbjct: 301 QDGN--GITASASKLPEELPAKREKLKSKDSK--------------LVQATLSNLFKKAE 353

Query: 361 EKRTPRSSKRSSA 366
           EK    S  +SS+
Sbjct: 361 EKTAGTSKAKSSS 353

BLAST of CSPI01G07540 vs. TrEMBL
Match: A0A0A0LVZ6_CUCSA (Uncharacterized protein OS=Cucumis sativus GN=Csa_1G043140 PE=4 SV=1)

HSP 1 Score: 825.9 bits (2132), Expect = 2.3e-236
Identity = 430/430 (100.00%), Postives = 430/430 (100.00%), Query Frame = 1

Query: 1   MARGSSSKKDEAKGEINPEIAERKRLKKLAFSNHILSETQARPQAYLSPSATVLKHHGKD 60
           MARGSSSKKDEAKGEINPEIAERKRLKKLAFSNHILSETQARPQAYLSPSATVLKHHGKD
Sbjct: 1   MARGSSSKKDEAKGEINPEIAERKRLKKLAFSNHILSETQARPQAYLSPSATVLKHHGKD 60

Query: 61  IVKKSQRKNRFLFSFSGLLAPVSGGKIGELKDLSTKNPILYLDFPQGRMKLFGTIMYPKN 120
           IVKKSQRKNRFLFSFSGLLAPVSGGKIGELKDLSTKNPILYLDFPQGRMKLFGTIMYPKN
Sbjct: 61  IVKKSQRKNRFLFSFSGLLAPVSGGKIGELKDLSTKNPILYLDFPQGRMKLFGTIMYPKN 120

Query: 121 RYLTLQFSRGGKNVTCEDCFDNMIVFSDAWWIGTKDENPEEACLDFPKDLTMGQCGEYDF 180
           RYLTLQFSRGGKNVTCEDCFDNMIVFSDAWWIGTKDENPEEACLDFPKDLTMGQCGEYDF
Sbjct: 121 RYLTLQFSRGGKNVTCEDCFDNMIVFSDAWWIGTKDENPEEACLDFPKDLTMGQCGEYDF 180

Query: 181 NGGAGVTSTSGVAGVTSTSKQSVQRKGINPAAENSFKGEHGDDLVGLEASVTNSIKTTPV 240
           NGGAGVTSTSGVAGVTSTSKQSVQRKGINPAAENSFKGEHGDDLVGLEASVTNSIKTTPV
Sbjct: 181 NGGAGVTSTSGVAGVTSTSKQSVQRKGINPAAENSFKGEHGDDLVGLEASVTNSIKTTPV 240

Query: 241 RHSERSARKVFNFAEASSEDESAGTDADLSEGEEKNIVIHEPSIGDHASEKTEDISVESI 300
           RHSERSARKVFNFAEASSEDESAGTDADLSEGEEKNIVIHEPSIGDHASEKTEDISVESI
Sbjct: 241 RHSERSARKVFNFAEASSEDESAGTDADLSEGEEKNIVIHEPSIGDHASEKTEDISVESI 300

Query: 301 DEDAVKIKPPFLEGNQTSISKEKKSFRAKGSAQSDTRGLVQPTLLSLFKKVEEKRTPRSS 360
           DEDAVKIKPPFLEGNQTSISKEKKSFRAKGSAQSDTRGLVQPTLLSLFKKVEEKRTPRSS
Sbjct: 301 DEDAVKIKPPFLEGNQTSISKEKKSFRAKGSAQSDTRGLVQPTLLSLFKKVEEKRTPRSS 360

Query: 361 KRSSAPKVSTQKMQLSGSKQKIDQDEGSKKRRVVRGQGGKAQKKDTEYEVEDEIEDLSSS 420
           KRSSAPKVSTQKMQLSGSKQKIDQDEGSKKRRVVRGQGGKAQKKDTEYEVEDEIEDLSSS
Sbjct: 361 KRSSAPKVSTQKMQLSGSKQKIDQDEGSKKRRVVRGQGGKAQKKDTEYEVEDEIEDLSSS 420

Query: 421 QEDTDEDWTS 431
           QEDTDEDWTS
Sbjct: 421 QEDTDEDWTS 430

BLAST of CSPI01G07540 vs. TrEMBL
Match: A0A061EXP0_THECC (Root hair initiation protein root hairless 1, putative isoform 2 OS=Theobroma cacao GN=TCM_025225 PE=4 SV=1)

HSP 1 Score: 389.0 bits (998), Expect = 7.3e-105
Identity = 234/450 (52.00%), Postives = 300/450 (66.67%), Query Frame = 1

Query: 1   MARGSSSKKDEAKGEINPEIAERKRLKKLAFSNHILSETQARPQAY--LSPSATVLKHHG 60
           M R SSSKK        PE  ERKRLKKLA  N++LS+T A P++Y  LSPS  V+KHHG
Sbjct: 1   MVRTSSSKKPPIAE--TPEATERKRLKKLALKNNLLSDTPATPKSYVPLSPSKLVMKHHG 60

Query: 61  KDIVKKSQRKNRFLFSFSGLLAPVSGGKIGELKDLSTKNPILYLDFPQGRMKLFGTIMYP 120
           KDI++KSQRKNRFLFSF GLLAP+SGGKIGELK+L +KNPILYLDFPQG+MKLFGTI+YP
Sbjct: 61  KDILRKSQRKNRFLFSFPGLLAPISGGKIGELKNLGSKNPILYLDFPQGQMKLFGTIVYP 120

Query: 121 KNRYLTLQFSRGGKNVTCEDCFDNMIVFSDAWWIGTKDENPEEACLDFPKDLTMGQCGEY 180
           KNRYLTL FSRGGKNV CED FDNMIVFSDAWWIG KDENPEEA LDFPK+L  GQ  EY
Sbjct: 121 KNRYLTLLFSRGGKNVMCEDYFDNMIVFSDAWWIGKKDENPEEARLDFPKELCQGQQMEY 180

Query: 181 DFNGGAGVTSTSGVAGVTSTSKQSVQRKGINPAAENSFKGEHGDDLVGLEASVTNSIKTT 240
           DF GGAGV S +         KQ   R  I      S   E GD L   +  +T  ++ T
Sbjct: 181 DFKGGAGVESVN---------KQDTPRTEIKQVEIESLDNESGDALSDDDNDLTAKMEVT 240

Query: 241 PVRHSERSARKVFNFAEASSEDESAGTDADLSEGEEKNI-----VIHEPSIGDHASEKTE 300
           P RHS R+A K F FAEASSED+   +DA+ S+GEEK +     +    +IG   S  + 
Sbjct: 241 PTRHSARNAGKRFKFAEASSEDDPVRSDAEPSDGEEKKVGKKLHLTENDTIGKTISSASL 300

Query: 301 DISVESIDEDAVKIKPPFLEGNQTSISKEKK-------SFRAKGSAQSDTRGLVQPTLLS 360
            +  ++ ++  +   P  ++ + TS+SK +K         ++K +++++   LVQPT+ +
Sbjct: 301 VLKSDAAEDSQI---PEQIQTSLTSVSKSRKISKSTVTVTKSKENSKANRGSLVQPTIST 360

Query: 361 LFKKVEEKRTPRSSKRSSAPKVSTQKMQLSGSKQKIDQDEGSKKRRVVRGQ---GGKAQK 420
           LFKKV EK+ PR S +SS+ KV  +K+Q +  K+KIDQ EGS K+  V  +   G   ++
Sbjct: 361 LFKKVGEKKGPRGSDKSSSTKVLGKKLQSNNYKRKIDQTEGSSKKGKVNEEKTTGTGIKR 420

Query: 421 KDTEYEVEDEIEDLSSSQED---TDEDWTS 431
           K  E E E++IE++SS+ ED   +DEDWT+
Sbjct: 421 KKKESEDEEDIEEISSTSEDANGSDEDWTA 436

BLAST of CSPI01G07540 vs. TrEMBL
Match: I1L3V3_SOYBN (Uncharacterized protein OS=Glycine max GN=GLYMA_09G164600 PE=4 SV=2)

HSP 1 Score: 387.5 bits (994), Expect = 2.1e-104
Identity = 236/452 (52.21%), Postives = 297/452 (65.71%), Query Frame = 1

Query: 1   MARGSSSKKDEAKGE--INPEIAERKRLKKLAFSNHILSETQARPQAYLSPSATVLKHHG 60
           MARG + KK+E +     NPE  ERKRLK LA SN ILSET AR   +L+PS+ V KHHG
Sbjct: 1   MARGKAKKKEEGEDTDVANPETLERKRLKSLAISNKILSETPARSSVHLNPSSVVAKHHG 60

Query: 61  KDIVKKSQRKN-RFLFSFSGLLAPVSGGKIGELKDLSTKNPILYLDFPQGRMKLFGTIMY 120
           KDI+KKSQRK+ R+LFSF GL+AP++GGKIG+LKDL TKNP+LYLDFPQG+MKLFGTI+Y
Sbjct: 61  KDIIKKSQRKSCRYLFSFPGLIAPIAGGKIGDLKDLGTKNPVLYLDFPQGQMKLFGTIVY 120

Query: 121 PKNRYLTLQFSRGGKNVTCEDCFDNMIVFSDAWWIGTKDENPEEACLDFPKDLTMGQCGE 180
           PKNRYLTLQF +GGK+V CED FDNMIVFSDAWWIG KDENPEE+ L+FP +L  G   E
Sbjct: 121 PKNRYLTLQFPKGGKSVMCEDYFDNMIVFSDAWWIGRKDENPEESKLEFPNELYEGHQAE 180

Query: 181 YDFNGGAGVTSTSGVAGVTSTSKQSVQRKGINPAAENSFKGEHGDDLVGLEASVTNSIKT 240
           YDF GGAG       AG  S   Q V R  I    + S K    DDL   E ++ ++ + 
Sbjct: 181 YDFKGGAG-------AGAASVVNQGVPRTNIQRVEQESPKTPTEDDLSDNEINLKDTKEL 240

Query: 241 TPVRHSERSARKVFNFAEASSEDESAGTDADLSEGEEKNIVIHEPSIGDHASEKTEDISV 300
            PVRHS R+A+K + FAE SS D+S     +LS+ EEK +V  + ++ DH S K + +  
Sbjct: 241 VPVRHSTRTAKKSYKFAEISSGDDSGENSPELSDDEEK-VVEVDTAVNDHNSSKKKTVVF 300

Query: 301 ESIDEDAVKI-KPPFLEGNQTSISKEKK-----SFRAKGSAQSDTRG-LVQPTLLSLFKK 360
           +  DED   + +P  +     S SK K+     S  A    +S  RG LVQ T+ +LFKK
Sbjct: 301 DLDDEDDAPVDQPAKINTESASRSKSKEVSQSASASASTEVKSSNRGSLVQATISTLFKK 360

Query: 361 VEEKRTPRSSKRSSAPKVSTQKMQLSGSKQKIDQDEGSKKR---RVVRGQGGKAQKKDTE 420
           VEEK TPRSS++S + K   QK Q +GSK+KID DEGSKKR      +  G K + K  E
Sbjct: 361 VEEKTTPRSSRKSPSSKAYGQKSQPAGSKRKIDLDEGSKKRARKTTDKDPGKKIKAKSKE 420

Query: 421 YEVE------DEIEDLSSSQED---TDEDWTS 431
            +VE      D+IE+ S++ ED   +DEDWT+
Sbjct: 421 DDVEDGDDDGDDIEEFSNASEDANESDEDWTA 444

BLAST of CSPI01G07540 vs. TrEMBL
Match: G7KKB7_MEDTR (DNA-binding protein RHL1, putative OS=Medicago truncatula GN=MTR_6g077860 PE=4 SV=2)

HSP 1 Score: 385.2 bits (988), Expect = 1.1e-103
Identity = 231/440 (52.50%), Postives = 291/440 (66.14%), Query Frame = 1

Query: 1   MARGSSSKK---DEAKGEINPEIAERKRLKKLAFSNHILSETQARPQAYLSPSATVLKHH 60
           MAR  + KK   DE     NPE  ERKRLK LAFSN++LSET+AR   +L+PS+ V KHH
Sbjct: 1   MARPKTKKKTRSDEEADATNPETIERKRLKSLAFSNNVLSETKARSSIHLNPSSIVAKHH 60

Query: 61  GKDIVKKSQRKN-RFLFSFSGLLAPVSGGKIGELKDLSTKNPILYLDFPQGRMKLFGTIM 120
           GKDI+KKSQRK+ R+LFSF GL AP+ GGKIG+LKDL TKNPILYLDFPQGRMKLFGTI+
Sbjct: 61  GKDIIKKSQRKSSRYLFSFPGLFAPIGGGKIGDLKDLGTKNPILYLDFPQGRMKLFGTIL 120

Query: 121 YPKNRYLTLQFSRGGKNVTCEDCFDNMIVFSDAWWIGTKDENPEEACLDFPKDLTMGQCG 180
           YPKNRYLTLQFS+GGK+V CED FDNMIVFSDAWWIGTKDENPEEA L+FPK+L  G+  
Sbjct: 121 YPKNRYLTLQFSKGGKSVMCEDYFDNMIVFSDAWWIGTKDENPEEAKLEFPKELYEGKQT 180

Query: 181 EYDFNGGAGVTSTSGVAGVTSTSKQSVQRKGINPAAENSFKGEHGDDLVGLEASVTNSIK 240
           E+DF GGAG  + +G A V +      + K   P +  +   E   DL   E  + ++ +
Sbjct: 181 EHDFKGGAGAGAGAGAASVVNHGVSKTKIKRPEPESPETPLEE---DLSDSEIELKDTTE 240

Query: 241 TTPVRHSERSARKVFNFAEASSEDESAGTDADLSEGEEKNIVIHEPSIGDHASEKTEDIS 300
             PVR S R+ +K + FAE SS D+S  +  DLSE EEK + + +    DH S K E   
Sbjct: 241 LVPVRQSARTVKKSYKFAEISSGDDSGKSSPDLSEHEEKAVEV-DTDANDHTSSKKETAV 300

Query: 301 VESIDE-DAVKIKPPFLEGNQTSISKEKKSFRAKGSAQSDTRGLVQPTLLSLFKKVEEKR 360
           ++  DE DA K + P       S+SK KK              LVQ T+ SLFKKVE K+
Sbjct: 301 IDIDDEDDAPKDQLPVENKEPASVSKAKKGL------------LVQATISSLFKKVEVKK 360

Query: 361 TPRSSKRSSAPKVSTQKMQLSGSKQKIDQDEGSKKR-RVVR----GQGGKAQKKDTEYEV 420
              + K+S + K S QK Q +GSK+KI+ DEG KKR R  +    G+  KA+ KD+E E 
Sbjct: 361 AAANPKKSPSSKASGQKSQPAGSKRKIELDEGPKKRARKTKDKNPGEKKKAKSKDSEVED 420

Query: 421 EDEIEDLSSSQEDTDEDWTS 431
           +D+IE+ S++ ED+DEDW +
Sbjct: 421 DDDIEEFSNASEDSDEDWAA 424

BLAST of CSPI01G07540 vs. TrEMBL
Match: A0A061EYH8_THECC (Root hair initiation protein root hairless 1, putative isoform 3 OS=Theobroma cacao GN=TCM_025225 PE=4 SV=1)

HSP 1 Score: 384.0 bits (985), Expect = 2.4e-103
Identity = 234/450 (52.00%), Postives = 299/450 (66.44%), Query Frame = 1

Query: 1   MARGSSSKKDEAKGEINPEIAERKRLKKLAFSNHILSETQARPQAY--LSPSATVLKHHG 60
           M R SSSKK        PE  ERKRLKKLA  N++LS+T A P++Y  LSPS  V+KHHG
Sbjct: 1   MVRTSSSKKPPIAE--TPEATERKRLKKLALKNNLLSDTPATPKSYVPLSPSKLVMKHHG 60

Query: 61  KDIVKKSQRKNRFLFSFSGLLAPVSGGKIGELKDLSTKNPILYLDFPQGRMKLFGTIMYP 120
           KDI++KSQRKNRFLFSF GLLAP+SGGKIGELK+L +KNPILYLDFPQG+MKLFGTI+YP
Sbjct: 61  KDILRKSQRKNRFLFSFPGLLAPISGGKIGELKNLGSKNPILYLDFPQGQMKLFGTIVYP 120

Query: 121 KNRYLTLQFSRGGKNVTCEDCFDNMIVFSDAWWIGTKDENPEEACLDFPKDLTMGQCGEY 180
           KNRYLTL FSRGGKNV CED FDNMIVFSDAWWIG KDENPEEA LDFPK+L  GQ  EY
Sbjct: 121 KNRYLTLLFSRGGKNVMCEDYFDNMIVFSDAWWIGKKDENPEEARLDFPKELCQGQQMEY 180

Query: 181 DFNGGAGVTSTSGVAGVTSTSKQSVQRKGINPAAENSFKGEHGDDLVGLEASVTNSIKTT 240
           DF GGAGV S +         KQ   R  I      S   E GD L   +  +T  ++ T
Sbjct: 181 DFKGGAGVESVN---------KQDTPRTEIKQVEIESLDNESGDALSDDDNDLTAKMEVT 240

Query: 241 PVRHSERSARKVFNFAEASSEDESAGTDADLSEGEEKNI-----VIHEPSIGDHASEKTE 300
           P RHS R+A K F FAEASSED+   +DA+ S+GEEK +     +    +IG   S  + 
Sbjct: 241 PTRHSARNAGKRFKFAEASSEDDPVRSDAEPSDGEEKKVGKKLHLTENDTIGKTISSASL 300

Query: 301 DISVESIDEDAVKIKPPFLEGNQTSISKEKK-------SFRAKGSAQSDTRGLVQPTLLS 360
            +  ++ ++  +   P  ++ + TS+SK +K         ++K +++++   LVQPT+ +
Sbjct: 301 VLKSDAAEDSQI---PEQIQTSLTSVSKSRKISKSTVTVTKSKENSKANRGSLVQPTIST 360

Query: 361 LFKKVEEKRTPRSSKRSSAPKVSTQKMQLSGSKQKIDQDEGSKKRRVVRGQ---GGKAQK 420
           LFKKV EK  PR S +SS+ KV  +K+Q +  K+KIDQ EGS K+  V  +   G   ++
Sbjct: 361 LFKKVGEK-GPRGSDKSSSTKVLGKKLQSNNYKRKIDQTEGSSKKGKVNEEKTTGTGIKR 420

Query: 421 KDTEYEVEDEIEDLSSSQED---TDEDWTS 431
           K  E E E++IE++SS+ ED   +DEDWT+
Sbjct: 421 KKKESEDEEDIEEISSTSEDANGSDEDWTA 435

BLAST of CSPI01G07540 vs. TAIR10
Match: AT1G48380.2 (AT1G48380.2 root hair initiation protein root hairless 1 (RHL1))

HSP 1 Score: 248.1 bits (632), Expect = 1.0e-65
Identity = 120/172 (69.77%), Postives = 141/172 (81.98%), Query Frame = 1

Query: 1   MARGSSSKKDEAKG--EINPEIAERKRLKKLAFSNHILSETQARPQAYLSPSATVLKHHG 60
           M R SSSKK  +KG  + + E  +RKRLK LA  N +LS++ A+  + L PS  VLKHHG
Sbjct: 1   MVRASSSKKGGSKGGDKDDAESKQRKRLKTLALDNQLLSDSPAKSHSSLKPSKQVLKHHG 60

Query: 61  KDIVKKSQRKNRFLFSFSGLLAPVSGGKIGELKDLSTKNPILYLDFPQGRMKLFGTIMYP 120
            DI++KSQRKNRFLFSF GLLAP+S   IG+L  LSTKNP+LYL+FPQGRMKLFGTI+YP
Sbjct: 61  TDIIRKSQRKNRFLFSFPGLLAPISAATIGDLDRLSTKNPVLYLNFPQGRMKLFGTILYP 120

Query: 121 KNRYLTLQFSRGGKNVTCEDCFDNMIVFSDAWWIGTKDENPEEACLDFPKDL 171
           KNRYLTLQFSRGGKNV C+D FDNMIVFS++WWIGTK+ENPEEA LDFPK+L
Sbjct: 121 KNRYLTLQFSRGGKNVLCDDYFDNMIVFSESWWIGTKEENPEEARLDFPKEL 172

BLAST of CSPI01G07540 vs. NCBI nr
Match: gi|449439513|ref|XP_004137530.1| (PREDICTED: DNA-binding protein RHL1 [Cucumis sativus])

HSP 1 Score: 825.9 bits (2132), Expect = 3.4e-236
Identity = 430/430 (100.00%), Postives = 430/430 (100.00%), Query Frame = 1

Query: 1   MARGSSSKKDEAKGEINPEIAERKRLKKLAFSNHILSETQARPQAYLSPSATVLKHHGKD 60
           MARGSSSKKDEAKGEINPEIAERKRLKKLAFSNHILSETQARPQAYLSPSATVLKHHGKD
Sbjct: 1   MARGSSSKKDEAKGEINPEIAERKRLKKLAFSNHILSETQARPQAYLSPSATVLKHHGKD 60

Query: 61  IVKKSQRKNRFLFSFSGLLAPVSGGKIGELKDLSTKNPILYLDFPQGRMKLFGTIMYPKN 120
           IVKKSQRKNRFLFSFSGLLAPVSGGKIGELKDLSTKNPILYLDFPQGRMKLFGTIMYPKN
Sbjct: 61  IVKKSQRKNRFLFSFSGLLAPVSGGKIGELKDLSTKNPILYLDFPQGRMKLFGTIMYPKN 120

Query: 121 RYLTLQFSRGGKNVTCEDCFDNMIVFSDAWWIGTKDENPEEACLDFPKDLTMGQCGEYDF 180
           RYLTLQFSRGGKNVTCEDCFDNMIVFSDAWWIGTKDENPEEACLDFPKDLTMGQCGEYDF
Sbjct: 121 RYLTLQFSRGGKNVTCEDCFDNMIVFSDAWWIGTKDENPEEACLDFPKDLTMGQCGEYDF 180

Query: 181 NGGAGVTSTSGVAGVTSTSKQSVQRKGINPAAENSFKGEHGDDLVGLEASVTNSIKTTPV 240
           NGGAGVTSTSGVAGVTSTSKQSVQRKGINPAAENSFKGEHGDDLVGLEASVTNSIKTTPV
Sbjct: 181 NGGAGVTSTSGVAGVTSTSKQSVQRKGINPAAENSFKGEHGDDLVGLEASVTNSIKTTPV 240

Query: 241 RHSERSARKVFNFAEASSEDESAGTDADLSEGEEKNIVIHEPSIGDHASEKTEDISVESI 300
           RHSERSARKVFNFAEASSEDESAGTDADLSEGEEKNIVIHEPSIGDHASEKTEDISVESI
Sbjct: 241 RHSERSARKVFNFAEASSEDESAGTDADLSEGEEKNIVIHEPSIGDHASEKTEDISVESI 300

Query: 301 DEDAVKIKPPFLEGNQTSISKEKKSFRAKGSAQSDTRGLVQPTLLSLFKKVEEKRTPRSS 360
           DEDAVKIKPPFLEGNQTSISKEKKSFRAKGSAQSDTRGLVQPTLLSLFKKVEEKRTPRSS
Sbjct: 301 DEDAVKIKPPFLEGNQTSISKEKKSFRAKGSAQSDTRGLVQPTLLSLFKKVEEKRTPRSS 360

Query: 361 KRSSAPKVSTQKMQLSGSKQKIDQDEGSKKRRVVRGQGGKAQKKDTEYEVEDEIEDLSSS 420
           KRSSAPKVSTQKMQLSGSKQKIDQDEGSKKRRVVRGQGGKAQKKDTEYEVEDEIEDLSSS
Sbjct: 361 KRSSAPKVSTQKMQLSGSKQKIDQDEGSKKRRVVRGQGGKAQKKDTEYEVEDEIEDLSSS 420

Query: 421 QEDTDEDWTS 431
           QEDTDEDWTS
Sbjct: 421 QEDTDEDWTS 430

BLAST of CSPI01G07540 vs. NCBI nr
Match: gi|659066963|ref|XP_008467323.1| (PREDICTED: DNA-binding protein RHL1 [Cucumis melo])

HSP 1 Score: 765.0 bits (1974), Expect = 7.0e-218
Identity = 399/431 (92.58%), Postives = 412/431 (95.59%), Query Frame = 1

Query: 1   MARGSSS-KKDEAKGEINPEIAERKRLKKLAFSNHILSETQARPQAYLSPSATVLKHHGK 60
           MARGSSS KKDEAKGEINPEI ERKRLKKLAFSN+ILSETQA+PQAYLSPSATVLKHHGK
Sbjct: 1   MARGSSSSKKDEAKGEINPEIGERKRLKKLAFSNNILSETQAKPQAYLSPSATVLKHHGK 60

Query: 61  DIVKKSQRKNRFLFSFSGLLAPVSGGKIGELKDLSTKNPILYLDFPQGRMKLFGTIMYPK 120
           DIVKKSQRKNRFLFSFSGLLAPVSGGKIGELKDL+TKNP+LYLDFPQGRMKLFGTIMYPK
Sbjct: 61  DIVKKSQRKNRFLFSFSGLLAPVSGGKIGELKDLATKNPVLYLDFPQGRMKLFGTIMYPK 120

Query: 121 NRYLTLQFSRGGKNVTCEDCFDNMIVFSDAWWIGTKDENPEEACLDFPKDLTMGQCGEYD 180
           NRYLTLQFS+GGKNVTCEDCFDNMIVFSDAWWIGTKDENPEEACLDFPK+LT+GQCGEYD
Sbjct: 121 NRYLTLQFSKGGKNVTCEDCFDNMIVFSDAWWIGTKDENPEEACLDFPKELTLGQCGEYD 180

Query: 181 FNGGAGVTSTSGVAGVTSTSKQSVQRKGINPAAENSFKGEHGDDLVGLEASVTNSIKTTP 240
           FNGGA         GVTSTSKQSVQ+KGINPA ENSFKGEHGDDLVGLEASVTNS+KT P
Sbjct: 181 FNGGA---------GVTSTSKQSVQKKGINPATENSFKGEHGDDLVGLEASVTNSVKTMP 240

Query: 241 VRHSERSARKVFNFAEASSEDESAGTDADLSEGEEKNIVIHEPSIGDHASEKTEDISVES 300
           VRHSERSARKVFNFAEASSEDES GTD DLSEGEEKNIVIHEPSIGDHASEKTEDISVES
Sbjct: 241 VRHSERSARKVFNFAEASSEDESTGTDTDLSEGEEKNIVIHEPSIGDHASEKTEDISVES 300

Query: 301 IDEDAVKIKPPFLEGNQTSISKEKKSFRAKGSAQSDTRGLVQPTLLSLFKKVEEKRTPRS 360
           IDEDAV+IKP FLEGNQTSISKEKK+ RAKGSAQSDTRGLVQPTLLSLFKKVEEKRTPRS
Sbjct: 301 IDEDAVEIKPSFLEGNQTSISKEKKNSRAKGSAQSDTRGLVQPTLLSLFKKVEEKRTPRS 360

Query: 361 SKRSSAPKVSTQKMQLSGSKQKIDQDEGSKKRRVVRGQGGKAQKKDTEYEVEDEIEDLSS 420
           SKRSS PKVSTQKMQLSGSKQKIDQDEGSKKRR VRGQGGKAQ+KDTEYEVEDEIE+LSS
Sbjct: 361 SKRSSVPKVSTQKMQLSGSKQKIDQDEGSKKRRAVRGQGGKAQRKDTEYEVEDEIEELSS 420

Query: 421 SQEDTDEDWTS 431
           SQEDTDEDWTS
Sbjct: 421 SQEDTDEDWTS 422

BLAST of CSPI01G07540 vs. NCBI nr
Match: gi|590638295|ref|XP_007029353.1| (Root hair initiation protein root hairless 1, putative isoform 2 [Theobroma cacao])

HSP 1 Score: 389.0 bits (998), Expect = 1.0e-104
Identity = 234/450 (52.00%), Postives = 300/450 (66.67%), Query Frame = 1

Query: 1   MARGSSSKKDEAKGEINPEIAERKRLKKLAFSNHILSETQARPQAY--LSPSATVLKHHG 60
           M R SSSKK        PE  ERKRLKKLA  N++LS+T A P++Y  LSPS  V+KHHG
Sbjct: 1   MVRTSSSKKPPIAE--TPEATERKRLKKLALKNNLLSDTPATPKSYVPLSPSKLVMKHHG 60

Query: 61  KDIVKKSQRKNRFLFSFSGLLAPVSGGKIGELKDLSTKNPILYLDFPQGRMKLFGTIMYP 120
           KDI++KSQRKNRFLFSF GLLAP+SGGKIGELK+L +KNPILYLDFPQG+MKLFGTI+YP
Sbjct: 61  KDILRKSQRKNRFLFSFPGLLAPISGGKIGELKNLGSKNPILYLDFPQGQMKLFGTIVYP 120

Query: 121 KNRYLTLQFSRGGKNVTCEDCFDNMIVFSDAWWIGTKDENPEEACLDFPKDLTMGQCGEY 180
           KNRYLTL FSRGGKNV CED FDNMIVFSDAWWIG KDENPEEA LDFPK+L  GQ  EY
Sbjct: 121 KNRYLTLLFSRGGKNVMCEDYFDNMIVFSDAWWIGKKDENPEEARLDFPKELCQGQQMEY 180

Query: 181 DFNGGAGVTSTSGVAGVTSTSKQSVQRKGINPAAENSFKGEHGDDLVGLEASVTNSIKTT 240
           DF GGAGV S +         KQ   R  I      S   E GD L   +  +T  ++ T
Sbjct: 181 DFKGGAGVESVN---------KQDTPRTEIKQVEIESLDNESGDALSDDDNDLTAKMEVT 240

Query: 241 PVRHSERSARKVFNFAEASSEDESAGTDADLSEGEEKNI-----VIHEPSIGDHASEKTE 300
           P RHS R+A K F FAEASSED+   +DA+ S+GEEK +     +    +IG   S  + 
Sbjct: 241 PTRHSARNAGKRFKFAEASSEDDPVRSDAEPSDGEEKKVGKKLHLTENDTIGKTISSASL 300

Query: 301 DISVESIDEDAVKIKPPFLEGNQTSISKEKK-------SFRAKGSAQSDTRGLVQPTLLS 360
            +  ++ ++  +   P  ++ + TS+SK +K         ++K +++++   LVQPT+ +
Sbjct: 301 VLKSDAAEDSQI---PEQIQTSLTSVSKSRKISKSTVTVTKSKENSKANRGSLVQPTIST 360

Query: 361 LFKKVEEKRTPRSSKRSSAPKVSTQKMQLSGSKQKIDQDEGSKKRRVVRGQ---GGKAQK 420
           LFKKV EK+ PR S +SS+ KV  +K+Q +  K+KIDQ EGS K+  V  +   G   ++
Sbjct: 361 LFKKVGEKKGPRGSDKSSSTKVLGKKLQSNNYKRKIDQTEGSSKKGKVNEEKTTGTGIKR 420

Query: 421 KDTEYEVEDEIEDLSSSQED---TDEDWTS 431
           K  E E E++IE++SS+ ED   +DEDWT+
Sbjct: 421 KKKESEDEEDIEEISSTSEDANGSDEDWTA 436

BLAST of CSPI01G07540 vs. NCBI nr
Match: gi|571477955|ref|XP_006587422.1| (PREDICTED: DNA-binding protein RHL1-like isoform X1 [Glycine max])

HSP 1 Score: 387.5 bits (994), Expect = 3.1e-104
Identity = 236/452 (52.21%), Postives = 297/452 (65.71%), Query Frame = 1

Query: 1   MARGSSSKKDEAKGE--INPEIAERKRLKKLAFSNHILSETQARPQAYLSPSATVLKHHG 60
           MARG + KK+E +     NPE  ERKRLK LA SN ILSET AR   +L+PS+ V KHHG
Sbjct: 1   MARGKAKKKEEGEDTDVANPETLERKRLKSLAISNKILSETPARSSVHLNPSSVVAKHHG 60

Query: 61  KDIVKKSQRKN-RFLFSFSGLLAPVSGGKIGELKDLSTKNPILYLDFPQGRMKLFGTIMY 120
           KDI+KKSQRK+ R+LFSF GL+AP++GGKIG+LKDL TKNP+LYLDFPQG+MKLFGTI+Y
Sbjct: 61  KDIIKKSQRKSCRYLFSFPGLIAPIAGGKIGDLKDLGTKNPVLYLDFPQGQMKLFGTIVY 120

Query: 121 PKNRYLTLQFSRGGKNVTCEDCFDNMIVFSDAWWIGTKDENPEEACLDFPKDLTMGQCGE 180
           PKNRYLTLQF +GGK+V CED FDNMIVFSDAWWIG KDENPEE+ L+FP +L  G   E
Sbjct: 121 PKNRYLTLQFPKGGKSVMCEDYFDNMIVFSDAWWIGRKDENPEESKLEFPNELYEGHQAE 180

Query: 181 YDFNGGAGVTSTSGVAGVTSTSKQSVQRKGINPAAENSFKGEHGDDLVGLEASVTNSIKT 240
           YDF GGAG       AG  S   Q V R  I    + S K    DDL   E ++ ++ + 
Sbjct: 181 YDFKGGAG-------AGAASVVNQGVPRTNIQRVEQESPKTPTEDDLSDNEINLKDTKEL 240

Query: 241 TPVRHSERSARKVFNFAEASSEDESAGTDADLSEGEEKNIVIHEPSIGDHASEKTEDISV 300
            PVRHS R+A+K + FAE SS D+S     +LS+ EEK +V  + ++ DH S K + +  
Sbjct: 241 VPVRHSTRTAKKSYKFAEISSGDDSGENSPELSDDEEK-VVEVDTAVNDHNSSKKKTVVF 300

Query: 301 ESIDEDAVKI-KPPFLEGNQTSISKEKK-----SFRAKGSAQSDTRG-LVQPTLLSLFKK 360
           +  DED   + +P  +     S SK K+     S  A    +S  RG LVQ T+ +LFKK
Sbjct: 301 DLDDEDDAPVDQPAKINTESASRSKSKEVSQSASASASTEVKSSNRGSLVQATISTLFKK 360

Query: 361 VEEKRTPRSSKRSSAPKVSTQKMQLSGSKQKIDQDEGSKKR---RVVRGQGGKAQKKDTE 420
           VEEK TPRSS++S + K   QK Q +GSK+KID DEGSKKR      +  G K + K  E
Sbjct: 361 VEEKTTPRSSRKSPSSKAYGQKSQPAGSKRKIDLDEGSKKRARKTTDKDPGKKIKAKSKE 420

Query: 421 YEVE------DEIEDLSSSQED---TDEDWTS 431
            +VE      D+IE+ S++ ED   +DEDWT+
Sbjct: 421 DDVEDGDDDGDDIEEFSNASEDANESDEDWTA 444

BLAST of CSPI01G07540 vs. NCBI nr
Match: gi|955377296|ref|XP_014624727.1| (PREDICTED: DNA-binding protein RHL1-like isoform X1 [Glycine max])

HSP 1 Score: 385.2 bits (988), Expect = 1.5e-103
Identity = 233/456 (51.10%), Postives = 294/456 (64.47%), Query Frame = 1

Query: 1   MARGSSSKKDEAKGE----INPEIAERKRLKKLAFSNHILSETQARPQAYLSPSATVLKH 60
           MARG  +KK E   E     NPE  ERKRLK LA S++ILSET AR    L+PS+ V KH
Sbjct: 1   MARGGKAKKKEEGEEEANLTNPETIERKRLKSLAISHNILSETPARSSVQLNPSSVVAKH 60

Query: 61  HGKDIVKKSQRKN-RFLFSFSGLLAPVSGGKIGELKDLSTKNPILYLDFPQGRMKLFGTI 120
           HGKDI+KKSQRK+ R+LFSF GL+AP++GGKIG+LKDL TKNP+LYLDFPQG+MKLFGTI
Sbjct: 61  HGKDIIKKSQRKSSRYLFSFPGLIAPIAGGKIGDLKDLGTKNPVLYLDFPQGQMKLFGTI 120

Query: 121 MYPKNRYLTLQFSRGGKNVTCEDCFDNMIVFSDAWWIGTKDENPEEACLDFPKDLTMGQC 180
           +YPKNRYLTLQF +GGK+V CED FDNMIVFSDAWWIG K+ENPEEA L+FPK+L  G  
Sbjct: 121 VYPKNRYLTLQFPKGGKSVMCEDYFDNMIVFSDAWWIGRKEENPEEAKLEFPKELYEGHQ 180

Query: 181 GEYDFNGGAGVTSTSGVAGVTSTSKQSVQRKGINPAAENSFKGEHGDDLVGLEASVTNSI 240
            EYDF GGAG       AG  S   Q V R  I    + S K    DDL   E ++ ++ 
Sbjct: 181 SEYDFKGGAG-------AGAASVVNQGVPRTKIQRVEQESPKTPTEDDLSDSEINLEDTK 240

Query: 241 KTTPVRHSERSARKVFNFAEASSEDESAGTDADLSEGEEKNIVIHEPSIGDHASEKTEDI 300
           +  PVRHS R+A+K + FAE SS D+S     DLS+ EEK +   + ++ DH S K + +
Sbjct: 241 ELVPVRHSTRTAKKSYKFAEISSGDDSGENSPDLSDHEEK-VAEVDTAVNDHNSSKKKTV 300

Query: 301 SVESIDEDAVKIKPPFLEGNQTSISKEKKSFRAKGSAQSDTR-------GLVQPTLLSLF 360
             +  DED   +  P  + N+++   + K      SA + T         LVQ T+ +LF
Sbjct: 301 VFDLDDEDDAPVDQPAKKNNESASRSKCKEVSQSASASASTEVKSSNRGSLVQATISTLF 360

Query: 361 KKVEEKRTPRSSKRSSAPKVSTQKMQLSGSKQKIDQDEGSKKR---RVVRGQGGKAQKKD 420
           KKVEEK+TPRSS++S + K S QK Q +GSK+K D DEGSKKR      +  G K + K 
Sbjct: 361 KKVEEKKTPRSSRKSPSSKPSGQKSQPAGSKRKTDLDEGSKKRARKTKDKDPGKKIKAKS 420

Query: 421 TEYEVE--------DEIEDLSSSQEDT---DEDWTS 431
            +  VE        D+IE+ S++ EDT   DEDWT+
Sbjct: 421 KKDNVEDDDDDDDGDDIEEFSNASEDTNESDEDWTA 448

The following BLAST results are available for this feature:
Match NameE-valueIdentityDescription
RHL1_ARATH6.2e-8149.06DNA-binding protein RHL1 OS=Arabidopsis thaliana GN=RHL1 PE=1 SV=1[more]
Match NameE-valueIdentityDescription
A0A0A0LVZ6_CUCSA2.3e-236100.00Uncharacterized protein OS=Cucumis sativus GN=Csa_1G043140 PE=4 SV=1[more]
A0A061EXP0_THECC7.3e-10552.00Root hair initiation protein root hairless 1, putative isoform 2 OS=Theobroma ca... [more]
I1L3V3_SOYBN2.1e-10452.21Uncharacterized protein OS=Glycine max GN=GLYMA_09G164600 PE=4 SV=2[more]
G7KKB7_MEDTR1.1e-10352.50DNA-binding protein RHL1, putative OS=Medicago truncatula GN=MTR_6g077860 PE=4 S... [more]
A0A061EYH8_THECC2.4e-10352.00Root hair initiation protein root hairless 1, putative isoform 3 OS=Theobroma ca... [more]
Match NameE-valueIdentityDescription
AT1G48380.21.0e-6569.77 root hair initiation protein root hairless 1 (RHL1)[more]
Match NameE-valueIdentityDescription
gi|449439513|ref|XP_004137530.1|3.4e-236100.00PREDICTED: DNA-binding protein RHL1 [Cucumis sativus][more]
gi|659066963|ref|XP_008467323.1|7.0e-21892.58PREDICTED: DNA-binding protein RHL1 [Cucumis melo][more]
gi|590638295|ref|XP_007029353.1|1.0e-10452.00Root hair initiation protein root hairless 1, putative isoform 2 [Theobroma caca... [more]
gi|571477955|ref|XP_006587422.1|3.1e-10452.21PREDICTED: DNA-binding protein RHL1-like isoform X1 [Glycine max][more]
gi|955377296|ref|XP_014624727.1|1.5e-10351.10PREDICTED: DNA-binding protein RHL1-like isoform X1 [Glycine max][more]
The following terms have been associated with this gene:
Vocabulary: INTERPRO
TermDefinition
GO Assignments
This gene is annotated with the following GO terms.
Category Term Accession Term Name
biological_process GO:0008150 biological_process
biological_process GO:0042023 DNA endoreduplication
cellular_component GO:0005575 cellular_component
molecular_function GO:0003674 molecular_function
molecular_function GO:0003677 DNA binding

The following mRNA feature(s) are a part of this gene:

Feature NameUnique NameType
CSPI01G07540.1CSPI01G07540.1mRNA


Analysis Name: InterPro Annotations of cucumber (PI183967)
Date Performed: 2017-01-17
IPR TermIPR DescriptionSourceSource TermSource DescriptionAlignment
NoneNo IPR availablePANTHERPTHR35698FAMILY NOT NAMEDcoord: 2..428
score: 1.2E