CSPI06G01170.1 (mRNA) Wild cucumber (PI 183967)

NameCSPI06G01170.1
TypemRNA
OrganismCucumis sativus (Wild cucumber (PI 183967))
DescriptionDNA binding protein
LocationChr6 : 855861 .. 859053 (+)
Sequence length2259
The following sequences are available for this feature:

Gene sequence (with intron)

Legend: five_prime_UTRCDSthree_prime_UTR
Hold the cursor over a type above to highlight its positions in the sequence below.
TGTTTTTATTTTTTATTTTTATGAACAACCTTGAGATTGAACGTCTTCGTTTGAAAAAAATTGCGGTTTCATTTCTTGATTTTCTGGCAACGGAAATTTATTCAACCCGTGTTTTAAATTTCTGGAAGGTGTCTGTCTGTTGAGAGTGCAAAAATGAGTTCTGGGTTGCAACCAGTTCCTATAACGCCCCAAAAACATGACCCTGCGTGGAAGCACTGTCAAATGTTTAAGAATGGGGATAGAGTACAGCTTAAATGTCTATACTGCCACAAACTTTTTAAGGGTGGTGGGATTCATAGAATAAAAGAACATCTCGCTGGTCAAAAGGGTAATGCTTCTACTTGTCATAGTGTTCCTCCTGAGGTTCAGAATATAATGCAAGAGAGTTTAGATGGGGTAATGATGAAGAAGAGGAAAAGACAGAAGCTTGATGAAGAGATGACTAATGTGAATACTATGACTGGTGAGGTAGATGGAATTTCTAATCATATGGATATGGATTCTAGTATTCATTTGATTGAAGTTGCTGAGCCACTTGAGACCAATTCAGTTTTGTTACTAACTCATGAGGAAGGAACAAGTAATAAAGTGGGAAGGAAAAAGGGTAGTAAAGGTAAGAGTTCTTCTTGCCTGGAAAGAGAGATGATTGTTATTCCAAATGGTGGTGGTATATTAGATTCTAATAGGGATCGTAACCAAGTGCATATGGCAGTTGGGCGATTTTTGTATGACATTGGAGCATCTCTAGAAGCAGTAAACTCAGCCTATTTCCAACCTATGATAGAATCAATTGCTTTAGCAGGCACTGGGATCATACCACCTTCATACCATGATATTCGGGGTTGGATATTGAAGAACTCAATGGAAGAAGTGAGGAGCGATTTTGACAGATGCAAAGCCACATGGGGAATAACTGGTTGTTCTGTCATGGTTGATCAGTGGTGTACTGAAGCAGGTCGGACCATGCTCAACTTTTTGGTATATTGCCCTAAGGGAACAGTGTTTTTGGAATCTGTGGATGCATCTGGGATTATGGATTCCCCAGATTTGCTTTACGAATTACTCAAAAAGGTGGTTGAACAAGTAGGGGTGAAGCATGTAGTGCAGGTGATTACTAGGTTTGAAGAAAATTTTGCTATTGCTGGTAGAAAGCTTTCGGATACGTATCCAACCCTCTATTGGACCCCATGTGCAGCTAGTTGTGTGGATTTGATTCTCGGGGATATTGGAAACATAGAGGGTGTAAATACTGTAATTGAGCAGGCTCGATCAATTACAAGATTTGTCTATAACAATAGTATGGTTTTAAACATGGTCAGGAAATGTACATTTGGAAACGATATTGTAGAACCTTGTCTGACAAGATCTGCCACAAACTTCGCCACATTGAATCGGATGGTTGATCTGAAACGATGTCTGCAGAACATGGTTACTTCTCAAGAATGGATGGACAGTCCATATTCAAAGAGGCCAGGGGGACTGGAAATGTTGGATTTAATCAGCAGTGAATCATTTTGGTCCTCATGCAATTCAATTATTAGTTTGACAAACCCTCTCTTGAGAGTTTTAAGGATAGTAGGTAGTGGGAAGAGACCTGCGATGGGATACGTTTATGCAGCAATGTATAATGCTAAACTAGCAATTAAGACAGAACTTATTAACAGAGATCGTTACATGGTGTACTGGAACATCATAGATCAGAGATGGGAACACCATTGGCGTCATCCTCTTTATGCTGCTGGATTCTACCTGAACCCCAAGTACTTTTATAGCATTGAAGGAGATATGCATGGTGAAATCCTATCAGGGATGTTTGATTGCATTGAAAGACTGGTTTCTGATACAAATGTTCAAGATAAAATAATTAAAGAAATAACCTCGTACAAGAATGCTAGTGGAGATTTTGCAAGGAAGACGGCTATTAGAGCAAGAGGGACACTGCTTCCAGGTGAGGGTCTAATCTACTACTGATTTTTAGTTTCCATTATGCACCTTTGCACTTCATCAACTTGTTTATATGGACTGACTTGCGTTTCTGAAAAGTGCTTCGAACCTAGAACAAGGCAGTGGCAGATACATGGCCCATTGCACATTCTAAGATTGTTAATACAACTTGCATGCTAGGGACTTGTGAAGCATCTATGTTTAGATAATGTGTGTGAGTGTGTTTTAAATTATACTGACCATTAAATTTTCTTCTGCATGACTCTCATGTCCCACTCAAATAAACTTTTAATTTGGAGCATATTTGTTTCTTAGTTATCTTCGACGCATGTCTAGAGCAAGACTTTATCTTCACAAATATATTCTTTTTGGTTTCTTAGTTGAGTGATCTTTTTGGTTTCTTGTAGCTGAGTGGTGGTCAACATGTGGAGAAGGAGGCTGCCCAAATTTAACTCGCTTGGCCACTCGAATTCTGAGTCAGACCTGCTCCTCAGTGGGGTTCAAGCAAAATGATGCCCTTTTTGATAAGTTACATGACACTAGGAATCACATTGAACATCAACGTCTTAGTGACCTTGTATTTGTGCGCTCCAACTTGCAACTTAAACAAATGTAAGTCAAATTAAAAAGTCACTACACCATGAGTCTCATGTTTGAAAGTTGGGGATATTCCTGAAGTTTTTTCCAATTAATTTATTGGCAGGGCCACTAATGTCAACGAACATTATCCAACTGACCCTCTTTCCTTTGATGAGCTCGGTATTGTTGACGACTGGGTTTGGAAAAAGGATTTAAGTGCAGAGGATTGTGGGAATCTGGAATGGACAGTACTTGATAATCCTCCCTTCAGTCCCCCTATGCGTTTACCTCAGAGTGATGGCTATGATGACTTGGTTGCAGGTATTACTCTAGAATAGCTTTGTTGGTTACCTGACCGTGATACCACAATCCCTACTAAAGTTGTCTCTCAAATGTGCAGGGTTTGATGATTTGGAGGTTTTTAAAAGGCAAAGGGAGAGTGAAGATGACAATATTTCATAAGACGAAGCTAGCAATGCCTGCAAGCAAGTCACACAATTTCTATTGGTGGGTGGTCTTACTTGTGTCTATTTACCTTGTAGATGGTATGAATTTTTCAAGCTCTTCAAGAGCAGATTGAGTGTTGTATGCCTCTAGTAGAATAGATCACCAGGTCCGTGTATCTTATAATTAGGCAAATTAGTTTTGTAGATTACAAGGCTTTTAGGTC

mRNA sequence

ATGAGTTCTGGGTTGCAACCAGTTCCTATAACGCCCCAAAAACATGACCCTGCGTGGAAGCACTGTCAAATGTTTAAGAATGGGGATAGAGTACAGCTTAAATGTCTATACTGCCACAAACTTTTTAAGGGTGGTGGGATTCATAGAATAAAAGAACATCTCGCTGGTCAAAAGGGTAATGCTTCTACTTGTCATAGTGTTCCTCCTGAGGTTCAGAATATAATGCAAGAGAGTTTAGATGGGGTAATGATGAAGAAGAGGAAAAGACAGAAGCTTGATGAAGAGATGACTAATGTGAATACTATGACTGGTGAGGTAGATGGAATTTCTAATCATATGGATATGGATTCTAGTATTCATTTGATTGAAGTTGCTGAGCCACTTGAGACCAATTCAGTTTTGTTACTAACTCATGAGGAAGGAACAAGTAATAAAGTGGGAAGGAAAAAGGGTAGTAAAGGTAAGAGTTCTTCTTGCCTGGAAAGAGAGATGATTGTTATTCCAAATGGTGGTGGTATATTAGATTCTAATAGGGATCGTAACCAAGTGCATATGGCAGTTGGGCGATTTTTGTATGACATTGGAGCATCTCTAGAAGCAGTAAACTCAGCCTATTTCCAACCTATGATAGAATCAATTGCTTTAGCAGGCACTGGGATCATACCACCTTCATACCATGATATTCGGGGTTGGATATTGAAGAACTCAATGGAAGAAGTGAGGAGCGATTTTGACAGATGCAAAGCCACATGGGGAATAACTGGTTGTTCTGTCATGGTTGATCAGTGGTGTACTGAAGCAGGTCGGACCATGCTCAACTTTTTGGTATATTGCCCTAAGGGAACAGTGTTTTTGGAATCTGTGGATGCATCTGGGATTATGGATTCCCCAGATTTGCTTTACGAATTACTCAAAAAGGTGGTTGAACAAGTAGGGGTGAAGCATGTAGTGCAGGTGATTACTAGGTTTGAAGAAAATTTTGCTATTGCTGGTAGAAAGCTTTCGGATACGTATCCAACCCTCTATTGGACCCCATGTGCAGCTAGTTGTGTGGATTTGATTCTCGGGGATATTGGAAACATAGAGGGTGTAAATACTGTAATTGAGCAGGCTCGATCAATTACAAGATTTGTCTATAACAATAGTATGGTTTTAAACATGGTCAGGAAATGTACATTTGGAAACGATATTGTAGAACCTTGTCTGACAAGATCTGCCACAAACTTCGCCACATTGAATCGGATGGTTGATCTGAAACGATGTCTGCAGAACATGGTTACTTCTCAAGAATGGATGGACAGTCCATATTCAAAGAGGCCAGGGGGACTGGAAATGTTGGATTTAATCAGCAGTGAATCATTTTGGTCCTCATGCAATTCAATTATTAGTTTGACAAACCCTCTCTTGAGAGTTTTAAGGATAGTAGGTAGTGGGAAGAGACCTGCGATGGGATACGTTTATGCAGCAATGTATAATGCTAAACTAGCAATTAAGACAGAACTTATTAACAGAGATCGTTACATGGTGTACTGGAACATCATAGATCAGAGATGGGAACACCATTGGCGTCATCCTCTTTATGCTGCTGGATTCTACCTGAACCCCAAGTACTTTTATAGCATTGAAGGAGATATGCATGGTGAAATCCTATCAGGGATGTTTGATTGCATTGAAAGACTGGTTTCTGATACAAATGTTCAAGATAAAATAATTAAAGAAATAACCTCGTACAAGAATGCTAGTGGAGATTTTGCAAGGAAGACGGCTATTAGAGCAAGAGGGACACTGCTTCCAGCTGAGTGGTGGTCAACATGTGGAGAAGGAGGCTGCCCAAATTTAACTCGCTTGGCCACTCGAATTCTGAGTCAGACCTGCTCCTCAGTGGGGTTCAAGCAAAATGATGCCCTTTTTGATAAGTTACATGACACTAGGAATCACATTGAACATCAACGTCTTAGTGACCTTGTATTTGTGCGCTCCAACTTGCAACTTAAACAAATGGCCACTAATGTCAACGAACATTATCCAACTGACCCTCTTTCCTTTGATGAGCTCGGTATTGTTGACGACTGGGTTTGGAAAAAGGATTTAAGTGCAGAGGATTGTGGGAATCTGGAATGGACAGTACTTGATAATCCTCCCTTCAGTCCCCCTATGCGTTTACCTCAGAGTGATGGCTATGATGACTTGGTTGCAGGGTTTGATGATTTGGAGGTTTTTAAAAGGCAAAGGGAGAGTGAAGATGACAATATTTCATAA

Coding sequence (CDS)

ATGAGTTCTGGGTTGCAACCAGTTCCTATAACGCCCCAAAAACATGACCCTGCGTGGAAGCACTGTCAAATGTTTAAGAATGGGGATAGAGTACAGCTTAAATGTCTATACTGCCACAAACTTTTTAAGGGTGGTGGGATTCATAGAATAAAAGAACATCTCGCTGGTCAAAAGGGTAATGCTTCTACTTGTCATAGTGTTCCTCCTGAGGTTCAGAATATAATGCAAGAGAGTTTAGATGGGGTAATGATGAAGAAGAGGAAAAGACAGAAGCTTGATGAAGAGATGACTAATGTGAATACTATGACTGGTGAGGTAGATGGAATTTCTAATCATATGGATATGGATTCTAGTATTCATTTGATTGAAGTTGCTGAGCCACTTGAGACCAATTCAGTTTTGTTACTAACTCATGAGGAAGGAACAAGTAATAAAGTGGGAAGGAAAAAGGGTAGTAAAGGTAAGAGTTCTTCTTGCCTGGAAAGAGAGATGATTGTTATTCCAAATGGTGGTGGTATATTAGATTCTAATAGGGATCGTAACCAAGTGCATATGGCAGTTGGGCGATTTTTGTATGACATTGGAGCATCTCTAGAAGCAGTAAACTCAGCCTATTTCCAACCTATGATAGAATCAATTGCTTTAGCAGGCACTGGGATCATACCACCTTCATACCATGATATTCGGGGTTGGATATTGAAGAACTCAATGGAAGAAGTGAGGAGCGATTTTGACAGATGCAAAGCCACATGGGGAATAACTGGTTGTTCTGTCATGGTTGATCAGTGGTGTACTGAAGCAGGTCGGACCATGCTCAACTTTTTGGTATATTGCCCTAAGGGAACAGTGTTTTTGGAATCTGTGGATGCATCTGGGATTATGGATTCCCCAGATTTGCTTTACGAATTACTCAAAAAGGTGGTTGAACAAGTAGGGGTGAAGCATGTAGTGCAGGTGATTACTAGGTTTGAAGAAAATTTTGCTATTGCTGGTAGAAAGCTTTCGGATACGTATCCAACCCTCTATTGGACCCCATGTGCAGCTAGTTGTGTGGATTTGATTCTCGGGGATATTGGAAACATAGAGGGTGTAAATACTGTAATTGAGCAGGCTCGATCAATTACAAGATTTGTCTATAACAATAGTATGGTTTTAAACATGGTCAGGAAATGTACATTTGGAAACGATATTGTAGAACCTTGTCTGACAAGATCTGCCACAAACTTCGCCACATTGAATCGGATGGTTGATCTGAAACGATGTCTGCAGAACATGGTTACTTCTCAAGAATGGATGGACAGTCCATATTCAAAGAGGCCAGGGGGACTGGAAATGTTGGATTTAATCAGCAGTGAATCATTTTGGTCCTCATGCAATTCAATTATTAGTTTGACAAACCCTCTCTTGAGAGTTTTAAGGATAGTAGGTAGTGGGAAGAGACCTGCGATGGGATACGTTTATGCAGCAATGTATAATGCTAAACTAGCAATTAAGACAGAACTTATTAACAGAGATCGTTACATGGTGTACTGGAACATCATAGATCAGAGATGGGAACACCATTGGCGTCATCCTCTTTATGCTGCTGGATTCTACCTGAACCCCAAGTACTTTTATAGCATTGAAGGAGATATGCATGGTGAAATCCTATCAGGGATGTTTGATTGCATTGAAAGACTGGTTTCTGATACAAATGTTCAAGATAAAATAATTAAAGAAATAACCTCGTACAAGAATGCTAGTGGAGATTTTGCAAGGAAGACGGCTATTAGAGCAAGAGGGACACTGCTTCCAGCTGAGTGGTGGTCAACATGTGGAGAAGGAGGCTGCCCAAATTTAACTCGCTTGGCCACTCGAATTCTGAGTCAGACCTGCTCCTCAGTGGGGTTCAAGCAAAATGATGCCCTTTTTGATAAGTTACATGACACTAGGAATCACATTGAACATCAACGTCTTAGTGACCTTGTATTTGTGCGCTCCAACTTGCAACTTAAACAAATGGCCACTAATGTCAACGAACATTATCCAACTGACCCTCTTTCCTTTGATGAGCTCGGTATTGTTGACGACTGGGTTTGGAAAAAGGATTTAAGTGCAGAGGATTGTGGGAATCTGGAATGGACAGTACTTGATAATCCTCCCTTCAGTCCCCCTATGCGTTTACCTCAGAGTGATGGCTATGATGACTTGGTTGCAGGGTTTGATGATTTGGAGGTTTTTAAAAGGCAAAGGGAGAGTGAAGATGACAATATTTCATAA
BLAST of CSPI06G01170.1 vs. TrEMBL
Match: A0A0A0KD75_CUCSA (Uncharacterized protein OS=Cucumis sativus GN=Csa_6G008670 PE=4 SV=1)

HSP 1 Score: 1550.8 bits (4014), Expect = 0.0e+00
Identity = 752/752 (100.00%), Postives = 752/752 (100.00%), Query Frame = 1

Query: 1   MSSGLQPVPITPQKHDPAWKHCQMFKNGDRVQLKCLYCHKLFKGGGIHRIKEHLAGQKGN 60
           MSSGLQPVPITPQKHDPAWKHCQMFKNGDRVQLKCLYCHKLFKGGGIHRIKEHLAGQKGN
Sbjct: 63  MSSGLQPVPITPQKHDPAWKHCQMFKNGDRVQLKCLYCHKLFKGGGIHRIKEHLAGQKGN 122

Query: 61  ASTCHSVPPEVQNIMQESLDGVMMKKRKRQKLDEEMTNVNTMTGEVDGISNHMDMDSSIH 120
           ASTCHSVPPEVQNIMQESLDGVMMKKRKRQKLDEEMTNVNTMTGEVDGISNHMDMDSSIH
Sbjct: 123 ASTCHSVPPEVQNIMQESLDGVMMKKRKRQKLDEEMTNVNTMTGEVDGISNHMDMDSSIH 182

Query: 121 LIEVAEPLETNSVLLLTHEEGTSNKVGRKKGSKGKSSSCLEREMIVIPNGGGILDSNRDR 180
           LIEVAEPLETNSVLLLTHEEGTSNKVGRKKGSKGKSSSCLEREMIVIPNGGGILDSNRDR
Sbjct: 183 LIEVAEPLETNSVLLLTHEEGTSNKVGRKKGSKGKSSSCLEREMIVIPNGGGILDSNRDR 242

Query: 181 NQVHMAVGRFLYDIGASLEAVNSAYFQPMIESIALAGTGIIPPSYHDIRGWILKNSMEEV 240
           NQVHMAVGRFLYDIGASLEAVNSAYFQPMIESIALAGTGIIPPSYHDIRGWILKNSMEEV
Sbjct: 243 NQVHMAVGRFLYDIGASLEAVNSAYFQPMIESIALAGTGIIPPSYHDIRGWILKNSMEEV 302

Query: 241 RSDFDRCKATWGITGCSVMVDQWCTEAGRTMLNFLVYCPKGTVFLESVDASGIMDSPDLL 300
           RSDFDRCKATWGITGCSVMVDQWCTEAGRTMLNFLVYCPKGTVFLESVDASGIMDSPDLL
Sbjct: 303 RSDFDRCKATWGITGCSVMVDQWCTEAGRTMLNFLVYCPKGTVFLESVDASGIMDSPDLL 362

Query: 301 YELLKKVVEQVGVKHVVQVITRFEENFAIAGRKLSDTYPTLYWTPCAASCVDLILGDIGN 360
           YELLKKVVEQVGVKHVVQVITRFEENFAIAGRKLSDTYPTLYWTPCAASCVDLILGDIGN
Sbjct: 363 YELLKKVVEQVGVKHVVQVITRFEENFAIAGRKLSDTYPTLYWTPCAASCVDLILGDIGN 422

Query: 361 IEGVNTVIEQARSITRFVYNNSMVLNMVRKCTFGNDIVEPCLTRSATNFATLNRMVDLKR 420
           IEGVNTVIEQARSITRFVYNNSMVLNMVRKCTFGNDIVEPCLTRSATNFATLNRMVDLKR
Sbjct: 423 IEGVNTVIEQARSITRFVYNNSMVLNMVRKCTFGNDIVEPCLTRSATNFATLNRMVDLKR 482

Query: 421 CLQNMVTSQEWMDSPYSKRPGGLEMLDLISSESFWSSCNSIISLTNPLLRVLRIVGSGKR 480
           CLQNMVTSQEWMDSPYSKRPGGLEMLDLISSESFWSSCNSIISLTNPLLRVLRIVGSGKR
Sbjct: 483 CLQNMVTSQEWMDSPYSKRPGGLEMLDLISSESFWSSCNSIISLTNPLLRVLRIVGSGKR 542

Query: 481 PAMGYVYAAMYNAKLAIKTELINRDRYMVYWNIIDQRWEHHWRHPLYAAGFYLNPKYFYS 540
           PAMGYVYAAMYNAKLAIKTELINRDRYMVYWNIIDQRWEHHWRHPLYAAGFYLNPKYFYS
Sbjct: 543 PAMGYVYAAMYNAKLAIKTELINRDRYMVYWNIIDQRWEHHWRHPLYAAGFYLNPKYFYS 602

Query: 541 IEGDMHGEILSGMFDCIERLVSDTNVQDKIIKEITSYKNASGDFARKTAIRARGTLLPAE 600
           IEGDMHGEILSGMFDCIERLVSDTNVQDKIIKEITSYKNASGDFARKTAIRARGTLLPAE
Sbjct: 603 IEGDMHGEILSGMFDCIERLVSDTNVQDKIIKEITSYKNASGDFARKTAIRARGTLLPAE 662

Query: 601 WWSTCGEGGCPNLTRLATRILSQTCSSVGFKQNDALFDKLHDTRNHIEHQRLSDLVFVRS 660
           WWSTCGEGGCPNLTRLATRILSQTCSSVGFKQNDALFDKLHDTRNHIEHQRLSDLVFVRS
Sbjct: 663 WWSTCGEGGCPNLTRLATRILSQTCSSVGFKQNDALFDKLHDTRNHIEHQRLSDLVFVRS 722

Query: 661 NLQLKQMATNVNEHYPTDPLSFDELGIVDDWVWKKDLSAEDCGNLEWTVLDNPPFSPPMR 720
           NLQLKQMATNVNEHYPTDPLSFDELGIVDDWVWKKDLSAEDCGNLEWTVLDNPPFSPPMR
Sbjct: 723 NLQLKQMATNVNEHYPTDPLSFDELGIVDDWVWKKDLSAEDCGNLEWTVLDNPPFSPPMR 782

Query: 721 LPQSDGYDDLVAGFDDLEVFKRQRESEDDNIS 753
           LPQSDGYDDLVAGFDDLEVFKRQRESEDDNIS
Sbjct: 783 LPQSDGYDDLVAGFDDLEVFKRQRESEDDNIS 814

BLAST of CSPI06G01170.1 vs. TrEMBL
Match: E5GC76_CUCME (DNA binding protein OS=Cucumis melo subsp. melo PE=4 SV=1)

HSP 1 Score: 1514.6 bits (3920), Expect = 0.0e+00
Identity = 732/752 (97.34%), Postives = 739/752 (98.27%), Query Frame = 1

Query: 1   MSSGLQPVPITPQKHDPAWKHCQMFKNGDRVQLKCLYCHKLFKGGGIHRIKEHLAGQKGN 60
           MSSGLQPVPITPQKHDPAWKHCQMFKNGDRVQLKCLYCHKLFKGGGIHRIKEHLAGQKGN
Sbjct: 1   MSSGLQPVPITPQKHDPAWKHCQMFKNGDRVQLKCLYCHKLFKGGGIHRIKEHLAGQKGN 60

Query: 61  ASTCHSVPPEVQNIMQESLDGVMMKKRKRQKLDEEMTNVNTMTGEVDGISNHMDMDSSIH 120
           ASTCHSVPPEVQNIMQESLDGVMMKKRKRQKLDEEMTNVN MT EVD ISNHMDMDSSIH
Sbjct: 61  ASTCHSVPPEVQNIMQESLDGVMMKKRKRQKLDEEMTNVNAMTAEVDAISNHMDMDSSIH 120

Query: 121 LIEVAEPLETNSVLLLTHEEGTSNKVGRKKGSKGKSSSCLEREMIVIPNGGGILDSNRDR 180
           LIEVAEPL+TNS LLLTHEEGTSNKVGRKKGSKGKSSSCL+REMIVIPNGGGILDSNRDR
Sbjct: 121 LIEVAEPLDTNSALLLTHEEGTSNKVGRKKGSKGKSSSCLDREMIVIPNGGGILDSNRDR 180

Query: 181 NQVHMAVGRFLYDIGASLEAVNSAYFQPMIESIALAGTGIIPPSYHDIRGWILKNSMEEV 240
           NQVHMA+GRFLYDIGASLEAVNSAYFQPMIESIALAGTGIIPPSYHDIRGWILKNS+EEV
Sbjct: 181 NQVHMAIGRFLYDIGASLEAVNSAYFQPMIESIALAGTGIIPPSYHDIRGWILKNSVEEV 240

Query: 241 RSDFDRCKATWGITGCSVMVDQWCTEAGRTMLNFLVYCPKGTVFLESVDASGIMDSPDLL 300
           R DFDRCKATWG+TGCSVMVDQWCTEAGRTMLNFLVYCPKGTVFLESVDASGIMDSPDLL
Sbjct: 241 RGDFDRCKATWGMTGCSVMVDQWCTEAGRTMLNFLVYCPKGTVFLESVDASGIMDSPDLL 300

Query: 301 YELLKKVVEQVGVKHVVQVITRFEENFAIAGRKLSDTYPTLYWTPCAASCVDLILGDIGN 360
           YELLKKVVEQVGVKHVVQVITRFEENFAIAGRKLSDTYPTLYWTPCAASCVDLIL DIGN
Sbjct: 301 YELLKKVVEQVGVKHVVQVITRFEENFAIAGRKLSDTYPTLYWTPCAASCVDLILADIGN 360

Query: 361 IEGVNTVIEQARSITRFVYNNSMVLNMVRKCTFGNDIVEPCLTRSATNFATLNRMVDLKR 420
           IE VNTVIEQARSITRFVYNNSMVLNMVRKCTFGNDIVEPCLTRSATNFATLNRMVDLKR
Sbjct: 361 IEDVNTVIEQARSITRFVYNNSMVLNMVRKCTFGNDIVEPCLTRSATNFATLNRMVDLKR 420

Query: 421 CLQNMVTSQEWMDSPYSKRPGGLEMLDLISSESFWSSCNSIISLTNPLLRVLRIVGSGKR 480
           CLQNMVTSQEWMDSPYSKRPGGLEMLDLISSESFWSSCNSII LTNPLLRVLRIVGSGKR
Sbjct: 421 CLQNMVTSQEWMDSPYSKRPGGLEMLDLISSESFWSSCNSIIRLTNPLLRVLRIVGSGKR 480

Query: 481 PAMGYVYAAMYNAKLAIKTELINRDRYMVYWNIIDQRWEHHWRHPLYAAGFYLNPKYFYS 540
           PAMGYVYAAMYNAKLAIKTELINRDRYMVYWNIIDQRWEHHWRHPL AAGFYLNPKYFYS
Sbjct: 481 PAMGYVYAAMYNAKLAIKTELINRDRYMVYWNIIDQRWEHHWRHPLCAAGFYLNPKYFYS 540

Query: 541 IEGDMHGEILSGMFDCIERLVSDTNVQDKIIKEITSYKNASGDFARKTAIRARGTLLPAE 600
           IEGDMHGEILSGMFDCIERLVSDTNVQDKIIKEITSYKNASGDFARKTAIRARGTLLPAE
Sbjct: 541 IEGDMHGEILSGMFDCIERLVSDTNVQDKIIKEITSYKNASGDFARKTAIRARGTLLPAE 600

Query: 601 WWSTCGEGGCPNLTRLATRILSQTCSSVGFKQNDALFDKLHDTRNHIEHQRLSDLVFVRS 660
           WWSTCGEGGCPNLTRLATRILSQTCSSVGFKQN   FDKLHDTRNHIEHQRLSDLVFVRS
Sbjct: 601 WWSTCGEGGCPNLTRLATRILSQTCSSVGFKQNQVFFDKLHDTRNHIEHQRLSDLVFVRS 660

Query: 661 NLQLKQMATNVNEHYPTDPLSFDELGIVDDWVWKKDLSAEDCGNLEWTVLDNPPFSPPMR 720
           NLQLKQMATNVNEHYPTDPLSFD LGIVDDWVWKKDLSAEDCGNLEWTVL+NPPFSPPMR
Sbjct: 661 NLQLKQMATNVNEHYPTDPLSFDGLGIVDDWVWKKDLSAEDCGNLEWTVLENPPFSPPMR 720

Query: 721 LPQSDGYDDLVAGFDDLEVFKRQRESEDDNIS 753
           LPQ+DGYDDLVAGFDDLEVFKRQRESEDDNIS
Sbjct: 721 LPQNDGYDDLVAGFDDLEVFKRQRESEDDNIS 752

BLAST of CSPI06G01170.1 vs. TrEMBL
Match: A0A061FL79_THECC (HAT and BED zinc finger domain-containing protein, putative OS=Theobroma cacao GN=TCM_042727 PE=4 SV=1)

HSP 1 Score: 940.3 bits (2429), Expect = 1.5e-270
Identity = 456/754 (60.48%), Postives = 575/754 (76.26%), Query Frame = 1

Query: 1   MSSGLQPVPITPQKHDPAWKHCQMFKNGDRVQLKCLYCHKLFKGGGIHRIKEHLAGQKGN 60
           M+S L+P+PIT QKHDPAWKHCQMF+NG+RVQLKC+YC K+F+GGGIHRIKEHLAGQKGN
Sbjct: 1   MASNLEPIPITSQKHDPAWKHCQMFRNGERVQLKCIYCGKIFRGGGIHRIKEHLAGQKGN 60

Query: 61  ASTCHSVPPEVQNIMQESLDGVMMKKRKRQKLDEEMTNVNTMTGEVDGISNHMDMDSSIH 120
           ASTC  VP +V+ +M+ESLDGV +KKRK+QK+ EEM+N N ++ E+D   N +D ++ + 
Sbjct: 61  ASTCFHVPSDVRLLMRESLDGVEVKKRKKQKIAEEMSNANQVSSEIDTYDNQVDTNTGLL 120

Query: 121 LIEVAEPLETNSVLLLTHEEGTSNKVG-RKKGSKGKSSSCLEREMIVIPNGGGILDSNRD 180
           +IE  + L+ +S LL+ + EGTSN  G R+K  KGKSS+     ++V   G   L + R 
Sbjct: 121 MIEGPDTLQPSSSLLV-NREGTSNVSGDRRKRGKGKSSAAESNALVVNTVG---LGAKRV 180

Query: 181 RNQVHMAVGRFLYDIGASLEAVNSAYFQPMIESIALAGTGIIPPSYHDIRGWILKNSMEE 240
            N VH+A+GRFL+DIGA L+AVNS YFQPM+++I   G+G++ PS  D++GWILK S+EE
Sbjct: 181 NNHVHVAIGRFLFDIGAPLDAVNSVYFQPMVDAIISGGSGVLMPSCSDLQGWILKKSVEE 240

Query: 241 VRSDFDRCKATWGITGCSVMVDQWCTEAGRTMLNFLVYCPKGTVFLESVDASGIMDSPDL 300
           V+SD D+  A W  TGCS++V+QW T+ GR +LNFLVYCP+GTVFL+SVDAS +++S D 
Sbjct: 241 VKSDNDKVTAAWVRTGCSILVNQWNTQTGRILLNFLVYCPEGTVFLKSVDASSVINSSDA 300

Query: 301 LYELLKKVVEQVGVKHVVQVITRFEENFAIAGRKLSDTYPTLYWTPCAASCVDLILGDIG 360
           LYELLK+VVE+VG KHV+QVIT  EE + +AGR+L++T+PTLYWTPCAA C++LIL D  
Sbjct: 301 LYELLKQVVEEVGSKHVLQVITNAEEQYIVAGRRLAETFPTLYWTPCAAHCINLILEDFA 360

Query: 361 NIEGVNTVIEQARSITRFVYNNSMVLNMVRKCTFGNDIVEPCLTRSATNFATLNRMVDLK 420
            +E +N +IEQARSITRFVYN+S+VLNMVR+ T GNDIVEP +T SATNF TL +M+DLK
Sbjct: 361 KLEWINVIIEQARSITRFVYNHSVVLNMVRRYTLGNDIVEPAVTCSATNFTTLKQMIDLK 420

Query: 421 RCLQNMVTSQEWMDSPYSKRPGGLEMLDLISSESFWSSCNSIISLTNPLLRVLRIVGSGK 480
             LQ MVTSQEWMD PYSK+PGGLEMLDL+S+ SFWSS   I  LTNPLLRVLR+VGS K
Sbjct: 421 NNLQAMVTSQEWMDCPYSKKPGGLEMLDLVSNPSFWSSSVLITQLTNPLLRVLRMVGSKK 480

Query: 481 RPAMGYVYAAMYNAKLAIKTELINRDRYMVYWNIIDQRWEHHWRHPLYAAGFYLNPKYFY 540
           RPAMGYVYA MY AK  IK EL+ R+ YM+YWNIID  WE  W HPL+ AGFYLNPK+FY
Sbjct: 481 RPAMGYVYAGMYRAKETIKKELVKRNEYMIYWNIIDHWWEQQWHHPLHGAGFYLNPKFFY 540

Query: 541 SIEGDMHGEILSGMFDCIERLVSDTNVQDKIIKEITSYKNASGDFARKTAIRARGTLLPA 600
           S+EGDM  E+LSGM DCIE+LV D  VQDKI KEI SYKN  GDF RK A+RAR TLLPA
Sbjct: 541 SMEGDMPNEMLSGMLDCIEKLVPDVKVQDKISKEINSYKNTVGDFGRKMAVRARDTLLPA 600

Query: 601 EWWSTCGEGGCPNLTRLATRILSQTCSSVGFKQNDALFDKLHDTRNHIEHQRLSDLVFVR 660
           EWWST G G CPNL RLA  +LSQTCS++G KQN   F+KLH+TRN +E QR  DL+FV+
Sbjct: 601 EWWSTYG-GSCPNLARLAIHVLSQTCSTLGLKQNSIPFEKLHETRNFLEQQRFRDLIFVQ 660

Query: 661 SNLQLKQMATNVNEHYPTDPLSFDELGIVDDWVWKKDLSAEDCGNLEWTVLDNPPFSPPM 720
            NLQL+Q+     E     P+SFD    ++DWV   D   E+  + +WT LD    +  +
Sbjct: 661 CNLQLRQIGCESKEQVSMQPMSFD--ATIEDWVMGNDAFLENYTHSDWTALDPLSVNTML 720

Query: 721 RLPQSDGYDDLVAGFDDLEVFK--RQRESEDDNI 752
             P SD  ++L AGFDD E+F   +++E+ +DN+
Sbjct: 721 LGPSSDEVEELGAGFDDYEIFNGVKEQENAEDNV 747

BLAST of CSPI06G01170.1 vs. TrEMBL
Match: A0A067LKQ6_JATCU (Uncharacterized protein OS=Jatropha curcas GN=JCGZ_01424 PE=4 SV=1)

HSP 1 Score: 930.6 bits (2404), Expect = 1.2e-267
Identity = 460/755 (60.93%), Postives = 582/755 (77.09%), Query Frame = 1

Query: 1   MSSGLQPVPITPQKHDPAWKHCQMFKNGDRVQLKCLYCHKLFKGGGIHRIKEHLAGQKGN 60
           M + L+P+PIT QKHDPAWKHCQMFKNG+RVQLKC+YC K+FKGGGIHRIKEHLAGQKGN
Sbjct: 1   MDANLEPIPITSQKHDPAWKHCQMFKNGERVQLKCIYCSKIFKGGGIHRIKEHLAGQKGN 60

Query: 61  ASTCHSVPPEVQNIMQESLDGVMMKKRKRQKLDEEMTNVN-TMTGEVDGI-SNHMDMDSS 120
           ASTC  VPP+V+ +MQ+SLDGV++KKRK+QK+ EE+T++N  +  E++G  +NH++ ++ 
Sbjct: 61  ASTCLRVPPDVRLMMQQSLDGVVVKKRKKQKIVEEITDLNPVVVNEIEGFGNNHIEANNG 120

Query: 121 IHLIEVAEPLETNSVLLLTHEEGTSNKVGRKKGSKGKSSSCLEREMIV-IPNGGGILDSN 180
           + LI V+  +E +S LL+  EE T +K G +K  +G+S   +  E      N      + 
Sbjct: 121 MDLIGVSNVIEPSSSLLVVQEERTISKGGERK-KRGRSKGSVANESAAGTMNNRLASGAK 180

Query: 181 RDRNQVHMAVGRFLYDIGASLEAVNSAYFQPMIESIALAGTGIIPPSYHDIRGWILKNSM 240
           R +   HMA+GRFLYDIGA L+AVNS YF PM+ +IA  G+    PSYHD+RGWILKNS+
Sbjct: 181 RSKEHAHMAIGRFLYDIGAPLDAVNSVYFLPMVNAIASGGSEDGMPSYHDLRGWILKNSV 240

Query: 241 EEVRSDFDRCKATWGITGCSVMVDQWCTEAGRTMLNFLVYCPKGTVFLESVDASGIMDSP 300
           EEV++D D+  ATW  TGCS++VDQW T  GRT+L+FLVYCP+G VFL+SVDAS I++S 
Sbjct: 241 EEVKTDMDKYMATWARTGCSILVDQWTTSIGRTLLSFLVYCPEGVVFLKSVDASDIINSS 300

Query: 301 DLLYELLKKVVEQVGVKHVVQVITRFEENFAIAGRKLSDTYPTLYWTPCAASCVDLILGD 360
           D LYELLK+VVE+VG +HV+QVITR E+ + ++G++LS+T+PTLYW PCAA CVDLIL D
Sbjct: 301 DALYELLKQVVEEVGFRHVLQVITRMEDQYIVSGKRLSNTFPTLYWAPCAAHCVDLILED 360

Query: 361 IGNIEGVNTVIEQARSITRFVYNNSMVLNMVRKCTFGNDIVEPCLTRSATNFATLNRMVD 420
              +E +NTVIEQARSITRFVYN+S+VLNM+R+ T GNDIVEP LT SA NFATL RMV+
Sbjct: 361 FSKLEWINTVIEQARSITRFVYNHSVVLNMMRRYTRGNDIVEPGLTSSAANFATLKRMVE 420

Query: 421 LKRCLQNMVTSQEWMDSPYSKRPGGLEMLDLISSESFWSSCNSIISLTNPLLRVLRIVGS 480
           LK  L+ MV SQEW+D PYSK+PGGLEMLDL+S++SFWSSC+ I  LT P LR+L IV  
Sbjct: 421 LKHALEVMVFSQEWVDCPYSKKPGGLEMLDLVSNQSFWSSCDLIAHLTYPFLRLLIIVSC 480

Query: 481 GKRPAMGYVYAAMYNAKLAIKTELINRDRYMVYWNIIDQRWEHHWRHPLYAAGFYLNPKY 540
            KRPAMGYVY  MY AK AIK +LI R+ YMVYWNIID+ WE     PL+AAGF+LNPK+
Sbjct: 481 HKRPAMGYVYVGMYRAKEAIKKKLIKREDYMVYWNIIDRWWEKQSNLPLHAAGFFLNPKF 540

Query: 541 FYSIEGDMHGEILSGMFDCIERLVSDTNVQDKIIKEITSYKNASGDFARKTAIRARGTLL 600
           FYSIEGD+H +ILSGM DCIERLV D ++QDKI KEI SYK+A+GDF RK A+RAR TLL
Sbjct: 541 FYSIEGDIHNDILSGMIDCIERLVPDADIQDKITKEIHSYKSAAGDFGRKMAVRARDTLL 600

Query: 601 PAEWWSTCGEGGCPNLTRLATRILSQTCSSVGFKQNDALFDKLHDTRNHIEHQRLSDLVF 660
           PAEWWST G G CPNL RLA RILSQTCSS+ +KQN    +++HDTRN +E QRLSDLVF
Sbjct: 601 PAEWWSTYG-GSCPNLVRLAIRILSQTCSSIVYKQNQIPVEQIHDTRNCLERQRLSDLVF 660

Query: 661 VRSNLQLKQMATNVN-EHYPTDPLSFDELGIVDDWVWKKDLSAEDCGNLEWTVLDNPPFS 720
           V+ NLQLKQM    N E    DP+S D +  +++W+ +KD S+ED  NL+W  LD P  S
Sbjct: 661 VQYNLQLKQMTGVKNKEQDSIDPISVDSISTLENWIREKDSSSEDYANLDWMALDPP--S 720

Query: 721 PPMRLPQSDGYDDLVAGFDDLEVFKRQRESEDDNI 752
              RL   D  ++L +GFDD E+FKR ++++++N+
Sbjct: 721 SNTRL--HDEVEELGSGFDDYEIFKRIKDTKEENV 749

BLAST of CSPI06G01170.1 vs. TrEMBL
Match: B9SDY5_RICCO (DNA binding protein, putative OS=Ricinus communis GN=RCOM_1480090 PE=4 SV=1)

HSP 1 Score: 929.9 bits (2402), Expect = 2.0e-267
Identity = 464/753 (61.62%), Postives = 574/753 (76.23%), Query Frame = 1

Query: 2   SSGLQPVPITPQKHDPAWKHCQMFKNGDRVQLKCLYCHKLFKGGGIHRIKEHLAGQKGNA 61
           S  L+P+PIT QKHDPAWKHCQMFKNG+RVQLKC+YC K+FKGGGIHRIKEHLAGQKGNA
Sbjct: 3   SDDLEPIPITSQKHDPAWKHCQMFKNGERVQLKCVYCGKIFKGGGIHRIKEHLAGQKGNA 62

Query: 62  STCHSVPPEVQNIMQESLDGVMMKKRKRQKLDEEMTNVNTMTG--EVDGISN-HMDMDSS 121
           STC  VP +V+ IMQ+SLDGV++KKRK+QK+ EE+TN+N + G  E++  +N  +++ + 
Sbjct: 63  STCLQVPTDVKLIMQQSLDGVVVKKRKKQKIAEEITNLNPVIGGGEIEVFANDQIEVSTG 122

Query: 122 IHLIEVAEPLETNSVLLLTHEEGTSNKVG-RKKGSKGKSSSCLEREMIVIPNGGGILDSN 181
           + LI V+  +E +S LL++ +EG +NK G R+K  + K S      ++ + +    L + 
Sbjct: 123 MELIGVSNVIEPSSSLLISGQEGKANKGGERRKRGRSKGSGANANAIVSMNSNRMALGAK 182

Query: 182 RDRNQVHMAVGRFLYDIGASLEAVNSAYFQPMIESIALAGTGIIPPSYHDIRGWILKNSM 241
           R  + VHMA+GRFLYDIGA L+AVNS YFQPM+++IA  G  +  PS HD+RGWILKNS+
Sbjct: 183 RVNDHVHMAIGRFLYDIGAPLDAVNSVYFQPMVDAIASGGLDVGMPSCHDLRGWILKNSV 242

Query: 242 EEVRSDFDRCKATWGITGCSVMVDQWCTEAGRTMLNFLVYCPKGTVFLESVDASGIMDSP 301
           EEV+++ D+  ATW  TGCSV+VDQW T  GRT+L+FLVYC +G VFL+SVDAS I++S 
Sbjct: 243 EEVKTEVDKHMATWARTGCSVLVDQWNTLMGRTLLSFLVYCSEGVVFLKSVDASDIINSS 302

Query: 302 DLLYELLKKVVEQVGVKHVVQVITRFEENFAIAGRKLSDTYPTLYWTPCAASCVDLILGD 361
           D LYEL+KKVVE+VGV+HV+QVIT  EE + + GR+L+DT+PTLY  PCAA C+DLIL D
Sbjct: 303 DALYELIKKVVEEVGVRHVLQVITSMEEQYIVVGRRLTDTFPTLYRAPCAAHCIDLILED 362

Query: 362 IGNIEGVNTVIEQARSITRFVYNNSMVLNMVRKCTFGNDIVEPCLTRSATNFATLNRMVD 421
              +E ++TVI QARSITRFVYN+S+VLNMV++ TFG++IV   LT  ATNF TL RMVD
Sbjct: 363 FAKLEWISTVILQARSITRFVYNHSVVLNMVKRYTFGSEIVATGLTHFATNFETLKRMVD 422

Query: 422 LKRCLQNMVTSQEWMDSPYSKRPGGLEMLDLISSESFWSSCNSIISLTNPLLRVLRIVGS 481
           LK  LQ MVTSQEWMD PYSK+P GLEMLDL+S++SFWSSC  I +LTNPLLR+LRIV S
Sbjct: 423 LKHTLQTMVTSQEWMDCPYSKKPRGLEMLDLLSNQSFWSSCVLITNLTNPLLRLLRIVSS 482

Query: 482 GKRPAMGYVYAAMYNAKLAIKTELINRDRYMVYWNIIDQRWEHHWRHPLYAAGFYLNPKY 541
            KRP MGYVYA +Y AK AIK EL+ R  YMVYWNIID  WE     PL+AAGF+LNPK 
Sbjct: 483 KKRPPMGYVYAGIYRAKEAIKKELVKRKDYMVYWNIIDHWWEQQSNLPLHAAGFFLNPKV 542

Query: 542 FYSIEGDMHGEILSGMFDCIERLVSDTNVQDKIIKEITSYKNASGDFARKTAIRARGTLL 601
            YSIEGD+H EILSGMFDCIE+LV D  VQDKI KEI SYKNASGDF RK A+RAR TLL
Sbjct: 543 LYSIEGDLHNEILSGMFDCIEKLVPDVTVQDKITKEINSYKNASGDFGRKMAVRARETLL 602

Query: 602 PAEWWSTCGEGGCPNLTRLATRILSQTCSSVGFKQNDALFDKLHDTRNHIEHQRLSDLVF 661
           PAEWWST G G CPNL RLA R+LSQ CSS G+K N    +++HDT+N +E QRLSDLVF
Sbjct: 603 PAEWWSTYG-GSCPNLARLAIRVLSQPCSSFGYKLNHISLEQIHDTKNCLERQRLSDLVF 662

Query: 662 VRSNLQLKQMATNVNEHYPTDPLSFDELGIVDDWVWKKDLSAEDCGNLEWTVLDNPPFSP 721
           V+ NL+LKQM     E    DPLSFD + I++DW+ +KD+S ED  N +W  LD P  S 
Sbjct: 663 VQYNLRLKQMVGKSEEQDSVDPLSFDCISILEDWIKEKDISTEDYANSDWMALDPP--SV 722

Query: 722 PMRLPQSDGYDDLVAGFDDLEVFKRQRESEDDN 751
             R P  D  D+L AGF D E+F R +++EDDN
Sbjct: 723 NTRQPH-DEVDELGAGFHDYEIFNRVKDTEDDN 751

BLAST of CSPI06G01170.1 vs. TAIR10
Match: AT3G22220.1 (AT3G22220.1 hAT transposon superfamily)

HSP 1 Score: 708.8 bits (1828), Expect = 3.7e-204
Identity = 364/758 (48.02%), Postives = 503/758 (66.36%), Query Frame = 1

Query: 1   MSSGLQPVPITPQKHDPAWKHCQMFKNGDRVQLKCLYCHKLFKGGGIHRIKEHLAGQKGN 60
           M S L+PV +TPQK D AWKHC+++K GDRVQ++CLYC K+FKGGGI R+KEHLAG+KG 
Sbjct: 1   MDSDLEPVALTPQKQDSAWKHCEVYKYGDRVQMRCLYCRKMFKGGGITRVKEHLAGKKGQ 60

Query: 61  ASTCHSVPPEVQNIMQESLDGVMMKKRKRQKLDEEMT--------NVNTMTGEVDGISNH 120
            + C  VP EV+  +Q+ +DG + ++RKR+K   E           V T       ++N 
Sbjct: 61  GTICDQVPDEVRLFLQQCIDGTVRRQRKRRKSSPEPLPIAYFPPCEVETQVAASSDVNNG 120

Query: 121 MDMDSSIHLIEVAEPLETNSVLLLTHEEGTSNKVGRKKGSKGKSSSCLEREM-IVIPNGG 180
               SS    +V     T      T+    +N   R   +  +    ++  + + I +  
Sbjct: 121 FKSPSS----DVVVGQSTGRTKQRTYRSRKNNAFERNDLANVEVDRDMDNLIPVAISSVK 180

Query: 181 GILD-SNRDRNQ-VHMAVGRFLYDIGASLEAVNSAYFQPMIESIALAGTGIIPPSYHDIR 240
            I+  ++++R + VHMA+GRFL+DIGA  +A NS   QP I++I   G G+  P++ D+R
Sbjct: 181 NIVHPTSKEREKTVHMAMGRFLFDIGADFDAANSVNVQPFIDAIVSGGFGVSIPTHEDLR 240

Query: 241 GWILKNSMEEVRSDFDRCKATWGITGCSVMVDQWCTEAGRTMLNFLVYCPKGTVFLESVD 300
           GWILK+ +EEV+ + D CK  W  TGCSV+V +  +  G  +L FLVYCP+  VFL+SVD
Sbjct: 241 GWILKSCVEEVKKEIDECKTLWKRTGCSVLVQELNSNEGPLILKFLVYCPEKVVFLKSVD 300

Query: 301 ASGIMDSPDLLYELLKKVVEQVGVKHVVQVITRFEENFAIAGRKLSDTYPTLYWTPCAAS 360
           AS I+DS D LYELLK+VVE++G  +VVQVIT+ E+++A AG+KL D YP+LYW PCAA 
Sbjct: 301 ASEILDSEDKLYELLKEVVEEIGDTNVVQVITKCEDHYAAAGKKLMDVYPSLYWVPCAAH 360

Query: 361 CVDLILGDIGNIEGVNTVIEQARSITRFVYNNSMVLNMVRKCTFGNDIVEPCLTRSATNF 420
           C+D +L + G ++ +  +IEQAR++TR +YN+S VLN++RK TFGNDIV+P  T SATNF
Sbjct: 361 CIDKMLEEFGKMDWIREIIEQARTVTRIIYNHSGVLNLMRKFTFGNDIVQPVCTSSATNF 420

Query: 421 ATLNRMVDLKRCLQNMVTSQEWMDSPYSKRPGGLEMLDLISSESFWSSCNSIISLTNPLL 480
            T+ R+ DLK  LQ MVTS EW D  YSK  GGL M + I+ E FW +      +T P+L
Sbjct: 421 TTMGRIADLKPYLQAMVTSSEWNDCSYSKEAGGLAMTETINDEDFWKALTLANHITAPIL 480

Query: 481 RVLRIVGSGKRPAMGYVYAAMYNAKLAIKTELINRDRYMVYWNIIDQRWEHHWRHPLYAA 540
           RVLRIV S ++PAMGYVYAAMY AK AIKT L +R+ Y+VYW IID+ W    + PLYAA
Sbjct: 481 RVLRIVCSERKPAMGYVYAAMYRAKEAIKTNLAHREEYIVYWKIIDRWW---LQQPLYAA 540

Query: 541 GFYLNPKYFYSIEGDMHGEILSGMFDCIERLVSDTNVQDKIIKEITSYKNASGDFARKTA 600
           GFYLNPK+FYSI+ +M  EI   + DCIE+LV D N+QD +IK+I SYKNA G F R  A
Sbjct: 541 GFYLNPKFFYSIDEEMRSEIHLAVVDCIEKLVPDVNIQDIVIKDINSYKNAVGIFGRNLA 600

Query: 601 IRARGTLLPAEWWSTCGEGGCPNLTRLATRILSQTC-SSVGFKQNDALFDKLHDTRNHIE 660
           IRAR T+LPAEWWST GE  C NL+R A RILSQTC SS+G  +N     ++++++N IE
Sbjct: 601 IRARDTMLPAEWWSTYGES-CLNLSRFAIRILSQTCSSSIGSVRNLTSISQIYESKNSIE 660

Query: 661 HQRLSDLVFVRSNLQLKQMATNVNEHYPTDPLSFDELGIVDDWVWKKDLSAEDCGNLEWT 720
            QRL+DLVFV+ N++L+++ +  +     DPLS   + +++DWV +  +  E  G+ +W 
Sbjct: 661 RQRLNDLVFVQYNMRLRRIGSESSGDDTVDPLSHSNMEVLEDWVSRNQVCIEGNGSSDWK 720

Query: 721 VLDNPPFSPPMRLPQSDGYDDLVAGFDDLEVFKRQRES 747
            L+    S  + +   D  +DL +GFDD E+FK ++E+
Sbjct: 721 SLEFIKRSEEVAV-VIDETEDLGSGFDDAEIFKGEKEA 749

BLAST of CSPI06G01170.1 vs. TAIR10
Match: AT4G15020.1 (AT4G15020.1 hAT transposon superfamily)

HSP 1 Score: 689.1 bits (1777), Expect = 3.0e-198
Identity = 358/761 (47.04%), Postives = 505/761 (66.36%), Query Frame = 1

Query: 1   MSSGLQPVPITPQKHDPAWKHCQMFKNGDRVQLKCLYCHKLFKGGGIHRIKEHLAGQKGN 60
           M + L+PV +TPQK D AWKHC+++K GDR+Q++CLYC K+FKGGGI R+KEHLAG+KG 
Sbjct: 1   MDAELEPVALTPQKQDNAWKHCEIYKYGDRLQMRCLYCRKMFKGGGITRVKEHLAGKKGQ 60

Query: 61  ASTCHSVPPEVQNIMQESLDGVMMKKRKRQKLDEEMTNVNTMTG-EVDGISNHMDMDSSI 120
            + C  VP +V+  +Q+ +DG + ++RKR K   E  +V ++   E D +    D++   
Sbjct: 61  GTICDQVPEDVRLFLQQCIDGTVRRQRKRHKSSSEPLSVASLPPIEGDMMVVQPDVNDGF 120

Query: 121 HLIEVAEPLETNSVLLLTHEEGT---SNKVGRKKGSKGKSSSCLEREM-----IVIPNGG 180
                ++ +  N  LL    +     S K   + GS   +   + R+M     + I +  
Sbjct: 121 KSPGSSDVVVQNESLLSGRTKQRTYRSKKNAFENGSASNNVDLIGRDMDNLIPVAISSVK 180

Query: 181 GILD-SNRDR-NQVHMAVGRFLYDIGASLEAVNSAYFQPMIESIALAGTGIIPPSYHDIR 240
            I+  S RDR N +HMA+GRFL+ IGA  +AVNS  FQPMI++IA  G G+  P++ D+R
Sbjct: 181 NIVHPSFRDRENTIHMAIGRFLFGIGADFDAVNSVNFQPMIDAIASGGFGVSAPTHDDLR 240

Query: 241 GWILKNSMEEVRSDFDRCKATWGITGCSVMVDQWCTEAGRTMLNFLVYCPKGTVFLESVD 300
           GWILKN +EE+  + D CKA W  TGCS++V++  ++ G  +LNFLVYCP+  VFL+SVD
Sbjct: 241 GWILKNCVEEMAKEIDECKAMWKRTGCSILVEELNSDKGFKVLNFLVYCPEKVVFLKSVD 300

Query: 301 ASGIMDSPDLLYELLKKVVEQVGVKHVVQVITRFEENFAIAGRKLSDTYPTLYWTPCAAS 360
           AS ++ S D L+ELL ++VE+VG  +VVQVIT+ ++ +  AG++L   YP+LYW PCAA 
Sbjct: 301 ASEVLSSADKLFELLSELVEEVGSTNVVQVITKCDDYYVDAGKRLMLVYPSLYWVPCAAH 360

Query: 361 CVDLILGDIGNIEGVNTVIEQARSITRFVYNNSMVLNMVRKCTFGNDIVEPCLTRSATNF 420
           C+D +L + G +  ++  IEQA++ITRFVYN+S VLN++ K T GNDI+ P  + SATNF
Sbjct: 361 CIDQMLEEFGKLGWISETIEQAQAITRFVYNHSGVLNLMWKFTSGNDILLPAFSSSATNF 420

Query: 421 ATLNRMVDLKRCLQNMVTSQEWMDSPYSKRPGGLEMLDLISSESFWSSCNSIISLTNPLL 480
           ATL R+ +LK  LQ MVTS EW +  YS+ P GL +++ ++ E+FW +   +  LT+PLL
Sbjct: 421 ATLGRIAELKSNLQAMVTSAEWNECSYSEEPSGL-VMNALTDEAFWKAVALVNHLTSPLL 480

Query: 481 RVLRIVGSGKRPAMGYVYAAMYNAKLAIKTELINRDRYMVYWNIIDQRWEHHWRHPLYAA 540
           R LRIV S KRPAMGYVYAA+Y AK AIKT L+NR+ Y++YW IID+ WE     PL AA
Sbjct: 481 RALRIVCSEKRPAMGYVYAALYRAKDAIKTHLVNREDYIIYWKIIDRWWEQQQHIPLLAA 540

Query: 541 GFYLNPKYFYSIEGDMHGEILSGMFDCIERLVSDTNVQDKIIKEITSYKNASGDFARKTA 600
           GF+LNPK FY+   ++  E++  + DCIERLV D  +QDKIIKE+TSYK A G F R  A
Sbjct: 541 GFFLNPKLFYNTNEEIRSELILSVLDCIERLVPDDKIQDKIIKELTSYKTAGGVFGRNLA 600

Query: 601 IRARGTLLPAEWWSTCGEGGCPNLTRLATRILSQTC-SSVGFKQNDALFDKLHDTRNHIE 660
           IRAR T+LPAEWWST GE  C NL+R A RILSQTC SSV  ++N    + ++ ++N IE
Sbjct: 601 IRARDTMLPAEWWSTYGE-SCLNLSRFAIRILSQTCSSSVSCRRNQIPVEHIYQSKNSIE 660

Query: 661 HQRLSDLVFVRSNLQLKQMATNVNEHYPTDPLSFDELGIVDDWVWKKDLSAEDCGNLEWT 720
            +RLSDLVFV+ N++L+Q+     +    DPLS + + ++ +WV       E  G+ +W 
Sbjct: 661 QKRLSDLVFVQYNMRLRQLGPGSGDD-TLDPLSHNRIDVLKEWVSGDQACVEGNGSADWK 720

Query: 721 VLDNPPFSPPMRLPQSDGYDDLVAGFDDLEVFKRQRESEDD 750
            L++         P  D  +DL +GFDD+E+FK ++E  D+
Sbjct: 721 SLES--IHRNQVAPIIDDTEDLGSGFDDIEIFKVEKEVRDE 756

BLAST of CSPI06G01170.1 vs. TAIR10
Match: AT3G17450.1 (AT3G17450.1 hAT dimerisation domain-containing protein)

HSP 1 Score: 354.0 bits (907), Expect = 2.3e-97
Identity = 219/722 (30.33%), Postives = 362/722 (50.14%), Query Frame = 1

Query: 16  DPAWKHCQMFKNGDRVQLKCLYCHKLFKGGGIHRIKEHLAGQKGNASTCHSVPPEVQNIM 75
           DP W+H  + ++  + ++KC YC+K+  GG I+R K+HLA   G  + C + P EV   +
Sbjct: 133 DPGWEH-GIAQDERKKKVKCNYCNKIVSGG-INRFKQHLARIPGEVAPCKTAPEEVYVKI 192

Query: 76  QESLDGVMMKKRKRQKLDE-------------------------EMTNVNTMTGEVDGIS 135
           +E++      KR+ +  DE                           +    M G      
Sbjct: 193 KENMKWHRAGKRQNRPDDEMGALTFRTVSQDPDQEEDREDHDFYPTSQDRLMLGNGRFSK 252

Query: 136 NHMDMDSSIHLIEVAEPLETNSVLLLTHEEGTSNKVGRKKGSKGKSSSCLEREMIVIPNG 195
           +      S ++  V+E  +T    ++  +  +S+K       +   SSC  R +      
Sbjct: 253 DKRKSFDSTNMRSVSEA-KTKRARMIPFQSPSSSK------QRKLYSSCSNRVV------ 312

Query: 196 GGILDSNRDRNQVHMAVGRFLYDIGASLEAVNSAYFQPMIESIALAGTGIIPPSYHDIRG 255
                    R  V  ++ +FL+ +G   EA NS YFQ MIE I + G G + PS     G
Sbjct: 313 --------SRKDVTSSISKFLHHVGVPTEAANSLYFQKMIELIGMYGEGFVVPSSQLFSG 372

Query: 256 WILKNSMEEVRSDFDRCKATWGITGCSVMVDQWCTEAGRTMLNFLVYCPKGTVFLESVDA 315
            +L+  M  ++S     +++W +TGCS+M D W    G+ M++FLV CP+G  F  S+DA
Sbjct: 373 RLLQEEMSTIKSYLREYRSSWVVTGCSIMADTWTNTEGKKMISFLVSCPRGVYFHSSIDA 432

Query: 316 SGIMDSPDLLYELLKKVVEQVGVKHVVQVITRFEENFAIAGRKLSDTYPTLYWTPCAASC 375
           + I++    L++ L K+V+ +G ++VVQVIT+    F  AG+ L +    LYWTPCA  C
Sbjct: 433 TDIVEDALSLFKCLDKLVDDIGEENVVQVITQNTAIFRSAGKLLEEKRKNLYWTPCAIHC 492

Query: 376 VDLILGDIGNIEGVNTVIEQARSITRFVYNNSMVLNMVR-KCTFGNDIVEPCLTRSATNF 435
            +L+L D   +E V+  +E+A+ ITRF+YN + +LN+++ + T G D++ P + R A+ F
Sbjct: 493 TELVLEDFSKLEFVSECLEKAQRITRFIYNQTWLLNLMKNEFTQGLDLLRPAVMRHASGF 552

Query: 436 ATLNRMVDLKRCLQNMVTSQEW-MDSPYSKRPGGLEMLDLISSESFWSSCNSIISLTNPL 495
            TL  ++D K  L+ +  S  W +    +K   G E+  ++ S  FW     ++   +P+
Sbjct: 553 TTLQSLMDHKASLRGLFQSDGWILSQTAAKSEEGREVEKMVLSAVFWKKVQYVLKSVDPV 612

Query: 496 LRVLRIVG-SGKRPAMGYVYAAMYNAKLAIKTELINRD---RYMVYWNIIDQRWEHHWRH 555
           ++V+ ++   G R +M Y Y  M  AK+AIK+  I+ D   +Y  +W +I+ RW   + H
Sbjct: 613 MQVIHMINDGGDRLSMPYAYGYMCCAKMAIKS--IHSDDARKYGPFWRVIEYRWNPLFHH 672

Query: 556 PLYAAGFYLNPKYFYSIEGDMHGEILSGMFDCIERLVSDTNVQDKIIKEITSYKNASGDF 615
           PLY A ++ NP Y Y  +     E++ G+ +CI RL  D   +   + +I  Y  A  DF
Sbjct: 673 PLYVAAYFFNPAYKYRPDFMAQSEVVRGVNECIVRLEPDNTRRITALMQIPDYTCAKADF 732

Query: 616 ARKTAIRARGTLLPAEWWSTCGEGGCPNLTRLATRILSQTCSSVGFKQNDALFDKLH-DT 675
               AI  R  L P+ WW   G   C  L R+A RILS TCSSVG +   +++D+++   
Sbjct: 733 GTDIAIGTRTELDPSAWWQQHGI-SCLELQRVAVRILSHTCSSVGCEPKWSVYDQVNSQC 792

Query: 676 RNHIEHQRLSDLVFVRSNLQLKQMATNVNEHYPTDPLSFDELGIVD----DWVWKKDLSA 702
           ++    +   DL +V  NL+L++       HY  +P       ++D    DW+   +   
Sbjct: 793 QSQFGKKSTKDLTYVHYNLRLREKQLKQRLHYEDEPPPTLNHALLDRLLPDWLVTSEKEE 828

BLAST of CSPI06G01170.1 vs. TAIR10
Match: AT1G79740.1 (AT1G79740.1 hAT transposon superfamily)

HSP 1 Score: 282.0 bits (720), Expect = 1.1e-75
Identity = 158/515 (30.68%), Postives = 281/515 (54.56%), Query Frame = 1

Query: 186 AVGRFLYDIGASLEAVNSAYFQPMIESIALAGTGIIPPSYHDIRGWILKNSMEEVRSDFD 245
           ++  F ++         S  +  M++++A  G G + PS      W+     + V+SD  
Sbjct: 111 SISLFFFENKIDFAVARSPSYHHMLDAVAKCGPGFVAPSPKT--EWL-----DRVKSDIS 170

Query: 246 -RCKAT---WGITGCSVMVDQWCTEAGRTMLNFLVYCPKGTVFLESVDASGIMDSPDLLY 305
            + K T   W  TGC+++ + W     R ++NF V  P    F +SVDAS    +   L 
Sbjct: 171 LQLKDTEKEWVTTGCTIIAEAWTDNKSRALINFSVSSPSRIFFHKSVDASSYFKNSKCLA 230

Query: 306 ELLKKVVEQVGVKHVVQVITRFEENFAIAG--RKLSDTYPTLYWTPCAASCVDLILGDIG 365
           +L   V++ +G +H+VQ+I   + +F   G    L   Y T++ +PCA+ C+++IL +  
Sbjct: 231 DLFDSVIQDIGQEHIVQII--MDNSFCYTGISNHLLQNYATIFVSPCASQCLNIILEEFS 290

Query: 366 NIEGVNTVIEQARSITRFVYNNSMVLNMVRKCTFGNDIVEPCLTRSATNFATLNRMVDLK 425
            ++ VN  I QA+ I++FVYNNS VL+++RK T G DI+   +TRS +NF +L  M+  K
Sbjct: 291 KVDWVNQCISQAQVISKFVYNNSPVLDLLRKLTGGQDIIRSGVTRSVSNFLSLQSMMKQK 350

Query: 426 RCLQNMVTSQEWMDSPYSKRPGGLEMLDLISSESFWSSCNSIISLTNPLLRVLRIVGSGK 485
             L++M    E+  +  + +P  +  ++++    FW +    ++++ P+L+VLR V +GK
Sbjct: 351 ARLKHMFNCPEYTTN--TNKPQSISCVNILEDNDFWRAVEESVAISEPILKVLREVSTGK 410

Query: 486 RPAMGYVYAAMYNAKLAIKT-ELINRDRYMVYWNIIDQRWEHHWRHPLYAAGFYLNPKYF 545
            PA+G +Y  M  AK +I+T  +++ +++ V+ +I+D  W  H   PL+AA  +LNP   
Sbjct: 411 -PAVGSIYELMSKAKESIRTYYIMDENKHKVFSDIVDTNWCEHLHSPLHAAAAFLNPSIQ 470

Query: 546 YSIEGDMHGEILSGMFDCIERLVSDTNVQDKIIKEITSYKNASGDFARKTAIRARGTLLP 605
           Y+ E      +    F  +E+L+  ++++  I  +I ++  A G F    A+ AR ++ P
Sbjct: 471 YNPEIKFLTSLKEDFFKVLEKLLPTSDLRRDITNQIFTFTRAKGMFGCNLAMEARDSVSP 530

Query: 606 AEWWSTCGEGGCPNLTRLATRILSQTCSSVGFKQNDALFDKLH-DTRNHIEHQRLSDLVF 665
             WW   G+   P L R+A RILSQ CS    ++  + F ++H + RN I+ + L+ L +
Sbjct: 531 GLWWEQFGD-SAPVLQRVAIRILSQVCSGYNLERQWSTFQQMHWERRNKIDREILNKLAY 590

Query: 666 VRSNLQLKQMATNVNEHYPTDPLSFDELGIVDDWV 693
           V  NL+L +M T       TDP++ +++ ++ +WV
Sbjct: 591 VNQNLKLGRMIT-----LETDPIALEDIDMMSEWV 607

BLAST of CSPI06G01170.1 vs. TAIR10
Match: AT3G13030.1 (AT3G13030.1 hAT transposon superfamily protein)

HSP 1 Score: 245.0 bits (624), Expect = 1.5e-64
Identity = 147/483 (30.43%), Postives = 244/483 (50.52%), Query Frame = 1

Query: 197 SLEAVNSAYFQPMIESIALAGTGIIPPSYHDIRGWILKNSMEEVRSDFDRCKATWGITGC 256
           +L AV++  F+ M+ ++     G+     HD+ GW L++++EEV+   ++ K +W ITGC
Sbjct: 63  NLSAVDAPCFKEMM-TVDGGQMGLESSDCHDLNGWRLQDALEEVQDRVEKIKESWAITGC 122

Query: 257 SVMVDQWCTEAGRTMLNFLVYCPKGTVFLESVDASGIMDSPDLLYELLKKVVEQVGVKHV 316
           S+++D W  + GR ++ F+  CP G V+L S D S   D    L  L+  +VE+VGV++V
Sbjct: 123 SILLDAWVDQKGRDLVTFVADCPAGLVYLISFDVSDFKDDVTALLSLVNGLVEEVGVRNV 182

Query: 317 VQVITRFEENF-AIAGRKLSDTYPTLYWTPCAASCVDLILGDIGNIEGVNTVIEQARSIT 376
            Q+I      +    G   +     ++W+   + C +L+L  I  I     + ++  +I 
Sbjct: 183 TQIIACSTSGWVGELGELFAGHDREVFWSVSVSHCFELMLVKISKIRSFGDIFDKVNNIW 242

Query: 377 RFVYNNSMVLNMVRKCTFGNDI-VEPCLTRSATNFATLNRMVDLKRCLQNMVTSQEWMDS 436
            F+ NN  VLN+ R    G DI V        T +  L  +   K+ L  M  S  W + 
Sbjct: 243 LFINNNPSVLNIFRDQCHGIDITVSSSEFEFVTPYLILESIFKAKKNLTAMFASSNWNNE 302

Query: 437 PYSKRPGGLEMLDLISSESFWSSCNSIISLTNPLLRVLRIVGSGKRPAMGYVYAAMYNAK 496
                   + + +L+S  SFW +  S++  T+PL+  L +  +     +GYVY  M + K
Sbjct: 303 QC------IAISNLVSDSSFWETVESVLKCTSPLIHGLLLFSTANNQHLGYVYDTMDSIK 362

Query: 497 LAIKTELINRDR-YMVYWNIIDQRWEHHWRHPLYAAGFYLNPKYFYSIEGDMHGEILSGM 556
            +I  E  ++ + Y   W++ID  W  H  +PL+AAG++LNP  FYS    +  E+++G+
Sbjct: 363 ESIAREFNHKPQFYKPLWDVIDDVWNKHLHNPLHAAGYFLNPTAFYSTNFHLDIEVVTGL 422

Query: 557 FDCIERLVSDTNVQDKIIKEITSYKNASGDFARKTAIRARGTLLPAEWWSTCGEGGCPNL 616
              +  +V D +VQ KI  +I  Y+     F   +       + PAEWW+       P L
Sbjct: 423 ISSLIHMVEDCHVQFKISTQIDMYRLGKDCFNEASQADQITGISPAEWWAH-KASQYPEL 482

Query: 617 TRLATRILSQTCSSVG-FKQNDALFDK--LHDTRNHIEHQRLSDLVFVRSNLQLKQMATN 674
             LA +ILSQTC     +K   +L +K  L +  ++ E Q L +LVFV+ NL L+     
Sbjct: 483 QSLAIKILSQTCEGASKYKLKRSLAEKLLLSEGMSNRERQHLDELVFVQYNLHLQSYKAK 537

BLAST of CSPI06G01170.1 vs. NCBI nr
Match: gi|778709347|ref|XP_004138492.2| (PREDICTED: uncharacterized protein LOC101220029 [Cucumis sativus])

HSP 1 Score: 1550.8 bits (4014), Expect = 0.0e+00
Identity = 752/752 (100.00%), Postives = 752/752 (100.00%), Query Frame = 1

Query: 1   MSSGLQPVPITPQKHDPAWKHCQMFKNGDRVQLKCLYCHKLFKGGGIHRIKEHLAGQKGN 60
           MSSGLQPVPITPQKHDPAWKHCQMFKNGDRVQLKCLYCHKLFKGGGIHRIKEHLAGQKGN
Sbjct: 1   MSSGLQPVPITPQKHDPAWKHCQMFKNGDRVQLKCLYCHKLFKGGGIHRIKEHLAGQKGN 60

Query: 61  ASTCHSVPPEVQNIMQESLDGVMMKKRKRQKLDEEMTNVNTMTGEVDGISNHMDMDSSIH 120
           ASTCHSVPPEVQNIMQESLDGVMMKKRKRQKLDEEMTNVNTMTGEVDGISNHMDMDSSIH
Sbjct: 61  ASTCHSVPPEVQNIMQESLDGVMMKKRKRQKLDEEMTNVNTMTGEVDGISNHMDMDSSIH 120

Query: 121 LIEVAEPLETNSVLLLTHEEGTSNKVGRKKGSKGKSSSCLEREMIVIPNGGGILDSNRDR 180
           LIEVAEPLETNSVLLLTHEEGTSNKVGRKKGSKGKSSSCLEREMIVIPNGGGILDSNRDR
Sbjct: 121 LIEVAEPLETNSVLLLTHEEGTSNKVGRKKGSKGKSSSCLEREMIVIPNGGGILDSNRDR 180

Query: 181 NQVHMAVGRFLYDIGASLEAVNSAYFQPMIESIALAGTGIIPPSYHDIRGWILKNSMEEV 240
           NQVHMAVGRFLYDIGASLEAVNSAYFQPMIESIALAGTGIIPPSYHDIRGWILKNSMEEV
Sbjct: 181 NQVHMAVGRFLYDIGASLEAVNSAYFQPMIESIALAGTGIIPPSYHDIRGWILKNSMEEV 240

Query: 241 RSDFDRCKATWGITGCSVMVDQWCTEAGRTMLNFLVYCPKGTVFLESVDASGIMDSPDLL 300
           RSDFDRCKATWGITGCSVMVDQWCTEAGRTMLNFLVYCPKGTVFLESVDASGIMDSPDLL
Sbjct: 241 RSDFDRCKATWGITGCSVMVDQWCTEAGRTMLNFLVYCPKGTVFLESVDASGIMDSPDLL 300

Query: 301 YELLKKVVEQVGVKHVVQVITRFEENFAIAGRKLSDTYPTLYWTPCAASCVDLILGDIGN 360
           YELLKKVVEQVGVKHVVQVITRFEENFAIAGRKLSDTYPTLYWTPCAASCVDLILGDIGN
Sbjct: 301 YELLKKVVEQVGVKHVVQVITRFEENFAIAGRKLSDTYPTLYWTPCAASCVDLILGDIGN 360

Query: 361 IEGVNTVIEQARSITRFVYNNSMVLNMVRKCTFGNDIVEPCLTRSATNFATLNRMVDLKR 420
           IEGVNTVIEQARSITRFVYNNSMVLNMVRKCTFGNDIVEPCLTRSATNFATLNRMVDLKR
Sbjct: 361 IEGVNTVIEQARSITRFVYNNSMVLNMVRKCTFGNDIVEPCLTRSATNFATLNRMVDLKR 420

Query: 421 CLQNMVTSQEWMDSPYSKRPGGLEMLDLISSESFWSSCNSIISLTNPLLRVLRIVGSGKR 480
           CLQNMVTSQEWMDSPYSKRPGGLEMLDLISSESFWSSCNSIISLTNPLLRVLRIVGSGKR
Sbjct: 421 CLQNMVTSQEWMDSPYSKRPGGLEMLDLISSESFWSSCNSIISLTNPLLRVLRIVGSGKR 480

Query: 481 PAMGYVYAAMYNAKLAIKTELINRDRYMVYWNIIDQRWEHHWRHPLYAAGFYLNPKYFYS 540
           PAMGYVYAAMYNAKLAIKTELINRDRYMVYWNIIDQRWEHHWRHPLYAAGFYLNPKYFYS
Sbjct: 481 PAMGYVYAAMYNAKLAIKTELINRDRYMVYWNIIDQRWEHHWRHPLYAAGFYLNPKYFYS 540

Query: 541 IEGDMHGEILSGMFDCIERLVSDTNVQDKIIKEITSYKNASGDFARKTAIRARGTLLPAE 600
           IEGDMHGEILSGMFDCIERLVSDTNVQDKIIKEITSYKNASGDFARKTAIRARGTLLPAE
Sbjct: 541 IEGDMHGEILSGMFDCIERLVSDTNVQDKIIKEITSYKNASGDFARKTAIRARGTLLPAE 600

Query: 601 WWSTCGEGGCPNLTRLATRILSQTCSSVGFKQNDALFDKLHDTRNHIEHQRLSDLVFVRS 660
           WWSTCGEGGCPNLTRLATRILSQTCSSVGFKQNDALFDKLHDTRNHIEHQRLSDLVFVRS
Sbjct: 601 WWSTCGEGGCPNLTRLATRILSQTCSSVGFKQNDALFDKLHDTRNHIEHQRLSDLVFVRS 660

Query: 661 NLQLKQMATNVNEHYPTDPLSFDELGIVDDWVWKKDLSAEDCGNLEWTVLDNPPFSPPMR 720
           NLQLKQMATNVNEHYPTDPLSFDELGIVDDWVWKKDLSAEDCGNLEWTVLDNPPFSPPMR
Sbjct: 661 NLQLKQMATNVNEHYPTDPLSFDELGIVDDWVWKKDLSAEDCGNLEWTVLDNPPFSPPMR 720

Query: 721 LPQSDGYDDLVAGFDDLEVFKRQRESEDDNIS 753
           LPQSDGYDDLVAGFDDLEVFKRQRESEDDNIS
Sbjct: 721 LPQSDGYDDLVAGFDDLEVFKRQRESEDDNIS 752

BLAST of CSPI06G01170.1 vs. NCBI nr
Match: gi|700190532|gb|KGN45736.1| (hypothetical protein Csa_6G008670 [Cucumis sativus])

HSP 1 Score: 1550.8 bits (4014), Expect = 0.0e+00
Identity = 752/752 (100.00%), Postives = 752/752 (100.00%), Query Frame = 1

Query: 1   MSSGLQPVPITPQKHDPAWKHCQMFKNGDRVQLKCLYCHKLFKGGGIHRIKEHLAGQKGN 60
           MSSGLQPVPITPQKHDPAWKHCQMFKNGDRVQLKCLYCHKLFKGGGIHRIKEHLAGQKGN
Sbjct: 63  MSSGLQPVPITPQKHDPAWKHCQMFKNGDRVQLKCLYCHKLFKGGGIHRIKEHLAGQKGN 122

Query: 61  ASTCHSVPPEVQNIMQESLDGVMMKKRKRQKLDEEMTNVNTMTGEVDGISNHMDMDSSIH 120
           ASTCHSVPPEVQNIMQESLDGVMMKKRKRQKLDEEMTNVNTMTGEVDGISNHMDMDSSIH
Sbjct: 123 ASTCHSVPPEVQNIMQESLDGVMMKKRKRQKLDEEMTNVNTMTGEVDGISNHMDMDSSIH 182

Query: 121 LIEVAEPLETNSVLLLTHEEGTSNKVGRKKGSKGKSSSCLEREMIVIPNGGGILDSNRDR 180
           LIEVAEPLETNSVLLLTHEEGTSNKVGRKKGSKGKSSSCLEREMIVIPNGGGILDSNRDR
Sbjct: 183 LIEVAEPLETNSVLLLTHEEGTSNKVGRKKGSKGKSSSCLEREMIVIPNGGGILDSNRDR 242

Query: 181 NQVHMAVGRFLYDIGASLEAVNSAYFQPMIESIALAGTGIIPPSYHDIRGWILKNSMEEV 240
           NQVHMAVGRFLYDIGASLEAVNSAYFQPMIESIALAGTGIIPPSYHDIRGWILKNSMEEV
Sbjct: 243 NQVHMAVGRFLYDIGASLEAVNSAYFQPMIESIALAGTGIIPPSYHDIRGWILKNSMEEV 302

Query: 241 RSDFDRCKATWGITGCSVMVDQWCTEAGRTMLNFLVYCPKGTVFLESVDASGIMDSPDLL 300
           RSDFDRCKATWGITGCSVMVDQWCTEAGRTMLNFLVYCPKGTVFLESVDASGIMDSPDLL
Sbjct: 303 RSDFDRCKATWGITGCSVMVDQWCTEAGRTMLNFLVYCPKGTVFLESVDASGIMDSPDLL 362

Query: 301 YELLKKVVEQVGVKHVVQVITRFEENFAIAGRKLSDTYPTLYWTPCAASCVDLILGDIGN 360
           YELLKKVVEQVGVKHVVQVITRFEENFAIAGRKLSDTYPTLYWTPCAASCVDLILGDIGN
Sbjct: 363 YELLKKVVEQVGVKHVVQVITRFEENFAIAGRKLSDTYPTLYWTPCAASCVDLILGDIGN 422

Query: 361 IEGVNTVIEQARSITRFVYNNSMVLNMVRKCTFGNDIVEPCLTRSATNFATLNRMVDLKR 420
           IEGVNTVIEQARSITRFVYNNSMVLNMVRKCTFGNDIVEPCLTRSATNFATLNRMVDLKR
Sbjct: 423 IEGVNTVIEQARSITRFVYNNSMVLNMVRKCTFGNDIVEPCLTRSATNFATLNRMVDLKR 482

Query: 421 CLQNMVTSQEWMDSPYSKRPGGLEMLDLISSESFWSSCNSIISLTNPLLRVLRIVGSGKR 480
           CLQNMVTSQEWMDSPYSKRPGGLEMLDLISSESFWSSCNSIISLTNPLLRVLRIVGSGKR
Sbjct: 483 CLQNMVTSQEWMDSPYSKRPGGLEMLDLISSESFWSSCNSIISLTNPLLRVLRIVGSGKR 542

Query: 481 PAMGYVYAAMYNAKLAIKTELINRDRYMVYWNIIDQRWEHHWRHPLYAAGFYLNPKYFYS 540
           PAMGYVYAAMYNAKLAIKTELINRDRYMVYWNIIDQRWEHHWRHPLYAAGFYLNPKYFYS
Sbjct: 543 PAMGYVYAAMYNAKLAIKTELINRDRYMVYWNIIDQRWEHHWRHPLYAAGFYLNPKYFYS 602

Query: 541 IEGDMHGEILSGMFDCIERLVSDTNVQDKIIKEITSYKNASGDFARKTAIRARGTLLPAE 600
           IEGDMHGEILSGMFDCIERLVSDTNVQDKIIKEITSYKNASGDFARKTAIRARGTLLPAE
Sbjct: 603 IEGDMHGEILSGMFDCIERLVSDTNVQDKIIKEITSYKNASGDFARKTAIRARGTLLPAE 662

Query: 601 WWSTCGEGGCPNLTRLATRILSQTCSSVGFKQNDALFDKLHDTRNHIEHQRLSDLVFVRS 660
           WWSTCGEGGCPNLTRLATRILSQTCSSVGFKQNDALFDKLHDTRNHIEHQRLSDLVFVRS
Sbjct: 663 WWSTCGEGGCPNLTRLATRILSQTCSSVGFKQNDALFDKLHDTRNHIEHQRLSDLVFVRS 722

Query: 661 NLQLKQMATNVNEHYPTDPLSFDELGIVDDWVWKKDLSAEDCGNLEWTVLDNPPFSPPMR 720
           NLQLKQMATNVNEHYPTDPLSFDELGIVDDWVWKKDLSAEDCGNLEWTVLDNPPFSPPMR
Sbjct: 723 NLQLKQMATNVNEHYPTDPLSFDELGIVDDWVWKKDLSAEDCGNLEWTVLDNPPFSPPMR 782

Query: 721 LPQSDGYDDLVAGFDDLEVFKRQRESEDDNIS 753
           LPQSDGYDDLVAGFDDLEVFKRQRESEDDNIS
Sbjct: 783 LPQSDGYDDLVAGFDDLEVFKRQRESEDDNIS 814

BLAST of CSPI06G01170.1 vs. NCBI nr
Match: gi|659081544|ref|XP_008441385.1| (PREDICTED: uncharacterized protein LOC103485517 [Cucumis melo])

HSP 1 Score: 1514.6 bits (3920), Expect = 0.0e+00
Identity = 732/752 (97.34%), Postives = 739/752 (98.27%), Query Frame = 1

Query: 1   MSSGLQPVPITPQKHDPAWKHCQMFKNGDRVQLKCLYCHKLFKGGGIHRIKEHLAGQKGN 60
           MSSGLQPVPITPQKHDPAWKHCQMFKNGDRVQLKCLYCHKLFKGGGIHRIKEHLAGQKGN
Sbjct: 1   MSSGLQPVPITPQKHDPAWKHCQMFKNGDRVQLKCLYCHKLFKGGGIHRIKEHLAGQKGN 60

Query: 61  ASTCHSVPPEVQNIMQESLDGVMMKKRKRQKLDEEMTNVNTMTGEVDGISNHMDMDSSIH 120
           ASTCHSVPPEVQNIMQESLDGVMMKKRKRQKLDEEMTNVN MT EVD ISNHMDMDSSIH
Sbjct: 61  ASTCHSVPPEVQNIMQESLDGVMMKKRKRQKLDEEMTNVNAMTAEVDAISNHMDMDSSIH 120

Query: 121 LIEVAEPLETNSVLLLTHEEGTSNKVGRKKGSKGKSSSCLEREMIVIPNGGGILDSNRDR 180
           LIEVAEPL+TNS LLLTHEEGTSNKVGRKKGSKGKSSSCL+REMIVIPNGGGILDSNRDR
Sbjct: 121 LIEVAEPLDTNSALLLTHEEGTSNKVGRKKGSKGKSSSCLDREMIVIPNGGGILDSNRDR 180

Query: 181 NQVHMAVGRFLYDIGASLEAVNSAYFQPMIESIALAGTGIIPPSYHDIRGWILKNSMEEV 240
           NQVHMA+GRFLYDIGASLEAVNSAYFQPMIESIALAGTGIIPPSYHDIRGWILKNS+EEV
Sbjct: 181 NQVHMAIGRFLYDIGASLEAVNSAYFQPMIESIALAGTGIIPPSYHDIRGWILKNSVEEV 240

Query: 241 RSDFDRCKATWGITGCSVMVDQWCTEAGRTMLNFLVYCPKGTVFLESVDASGIMDSPDLL 300
           R DFDRCKATWG+TGCSVMVDQWCTEAGRTMLNFLVYCPKGTVFLESVDASGIMDSPDLL
Sbjct: 241 RGDFDRCKATWGMTGCSVMVDQWCTEAGRTMLNFLVYCPKGTVFLESVDASGIMDSPDLL 300

Query: 301 YELLKKVVEQVGVKHVVQVITRFEENFAIAGRKLSDTYPTLYWTPCAASCVDLILGDIGN 360
           YELLKKVVEQVGVKHVVQVITRFEENFAIAGRKLSDTYPTLYWTPCAASCVDLIL DIGN
Sbjct: 301 YELLKKVVEQVGVKHVVQVITRFEENFAIAGRKLSDTYPTLYWTPCAASCVDLILADIGN 360

Query: 361 IEGVNTVIEQARSITRFVYNNSMVLNMVRKCTFGNDIVEPCLTRSATNFATLNRMVDLKR 420
           IE VNTVIEQARSITRFVYNNSMVLNMVRKCTFGNDIVEPCLTRSATNFATLNRMVDLKR
Sbjct: 361 IEDVNTVIEQARSITRFVYNNSMVLNMVRKCTFGNDIVEPCLTRSATNFATLNRMVDLKR 420

Query: 421 CLQNMVTSQEWMDSPYSKRPGGLEMLDLISSESFWSSCNSIISLTNPLLRVLRIVGSGKR 480
           CLQNMVTSQEWMDSPYSKRPGGLEMLDLISSESFWSSCNSII LTNPLLRVLRIVGSGKR
Sbjct: 421 CLQNMVTSQEWMDSPYSKRPGGLEMLDLISSESFWSSCNSIIRLTNPLLRVLRIVGSGKR 480

Query: 481 PAMGYVYAAMYNAKLAIKTELINRDRYMVYWNIIDQRWEHHWRHPLYAAGFYLNPKYFYS 540
           PAMGYVYAAMYNAKLAIKTELINRDRYMVYWNIIDQRWEHHWRHPL AAGFYLNPKYFYS
Sbjct: 481 PAMGYVYAAMYNAKLAIKTELINRDRYMVYWNIIDQRWEHHWRHPLCAAGFYLNPKYFYS 540

Query: 541 IEGDMHGEILSGMFDCIERLVSDTNVQDKIIKEITSYKNASGDFARKTAIRARGTLLPAE 600
           IEGDMHGEILSGMFDCIERLVSDTNVQDKIIKEITSYKNASGDFARKTAIRARGTLLPAE
Sbjct: 541 IEGDMHGEILSGMFDCIERLVSDTNVQDKIIKEITSYKNASGDFARKTAIRARGTLLPAE 600

Query: 601 WWSTCGEGGCPNLTRLATRILSQTCSSVGFKQNDALFDKLHDTRNHIEHQRLSDLVFVRS 660
           WWSTCGEGGCPNLTRLATRILSQTCSSVGFKQN   FDKLHDTRNHIEHQRLSDLVFVRS
Sbjct: 601 WWSTCGEGGCPNLTRLATRILSQTCSSVGFKQNQVFFDKLHDTRNHIEHQRLSDLVFVRS 660

Query: 661 NLQLKQMATNVNEHYPTDPLSFDELGIVDDWVWKKDLSAEDCGNLEWTVLDNPPFSPPMR 720
           NLQLKQMATNVNEHYPTDPLSFD LGIVDDWVWKKDLSAEDCGNLEWTVL+NPPFSPPMR
Sbjct: 661 NLQLKQMATNVNEHYPTDPLSFDGLGIVDDWVWKKDLSAEDCGNLEWTVLENPPFSPPMR 720

Query: 721 LPQSDGYDDLVAGFDDLEVFKRQRESEDDNIS 753
           LPQ+DGYDDLVAGFDDLEVFKRQRESEDDNIS
Sbjct: 721 LPQNDGYDDLVAGFDDLEVFKRQRESEDDNIS 752

BLAST of CSPI06G01170.1 vs. NCBI nr
Match: gi|657944000|ref|XP_008371908.1| (PREDICTED: uncharacterized protein LOC103435298 [Malus domestica])

HSP 1 Score: 989.6 bits (2557), Expect = 3.1e-285
Identity = 491/756 (64.95%), Postives = 597/756 (78.97%), Query Frame = 1

Query: 1   MSSGL--QPVPITPQKHDPAWKHCQMFKNGDRVQLKCLYCHKLFKGGGIHRIKEHLAGQK 60
           M+SGL  +PVPIT QKHDPAWKHCQMFK G+RVQLKC+YC+KLFKGGGIHRIKEHLAGQK
Sbjct: 1   MASGLVMEPVPITSQKHDPAWKHCQMFKIGERVQLKCIYCNKLFKGGGIHRIKEHLAGQK 60

Query: 61  GNASTCHSVPPEVQNIMQESLDGVMMKKRKRQKLDEEMTNVNTMT-GEVDGISNHMDMDS 120
           GNASTC  VPP+V+  MQ+SLDGV++KKR RQKLDEE+TN+N    GE + I+   D+ +
Sbjct: 61  GNASTCLRVPPDVRAQMQQSLDGVVVKKRNRQKLDEEITNINPSPHGEGELIAVQNDVSN 120

Query: 121 SIHLIEVAEPLETNSVLLLTHEEGTSN--KVGRKKGSKGKSSSCLEREMIVIPNGGGILD 180
            + LI V EPLE     LL ++EG ++   + R+K  +GKSS C     +V+ N    L 
Sbjct: 121 GVQLIGVPEPLEHKG--LLGNQEGMTSGRSLERRKRGRGKSS-CAGHSALVVSNSVA-LG 180

Query: 181 SNRDRNQVHMAVGRFLYDIGASLEAVNSAYFQPMIESIALAGTGIIPPSYHDIRGWILKN 240
             +  N VH A+GRFLYDIGA  +AVNSAYFQPMI++IA  G+G++PP+YHDIR WILKN
Sbjct: 181 PPKVNNFVHEAIGRFLYDIGAPPDAVNSAYFQPMIDAIASGGSGVVPPTYHDIRSWILKN 240

Query: 241 SMEEVRSDFDRCKATWGITGCSVMVDQWCTEAGRTMLNFLVYCPKGTVFLESVDASGIMD 300
           S+EEVR++ D+ + TWG TGCSV+VDQW TE+G+ +L+FLVYCP+GTVF ESVDAS I++
Sbjct: 241 SVEEVRNNIDKHRETWGRTGCSVLVDQWNTESGKVLLSFLVYCPEGTVFWESVDASDIIN 300

Query: 301 SPDLLYELLKKVVEQVGVKHVVQVITRFEENFAIAGRKLSDTYPTLYWTPCAASCVDLIL 360
           S D LYELL++VVE+VGVK V+QVIT  EE   +AGR+L+DT+PTLYWTPCAA C+DL+L
Sbjct: 301 SSDALYELLRRVVEEVGVKDVLQVITSGEEQCMVAGRRLTDTFPTLYWTPCAARCLDLML 360

Query: 361 GDIGNIEGVNTVIEQARSITRFVYNNSMVLNMVRKCTFGNDIVEPCLTRSATNFATLNRM 420
            D GNIE +NTVIEQARSIT+FVYN+S+VLNMVR+ TFGNDIVEP  TR +TNF TL R+
Sbjct: 361 EDFGNIEWINTVIEQARSITKFVYNHSVVLNMVRRSTFGNDIVEPGATRFSTNFTTLKRL 420

Query: 421 VDLKRCLQNMVTSQEWMDSPYSKRPGGLEMLDLISSESFWSSCNSIISLTNPLLRVLRIV 480
           VDLK CLQ MVTSQEWMDS YSK PGGLEMLDLISS+SFWSSC  I+ LTNPLLRVLR+V
Sbjct: 421 VDLKHCLQVMVTSQEWMDSLYSKEPGGLEMLDLISSQSFWSSCILIVGLTNPLLRVLRMV 480

Query: 481 GSGKRPAMGYVYAAMYNAKLAIKTELINRDRYMVYWNIIDQRWEHHWRHPLYAAGFYLNP 540
           GS KRPAMGYVYA MY AK  IK EL+ R+ YM+YWNIIDQRWE  WR PL+AAGFYLNP
Sbjct: 481 GSEKRPAMGYVYAGMYRAKETIKKELVKREEYMIYWNIIDQRWEQQWRSPLHAAGFYLNP 540

Query: 541 KYFYSIEGDMHGEILSGMFDCIERLVSDTNVQDKIIKEITSYKNASGDFARKTAIRARGT 600
           K FYS EGDMHG+ILS MFDCIERLV DT VQDKIIKE+  YK+A+GDF RK AIRA+ T
Sbjct: 541 KIFYSFEGDMHGDILSHMFDCIERLVPDTKVQDKIIKELNLYKSAAGDFRRKMAIRAKDT 600

Query: 601 LLPAEWWSTCGEGGCPNLTRLATRILSQTCSSVGFKQNDALFDKLHDTRNHIEHQRLSDL 660
           LLPAEWWST G GGCPNLTRLA RILSQTCSS+G ++N+  F++ H+TRN +E QRLSDL
Sbjct: 601 LLPAEWWSTYG-GGCPNLTRLAIRILSQTCSSIGCRRNEIPFERAHNTRNCLERQRLSDL 660

Query: 661 VFVRSNLQLKQMATNVNEHYPTDPLSFDELGIVDDWVWKKDLSAEDCGNLEWTVLDNPPF 720
           VFV+ NL+LKQM    +E    DP+SF+ + + +DWV  KD+  +D G+ +W  LD+   
Sbjct: 661 VFVQYNLRLKQMVDKNSEQDVMDPISFENISMTEDWVTGKDMCLDDNGSFDWMELDSTSA 720

Query: 721 SPPMRLPQSDGYDDLVAGFDDLEVFKRQRESEDDNI 752
           S  +  P +D  DDL +GF D E+F R +  E++N+
Sbjct: 721 STMLLGPSNDDADDLGSGFYDYEIFSRAKHGEEENV 751

BLAST of CSPI06G01170.1 vs. NCBI nr
Match: gi|694320495|ref|XP_009351432.1| (PREDICTED: uncharacterized protein LOC103942959 [Pyrus x bretschneideri])

HSP 1 Score: 983.0 bits (2540), Expect = 2.9e-283
Identity = 486/755 (64.37%), Postives = 593/755 (78.54%), Query Frame = 1

Query: 1   MSSGL--QPVPITPQKHDPAWKHCQMFKNGDRVQLKCLYCHKLFKGGGIHRIKEHLAGQK 60
           M+SGL  +PVPIT QKHDPAWKHCQMFK G+RVQLKC+YC+KLFKGGGIHRIKEHLAGQK
Sbjct: 1   MASGLVMEPVPITSQKHDPAWKHCQMFKIGERVQLKCIYCNKLFKGGGIHRIKEHLAGQK 60

Query: 61  GNASTCHSVPPEVQNIMQESLDGVMMKKRKRQKLDEEMTNVNTMTGEVDGISNHMDMDSS 120
           GNASTC  VP +V+  MQ+SLDGV++KKR RQKLDEE+TN+N   GE + I+   D+ + 
Sbjct: 61  GNASTCLRVPQDVRAQMQQSLDGVVVKKRNRQKLDEEITNINPSPGEGELIAVQNDVSNG 120

Query: 121 IHLIEVAEPLETNSVLLLTHEEGTSN--KVGRKKGSKGKSSSCLEREMIVIPNGGGILDS 180
           + LI V E LE     LL ++EG ++   + R+K  +GKSS C     +V+ N    L  
Sbjct: 121 VQLIGVPETLEHKG--LLGNQEGMTSGRSLERRKRGRGKSS-CAGHSALVVSNSVA-LGP 180

Query: 181 NRDRNQVHMAVGRFLYDIGASLEAVNSAYFQPMIESIALAGTGIIPPSYHDIRGWILKNS 240
            +  N VH A+GRFLYDIGA  +AVNSAYFQPMI++IA  G+G++PP+YHDIR WILKNS
Sbjct: 181 PKVNNFVHEAIGRFLYDIGAPPDAVNSAYFQPMIDAIASGGSGVVPPTYHDIRSWILKNS 240

Query: 241 MEEVRSDFDRCKATWGITGCSVMVDQWCTEAGRTMLNFLVYCPKGTVFLESVDASGIMDS 300
           +EEVR++ D+ + TWG TGCSV+VDQW TE+G+ +L+FLVYCP+GTVF ESVDAS +++S
Sbjct: 241 VEEVRNNIDKHRETWGRTGCSVLVDQWNTESGKVLLSFLVYCPEGTVFWESVDASDVINS 300

Query: 301 PDLLYELLKKVVEQVGVKHVVQVITRFEENFAIAGRKLSDTYPTLYWTPCAASCVDLILG 360
            D LYELL++VVE+VGVK V+QVIT  EE   +AGR+L+DT+P LYWTPCAA C+DL+L 
Sbjct: 301 SDALYELLRRVVEEVGVKDVLQVITSGEEQCMVAGRRLTDTFPXLYWTPCAAQCLDLMLE 360

Query: 361 DIGNIEGVNTVIEQARSITRFVYNNSMVLNMVRKCTFGNDIVEPCLTRSATNFATLNRMV 420
           D GNIE +NTVIEQARSIT+FVYN+S+VLNMVR+ TFGNDIVEP  TR +TNF TL R+V
Sbjct: 361 DFGNIEWINTVIEQARSITKFVYNHSVVLNMVRRSTFGNDIVEPGATRFSTNFTTLKRLV 420

Query: 421 DLKRCLQNMVTSQEWMDSPYSKRPGGLEMLDLISSESFWSSCNSIISLTNPLLRVLRIVG 480
           DLK CLQ MVTSQEWMDS YSK PGGLEMLDLIS++SFWSSC  I+ LTNPLLRVLR+VG
Sbjct: 421 DLKHCLQVMVTSQEWMDSLYSKEPGGLEMLDLISNQSFWSSCILIVGLTNPLLRVLRMVG 480

Query: 481 SGKRPAMGYVYAAMYNAKLAIKTELINRDRYMVYWNIIDQRWEHHWRHPLYAAGFYLNPK 540
           S KRPAMGYVYA M  AK  IK EL+ R+ YM+YWNIIDQRWE HW  PL+AAGFYLNPK
Sbjct: 481 SEKRPAMGYVYAGMCRAKETIKKELVKREEYMIYWNIIDQRWEQHWCSPLHAAGFYLNPK 540

Query: 541 YFYSIEGDMHGEILSGMFDCIERLVSDTNVQDKIIKEITSYKNASGDFARKTAIRARGTL 600
            FYS EGDMHG+ILS MFDCIERLV DT VQDKIIKE+  YK+A+GDF RK AIRA+ TL
Sbjct: 541 IFYSFEGDMHGDILSHMFDCIERLVPDTKVQDKIIKELNLYKSAAGDFKRKMAIRAKDTL 600

Query: 601 LPAEWWSTCGEGGCPNLTRLATRILSQTCSSVGFKQNDALFDKLHDTRNHIEHQRLSDLV 660
           LPAEWWST G GGCPNLTRLA RILSQTCSS+G ++N+  F+K H+TRN +E QRLSDLV
Sbjct: 601 LPAEWWSTYG-GGCPNLTRLAIRILSQTCSSIGCRRNEIPFEKAHNTRNCLERQRLSDLV 660

Query: 661 FVRSNLQLKQMATNVNEHYPTDPLSFDELGIVDDWVWKKDLSAEDCGNLEWTVLDNPPFS 720
           FV+ NL+LKQM    +E    DP+SF+ + + +DWV  KD+  +D G+ +W  LD+   S
Sbjct: 661 FVQYNLRLKQMVDKNSEQDVMDPISFENISVTEDWVTGKDMCLDDNGSFDWMELDSTSAS 720

Query: 721 PPMRLPQSDGYDDLVAGFDDLEVFKRQRESEDDNI 752
             +  P +D  DDL +GF D E+F R +  E++N+
Sbjct: 721 TVLLGPSNDDADDLGSGFYDYEIFSRAKHGEEENV 750

The following BLAST results are available for this feature:
Match NameE-valueIdentityDescription
Match NameE-valueIdentityDescription
A0A0A0KD75_CUCSA0.0e+00100.00Uncharacterized protein OS=Cucumis sativus GN=Csa_6G008670 PE=4 SV=1[more]
E5GC76_CUCME0.0e+0097.34DNA binding protein OS=Cucumis melo subsp. melo PE=4 SV=1[more]
A0A061FL79_THECC1.5e-27060.48HAT and BED zinc finger domain-containing protein, putative OS=Theobroma cacao G... [more]
A0A067LKQ6_JATCU1.2e-26760.93Uncharacterized protein OS=Jatropha curcas GN=JCGZ_01424 PE=4 SV=1[more]
B9SDY5_RICCO2.0e-26761.62DNA binding protein, putative OS=Ricinus communis GN=RCOM_1480090 PE=4 SV=1[more]
Match NameE-valueIdentityDescription
AT3G22220.13.7e-20448.02 hAT transposon superfamily[more]
AT4G15020.13.0e-19847.04 hAT transposon superfamily[more]
AT3G17450.12.3e-9730.33 hAT dimerisation domain-containing protein[more]
AT1G79740.11.1e-7530.68 hAT transposon superfamily[more]
AT3G13030.11.5e-6430.43 hAT transposon superfamily protein[more]
Match NameE-valueIdentityDescription
gi|778709347|ref|XP_004138492.2|0.0e+00100.00PREDICTED: uncharacterized protein LOC101220029 [Cucumis sativus][more]
gi|700190532|gb|KGN45736.1|0.0e+00100.00hypothetical protein Csa_6G008670 [Cucumis sativus][more]
gi|659081544|ref|XP_008441385.1|0.0e+0097.34PREDICTED: uncharacterized protein LOC103485517 [Cucumis melo][more]
gi|657944000|ref|XP_008371908.1|3.1e-28564.95PREDICTED: uncharacterized protein LOC103435298 [Malus domestica][more]
gi|694320495|ref|XP_009351432.1|2.9e-28364.37PREDICTED: uncharacterized protein LOC103942959 [Pyrus x bretschneideri][more]
The following terms have been associated with this mRNA:
Vocabulary: INTERPRO
TermDefinition
IPR003656Znf_BED
IPR007021DUF659
IPR008906HATC_C_dom
IPR012337RNaseH-like_sf
Vocabulary: Molecular Function
TermDefinition
GO:0003677DNA binding
GO:0046983protein dimerization activity
GO:0003676nucleic acid binding
GO Assignments
This mRNA is annotated with the following GO terms.
Category Term Accession Term Name
biological_process GO:0008150 biological_process
cellular_component GO:0005575 cellular_component
molecular_function GO:0003677 DNA binding
molecular_function GO:0003676 nucleic acid binding
molecular_function GO:0046983 protein dimerization activity

This mRNA is a part of the following gene feature(s):

Feature NameUnique NameType
CSPI06G01170CSPI06G01170gene


The following polypeptide feature(s) derives from this mRNA:

Feature NameUnique NameType
CSPI06G01170.1CSPI06G01170.1-proteinpolypeptide


The following five_prime_UTR feature(s) are a part of this mRNA:

Feature NameUnique NameType
CSPI06G01170.1.utr5p1CSPI06G01170.1.utr5p1five_prime_UTR


The following CDS feature(s) are a part of this mRNA:

Feature NameUnique NameType
CSPI06G01170.1.cds1CSPI06G01170.1.cds1CDS
CSPI06G01170.1.cds2CSPI06G01170.1.cds2CDS
CSPI06G01170.1.cds3CSPI06G01170.1.cds3CDS
CSPI06G01170.1.cds4CSPI06G01170.1.cds4CDS


The following three_prime_UTR feature(s) are a part of this mRNA:

Feature NameUnique NameType
CSPI06G01170.1.utr3p1CSPI06G01170.1.utr3p1three_prime_UTR


Analysis Name: InterPro Annotations of cucumber (PI183967)
Date Performed: 2017-01-17
IPR TermIPR DescriptionSourceSource TermSource DescriptionAlignment
IPR003656Zinc finger, BED-typePFAMPF02892zf-BEDcoord: 16..65
score: 1.
IPR003656Zinc finger, BED-typePROFILEPS50808ZF_BEDcoord: 13..71
score: 10
IPR007021Domain of unknown function DUF659PFAMPF04937DUF659coord: 223..374
score: 3.5
IPR008906HAT, C-terminal dimerisation domainPFAMPF05699Dimer_Tnp_hATcoord: 582..662
score: 8.
IPR012337Ribonuclease H-like domainunknownSSF53098Ribonuclease H-likecoord: 255..664
score: 3.03
NoneNo IPR availablePANTHERPTHR32166FAMILY NOT NAMEDcoord: 1..749
score:
NoneNo IPR availablePANTHERPTHR32166:SF27HAT DIMERIZATION DOMAIN-CONTAINING PROTEIN-RELATEDcoord: 1..749
score: