CmoCh01G005250 (gene) Cucurbita moschata (Rifu)

NameCmoCh01G005250
Typegene
OrganismCucurbita moschata (Cucurbita moschata (Rifu))
DescriptionMuDR family transposase
LocationCmo_Chr01 : 2553026 .. 2554813 (-)
The following sequences are available for this feature:

Gene sequence (with intron)

Legend: CDSexon
Hold the cursor over a type above to highlight its positions in the sequence below.
ATGGCTGATCATTCTTTAGTTGTATCTGAAACTGCACTTAGTCTAGTAGACCACACCCTGGTTATTGGACAAGAATTTCCCGATGTTGAAACCTGCCGGAGAATGTTGAAAGATATTGCTATAGCCTTGCATTTTGATATTCGAATTGTTAAATCTGATCGTAGTCGATTTATAGCCAAGTGTTCCAAGGAAGGTTGCCCATGGCGTGTGCATGTAGCAAAATGCCCTGGAGTTCCAACCTTTACAGTTAGAACCCTACATGGTGAGCATACTTGTGAAGGTGTTCATAATCTTCATCATCAGCAAGCCTCTGTGGGATGGGTTGCCAGATCTGTATCAGCACAAGTAAGAGATAATCCACAGTACAAACCCAAGGAAATTCTCCGGGATATTCGTGATCAGCATGGAGTCGCTGTATCGTACATGCAAGCTTGGCGTGGGAAAGAGCGTAGCATGGCTGCACTTCATGGAACCTTTGAAGAAGGGTATCGCCTTCTTCCTGCTTATTGTGAACAAGTAAGGAAAACAAACCCTGGAAGCATTGCATCAGTTTTTGCAATTGGACAAGAAAATTGCTTCCAGCGCCTGTTTATTTCGTATCGTGCTTCAATATATGGGTTTATAAATGCCTGTAGGCCACTTCTTGAACTTGACAAAGCACATCTGAAAGGAAAATACTTGGGAGCCTTACTGTGTGCTGCTGCTGTTGATGCGGATGATTCATTGTTCCCATTGGCCATTGCAGTTGTTGATGTGGAGAGTGATGAAAATTGGATGTGGTTCATGTCAGAGTTGCGCAAGCTTCTTGGGGTAAATACTGATAGCATGCCTAGACTAACAATACTATCTGAAAGACAAAGAGGCATTGTGGAGGCAGTCGAAACCCATTTTCCGACTGCCTTCCATGGATTCTGTCTGCGCTATGTAAGCGAAAATTTTCGTGATACATTTAAAAACACAAAGTTGGTCAATATTTTTTGGAATGCTGTTTATGCTCTCACTGCAGCTGAATTCGATAGCAAAATCGCGGAGATGGTGGAGATCTCACAAGAAGTAATAACGTGGTTTCAGCATTTCCCTCCCCAGTTATGGGCTGTAGCATATTTTGAAGGTGTGCGATATGGCCATTTTACTTTGGGGGTTACAGAGTTGTTGTATAATTGGGCACTCGAGTGTCACGAGCTCCCCATTGTGCAGATGATGGAACATATTCGTAATGAGATGGCATCTTGGTTTAACGATCGGCGTGAAATGGCAATGAGATGGACCTCCATTCTCGTACCCTCTGCTGAGAAGCGAATTGCCGAGGCAATTGCAGATGCTCATTGTTATCAAGTACTTCGTGCAAATGAAGTTGAGTTTGAAATCGTCTCAACCGAGCGGACAAATATTGTGGAGATACACAGTCGTGTGTGCTCTTGTCGTCGTTGGCAGCTATATGGTCTGCCTTGTGCTCATGCTGCAGCTGCTCTAATGTCCTGTGGGCAGAATGCTCAAGTATTTGCTGAGCAATGTTTCACCGTTGATAGTTTTCGCCAAACTTATTCACAAATGATATTCCCAATCCCTGATAAGAGCCTGTGGAAGGAACCGGGCGAGGGAGCGGAGGGCGGAGGAGGGGCAAAGGTTGACATCACAATACGCCCTCCCAAAGTTCGTCGCCCACCTGGAAGGCCGAAAAAGAAAGTTCTAAGAGTCGAAAATTTAAAACGCCCGAAAAGGATTGTACAATGTGGTCGCTGTCATTTGTTGGGACACTCTCAAAAGAAGTGCACCATGCCAATGTGA

mRNA sequence

ATGGCTGATCATTCTTTAGTTGTATCTGAAACTGCACTTAGTCTAGTAGACCACACCCTGGTTATTGGACAAGAATTTCCCGATGTTGAAACCTGCCGGAGAATGTTGAAAGATATTGCTATAGCCTTGCATTTTGATATTCGAATTGTTAAATCTGATCGTAGTCGATTTATAGCCAAGTGTTCCAAGGAAGGTTGCCCATGGCGTGTGCATGTAGCAAAATGCCCTGGAGTTCCAACCTTTACAGTTAGAACCCTACATGGTGAGCATACTTGTGAAGGTGTTCATAATCTTCATCATCAGCAAGCCTCTGTGGGATGGGTTGCCAGATCTGTATCAGCACAAGTAAGAGATAATCCACAGTACAAACCCAAGGAAATTCTCCGGGATATTCGTGATCAGCATGGAGTCGCTGTATCGTACATGCAAGCTTGGCGTGGGAAAGAGCGTAGCATGGCTGCACTTCATGGAACCTTTGAAGAAGGGTATCGCCTTCTTCCTGCTTATTGTGAACAAGTAAGGAAAACAAACCCTGGAAGCATTGCATCAGTTTTTGCAATTGGACAAGAAAATTGCTTCCAGCGCCTGTTTATTTCGTATCGTGCTTCAATATATGGGTTTATAAATGCCTGTAGGCCACTTCTTGAACTTGACAAAGCACATCTGAAAGGAAAATACTTGGGAGCCTTACTGTGTGCTGCTGCTGTTGATGCGGATGATTCATTGTTCCCATTGGCCATTGCAGTTGTTGATGTGGAGAGTGATGAAAATTGGATGTGGTTCATGTCAGAGTTGCGCAAGCTTCTTGGGGTAAATACTGATAGCATGCCTAGACTAACAATACTATCTGAAAGACAAAGAGGCATTGTGGAGGCAGTCGAAACCCATTTTCCGACTGCCTTCCATGGATTCTGTCTGCGCTATGTAAGCGAAAATTTTCGTGATACATTTAAAAACACAAAGTTGGTCAATATTTTTTGGAATGCTGTTTATGCTCTCACTGCAGCTGAATTCGATAGCAAAATCGCGGAGATGGTGGAGATCTCACAAGAAGTAATAACGTGGTTTCAGCATTTCCCTCCCCAGTTATGGGCTGTAGCATATTTTGAAGGTGTGCGATATGGCCATTTTACTTTGGGGGTTACAGAGTTGTTGTATAATTGGGCACTCGAGTGTCACGAGCTCCCCATTGTGCAGATGATGGAACATATTCGTAATGAGATGGCATCTTGGTTTAACGATCGGCGTGAAATGGCAATGAGATGGACCTCCATTCTCGTACCCTCTGCTGAGAAGCGAATTGCCGAGGCAATTGCAGATGCTCATTGTTATCAAGTACTTCGTGCAAATGAAGTTGAGTTTGAAATCGTCTCAACCGAGCGGACAAATATTGTGGAGATACACAGTCGTGTGTGCTCTTGTCGTCGTTGGCAGCTATATGGTCTGCCTTGTGCTCATGCTGCAGCTGCTCTAATGTCCTGTGGGCAGAATGCTCAAGTATTTGCTGAGCAATGTTTCACCGTTGATAGTTTTCGCCAAACTTATTCACAAATGATATTCCCAATCCCTGATAAGAGCCTGTGGAAGGAACCGGGCGAGGGAGCGGAGGGCGGAGGAGGGGCAAAGGTTGACATCACAATACGCCCTCCCAAAGTTCGTCGCCCACCTGGAAGGCCGAAAAAGAAAGTTCTAAGAGTCGAAAATTTAAAACGCCCGAAAAGGATTGTACAATGTGGTCGCTGTCATTTGTTGGGACACTCTCAAAAGAAGTGCACCATGCCAATGTGA

Coding sequence (CDS)

ATGGCTGATCATTCTTTAGTTGTATCTGAAACTGCACTTAGTCTAGTAGACCACACCCTGGTTATTGGACAAGAATTTCCCGATGTTGAAACCTGCCGGAGAATGTTGAAAGATATTGCTATAGCCTTGCATTTTGATATTCGAATTGTTAAATCTGATCGTAGTCGATTTATAGCCAAGTGTTCCAAGGAAGGTTGCCCATGGCGTGTGCATGTAGCAAAATGCCCTGGAGTTCCAACCTTTACAGTTAGAACCCTACATGGTGAGCATACTTGTGAAGGTGTTCATAATCTTCATCATCAGCAAGCCTCTGTGGGATGGGTTGCCAGATCTGTATCAGCACAAGTAAGAGATAATCCACAGTACAAACCCAAGGAAATTCTCCGGGATATTCGTGATCAGCATGGAGTCGCTGTATCGTACATGCAAGCTTGGCGTGGGAAAGAGCGTAGCATGGCTGCACTTCATGGAACCTTTGAAGAAGGGTATCGCCTTCTTCCTGCTTATTGTGAACAAGTAAGGAAAACAAACCCTGGAAGCATTGCATCAGTTTTTGCAATTGGACAAGAAAATTGCTTCCAGCGCCTGTTTATTTCGTATCGTGCTTCAATATATGGGTTTATAAATGCCTGTAGGCCACTTCTTGAACTTGACAAAGCACATCTGAAAGGAAAATACTTGGGAGCCTTACTGTGTGCTGCTGCTGTTGATGCGGATGATTCATTGTTCCCATTGGCCATTGCAGTTGTTGATGTGGAGAGTGATGAAAATTGGATGTGGTTCATGTCAGAGTTGCGCAAGCTTCTTGGGGTAAATACTGATAGCATGCCTAGACTAACAATACTATCTGAAAGACAAAGAGGCATTGTGGAGGCAGTCGAAACCCATTTTCCGACTGCCTTCCATGGATTCTGTCTGCGCTATGTAAGCGAAAATTTTCGTGATACATTTAAAAACACAAAGTTGGTCAATATTTTTTGGAATGCTGTTTATGCTCTCACTGCAGCTGAATTCGATAGCAAAATCGCGGAGATGGTGGAGATCTCACAAGAAGTAATAACGTGGTTTCAGCATTTCCCTCCCCAGTTATGGGCTGTAGCATATTTTGAAGGTGTGCGATATGGCCATTTTACTTTGGGGGTTACAGAGTTGTTGTATAATTGGGCACTCGAGTGTCACGAGCTCCCCATTGTGCAGATGATGGAACATATTCGTAATGAGATGGCATCTTGGTTTAACGATCGGCGTGAAATGGCAATGAGATGGACCTCCATTCTCGTACCCTCTGCTGAGAAGCGAATTGCCGAGGCAATTGCAGATGCTCATTGTTATCAAGTACTTCGTGCAAATGAAGTTGAGTTTGAAATCGTCTCAACCGAGCGGACAAATATTGTGGAGATACACAGTCGTGTGTGCTCTTGTCGTCGTTGGCAGCTATATGGTCTGCCTTGTGCTCATGCTGCAGCTGCTCTAATGTCCTGTGGGCAGAATGCTCAAGTATTTGCTGAGCAATGTTTCACCGTTGATAGTTTTCGCCAAACTTATTCACAAATGATATTCCCAATCCCTGATAAGAGCCTGTGGAAGGAACCGGGCGAGGGAGCGGAGGGCGGAGGAGGGGCAAAGGTTGACATCACAATACGCCCTCCCAAAGTTCGTCGCCCACCTGGAAGGCCGAAAAAGAAAGTTCTAAGAGTCGAAAATTTAAAACGCCCGAAAAGGATTGTACAATGTGGTCGCTGTCATTTGTTGGGACACTCTCAAAAGAAGTGCACCATGCCAATGTGA
BLAST of CmoCh01G005250 vs. TrEMBL
Match: A0A0A0LN02_CUCSA (Uncharacterized protein OS=Cucumis sativus GN=Csa_2G094910 PE=4 SV=1)

HSP 1 Score: 1180.2 bits (3052), Expect = 0.0e+00
Identity = 571/595 (95.97%), Postives = 583/595 (97.98%), Query Frame = 1

Query: 1   MADHSLVVSETALSLVDHTLVIGQEFPDVETCRRMLKDIAIALHFDIRIVKSDRSRFIAK 60
           MADHSL+VSETALSLVDHTLVIGQEFPDVETCRRMLKDIAIA+HFDIRIVKSDRSRFIAK
Sbjct: 1   MADHSLIVSETALSLVDHTLVIGQEFPDVETCRRMLKDIAIAMHFDIRIVKSDRSRFIAK 60

Query: 61  CSKEGCPWRVHVAKCPGVPTFTVRTLHGEHTCEGVHNLHHQQASVGWVARSVSAQVRDNP 120
           CSKEGCPWRVHVAKCPGVPTFTVRTLHGEHTCEGV NLHHQQASVGWVARSV+AQVRDNP
Sbjct: 61  CSKEGCPWRVHVAKCPGVPTFTVRTLHGEHTCEGVRNLHHQQASVGWVARSVAAQVRDNP 120

Query: 121 QYKPKEILRDIRDQHGVAVSYMQAWRGKERSMAALHGTFEEGYRLLPAYCEQVRKTNPGS 180
           QYKPKEILRDIRDQHGVAVSYMQAWRGKERSMAALHGTFEEGYRLLPAYCEQ+ KTNPGS
Sbjct: 121 QYKPKEILRDIRDQHGVAVSYMQAWRGKERSMAALHGTFEEGYRLLPAYCEQISKTNPGS 180

Query: 181 IASVFAIGQENCFQRLFISYRASIYGFINACRPLLELDKAHLKGKYLGALLCAAAVDADD 240
           IASVFA GQENCFQRLFISYRASIYGFINACRPLLELD+AHLKGKYLGALLCAA VDADD
Sbjct: 181 IASVFATGQENCFQRLFISYRASIYGFINACRPLLELDRAHLKGKYLGALLCAAVVDADD 240

Query: 241 SLFPLAIAVVDVESDENWMWFMSELRKLLGVNTDSMPRLTILSERQRGIVEAVETHFPTA 300
           SLFPLAIAVVDVESDENWMWFMSELRKLLGVNTDSMPRLTILSERQRGIVEAVETHFP+A
Sbjct: 241 SLFPLAIAVVDVESDENWMWFMSELRKLLGVNTDSMPRLTILSERQRGIVEAVETHFPSA 300

Query: 301 FHGFCLRYVSENFRDTFKNTKLVNIFWNAVYALTAAEFDSKIAEMVEISQEVITWFQHFP 360
           FHGFCLRYVSENFRDTFKNTKLVNIFWNAVYALTAAEFDSKIAEMVEISQEVITWFQHFP
Sbjct: 301 FHGFCLRYVSENFRDTFKNTKLVNIFWNAVYALTAAEFDSKIAEMVEISQEVITWFQHFP 360

Query: 361 PQLWAVAYFEGVRYGHFTLGVTELLYNWALECHELPIVQMMEHIRNEMASWFNDRREMAM 420
           PQLWAVAYFEGVRYGHFTLGVTELLYNWALECHELPIVQMMEHIRNEMASWFN+RREM M
Sbjct: 361 PQLWAVAYFEGVRYGHFTLGVTELLYNWALECHELPIVQMMEHIRNEMASWFNERREMGM 420

Query: 421 RWTSILVPSAEKRIAEAIADAHCYQVLRANEVEFEIVSTERTNIVEIHSRVCSCRRWQLY 480
           RWTSILVPSAEKRIAEAIADA CYQVLRANEVEFEIVSTERTNIVEIHSRVCSCRRWQLY
Sbjct: 421 RWTSILVPSAEKRIAEAIADARCYQVLRANEVEFEIVSTERTNIVEIHSRVCSCRRWQLY 480

Query: 481 GLPCAHAAAALMSCGQNAQVFAEQCFTVDSFRQTYSQMIFPIPDKSLWKEPGEGAEGGGG 540
           GLPCAHAAAALMSCGQNA +FAE CFTV S+R+TYSQMI+PI DKSLWKEPGEGAE GG 
Sbjct: 481 GLPCAHAAAALMSCGQNAHLFAEPCFTVTSYRETYSQMIYPILDKSLWKEPGEGAE-GGV 540

Query: 541 AKVDITIRPPKVRRPPGRPKKKVLRVENLKRPKRIVQCGRCHLLGHSQKKCTMPM 596
           AKVDITIRPPK+RRPPGRPKKKVLRVENLKRPKRIVQCGRCHLLGHSQKKCTMPM
Sbjct: 541 AKVDITIRPPKIRRPPGRPKKKVLRVENLKRPKRIVQCGRCHLLGHSQKKCTMPM 594

BLAST of CmoCh01G005250 vs. TrEMBL
Match: A0A067JKI4_JATCU (Uncharacterized protein OS=Jatropha curcas GN=JCGZ_23146 PE=4 SV=1)

HSP 1 Score: 1122.1 bits (2901), Expect = 0.0e+00
Identity = 533/595 (89.58%), Postives = 567/595 (95.29%), Query Frame = 1

Query: 1   MADHSLVVSETALSLVDHTLVIGQEFPDVETCRRMLKDIAIALHFDIRIVKSDRSRFIAK 60
           MAD +L+V + +L+L + +LVIGQEFP+VETCRR LKDIAIALHFD+RIVKSDRSRFIAK
Sbjct: 1   MADRALIVPDASLALAEQSLVIGQEFPNVETCRRTLKDIAIALHFDLRIVKSDRSRFIAK 60

Query: 61  CSKEGCPWRVHVAKCPGVPTFTVRTLHGEHTCEGVHNLHHQQASVGWVARSVSAQVRDNP 120
           CSKEGCPWRVHVAKCPGVPTFT+RTLHGEHTCEGV NLHHQQASVGWVARSV A++RDNP
Sbjct: 61  CSKEGCPWRVHVAKCPGVPTFTIRTLHGEHTCEGVRNLHHQQASVGWVARSVEARIRDNP 120

Query: 121 QYKPKEILRDIRDQHGVAVSYMQAWRGKERSMAALHGTFEEGYRLLPAYCEQVRKTNPGS 180
           QYKPKEIL+DIRDQHGVAVSYMQAWRGKERSMAALHGTFEEGYRLLPAYCEQ+RKTNPGS
Sbjct: 121 QYKPKEILQDIRDQHGVAVSYMQAWRGKERSMAALHGTFEEGYRLLPAYCEQIRKTNPGS 180

Query: 181 IASVFAIGQENCFQRLFISYRASIYGFINACRPLLELDKAHLKGKYLGALLCAAAVDADD 240
           IASVFA GQEN FQRLFISYRA IYGFINACRPLLELDKAHLKGKYLG LLCAAAVDADD
Sbjct: 181 IASVFATGQENSFQRLFISYRACIYGFINACRPLLELDKAHLKGKYLGTLLCAAAVDADD 240

Query: 241 SLFPLAIAVVDVESDENWMWFMSELRKLLGVNTDSMPRLTILSERQRGIVEAVETHFPTA 300
            LFPLAIA+VD ESDENWMWFMSELRKLLGVNTD+MPRLT+LSERQRGIVEAVETHFP+A
Sbjct: 241 VLFPLAIAIVDTESDENWMWFMSELRKLLGVNTDNMPRLTVLSERQRGIVEAVETHFPSA 300

Query: 301 FHGFCLRYVSENFRDTFKNTKLVNIFWNAVYALTAAEFDSKIAEMVEISQEVITWFQHFP 360
           FHGFCLRYVSENFRDTFKNTKLVNIFWNAVYALT AEF+SKI+EMVEISQ+VITWFQHFP
Sbjct: 301 FHGFCLRYVSENFRDTFKNTKLVNIFWNAVYALTTAEFESKISEMVEISQDVITWFQHFP 360

Query: 361 PQLWAVAYFEGVRYGHFTLGVTELLYNWALECHELPIVQMMEHIRNEMASWFNDRREMAM 420
           PQLWAVAYFEG+RYGHFTLGVTELLYNWALECHELPIVQMMEHIRN++ASWFNDRR++ M
Sbjct: 361 PQLWAVAYFEGMRYGHFTLGVTELLYNWALECHELPIVQMMEHIRNQLASWFNDRRDIGM 420

Query: 421 RWTSILVPSAEKRIAEAIADAHCYQVLRANEVEFEIVSTERTNIVEIHSRVCSCRRWQLY 480
           RWTSILVPSAEKRI EAIADA CYQVLRANE+EFEIVSTERTNIV+I SRVCSCRRWQLY
Sbjct: 421 RWTSILVPSAEKRILEAIADARCYQVLRANEIEFEIVSTERTNIVDIRSRVCSCRRWQLY 480

Query: 481 GLPCAHAAAALMSCGQNAQVFAEQCFTVDSFRQTYSQMIFPIPDKSLWKEPGEGAEGGGG 540
           GLPCAHAAAAL+SCGQNA +FAE CFTV S+R+TYSQMI PIPDKSLWKEPGEG E GGG
Sbjct: 481 GLPCAHAAAALISCGQNAHLFAEPCFTVVSYRETYSQMINPIPDKSLWKEPGEGIE-GGG 540

Query: 541 AKVDITIRPPKVRRPPGRPKKKVLRVENLKRPKRIVQCGRCHLLGHSQKKCTMPM 596
           AKVDI IRPPK RRPPGRPKKKVLRVEN KRPKR+VQCGRCHLLGHSQKKCTMP+
Sbjct: 541 AKVDIVIRPPKTRRPPGRPKKKVLRVENFKRPKRVVQCGRCHLLGHSQKKCTMPI 594

BLAST of CmoCh01G005250 vs. TrEMBL
Match: A0A0D2U3A4_GOSRA (Uncharacterized protein OS=Gossypium raimondii GN=B456_013G180100 PE=4 SV=1)

HSP 1 Score: 1117.8 bits (2890), Expect = 0.0e+00
Identity = 530/595 (89.08%), Postives = 572/595 (96.13%), Query Frame = 1

Query: 1   MADHSLVVSETALSLVDHTLVIGQEFPDVETCRRMLKDIAIALHFDIRIVKSDRSRFIAK 60
           MADH+LVV++T+ +LV+H+LVIGQEFPDVETCRR LKDIAIALHFD+RIVKSDRSRFIAK
Sbjct: 1   MADHALVVADTSHTLVEHSLVIGQEFPDVETCRRTLKDIAIALHFDLRIVKSDRSRFIAK 60

Query: 61  CSKEGCPWRVHVAKCPGVPTFTVRTLHGEHTCEGVHNLHHQQASVGWVARSVSAQVRDNP 120
           CSKEGCPWRVHVAKCPGVPTF++RTLHGEHTCEGV NLHHQQASVGWVARSV A++RDNP
Sbjct: 61  CSKEGCPWRVHVAKCPGVPTFSIRTLHGEHTCEGVRNLHHQQASVGWVARSVEARIRDNP 120

Query: 121 QYKPKEILRDIRDQHGVAVSYMQAWRGKERSMAALHGTFEEGYRLLPAYCEQVRKTNPGS 180
           QYKPKEIL+DIRDQHGVAVSYMQAWRGKERSMAALHGTFEEGYRLLPAYCEQ+RKTNPGS
Sbjct: 121 QYKPKEILQDIRDQHGVAVSYMQAWRGKERSMAALHGTFEEGYRLLPAYCEQIRKTNPGS 180

Query: 181 IASVFAIGQENCFQRLFISYRASIYGFINACRPLLELDKAHLKGKYLGALLCAAAVDADD 240
           +ASVFA GQENCFQRLFISYRASIYGFI ACRPLLELDKA LKGKYLGALLCAAAVDADD
Sbjct: 181 VASVFATGQENCFQRLFISYRASIYGFITACRPLLELDKADLKGKYLGALLCAAAVDADD 240

Query: 241 SLFPLAIAVVDVESDENWMWFMSELRKLLGVNTDSMPRLTILSERQRGIVEAVETHFPTA 300
           +LFPLAIA+VDVESDENWMWFMSELRKLLGVNT++MPRLTILSERQRG+V+AVETHFP+A
Sbjct: 241 ALFPLAIAIVDVESDENWMWFMSELRKLLGVNTENMPRLTILSERQRGMVDAVETHFPSA 300

Query: 301 FHGFCLRYVSENFRDTFKNTKLVNIFWNAVYALTAAEFDSKIAEMVEISQEVITWFQHFP 360
           FHGFCLRYVSENFRDTFKNTKLVNIFWNAVYALT  EF+SKIAEMVEISQ+VI WFQ FP
Sbjct: 301 FHGFCLRYVSENFRDTFKNTKLVNIFWNAVYALTTVEFESKIAEMVEISQDVIQWFQLFP 360

Query: 361 PQLWAVAYFEGVRYGHFTLGVTELLYNWALECHELPIVQMMEHIRNEMASWFNDRREMAM 420
           P+LWAVAYFEGVRYGHFTLGVTE+LYNWALECHELPIVQMMEHIR+++ +WF +RREM M
Sbjct: 361 PRLWAVAYFEGVRYGHFTLGVTEMLYNWALECHELPIVQMMEHIRHQLTTWFTNRREMGM 420

Query: 421 RWTSILVPSAEKRIAEAIADAHCYQVLRANEVEFEIVSTERTNIVEIHSRVCSCRRWQLY 480
           RWTSILVPSAEKRI+EAIADA CYQVLRANEVEFEIVSTERTNIV+I SRVCSCRRWQLY
Sbjct: 421 RWTSILVPSAEKRISEAIADARCYQVLRANEVEFEIVSTERTNIVDIRSRVCSCRRWQLY 480

Query: 481 GLPCAHAAAALMSCGQNAQVFAEQCFTVDSFRQTYSQMIFPIPDKSLWKEPGEGAEGGGG 540
           GLPCAHAAAAL+SCGQNA +FAE CFTV S+R+TYSQMI PIPDKS+WKE GEGAE GG 
Sbjct: 481 GLPCAHAAAALISCGQNAHMFAEPCFTVGSYRETYSQMIHPIPDKSIWKELGEGAE-GGA 540

Query: 541 AKVDITIRPPKVRRPPGRPKKKVLRVENLKRPKRIVQCGRCHLLGHSQKKCTMPM 596
           AK+DITIRPPK+RRPPGRPKKKVLRVENLKRPKR+VQCGRCHLLGHSQKKCTMP+
Sbjct: 541 AKLDITIRPPKIRRPPGRPKKKVLRVENLKRPKRVVQCGRCHLLGHSQKKCTMPI 594

BLAST of CmoCh01G005250 vs. TrEMBL
Match: A0A061FG29_THECC (MuDR family transposase isoform 1 OS=Theobroma cacao GN=TCM_035107 PE=4 SV=1)

HSP 1 Score: 1104.7 bits (2856), Expect = 0.0e+00
Identity = 528/605 (87.27%), Postives = 567/605 (93.72%), Query Frame = 1

Query: 3   DHSLVVSETALSLVDHTL------------VIGQEFPDVETCRRMLKDIAIALHFDIRIV 62
           DH+LVV++T+ SLV+HTL            VIGQEFPDVETCRR LKDIAIALHFD+RIV
Sbjct: 42  DHALVVADTSHSLVEHTLADTSRALVEQTLVIGQEFPDVETCRRTLKDIAIALHFDLRIV 101

Query: 63  KSDRSRFIAKCSKEGCPWRVHVAKCPGVPTFTVRTLHGEHTCEGVHNLHHQQASVGWVAR 122
           KSDRSRFIAKCSKEGCPWRVHVAKCPGVPTF++RTLHGEHTCEGV NLHHQQASVGWVAR
Sbjct: 102 KSDRSRFIAKCSKEGCPWRVHVAKCPGVPTFSIRTLHGEHTCEGVRNLHHQQASVGWVAR 161

Query: 123 SVSAQVRDNPQYKPKEILRDIRDQHGVAVSYMQAWRGKERSMAALHGTFEEGYRLLPAYC 182
           SV A+VRDNPQYKPKEIL+DIRDQHGVAVSYMQAWRGKERSMAALHGTFEEGYRLLPAYC
Sbjct: 162 SVEARVRDNPQYKPKEILQDIRDQHGVAVSYMQAWRGKERSMAALHGTFEEGYRLLPAYC 221

Query: 183 EQVRKTNPGSIASVFAIGQENCFQRLFISYRASIYGFINACRPLLELDKAHLKGKYLGAL 242
           EQ+RKTNPGS+ASVFA GQENCFQRLFISYRASIYGFINACRPLLELDKA LKGKYLG L
Sbjct: 222 EQIRKTNPGSVASVFATGQENCFQRLFISYRASIYGFINACRPLLELDKADLKGKYLGTL 281

Query: 243 LCAAAVDADDSLFPLAIAVVDVESDENWMWFMSELRKLLGVNTDSMPRLTILSERQRGIV 302
           LCAAAVDADD+LFPLAIA+VD+ESDENWMWFMSELRKLLGVNT++MPRLTILSER++ IV
Sbjct: 282 LCAAAVDADDALFPLAIAIVDLESDENWMWFMSELRKLLGVNTENMPRLTILSERRQSIV 341

Query: 303 EAVETHFPTAFHGFCLRYVSENFRDTFKNTKLVNIFWNAVYALTAAEFDSKIAEMVEISQ 362
           +AVETHFP+AFHGFCLRYVSENFRDTFKNTKLVNIFWNAVYALT  EF+SKI+EMVEISQ
Sbjct: 342 DAVETHFPSAFHGFCLRYVSENFRDTFKNTKLVNIFWNAVYALTTVEFESKISEMVEISQ 401

Query: 363 EVITWFQHFPPQLWAVAYFEGVRYGHFTLGVTELLYNWALECHELPIVQMMEHIRNEMAS 422
           +VI WFQHFPPQLWAVAYFEGVRYGHF+LGVTELLYNWALECHELP+VQMMEHIR+++ S
Sbjct: 402 DVIQWFQHFPPQLWAVAYFEGVRYGHFSLGVTELLYNWALECHELPVVQMMEHIRHQLTS 461

Query: 423 WFNDRREMAMRWTSILVPSAEKRIAEAIADAHCYQVLRANEVEFEIVSTERTNIVEIHSR 482
           WFN+RREM MRWTS LVPSAEKRI EAIADA CYQVLRANE+EFEIVSTERTNIV+I SR
Sbjct: 462 WFNNRREMGMRWTSSLVPSAEKRILEAIADARCYQVLRANEIEFEIVSTERTNIVDIRSR 521

Query: 483 VCSCRRWQLYGLPCAHAAAALMSCGQNAQVFAEQCFTVDSFRQTYSQMIFPIPDKSLWKE 542
           VCSCRRWQLYGLPCAHAAAAL+SCGQNA +FAE CFTV S+R+TYSQMI PIPDKS WKE
Sbjct: 522 VCSCRRWQLYGLPCAHAAAALISCGQNAHLFAEPCFTVASYRETYSQMINPIPDKSTWKE 581

Query: 543 PGEGAEGGGGAKVDITIRPPKVRRPPGRPKKKVLRVENLKRPKRIVQCGRCHLLGHSQKK 596
            GEGAE GG AK+DITIRPPK RRPPGRPKKKVLRVENLKRPKR+VQCGRCHLLGHSQKK
Sbjct: 582 QGEGAE-GGAAKLDITIRPPKYRRPPGRPKKKVLRVENLKRPKRVVQCGRCHLLGHSQKK 641

BLAST of CmoCh01G005250 vs. TrEMBL
Match: A0A151TXU4_CAJCA (Uncharacterized protein OS=Cajanus cajan GN=KK1_011143 PE=4 SV=1)

HSP 1 Score: 1098.6 bits (2840), Expect = 0.0e+00
Identity = 521/598 (87.12%), Postives = 559/598 (93.48%), Query Frame = 1

Query: 1   MADHSLVVSETALSLV---DHTLVIGQEFPDVETCRRMLKDIAIALHFDIRIVKSDRSRF 60
           MA+HSLV++ T++  V   +  LVIGQEFPDVETCRR LKDIAIA+HFD+RIVKSDRSRF
Sbjct: 1   MANHSLVLNNTSVGTVTVAEQPLVIGQEFPDVETCRRTLKDIAIAMHFDLRIVKSDRSRF 60

Query: 61  IAKCSKEGCPWRVHVAKCPGVPTFTVRTLHGEHTCEGVHNLHHQQASVGWVARSVSAQVR 120
           IAKCSKEGCPWRVHVAKCPGVPTFTVRTL GEHTCEGV NLHHQQASVGWVARSV A++R
Sbjct: 61  IAKCSKEGCPWRVHVAKCPGVPTFTVRTLQGEHTCEGVQNLHHQQASVGWVARSVEARIR 120

Query: 121 DNPQYKPKEILRDIRDQHGVAVSYMQAWRGKERSMAALHGTFEEGYRLLPAYCEQVRKTN 180
           DNPQYKP+EIL+DIRDQHGVAVSYMQAWRGKERSMAALHGTFEEGYRLLP YCEQ+RKTN
Sbjct: 121 DNPQYKPREILQDIRDQHGVAVSYMQAWRGKERSMAALHGTFEEGYRLLPGYCEQIRKTN 180

Query: 181 PGSIASVFAIGQENCFQRLFISYRASIYGFINACRPLLELDKAHLKGKYLGALLCAAAVD 240
           PGSI SV A GQENCFQRLFISYRASIYGFINACRPLLELD+AHLKGKYLG LLCAAAVD
Sbjct: 181 PGSITSVVAAGQENCFQRLFISYRASIYGFINACRPLLELDRAHLKGKYLGTLLCAAAVD 240

Query: 241 ADDSLFPLAIAVVDVESDENWMWFMSELRKLLGVNTDSMPRLTILSERQRGIVEAVETHF 300
           ADD+LFPLAIAVVD ESDENWMWFMSELRKLLGVNTD+MPRLTILSERQRG+VEAVETHF
Sbjct: 241 ADDALFPLAIAVVDAESDENWMWFMSELRKLLGVNTDNMPRLTILSERQRGLVEAVETHF 300

Query: 301 PTAFHGFCLRYVSENFRDTFKNTKLVNIFWNAVYALTAAEFDSKIAEMVEISQEVITWFQ 360
           P+A HGFCLRYVSENFRDTFKNTKLVNIFWNAVYALTAAEF+SKI EM+EISQ+VI+WFQ
Sbjct: 301 PSASHGFCLRYVSENFRDTFKNTKLVNIFWNAVYALTAAEFESKITEMMEISQDVISWFQ 360

Query: 361 HFPPQLWAVAYFEGVRYGHFTLGVTELLYNWALECHELPIVQMMEHIRNEMASWFNDRRE 420
            FPP LWAVAYF+GVRYGHFTLGVTELLYNWALECHELPIVQMMEHIR +M SWFNDR++
Sbjct: 361 QFPPYLWAVAYFDGVRYGHFTLGVTELLYNWALECHELPIVQMMEHIRQQMVSWFNDRQD 420

Query: 421 MAMRWTSILVPSAEKRIAEAIADAHCYQVLRANEVEFEIVSTERTNIVEIHSRVCSCRRW 480
           M MRWTSILVPSAEKRI EAIADAHCYQVLRANEVEFEIVSTERTNIV+I SR CSCRRW
Sbjct: 421 MGMRWTSILVPSAEKRILEAIADAHCYQVLRANEVEFEIVSTERTNIVDIRSRECSCRRW 480

Query: 481 QLYGLPCAHAAAALMSCGQNAQVFAEQCFTVDSFRQTYSQMIFPIPDKSLWKEPGEGAEG 540
           QLYGLPCAHAAAAL+SCG NA +FAE CFTV S+R TYSQ+I P+PDKS W+E GEGAEG
Sbjct: 481 QLYGLPCAHAAAALISCGHNAHMFAEPCFTVQSYRMTYSQIINPVPDKSQWREQGEGAEG 540

Query: 541 GGGAKVDITIRPPKVRRPPGRPKKKVLRVENLKRPKRIVQCGRCHLLGHSQKKCTMPM 596
           GGGA+VDI IRPPK RRPPGRPKKKVLRVEN KRPK++VQCGRCH+LGHSQKKCTMP+
Sbjct: 541 GGGARVDIMIRPPKTRRPPGRPKKKVLRVENFKRPKKVVQCGRCHMLGHSQKKCTMPI 598

BLAST of CmoCh01G005250 vs. TAIR10
Match: AT1G64260.1 (AT1G64260.1 MuDR family transposase)

HSP 1 Score: 176.4 bits (446), Expect = 5.2e-44
Identity = 131/533 (24.58%), Postives = 234/533 (43.90%), Query Frame = 1

Query: 17  DHTLVIGQEFPDVETCRRMLKDIAIALHFDIRIVKSDRSRFIAKCSKEGCPWRVHVAKCP 76
           DH + +G  F D +  ++ +    I    +  + ++++  +  +C +  C W +  A+  
Sbjct: 182 DHDMHLGLCFKDRDELKKAVDWWCIRRRRNCIVRETEKEMYTFECVRWKCKWSLRAARME 241

Query: 77  GVPTFTVRTLHGEHTCEGVHNLHHQQASVGWVARSVSAQVRDNPQYKPKEILRDIRDQHG 136
                 +    G HTC   H   +   S  + A  +   VR  P     E+ +  +++ G
Sbjct: 242 EHGLVEITKYTGPHTCS--HEYPNDFESE-FAADEIERVVRIQPTLSIAELKKWWKEKTG 301

Query: 137 VAVSYMQAWRGKERSMAALHGTFEEGYRLLPAYCEQVRKTNPGSIA---SVFAIGQENCF 196
             +   +   GK   +  + G  ++ +R++P        +N   +     +F       F
Sbjct: 302 YELQTSKMRDGKLEVIKRVFGDEDQSFRVMPKLISAFHSSNGLLVDWQYDLFPNPDFASF 361

Query: 197 QRLFISYRASIYGFINACRPLLELDKAHLKGKYLGALLCAAAVDADDSLFPLAIAVVDVE 256
           + +F S+  SI GF + CRPL+ +D   L GKY   L+ A+ VDA +  FPLA AV    
Sbjct: 362 RGVFWSFSQSIEGFQH-CRPLIVVDTKSLNGKYQLKLMIASGVDAANKFFPLAFAVTKEV 421

Query: 257 SDENWMWFMSELRKLLGVNTDSMPRLTILSERQRGIVEAVET-----HFPTAFHGFCLRY 316
           S ++W WF +++R+ +    D    L ++S   R IV  V         P A H FCL +
Sbjct: 422 STDSWRWFFTKIREKVTQRKD----LCLISSPLRDIVAVVNEPGSLWQEPWAHHKFCLNH 481

Query: 317 VSENFRDTFKNTKLVNIFWNAVYALTAAEFDSKIAEMVEISQEVITWFQHFPPQLWAVAY 376
           +   F   F++  L ++   A       EFDS + ++ E + E   W    P   WA+A+
Sbjct: 482 LRSQFLGVFRDYNLESLVEQAGSTNQKEEFDSYMNDIKEKNPEAWKWLDQIPRHKWALAH 541

Query: 377 FEGVRYGHFTLGVTELLYNWALECHELP---------IVQMMEHIRNEMASWFNDRREMA 436
             G+RYG   +   E L+     C   P         ++ M + +R+      +      
Sbjct: 542 DSGLRYGIIEID-REALF---AVCRGFPYCTVAMTGGVMLMFDELRSSFDKSLSSIYSSL 601

Query: 437 MRWTSILVPSAEKRIAEAIADAHCYQVLRANEVEFEI-VSTERTN-IVEIHSRVCSCRRW 496
            R      P  +K + E + D+  Y + +     F++  S+E+   IV+++   C+CR++
Sbjct: 602 NRGVVYTEPFMDK-LEEFMTDSIPYVITQLERDSFKVSESSEKEEWIVQLNVSTCTCRKF 661

Query: 497 QLYGLPCAHAAAALMSCGQNAQVFAEQCFTVDSFRQTYSQMIFPIPDKSLWKE 531
           Q Y  PC HA A       N   + ++C+TV+ + +TY+    P+PD + W E
Sbjct: 662 QSYKFPCLHALAVFEKLKINPLQYVDECYTVEQYCKTYAATFSPVPDVAAWPE 701

BLAST of CmoCh01G005250 vs. TAIR10
Match: AT1G49920.1 (AT1G49920.1 MuDR family transposase)

HSP 1 Score: 149.8 bits (377), Expect = 5.2e-36
Identity = 135/600 (22.50%), Postives = 253/600 (42.17%), Query Frame = 1

Query: 11  TALSLVDHTLVIGQEFPDVETCRRMLKDIAIALHFDIRIVKSDRSRFIAKCSKEGCPWRV 70
           + L L   T+ +G  F D+   ++ +   +I       + ++++  ++ +C +  C W +
Sbjct: 171 SGLWLEGDTMRVGLCFKDLAEMKKAVDWCSIKRRQKCLLRETEKDVYVVECERWHCKWSI 230

Query: 71  HVAKCPGVPTFTVRTLHGEHTCEGVHNLHHQQASVGWVARSVSAQVRDNPQYKPKEILRD 130
             ++      F +    G H C   +  H        +   +   VR  P     E+ + 
Sbjct: 231 CASRREEDGLFEITECSGPHDC---YPEHLNDFDAECIPFQIERVVRVQPTLSTAELDKW 290

Query: 131 IRDQHGVAVSYMQAW-------RGKERSMAALHGTFEEGYRLLPAYCEQVRKTN----PG 190
              + G A+  +            K +++    G +++ +RL+P     +  +N      
Sbjct: 291 WEKKFGFALDQVVEHCSEGLVEDAKVKAIKRFFGDWDQSFRLIPKLMSVLHSSNGLLVDW 350

Query: 191 SIASVFAIGQENCFQRLFISYRASIYGFINACRPLLELDKAHLKGKYLGALLCAAAVDAD 250
              S+    +   F+ LF ++  SI GF + CRPL+ +D  +L GKY   L+ A+A DA 
Sbjct: 351 QYDSLTHDPEHASFRGLFWAFSQSIQGFQH-CRPLIVVDTKNLGGKYKMKLMIASAFDAT 410

Query: 251 DSLFPLAIAVVDVESDENWMWFMSELRKLL----GVNTDSMPRLTILSERQRGIVEAVET 310
           +  FPLA AV    S ++W WF++ +R+ +    G+   S P   IL+       +  E 
Sbjct: 411 NQYFPLAFAVTKEVSVDSWRWFLTRIREKVTQRQGICLISSPDPDILAVINEPGSQWKE- 470

Query: 311 HFPTAFHGFCLRYVSENFRDTFK--NTKLVNIFWNAVYALTAAEFDSKIAEMVEISQEVI 370
             P A+H FCL ++           +  +  +   A  +    EFDS + E+ E + E  
Sbjct: 471 --PWAYHRFCLYHLCSKLCSVSPGFDYNMHFLVDEAGSSSQKEEFDSYMKEIKERNPEAW 530

Query: 371 TWFQHFPPQLWAVAYFEGVRYGHFTLGVTELLYNWALECHELP----IVQMMEHIRNEMA 430
            W   FPP  WA+A+ +G RYG   +  TE L+       ++     ++ +   +++  A
Sbjct: 531 KWLDQFPPHQWALAHDDGRRYGIMRID-TEALFAVCKRFRKVAMAGGVMLLFGQLKDAFA 590

Query: 431 SWFNDRREMAMRWTSILVPSAEKRIAEAIADA------------HCYQVLRANEVEFEIV 490
             F   R  +++   +      +++ E   D+              YQV  A + +  ++
Sbjct: 591 ESFKLSRG-SLKHGDVYTEHVMEKLEEFETDSDTWVITITPLERDAYQVSMAPKKKTRLM 650

Query: 491 ---STERTNIVEIHSRVCSCRRWQLYGLPCAHAAAALMSCGQNAQVFAEQCFTVDSFRQT 550
              +   + IV+++   C+C  +Q    PC HA A       N   + + C+TV+ + +T
Sbjct: 651 GQSNDSTSGIVQLNDTTCTCGEFQKNKFPCLHALAVCDELKINPLQYVDDCYTVERYHKT 710

Query: 551 YSQMIFPIPDKSLWKEPGEGAEGGGGAKVDITIRPPKVRRPP----GRPKKKVLRVENLK 571
           YS    P+P+ S W E          A    T+ PP +  PP    G+ K+K    ++L+
Sbjct: 711 YSAKFSPVPELSAWPE----------AYGVPTLIPPVIEPPPPKVSGKGKEKDTEDDHLE 751

BLAST of CmoCh01G005250 vs. NCBI nr
Match: gi|449442265|ref|XP_004138902.1| (PREDICTED: uncharacterized protein LOC101220272 [Cucumis sativus])

HSP 1 Score: 1180.2 bits (3052), Expect = 0.0e+00
Identity = 571/595 (95.97%), Postives = 583/595 (97.98%), Query Frame = 1

Query: 1   MADHSLVVSETALSLVDHTLVIGQEFPDVETCRRMLKDIAIALHFDIRIVKSDRSRFIAK 60
           MADHSL+VSETALSLVDHTLVIGQEFPDVETCRRMLKDIAIA+HFDIRIVKSDRSRFIAK
Sbjct: 1   MADHSLIVSETALSLVDHTLVIGQEFPDVETCRRMLKDIAIAMHFDIRIVKSDRSRFIAK 60

Query: 61  CSKEGCPWRVHVAKCPGVPTFTVRTLHGEHTCEGVHNLHHQQASVGWVARSVSAQVRDNP 120
           CSKEGCPWRVHVAKCPGVPTFTVRTLHGEHTCEGV NLHHQQASVGWVARSV+AQVRDNP
Sbjct: 61  CSKEGCPWRVHVAKCPGVPTFTVRTLHGEHTCEGVRNLHHQQASVGWVARSVAAQVRDNP 120

Query: 121 QYKPKEILRDIRDQHGVAVSYMQAWRGKERSMAALHGTFEEGYRLLPAYCEQVRKTNPGS 180
           QYKPKEILRDIRDQHGVAVSYMQAWRGKERSMAALHGTFEEGYRLLPAYCEQ+ KTNPGS
Sbjct: 121 QYKPKEILRDIRDQHGVAVSYMQAWRGKERSMAALHGTFEEGYRLLPAYCEQISKTNPGS 180

Query: 181 IASVFAIGQENCFQRLFISYRASIYGFINACRPLLELDKAHLKGKYLGALLCAAAVDADD 240
           IASVFA GQENCFQRLFISYRASIYGFINACRPLLELD+AHLKGKYLGALLCAA VDADD
Sbjct: 181 IASVFATGQENCFQRLFISYRASIYGFINACRPLLELDRAHLKGKYLGALLCAAVVDADD 240

Query: 241 SLFPLAIAVVDVESDENWMWFMSELRKLLGVNTDSMPRLTILSERQRGIVEAVETHFPTA 300
           SLFPLAIAVVDVESDENWMWFMSELRKLLGVNTDSMPRLTILSERQRGIVEAVETHFP+A
Sbjct: 241 SLFPLAIAVVDVESDENWMWFMSELRKLLGVNTDSMPRLTILSERQRGIVEAVETHFPSA 300

Query: 301 FHGFCLRYVSENFRDTFKNTKLVNIFWNAVYALTAAEFDSKIAEMVEISQEVITWFQHFP 360
           FHGFCLRYVSENFRDTFKNTKLVNIFWNAVYALTAAEFDSKIAEMVEISQEVITWFQHFP
Sbjct: 301 FHGFCLRYVSENFRDTFKNTKLVNIFWNAVYALTAAEFDSKIAEMVEISQEVITWFQHFP 360

Query: 361 PQLWAVAYFEGVRYGHFTLGVTELLYNWALECHELPIVQMMEHIRNEMASWFNDRREMAM 420
           PQLWAVAYFEGVRYGHFTLGVTELLYNWALECHELPIVQMMEHIRNEMASWFN+RREM M
Sbjct: 361 PQLWAVAYFEGVRYGHFTLGVTELLYNWALECHELPIVQMMEHIRNEMASWFNERREMGM 420

Query: 421 RWTSILVPSAEKRIAEAIADAHCYQVLRANEVEFEIVSTERTNIVEIHSRVCSCRRWQLY 480
           RWTSILVPSAEKRIAEAIADA CYQVLRANEVEFEIVSTERTNIVEIHSRVCSCRRWQLY
Sbjct: 421 RWTSILVPSAEKRIAEAIADARCYQVLRANEVEFEIVSTERTNIVEIHSRVCSCRRWQLY 480

Query: 481 GLPCAHAAAALMSCGQNAQVFAEQCFTVDSFRQTYSQMIFPIPDKSLWKEPGEGAEGGGG 540
           GLPCAHAAAALMSCGQNA +FAE CFTV S+R+TYSQMI+PI DKSLWKEPGEGAE GG 
Sbjct: 481 GLPCAHAAAALMSCGQNAHLFAEPCFTVTSYRETYSQMIYPILDKSLWKEPGEGAE-GGV 540

Query: 541 AKVDITIRPPKVRRPPGRPKKKVLRVENLKRPKRIVQCGRCHLLGHSQKKCTMPM 596
           AKVDITIRPPK+RRPPGRPKKKVLRVENLKRPKRIVQCGRCHLLGHSQKKCTMPM
Sbjct: 541 AKVDITIRPPKIRRPPGRPKKKVLRVENLKRPKRIVQCGRCHLLGHSQKKCTMPM 594

BLAST of CmoCh01G005250 vs. NCBI nr
Match: gi|659082246|ref|XP_008441740.1| (PREDICTED: uncharacterized protein LOC103485812 [Cucumis melo])

HSP 1 Score: 1180.2 bits (3052), Expect = 0.0e+00
Identity = 571/595 (95.97%), Postives = 583/595 (97.98%), Query Frame = 1

Query: 1   MADHSLVVSETALSLVDHTLVIGQEFPDVETCRRMLKDIAIALHFDIRIVKSDRSRFIAK 60
           MADHSL+VSETALSLVDHTLVIGQEFPDVETCRRMLKDIAIA+HFDIRIVKSDRSRFIAK
Sbjct: 1   MADHSLIVSETALSLVDHTLVIGQEFPDVETCRRMLKDIAIAMHFDIRIVKSDRSRFIAK 60

Query: 61  CSKEGCPWRVHVAKCPGVPTFTVRTLHGEHTCEGVHNLHHQQASVGWVARSVSAQVRDNP 120
           CSKEGCPWRVHVAKCPGVPTFTVRTLHGEHTCEGV NLHHQQASVGWVARSV+AQVRDNP
Sbjct: 61  CSKEGCPWRVHVAKCPGVPTFTVRTLHGEHTCEGVRNLHHQQASVGWVARSVAAQVRDNP 120

Query: 121 QYKPKEILRDIRDQHGVAVSYMQAWRGKERSMAALHGTFEEGYRLLPAYCEQVRKTNPGS 180
           QYKPKEILRDIRDQHGVAVSYMQAWRGKERSMAALHGTFEEGYRLLPAYCEQ+ KTNPGS
Sbjct: 121 QYKPKEILRDIRDQHGVAVSYMQAWRGKERSMAALHGTFEEGYRLLPAYCEQISKTNPGS 180

Query: 181 IASVFAIGQENCFQRLFISYRASIYGFINACRPLLELDKAHLKGKYLGALLCAAAVDADD 240
           IASVFA GQENCFQRLFISYRASIYGFINACRPLLELD+AHLKGKYLGALLCAA VDADD
Sbjct: 181 IASVFATGQENCFQRLFISYRASIYGFINACRPLLELDRAHLKGKYLGALLCAAVVDADD 240

Query: 241 SLFPLAIAVVDVESDENWMWFMSELRKLLGVNTDSMPRLTILSERQRGIVEAVETHFPTA 300
           SLFPLAIAVVDVESDENWMWFMSELRKLLGVNTD+MPRLTILSERQRGIVEAVETHFP+A
Sbjct: 241 SLFPLAIAVVDVESDENWMWFMSELRKLLGVNTDNMPRLTILSERQRGIVEAVETHFPSA 300

Query: 301 FHGFCLRYVSENFRDTFKNTKLVNIFWNAVYALTAAEFDSKIAEMVEISQEVITWFQHFP 360
           FHGFCLRYVSENFRDTFKNTKLVNIFWNAVYALTAAEFDSKIAEMVEISQEVITWFQHFP
Sbjct: 301 FHGFCLRYVSENFRDTFKNTKLVNIFWNAVYALTAAEFDSKIAEMVEISQEVITWFQHFP 360

Query: 361 PQLWAVAYFEGVRYGHFTLGVTELLYNWALECHELPIVQMMEHIRNEMASWFNDRREMAM 420
           PQLWAVAYFEGVRYGHFTLGVTELLYNWALECHELPIVQMMEHIRNEMASWFN+RREM M
Sbjct: 361 PQLWAVAYFEGVRYGHFTLGVTELLYNWALECHELPIVQMMEHIRNEMASWFNERREMGM 420

Query: 421 RWTSILVPSAEKRIAEAIADAHCYQVLRANEVEFEIVSTERTNIVEIHSRVCSCRRWQLY 480
           RWTSILVPSAEKRIAEAIADA CYQVLRANEVEFEIVSTERTNIVEIHSRVCSCRRWQLY
Sbjct: 421 RWTSILVPSAEKRIAEAIADARCYQVLRANEVEFEIVSTERTNIVEIHSRVCSCRRWQLY 480

Query: 481 GLPCAHAAAALMSCGQNAQVFAEQCFTVDSFRQTYSQMIFPIPDKSLWKEPGEGAEGGGG 540
           GLPCAHAAAALMSCGQNA +FAE CFTV S+R+TYSQMI+PI DKSLWKEPGEGAEGG G
Sbjct: 481 GLPCAHAAAALMSCGQNAHLFAEPCFTVTSYRETYSQMIYPILDKSLWKEPGEGAEGGVG 540

Query: 541 AKVDITIRPPKVRRPPGRPKKKVLRVENLKRPKRIVQCGRCHLLGHSQKKCTMPM 596
            KVDITIRPPKVRRPPGRPKKKVLRVENLKRPKRIVQCGRCHLLGHSQKKCTMPM
Sbjct: 541 -KVDITIRPPKVRRPPGRPKKKVLRVENLKRPKRIVQCGRCHLLGHSQKKCTMPM 594

BLAST of CmoCh01G005250 vs. NCBI nr
Match: gi|802754907|ref|XP_012088790.1| (PREDICTED: uncharacterized protein LOC105647353 [Jatropha curcas])

HSP 1 Score: 1122.1 bits (2901), Expect = 0.0e+00
Identity = 533/595 (89.58%), Postives = 567/595 (95.29%), Query Frame = 1

Query: 1   MADHSLVVSETALSLVDHTLVIGQEFPDVETCRRMLKDIAIALHFDIRIVKSDRSRFIAK 60
           MAD +L+V + +L+L + +LVIGQEFP+VETCRR LKDIAIALHFD+RIVKSDRSRFIAK
Sbjct: 1   MADRALIVPDASLALAEQSLVIGQEFPNVETCRRTLKDIAIALHFDLRIVKSDRSRFIAK 60

Query: 61  CSKEGCPWRVHVAKCPGVPTFTVRTLHGEHTCEGVHNLHHQQASVGWVARSVSAQVRDNP 120
           CSKEGCPWRVHVAKCPGVPTFT+RTLHGEHTCEGV NLHHQQASVGWVARSV A++RDNP
Sbjct: 61  CSKEGCPWRVHVAKCPGVPTFTIRTLHGEHTCEGVRNLHHQQASVGWVARSVEARIRDNP 120

Query: 121 QYKPKEILRDIRDQHGVAVSYMQAWRGKERSMAALHGTFEEGYRLLPAYCEQVRKTNPGS 180
           QYKPKEIL+DIRDQHGVAVSYMQAWRGKERSMAALHGTFEEGYRLLPAYCEQ+RKTNPGS
Sbjct: 121 QYKPKEILQDIRDQHGVAVSYMQAWRGKERSMAALHGTFEEGYRLLPAYCEQIRKTNPGS 180

Query: 181 IASVFAIGQENCFQRLFISYRASIYGFINACRPLLELDKAHLKGKYLGALLCAAAVDADD 240
           IASVFA GQEN FQRLFISYRA IYGFINACRPLLELDKAHLKGKYLG LLCAAAVDADD
Sbjct: 181 IASVFATGQENSFQRLFISYRACIYGFINACRPLLELDKAHLKGKYLGTLLCAAAVDADD 240

Query: 241 SLFPLAIAVVDVESDENWMWFMSELRKLLGVNTDSMPRLTILSERQRGIVEAVETHFPTA 300
            LFPLAIA+VD ESDENWMWFMSELRKLLGVNTD+MPRLT+LSERQRGIVEAVETHFP+A
Sbjct: 241 VLFPLAIAIVDTESDENWMWFMSELRKLLGVNTDNMPRLTVLSERQRGIVEAVETHFPSA 300

Query: 301 FHGFCLRYVSENFRDTFKNTKLVNIFWNAVYALTAAEFDSKIAEMVEISQEVITWFQHFP 360
           FHGFCLRYVSENFRDTFKNTKLVNIFWNAVYALT AEF+SKI+EMVEISQ+VITWFQHFP
Sbjct: 301 FHGFCLRYVSENFRDTFKNTKLVNIFWNAVYALTTAEFESKISEMVEISQDVITWFQHFP 360

Query: 361 PQLWAVAYFEGVRYGHFTLGVTELLYNWALECHELPIVQMMEHIRNEMASWFNDRREMAM 420
           PQLWAVAYFEG+RYGHFTLGVTELLYNWALECHELPIVQMMEHIRN++ASWFNDRR++ M
Sbjct: 361 PQLWAVAYFEGMRYGHFTLGVTELLYNWALECHELPIVQMMEHIRNQLASWFNDRRDIGM 420

Query: 421 RWTSILVPSAEKRIAEAIADAHCYQVLRANEVEFEIVSTERTNIVEIHSRVCSCRRWQLY 480
           RWTSILVPSAEKRI EAIADA CYQVLRANE+EFEIVSTERTNIV+I SRVCSCRRWQLY
Sbjct: 421 RWTSILVPSAEKRILEAIADARCYQVLRANEIEFEIVSTERTNIVDIRSRVCSCRRWQLY 480

Query: 481 GLPCAHAAAALMSCGQNAQVFAEQCFTVDSFRQTYSQMIFPIPDKSLWKEPGEGAEGGGG 540
           GLPCAHAAAAL+SCGQNA +FAE CFTV S+R+TYSQMI PIPDKSLWKEPGEG E GGG
Sbjct: 481 GLPCAHAAAALISCGQNAHLFAEPCFTVVSYRETYSQMINPIPDKSLWKEPGEGIE-GGG 540

Query: 541 AKVDITIRPPKVRRPPGRPKKKVLRVENLKRPKRIVQCGRCHLLGHSQKKCTMPM 596
           AKVDI IRPPK RRPPGRPKKKVLRVEN KRPKR+VQCGRCHLLGHSQKKCTMP+
Sbjct: 541 AKVDIVIRPPKTRRPPGRPKKKVLRVENFKRPKRVVQCGRCHLLGHSQKKCTMPI 594

BLAST of CmoCh01G005250 vs. NCBI nr
Match: gi|1000981166|ref|XP_015570632.1| (PREDICTED: uncharacterized protein LOC107260770 [Ricinus communis])

HSP 1 Score: 1118.6 bits (2892), Expect = 0.0e+00
Identity = 532/596 (89.26%), Postives = 570/596 (95.64%), Query Frame = 1

Query: 1   MADHSL-VVSETALSLVDHTLVIGQEFPDVETCRRMLKDIAIALHFDIRIVKSDRSRFIA 60
           MAD++L VV + +L+L + +LVIGQEF DVETCRR LKDIAIALHFD+RIVKSDRSRFIA
Sbjct: 1   MADNALIVVPDGSLALSEQSLVIGQEFADVETCRRTLKDIAIALHFDLRIVKSDRSRFIA 60

Query: 61  KCSKEGCPWRVHVAKCPGVPTFTVRTLHGEHTCEGVHNLHHQQASVGWVARSVSAQVRDN 120
           KCSKEGCPWRVHVAKCPGVPTFTVRTLHGEHTCEGVHNLHHQQASVGWVARSV A++RDN
Sbjct: 61  KCSKEGCPWRVHVAKCPGVPTFTVRTLHGEHTCEGVHNLHHQQASVGWVARSVEARIRDN 120

Query: 121 PQYKPKEILRDIRDQHGVAVSYMQAWRGKERSMAALHGTFEEGYRLLPAYCEQVRKTNPG 180
           PQYKPKEIL+DIRDQHGVAVSYMQAWRGKERSMAALHGTFEEGY LLPAYCEQ+RKTNPG
Sbjct: 121 PQYKPKEILQDIRDQHGVAVSYMQAWRGKERSMAALHGTFEEGYHLLPAYCEQIRKTNPG 180

Query: 181 SIASVFAIGQENCFQRLFISYRASIYGFINACRPLLELDKAHLKGKYLGALLCAAAVDAD 240
           SIASVFA GQENCFQRLFISYRA I+GFINACRPLLELD+AHLKGKYLG +LCAAAVDAD
Sbjct: 181 SIASVFATGQENCFQRLFISYRACIFGFINACRPLLELDRAHLKGKYLGTILCAAAVDAD 240

Query: 241 DSLFPLAIAVVDVESDENWMWFMSELRKLLGVNTDSMPRLTILSERQRGIVEAVETHFPT 300
           D+LFPLAIA+VD ESDENWMWFMSELRKLLGVNTD+MPRLT+LSERQRGIVEAVETHFP+
Sbjct: 241 DALFPLAIAIVDTESDENWMWFMSELRKLLGVNTDNMPRLTVLSERQRGIVEAVETHFPS 300

Query: 301 AFHGFCLRYVSENFRDTFKNTKLVNIFWNAVYALTAAEFDSKIAEMVEISQEVITWFQHF 360
           AFHGFCLRYVSENFRDTFKNTKLVNIFWNAVYALT AEF+SKI+EMVEISQ+V+TWFQHF
Sbjct: 301 AFHGFCLRYVSENFRDTFKNTKLVNIFWNAVYALTTAEFESKISEMVEISQDVLTWFQHF 360

Query: 361 PPQLWAVAYFEGVRYGHFTLGVTELLYNWALECHELPIVQMMEHIRNEMASWFNDRREMA 420
           PPQLWAVAYFEG+RYGHFTLGVTELLYNWALECHELPIVQMMEHIRN++ SWFN+RR++ 
Sbjct: 361 PPQLWAVAYFEGMRYGHFTLGVTELLYNWALECHELPIVQMMEHIRNQLVSWFNNRRDVG 420

Query: 421 MRWTSILVPSAEKRIAEAIADAHCYQVLRANEVEFEIVSTERTNIVEIHSRVCSCRRWQL 480
           MRWT ILVPSAEKRI EAIADA CYQVLRANEVEFEIVSTERTNIV+I SRVCSCRRWQL
Sbjct: 421 MRWTLILVPSAEKRILEAIADARCYQVLRANEVEFEIVSTERTNIVDIRSRVCSCRRWQL 480

Query: 481 YGLPCAHAAAALMSCGQNAQVFAEQCFTVDSFRQTYSQMIFPIPDKSLWKEPGEGAEGGG 540
           YGLPCAHAAAAL+SCGQNAQ+FAE CFTV S+R+TYSQ+I PIPDKSLWKEPGEG E GG
Sbjct: 481 YGLPCAHAAAALISCGQNAQLFAEPCFTVASYRETYSQIISPIPDKSLWKEPGEGTE-GG 540

Query: 541 GAKVDITIRPPKVRRPPGRPKKKVLRVENLKRPKRIVQCGRCHLLGHSQKKCTMPM 596
           GAKVDITIRPPK+RRPPGRPKKKVLRVEN KRPKR+VQCGRCHLLGHSQKKCTMPM
Sbjct: 541 GAKVDITIRPPKIRRPPGRPKKKVLRVENFKRPKRVVQCGRCHLLGHSQKKCTMPM 595

BLAST of CmoCh01G005250 vs. NCBI nr
Match: gi|823262209|ref|XP_012463849.1| (PREDICTED: uncharacterized protein LOC105783147 [Gossypium raimondii])

HSP 1 Score: 1117.8 bits (2890), Expect = 0.0e+00
Identity = 530/595 (89.08%), Postives = 572/595 (96.13%), Query Frame = 1

Query: 1   MADHSLVVSETALSLVDHTLVIGQEFPDVETCRRMLKDIAIALHFDIRIVKSDRSRFIAK 60
           MADH+LVV++T+ +LV+H+LVIGQEFPDVETCRR LKDIAIALHFD+RIVKSDRSRFIAK
Sbjct: 1   MADHALVVADTSHTLVEHSLVIGQEFPDVETCRRTLKDIAIALHFDLRIVKSDRSRFIAK 60

Query: 61  CSKEGCPWRVHVAKCPGVPTFTVRTLHGEHTCEGVHNLHHQQASVGWVARSVSAQVRDNP 120
           CSKEGCPWRVHVAKCPGVPTF++RTLHGEHTCEGV NLHHQQASVGWVARSV A++RDNP
Sbjct: 61  CSKEGCPWRVHVAKCPGVPTFSIRTLHGEHTCEGVRNLHHQQASVGWVARSVEARIRDNP 120

Query: 121 QYKPKEILRDIRDQHGVAVSYMQAWRGKERSMAALHGTFEEGYRLLPAYCEQVRKTNPGS 180
           QYKPKEIL+DIRDQHGVAVSYMQAWRGKERSMAALHGTFEEGYRLLPAYCEQ+RKTNPGS
Sbjct: 121 QYKPKEILQDIRDQHGVAVSYMQAWRGKERSMAALHGTFEEGYRLLPAYCEQIRKTNPGS 180

Query: 181 IASVFAIGQENCFQRLFISYRASIYGFINACRPLLELDKAHLKGKYLGALLCAAAVDADD 240
           +ASVFA GQENCFQRLFISYRASIYGFI ACRPLLELDKA LKGKYLGALLCAAAVDADD
Sbjct: 181 VASVFATGQENCFQRLFISYRASIYGFITACRPLLELDKADLKGKYLGALLCAAAVDADD 240

Query: 241 SLFPLAIAVVDVESDENWMWFMSELRKLLGVNTDSMPRLTILSERQRGIVEAVETHFPTA 300
           +LFPLAIA+VDVESDENWMWFMSELRKLLGVNT++MPRLTILSERQRG+V+AVETHFP+A
Sbjct: 241 ALFPLAIAIVDVESDENWMWFMSELRKLLGVNTENMPRLTILSERQRGMVDAVETHFPSA 300

Query: 301 FHGFCLRYVSENFRDTFKNTKLVNIFWNAVYALTAAEFDSKIAEMVEISQEVITWFQHFP 360
           FHGFCLRYVSENFRDTFKNTKLVNIFWNAVYALT  EF+SKIAEMVEISQ+VI WFQ FP
Sbjct: 301 FHGFCLRYVSENFRDTFKNTKLVNIFWNAVYALTTVEFESKIAEMVEISQDVIQWFQLFP 360

Query: 361 PQLWAVAYFEGVRYGHFTLGVTELLYNWALECHELPIVQMMEHIRNEMASWFNDRREMAM 420
           P+LWAVAYFEGVRYGHFTLGVTE+LYNWALECHELPIVQMMEHIR+++ +WF +RREM M
Sbjct: 361 PRLWAVAYFEGVRYGHFTLGVTEMLYNWALECHELPIVQMMEHIRHQLTTWFTNRREMGM 420

Query: 421 RWTSILVPSAEKRIAEAIADAHCYQVLRANEVEFEIVSTERTNIVEIHSRVCSCRRWQLY 480
           RWTSILVPSAEKRI+EAIADA CYQVLRANEVEFEIVSTERTNIV+I SRVCSCRRWQLY
Sbjct: 421 RWTSILVPSAEKRISEAIADARCYQVLRANEVEFEIVSTERTNIVDIRSRVCSCRRWQLY 480

Query: 481 GLPCAHAAAALMSCGQNAQVFAEQCFTVDSFRQTYSQMIFPIPDKSLWKEPGEGAEGGGG 540
           GLPCAHAAAAL+SCGQNA +FAE CFTV S+R+TYSQMI PIPDKS+WKE GEGAE GG 
Sbjct: 481 GLPCAHAAAALISCGQNAHMFAEPCFTVGSYRETYSQMIHPIPDKSIWKELGEGAE-GGA 540

Query: 541 AKVDITIRPPKVRRPPGRPKKKVLRVENLKRPKRIVQCGRCHLLGHSQKKCTMPM 596
           AK+DITIRPPK+RRPPGRPKKKVLRVENLKRPKR+VQCGRCHLLGHSQKKCTMP+
Sbjct: 541 AKLDITIRPPKIRRPPGRPKKKVLRVENLKRPKRVVQCGRCHLLGHSQKKCTMPI 594

The following BLAST results are available for this feature:
Match NameE-valueIdentityDescription
Match NameE-valueIdentityDescription
A0A0A0LN02_CUCSA0.0e+0095.97Uncharacterized protein OS=Cucumis sativus GN=Csa_2G094910 PE=4 SV=1[more]
A0A067JKI4_JATCU0.0e+0089.58Uncharacterized protein OS=Jatropha curcas GN=JCGZ_23146 PE=4 SV=1[more]
A0A0D2U3A4_GOSRA0.0e+0089.08Uncharacterized protein OS=Gossypium raimondii GN=B456_013G180100 PE=4 SV=1[more]
A0A061FG29_THECC0.0e+0087.27MuDR family transposase isoform 1 OS=Theobroma cacao GN=TCM_035107 PE=4 SV=1[more]
A0A151TXU4_CAJCA0.0e+0087.12Uncharacterized protein OS=Cajanus cajan GN=KK1_011143 PE=4 SV=1[more]
Match NameE-valueIdentityDescription
AT1G64260.15.2e-4424.58 MuDR family transposase[more]
AT1G49920.15.2e-3622.50 MuDR family transposase[more]
Match NameE-valueIdentityDescription
gi|449442265|ref|XP_004138902.1|0.0e+0095.97PREDICTED: uncharacterized protein LOC101220272 [Cucumis sativus][more]
gi|659082246|ref|XP_008441740.1|0.0e+0095.97PREDICTED: uncharacterized protein LOC103485812 [Cucumis melo][more]
gi|802754907|ref|XP_012088790.1|0.0e+0089.58PREDICTED: uncharacterized protein LOC105647353 [Jatropha curcas][more]
gi|1000981166|ref|XP_015570632.1|0.0e+0089.26PREDICTED: uncharacterized protein LOC107260770 [Ricinus communis][more]
gi|823262209|ref|XP_012463849.1|0.0e+0089.08PREDICTED: uncharacterized protein LOC105783147 [Gossypium raimondii][more]
The following terms have been associated with this gene:
Vocabulary: INTERPRO
TermDefinition
IPR004332Transposase_MuDR
IPR006564Znf_PMZ
IPR007527Znf_SWIM
IPR018289MULE_transposase_dom
Vocabulary: Molecular Function
TermDefinition
GO:0008270zinc ion binding
GO Assignments
This gene is annotated with the following GO terms.
Category Term Accession Term Name
biological_process GO:0008150 biological_process
cellular_component GO:0005575 cellular_component
molecular_function GO:0003676 nucleic acid binding
molecular_function GO:0008270 zinc ion binding

The following mRNA feature(s) are a part of this gene:

Feature NameUnique NameType
CmoCh01G005250.1CmoCh01G005250.1mRNA


Analysis Name: InterPro Annotations of Cucurbita moschata
Date Performed: 2017-05-19
IPR TermIPR DescriptionSourceSource TermSource DescriptionAlignment
IPR004332Transposase, MuDR, plantPFAMPF03108DBD_Tnp_Mutcoord: 20..83
score: 3.7
IPR006564Zinc finger, PMZ-typeSMARTSM0057526again6coord: 470..497
score: 1.
IPR007527Zinc finger, SWIM-typePFAMPF04434SWIMcoord: 472..494
score: 1.
IPR007527Zinc finger, SWIM-typePROFILEPS50966ZF_SWIMcoord: 454..495
score: 10
IPR018289MULE transposase domainPFAMPF10551MULEcoord: 216..312
score: 2.8
NoneNo IPR availablePANTHERPTHR31973FAMILY NOT NAMEDcoord: 20..593
score:
NoneNo IPR availablePANTHERPTHR31973:SF22SUBFAMILY NOT NAMEDcoord: 20..593
score:

The following gene(s) are orthologous to this gene:
GeneOrthologueOrganismBlock
CmoCh01G005250CmaCh01G004910Cucurbita maxima (Rimu)cmacmoB468
CmoCh01G005250Csa2G094910Cucumber (Chinese Long) v2cmocuB429
CmoCh01G005250CSPI02G08050Wild cucumber (PI 183967)cmocpiB433
CmoCh01G005250CsaV3_2G010430Cucumber (Chinese Long) v3cmocucB0518
CmoCh01G005250Bhi06G000808Wax gourdcmowgoB0552
The following gene(s) are paralogous to this gene:
GeneParalogueOrganismBlock
CmoCh01G005250CmoCh05G008500Cucurbita moschata (Rifu)cmocmoB372