CmaCh04G004830 (gene) Cucurbita maxima (Rimu)

NameCmaCh04G004830
Typegene
OrganismCucurbita maxima (Cucurbita maxima (Rimu))
DescriptionPIF / Ping-Pong family of plant transposases
LocationCma_Chr04 : 2459802 .. 2461106 (-)
The following sequences are available for this feature:

Gene sequence (with intron)

Legend: CDSexon
Hold the cursor over a type above to highlight its positions in the sequence below.
ATGGATTCCCGGCAATTGGCTGCTTTACTCTCTTCTTTGATCTCCCAACTACTCCTCCTGCTATTGCTTCTCTTCCCTTCCTCCAACCCACATTCTCTTTTGTCCAATTCTTCATCTGATTCCAATTTCTATGCTAATCTCTTTCCTCTCTTCAATCACTTCCTGTTTTCCCAGCAAATTGCTGCATCCCTTTCGTTTCTCTCCGTTTCGCGTAAGAGGAAGAGGACGCATTCGTCGGAGCTGCTTGAATTAGGGCCATCCGATAGCGGTGGTGAGGACGGCGGCCGTGGACGAGTTCATCTGTTGCGGACTCGGAGTCCTGATTCTTTCAGGAATCACTTTCGGATGACCTCCTCGACGTTTGAATGGCTCTCTGGTTTGCTCGAGCCGCTTCTCGAGTGTCGTGACCCGGTAGGTTCGCCTCTTGATCTCTCCGCTGAGATTCGACTCGGTGTTGGCTTGTCCCGGCTGGCCACAGGCTGCGATTTCTCGACGATTTCGGACCAATTTGGCGTCTCGGAGTCGGTAGCGAGGTTCTGTGCTAAGCAATTGTGTCGTGTTCTCTGCACTAATTTTCGCTTTTGGGTTGAATTCCCTTGCCCCAGTGAGCTCGAATTAACATCCTCAGCCTTTGAAGATATTGCTGGGCTTCCGAATTGCTGTGGCGTGATTTCTTGTACAAGGTTCAAGATCATTAGAAATACCAATTTTTATGAAGATAGCATCGCTACTCAACTTGTTGTTGATTCCTCGTCACGAATCCTTAGTATTGTTGCAGGATTTCGTGGCGATAAAGACGACTCCACGGTGCTTATGTCCACAACGCTGTTTAAAGACATTGAAGAAGAAAGGCTACTGGGTTCTCCTCCTGTTTACCTTCATGGGGTGGCTGTGAATCAATACTTGTTTGGACATGGCGACTATCCTTTGCTTCCATGGTTAATGGTGCCTTTTGCAGGAGCTGTTTCAGGGTCAACTGAAGAGAGTTTCAATGAAGCTCATCGCTTGATGTCCATTCCAGCTCTGAAAGCCATCATTAGTTTGAGAAATTGGGGAGTTTTGAGCCAACCAATGCATGAGGAATTCAAAACTGCTGTTGCATACATTGGTGCTTGCTCAATTCTTCACAATGCTTTGTTGATGAGGGAGGATTTTACTGCCATGGCTGATGAATGGGAGAGCTTAGCTTCACTCGATCATAGCTCTCAGTATGTTGGTATTGGATTGAATGAGGATTCACCTGATGAGAAGGCTTCTATGATACAGAAGGCCTTGGCTCTGAGAGCTAGAGAGCTTCACACTTAA

mRNA sequence

ATGGATTCCCGGCAATTGGCTGCTTTACTCTCTTCTTTGATCTCCCAACTACTCCTCCTGCTATTGCTTCTCTTCCCTTCCTCCAACCCACATTCTCTTTTGTCCAATTCTTCATCTGATTCCAATTTCTATGCTAATCTCTTTCCTCTCTTCAATCACTTCCTGTTTTCCCAGCAAATTGCTGCATCCCTTTCGTTTCTCTCCGTTTCGCGTAAGAGGAAGAGGACGCATTCGTCGGAGCTGCTTGAATTAGGGCCATCCGATAGCGGTGGTGAGGACGGCGGCCGTGGACGAGTTCATCTGTTGCGGACTCGGAGTCCTGATTCTTTCAGGAATCACTTTCGGATGACCTCCTCGACGTTTGAATGGCTCTCTGGTTTGCTCGAGCCGCTTCTCGAGTGTCGTGACCCGGTAGGTTCGCCTCTTGATCTCTCCGCTGAGATTCGACTCGGTGTTGGCTTGTCCCGGCTGGCCACAGGCTGCGATTTCTCGACGATTTCGGACCAATTTGGCGTCTCGGAGTCGGTAGCGAGGTTCTGTGCTAAGCAATTGTGTCGTGTTCTCTGCACTAATTTTCGCTTTTGGGTTGAATTCCCTTGCCCCAGTGAGCTCGAATTAACATCCTCAGCCTTTGAAGATATTGCTGGGCTTCCGAATTGCTGTGGCGTGATTTCTTGTACAAGGTTCAAGATCATTAGAAATACCAATTTTTATGAAGATAGCATCGCTACTCAACTTGTTGTTGATTCCTCGTCACGAATCCTTAGTATTGTTGCAGGATTTCGTGGCGATAAAGACGACTCCACGGTGCTTATGTCCACAACGCTGTTTAAAGACATTGAAGAAGAAAGGCTACTGGGTTCTCCTCCTGTTTACCTTCATGGGGTGGCTGTGAATCAATACTTGTTTGGACATGGCGACTATCCTTTGCTTCCATGGTTAATGGTGCCTTTTGCAGGAGCTGTTTCAGGGTCAACTGAAGAGAGTTTCAATGAAGCTCATCGCTTGATGTCCATTCCAGCTCTGAAAGCCATCATTAGTTTGAGAAATTGGGGAGTTTTGAGCCAACCAATGCATGAGGAATTCAAAACTGCTGTTGCATACATTGGTGCTTGCTCAATTCTTCACAATGCTTTGTTGATGAGGGAGGATTTTACTGCCATGGCTGATGAATGGGAGAGCTTAGCTTCACTCGATCATAGCTCTCAGTATGTTGGTATTGGATTGAATGAGGATTCACCTGATGAGAAGGCTTCTATGATACAGAAGGCCTTGGCTCTGAGAGCTAGAGAGCTTCACACTTAA

Coding sequence (CDS)

ATGGATTCCCGGCAATTGGCTGCTTTACTCTCTTCTTTGATCTCCCAACTACTCCTCCTGCTATTGCTTCTCTTCCCTTCCTCCAACCCACATTCTCTTTTGTCCAATTCTTCATCTGATTCCAATTTCTATGCTAATCTCTTTCCTCTCTTCAATCACTTCCTGTTTTCCCAGCAAATTGCTGCATCCCTTTCGTTTCTCTCCGTTTCGCGTAAGAGGAAGAGGACGCATTCGTCGGAGCTGCTTGAATTAGGGCCATCCGATAGCGGTGGTGAGGACGGCGGCCGTGGACGAGTTCATCTGTTGCGGACTCGGAGTCCTGATTCTTTCAGGAATCACTTTCGGATGACCTCCTCGACGTTTGAATGGCTCTCTGGTTTGCTCGAGCCGCTTCTCGAGTGTCGTGACCCGGTAGGTTCGCCTCTTGATCTCTCCGCTGAGATTCGACTCGGTGTTGGCTTGTCCCGGCTGGCCACAGGCTGCGATTTCTCGACGATTTCGGACCAATTTGGCGTCTCGGAGTCGGTAGCGAGGTTCTGTGCTAAGCAATTGTGTCGTGTTCTCTGCACTAATTTTCGCTTTTGGGTTGAATTCCCTTGCCCCAGTGAGCTCGAATTAACATCCTCAGCCTTTGAAGATATTGCTGGGCTTCCGAATTGCTGTGGCGTGATTTCTTGTACAAGGTTCAAGATCATTAGAAATACCAATTTTTATGAAGATAGCATCGCTACTCAACTTGTTGTTGATTCCTCGTCACGAATCCTTAGTATTGTTGCAGGATTTCGTGGCGATAAAGACGACTCCACGGTGCTTATGTCCACAACGCTGTTTAAAGACATTGAAGAAGAAAGGCTACTGGGTTCTCCTCCTGTTTACCTTCATGGGGTGGCTGTGAATCAATACTTGTTTGGACATGGCGACTATCCTTTGCTTCCATGGTTAATGGTGCCTTTTGCAGGAGCTGTTTCAGGGTCAACTGAAGAGAGTTTCAATGAAGCTCATCGCTTGATGTCCATTCCAGCTCTGAAAGCCATCATTAGTTTGAGAAATTGGGGAGTTTTGAGCCAACCAATGCATGAGGAATTCAAAACTGCTGTTGCATACATTGGTGCTTGCTCAATTCTTCACAATGCTTTGTTGATGAGGGAGGATTTTACTGCCATGGCTGATGAATGGGAGAGCTTAGCTTCACTCGATCATAGCTCTCAGTATGTTGGTATTGGATTGAATGAGGATTCACCTGATGAGAAGGCTTCTATGATACAGAAGGCCTTGGCTCTGAGAGCTAGAGAGCTTCACACTTAA

Protein sequence

MDSRQLAALLSSLISQLLLLLLLLFPSSNPHSLLSNSSSDSNFYANLFPLFNHFLFSQQIAASLSFLSVSRKRKRTHSSELLELGPSDSGGEDGGRGRVHLLRTRSPDSFRNHFRMTSSTFEWLSGLLEPLLECRDPVGSPLDLSAEIRLGVGLSRLATGCDFSTISDQFGVSESVARFCAKQLCRVLCTNFRFWVEFPCPSELELTSSAFEDIAGLPNCCGVISCTRFKIIRNTNFYEDSIATQLVVDSSSRILSIVAGFRGDKDDSTVLMSTTLFKDIEEERLLGSPPVYLHGVAVNQYLFGHGDYPLLPWLMVPFAGAVSGSTEESFNEAHRLMSIPALKAIISLRNWGVLSQPMHEEFKTAVAYIGACSILHNALLMREDFTAMADEWESLASLDHSSQYVGIGLNEDSPDEKASMIQKALALRARELHT
BLAST of CmaCh04G004830 vs. TrEMBL
Match: A0A0A0LBX6_CUCSA (Uncharacterized protein OS=Cucumis sativus GN=Csa_3G202740 PE=4 SV=1)

HSP 1 Score: 732.6 bits (1890), Expect = 2.7e-208
Identity = 373/435 (85.75%), Postives = 400/435 (91.95%), Query Frame = 1

Query: 1   MDSRQLAALLSSLISQLLLLLLLLFPSSNPHSLLSNSSSDSNFYANLFPLFNHFLFSQQI 60
           MDS +LAALLSSLISQLLLLL LLFPSSNPHSL SNS+ DS+FYANLF    HFLFSQ  
Sbjct: 1   MDSPRLAALLSSLISQLLLLLFLLFPSSNPHSLFSNSAPDSSFYANLFA---HFLFSQDF 60

Query: 61  AASLSFLSVSRKRKRTHSSELLELGPSDSGGEDGGRGRVH-LLRTRSPDSFRNHFRMTSS 120
           AASL FLSVSRKRKRT+ S+ LELG S         GRVH L RTR+PDSFRNHFRMTSS
Sbjct: 61  AASLPFLSVSRKRKRTNRSDHLELGSS--------HGRVHHLFRTRTPDSFRNHFRMTSS 120

Query: 121 TFEWLSGLLEPLLECRDPVGSPLDLSAEIRLGVGLSRLATGCDFSTISDQFGVSESVARF 180
           TFEWLSGLLEPLLECRDPVGSPLDLS EIRLGVGL RLATGCDFSTISDQFGVSESVARF
Sbjct: 121 TFEWLSGLLEPLLECRDPVGSPLDLSVEIRLGVGLYRLATGCDFSTISDQFGVSESVARF 180

Query: 181 CAKQLCRVLCTNFRFWVEFPCPSELELTSSAFEDIAGLPNCCGVISCTRFKIIRNTNFYE 240
           C+KQLCRVLCTNFRFWVEFPCP+ELELTSSAFED+AGLPNCCGV+SCTRFKIIRN++FYE
Sbjct: 181 CSKQLCRVLCTNFRFWVEFPCPNELELTSSAFEDLAGLPNCCGVVSCTRFKIIRNSHFYE 240

Query: 241 DSIATQLVVDSSSRILSIVAGFRGDKDDSTVLMSTTLFKDIEEERLLGSPPVYLHGVAVN 300
           DS+ATQLVVDSSSRILSIVAGFRG+KDDSTVLMS+TLFKDIE+ RLL SPPVYLHGVAVN
Sbjct: 241 DSVATQLVVDSSSRILSIVAGFRGNKDDSTVLMSSTLFKDIEQGRLLNSPPVYLHGVAVN 300

Query: 301 QYLFGHGDYPLLPWLMVPFAGAVSGSTEESFNEAHRLMSIPALKAIISLRNWGVLSQPMH 360
           +YLFGHG+YPLLPWL+VPFAGAVSGSTEESFNEAHRLM IPALKAI+SLRNWGVLSQP+H
Sbjct: 301 KYLFGHGEYPLLPWLIVPFAGAVSGSTEESFNEAHRLMCIPALKAIVSLRNWGVLSQPIH 360

Query: 361 EEFKTAVAYIGACSILHNALLMREDFTAMADEWESLASLDHSSQYVGIGLNEDSPDEKAS 420
           EEFKTAVAYIGACSILHNALLMREDF+AMADEWESL+SLDH SQYV  GLN DS +EKAS
Sbjct: 361 EEFKTAVAYIGACSILHNALLMREDFSAMADEWESLSSLDHKSQYVEAGLNVDSTNEKAS 420

Query: 421 MIQKALALRARELHT 435
           +IQ+ALALRARELH+
Sbjct: 421 VIQRALALRARELHS 424

BLAST of CmaCh04G004830 vs. TrEMBL
Match: F6HQ92_VITVI (Putative uncharacterized protein OS=Vitis vinifera GN=VIT_03s0063g00150 PE=4 SV=1)

HSP 1 Score: 528.5 bits (1360), Expect = 7.8e-147
Identity = 275/443 (62.08%), Postives = 334/443 (75.40%), Query Frame = 1

Query: 1   MDSRQLAALLSSLISQLLLLLLLLFPSSNPHSLLSNSSSDSNFYANLFPLFNHFLFSQQI 60
           M+SR LAALLSSLIS+LLLLLLLLFPSSNP ++ SNS+S SNFY  +FPL +HFL S ++
Sbjct: 1   MNSRALAALLSSLISELLLLLLLLFPSSNPLTITSNSNSGSNFYETIFPLIHHFLSSAEL 60

Query: 61  AASLSFLSVSRKRKRTHSSELLELGPSDSGGEDGGRGRVHLLRTRSPDSFRNHFRMTSST 120
             SLS LS+SRKRKRTH  +L      D  G +  R  + L  T++PDSF+  FRMTSST
Sbjct: 61  VTSLSLLSISRKRKRTHQPDLDNEDEEDEPGSELARFELGL--TQNPDSFKGCFRMTSST 120

Query: 121 FEWLSGLLEPLLECRDPVGSPLDLSAEIRLGVGLSRLATGCDFSTISDQFGVSESVARFC 180
           FEWLSGLLEPLL+CRDP+GSPL+L+ EIRLG+GL RLATG D+  I+ +FGVSES+ RFC
Sbjct: 121 FEWLSGLLEPLLDCRDPIGSPLNLAPEIRLGIGLFRLATGSDYPEIARRFGVSESITRFC 180

Query: 181 AKQLCRVLCTNFRFWVEFPCPSELELTSSAFEDIAGLPNCCGVISCTRFKIIRNTNF--- 240
            KQLCRVLCTNFRFW+ FP P +L+  S++FE + GLPNCCGVI CTRFKI+RN  F   
Sbjct: 181 VKQLCRVLCTNFRFWIAFPSPIDLDSLSTSFEALTGLPNCCGVIDCTRFKIVRNNGFKLS 240

Query: 241 -----YEDSIATQLVVDSSSRILSIVAGFRGDKDDSTVLMSTTLFKDIEEERLLGSPPVY 300
                 E+SIA Q+VVDSSSRILSIVAGFRGDK +S VL S+TL+KDIE   LL +PPVY
Sbjct: 241 PKEEVREESIAAQIVVDSSSRILSIVAGFRGDKGESRVLKSSTLYKDIEGGSLLNAPPVY 300

Query: 301 LHGVAVNQYLFGHGDYPLLPWLMVPFAGAVSGSTEESFNEAHRLMSIPALKAIISLRNWG 360
           ++GV +NQYL G G YPLLPWLMVPF     GS EE+FN AH LM I AL+AI SL++WG
Sbjct: 301 MNGVGINQYLIGDGGYPLLPWLMVPFVDPAPGSYEENFNSAHHLMHISALRAIASLKDWG 360

Query: 361 VLSQPMHEEFKTAVAYIGACSILHNALLMREDFTAMADEWESLASLDHSSQYVGIGLNED 420
           VL Q +  EFK AVAYIG+C+ILHN LLMR+D++A++D    L     S QY      E+
Sbjct: 361 VLRQTIEGEFKMAVAYIGSCAILHNVLLMRDDYSALSD---GLGDYSQSPQYCRNASLEE 420

Query: 421 SPDEK-ASMIQKALALRARELHT 435
           SP E+ AS+I+ ALA RAR+ H+
Sbjct: 421 SPIERNASVIRNALATRARKFHS 438

BLAST of CmaCh04G004830 vs. TrEMBL
Match: A0A067EX85_CITSI (Uncharacterized protein OS=Citrus sinensis GN=CISIN_1g013572mg PE=4 SV=1)

HSP 1 Score: 520.8 bits (1340), Expect = 1.6e-144
Identity = 280/439 (63.78%), Postives = 333/439 (75.85%), Query Frame = 1

Query: 1   MDSRQLAALLSSLISQLLLLLLLLFPSSNPHSLLSNSSSDSNFYANLFPLFNHFLFSQQI 60
           MDS++L+A LSSL+SQL LLLLLLFP S           D+    NLFPL +HF+ SQQ+
Sbjct: 1   MDSQKLSAFLSSLVSQLFLLLLLLFPDS-----------DATQRTNLFPLISHFISSQQV 60

Query: 61  AASLSFLSVSRKRKRTHSSELLELGPS-DSGGEDGGRGRVHLLRTRSPDSFRNHFRMTSS 120
           AASL+FLS+SRKRKRTHSSE  EL P+ D      G G   L  T+ PDSFRN F+M+SS
Sbjct: 61  AASLTFLSISRKRKRTHSSEE-ELEPTHDDKTSRLGHGLSQLGFTQLPDSFRNSFKMSSS 120

Query: 121 TFEWLSGLLEPLLECRDPVGSPLDLSAEIRLGVGLSRLATGCDFSTISDQFGVSESVARF 180
           TF WLSGLLEPLL+CRDPVG PL+LSA+IRLG+GL RL  G  +S I+ +F V+ESV RF
Sbjct: 121 TFRWLSGLLEPLLDCRDPVGLPLNLSADIRLGIGLFRLVNGSTYSEIATRFEVTESVTRF 180

Query: 181 CAKQLCRVLCTNFRFWVEFPCPSELELTSSAFEDIAGLPNCCGVISCTRFKIIR----NT 240
           C KQLCRVLCTNFRFWV FP P EL L S +FE++ GLPNCCGVI CTRFKII+    N+
Sbjct: 181 CVKQLCRVLCTNFRFWVAFPGPEELGLISKSFEELTGLPNCCGVIDCTRFKIIKIDGSNS 240

Query: 241 NFYEDSIATQLVVDSSSRILSIVAGFRGDKDDSTVLMSTTLFKDIEEERLLGSPPVYLHG 300
           +  EDSIA Q+VVDSSSR+LSIVAG RGDK DS VL S+TL+KDIEE++LL S P+ ++G
Sbjct: 241 SKDEDSIAVQIVVDSSSRMLSIVAGIRGDKGDSRVLKSSTLYKDIEEKKLLNSSPICVNG 300

Query: 301 VAVNQYLFGHGDYPLLPWLMVPFAGAVSGSTEESFNEAHRLMSIPALKAIISLRNWGVLS 360
           VAV+QYL G G YPLLPWLMVPF  A  GS+EE+FN AH LM +PALKAI SL+NWGVLS
Sbjct: 301 VAVDQYLIGDGGYPLLPWLMVPFVDANPGSSEENFNAAHNLMRVPALKAIASLKNWGVLS 360

Query: 361 QPMHEEFKTAVAYIGACSILHNALLMREDFTAMADEWESLASLDHSSQYVG-IGLNEDSP 420
           +P+ E+FKTAVA IGACSILHNALLMREDF+ + +E    +  D SSQY     L E+S 
Sbjct: 361 RPIDEDFKTAVALIGACSILHNALLMREDFSGLFEELGDYSLHDESSQYYSDASLEENST 420

Query: 421 DEKASMIQKALALRARELH 434
           ++KAS I+ ALA RAR  H
Sbjct: 421 EKKASAIRSALATRARVQH 427

BLAST of CmaCh04G004830 vs. TrEMBL
Match: B9RYF7_RICCO (Putative uncharacterized protein OS=Ricinus communis GN=RCOM_0812810 PE=4 SV=1)

HSP 1 Score: 489.6 bits (1259), Expect = 4.0e-135
Identity = 270/448 (60.27%), Postives = 328/448 (73.21%), Query Frame = 1

Query: 1   MDSRQLAALLSSLISQLLLLLLLLFPSSNPHSLLSNSSSDSNFYANLFPLFNHFLFSQQI 60
           M+S++LAAL+SSLIS LLLLLLLLFPSSN     SN  + SN YANLFPL +H L SQ+ 
Sbjct: 1   MESKKLAALISSLISDLLLLLLLLFPSSNSFDSSSNHFN-SNCYANLFPLIHHLLSSQET 60

Query: 61  AASLSFLSVSRKRKRTHSSELLELGPSDSGGEDGGRGRVHLLR-----TRSPDSFRNHFR 120
           AASLS L++S+KRKRTH SE      S+S  ED   G  H L       ++PDSFR  F+
Sbjct: 61  AASLSILNLSKKRKRTHFSE----PDSESTHEDKSHGPFHRLSELARVVQNPDSFRTFFK 120

Query: 121 MTSSTFEWLSGLLEPLLECRDPVGSPLDLSAEIRLGVGLSRLATGCDFSTISDQFGVSES 180
           M +STFEWLSGLLEPLL+CRDP+GSPL LSAE+RLGVGL RLATG ++S I+D+FGV+ES
Sbjct: 121 MKASTFEWLSGLLEPLLDCRDPIGSPLSLSAELRLGVGLFRLATGSNYSEIADRFGVTES 180

Query: 181 VARFCAKQLCRVLCTNFRFWVEFPCPSELELTSSAFEDIAGLPNCCGVISCTRFKIIR-- 240
            ARFCAKQLCRVLCTNFRFWV FP P EL+  S+AFE + GLPNCCGVI   RF +++  
Sbjct: 181 AARFCAKQLCRVLCTNFRFWVSFPSPVELQSVSNAFEKLIGLPNCCGVIDSARFNLVKKA 240

Query: 241 ------NTNFYEDSIATQLVVDSSSRILSIVAGFRGDKDDSTVLMSTTLFKDIEEERLLG 300
                 N    +D IA Q+VVDSSSRILSIVAGFRG+K +S +L STTL+KDIE  R+L 
Sbjct: 241 DDKLASNGKDQDDMIAAQIVVDSSSRILSIVAGFRGEKGNSRMLKSTTLYKDIEGGRVLN 300

Query: 301 SPPVYLHGVAVNQYLFGHGDYPLLPWLMVPFAGAVSGSTEESFNEAHRLMSIPALKAIIS 360
           S P  ++GVA+N+YL G G YPLLPWLMVPF  A+ GS EE FN+A+ LM + +L+AI S
Sbjct: 301 SSPEIVNGVAINRYLIGGGRYPLLPWLMVPFLDALPGSCEEKFNKANDLMRVSSLRAIAS 360

Query: 361 LRNWGVLSQPMHEEFKTAVAYIGACSILHNALLMREDFTAMAD-EWESLASLDHSSQYVG 420
           L+NWGVLS+P+ EEFKTAVA IGACSILHNALLMRED +A+ D    SL +   S  ++ 
Sbjct: 361 LKNWGVLSRPIQEEFKTAVALIGACSILHNALLMREDDSALLDMGGYSLYNQQCSQHFMD 420

Query: 421 IGLNEDSP-DEKASMIQKALALRARELH 434
             + + S  D KAS I+ ALA +    H
Sbjct: 421 AEVEDISRIDGKASEIRNALATKVAVFH 443

BLAST of CmaCh04G004830 vs. TrEMBL
Match: A0A061E009_THECC (PIF / Ping-Pong family of plant transposases OS=Theobroma cacao GN=TCM_007086 PE=4 SV=1)

HSP 1 Score: 487.6 bits (1254), Expect = 1.5e-134
Identity = 264/439 (60.14%), Postives = 315/439 (71.75%), Query Frame = 1

Query: 1   MDSRQLAALLSSLISQLLLLLLLLFPSSNPHSLLSNSSSDSNFYANLFPLFNHFLFSQQI 60
           MD R+L+AL+SSL+SQLLLLL + F S+N +  +S+         NLF + N+ L SQ+I
Sbjct: 1   MDPRKLSALVSSLVSQLLLLLPIFFNSTNSNDFVSDR--------NLFSVLNYLLSSQEI 60

Query: 61  AASLSFLSVSRKRKRTHSSE-----LLELGPSDSGGEDGGRGRVHLLRTRSPDSFRNHFR 120
           AA+LSF+SVSRKRKRT  SE     ++E    + G   G   RV L  TR PD F+  FR
Sbjct: 61  AATLSFVSVSRKRKRTQCSESDSEPIVEERDQELGHRLGD-DRVRLGLTRDPDLFKACFR 120

Query: 121 MTSSTFEWLSGLLEPLLECRDPVGSPLDLSAEIRLGVGLSRLATGCDFSTISDQFGVSES 180
           M SSTFEWL+GLLEPLLECRDPVGSPL+LSAE+RLG+GL RLATG  +  I+ +FGVSES
Sbjct: 121 MKSSTFEWLAGLLEPLLECRDPVGSPLNLSAELRLGIGLFRLATGSSYPEIAQRFGVSES 180

Query: 181 VARFCAKQLCRVLCTNFRFWVEFPCPSELELTSSAFEDIAGLPNCCGVISCTRFKIIRNT 240
           V RFC K LCRVLCTNFRFWV FP P EL+  S +FE   GLPNCCGVI CTRF I+   
Sbjct: 181 VTRFCTKHLCRVLCTNFRFWVAFPSPEELKSVSLSFEQFTGLPNCCGVIDCTRFNIVNEN 240

Query: 241 NFYEDSIATQLVVDSSSRILSIVAGFRGDKDDSTVLMSTTLFKDIEEERLLGSPPVYLHG 300
           N   DS+A Q+VVDSSS+ILSIVAGF+GDK DS VL S+TL+KD+EE RLL S PV ++G
Sbjct: 241 NGSIDSVAAQIVVDSSSKILSIVAGFKGDKGDSRVLKSSTLYKDVEEGRLLNSSPVLVNG 300

Query: 301 VAVNQYLFGHGDYPLLPWLMVPFAGAVSGSTEESFNEAHRLMSIPALKAIISLRNWGVLS 360
           VA+NQYL G G YPLLPWLMVPF   V GS+E  FN AHR M + ALK I SL+NWG+L 
Sbjct: 301 VAINQYLVGDGAYPLLPWLMVPFVDVVPGSSEGKFNVAHRAMHVSALKTIASLKNWGILK 360

Query: 361 QPMHEEFKTAVAYIGACSILHNALLMREDFTAMADEWESLASLDHSSQYVG-IGLNEDSP 420
           +PM EE K AVA IGACSILHN LLMRED +A+ +        D SSQ  G   L E+S 
Sbjct: 361 KPMEEELKAAVAIIGACSILHNILLMREDDSALCELVGDYLVHDQSSQCYGEASLEENSI 420

Query: 421 DEKASMIQKALALRARELH 434
            ++AS+I+ ALA  ARE H
Sbjct: 421 GKEASVIRDALATEAREAH 430

BLAST of CmaCh04G004830 vs. TAIR10
Match: AT3G55350.1 (AT3G55350.1 PIF / Ping-Pong family of plant transposases)

HSP 1 Score: 127.9 bits (320), Expect = 1.5e-29
Identity = 89/305 (29.18%), Postives = 146/305 (47.87%), Query Frame = 1

Query: 107 PDSFRNHFRMTSSTFEWLSGLLEPLLECR-----DPVGSPLDLSAEIRLGVGLSRLATGC 166
           P +F + F+++  TF+++  L++     +     D  G+PL L+   R+ V L RL +G 
Sbjct: 69  PKTFESVFKISRKTFDYICSLVKADFTAKPANFSDSNGNPLSLND--RVAVALRRLGSGE 128

Query: 167 DFSTISDQFGVSESVA-----RFCAKQLCRVLCTNFRFWVEFPCPSELELTSSAFEDIAG 226
             S I + FG+++S       RF      R +  +   W     PS+L+   S FE I+G
Sbjct: 129 SLSVIGETFGMNQSTVSQITWRFVESMEERAI--HHLSW-----PSKLDEIKSKFEKISG 188

Query: 227 LPNCCGVISCTRFKIIRNTNFYEDS------------IATQLVVDSSSRILSIVAGFRGD 286
           LPNCCG I  T   I+ N    E S            +  Q VVD   R L ++AG+ G 
Sbjct: 189 LPNCCGAIDITH--IVMNLPAVEPSNKVWLDGEKNFSMTLQAVVDPDMRFLDVIAGWPGS 248

Query: 287 KDDSTVLMSTTLFKDIEE-ERLLGSPPVYLHGVAVNQYLFGHGDYPLLPWLMVPFAGAVS 346
            +D  VL ++  +K +E+ +RL G          + +Y+ G   +PLLPWL+ P+ G  +
Sbjct: 249 LNDDVVLKNSGFYKLVEKGKRLNGEKLPLSERTELREYIVGDSGFPLLPWLLTPYQGKPT 308

Query: 347 GSTEESFNEAHRLMSIPALKAIISLRN-WGVLSQPMHEEFKTAV-AYIGACSILHNALLM 387
              +  FN+ H   +  A  A+  L++ W +++  M    +  +   I  C +LHN ++ 
Sbjct: 309 SLPQTEFNKRHSEATKAAQMALSKLKDRWRIINGVMWMPDRNRLPRIIFVCCLLHNIIID 362

BLAST of CmaCh04G004830 vs. TAIR10
Match: AT3G63270.1 (AT3G63270.1 Putative harbinger transposase-derived nuclease (InterPro:IPR006912))

HSP 1 Score: 120.2 bits (300), Expect = 3.2e-27
Identity = 92/342 (26.90%), Postives = 151/342 (44.15%), Query Frame = 1

Query: 67  LSVSRKRKRTHSSELLELGPSDSGGEDGGRGRVHLLRTRSPD-------SFRNHFRMTSS 126
           L  ++K  +    + +   P D    D        LR  SP        +F++ FR + +
Sbjct: 15  LDKAKKLAKNKEKKRVNAVPLDPEAIDCDWWDTFWLRNSSPSVPSDEDYAFKHFFRASKT 74

Query: 127 TFEWLSGLLEPLLECRDPVG----SPLDLSAEIRLGVGLSRLATGCDFSTISDQFGVSES 186
           TF ++  L+   L  R P G        LS E ++ + L RLA+G    ++   FGV +S
Sbjct: 75  TFSYICSLVREDLISRPPSGLINIEGRLLSVEKQVAIALRRLASGDSQVSVGAAFGVGQS 134

Query: 187 VARFCAKQLCRVLCTNFRFWVEFPCPSELELTSSAFEDIAGLPNCCGVISCTR----FKI 246
                  +    L    +  + +P    +E   S FE++ GLPNCCG I  T        
Sbjct: 135 TVSQVTWRFIEALEERAKHHLRWPDSDRIEEIKSKFEEMYGLPNCCGAIDTTHIIMTLPA 194

Query: 247 IRNTNFYED-----SIATQLVVDSSSRILSIVAGFRGDKDDSTVLMSTTLFKDIEEERLL 306
           ++ ++ + D     S+  Q V D   R L++V G+ G    S +L  +  FK  E  ++L
Sbjct: 195 VQASDDWCDQEKNYSMFLQGVFDHEMRFLNMVTGWPGGMTVSKLLKFSGFFKLCENAQIL 254

Query: 307 -GSPPVYLHGVAVNQYLFGHGDYPLLPWLMVPFAGAVSGSTEESFNEAHRLMSIPALKAI 366
            G+P     G  + +Y+ G   YPLLPWL+ P        +  +FNE H  +   A  A 
Sbjct: 255 DGNPKTLSQGAQIREYVVGGISYPLLPWLITPHDSDHPSDSMVAFNERHEKVRSVAATAF 314

Query: 367 ISLR-NWGVLSQPM-HEEFKTAVAYIGACSILHNALLMREDF 386
             L+ +W +LS+ M   + +   + I  C +LHN ++   D+
Sbjct: 315 QQLKGSWRILSKVMWRPDRRKLPSIILVCCLLHNIIIDCGDY 356

BLAST of CmaCh04G004830 vs. TAIR10
Match: AT3G19120.1 (AT3G19120.1 PIF / Ping-Pong family of plant transposases)

HSP 1 Score: 97.4 bits (241), Expect = 2.2e-20
Identity = 102/393 (25.95%), Postives = 167/393 (42.49%), Query Frame = 1

Query: 12  SLISQLLLLLLLLFPSSNPHSLLSNSSSDSNFYANLFPLFNHF-LFSQQIAASLSFLSVS 71
           +++S LL L   L P+S   S  S SS  S   ++L    +   L    +A+ LSFL+V+
Sbjct: 7   AMLSHLLHLQNSLDPTSTLFSSASTSSQSSTTPSSLLSTSSAAPLLFFTLASLLSFLAVN 66

Query: 72  RKRKRTHSSELLELGPSDSGGEDGGRGRVHLLRTRSPDS------------FRNHFRMTS 131
           R    + SS      PS       G   V   R  + D             +R+ + ++ 
Sbjct: 67  RSSTESSSSSESP-SPSPPPPLADGDYSVAAFRALTTDHIWSLDAPLRDARWRSLYGLSY 126

Query: 132 STFEWLSGLLEPLLECRDPVGSPLDLSAEIRLGVGLSRLATGCDFSTISDQFGVSESVAR 191
             F  +   L+P +       S L L A+  + + LSRLA GC   T++ ++ +   +  
Sbjct: 127 PVFITVVDKLKPFI-----TASNLSLPADYAVAMVLSRLAHGCSAKTLASRYSLDPYLIS 186

Query: 192 FCAKQLCRVLCTN-FRFWVEFPCPSE-LELTSSAFEDIAGLPNCCGVISCTRFKIIRNT- 251
                + R+L T  +  +++ P     L  T+  FE++  LPN CG I  T  K+ R T 
Sbjct: 187 KITNMVTRLLATKLYPEFIKIPVGKRRLIETTQGFEELTSLPNICGAIDSTPVKLRRRTK 246

Query: 252 ----NFYE-----DSIATQLVVDSSSRILSIVAGFRGDKDDSTVLMSTTLFKDIEEERLL 311
               N Y      D++  Q+V D       +     G +DDS+    + L+K +    ++
Sbjct: 247 LNPRNIYGCKYGYDAVLLQVVADHKKIFWDVCVKAPGGEDDSSHFRDSLLYKRLTSGDIV 306

Query: 312 GSPPVYLHGVAVNQYLFGHGDYPLLPWLMVPFAGAVSGSTEESFNEAHRLMSIPALKAII 371
               + + G  V  Y+ G   YPLL +LM PF+   SG+  E+  +   +     +   I
Sbjct: 307 WEKVINIRGHHVRPYIVGDWCYPLLSFLMTPFSPNGSGTPPENLFDGMLMKGRSVVVEAI 366

Query: 372 SL--RNWGVLSQPMHEEFKTAVAYIGACSILHN 378
            L    W +L Q ++     A   I AC +LHN
Sbjct: 367 GLLKARWKIL-QSLNVGVNHAPQTIVACCVLHN 392

BLAST of CmaCh04G004830 vs. TAIR10
Match: AT4G29780.1 (AT4G29780.1 unknown protein)

HSP 1 Score: 95.9 bits (237), Expect = 6.5e-20
Identity = 80/302 (26.49%), Postives = 128/302 (42.38%), Query Frame = 1

Query: 108 DSFRNHFRMTSSTFEWLSGLLEPLLE-----CRDPVGSPLDLSAEIRLGVGLSRLATGCD 167
           D FR  FRM+ STF  +   L+  +       RD + +P       R+GV + RLATG  
Sbjct: 211 DEFRREFRMSKSTFNLICEELDTTVTKKNTMLRDAIPAPK------RVGVCVWRLATGAP 270

Query: 168 FSTISDQFGVSESVARFCAKQLCR----VLCTNFRFWVEFPCPSELELTSSAFEDIAGLP 227
              +S++FG+  S       ++CR    VL   +  W   P  SE+  T + FE +  +P
Sbjct: 271 LRHVSERFGLGISTCHKLVIEVCRAIYDVLMPKYLLW---PSDSEINSTKAKFESVHKIP 330

Query: 228 NCCGVISCTRFKII------------RNTNFYED---SIATQLVVDSSSRILSIVAGFRG 287
           N  G I  T   II            R+T   +    SI  Q VV++      +  G  G
Sbjct: 331 NVVGSIYTTHIPIIAPKVHVAAYFNKRHTERNQKTSYSITVQGVVNADGIFTDVCIGNPG 390

Query: 288 DKDDSTVLMSTTLFKDIEEERLLGSPPVYLHGVAVNQYLFGHGDYPLLPWLMVPFAGAVS 347
              D  +L  ++L           S      G+  + ++ G+  +PL  +L+VP+     
Sbjct: 391 SLTDDQILEKSSL-----------SRQRAARGMLRDSWIVGNSGFPLTDYLLVPYTRQNL 450

Query: 348 GSTEESFNEAHRLMSIPALKAIISLR-NWGVLSQPMHEEFKTAVAYIGACSILHNALLMR 385
             T+ +FNE+   +   A  A   L+  W  L +    + +     +GAC +LHN   MR
Sbjct: 451 TWTQHAFNESIGEIQGIATAAFERLKGRWACLQKRTEVKLQDLPYVLGACCVLHNICEMR 492

BLAST of CmaCh04G004830 vs. NCBI nr
Match: gi|778688571|ref|XP_011652780.1| (PREDICTED: uncharacterized protein LOC101203312 [Cucumis sativus])

HSP 1 Score: 732.6 bits (1890), Expect = 3.9e-208
Identity = 373/435 (85.75%), Postives = 400/435 (91.95%), Query Frame = 1

Query: 1   MDSRQLAALLSSLISQLLLLLLLLFPSSNPHSLLSNSSSDSNFYANLFPLFNHFLFSQQI 60
           MDS +LAALLSSLISQLLLLL LLFPSSNPHSL SNS+ DS+FYANLF    HFLFSQ  
Sbjct: 1   MDSPRLAALLSSLISQLLLLLFLLFPSSNPHSLFSNSAPDSSFYANLFA---HFLFSQDF 60

Query: 61  AASLSFLSVSRKRKRTHSSELLELGPSDSGGEDGGRGRVH-LLRTRSPDSFRNHFRMTSS 120
           AASL FLSVSRKRKRT+ S+ LELG S         GRVH L RTR+PDSFRNHFRMTSS
Sbjct: 61  AASLPFLSVSRKRKRTNRSDHLELGSS--------HGRVHHLFRTRTPDSFRNHFRMTSS 120

Query: 121 TFEWLSGLLEPLLECRDPVGSPLDLSAEIRLGVGLSRLATGCDFSTISDQFGVSESVARF 180
           TFEWLSGLLEPLLECRDPVGSPLDLS EIRLGVGL RLATGCDFSTISDQFGVSESVARF
Sbjct: 121 TFEWLSGLLEPLLECRDPVGSPLDLSVEIRLGVGLYRLATGCDFSTISDQFGVSESVARF 180

Query: 181 CAKQLCRVLCTNFRFWVEFPCPSELELTSSAFEDIAGLPNCCGVISCTRFKIIRNTNFYE 240
           C+KQLCRVLCTNFRFWVEFPCP+ELELTSSAFED+AGLPNCCGV+SCTRFKIIRN++FYE
Sbjct: 181 CSKQLCRVLCTNFRFWVEFPCPNELELTSSAFEDLAGLPNCCGVVSCTRFKIIRNSHFYE 240

Query: 241 DSIATQLVVDSSSRILSIVAGFRGDKDDSTVLMSTTLFKDIEEERLLGSPPVYLHGVAVN 300
           DS+ATQLVVDSSSRILSIVAGFRG+KDDSTVLMS+TLFKDIE+ RLL SPPVYLHGVAVN
Sbjct: 241 DSVATQLVVDSSSRILSIVAGFRGNKDDSTVLMSSTLFKDIEQGRLLNSPPVYLHGVAVN 300

Query: 301 QYLFGHGDYPLLPWLMVPFAGAVSGSTEESFNEAHRLMSIPALKAIISLRNWGVLSQPMH 360
           +YLFGHG+YPLLPWL+VPFAGAVSGSTEESFNEAHRLM IPALKAI+SLRNWGVLSQP+H
Sbjct: 301 KYLFGHGEYPLLPWLIVPFAGAVSGSTEESFNEAHRLMCIPALKAIVSLRNWGVLSQPIH 360

Query: 361 EEFKTAVAYIGACSILHNALLMREDFTAMADEWESLASLDHSSQYVGIGLNEDSPDEKAS 420
           EEFKTAVAYIGACSILHNALLMREDF+AMADEWESL+SLDH SQYV  GLN DS +EKAS
Sbjct: 361 EEFKTAVAYIGACSILHNALLMREDFSAMADEWESLSSLDHKSQYVEAGLNVDSTNEKAS 420

Query: 421 MIQKALALRARELHT 435
           +IQ+ALALRARELH+
Sbjct: 421 VIQRALALRARELHS 424

BLAST of CmaCh04G004830 vs. NCBI nr
Match: gi|659112261|ref|XP_008456140.1| (PREDICTED: uncharacterized protein LOC103496169 [Cucumis melo])

HSP 1 Score: 652.1 bits (1681), Expect = 6.7e-184
Identity = 324/380 (85.26%), Postives = 349/380 (91.84%), Query Frame = 1

Query: 56  FSQQIAASLSFLSVSRKRKRTHSSELLELGPSDSGGEDGGRGRVH-LLRTRSPDSFRNHF 115
           F +  AASL FLSVSRKRKRT+  + LELG S         GRVH L RTR+PDSFRNHF
Sbjct: 10  FPRIFAASLPFLSVSRKRKRTNPPDHLELGSS--------HGRVHHLFRTRTPDSFRNHF 69

Query: 116 RMTSSTFEWLSGLLEPLLECRDPVGSPLDLSAEIRLGVGLSRLATGCDFSTISDQFGVSE 175
           RMTSSTFEWLSGLLEPLLECRDPVGSPLDLS EIRLGVGL RLATGCDFSTISDQFGVSE
Sbjct: 70  RMTSSTFEWLSGLLEPLLECRDPVGSPLDLSVEIRLGVGLYRLATGCDFSTISDQFGVSE 129

Query: 176 SVARFCAKQLCRVLCTNFRFWVEFPCPSELELTSSAFEDIAGLPNCCGVISCTRFKIIRN 235
           SVARFC+KQLCRVLCTNFRFWVEFPCP+ELELTSSAFED+AGLPNCCGV+SCTRFKIIRN
Sbjct: 130 SVARFCSKQLCRVLCTNFRFWVEFPCPNELELTSSAFEDLAGLPNCCGVVSCTRFKIIRN 189

Query: 236 TNFYEDSIATQLVVDSSSRILSIVAGFRGDKDDSTVLMSTTLFKDIEEERLLGSPPVYLH 295
           ++FYEDS+ATQLVVDSSSRILSIVAGFRG+KDDSTVLMS+TLFKDIE+ RLL SPPVYLH
Sbjct: 190 SHFYEDSVATQLVVDSSSRILSIVAGFRGNKDDSTVLMSSTLFKDIEQGRLLNSPPVYLH 249

Query: 296 GVAVNQYLFGHGDYPLLPWLMVPFAGAVSGSTEESFNEAHRLMSIPALKAIISLRNWGVL 355
           GVAVN+YLFG G+YPLLPWL+VPFAGAVSGSTEESFNEAHRLM IPALKAI+SLRNWGVL
Sbjct: 250 GVAVNKYLFGRGEYPLLPWLIVPFAGAVSGSTEESFNEAHRLMCIPALKAIVSLRNWGVL 309

Query: 356 SQPMHEEFKTAVAYIGACSILHNALLMREDFTAMADEWESLASLDHSSQYVGIGLNEDSP 415
           SQP+HEEFKTAVAYIGACSILHNALLMREDF+AMADEWESL+SLDH SQYV  GLN DS 
Sbjct: 310 SQPIHEEFKTAVAYIGACSILHNALLMREDFSAMADEWESLSSLDHRSQYVEAGLNVDST 369

Query: 416 DEKASMIQKALALRARELHT 435
           +EKAS+IQ+ALA RARELH+
Sbjct: 370 NEKASVIQRALAQRARELHS 381

BLAST of CmaCh04G004830 vs. NCBI nr
Match: gi|731383279|ref|XP_010647732.1| (PREDICTED: putative nuclease HARBI1 [Vitis vinifera])

HSP 1 Score: 528.5 bits (1360), Expect = 1.1e-146
Identity = 275/443 (62.08%), Postives = 334/443 (75.40%), Query Frame = 1

Query: 1   MDSRQLAALLSSLISQLLLLLLLLFPSSNPHSLLSNSSSDSNFYANLFPLFNHFLFSQQI 60
           M+SR LAALLSSLIS+LLLLLLLLFPSSNP ++ SNS+S SNFY  +FPL +HFL S ++
Sbjct: 1   MNSRALAALLSSLISELLLLLLLLFPSSNPLTITSNSNSGSNFYETIFPLIHHFLSSAEL 60

Query: 61  AASLSFLSVSRKRKRTHSSELLELGPSDSGGEDGGRGRVHLLRTRSPDSFRNHFRMTSST 120
             SLS LS+SRKRKRTH  +L      D  G +  R  + L  T++PDSF+  FRMTSST
Sbjct: 61  VTSLSLLSISRKRKRTHQPDLDNEDEEDEPGSELARFELGL--TQNPDSFKGCFRMTSST 120

Query: 121 FEWLSGLLEPLLECRDPVGSPLDLSAEIRLGVGLSRLATGCDFSTISDQFGVSESVARFC 180
           FEWLSGLLEPLL+CRDP+GSPL+L+ EIRLG+GL RLATG D+  I+ +FGVSES+ RFC
Sbjct: 121 FEWLSGLLEPLLDCRDPIGSPLNLAPEIRLGIGLFRLATGSDYPEIARRFGVSESITRFC 180

Query: 181 AKQLCRVLCTNFRFWVEFPCPSELELTSSAFEDIAGLPNCCGVISCTRFKIIRNTNF--- 240
            KQLCRVLCTNFRFW+ FP P +L+  S++FE + GLPNCCGVI CTRFKI+RN  F   
Sbjct: 181 VKQLCRVLCTNFRFWIAFPSPIDLDSLSTSFEALTGLPNCCGVIDCTRFKIVRNNGFKLS 240

Query: 241 -----YEDSIATQLVVDSSSRILSIVAGFRGDKDDSTVLMSTTLFKDIEEERLLGSPPVY 300
                 E+SIA Q+VVDSSSRILSIVAGFRGDK +S VL S+TL+KDIE   LL +PPVY
Sbjct: 241 PKEEVREESIAAQIVVDSSSRILSIVAGFRGDKGESRVLKSSTLYKDIEGGSLLNAPPVY 300

Query: 301 LHGVAVNQYLFGHGDYPLLPWLMVPFAGAVSGSTEESFNEAHRLMSIPALKAIISLRNWG 360
           ++GV +NQYL G G YPLLPWLMVPF     GS EE+FN AH LM I AL+AI SL++WG
Sbjct: 301 MNGVGINQYLIGDGGYPLLPWLMVPFVDPAPGSYEENFNSAHHLMHISALRAIASLKDWG 360

Query: 361 VLSQPMHEEFKTAVAYIGACSILHNALLMREDFTAMADEWESLASLDHSSQYVGIGLNED 420
           VL Q +  EFK AVAYIG+C+ILHN LLMR+D++A++D    L     S QY      E+
Sbjct: 361 VLRQTIEGEFKMAVAYIGSCAILHNVLLMRDDYSALSD---GLGDYSQSPQYCRNASLEE 420

Query: 421 SPDEK-ASMIQKALALRARELHT 435
           SP E+ AS+I+ ALA RAR+ H+
Sbjct: 421 SPIERNASVIRNALATRARKFHS 438

BLAST of CmaCh04G004830 vs. NCBI nr
Match: gi|568867441|ref|XP_006487046.1| (PREDICTED: putative nuclease HARBI1 [Citrus sinensis])

HSP 1 Score: 521.9 bits (1343), Expect = 1.0e-144
Identity = 281/439 (64.01%), Postives = 333/439 (75.85%), Query Frame = 1

Query: 1   MDSRQLAALLSSLISQLLLLLLLLFPSSNPHSLLSNSSSDSNFYANLFPLFNHFLFSQQI 60
           MDS++L+A LSSL+SQL LLLLLLFP S           DS    NLFPL +HF+ SQQ+
Sbjct: 1   MDSQKLSAFLSSLVSQLFLLLLLLFPDS-----------DSTQRTNLFPLISHFISSQQV 60

Query: 61  AASLSFLSVSRKRKRTHSSELLELGPS-DSGGEDGGRGRVHLLRTRSPDSFRNHFRMTSS 120
           AASL+FLS+SRKRKRTHSSE  EL P+ D      G G   L  T+ PDSFRN F+M+SS
Sbjct: 61  AASLTFLSISRKRKRTHSSEE-ELEPTHDDKTSRLGHGLSQLGFTQLPDSFRNSFKMSSS 120

Query: 121 TFEWLSGLLEPLLECRDPVGSPLDLSAEIRLGVGLSRLATGCDFSTISDQFGVSESVARF 180
           TF WLSGLLEPLL+CRDPVG PL+LSA+IRLG+GL RL  G  +S I+ +F V+ESV RF
Sbjct: 121 TFRWLSGLLEPLLDCRDPVGLPLNLSADIRLGIGLFRLVNGSTYSEIATRFEVTESVTRF 180

Query: 181 CAKQLCRVLCTNFRFWVEFPCPSELELTSSAFEDIAGLPNCCGVISCTRFKIIR----NT 240
           C KQLCRVLCTNFRFWV FP P EL L S +FE++ GLPNCCGVI CTRFKII+    N+
Sbjct: 181 CVKQLCRVLCTNFRFWVAFPGPEELGLISKSFEELTGLPNCCGVIDCTRFKIIKIDGSNS 240

Query: 241 NFYEDSIATQLVVDSSSRILSIVAGFRGDKDDSTVLMSTTLFKDIEEERLLGSPPVYLHG 300
           +  EDSIA Q+VVDSSSR+LSIVAG RGDK DS VL S+TL+KDIEE++LL S P+ ++G
Sbjct: 241 SKDEDSIAVQIVVDSSSRMLSIVAGIRGDKGDSRVLKSSTLYKDIEEKKLLNSSPICVNG 300

Query: 301 VAVNQYLFGHGDYPLLPWLMVPFAGAVSGSTEESFNEAHRLMSIPALKAIISLRNWGVLS 360
           VAV+QYL G G YPLLPWLMVPF  A  GS+EE+FN AH LM +PALKAI SL+NWGVLS
Sbjct: 301 VAVDQYLIGDGGYPLLPWLMVPFVDANPGSSEENFNAAHNLMRVPALKAIASLKNWGVLS 360

Query: 361 QPMHEEFKTAVAYIGACSILHNALLMREDFTAMADEWESLASLDHSSQYVG-IGLNEDSP 420
           +P+ E+FKTAVA IGACSILHNALLMREDF+ + +E    +  D SSQY     L E+S 
Sbjct: 361 RPIDEDFKTAVALIGACSILHNALLMREDFSGLFEELGDYSLHDESSQYYSDASLEENST 420

Query: 421 DEKASMIQKALALRARELH 434
           ++KAS I+ ALA RAR  H
Sbjct: 421 EKKASAIRSALATRARVQH 427

BLAST of CmaCh04G004830 vs. NCBI nr
Match: gi|641840832|gb|KDO59749.1| (hypothetical protein CISIN_1g013572mg [Citrus sinensis])

HSP 1 Score: 520.8 bits (1340), Expect = 2.3e-144
Identity = 280/439 (63.78%), Postives = 333/439 (75.85%), Query Frame = 1

Query: 1   MDSRQLAALLSSLISQLLLLLLLLFPSSNPHSLLSNSSSDSNFYANLFPLFNHFLFSQQI 60
           MDS++L+A LSSL+SQL LLLLLLFP S           D+    NLFPL +HF+ SQQ+
Sbjct: 1   MDSQKLSAFLSSLVSQLFLLLLLLFPDS-----------DATQRTNLFPLISHFISSQQV 60

Query: 61  AASLSFLSVSRKRKRTHSSELLELGPS-DSGGEDGGRGRVHLLRTRSPDSFRNHFRMTSS 120
           AASL+FLS+SRKRKRTHSSE  EL P+ D      G G   L  T+ PDSFRN F+M+SS
Sbjct: 61  AASLTFLSISRKRKRTHSSEE-ELEPTHDDKTSRLGHGLSQLGFTQLPDSFRNSFKMSSS 120

Query: 121 TFEWLSGLLEPLLECRDPVGSPLDLSAEIRLGVGLSRLATGCDFSTISDQFGVSESVARF 180
           TF WLSGLLEPLL+CRDPVG PL+LSA+IRLG+GL RL  G  +S I+ +F V+ESV RF
Sbjct: 121 TFRWLSGLLEPLLDCRDPVGLPLNLSADIRLGIGLFRLVNGSTYSEIATRFEVTESVTRF 180

Query: 181 CAKQLCRVLCTNFRFWVEFPCPSELELTSSAFEDIAGLPNCCGVISCTRFKIIR----NT 240
           C KQLCRVLCTNFRFWV FP P EL L S +FE++ GLPNCCGVI CTRFKII+    N+
Sbjct: 181 CVKQLCRVLCTNFRFWVAFPGPEELGLISKSFEELTGLPNCCGVIDCTRFKIIKIDGSNS 240

Query: 241 NFYEDSIATQLVVDSSSRILSIVAGFRGDKDDSTVLMSTTLFKDIEEERLLGSPPVYLHG 300
           +  EDSIA Q+VVDSSSR+LSIVAG RGDK DS VL S+TL+KDIEE++LL S P+ ++G
Sbjct: 241 SKDEDSIAVQIVVDSSSRMLSIVAGIRGDKGDSRVLKSSTLYKDIEEKKLLNSSPICVNG 300

Query: 301 VAVNQYLFGHGDYPLLPWLMVPFAGAVSGSTEESFNEAHRLMSIPALKAIISLRNWGVLS 360
           VAV+QYL G G YPLLPWLMVPF  A  GS+EE+FN AH LM +PALKAI SL+NWGVLS
Sbjct: 301 VAVDQYLIGDGGYPLLPWLMVPFVDANPGSSEENFNAAHNLMRVPALKAIASLKNWGVLS 360

Query: 361 QPMHEEFKTAVAYIGACSILHNALLMREDFTAMADEWESLASLDHSSQYVG-IGLNEDSP 420
           +P+ E+FKTAVA IGACSILHNALLMREDF+ + +E    +  D SSQY     L E+S 
Sbjct: 361 RPIDEDFKTAVALIGACSILHNALLMREDFSGLFEELGDYSLHDESSQYYSDASLEENST 420

Query: 421 DEKASMIQKALALRARELH 434
           ++KAS I+ ALA RAR  H
Sbjct: 421 EKKASAIRSALATRARVQH 427

The following BLAST results are available for this feature:
Match NameE-valueIdentityDescription
Match NameE-valueIdentityDescription
A0A0A0LBX6_CUCSA2.7e-20885.75Uncharacterized protein OS=Cucumis sativus GN=Csa_3G202740 PE=4 SV=1[more]
F6HQ92_VITVI7.8e-14762.08Putative uncharacterized protein OS=Vitis vinifera GN=VIT_03s0063g00150 PE=4 SV=... [more]
A0A067EX85_CITSI1.6e-14463.78Uncharacterized protein OS=Citrus sinensis GN=CISIN_1g013572mg PE=4 SV=1[more]
B9RYF7_RICCO4.0e-13560.27Putative uncharacterized protein OS=Ricinus communis GN=RCOM_0812810 PE=4 SV=1[more]
A0A061E009_THECC1.5e-13460.14PIF / Ping-Pong family of plant transposases OS=Theobroma cacao GN=TCM_007086 PE... [more]
Match NameE-valueIdentityDescription
AT3G55350.11.5e-2929.18 PIF / Ping-Pong family of plant transposases[more]
AT3G63270.13.2e-2726.90 Putative harbinger transposase-derived nuclease (InterPro:IPR006912)[more]
AT3G19120.12.2e-2025.95 PIF / Ping-Pong family of plant transposases[more]
AT4G29780.16.5e-2026.49 unknown protein[more]
Match NameE-valueIdentityDescription
gi|778688571|ref|XP_011652780.1|3.9e-20885.75PREDICTED: uncharacterized protein LOC101203312 [Cucumis sativus][more]
gi|659112261|ref|XP_008456140.1|6.7e-18485.26PREDICTED: uncharacterized protein LOC103496169 [Cucumis melo][more]
gi|731383279|ref|XP_010647732.1|1.1e-14662.08PREDICTED: putative nuclease HARBI1 [Vitis vinifera][more]
gi|568867441|ref|XP_006487046.1|1.0e-14464.01PREDICTED: putative nuclease HARBI1 [Citrus sinensis][more]
gi|641840832|gb|KDO59749.1|2.3e-14463.78hypothetical protein CISIN_1g013572mg [Citrus sinensis][more]
The following terms have been associated with this gene:
Vocabulary: INTERPRO
TermDefinition
IPR027806HARBI1_dom
GO Assignments
This gene is annotated with the following GO terms.
Category Term Accession Term Name
biological_process GO:0008150 biological_process
cellular_component GO:0005575 cellular_component
cellular_component GO:0016021 integral component of membrane
molecular_function GO:0003674 molecular_function

The following mRNA feature(s) are a part of this gene:

Feature NameUnique NameType
CmaCh04G004830.1CmaCh04G004830.1mRNA


Analysis Name: InterPro Annotations of Cucurbita maxima
Date Performed: 2017-05-20
IPR TermIPR DescriptionSourceSource TermSource DescriptionAlignment
IPR027806Harbinger transposase-derived nuclease domainPFAMPF13359DDE_Tnp_4coord: 236..377
score: 8.1
NoneNo IPR availablePANTHERPTHR22930UNCHARACTERIZEDcoord: 34..429
score: 8.7E
NoneNo IPR availablePANTHERPTHR22930:SF49SUBFAMILY NOT NAMEDcoord: 34..429
score: 8.7E

The following gene(s) are paralogous to this gene:

None