Cmc12g0322371 (gene) Melon (Charmono) v1.1

Overview
NameCmc12g0322371
Typegene
OrganismCucumis melo L. var. cantalupensis cv. Charmono (Melon (Charmono) v1.1)
DescriptionProtein ALP1-like
LocationCMiso1.1chr12: 10194055 .. 10195162 (-)
RNA-Seq ExpressionCmc12g0322371
SyntenyCmc12g0322371
Sequences
The following sequences are available for this feature:

Gene sequence (with intron)

Legend: polypeptideexonCDS
Hold the cursor over a type above to highlight its positions in the sequence below.
ATGAGCTTGCACAGGATACAATCTTCAAAACAACCATGTAGAACCTCTGCACTAAGAGGACATGATTATGTGATTAAGTTGTTAAATGGAAACGATAGTAGATGTTTTGATTGTTTTAGGATGAAAAGAATTACATTCATAAGGTTTTGTGAAGATTTAAAATCAAAGACGAATCTGAAATCATCTAGATATCTTACCGTTCAAGAAAAAGTTGTTGTATTCTTATTAATCATATCACATAATGAAAGCAATTGTATAGCAGCAGAAAGATTTCAACATTCGGGTCATACATTTTCTCTAGCTTTTAACCTTGTTTTGAGGAAAGTTTGCAAGCTTGGTAGAGAAATTATTCGTCCACCCAATATGGACAATATATCAATGGAGATCGTATCAAATTCAAAATATTACCCTTTCTTTAAGGTAATGCGTTAATGTTTTGATTTATTTTCAGAAATAAAATGTTTTGAAATATATTTGTGTTTTTTATTATCCTTTAGGATTGTATTGGTGCTAGTGATGGCACTCATGTTGTTGCAAGTATTCTCCAAAATGAACAAATATCGTTTCCTGGAAATAAAACTAACACGACATGGAATATAATGTGTGTTTGTTCATTTGATATGTTATTCACGTATGTCATGTCTGGTTGGGAAGGATCAGCCAATGATTCTCGCATACTTCTAGAATGTATCAAGAATCCCGAGAATAAATTTCCTATGTCTAAGAGAGGTAAATAATTATGTATTTTTCAGAACTTTAATTCCATATTTGACTTTATGATAATTTTTTATTTGCTTTATTATATGCAGATCAATACTACCTTGTCGATTTAGGATATTCAAATATGCCAGGATTTTTAGCACCATTTCGAGGTCAAAGATATCATTTACGAGATTTTAGAGAAAGGAGACATCGCCCTCGAAATAGAGAAGAAGTGTTTAATTATCGACATTCTTCACTGCGAAATGTCATTGAACGTTGTTTTGGTGTTCTGAAGGCTCGATTTCCTATTCTTAAGCAAATGTCTCCTTACTCGATCAAGACACAAAAATATATACCGATAGCATGTTGCACAATTCACAATTATATTAGATTGAATGGTCGTTAA

mRNA sequence

ATGAGCTTGCACAGGATACAATCTTCAAAACAACCATGTAGAACCTCTGCACTAAGAGGACATGATTATGTGATTAAGTTGTTAAATGGAAACGATAGTAGATGTTTTGATTGTTTTAGGATGAAAAGAATTACATTCATAAGGTTTTGTGAAGATTTAAAATCAAAGACGAATCTGAAATCATCTAGATATCTTACCGTTCAAGAAAAAGTTGTTGTATTCTTATTAATCATATCACATAATGAAAGCAATTGTATAGCAGCAGAAAGATTTCAACATTCGGGTCATACATTTTCTCTAGCTTTTAACCTTGTTTTGAGGAAAGTTTGCAAGCTTGGTAGAGAAATTATTCGTCCACCCAATATGGACAATATATCAATGGAGATCGTATCAAATTCAAAATATTACCCTTTCTTTAAGGATTGTATTGGTGCTAGTGATGGCACTCATGTTGTTGCAAGTATTCTCCAAAATGAACAAATATCGTTTCCTGGAAATAAAACTAACACGACATGGAATATAATGTGTGTTTGTTCATTTGATATGTTATTCACGTATGTCATGTCTGGTTGGGAAGGATCAGCCAATGATTCTCGCATACTTCTAGAATGTATCAAGAATCCCGAGAATAAATTTCCTATGTCTAAGAGAGATCAATACTACCTTGTCGATTTAGGATATTCAAATATGCCAGGATTTTTAGCACCATTTCGAGGTCAAAGATATCATTTACGAGATTTTAGAGAAAGGAGACATCGCCCTCGAAATAGAGAAGAAGTGTTTAATTATCGACATTCTTCACTGCGAAATGTCATTGAACGTTGTTTTGGTGTTCTGAAGGCTCGATTTCCTATTCTTAAGCAAATGTCTCCTTACTCGATCAAGACACAAAAATATATACCGATAGCATGTTGCACAATTCACAATTATATTAGATTGAATGGTCGTTAA

Coding sequence (CDS)

ATGAGCTTGCACAGGATACAATCTTCAAAACAACCATGTAGAACCTCTGCACTAAGAGGACATGATTATGTGATTAAGTTGTTAAATGGAAACGATAGTAGATGTTTTGATTGTTTTAGGATGAAAAGAATTACATTCATAAGGTTTTGTGAAGATTTAAAATCAAAGACGAATCTGAAATCATCTAGATATCTTACCGTTCAAGAAAAAGTTGTTGTATTCTTATTAATCATATCACATAATGAAAGCAATTGTATAGCAGCAGAAAGATTTCAACATTCGGGTCATACATTTTCTCTAGCTTTTAACCTTGTTTTGAGGAAAGTTTGCAAGCTTGGTAGAGAAATTATTCGTCCACCCAATATGGACAATATATCAATGGAGATCGTATCAAATTCAAAATATTACCCTTTCTTTAAGGATTGTATTGGTGCTAGTGATGGCACTCATGTTGTTGCAAGTATTCTCCAAAATGAACAAATATCGTTTCCTGGAAATAAAACTAACACGACATGGAATATAATGTGTGTTTGTTCATTTGATATGTTATTCACGTATGTCATGTCTGGTTGGGAAGGATCAGCCAATGATTCTCGCATACTTCTAGAATGTATCAAGAATCCCGAGAATAAATTTCCTATGTCTAAGAGAGATCAATACTACCTTGTCGATTTAGGATATTCAAATATGCCAGGATTTTTAGCACCATTTCGAGGTCAAAGATATCATTTACGAGATTTTAGAGAAAGGAGACATCGCCCTCGAAATAGAGAAGAAGTGTTTAATTATCGACATTCTTCACTGCGAAATGTCATTGAACGTTGTTTTGGTGTTCTGAAGGCTCGATTTCCTATTCTTAAGCAAATGTCTCCTTACTCGATCAAGACACAAAAATATATACCGATAGCATGTTGCACAATTCACAATTATATTAGATTGAATGGTCGTTAA

Protein sequence

MSLHRIQSSKQPCRTSALRGHDYVIKLLNGNDSRCFDCFRMKRITFIRFCEDLKSKTNLKSSRYLTVQEKVVVFLLIISHNESNCIAAERFQHSGHTFSLAFNLVLRKVCKLGREIIRPPNMDNISMEIVSNSKYYPFFKDCIGASDGTHVVASILQNEQISFPGNKTNTTWNIMCVCSFDMLFTYVMSGWEGSANDSRILLECIKNPENKFPMSKRDQYYLVDLGYSNMPGFLAPFRGQRYHLRDFRERRHRPRNREEVFNYRHSSLRNVIERCFGVLKARFPILKQMSPYSIKTQKYIPIACCTIHNYIRLNGR
Homology
BLAST of Cmc12g0322371 vs. NCBI nr
Match: TYK06269.1 (protein ALP1-like [Cucumis melo var. makuwa])

HSP 1 Score: 436.8 bits (1122), Expect = 1.6e-118
Identity = 205/229 (89.52%), Postives = 210/229 (91.70%), Query Frame = 0

Query: 88  AERFQHSGHTFSLAFNLVLRKVCKLGREIIRPPNMDNISMEIVSNSKYYPFFKDCIGASD 147
           AERFQHSGHT SLAFN VLRKVCKLG EIIRPPNMD ++ +IVSNSKYYPFFKDCIGA D
Sbjct: 21  AERFQHSGHTISLAFNKVLRKVCKLGGEIIRPPNMDTVATKIVSNSKYYPFFKDCIGAID 80

Query: 148 GTHVVASILQNEQISFPGNKTNTTWNIMCVCSFDMLFTYVMSGWEGSANDSRILLECIKN 207
           GTHV ASI QNEQI F G KTNTTWNIMCVCSFDMLFTYVMSGWEGSANDSRIL ECIKN
Sbjct: 81  GTHVAASIPQNEQIPFRGRKTNTTWNIMCVCSFDMLFTYVMSGWEGSANDSRILQECIKN 140

Query: 208 PENKFPMSKRDQYYLVDLGYSNMPGFLAPFRGQRYHLRDFRERRHRPRNREEVFNYRHSS 267
           PENKFPM KRDQYYLV+ GYSNMPGFLAPFRGQRYHLRDFRERRHRPR REEVFNYRHSS
Sbjct: 141 PENKFPMPKRDQYYLVNSGYSNMPGFLAPFRGQRYHLRDFRERRHRPRGREEVFNYRHSS 200

Query: 268 LRNVIERCFGVLKARFPILKQMSPYSIKTQKYIPIACCTIHNYIRLNGR 317
           LRNVIERCFGVLKARFPILKQM PY IKTQKYIPI CCT+HNYIRLN R
Sbjct: 201 LRNVIERCFGVLKARFPILKQMPPYPIKTQKYIPITCCTVHNYIRLNDR 249

BLAST of Cmc12g0322371 vs. NCBI nr
Match: KAF7123090.1 (hypothetical protein RHSIM_Rhsim12G0067000 [Rhododendron simsii])

HSP 1 Score: 423.7 bits (1088), Expect = 1.4e-114
Identity = 193/304 (63.49%), Postives = 237/304 (77.96%), Query Frame = 0

Query: 11  QPCRTSALRGHDYVIKLLNGNDSRCFDCFRMKRITFIRFCEDLKSKTNLKSSRYLTVQEK 70
           +PCRTS LRGHDYV+++LNG++ RC   FRMK   FI FCE LK   NLK SRYLT+QE+
Sbjct: 301 RPCRTSMLRGHDYVLEVLNGHEDRCHQNFRMKPQVFIAFCEALKLHANLKHSRYLTLQEQ 360

Query: 71  VVVFLLIISHNESNCIAAERFQHSGHTFSLAFNLVLRKVCKLGREIIRPPNMDNISMEIV 130
           V +FLL I HNE N +  ERFQHSGHT S+ F+ VL+ VCKLG  II+PP+ D+I  +I 
Sbjct: 361 VCIFLLTIGHNERNRVVQERFQHSGHTISIYFHRVLKAVCKLGVVIIQPPSFDSIPHQIR 420

Query: 131 SNSKYYPFFKDCIGASDGTHVVASILQNEQISFPGNKTNTTWNIMCVCSFDMLFTYVMSG 190
            N+KY+PFFKDC+GA DGTH+ AS+  ++QI + G  T TT N+M  CSFDM FTYV+SG
Sbjct: 421 RNAKYFPFFKDCVGAIDGTHISASVPASDQIPYRGKHTVTTQNVMAACSFDMRFTYVLSG 480

Query: 191 WEGSANDSRILLECIKNPENKFPMSKRDQYYLVDLGYSNMPGFLAPFRGQRYHLRDFRER 250
           WEG+ANDSR+ LEC+ NP   FP     +YY+VD GY+NMPGFL+P+RG+RYHL  FR  
Sbjct: 481 WEGTANDSRVFLECVNNPAMGFPKPPEGKYYVVDSGYTNMPGFLSPYRGERYHLNSFRGH 540

Query: 251 RHRPRNREEVFNYRHSSLRNVIERCFGVLKARFPILKQMSPYSIKTQKYIPIACCTIHNY 310
             RP+  EE FN+RHSSLRNVIERCFGVLKARFPILKQM PYS++TQKYIP ACCT+HN+
Sbjct: 541 GQRPKKIEEWFNFRHSSLRNVIERCFGVLKARFPILKQMPPYSVRTQKYIPTACCTVHNW 600

Query: 311 IRLN 315
           IR++
Sbjct: 601 IRMH 604

BLAST of Cmc12g0322371 vs. NCBI nr
Match: XP_028067161.1 (uncharacterized protein LOC114269968 [Camellia sinensis])

HSP 1 Score: 418.3 bits (1074), Expect = 5.8e-113
Identity = 187/306 (61.11%), Postives = 241/306 (78.76%), Query Frame = 0

Query: 11  QPCRTSALRGHDYVIKLLNGNDSRCFDCFRMKRITFIRFCEDLKSKTNLKSSRYLTVQEK 70
           +PCRTS L+GHDYV+++LNG++ R  + FRM+   FI  CE LK    L+ SRYLTVQE+
Sbjct: 377 RPCRTSILQGHDYVLEILNGHERRSKENFRMEPDVFINLCEALKVYGKLEHSRYLTVQEQ 436

Query: 71  VVVFLLIISHNESNCIAAERFQHSGHTFSLAFNLVLRKVCKLGREIIRPPNMDNISMEIV 130
           V +FLL I HNE N +  ERFQHSG T S  FN VL+ VC+LG+++IRPP+ D +  EI 
Sbjct: 437 VCIFLLTIGHNERNRVVQERFQHSGQTISKYFNRVLKAVCRLGKQVIRPPDFDEVPAEIR 496

Query: 131 SNSKYYPFFKDCIGASDGTHVVASILQNEQISFPGNKTNTTWNIMCVCSFDMLFTYVMSG 190
            N ++YPFFKDC+GA DGTH+ A +  +EQI + G  T TT N+MCVCSFDM FTYV++G
Sbjct: 497 HNPRFYPFFKDCVGAIDGTHISARVPASEQIPYRGKHTVTTQNVMCVCSFDMRFTYVLAG 556

Query: 191 WEGSANDSRILLECIKNPENKFPMSKRDQYYLVDLGYSNMPGFLAPFRGQRYHLRDFRER 250
           WEG+ANDSR+ +E + +P N FPM   D+YY+VD GY+NMPGFL+P+RG+RYHL +FR +
Sbjct: 557 WEGTANDSRVFIETVHDPTNLFPMPPTDKYYVVDSGYTNMPGFLSPYRGERYHLNEFRNQ 616

Query: 251 RHRPRNREEVFNYRHSSLRNVIERCFGVLKARFPILKQMSPYSIKTQKYIPIACCTIHNY 310
           R +PRN++++FNYRHSSLRNVIERCFGVLKARFPIL+ M PYS KTQ+YI IACCTIHN+
Sbjct: 617 RRQPRNKKQMFNYRHSSLRNVIERCFGVLKARFPILRDMPPYSTKTQRYIHIACCTIHNW 676

Query: 311 IRLNGR 317
           IR++ +
Sbjct: 677 IRIHSQ 682

BLAST of Cmc12g0322371 vs. NCBI nr
Match: XP_028100667.1 (uncharacterized protein LOC114300013 [Camellia sinensis] >XP_028120812.1 uncharacterized protein LOC114318158 [Camellia sinensis])

HSP 1 Score: 417.2 bits (1071), Expect = 1.3e-112
Identity = 187/306 (61.11%), Postives = 240/306 (78.43%), Query Frame = 0

Query: 11  QPCRTSALRGHDYVIKLLNGNDSRCFDCFRMKRITFIRFCEDLKSKTNLKSSRYLTVQEK 70
           +PCRTS L+GHDYV+++LNG++ R  + FRM+   FI  CE LK    L+ SRYLTVQE+
Sbjct: 39  RPCRTSILQGHDYVLEILNGHERRSKENFRMEPDVFINLCEALKVYGKLEHSRYLTVQEQ 98

Query: 71  VVVFLLIISHNESNCIAAERFQHSGHTFSLAFNLVLRKVCKLGREIIRPPNMDNISMEIV 130
           V +FLL I HNE N +  ERFQHSG T S  FN VL+ VC+LG+++IRPP+ D +  EI 
Sbjct: 99  VCIFLLTIGHNERNRVVQERFQHSGQTISKYFNRVLKAVCRLGKQVIRPPDFDEVPAEIR 158

Query: 131 SNSKYYPFFKDCIGASDGTHVVASILQNEQISFPGNKTNTTWNIMCVCSFDMLFTYVMSG 190
            N ++YPFFKDC+GA DGTH+ A +  +EQI + G  T TT N+MCVCSFDM FTYV++G
Sbjct: 159 HNPRFYPFFKDCVGAIDGTHISARVPASEQIPYRGKHTVTTQNVMCVCSFDMRFTYVLAG 218

Query: 191 WEGSANDSRILLECIKNPENKFPMSKRDQYYLVDLGYSNMPGFLAPFRGQRYHLRDFRER 250
           WEG+ANDSR+ +E + +P N FPM   D+YY+VD GY+NMPGFL+P+RG+RYHL +FR +
Sbjct: 219 WEGTANDSRVFIETVHDPTNLFPMPPTDKYYVVDSGYTNMPGFLSPYRGERYHLNEFRNQ 278

Query: 251 RHRPRNREEVFNYRHSSLRNVIERCFGVLKARFPILKQMSPYSIKTQKYIPIACCTIHNY 310
           R +PRN++++FNYRHSSLRNVIERCFGVLKARFPIL+ M PYS KTQ+YI IACCTIHN+
Sbjct: 279 RRQPRNKKQMFNYRHSSLRNVIERCFGVLKARFPILRDMPPYSTKTQRYIHIACCTIHNW 338

Query: 311 IRLNGR 317
           IR + +
Sbjct: 339 IRTHSQ 344

BLAST of Cmc12g0322371 vs. NCBI nr
Match: XP_028094390.1 (uncharacterized protein LOC114294454 [Camellia sinensis])

HSP 1 Score: 412.1 bits (1058), Expect = 4.2e-111
Identity = 185/306 (60.46%), Postives = 237/306 (77.45%), Query Frame = 0

Query: 11  QPCRTSALRGHDYVIKLLNGNDSRCFDCFRMKRITFIRFCEDLKSKTNLKSSRYLTVQEK 70
           +PCRTS L+GHDYV+++LNG++ R  + FRM+   FI  CE LK    L+ SRYLTVQE+
Sbjct: 39  RPCRTSILQGHDYVLEILNGHERRSKENFRMEPDVFINLCESLKVYGKLEHSRYLTVQEQ 98

Query: 71  VVVFLLIISHNESNCIAAERFQHSGHTFSLAFNLVLRKVCKLGREIIRPPNMDNISMEIV 130
           V +FLL I HNE N +  ERFQHSG T S  FN VL+ VC+LG+++IRPP+ D +  EI 
Sbjct: 99  VCIFLLTIGHNERNRVVQERFQHSGQTISKYFNRVLKAVCRLGKQVIRPPDFDEVPAEIR 158

Query: 131 SNSKYYPFFKDCIGASDGTHVVASILQNEQISFPGNKTNTTWNIMCVCSFDMLFTYVMSG 190
            N ++YPFFKDC+GA DGTH+ A +  +EQI + G  T TT N+M VCSFDM FTYV++G
Sbjct: 159 HNPRFYPFFKDCVGAIDGTHISARVPASEQIPYRGKHTVTTQNVMSVCSFDMRFTYVLAG 218

Query: 191 WEGSANDSRILLECIKNPENKFPMSKRDQYYLVDLGYSNMPGFLAPFRGQRYHLRDFRER 250
           WEG+ANDSR+ +E + +P N FPM   D+YY+VD GY+NMPGFL+P+RG+RYHL +FR +
Sbjct: 219 WEGTANDSRVFIETVHDPTNLFPMPPTDKYYVVDSGYTNMPGFLSPYRGERYHLNEFRNQ 278

Query: 251 RHRPRNREEVFNYRHSSLRNVIERCFGVLKARFPILKQMSPYSIKTQKYIPIACCTIHNY 310
           R +PRN+ ++FNYRH S RNVIERCFGVLKARFPIL+ M PYS KTQ+YIPIACCTIHN+
Sbjct: 279 RRQPRNKNQMFNYRHFSPRNVIERCFGVLKARFPILRDMPPYSTKTQRYIPIACCTIHNW 338

Query: 311 IRLNGR 317
           IR + +
Sbjct: 339 IRTHSQ 344

BLAST of Cmc12g0322371 vs. ExPASy TrEMBL
Match: A0A5D3C7F6 (Protein ALP1-like OS=Cucumis melo var. makuwa OX=1194695 GN=E5676_scaffold157G00500 PE=3 SV=1)

HSP 1 Score: 436.8 bits (1122), Expect = 7.7e-119
Identity = 205/229 (89.52%), Postives = 210/229 (91.70%), Query Frame = 0

Query: 88  AERFQHSGHTFSLAFNLVLRKVCKLGREIIRPPNMDNISMEIVSNSKYYPFFKDCIGASD 147
           AERFQHSGHT SLAFN VLRKVCKLG EIIRPPNMD ++ +IVSNSKYYPFFKDCIGA D
Sbjct: 21  AERFQHSGHTISLAFNKVLRKVCKLGGEIIRPPNMDTVATKIVSNSKYYPFFKDCIGAID 80

Query: 148 GTHVVASILQNEQISFPGNKTNTTWNIMCVCSFDMLFTYVMSGWEGSANDSRILLECIKN 207
           GTHV ASI QNEQI F G KTNTTWNIMCVCSFDMLFTYVMSGWEGSANDSRIL ECIKN
Sbjct: 81  GTHVAASIPQNEQIPFRGRKTNTTWNIMCVCSFDMLFTYVMSGWEGSANDSRILQECIKN 140

Query: 208 PENKFPMSKRDQYYLVDLGYSNMPGFLAPFRGQRYHLRDFRERRHRPRNREEVFNYRHSS 267
           PENKFPM KRDQYYLV+ GYSNMPGFLAPFRGQRYHLRDFRERRHRPR REEVFNYRHSS
Sbjct: 141 PENKFPMPKRDQYYLVNSGYSNMPGFLAPFRGQRYHLRDFRERRHRPRGREEVFNYRHSS 200

Query: 268 LRNVIERCFGVLKARFPILKQMSPYSIKTQKYIPIACCTIHNYIRLNGR 317
           LRNVIERCFGVLKARFPILKQM PY IKTQKYIPI CCT+HNYIRLN R
Sbjct: 201 LRNVIERCFGVLKARFPILKQMPPYPIKTQKYIPITCCTVHNYIRLNDR 249

BLAST of Cmc12g0322371 vs. ExPASy TrEMBL
Match: A0A5A7VNL5 (Putative nuclease HARBI1 isoform X1 OS=Cucumis melo var. makuwa OX=1194695 GN=E6C27_scaffold238G001450 PE=3 SV=1)

HSP 1 Score: 373.2 bits (957), Expect = 1.0e-99
Identity = 175/195 (89.74%), Postives = 178/195 (91.28%), Query Frame = 0

Query: 122 MDNISMEIVSNSKYYPFFKDCIGASDGTHVVASILQNEQISFPGNKTNTTWNIMCVCSFD 181
           MD ++MEIVSNSKYYP FKDCIGA DGTHV ASI QNEQI F G KTNTTWNIMCVCSFD
Sbjct: 1   MDTVAMEIVSNSKYYPLFKDCIGAIDGTHVAASIPQNEQIPFRGRKTNTTWNIMCVCSFD 60

Query: 182 MLFTYVMSGWEGSANDSRILLECIKNPENKFPMSKRDQYYLVDLGYSNMPGFLAPFRGQR 241
           MLFTYVMSGWEGSANDSRIL ECIKNPENKFPM KRDQYYLVD GYSNMPGFLAPFRGQR
Sbjct: 61  MLFTYVMSGWEGSANDSRILQECIKNPENKFPMPKRDQYYLVDSGYSNMPGFLAPFRGQR 120

Query: 242 YHLRDFRERRHRPRNREEVFNYRHSSLRNVIERCFGVLKARFPILKQMSPYSIKTQKYIP 301
           YHLRDFRERRHRPR REEVFNYRHSSLRNVIERCFGVLKARFPILKQM PY IKTQKYI 
Sbjct: 121 YHLRDFRERRHRPRGREEVFNYRHSSLRNVIERCFGVLKARFPILKQMPPYPIKTQKYIL 180

Query: 302 IACCTIHNYIRLNGR 317
           I CCT+HNYIRLN R
Sbjct: 181 ITCCTVHNYIRLNDR 195

BLAST of Cmc12g0322371 vs. ExPASy TrEMBL
Match: A0A5D3DC11 (Protein ALP1-like OS=Cucumis melo var. makuwa OX=1194695 GN=E5676_scaffold392G00620 PE=3 SV=1)

HSP 1 Score: 369.8 bits (948), Expect = 1.2e-98
Identity = 173/195 (88.72%), Postives = 178/195 (91.28%), Query Frame = 0

Query: 122 MDNISMEIVSNSKYYPFFKDCIGASDGTHVVASILQNEQISFPGNKTNTTWNIMCVCSFD 181
           M+ ++ +IVSNSKYYPFFKDCIGA DGTHV ASI QNEQI F G KTNTTWNIMCVCSFD
Sbjct: 1   MNTVATKIVSNSKYYPFFKDCIGAIDGTHVAASIPQNEQIPFRGRKTNTTWNIMCVCSFD 60

Query: 182 MLFTYVMSGWEGSANDSRILLECIKNPENKFPMSKRDQYYLVDLGYSNMPGFLAPFRGQR 241
           MLFTYVMSGWEGSANDSRIL ECIKNPENKFPM KRDQYYLVD GYSNMPGFLAPFRGQR
Sbjct: 61  MLFTYVMSGWEGSANDSRILQECIKNPENKFPMPKRDQYYLVDSGYSNMPGFLAPFRGQR 120

Query: 242 YHLRDFRERRHRPRNREEVFNYRHSSLRNVIERCFGVLKARFPILKQMSPYSIKTQKYIP 301
           YHLRDFRERRHRPR REEVFNYRHSSLRNVIERCFGVLKARFPILKQM PY IKTQKYI 
Sbjct: 121 YHLRDFRERRHRPRGREEVFNYRHSSLRNVIERCFGVLKARFPILKQMPPYPIKTQKYIL 180

Query: 302 IACCTIHNYIRLNGR 317
           I CCT+HNYIRLN R
Sbjct: 181 ITCCTVHNYIRLNDR 195

BLAST of Cmc12g0322371 vs. ExPASy TrEMBL
Match: A0A0B2SJL0 (Putative nuclease HARBI1 (Fragment) OS=Glycine soja OX=3848 GN=glysoja_043532 PE=3 SV=1)

HSP 1 Score: 358.2 bits (918), Expect = 3.5e-95
Identity = 171/304 (56.25%), Postives = 215/304 (70.72%), Query Frame = 0

Query: 10  KQPCRTSALRGHDYVIKLLNGNDSRCFDCFRMKRITFIRFCEDLKSKTNLKSSRYLTVQE 69
           K PCRTS L G  Y IK L G+++RC++ F MK+  F+ FCE LK   NL   + ++++E
Sbjct: 3   KTPCRTSMLTGKMYTIKFLIGHETRCYENFWMKKYVFMNFCETLKEVANLCDGKKVSIKE 62

Query: 70  KVVVFLLIISHNESNCIAAERFQHSGHTFSLAFNLVLRKVCKLGREIIRPPNMDNISMEI 129
            + +FL+II HN  + + AERFQHS HT S  F ++L+ VCKLG  II   N  +    I
Sbjct: 63  AIAMFLIIICHNLRHRVVAERFQHSLHTVSKWFRIILKAVCKLGTNIIHQRNQTSTHPHI 122

Query: 130 VSNSKYYPFFKDCIGASDGTHVVASILQNEQISFPGNKTNTTWNIMCVCSFDMLFTYVMS 189
             N KYYP+FKDCIGA DG HV A    ++Q +F G K   T N++ VC FDMLFT+V S
Sbjct: 123 RGNPKYYPYFKDCIGAIDGMHVSAWASASKQAAFRGRKVLVTQNVLVVCDFDMLFTFVYS 182

Query: 190 GWEGSANDSRILLECIKNPENKFPMSKRDQYYLVDLGYSNMPGFLAPFRGQRYHLRDFRE 249
           GWEG+ANDSR+ L+ + + EN FP    DQ+YL+D G+SNMPG+LAPFR  +YHL DFRE
Sbjct: 183 GWEGTANDSRVFLDAL-SLENNFPKPNGDQFYLIDSGFSNMPGYLAPFRRNKYHLHDFRE 242

Query: 250 RRHRPRNREEVFNYRHSSLRNVIERCFGVLKARFPILKQMSPYSIKTQKYIPIACCTIHN 309
              RPR +EE+FNYRHSSLRNVIERCFGVLKARFPILK M  Y I+ Q+ I IACCTIHN
Sbjct: 243 GGGRPRGKEELFNYRHSSLRNVIERCFGVLKARFPILKLMPSYPIRRQRLILIACCTIHN 302

Query: 310 YIRL 314
           +IR+
Sbjct: 303 FIRM 305

BLAST of Cmc12g0322371 vs. ExPASy TrEMBL
Match: A0A4Y7J673 (Uncharacterized protein OS=Papaver somniferum OX=3469 GN=C5167_014815 PE=3 SV=1)

HSP 1 Score: 355.5 bits (911), Expect = 2.3e-94
Identity = 162/307 (52.77%), Postives = 218/307 (71.01%), Query Frame = 0

Query: 10  KQPCRTSALRGHDYVIKLLNGNDSRCFDCFRMKRITFIRFCEDLKSKTNLKSSRYLTVQE 69
           K P  TS L G +++ +LLNG+  R ++  RM   TF+  C  L++   L+  R ++V+E
Sbjct: 225 KIPMMTSVLSGREFIFELLNGHPRRMYNLMRMDPSTFMLLCSTLRTNDFLQDDRSVSVEE 284

Query: 70  KVVVFLLIISHNESNCIAAERFQHSGHTFSLAFNLVLRKVCKLGREIIRPPNMDNISMEI 129
            V +FL  +S +  N + AE FQHS  T    F  VL+ +C+LG  II+PPNMD +  EI
Sbjct: 285 AVGIFLATVSQSMRNRVVAEMFQHSNETVYRHFKKVLKALCRLGCLIIKPPNMDEVPPEI 344

Query: 130 VSNSKYYPFFKDCIGASDGTHVVASILQNEQISFPGNKTNTTWNIMCVCSFDMLFTYVMS 189
           ++N K+YP+F DC+GA DGTH+ A +  ++QI F G K   T NIMC CSFDMLFT+V +
Sbjct: 345 MTNPKFYPWFVDCVGAIDGTHISACVPASKQIPFRGRKAQITQNIMCACSFDMLFTFVYT 404

Query: 190 GWEGSANDSRILLECIKNPENKFPMSKRDQYYLVDLGYSNMPGFLAPFRGQRYHLRDFRE 249
           GWEG+AND+R+L++ I N ENKFPM +  +YY+VD  Y+NMPGFL P+RG+RYHLRDFR 
Sbjct: 405 GWEGTANDARVLMDAISNEENKFPMPREGRYYVVDSAYTNMPGFLTPYRGERYHLRDFRG 464

Query: 250 RRHRPRNREEVFNYRHSSLRNVIERCFGVLKARFPILKQMSPYSIKTQKYIPIACCTIHN 309
           R  + +   E+FN+RHSSLRNVIERCFGV K+RFPILK M  Y ++ Q+ IP+ACCT+HN
Sbjct: 465 RSRQAKGPMELFNHRHSSLRNVIERCFGVWKSRFPILKCMPNYPLRRQRLIPVACCTLHN 524

Query: 310 YIRLNGR 317
           +IRLN R
Sbjct: 525 FIRLNSR 531

BLAST of Cmc12g0322371 vs. TAIR 10
Match: AT5G41980.1 (CONTAINS InterPro DOMAIN/s: Putative harbinger transposase-derived nuclease (InterPro:IPR006912); BEST Arabidopsis thaliana protein match is: unknown protein (TAIR:AT1G43722.1); Has 1807 Blast hits to 1807 proteins in 277 species: Archae - 0; Bacteria - 0; Metazoa - 736; Fungi - 347; Plants - 385; Viruses - 0; Other Eukaryotes - 339 (source: NCBI BLink). )

HSP 1 Score: 205.7 bits (522), Expect = 5.5e-53
Identity = 107/304 (35.20%), Postives = 168/304 (55.26%), Query Frame = 0

Query: 10  KQPCRTSALRGHDYVIKLLNGNDSRCFDCFRMKRITFIRFCEDLKSKTNLKSSRYLTVQE 69
           K+  + S   G+ +V ++LNG + +CF+ FRM +  F + C+ L+++  L+ +  + ++ 
Sbjct: 17  KEVSKISISDGNKFVYQILNGPNEQCFENFRMDKPVFYKLCDLLQTRGLLRHTNRIKIEA 76

Query: 70  KVVVFLLIISHNESNCIAAERFQHSGHTFSLAFNLVLRKVCKLGREIIRPPNMDNISMEI 129
           ++ +FL II HN       E F +SG T S  FN VL  V  + ++  +P    N + + 
Sbjct: 77  QLAIFLFIIGHNLRTRAVQELFCYSGETISRHFNNVLNAVIAISKDFFQP----NSNSDT 136

Query: 130 VSNSKYYPFFKDCIGASDGTHVVASILQNEQISFPGNKTNTTWNIMCVCSFDMLFTYVMS 189
           + N    P+FKDC+G  D  H+   +  +EQ  F       T N++   SFD+ F YV++
Sbjct: 137 LENDD--PYFKDCVGVVDSFHIPVMVGVDEQGPFRNGNGLLTQNVLAASSFDLRFNYVLA 196

Query: 190 GWEGSANDSRILLECIKNPENKFPMSKRDQYYLVDLGYSNMPGFLAPFRGQRYHLRDFRE 249
           GWEGSA+D ++L   +    NK  +  + +YY+VD  Y N+PGF+AP+ G   + R+   
Sbjct: 197 GWEGSASDQQVLNAALTR-RNKLQV-PQGKYYIVDNKYPNLPGFIAPYHGVSTNSRE--- 256

Query: 250 RRHRPRNREEVFNYRHSSLRNVIERCFGVLKARFPILKQMSPYSIKTQKYIPIACCTIHN 309
                   +E+FN RH  L   I R FG LK RFPIL    PY ++TQ  + IA C +HN
Sbjct: 257 ------EAKEMFNERHKLLHRAIHRTFGALKERFPILLSAPPYPLQTQVKLVIAACALHN 303

Query: 310 YIRL 314
           Y+RL
Sbjct: 317 YVRL 303

BLAST of Cmc12g0322371 vs. TAIR 10
Match: AT1G43722.1 (unknown protein; BEST Arabidopsis thaliana protein match is: unknown protein (TAIR:AT5G28730.1); Has 924 Blast hits to 912 proteins in 109 species: Archae - 0; Bacteria - 0; Metazoa - 222; Fungi - 31; Plants - 661; Viruses - 0; Other Eukaryotes - 10 (source: NCBI BLink). )

HSP 1 Score: 152.9 bits (385), Expect = 4.3e-37
Identity = 89/261 (34.10%), Postives = 134/261 (51.34%), Query Frame = 0

Query: 28  LNGNDSRCFDCFRMKRITFIRFCEDLKSKTNLKSSRYLTVQEKVVVFLLIISHNESNCIA 87
           L  + + C    RM    F   C  L++  +L+ +  ++++E V +FL I  HNE     
Sbjct: 59  LQQDAAACLQLLRMSLPCFTTLCNMLQTNYDLQPTLNISIEESVAMFLRICGHNEVYRDV 118

Query: 88  AERFQHSGHTFSLAFNLVLRKVCKLGREIIRPP---NMDNISMEIVSNSKYYPFFKDCIG 147
             RF  +  T    F  VL     L  + IR P    +  I   +  + +Y+P+F   +G
Sbjct: 119 GLRFGRNQETVQRKFREVLTATELLACDYIRTPTRQELYRIPERLQVDQRYWPYFSGFVG 178

Query: 148 ASDGTHVVASILQNEQISFPGNKTNTTWNIMCVCSFDMLFTYVMSGWEGSANDSRILLEC 207
           A DGTHV   +  + Q  +     N + NIM +C   MLFTY+ +G  GS  D+ + L+ 
Sbjct: 179 AMDGTHVCVKVKPDLQGMYWNRHDNASLNIMAICDLKMLFTYIWNGAPGSCYDTAV-LQI 238

Query: 208 IKNPENKFPMSKRDQYYLVDLGYSNMPGFLAPFRGQ-----RYHLRDFRERRHRPRNREE 267
            +  +++FP+   ++YYLVD GY N  G LAP+R       RYH+  F     RPRN+ E
Sbjct: 239 AQQSDSEFPLPPSEKYYLVDSGYPNKQGLLAPYRSSRNRVVRYHMSQF-YYGPRPRNKHE 298

Query: 268 VFNYRHSSLRNVIERCFGVLK 281
           +FN  H+SLR+VIER F + K
Sbjct: 299 LFNQCHTSLRSVIERTFRIWK 317

BLAST of Cmc12g0322371 vs. TAIR 10
Match: AT5G35695.1 (CONTAINS InterPro DOMAIN/s: Putative harbinger transposase-derived nuclease (InterPro:IPR006912); BEST Arabidopsis thaliana protein match is: unknown protein (TAIR:AT5G41980.1); Has 30201 Blast hits to 17322 proteins in 780 species: Archae - 12; Bacteria - 1396; Metazoa - 17338; Fungi - 3422; Plants - 5037; Viruses - 0; Other Eukaryotes - 2996 (source: NCBI BLink). )

HSP 1 Score: 132.1 bits (331), Expect = 7.8e-31
Identity = 62/130 (47.69%), Postives = 85/130 (65.38%), Query Frame = 0

Query: 183 LFTYVMSGWEGSANDSRILLECIKNPENKFPMSKRDQYYLVDLGYSNMPGFLAPFRGQRY 242
           +F YV+SGWEGSA+DSR+L + ++            ++YLVD G++N   FLAPFRG RY
Sbjct: 24  IFIYVLSGWEGSAHDSRVLSDALR------------KFYLVDCGFANRLNFLAPFRGVRY 83

Query: 243 HLRDFRERRHRPRNREEVFNYRHSSLRNVIERCFGVLKARFPILKQMSPYSIKTQKYIPI 302
           HL++F  +R  P    E+FN RH SLRNVIER FG+ K+RF I K   P+S K Q  + +
Sbjct: 84  HLQEFAGQRRDPETPHELFNLRHVSLRNVIERIFGIFKSRFAIFKSAPPFSYKKQAGLVL 141

Query: 303 ACCTIHNYIR 313
            C  +HN++R
Sbjct: 144 TCAALHNFLR 141

BLAST of Cmc12g0322371 vs. TAIR 10
Match: AT5G28730.1 (unknown protein; BEST Arabidopsis thaliana protein match is: unknown protein (TAIR:AT1G43722.1); Has 496 Blast hits to 496 proteins in 68 species: Archae - 0; Bacteria - 0; Metazoa - 3; Fungi - 23; Plants - 470; Viruses - 0; Other Eukaryotes - 0 (source: NCBI BLink). )

HSP 1 Score: 116.3 bits (290), Expect = 4.4e-26
Identity = 73/244 (29.92%), Postives = 116/244 (47.54%), Query Frame = 0

Query: 31  NDSRCFDCFRMKRITFIRFCEDLKSKTNLKSSRYLTVQEKVVVFLLIISHNESNCIAAER 90
           N+  C    RM    F + CE L  K  L+SS  +++ E V +FL+I + N++    A R
Sbjct: 20  NEVSCQTLIRMSSEAFTQLCEILHGKYGLQSSTNISLDESVAIFLIICASNDTQRDIALR 79

Query: 91  FQHSGHTFSLAFNLVLRKVCKLGREIIRPPNMDN---ISMEIVSNSKYYPFFKDCIGASD 150
           F H+  T    F+ VL+ + +L  E IRP  ++    IS  +  +++Y+PF  D +G + 
Sbjct: 80  FGHAQETIWRKFHDVLKAMERLAVEYIRPRKVEELRAISNRLQDDTRYWPFLMDLLGIA- 139

Query: 151 GTHVVASILQNEQISFPGNKTNTTWNIMCVCSFDMLFTYVMSGWEGSANDSRILLECIKN 210
                                  ++N++ +C  DMLFTY   G  GS +D+R+L   I +
Sbjct: 140 -----------------------SFNVLAICDLDMLFTYCFVGMAGSTHDARVLSAAISD 199

Query: 211 PENKFPMSKRDQYYLVDLGYSNMPGFLAPFRGQRYHLRDFRERRHRPRNREEVFNYRHSS 270
            +  F +    +YYLVD GY+N  G+LAP+R +    +D         N  E  N +   
Sbjct: 200 -DPLFHVPPDSKYYLVDSGYANKRGYLAPYRREHREAQDIISNNFLTVNLFETHNIKDYD 238

Query: 271 LRNV 272
             NV
Sbjct: 260 FDNV 238

BLAST of Cmc12g0322371 vs. TAIR 10
Match: AT5G28950.1 (unknown protein; BEST Arabidopsis thaliana protein match is: unknown protein (TAIR:AT5G41980.1); Has 448 Blast hits to 446 proteins in 74 species: Archae - 0; Bacteria - 0; Metazoa - 31; Fungi - 21; Plants - 396; Viruses - 0; Other Eukaryotes - 0 (source: NCBI BLink). )

HSP 1 Score: 95.5 bits (236), Expect = 8.1e-20
Identity = 37/95 (38.95%), Postives = 63/95 (66.32%), Query Frame = 0

Query: 125 ISMEIVSNSKYYPFFKDCIGASDGTHVVASILQNEQISFPGNKTNTTWNIMCVCSFDMLF 184
           +  +I  +++ YP+FKDC+GA D TH+ A + Q +  SF   K + + N++  C+FD+ F
Sbjct: 8   VPRKIRESTRLYPYFKDCVGAIDDTHIFAMVSQKKMPSFRNRKGDISQNMLAACNFDVEF 67

Query: 185 TYVMSGWEGSANDSRILLECIKNPENKFPMSKRDQ 220
            YV+SGWEGSA+DS++L + +    N+ P+ + D+
Sbjct: 68  MYVLSGWEGSAHDSKVLNDALTRNSNRLPVPEEDE 102

The following BLAST results are available for this feature:
Match NameE-valueIdentityDescription
TYK06269.11.6e-11889.52protein ALP1-like [Cucumis melo var. makuwa][more]
KAF7123090.11.4e-11463.49hypothetical protein RHSIM_Rhsim12G0067000 [Rhododendron simsii][more]
XP_028067161.15.8e-11361.11uncharacterized protein LOC114269968 [Camellia sinensis][more]
XP_028100667.11.3e-11261.11uncharacterized protein LOC114300013 [Camellia sinensis] >XP_028120812.1 unchara... [more]
XP_028094390.14.2e-11160.46uncharacterized protein LOC114294454 [Camellia sinensis][more]
Match NameE-valueIdentityDescription
Match NameE-valueIdentityDescription
A0A5D3C7F67.7e-11989.52Protein ALP1-like OS=Cucumis melo var. makuwa OX=1194695 GN=E5676_scaffold157G00... [more]
A0A5A7VNL51.0e-9989.74Putative nuclease HARBI1 isoform X1 OS=Cucumis melo var. makuwa OX=1194695 GN=E6... [more]
A0A5D3DC111.2e-9888.72Protein ALP1-like OS=Cucumis melo var. makuwa OX=1194695 GN=E5676_scaffold392G00... [more]
A0A0B2SJL03.5e-9556.25Putative nuclease HARBI1 (Fragment) OS=Glycine soja OX=3848 GN=glysoja_043532 PE... [more]
A0A4Y7J6732.3e-9452.77Uncharacterized protein OS=Papaver somniferum OX=3469 GN=C5167_014815 PE=3 SV=1[more]
Match NameE-valueIdentityDescription
AT5G41980.15.5e-5335.20CONTAINS InterPro DOMAIN/s: Putative harbinger transposase-derived nuclease (Int... [more]
AT1G43722.14.3e-3734.10unknown protein; BEST Arabidopsis thaliana protein match is: unknown protein (TA... [more]
AT5G35695.17.8e-3147.69CONTAINS InterPro DOMAIN/s: Putative harbinger transposase-derived nuclease (Int... [more]
AT5G28730.14.4e-2629.92unknown protein; BEST Arabidopsis thaliana protein match is: unknown protein (TA... [more]
AT5G28950.18.1e-2038.95unknown protein; BEST Arabidopsis thaliana protein match is: unknown protein (TA... [more]
InterPro
Analysis Name: InterPro Annotations of Melon (Charmono) v1.1
Date Performed: 2022-10-13
IPR TermIPR DescriptionSourceSource TermSource DescriptionAlignment
IPR027806Harbinger transposase-derived nuclease domainPFAMPF13359DDE_Tnp_4coord: 147..309
e-value: 6.8E-23
score: 81.1
NoneNo IPR availablePANTHERPTHR22930:SF212NUCLEASE HARBI1 ISOFORM X1-RELATEDcoord: 9..312
NoneNo IPR availablePANTHERPTHR22930UNCHARACTERIZEDcoord: 9..312

Relationships

The following mRNA feature(s) are a part of this gene:

Feature NameUnique NameType
Cmc12g0322371.1Cmc12g0322371.1mRNA


GO Annotation
GO Assignments
This gene is annotated with the following GO terms.
Category Term Accession Term Name
cellular_component GO:0005634 nucleus
molecular_function GO:0016787 hydrolase activity
molecular_function GO:0046872 metal ion binding