CaUC04G076540 (gene) Watermelon (USVL246-FR2) v1

Overview
NameCaUC04G076540
Typegene
OrganismCitrullus amarus (Watermelon (USVL246-FR2) v1)
DescriptionDDE Tnp4 domain-containing protein
LocationCiama_Chr04: 25863616 .. 25864995 (-)
RNA-Seq ExpressionCaUC04G076540
SyntenyCaUC04G076540
Sequences
The following sequences are available for this feature:

Gene sequence (with intron)

Legend: polypeptideexonCDS
Hold the cursor over a type above to highlight its positions in the sequence below.
ATGGAGAGTTCTGATGATGAAAAGGATGGAACTTATGGGAAATATGTTCCAAGAGAACCGAGTCATAATCTAGTATCTAATGGTGCAAAATTTGTAGATGAAGTACTCAATGGACAAAATGAACGTTGTTTAGAACATTTCCGCATGGACAAGCACATATTTTATAAGTTGTGTGATATTTTGCAAGCCAAAGGCTTACTGCGTCATACAAACCGCATTAAGATTGAAGAGCAACTAGCCATATTCATGTTTATTATTGGTCACAATCTTAGGACACGAGCAGTTCAAGAGTTGTTCAGATATTCAGGAGAAACAATAAGTCGCCATTTTAACAATGTATTGAATGCAATTATGGCAATATCACTGGACTTCTTTCAACCTCCAGGATCTAATGTTCCTCCAGAAATTTCACAAGATCCCAGATTTTATCCCTACTTTAAGGTAGCAGGGTGGTTTTAGTTTGACTAGTGTTTTCCTATAATGTGACTTACCCTCTCATAAACTTATCTTATTTTAGGATTGTGTGGGGGCAATTGATGGCATACACATCCCTGTGATGGTTGGTGTTGATGAGCAAGGGCCTTTTCGAAATAAGAATGGACTACTTTCTCAAATTGTTTTGGCAGCTTGCTCATTTGACCTCAAGTTCCATTACGTTCTAGCAGGATGGGAAGGATCGGCATCCGATTTGCAGGTTCTGAACTCAGCACTTACTAGGCGAAACAAACTACATGTTCCCGAAGGTGAGTGTATTCTAAGAGGATAATCATGAATCTCTTAGTTTTAGTGTGCCTAGTATAGTAGAAATGGTATTTGACTGGTCAGCATGTTGCTCTCTGGACATTTTTGGGCTGATATTGTGGAATTGCAGGTAAATACTACCTTGTGGACCAAAAATATATGAACATGCCTGGTTTTATTGCACCTTATCATGATATCCCCTATCATTCAAAGGAATATCCTGGTGGTTATCATCCGCAAGATGCCAAAGAGCTATTTAATCTACGGCATTCATTGTTGCGCAATGCAACTGATAGAACTTTTGGAGTTCTAAAAGCGCGCTTCCCCATACTATTGTCAGCTCCTCCTTACCCATTACAGACACAAGTTAAGTTGGTCGTTGCGACATGTGCAATTCACAATTACATTCGGAGGGAGAATCCTGACGATTGGCTCTTTAGATTATATGAACATGACCATGTTCCACATATGGAGGATTCATTGCCTCAATTGGACGCAGAACAGTTGACAACACAGATTGAGACTCCAATTGTGGACATTGCTTTTGAGACGGGAGAACTAGAAATTACATCACAGTTACGGGATACTATTGCAGCTGAATTGTGGAGTGACTACATTAATGATATATCACCAATGTAA

mRNA sequence

ATGGAGAGTTCTGATGATGAAAAGGATGGAACTTATGGGAAATATGTTCCAAGAGAACCGAGTCATAATCTAGTATCTAATGGTGCAAAATTTGTAGATGAAGTACTCAATGGACAAAATGAACGTTGTTTAGAACATTTCCGCATGGACAAGCACATATTTTATAAGTTGTGTGATATTTTGCAAGCCAAAGGCTTACTGCGTCATACAAACCGCATTAAGATTGAAGAGCAACTAGCCATATTCATGTTTATTATTGGTCACAATCTTAGGACACGAGCAGTTCAAGAGTTGTTCAGATATTCAGGAGAAACAATAAGTCGCCATTTTAACAATGTATTGAATGCAATTATGGCAATATCACTGGACTTCTTTCAACCTCCAGGATCTAATGTTCCTCCAGAAATTTCACAAGATCCCAGATTTTATCCCTACTTTAAGGATTGTGTGGGGGCAATTGATGGCATACACATCCCTGTGATGGTTGGTGTTGATGAGCAAGGGCCTTTTCGAAATAAGAATGGACTACTTTCTCAAATTGTTTTGGCAGCTTGCTCATTTGACCTCAAGTTCCATTACGTTCTAGCAGGATGGGAAGGATCGGCATCCGATTTGCAGGTTCTGAACTCAGCACTTACTAGGCGAAACAAACTACATGTTCCCGAAGGTAAATACTACCTTGTGGACCAAAAATATATGAACATGCCTGGTTTTATTGCACCTTATCATGATATCCCCTATCATTCAAAGGAATATCCTGGTGGTTATCATCCGCAAGATGCCAAAGAGCTATTTAATCTACGGCATTCATTGTTGCGCAATGCAACTGATAGAACTTTTGGAGTTCTAAAAGCGCGCTTCCCCATACTATTGTCAGCTCCTCCTTACCCATTACAGACACAAGTTAAGTTGGTCGTTGCGACATGTGCAATTCACAATTACATTCGGAGGGAGAATCCTGACGATTGGCTCTTTAGATTATATGAACATGACCATGTTCCACATATGGAGGATTCATTGCCTCAATTGGACGCAGAACAGTTGACAACACAGATTGAGACTCCAATTGTGGACATTGCTTTTGAGACGGGAGAACTAGAAATTACATCACAGTTACGGGATACTATTGCAGCTGAATTGTGGAGTGACTACATTAATGATATATCACCAATGTAA

Coding sequence (CDS)

ATGGAGAGTTCTGATGATGAAAAGGATGGAACTTATGGGAAATATGTTCCAAGAGAACCGAGTCATAATCTAGTATCTAATGGTGCAAAATTTGTAGATGAAGTACTCAATGGACAAAATGAACGTTGTTTAGAACATTTCCGCATGGACAAGCACATATTTTATAAGTTGTGTGATATTTTGCAAGCCAAAGGCTTACTGCGTCATACAAACCGCATTAAGATTGAAGAGCAACTAGCCATATTCATGTTTATTATTGGTCACAATCTTAGGACACGAGCAGTTCAAGAGTTGTTCAGATATTCAGGAGAAACAATAAGTCGCCATTTTAACAATGTATTGAATGCAATTATGGCAATATCACTGGACTTCTTTCAACCTCCAGGATCTAATGTTCCTCCAGAAATTTCACAAGATCCCAGATTTTATCCCTACTTTAAGGATTGTGTGGGGGCAATTGATGGCATACACATCCCTGTGATGGTTGGTGTTGATGAGCAAGGGCCTTTTCGAAATAAGAATGGACTACTTTCTCAAATTGTTTTGGCAGCTTGCTCATTTGACCTCAAGTTCCATTACGTTCTAGCAGGATGGGAAGGATCGGCATCCGATTTGCAGGTTCTGAACTCAGCACTTACTAGGCGAAACAAACTACATGTTCCCGAAGGTAAATACTACCTTGTGGACCAAAAATATATGAACATGCCTGGTTTTATTGCACCTTATCATGATATCCCCTATCATTCAAAGGAATATCCTGGTGGTTATCATCCGCAAGATGCCAAAGAGCTATTTAATCTACGGCATTCATTGTTGCGCAATGCAACTGATAGAACTTTTGGAGTTCTAAAAGCGCGCTTCCCCATACTATTGTCAGCTCCTCCTTACCCATTACAGACACAAGTTAAGTTGGTCGTTGCGACATGTGCAATTCACAATTACATTCGGAGGGAGAATCCTGACGATTGGCTCTTTAGATTATATGAACATGACCATGTTCCACATATGGAGGATTCATTGCCTCAATTGGACGCAGAACAGTTGACAACACAGATTGAGACTCCAATTGTGGACATTGCTTTTGAGACGGGAGAACTAGAAATTACATCACAGTTACGGGATACTATTGCAGCTGAATTGTGGAGTGACTACATTAATGATATATCACCAATGTAA

Protein sequence

MESSDDEKDGTYGKYVPREPSHNLVSNGAKFVDEVLNGQNERCLEHFRMDKHIFYKLCDILQAKGLLRHTNRIKIEEQLAIFMFIIGHNLRTRAVQELFRYSGETISRHFNNVLNAIMAISLDFFQPPGSNVPPEISQDPRFYPYFKDCVGAIDGIHIPVMVGVDEQGPFRNKNGLLSQIVLAACSFDLKFHYVLAGWEGSASDLQVLNSALTRRNKLHVPEGKYYLVDQKYMNMPGFIAPYHDIPYHSKEYPGGYHPQDAKELFNLRHSLLRNATDRTFGVLKARFPILLSAPPYPLQTQVKLVVATCAIHNYIRRENPDDWLFRLYEHDHVPHMEDSLPQLDAEQLTTQIETPIVDIAFETGELEITSQLRDTIAAELWSDYINDISPM
Homology
BLAST of CaUC04G076540 vs. NCBI nr
Match: XP_038895429.1 (putative nuclease HARBI1 isoform X1 [Benincasa hispida])

HSP 1 Score: 786.9 bits (2031), Expect = 7.7e-224
Identity = 382/392 (97.45%), Postives = 385/392 (98.21%), Query Frame = 0

Query: 1   MESSDDEKDGTYGKYVPREPSHNLVSNGAKFVDEVLNGQNERCLEHFRMDKHIFYKLCDI 60
           MESSDDEKDGTYGKYVPREPSHNLVSNGAKFVDEVLNGQNERCL++FRMDKHIFYKLCDI
Sbjct: 68  MESSDDEKDGTYGKYVPREPSHNLVSNGAKFVDEVLNGQNERCLDYFRMDKHIFYKLCDI 127

Query: 61  LQAKGLLRHTNRIKIEEQLAIFMFIIGHNLRTRAVQELFRYSGETISRHFNNVLNAIMAI 120
           LQAKGLLRHTNRIKIEEQLAIFMFIIGHNLRTRAVQELFRYSGETISRHFNNVLNAIMAI
Sbjct: 128 LQAKGLLRHTNRIKIEEQLAIFMFIIGHNLRTRAVQELFRYSGETISRHFNNVLNAIMAI 187

Query: 121 SLDFFQPPGSNV-PPEISQDPRFYPYFKDCVGAIDGIHIPVMVGVDEQGPFRNKNGLLSQ 180
           SLDFFQPPGSNV PPEI +DPRFYPYFKDCVGAIDGIHIPVMVGVDEQGPFRNK GLLSQ
Sbjct: 188 SLDFFQPPGSNVPPPEILEDPRFYPYFKDCVGAIDGIHIPVMVGVDEQGPFRNKKGLLSQ 247

Query: 181 IVLAACSFDLKFHYVLAGWEGSASDLQVLNSALTRRNKLHVPEGKYYLVDQKYMNMPGFI 240
           IVLAACSFDLKFHYVLAGWEGSASDLQVLNSALTRRNKLHVPEGKYYLVDQKYMNMPGFI
Sbjct: 248 IVLAACSFDLKFHYVLAGWEGSASDLQVLNSALTRRNKLHVPEGKYYLVDQKYMNMPGFI 307

Query: 241 APYHDIPYHSKEYPGGYHPQDAKELFNLRHSLLRNATDRTFGVLKARFPILLSAPPYPLQ 300
           APYHDIPYHSKEYPGGYHPQDAKELFNLRHSLLRNATDRTFG LKARFPILLSAPPYPLQ
Sbjct: 308 APYHDIPYHSKEYPGGYHPQDAKELFNLRHSLLRNATDRTFGALKARFPILLSAPPYPLQ 367

Query: 301 TQVKLVVATCAIHNYIRRENPDDWLFRLYEHDHVPHMEDSLPQLDAEQLTTQIETPIVDI 360
           TQVKLVVATCAIHNYIRRENPDDWLFRLYE DHVPHMEDSLPQLDAEQLTT IETPIVDI
Sbjct: 368 TQVKLVVATCAIHNYIRRENPDDWLFRLYEQDHVPHMEDSLPQLDAEQLTTHIETPIVDI 427

Query: 361 AFETGELEITSQLRDTIAAELWSDYINDISPM 392
           AFET ELEITSQLRDTIAAELWSDYINDISPM
Sbjct: 428 AFETEELEITSQLRDTIAAELWSDYINDISPM 459

BLAST of CaUC04G076540 vs. NCBI nr
Match: XP_038895430.1 (putative nuclease HARBI1 isoform X2 [Benincasa hispida] >XP_038895431.1 putative nuclease HARBI1 isoform X2 [Benincasa hispida] >XP_038895432.1 putative nuclease HARBI1 isoform X2 [Benincasa hispida])

HSP 1 Score: 786.9 bits (2031), Expect = 7.7e-224
Identity = 382/392 (97.45%), Postives = 385/392 (98.21%), Query Frame = 0

Query: 1   MESSDDEKDGTYGKYVPREPSHNLVSNGAKFVDEVLNGQNERCLEHFRMDKHIFYKLCDI 60
           MESSDDEKDGTYGKYVPREPSHNLVSNGAKFVDEVLNGQNERCL++FRMDKHIFYKLCDI
Sbjct: 1   MESSDDEKDGTYGKYVPREPSHNLVSNGAKFVDEVLNGQNERCLDYFRMDKHIFYKLCDI 60

Query: 61  LQAKGLLRHTNRIKIEEQLAIFMFIIGHNLRTRAVQELFRYSGETISRHFNNVLNAIMAI 120
           LQAKGLLRHTNRIKIEEQLAIFMFIIGHNLRTRAVQELFRYSGETISRHFNNVLNAIMAI
Sbjct: 61  LQAKGLLRHTNRIKIEEQLAIFMFIIGHNLRTRAVQELFRYSGETISRHFNNVLNAIMAI 120

Query: 121 SLDFFQPPGSNV-PPEISQDPRFYPYFKDCVGAIDGIHIPVMVGVDEQGPFRNKNGLLSQ 180
           SLDFFQPPGSNV PPEI +DPRFYPYFKDCVGAIDGIHIPVMVGVDEQGPFRNK GLLSQ
Sbjct: 121 SLDFFQPPGSNVPPPEILEDPRFYPYFKDCVGAIDGIHIPVMVGVDEQGPFRNKKGLLSQ 180

Query: 181 IVLAACSFDLKFHYVLAGWEGSASDLQVLNSALTRRNKLHVPEGKYYLVDQKYMNMPGFI 240
           IVLAACSFDLKFHYVLAGWEGSASDLQVLNSALTRRNKLHVPEGKYYLVDQKYMNMPGFI
Sbjct: 181 IVLAACSFDLKFHYVLAGWEGSASDLQVLNSALTRRNKLHVPEGKYYLVDQKYMNMPGFI 240

Query: 241 APYHDIPYHSKEYPGGYHPQDAKELFNLRHSLLRNATDRTFGVLKARFPILLSAPPYPLQ 300
           APYHDIPYHSKEYPGGYHPQDAKELFNLRHSLLRNATDRTFG LKARFPILLSAPPYPLQ
Sbjct: 241 APYHDIPYHSKEYPGGYHPQDAKELFNLRHSLLRNATDRTFGALKARFPILLSAPPYPLQ 300

Query: 301 TQVKLVVATCAIHNYIRRENPDDWLFRLYEHDHVPHMEDSLPQLDAEQLTTQIETPIVDI 360
           TQVKLVVATCAIHNYIRRENPDDWLFRLYE DHVPHMEDSLPQLDAEQLTT IETPIVDI
Sbjct: 301 TQVKLVVATCAIHNYIRRENPDDWLFRLYEQDHVPHMEDSLPQLDAEQLTTHIETPIVDI 360

Query: 361 AFETGELEITSQLRDTIAAELWSDYINDISPM 392
           AFET ELEITSQLRDTIAAELWSDYINDISPM
Sbjct: 361 AFETEELEITSQLRDTIAAELWSDYINDISPM 392

BLAST of CaUC04G076540 vs. NCBI nr
Match: XP_016899554.1 (PREDICTED: uncharacterized protein LOC103502878 [Cucumis melo])

HSP 1 Score: 773.1 bits (1995), Expect = 1.2e-219
Identity = 372/392 (94.90%), Postives = 380/392 (96.94%), Query Frame = 0

Query: 1   MESSDDEKDGTYGKYVPREPSHNLVSNGAKFVDEVLNGQNERCLEHFRMDKHIFYKLCDI 60
           MESSDDEKDGTYGKYVPREPSHNLVSNGAKFVDEVLNGQNERCLEHFRMDKH+FYKLCDI
Sbjct: 1   MESSDDEKDGTYGKYVPREPSHNLVSNGAKFVDEVLNGQNERCLEHFRMDKHVFYKLCDI 60

Query: 61  LQAKGLLRHTNRIKIEEQLAIFMFIIGHNLRTRAVQELFRYSGETISRHFNNVLNAIMAI 120
           LQAKGLLRHTNRIKIEEQLAIFMFIIGHNLRTRAVQELFRYSGETISRHFNNVLNAIMAI
Sbjct: 61  LQAKGLLRHTNRIKIEEQLAIFMFIIGHNLRTRAVQELFRYSGETISRHFNNVLNAIMAI 120

Query: 121 SLDFFQPPGSNV-PPEISQDPRFYPYFKDCVGAIDGIHIPVMVGVDEQGPFRNKNGLLSQ 180
           SLDFFQPPGSNV PPEI +DPRFYPYFKDCVGAIDGIHIPVMVGVDEQGPFRN NG LSQ
Sbjct: 121 SLDFFQPPGSNVPPPEILEDPRFYPYFKDCVGAIDGIHIPVMVGVDEQGPFRNTNGQLSQ 180

Query: 181 IVLAACSFDLKFHYVLAGWEGSASDLQVLNSALTRRNKLHVPEGKYYLVDQKYMNMPGFI 240
           IVLAACSFDLKFHYVLAGWEGSASDLQVLNSALTRRNKLHVPEGKYYLVDQKYMNMPGF+
Sbjct: 181 IVLAACSFDLKFHYVLAGWEGSASDLQVLNSALTRRNKLHVPEGKYYLVDQKYMNMPGFV 240

Query: 241 APYHDIPYHSKEYPGGYHPQDAKELFNLRHSLLRNATDRTFGVLKARFPILLSAPPYPLQ 300
           APYHDI YHSKEYPGGYHPQDAKELFNLRHSLLRNAT+RTFG LKARFPILLSAPPYPLQ
Sbjct: 241 APYHDITYHSKEYPGGYHPQDAKELFNLRHSLLRNATERTFGALKARFPILLSAPPYPLQ 300

Query: 301 TQVKLVVATCAIHNYIRRENPDDWLFRLYEHDHVPHMEDSLPQLDAEQLTTQIETPIVDI 360
           TQVKLVVATCAIHNYIRRENPDDW FRLYE DHVPHMEDSLPQL+AEQLT  IETPIVD+
Sbjct: 301 TQVKLVVATCAIHNYIRRENPDDWFFRLYEQDHVPHMEDSLPQLEAEQLTANIETPIVDV 360

Query: 361 AFETGELEITSQLRDTIAAELWSDYINDISPM 392
           AFET ELEI SQLRD+IAAE+WSDYINDISPM
Sbjct: 361 AFETEELEIASQLRDSIAAEIWSDYINDISPM 392

BLAST of CaUC04G076540 vs. NCBI nr
Match: XP_004137507.1 (putative nuclease HARBI1 isoform X1 [Cucumis sativus] >KAE8652559.1 hypothetical protein Csa_014371 [Cucumis sativus])

HSP 1 Score: 768.5 bits (1983), Expect = 2.8e-218
Identity = 370/392 (94.39%), Postives = 379/392 (96.68%), Query Frame = 0

Query: 1   MESSDDEKDGTYGKYVPREPSHNLVSNGAKFVDEVLNGQNERCLEHFRMDKHIFYKLCDI 60
           MESSDDEKDGTYGKYVPREPSHNLVSNGAKFVDEVLNGQNERCL+HFRMDKH+FYKLCDI
Sbjct: 1   MESSDDEKDGTYGKYVPREPSHNLVSNGAKFVDEVLNGQNERCLDHFRMDKHVFYKLCDI 60

Query: 61  LQAKGLLRHTNRIKIEEQLAIFMFIIGHNLRTRAVQELFRYSGETISRHFNNVLNAIMAI 120
           LQAKGLLRHTNRIKIEEQLAIFMFIIGHNLRTRAVQELFRYSGETISRHFNNVLNAIMAI
Sbjct: 61  LQAKGLLRHTNRIKIEEQLAIFMFIIGHNLRTRAVQELFRYSGETISRHFNNVLNAIMAI 120

Query: 121 SLDFFQPPGSNV-PPEISQDPRFYPYFKDCVGAIDGIHIPVMVGVDEQGPFRNKNGLLSQ 180
           SLDFFQPPGSNV PPEI +DPRFYPYFKDCVG IDGIHIPVMVGVDEQGPFRNKNG LSQ
Sbjct: 121 SLDFFQPPGSNVPPPEILEDPRFYPYFKDCVGVIDGIHIPVMVGVDEQGPFRNKNGQLSQ 180

Query: 181 IVLAACSFDLKFHYVLAGWEGSASDLQVLNSALTRRNKLHVPEGKYYLVDQKYMNMPGFI 240
           IVLAACSFDLKFHYVLAGWEGSASDLQVLNSALTRRNKLHVPEGKYYLVDQKYMNMPGF+
Sbjct: 181 IVLAACSFDLKFHYVLAGWEGSASDLQVLNSALTRRNKLHVPEGKYYLVDQKYMNMPGFV 240

Query: 241 APYHDIPYHSKEYPGGYHPQDAKELFNLRHSLLRNATDRTFGVLKARFPILLSAPPYPLQ 300
           APYHDI Y SKEYPGGYHPQDAKELFNLRHSLLRNAT+RTF  LKARFPILLSAPPYPLQ
Sbjct: 241 APYHDITYQSKEYPGGYHPQDAKELFNLRHSLLRNATERTFEALKARFPILLSAPPYPLQ 300

Query: 301 TQVKLVVATCAIHNYIRRENPDDWLFRLYEHDHVPHMEDSLPQLDAEQLTTQIETPIVDI 360
           TQVKLVVATCAIHNYIRRENPDDW FRLYE DHVPHMEDSLPQL+AEQLT  IETPIVD+
Sbjct: 301 TQVKLVVATCAIHNYIRRENPDDWFFRLYEQDHVPHMEDSLPQLEAEQLTANIETPIVDV 360

Query: 361 AFETGELEITSQLRDTIAAELWSDYINDISPM 392
           AFET ELEITSQLRD+IAAE+WSDYINDISPM
Sbjct: 361 AFETEELEITSQLRDSIAAEIWSDYINDISPM 392

BLAST of CaUC04G076540 vs. NCBI nr
Match: XP_022924205.1 (putative nuclease HARBI1 isoform X2 [Cucurbita moschata] >XP_022924206.1 putative nuclease HARBI1 isoform X2 [Cucurbita moschata])

HSP 1 Score: 765.4 bits (1975), Expect = 2.4e-217
Identity = 369/391 (94.37%), Postives = 376/391 (96.16%), Query Frame = 0

Query: 1   MESSDDEKDGTYGKYVPREPSHNLVSNGAKFVDEVLNGQNERCLEHFRMDKHIFYKLCDI 60
           MESSDDEKDG+YGKYVPREPSHNLV+NGAKFVDEVLNGQNERCLE+FRMDKHIFYKLCDI
Sbjct: 1   MESSDDEKDGSYGKYVPREPSHNLVTNGAKFVDEVLNGQNERCLENFRMDKHIFYKLCDI 60

Query: 61  LQAKGLLRHTNRIKIEEQLAIFMFIIGHNLRTRAVQELFRYSGETISRHFNNVLNAIMAI 120
           LQAKGLLRHTNRIKIEEQLAIFMFIIGHNLRTRAVQELFRYSGETISRHFNNVLNAIMAI
Sbjct: 61  LQAKGLLRHTNRIKIEEQLAIFMFIIGHNLRTRAVQELFRYSGETISRHFNNVLNAIMAI 120

Query: 121 SLDFFQPPGSNVPPEISQDPRFYPYFKDCVGAIDGIHIPVMVGVDEQGPFRNKNGLLSQI 180
           SLDFFQPPGSNVPPEI  DPRFYPYFKDCVGAIDGIHIPVMVGVDEQGPFRNKNGLLSQ 
Sbjct: 121 SLDFFQPPGSNVPPEILDDPRFYPYFKDCVGAIDGIHIPVMVGVDEQGPFRNKNGLLSQN 180

Query: 181 VLAACSFDLKFHYVLAGWEGSASDLQVLNSALTRRNKLHVPEGKYYLVDQKYMNMPGFIA 240
           VLAACSFDLKFHYVLAGWEGSA+DLQVLNSALTRRNKLHVPEGKYYLVDQKYMNMPGFIA
Sbjct: 181 VLAACSFDLKFHYVLAGWEGSATDLQVLNSALTRRNKLHVPEGKYYLVDQKYMNMPGFIA 240

Query: 241 PYHDIPYHSKEYPGGYHPQDAKELFNLRHSLLRNATDRTFGVLKARFPILLSAPPYPLQT 300
           PYHDIPY S+EY GGYHPQDAKELFNLRHSLLRNATDRTFG LK RFPILLSAPPYPLQT
Sbjct: 241 PYHDIPYQSREYTGGYHPQDAKELFNLRHSLLRNATDRTFGALKVRFPILLSAPPYPLQT 300

Query: 301 QVKLVVATCAIHNYIRRENPDDWLFRLYEHDHVPHMEDSLPQLDAEQLTTQIETPIVDIA 360
           QVKLVVATCAIHNYIRRENPDDWLF+LYE DHV HMEDSLPQL+AEQLT  IETP VDIA
Sbjct: 301 QVKLVVATCAIHNYIRRENPDDWLFKLYEQDHVSHMEDSLPQLEAEQLTAHIETPTVDIA 360

Query: 361 FETGELEITSQLRDTIAAELWSDYINDISPM 392
           FET ELEITSQLRD IA ELWSDYINDISPM
Sbjct: 361 FETEELEITSQLRDAIATELWSDYINDISPM 391

BLAST of CaUC04G076540 vs. ExPASy TrEMBL
Match: A0A1S4DU98 (uncharacterized protein LOC103502878 OS=Cucumis melo OX=3656 GN=LOC103502878 PE=3 SV=1)

HSP 1 Score: 773.1 bits (1995), Expect = 5.6e-220
Identity = 372/392 (94.90%), Postives = 380/392 (96.94%), Query Frame = 0

Query: 1   MESSDDEKDGTYGKYVPREPSHNLVSNGAKFVDEVLNGQNERCLEHFRMDKHIFYKLCDI 60
           MESSDDEKDGTYGKYVPREPSHNLVSNGAKFVDEVLNGQNERCLEHFRMDKH+FYKLCDI
Sbjct: 1   MESSDDEKDGTYGKYVPREPSHNLVSNGAKFVDEVLNGQNERCLEHFRMDKHVFYKLCDI 60

Query: 61  LQAKGLLRHTNRIKIEEQLAIFMFIIGHNLRTRAVQELFRYSGETISRHFNNVLNAIMAI 120
           LQAKGLLRHTNRIKIEEQLAIFMFIIGHNLRTRAVQELFRYSGETISRHFNNVLNAIMAI
Sbjct: 61  LQAKGLLRHTNRIKIEEQLAIFMFIIGHNLRTRAVQELFRYSGETISRHFNNVLNAIMAI 120

Query: 121 SLDFFQPPGSNV-PPEISQDPRFYPYFKDCVGAIDGIHIPVMVGVDEQGPFRNKNGLLSQ 180
           SLDFFQPPGSNV PPEI +DPRFYPYFKDCVGAIDGIHIPVMVGVDEQGPFRN NG LSQ
Sbjct: 121 SLDFFQPPGSNVPPPEILEDPRFYPYFKDCVGAIDGIHIPVMVGVDEQGPFRNTNGQLSQ 180

Query: 181 IVLAACSFDLKFHYVLAGWEGSASDLQVLNSALTRRNKLHVPEGKYYLVDQKYMNMPGFI 240
           IVLAACSFDLKFHYVLAGWEGSASDLQVLNSALTRRNKLHVPEGKYYLVDQKYMNMPGF+
Sbjct: 181 IVLAACSFDLKFHYVLAGWEGSASDLQVLNSALTRRNKLHVPEGKYYLVDQKYMNMPGFV 240

Query: 241 APYHDIPYHSKEYPGGYHPQDAKELFNLRHSLLRNATDRTFGVLKARFPILLSAPPYPLQ 300
           APYHDI YHSKEYPGGYHPQDAKELFNLRHSLLRNAT+RTFG LKARFPILLSAPPYPLQ
Sbjct: 241 APYHDITYHSKEYPGGYHPQDAKELFNLRHSLLRNATERTFGALKARFPILLSAPPYPLQ 300

Query: 301 TQVKLVVATCAIHNYIRRENPDDWLFRLYEHDHVPHMEDSLPQLDAEQLTTQIETPIVDI 360
           TQVKLVVATCAIHNYIRRENPDDW FRLYE DHVPHMEDSLPQL+AEQLT  IETPIVD+
Sbjct: 301 TQVKLVVATCAIHNYIRRENPDDWFFRLYEQDHVPHMEDSLPQLEAEQLTANIETPIVDV 360

Query: 361 AFETGELEITSQLRDTIAAELWSDYINDISPM 392
           AFET ELEI SQLRD+IAAE+WSDYINDISPM
Sbjct: 361 AFETEELEIASQLRDSIAAEIWSDYINDISPM 392

BLAST of CaUC04G076540 vs. ExPASy TrEMBL
Match: A0A6J1EE58 (putative nuclease HARBI1 isoform X2 OS=Cucurbita moschata OX=3662 GN=LOC111431723 PE=3 SV=1)

HSP 1 Score: 765.4 bits (1975), Expect = 1.2e-217
Identity = 369/391 (94.37%), Postives = 376/391 (96.16%), Query Frame = 0

Query: 1   MESSDDEKDGTYGKYVPREPSHNLVSNGAKFVDEVLNGQNERCLEHFRMDKHIFYKLCDI 60
           MESSDDEKDG+YGKYVPREPSHNLV+NGAKFVDEVLNGQNERCLE+FRMDKHIFYKLCDI
Sbjct: 1   MESSDDEKDGSYGKYVPREPSHNLVTNGAKFVDEVLNGQNERCLENFRMDKHIFYKLCDI 60

Query: 61  LQAKGLLRHTNRIKIEEQLAIFMFIIGHNLRTRAVQELFRYSGETISRHFNNVLNAIMAI 120
           LQAKGLLRHTNRIKIEEQLAIFMFIIGHNLRTRAVQELFRYSGETISRHFNNVLNAIMAI
Sbjct: 61  LQAKGLLRHTNRIKIEEQLAIFMFIIGHNLRTRAVQELFRYSGETISRHFNNVLNAIMAI 120

Query: 121 SLDFFQPPGSNVPPEISQDPRFYPYFKDCVGAIDGIHIPVMVGVDEQGPFRNKNGLLSQI 180
           SLDFFQPPGSNVPPEI  DPRFYPYFKDCVGAIDGIHIPVMVGVDEQGPFRNKNGLLSQ 
Sbjct: 121 SLDFFQPPGSNVPPEILDDPRFYPYFKDCVGAIDGIHIPVMVGVDEQGPFRNKNGLLSQN 180

Query: 181 VLAACSFDLKFHYVLAGWEGSASDLQVLNSALTRRNKLHVPEGKYYLVDQKYMNMPGFIA 240
           VLAACSFDLKFHYVLAGWEGSA+DLQVLNSALTRRNKLHVPEGKYYLVDQKYMNMPGFIA
Sbjct: 181 VLAACSFDLKFHYVLAGWEGSATDLQVLNSALTRRNKLHVPEGKYYLVDQKYMNMPGFIA 240

Query: 241 PYHDIPYHSKEYPGGYHPQDAKELFNLRHSLLRNATDRTFGVLKARFPILLSAPPYPLQT 300
           PYHDIPY S+EY GGYHPQDAKELFNLRHSLLRNATDRTFG LK RFPILLSAPPYPLQT
Sbjct: 241 PYHDIPYQSREYTGGYHPQDAKELFNLRHSLLRNATDRTFGALKVRFPILLSAPPYPLQT 300

Query: 301 QVKLVVATCAIHNYIRRENPDDWLFRLYEHDHVPHMEDSLPQLDAEQLTTQIETPIVDIA 360
           QVKLVVATCAIHNYIRRENPDDWLF+LYE DHV HMEDSLPQL+AEQLT  IETP VDIA
Sbjct: 301 QVKLVVATCAIHNYIRRENPDDWLFKLYEQDHVSHMEDSLPQLEAEQLTAHIETPTVDIA 360

Query: 361 FETGELEITSQLRDTIAAELWSDYINDISPM 392
           FET ELEITSQLRD IA ELWSDYINDISPM
Sbjct: 361 FETEELEITSQLRDAIATELWSDYINDISPM 391

BLAST of CaUC04G076540 vs. ExPASy TrEMBL
Match: A0A6J1KJM0 (putative nuclease HARBI1 OS=Cucurbita maxima OX=3661 GN=LOC111495826 PE=3 SV=1)

HSP 1 Score: 760.0 bits (1961), Expect = 4.9e-216
Identity = 368/391 (94.12%), Postives = 374/391 (95.65%), Query Frame = 0

Query: 1   MESSDDEKDGTYGKYVPREPSHNLVSNGAKFVDEVLNGQNERCLEHFRMDKHIFYKLCDI 60
           MESSDDEKDG+YGKYVPREPSHNLVSNGAKFVDEVLNGQNERCLE+FRMDKHIFYKLCDI
Sbjct: 1   MESSDDEKDGSYGKYVPREPSHNLVSNGAKFVDEVLNGQNERCLENFRMDKHIFYKLCDI 60

Query: 61  LQAKGLLRHTNRIKIEEQLAIFMFIIGHNLRTRAVQELFRYSGETISRHFNNVLNAIMAI 120
           LQAKGLLRHTNRIKIEEQLAIFMFIIGHNLRTRAVQELFRYSGETISRHFNNVLNAIMAI
Sbjct: 61  LQAKGLLRHTNRIKIEEQLAIFMFIIGHNLRTRAVQELFRYSGETISRHFNNVLNAIMAI 120

Query: 121 SLDFFQPPGSNVPPEISQDPRFYPYFKDCVGAIDGIHIPVMVGVDEQGPFRNKNGLLSQI 180
           SLDFFQPPGSNVPPEI  DPRFYPYFKDCVGAIDGIHIPVMVGVDEQGPFRNKNGLLSQ 
Sbjct: 121 SLDFFQPPGSNVPPEILDDPRFYPYFKDCVGAIDGIHIPVMVGVDEQGPFRNKNGLLSQN 180

Query: 181 VLAACSFDLKFHYVLAGWEGSASDLQVLNSALTRRNKLHVPEGKYYLVDQKYMNMPGFIA 240
           VLAACSFDLKFHYVLAGWEGSA+DLQVLNSALTRRNKLH+PEGKYYLVDQKYMNMPGFIA
Sbjct: 181 VLAACSFDLKFHYVLAGWEGSATDLQVLNSALTRRNKLHIPEGKYYLVDQKYMNMPGFIA 240

Query: 241 PYHDIPYHSKEYPGGYHPQDAKELFNLRHSLLRNATDRTFGVLKARFPILLSAPPYPLQT 300
           PYHDIPY S+EY GGYHPQDAKELFNLRHSLLRNATDRTFG LK RFPILLSAPPYPLQT
Sbjct: 241 PYHDIPYQSREYTGGYHPQDAKELFNLRHSLLRNATDRTFGALKVRFPILLSAPPYPLQT 300

Query: 301 QVKLVVATCAIHNYIRRENPDDWLFRLYEHDHVPHMEDSLPQLDAEQLTTQIETPIVDIA 360
           QVKLVVATCAIHNYIRRENPDD LFRLYE DHV HMEDSLPQL+AEQLT  IETP VDIA
Sbjct: 301 QVKLVVATCAIHNYIRRENPDDCLFRLYEQDHVSHMEDSLPQLEAEQLTAHIETPTVDIA 360

Query: 361 FETGELEITSQLRDTIAAELWSDYINDISPM 392
           FET E EITSQLRD IA ELWSDYINDISPM
Sbjct: 361 FETEEREITSQLRDAIATELWSDYINDISPM 391

BLAST of CaUC04G076540 vs. ExPASy TrEMBL
Match: A0A5D3BLI7 (Putative nuclease HARBI1 OS=Cucumis melo var. makuwa OX=1194695 GN=E5676_scaffold248G002420 PE=3 SV=1)

HSP 1 Score: 755.0 bits (1948), Expect = 1.6e-214
Identity = 363/382 (95.03%), Postives = 371/382 (97.12%), Query Frame = 0

Query: 1   MESSDDEKDGTYGKYVPREPSHNLVSNGAKFVDEVLNGQNERCLEHFRMDKHIFYKLCDI 60
           MESSDDEKDGTYGKYVPREPSHNLVSNGAKFVDEVLNGQNERCLEHFRMDKH+FYKLCDI
Sbjct: 1   MESSDDEKDGTYGKYVPREPSHNLVSNGAKFVDEVLNGQNERCLEHFRMDKHVFYKLCDI 60

Query: 61  LQAKGLLRHTNRIKIEEQLAIFMFIIGHNLRTRAVQELFRYSGETISRHFNNVLNAIMAI 120
           LQAKGLLRHTNRIKIEEQLAIFMFIIGHNLRTRAVQELFRYSGETISRHFNNVLNAIMAI
Sbjct: 61  LQAKGLLRHTNRIKIEEQLAIFMFIIGHNLRTRAVQELFRYSGETISRHFNNVLNAIMAI 120

Query: 121 SLDFFQPPGSNV-PPEISQDPRFYPYFKDCVGAIDGIHIPVMVGVDEQGPFRNKNGLLSQ 180
           SLDFFQPPGSNV PPEI +DPRFYPYFKDCVGAIDGIHIPVMVGVDEQGPFRNKNG LSQ
Sbjct: 121 SLDFFQPPGSNVPPPEILEDPRFYPYFKDCVGAIDGIHIPVMVGVDEQGPFRNKNGQLSQ 180

Query: 181 IVLAACSFDLKFHYVLAGWEGSASDLQVLNSALTRRNKLHVPEGKYYLVDQKYMNMPGFI 240
           IVLAACSFDLKFHYVLAGWEGSASDLQVLNSALTRRNKLHVPEGKYYLVDQKYMNMPGF+
Sbjct: 181 IVLAACSFDLKFHYVLAGWEGSASDLQVLNSALTRRNKLHVPEGKYYLVDQKYMNMPGFV 240

Query: 241 APYHDIPYHSKEYPGGYHPQDAKELFNLRHSLLRNATDRTFGVLKARFPILLSAPPYPLQ 300
           APYHDI YHSKEYPGGYHPQDAKELFNLRHSLLRNAT+RTFG LKARFPILLSAPPYPLQ
Sbjct: 241 APYHDITYHSKEYPGGYHPQDAKELFNLRHSLLRNATERTFGALKARFPILLSAPPYPLQ 300

Query: 301 TQVKLVVATCAIHNYIRRENPDDWLFRLYEHDHVPHMEDSLPQLDAEQLTTQIETPIVDI 360
           TQVKLVVATCAIHNYIRRENPDDW FRLYE DHVPHMEDSLPQL+AEQLT  IETPIVD+
Sbjct: 301 TQVKLVVATCAIHNYIRRENPDDWFFRLYEQDHVPHMEDSLPQLEAEQLTANIETPIVDV 360

Query: 361 AFETGELEITSQLRDTIAAELW 382
           AFET ELEI SQLRD+IAAE+W
Sbjct: 361 AFETEELEIASQLRDSIAAEIW 382

BLAST of CaUC04G076540 vs. ExPASy TrEMBL
Match: A0A6J1C7D1 (putative nuclease HARBI1 OS=Momordica charantia OX=3673 GN=LOC111009055 PE=3 SV=1)

HSP 1 Score: 746.1 bits (1925), Expect = 7.3e-212
Identity = 362/393 (92.11%), Postives = 373/393 (94.91%), Query Frame = 0

Query: 1   MESSDDEKDGTYGKYVPREPS-HNLVSNGAKFVDEVLNGQNERCLEHFRMDKHIFYKLCD 60
           MESSDDEKDGTYGKYVPRE S HNLVSNGAKFVDEVL GQNE CLE+FRMDKHIFYKLCD
Sbjct: 1   MESSDDEKDGTYGKYVPRELSHHNLVSNGAKFVDEVLKGQNELCLENFRMDKHIFYKLCD 60

Query: 61  ILQAKGLLRHTNRIKIEEQLAIFMFIIGHNLRTRAVQELFRYSGETISRHFNNVLNAIMA 120
           ILQAKGLLRHTNRIKIEEQLAIFMFIIGHNLRTRAVQELFRYSGETISRHFNNVLNAIMA
Sbjct: 61  ILQAKGLLRHTNRIKIEEQLAIFMFIIGHNLRTRAVQELFRYSGETISRHFNNVLNAIMA 120

Query: 121 ISLDFFQPPGSNVPPEISQDPRFYPYFKDCVGAIDGIHIPVMVGVDEQGPFRNKNGLLSQ 180
           ISLDFFQPPGSNVP EI +DPRFYPYFKDCVGA+DGIHIPVMVGVDEQGPFRNKNGLLSQ
Sbjct: 121 ISLDFFQPPGSNVPAEILEDPRFYPYFKDCVGAVDGIHIPVMVGVDEQGPFRNKNGLLSQ 180

Query: 181 IVLAACSFDLKFHYVLAGWEGSASDLQVLNSALTRRNKLHVPEGKYYLVDQKYMNMPGFI 240
            VLAACSFDL FHYVLAGWEGSASDLQVLNSALTRRNKLHVPEGKYYLVDQKYMNMPGF+
Sbjct: 181 NVLAACSFDLMFHYVLAGWEGSASDLQVLNSALTRRNKLHVPEGKYYLVDQKYMNMPGFV 240

Query: 241 APYHDIPYHSKEYPGGYHPQDAKELFNLRHSLLRNATDRTFGVLKARFPILLSAPPYPLQ 300
           APYHD+PYHSK++PGGYHPQDAK+LFNLRHSLLRNATDRTFG LKARFPILLSAPPYPLQ
Sbjct: 241 APYHDVPYHSKDFPGGYHPQDAKQLFNLRHSLLRNATDRTFGALKARFPILLSAPPYPLQ 300

Query: 301 TQVKLVVATCAIHNYIRRENPDDWLFRLYEHDHVPHMEDSLPQLDAEQLT-TQIETPIVD 360
           TQVKLVVATCAIHNYIRRE PDDWLFRLYE DH+PHMEDSLP L+A QL  T IETP VD
Sbjct: 301 TQVKLVVATCAIHNYIRREKPDDWLFRLYEQDHLPHMEDSLPPLEAAQLVGTHIETPTVD 360

Query: 361 IAFETGELEITSQLRDTIAAELWSDYINDISPM 392
           IAFET ELEITSQLRD IA ELWSDYIND+SPM
Sbjct: 361 IAFETEELEITSQLRDAIATELWSDYINDVSPM 393

BLAST of CaUC04G076540 vs. TAIR 10
Match: AT5G41980.1 (CONTAINS InterPro DOMAIN/s: Putative harbinger transposase-derived nuclease (InterPro:IPR006912); BEST Arabidopsis thaliana protein match is: unknown protein (TAIR:AT1G43722.1); Has 1807 Blast hits to 1807 proteins in 277 species: Archae - 0; Bacteria - 0; Metazoa - 736; Fungi - 347; Plants - 385; Viruses - 0; Other Eukaryotes - 339 (source: NCBI BLink). )

HSP 1 Score: 474.9 bits (1221), Expect = 6.1e-134
Identity = 243/384 (63.28%), Postives = 291/384 (75.78%), Query Frame = 0

Query: 7   EKDGTYGKYVPREPSHNLVSNGAKFVDEVLNGQNERCLEHFRMDKHIFYKLCDILQAKGL 66
           E+D      +P+E S   +S+G KFV ++LNG NE+C E+FRMDK +FYKLCD+LQ +GL
Sbjct: 6   EEDKEEAVTLPKEVSKISISDGNKFVYQILNGPNEQCFENFRMDKPVFYKLCDLLQTRGL 65

Query: 67  LRHTNRIKIEEQLAIFMFIIGHNLRTRAVQELFRYSGETISRHFNNVLNAIMAISLDFFQ 126
           LRHTNRIKIE QLAIF+FIIGHNLRTRAVQELF YSGETISRHFNNVLNA++AIS DFFQ
Sbjct: 66  LRHTNRIKIEAQLAIFLFIIGHNLRTRAVQELFCYSGETISRHFNNVLNAVIAISKDFFQ 125

Query: 127 PPGSNVPPEISQDPRFYPYFKDCVGAIDGIHIPVMVGVDEQGPFRNKNGLLSQIVLAACS 186
            P SN     + D    PYFKDCVG +D  HIPVMVGVDEQGPFRN NGLL+Q VLAA S
Sbjct: 126 -PNSNSDTLENDD----PYFKDCVGVVDSFHIPVMVGVDEQGPFRNGNGLLTQNVLAASS 185

Query: 187 FDLKFHYVLAGWEGSASDLQVLNSALTRRNKLHVPEGKYYLVDQKYMNMPGFIAPYHDIP 246
           FDL+F+YVLAGWEGSASD QVLN+ALTRRNKL VP+GKYY+VD KY N+PGFIAPYH + 
Sbjct: 186 FDLRFNYVLAGWEGSASDQQVLNAALTRRNKLQVPQGKYYIVDNKYPNLPGFIAPYHGVS 245

Query: 247 YHSKEYPGGYHPQDAKELFNLRHSLLRNATDRTFGVLKARFPILLSAPPYPLQTQVKLVV 306
            +S+E        +AKE+FN RH LL  A  RTFG LK RFPILLSAPPYPLQTQVKLV+
Sbjct: 246 TNSRE--------EAKEMFNERHKLLHRAIHRTFGALKERFPILLSAPPYPLQTQVKLVI 305

Query: 307 ATCAIHNYIRRENPDDWLFRLYEHDHVPHM-EDSLPQLDAEQLTTQIETPIVDIAFETGE 366
           A CA+HNY+R E PDD +FR++E + +    ED    L+ E    Q+E    +  F   E
Sbjct: 306 AACALHNYVRLEKPDDLVFRMFEEETLAEAGEDREVALEEE----QVEIVGQEHGFRPEE 365

Query: 367 LEITSQLRDTIAAELWSDYINDIS 390
           +E + +LRD IA+ELW+ Y+ ++S
Sbjct: 366 VEDSLRLRDEIASELWNHYVQNMS 372

BLAST of CaUC04G076540 vs. TAIR 10
Match: AT1G43722.1 (unknown protein; BEST Arabidopsis thaliana protein match is: unknown protein (TAIR:AT5G28730.1); Has 924 Blast hits to 912 proteins in 109 species: Archae - 0; Bacteria - 0; Metazoa - 222; Fungi - 31; Plants - 661; Viruses - 0; Other Eukaryotes - 10 (source: NCBI BLink). )

HSP 1 Score: 157.9 bits (398), Expect = 1.6e-38
Identity = 97/283 (34.28%), Postives = 139/283 (49.12%), Query Frame = 0

Query: 12  YGKYVPREPSHNLVSNGAKFVDEVLNGQNERCLEHFRMDKHIFYKLCDILQAKGLLRHTN 71
           Y +Y  R P       G + +   L      CL+  RM    F  LC++LQ    L+ T 
Sbjct: 35  YDRYFQRAPVQIDRGLGWRNIWRRLQQDAAACLQLLRMSLPCFTTLCNMLQTNYDLQPTL 94

Query: 72  RIKIEEQLAIFMFIIGHNLRTRAVQELFRYSGETISRHFNNVLNAIMAISLDFFQPPGS- 131
            I IEE +A+F+ I GHN   R V   F  + ET+ R F  VL A   ++ D+ + P   
Sbjct: 95  NISIEESVAMFLRICGHNEVYRDVGLRFGRNQETVQRKFREVLTATELLACDYIRTPTRQ 154

Query: 132 ---NVPPEISQDPRFYPYFKDCVGAIDGIHIPVMVGVDEQGPFRNKNGLLSQIVLAACSF 191
               +P  +  D R++PYF   VGA+DG H+ V V  D QG + N++   S  ++A C  
Sbjct: 155 ELYRIPERLQVDQRYWPYFSGFVGAMDGTHVCVKVKPDLQGMYWNRHDNASLNIMAICDL 214

Query: 192 DLKFHYVLAGWEGSASDLQVLNSALTRRNKLHVPEG-KYYLVDQKYMNMPGFIAPYHD-- 251
            + F Y+  G  GS  D  VL  A    ++  +P   KYYLVD  Y N  G +APY    
Sbjct: 215 KMLFTYIWNGAPGSCYDTAVLQIAQQSDSEFPLPPSEKYYLVDSGYPNKQGLLAPYRSSR 274

Query: 252 ---IPYHSKEYPGGYHPQDAKELFNLRHSLLRNATDRTFGVLK 285
              + YH  ++  G  P++  ELFN  H+ LR+  +RTF + K
Sbjct: 275 NRVVRYHMSQFYYGPRPRNKHELFNQCHTSLRSVIERTFRIWK 317

BLAST of CaUC04G076540 vs. TAIR 10
Match: AT5G35695.1 (CONTAINS InterPro DOMAIN/s: Putative harbinger transposase-derived nuclease (InterPro:IPR006912); BEST Arabidopsis thaliana protein match is: unknown protein (TAIR:AT5G41980.1); Has 30201 Blast hits to 17322 proteins in 780 species: Archae - 12; Bacteria - 1396; Metazoa - 17338; Fungi - 3422; Plants - 5037; Viruses - 0; Other Eukaryotes - 2996 (source: NCBI BLink). )

HSP 1 Score: 115.2 bits (287), Expect = 1.2e-25
Identity = 71/197 (36.04%), Postives = 104/197 (52.79%), Query Frame = 0

Query: 191 FHYVLAGWEGSASDLQVLNSALTRRNKLHVPEGKYYLVDQKYMNMPGFIAPYHDIPYHSK 250
           F YVL+GWEGSA D +VL+ AL           K+YLVD  + N   F+AP+  + YH +
Sbjct: 25  FIYVLSGWEGSAHDSRVLSDALR----------KFYLVDCGFANRLNFLAPFRGVRYHLQ 84

Query: 251 EYPGGYH-PQDAKELFNLRHSLLRNATDRTFGVLKARFPILLSAPPYPLQTQVKLVVATC 310
           E+ G    P+   ELFNLRH  LRN  +R FG+ K+RF I  SAPP+  + Q  LV+   
Sbjct: 85  EFAGQRRDPETPHELFNLRHVSLRNVIERIFGIFKSRFAIFKSAPPFSYKKQAGLVLTCA 144

Query: 311 AIHNYIRRENPDDWLFRLYEHDHVPHMEDSLPQLDAEQLTTQIETPIVDIAFETGELEIT 370
           A+HN++R+E   D        D V +  D +        T +I+     +  +  + E T
Sbjct: 145 ALHNFLRKECRSD---EADFPDEVGNEGDVVNNEGNAMNTNEIDNE-EPLEAQKQDRENT 204

Query: 371 SQLRDTIAAELWSDYIN 387
           +  R ++A ++W D  N
Sbjct: 205 NMWRKSMAEDMWKDATN 207

BLAST of CaUC04G076540 vs. TAIR 10
Match: AT5G28950.1 (unknown protein; BEST Arabidopsis thaliana protein match is: unknown protein (TAIR:AT5G41980.1); Has 448 Blast hits to 446 proteins in 74 species: Archae - 0; Bacteria - 0; Metazoa - 31; Fungi - 21; Plants - 396; Viruses - 0; Other Eukaryotes - 0 (source: NCBI BLink). )

HSP 1 Score: 114.0 bits (284), Expect = 2.7e-25
Identity = 56/96 (58.33%), Postives = 68/96 (70.83%), Query Frame = 0

Query: 128 PGSNVPPEISQDPRFYPYFKDCVGAIDGIHIPVMVGVDEQGPFRNKNGLLSQIVLAACSF 187
           P   VP +I +  R YPYFKDCVGAID  HI  MV   +   FRN+ G +SQ +LAAC+F
Sbjct: 4   PEIAVPRKIRESTRLYPYFKDCVGAIDDTHIFAMVSQKKMPSFRNRKGDISQNMLAACNF 63

Query: 188 DLKFHYVLAGWEGSASDLQVLNSALTRR-NKLHVPE 223
           D++F YVL+GWEGSA D +VLN ALTR  N+L VPE
Sbjct: 64  DVEFMYVLSGWEGSAHDSKVLNDALTRNSNRLPVPE 99

BLAST of CaUC04G076540 vs. TAIR 10
Match: AT4G10890.1 (unknown protein; CONTAINS InterPro DOMAIN/s: Protein of unknown function DUF2439 (InterPro:IPR018838); BEST Arabidopsis thaliana protein match is: unknown protein (TAIR:AT1G43722.1); Has 30201 Blast hits to 17322 proteins in 780 species: Archae - 12; Bacteria - 1396; Metazoa - 17338; Fungi - 3422; Plants - 5037; Viruses - 0; Other Eukaryotes - 2996 (source: NCBI BLink). )

HSP 1 Score: 62.0 bits (149), Expect = 1.2e-09
Identity = 31/72 (43.06%), Postives = 42/72 (58.33%), Query Frame = 0

Query: 219 HVPEGKYYLVDQKYMNMPGFIAPYHDIPYHSKEYPGGYHPQDAKELFNLRHSLLRNATDR 278
           H    KYYLV+  Y    G++ P+  I YH  ++  G  P   +ELFN +H  LR+  DR
Sbjct: 89  HPSNRKYYLVNSVYPTTTGYLGPHRRILYHLGQFGRGGPPVTVQELFNRKHLDLRSVIDR 148

Query: 279 TFGVLKARFPIL 291
           TFGV KA++ IL
Sbjct: 149 TFGVWKAKWRIL 160

The following BLAST results are available for this feature:
Match NameE-valueIdentityDescription
XP_038895429.17.7e-22497.45putative nuclease HARBI1 isoform X1 [Benincasa hispida][more]
XP_038895430.17.7e-22497.45putative nuclease HARBI1 isoform X2 [Benincasa hispida] >XP_038895431.1 putative... [more]
XP_016899554.11.2e-21994.90PREDICTED: uncharacterized protein LOC103502878 [Cucumis melo][more]
XP_004137507.12.8e-21894.39putative nuclease HARBI1 isoform X1 [Cucumis sativus] >KAE8652559.1 hypothetical... [more]
XP_022924205.12.4e-21794.37putative nuclease HARBI1 isoform X2 [Cucurbita moschata] >XP_022924206.1 putativ... [more]
Match NameE-valueIdentityDescription
Match NameE-valueIdentityDescription
A0A1S4DU985.6e-22094.90uncharacterized protein LOC103502878 OS=Cucumis melo OX=3656 GN=LOC103502878 PE=... [more]
A0A6J1EE581.2e-21794.37putative nuclease HARBI1 isoform X2 OS=Cucurbita moschata OX=3662 GN=LOC11143172... [more]
A0A6J1KJM04.9e-21694.12putative nuclease HARBI1 OS=Cucurbita maxima OX=3661 GN=LOC111495826 PE=3 SV=1[more]
A0A5D3BLI71.6e-21495.03Putative nuclease HARBI1 OS=Cucumis melo var. makuwa OX=1194695 GN=E5676_scaffol... [more]
A0A6J1C7D17.3e-21292.11putative nuclease HARBI1 OS=Momordica charantia OX=3673 GN=LOC111009055 PE=3 SV=... [more]
Match NameE-valueIdentityDescription
AT5G41980.16.1e-13463.28CONTAINS InterPro DOMAIN/s: Putative harbinger transposase-derived nuclease (Int... [more]
AT1G43722.11.6e-3834.28unknown protein; BEST Arabidopsis thaliana protein match is: unknown protein (TA... [more]
AT5G35695.11.2e-2536.04CONTAINS InterPro DOMAIN/s: Putative harbinger transposase-derived nuclease (Int... [more]
AT5G28950.12.7e-2558.33unknown protein; BEST Arabidopsis thaliana protein match is: unknown protein (TA... [more]
AT4G10890.11.2e-0943.06unknown protein; CONTAINS InterPro DOMAIN/s: Protein of unknown function DUF2439... [more]
InterPro
Analysis Name: InterPro Annotations of Watermelon (USVL246-FR2) v1
Date Performed: 2022-01-31
IPR TermIPR DescriptionSourceSource TermSource DescriptionAlignment
IPR027806Harbinger transposase-derived nuclease domainPFAMPF13359DDE_Tnp_4coord: 153..313
e-value: 6.3E-15
score: 55.2
NoneNo IPR availableMOBIDB_LITEmobidb-litedisorder_predictioncoord: 1..21
NoneNo IPR availableMOBIDB_LITEmobidb-litedisorder_predictioncoord: 1..15
NoneNo IPR availablePANTHERPTHR22930UNCHARACTERIZEDcoord: 1..391
NoneNo IPR availablePANTHERPTHR22930:SF201NUCLEASE HARBI1-LIKE PROTEINcoord: 1..391

Relationships

The following mRNA feature(s) are a part of this gene:

Feature NameUnique NameType
CaUC04G076540.1CaUC04G076540.1mRNA


GO Annotation
GO Assignments
This gene is annotated with the following GO terms.
Category Term Accession Term Name
molecular_function GO:0046872 metal ion binding