CaUC02G033680 (gene) Watermelon (USVL246-FR2) v1

Overview
NameCaUC02G033680
Typegene
OrganismCitrullus amarus (Watermelon (USVL246-FR2) v1)
DescriptionIntegrase catalytic domain-containing protein
LocationCiama_Chr02: 9722209 .. 9724113 (+)
RNA-Seq ExpressionCaUC02G033680
SyntenyCaUC02G033680
Sequences
The following sequences are available for this feature:

Gene sequence (with intron)

Legend: exonCDSpolypeptide
Hold the cursor over a type above to highlight its positions in the sequence below.
ATGGATCGTACATGGACGGTAGTTGGAGTCGGCGGCTCTCTTAGGGTTCCCTCATCTGGGATCCCTGGGGAAGAGGATCAAGTTGGCCCTTGCGAACAGCTTGATGCACTATCTCCCTTCAACCCTTTGAGCGAAATGCGGCAAAAGGAAGGAAAATCCATGGACCGACCCCATCGTCTCCACCCCGTAGGAACTACGAGATCACCCCAAGGACGCCTTCGGTATCCAGGGGTCGCGGACCGACCATAGAACCCTGTTCAATAAGTGGAACGCATTAGCTGTCCGCTCTCCGGTTGGGCAGTAAGGGTCGGAGAAGGGCAATCACTCATTCTTAAAACCAGCATTCTTAAGACCAAAGAGGCGGGCGGAAAAGGGGGGAAAGCTCTCCGTTCCTGGTTCTCCTGTAGCTGGATTCTCCGGAACCACAAGAATCCTTAGTTAGAATGGGATTCCAACTCAGTACCTTTTGAGATTTTGAGAAGAGTTGCTCTTTGGAGAGCACAGTACGATGAAAGTTGTAAGCTGTGTTCGGGGGGGAGTTATTGTCTATCGTTGGCCTCTATGGTAGAATCAGTCGGGGGCCTGAGAGGCGGTGGTTTACCCTGTGGCGGATGTCAGCGGTTCGAGTCCGCTTATCTCCAACTCGTGAACTTAGCCGATACAAAGCTATATGATAGCACTCCATTTTTCCGATTCGGCAGTTCGATCTATGATTTATCATTCATGGACGTTGATAAGATCCTTCCATTTAGCAGCACCTTAGGATGGCATAGCCTTAAAGTTAAGGGCGAGGTTCAAACGAAGAAAGGCTTACGGTGGATACCTAGGCACCCAGAGACGAGGAAGGGCGTAGTAAGCGACGAAATGCTTCGGGGAGTTGAAAATAAGCGTAGATCCGGAGATTCCCGAATAGGTCAACCTTTCAAACTGCTGCTGAATCCATGGGCAGGCAAGAGACAACCTGGCGAACTGAAACATCTTAGTAGCCAGAGGAAAAGAAAGCAAAAGCGATTCCCGTAGTAGCGGCGAGCGAAATGGGAGCAGCCTAAACCGTGAAAACGGGGTTGTGGGAGAGCAATACAAGCGTCGTGCTGCTAGGCGAAGCGGTGGAGTGCTGCACCCTAGATGGCGAGAGTCCAGTAGCCGAAAGCATCACTAGCTTACGCTCTGACCCGAGTAGCATGGGGCACGTGGAATCCCGTGTGAATCAGCAAGGACCACCTTGCAAGGCTAAATACTCCTGGGTGACCGATAGTGAAGTAGTACCGTGAGGGAAGGGTGAAAAGAACCCCCATCGGGGAGTGAAATAGAACATGAAACCGTAAGCTTCCAAGCAGTGGGAGGAGACCAGGACTCTGACCGCGTGCCTGTTGAAGAATGAGCCGGCGACTCATAGGCAGTGGCTTGGTTAAGGGAACCCACCGGAGCCGTAGCGAAAGCGAGTCTTCATGGGGCAATTGTCACTGCTTATGGACCCGAACCTGGGTGATCTATCCATGACCAGGATGAAGCTTGGGTGAAACTAAGTGGAGGTCCGAACCGACTGATGTTGAAGAATCAGCGGATGAGTTGTGGTTAGGGGTGAAATGCCACTCGAACCCAGAGCTAGCTGGTTCTCCCCGAAATGCGTTGAGGCGCAGCAGTTGACTGGACATCTAGGGGTAAAGCACTGTTTCGGTGCGGGCCGCGAGAGCGGTACCAAATCGAGGCAAACTCTGAATACTAGATATGATCTCAAAATAACAGGGGTCAAGGTCGGCCAGTGAGACGATGGGGGATAAGCTTCATCGTCGAGAGGGAAACAGCCCGGATCACCAGCTAAGGCCCCTAAATGACCGCTCAGTGATAAAGGAGGTAGGGGTGCAGAGACAGCCAGGAGGTTTGCCTAGAAGCAGCCACCCTTGA

mRNA sequence

ATGGATCGTACATGGACGGTAGTTGGAGTCGGCGGCTCTCTTAGGGTTCCCTCATCTGGGATCCCTGGGGAAGAGGATCAAGTTGGCCCTTGCGAACAGCTTGATGCACTATCTCCCTTCAACCCTTTGAGCGAAATGCGGCAAAAGGAAGGAAAATCCATGGACCGACCCCATCGTCTCCACCCCGTAGGAACTACGAGATCACCCCAAGGACGCCTTCGACCAAAGAGGCGGGCGGAAAAGGGGGGAAAGCTCTCCGTTCCTGGTTCTCCTGTAGCTGGATTCTCCGGAACCACAAGAATCCTTAAATCAGTCGGGGGCCTGAGAGGCGGTGGTTTACCCTGTGGCGGATGTCAGCGGTTCGAGTCCGCTTATCTCCAACTCGTGAACTTAGCCGATACAAAGCTATATGATAGCACTCCATTTTTCCGATTCGGCAGTTCGATCTATGATTTATCATTCATGGACGTTGATAAGATCCTTCCATTTAGCAGCACCTTAGGATGGCATAGCCTTAAAGTTAAGGGCGAGGTTCAAACGAAGAAAGGCTTACGGTGGATACCTAGGCACCCAGAGACGAGGAAGGGCGTAGGGTCAAGGTCGGCCAGTGAGACGATGGGGGATAAGCTTCATCGTCGAGAGGGAAACAGCCCGGATCACCAGCTAAGGCCCCTAAATGACCGCTCAGTGATAAAGGAGGTAGGGGTGCAGAGACAGCCAGGAGGTTTGCCTAGAAGCAGCCACCCTTGA

Coding sequence (CDS)

ATGGATCGTACATGGACGGTAGTTGGAGTCGGCGGCTCTCTTAGGGTTCCCTCATCTGGGATCCCTGGGGAAGAGGATCAAGTTGGCCCTTGCGAACAGCTTGATGCACTATCTCCCTTCAACCCTTTGAGCGAAATGCGGCAAAAGGAAGGAAAATCCATGGACCGACCCCATCGTCTCCACCCCGTAGGAACTACGAGATCACCCCAAGGACGCCTTCGACCAAAGAGGCGGGCGGAAAAGGGGGGAAAGCTCTCCGTTCCTGGTTCTCCTGTAGCTGGATTCTCCGGAACCACAAGAATCCTTAAATCAGTCGGGGGCCTGAGAGGCGGTGGTTTACCCTGTGGCGGATGTCAGCGGTTCGAGTCCGCTTATCTCCAACTCGTGAACTTAGCCGATACAAAGCTATATGATAGCACTCCATTTTTCCGATTCGGCAGTTCGATCTATGATTTATCATTCATGGACGTTGATAAGATCCTTCCATTTAGCAGCACCTTAGGATGGCATAGCCTTAAAGTTAAGGGCGAGGTTCAAACGAAGAAAGGCTTACGGTGGATACCTAGGCACCCAGAGACGAGGAAGGGCGTAGGGTCAAGGTCGGCCAGTGAGACGATGGGGGATAAGCTTCATCGTCGAGAGGGAAACAGCCCGGATCACCAGCTAAGGCCCCTAAATGACCGCTCAGTGATAAAGGAGGTAGGGGTGCAGAGACAGCCAGGAGGTTTGCCTAGAAGCAGCCACCCTTGA

Protein sequence

MDRTWTVVGVGGSLRVPSSGIPGEEDQVGPCEQLDALSPFNPLSEMRQKEGKSMDRPHRLHPVGTTRSPQGRLRPKRRAEKGGKLSVPGSPVAGFSGTTRILKSVGGLRGGGLPCGGCQRFESAYLQLVNLADTKLYDSTPFFRFGSSIYDLSFMDVDKILPFSSTLGWHSLKVKGEVQTKKGLRWIPRHPETRKGVGSRSASETMGDKLHRREGNSPDHQLRPLNDRSVIKEVGVQRQPGGLPRSSHP
Homology
BLAST of CaUC02G033680 vs. NCBI nr
Match: CAD5336145.1 (unnamed protein product [Arabidopsis thaliana])

HSP 1 Score: 335.1 bits (858), Expect = 5.1e-88
Identity = 180/258 (69.77%), Postives = 191/258 (74.03%), Query Frame = 0

Query: 21  IPGEEDQVGPCEQLDALSPFNPLSEMRQKEGKSMDRPHRLHPVGTTRSPQGRLRP----- 80
           IPGEEDQVGPCEQLDALSPFNPLSEMRQKEGKSMDRPHRLHPVGTTRSPQGRLR      
Sbjct: 132 IPGEEDQVGPCEQLDALSPFNPLSEMRQKEGKSMDRPHRLHPVGTTRSPQGRLRSSPFEI 191

Query: 81  -------KRRAEKGGKLSVPGSPVAGFSGTTRILKSVGGLRGGGLPCGGCQRFESAYLQL 140
                  + + ++  KL   GS     +    +++SV GL GGGLPCGGCQRFESAYLQL
Sbjct: 192 LRRVALWRAQYDESCKLCSGGSSCLSLAS---MVESVRGLIGGGLPCGGCQRFESAYLQL 251

Query: 141 VNLADTKLYDSTPFFRFGSSIYDLSFMDVDKILPFSSTLGWHSLKVKGEVQTKKGLRWIP 200
           VNLADTKLYDST FFRFGSSIYD SFMDVDKI PFSSTLGWHSLKVKGEVQT+KGLRWIP
Sbjct: 252 VNLADTKLYDSTQFFRFGSSIYDFSFMDVDKIFPFSSTLGWHSLKVKGEVQTRKGLRWIP 311

Query: 201 RHPETRKGVGS--------------------------------RSASETMGDKLHRREGN 235
           RHPETRKGV S                                RSASET+GDKLHRREGN
Sbjct: 312 RHPETRKGVVSDEMLRGVENKRRSGDSRIGAAVDWTSRGKALFRSASETVGDKLHRREGN 371

BLAST of CaUC02G033680 vs. NCBI nr
Match: CAD5336141.1 (unnamed protein product [Arabidopsis thaliana])

HSP 1 Score: 335.1 bits (858), Expect = 5.1e-88
Identity = 180/258 (69.77%), Postives = 191/258 (74.03%), Query Frame = 0

Query: 21  IPGEEDQVGPCEQLDALSPFNPLSEMRQKEGKSMDRPHRLHPVGTTRSPQGRLRP----- 80
           IPGEEDQVGPCEQLDALSPFNPLSEMRQKEGKSMDRPHRLHPVGTTRSPQGRLR      
Sbjct: 46  IPGEEDQVGPCEQLDALSPFNPLSEMRQKEGKSMDRPHRLHPVGTTRSPQGRLRSSPFEI 105

Query: 81  -------KRRAEKGGKLSVPGSPVAGFSGTTRILKSVGGLRGGGLPCGGCQRFESAYLQL 140
                  + + ++  KL   GS     +    +++SV GL GGGLPCGGCQRFESAYLQL
Sbjct: 106 LRRVALWRAQYDESCKLCSGGSSCLSLAS---MVESVRGLIGGGLPCGGCQRFESAYLQL 165

Query: 141 VNLADTKLYDSTPFFRFGSSIYDLSFMDVDKILPFSSTLGWHSLKVKGEVQTKKGLRWIP 200
           VNLADTKLYDST FFRFGSSIYD SFMDVDKI PFSSTLGWHSLKVKGEVQT+KGLRWIP
Sbjct: 166 VNLADTKLYDSTQFFRFGSSIYDFSFMDVDKIFPFSSTLGWHSLKVKGEVQTRKGLRWIP 225

Query: 201 RHPETRKGVGS--------------------------------RSASETMGDKLHRREGN 235
           RHPETRKGV S                                RSASET+GDKLHRREGN
Sbjct: 226 RHPETRKGVVSDEMLRGVENKRRSGDSRIGAAVDWTSRGKALFRSASETVGDKLHRREGN 285

BLAST of CaUC02G033680 vs. NCBI nr
Match: CAD5336140.1 (unnamed protein product [Arabidopsis thaliana] >CAD5336144.1 unnamed protein product [Arabidopsis thaliana])

HSP 1 Score: 335.1 bits (858), Expect = 5.1e-88
Identity = 180/258 (69.77%), Postives = 191/258 (74.03%), Query Frame = 0

Query: 21  IPGEEDQVGPCEQLDALSPFNPLSEMRQKEGKSMDRPHRLHPVGTTRSPQGRLRP----- 80
           IPGEEDQVGPCEQLDALSPFNPLSEMRQKEGKSMDRPHRLHPVGTTRSPQGRLR      
Sbjct: 46  IPGEEDQVGPCEQLDALSPFNPLSEMRQKEGKSMDRPHRLHPVGTTRSPQGRLRSSPFEI 105

Query: 81  -------KRRAEKGGKLSVPGSPVAGFSGTTRILKSVGGLRGGGLPCGGCQRFESAYLQL 140
                  + + ++  KL   GS     +    +++SV GL GGGLPCGGCQRFESAYLQL
Sbjct: 106 LRRVALWRAQYDESCKLCSGGSSCLSLAS---MVESVRGLIGGGLPCGGCQRFESAYLQL 165

Query: 141 VNLADTKLYDSTPFFRFGSSIYDLSFMDVDKILPFSSTLGWHSLKVKGEVQTKKGLRWIP 200
           VNLADTKLYDST FFRFGSSIYD SFMDVDKI PFSSTLGWHSLKVKGEVQT+KGLRWIP
Sbjct: 166 VNLADTKLYDSTQFFRFGSSIYDFSFMDVDKIFPFSSTLGWHSLKVKGEVQTRKGLRWIP 225

Query: 201 RHPETRKGVGS--------------------------------RSASETMGDKLHRREGN 235
           RHPETRKGV S                                RSASET+GDKLHRREGN
Sbjct: 226 RHPETRKGVVSDEMLRGVENKRRSGDSRIGAAVDWTSRGKALFRSASETVGDKLHRREGN 285

BLAST of CaUC02G033680 vs. NCBI nr
Match: KAD3640919.1 (hypothetical protein E3N88_30142 [Mikania micrantha])

HSP 1 Score: 308.5 bits (789), Expect = 5.1e-80
Identity = 162/217 (74.65%), Postives = 169/217 (77.88%), Query Frame = 0

Query: 1   MDRTWTVVGVGGSLRVPSSGIPGEEDQVGPCEQLDALSPFNPLSEMRQKEGKSMDRPHRL 60
           MDRTWTVVGVGGS RVP SGIPGEEDQVGPCEQLDALSPFNPLSE+RQKEGKSMDRPH L
Sbjct: 56  MDRTWTVVGVGGSPRVPLSGIPGEEDQVGPCEQLDALSPFNPLSEIRQKEGKSMDRPHHL 115

Query: 61  HPVGTTRSPQGRLRPKRRAEKGGKLSVPGSPVAGFSGTTRILKSVGGLRGGGLPCGGCQR 120
           HPVGTTR PQGRLR             PG+               G LRGGGLPCGGCQR
Sbjct: 116 HPVGTTRLPQGRLRH------------PGN-------------QSGDLRGGGLPCGGCQR 175

Query: 121 FESAYLQLVNLADTKLYDSTPFFRFGSSIYDLSFMDVDKILPFSSTLGWHSLKVKGEVQT 180
           FESAYLQLVNLADTKLYDST FFRFG SIYDLSFMDVDKI PFSSTLGWHSLK+KGEVQT
Sbjct: 176 FESAYLQLVNLADTKLYDSTQFFRFGGSIYDLSFMDVDKIHPFSSTLGWHSLKMKGEVQT 235

Query: 181 KKGLRWIPRHPETRKGVGSRSASETMGDKLHRREGNS 218
           +KGLRWIPRHPETRKGV S      + +K   R G+S
Sbjct: 236 RKGLRWIPRHPETRKGVVSDEMLRGVENK--HRSGDS 245

BLAST of CaUC02G033680 vs. NCBI nr
Match: OVA05688.1 (hypothetical protein BVC80_4285g1 [Macleaya cordata])

HSP 1 Score: 306.6 bits (784), Expect = 2.0e-79
Identity = 173/247 (70.04%), Postives = 185/247 (74.90%), Query Frame = 0

Query: 7   VVGVGGSLRVPSSGIPGEEDQVGPCEQLDALSPFNPLSEMRQKEGKSMDRPHRLHPVGTT 66
           VVGVGGS RVPSSGIPGEEDQVGPCEQLDALSPFNPLSEMRQKEGKSMDRPHRLHPVGTT
Sbjct: 258 VVGVGGSPRVPSSGIPGEEDQVGPCEQLDALSPFNPLSEMRQKEGKSMDRPHRLHPVGTT 317

Query: 67  RSPQGRLR-PKRRAEKGGKLSVPGSPVAGFSGTTRILK-----SVGGLRGGGLPCGGCQR 126
           RSPQGRLR P   + +  +     S  A F    R+       S GGLRGGGLPCGGCQR
Sbjct: 318 RSPQGRLRHPGLDSPEPQESLEWDSNSAPFEILRRVALWRAQISRGGLRGGGLPCGGCQR 377

Query: 127 FESAYLQLVNLADTKLYDSTPFFRFGSSIYDLSFMDVDKILPFSSTLGWHSLKVKGEVQT 186
           FESAYLQLVNLADTK+YDST FFRFGSSIYDLSFMDVDKIL FSSTLGWHSLKV GEVQT
Sbjct: 378 FESAYLQLVNLADTKVYDSTQFFRFGSSIYDLSFMDVDKILLFSSTLGWHSLKVNGEVQT 437

Query: 187 KKGLRWIPRHPETRKGVGSRSASETMGDKLHRREGNSPDHQLRPLNDRSVIKEVGVQRQP 246
           +KGLRWIPRHPETRKGV S      + +K   R G+S   Q   L    ++     +RQP
Sbjct: 438 RKGLRWIPRHPETRKGVASDEMLRGVENK--HRSGDSRIGQPFEL----LLNPWAGKRQP 497

Query: 247 GGLPRSS 248
           G L   S
Sbjct: 498 GELKHLS 498

BLAST of CaUC02G033680 vs. ExPASy TrEMBL
Match: A0A7G2FKR6 (Uncharacterized protein ycf68 OS=Arabidopsis thaliana OX=3702 GN=AT9943_LOCUS23352 PE=3 SV=1)

HSP 1 Score: 335.1 bits (858), Expect = 2.5e-88
Identity = 180/258 (69.77%), Postives = 191/258 (74.03%), Query Frame = 0

Query: 21  IPGEEDQVGPCEQLDALSPFNPLSEMRQKEGKSMDRPHRLHPVGTTRSPQGRLRP----- 80
           IPGEEDQVGPCEQLDALSPFNPLSEMRQKEGKSMDRPHRLHPVGTTRSPQGRLR      
Sbjct: 46  IPGEEDQVGPCEQLDALSPFNPLSEMRQKEGKSMDRPHRLHPVGTTRSPQGRLRSSPFEI 105

Query: 81  -------KRRAEKGGKLSVPGSPVAGFSGTTRILKSVGGLRGGGLPCGGCQRFESAYLQL 140
                  + + ++  KL   GS     +    +++SV GL GGGLPCGGCQRFESAYLQL
Sbjct: 106 LRRVALWRAQYDESCKLCSGGSSCLSLAS---MVESVRGLIGGGLPCGGCQRFESAYLQL 165

Query: 141 VNLADTKLYDSTPFFRFGSSIYDLSFMDVDKILPFSSTLGWHSLKVKGEVQTKKGLRWIP 200
           VNLADTKLYDST FFRFGSSIYD SFMDVDKI PFSSTLGWHSLKVKGEVQT+KGLRWIP
Sbjct: 166 VNLADTKLYDSTQFFRFGSSIYDFSFMDVDKIFPFSSTLGWHSLKVKGEVQTRKGLRWIP 225

Query: 201 RHPETRKGVGS--------------------------------RSASETMGDKLHRREGN 235
           RHPETRKGV S                                RSASET+GDKLHRREGN
Sbjct: 226 RHPETRKGVVSDEMLRGVENKRRSGDSRIGAAVDWTSRGKALFRSASETVGDKLHRREGN 285

BLAST of CaUC02G033680 vs. ExPASy TrEMBL
Match: A0A7G2FMH4 (Uncharacterized protein ycf68 OS=Arabidopsis thaliana OX=3702 GN=AT9943_LOCUS23357 PE=3 SV=1)

HSP 1 Score: 335.1 bits (858), Expect = 2.5e-88
Identity = 180/258 (69.77%), Postives = 191/258 (74.03%), Query Frame = 0

Query: 21  IPGEEDQVGPCEQLDALSPFNPLSEMRQKEGKSMDRPHRLHPVGTTRSPQGRLRP----- 80
           IPGEEDQVGPCEQLDALSPFNPLSEMRQKEGKSMDRPHRLHPVGTTRSPQGRLR      
Sbjct: 132 IPGEEDQVGPCEQLDALSPFNPLSEMRQKEGKSMDRPHRLHPVGTTRSPQGRLRSSPFEI 191

Query: 81  -------KRRAEKGGKLSVPGSPVAGFSGTTRILKSVGGLRGGGLPCGGCQRFESAYLQL 140
                  + + ++  KL   GS     +    +++SV GL GGGLPCGGCQRFESAYLQL
Sbjct: 192 LRRVALWRAQYDESCKLCSGGSSCLSLAS---MVESVRGLIGGGLPCGGCQRFESAYLQL 251

Query: 141 VNLADTKLYDSTPFFRFGSSIYDLSFMDVDKILPFSSTLGWHSLKVKGEVQTKKGLRWIP 200
           VNLADTKLYDST FFRFGSSIYD SFMDVDKI PFSSTLGWHSLKVKGEVQT+KGLRWIP
Sbjct: 252 VNLADTKLYDSTQFFRFGSSIYDFSFMDVDKIFPFSSTLGWHSLKVKGEVQTRKGLRWIP 311

Query: 201 RHPETRKGVGS--------------------------------RSASETMGDKLHRREGN 235
           RHPETRKGV S                                RSASET+GDKLHRREGN
Sbjct: 312 RHPETRKGVVSDEMLRGVENKRRSGDSRIGAAVDWTSRGKALFRSASETVGDKLHRREGN 371

BLAST of CaUC02G033680 vs. ExPASy TrEMBL
Match: A0A7G2FJL3 (Uncharacterized protein ycf68 OS=Arabidopsis thaliana OX=3702 GN=AT9943_LOCUS23353 PE=3 SV=1)

HSP 1 Score: 335.1 bits (858), Expect = 2.5e-88
Identity = 180/258 (69.77%), Postives = 191/258 (74.03%), Query Frame = 0

Query: 21  IPGEEDQVGPCEQLDALSPFNPLSEMRQKEGKSMDRPHRLHPVGTTRSPQGRLRP----- 80
           IPGEEDQVGPCEQLDALSPFNPLSEMRQKEGKSMDRPHRLHPVGTTRSPQGRLR      
Sbjct: 46  IPGEEDQVGPCEQLDALSPFNPLSEMRQKEGKSMDRPHRLHPVGTTRSPQGRLRSSPFEI 105

Query: 81  -------KRRAEKGGKLSVPGSPVAGFSGTTRILKSVGGLRGGGLPCGGCQRFESAYLQL 140
                  + + ++  KL   GS     +    +++SV GL GGGLPCGGCQRFESAYLQL
Sbjct: 106 LRRVALWRAQYDESCKLCSGGSSCLSLAS---MVESVRGLIGGGLPCGGCQRFESAYLQL 165

Query: 141 VNLADTKLYDSTPFFRFGSSIYDLSFMDVDKILPFSSTLGWHSLKVKGEVQTKKGLRWIP 200
           VNLADTKLYDST FFRFGSSIYD SFMDVDKI PFSSTLGWHSLKVKGEVQT+KGLRWIP
Sbjct: 166 VNLADTKLYDSTQFFRFGSSIYDFSFMDVDKIFPFSSTLGWHSLKVKGEVQTRKGLRWIP 225

Query: 201 RHPETRKGVGS--------------------------------RSASETMGDKLHRREGN 235
           RHPETRKGV S                                RSASET+GDKLHRREGN
Sbjct: 226 RHPETRKGVVSDEMLRGVENKRRSGDSRIGAAVDWTSRGKALFRSASETVGDKLHRREGN 285

BLAST of CaUC02G033680 vs. ExPASy TrEMBL
Match: A0A5N6MLP8 (Uncharacterized protein ycf68 OS=Mikania micrantha OX=192012 GN=E3N88_30142 PE=3 SV=1)

HSP 1 Score: 308.5 bits (789), Expect = 2.5e-80
Identity = 162/217 (74.65%), Postives = 169/217 (77.88%), Query Frame = 0

Query: 1   MDRTWTVVGVGGSLRVPSSGIPGEEDQVGPCEQLDALSPFNPLSEMRQKEGKSMDRPHRL 60
           MDRTWTVVGVGGS RVP SGIPGEEDQVGPCEQLDALSPFNPLSE+RQKEGKSMDRPH L
Sbjct: 56  MDRTWTVVGVGGSPRVPLSGIPGEEDQVGPCEQLDALSPFNPLSEIRQKEGKSMDRPHHL 115

Query: 61  HPVGTTRSPQGRLRPKRRAEKGGKLSVPGSPVAGFSGTTRILKSVGGLRGGGLPCGGCQR 120
           HPVGTTR PQGRLR             PG+               G LRGGGLPCGGCQR
Sbjct: 116 HPVGTTRLPQGRLRH------------PGN-------------QSGDLRGGGLPCGGCQR 175

Query: 121 FESAYLQLVNLADTKLYDSTPFFRFGSSIYDLSFMDVDKILPFSSTLGWHSLKVKGEVQT 180
           FESAYLQLVNLADTKLYDST FFRFG SIYDLSFMDVDKI PFSSTLGWHSLK+KGEVQT
Sbjct: 176 FESAYLQLVNLADTKLYDSTQFFRFGGSIYDLSFMDVDKIHPFSSTLGWHSLKMKGEVQT 235

Query: 181 KKGLRWIPRHPETRKGVGSRSASETMGDKLHRREGNS 218
           +KGLRWIPRHPETRKGV S      + +K   R G+S
Sbjct: 236 RKGLRWIPRHPETRKGVVSDEMLRGVENK--HRSGDS 245

BLAST of CaUC02G033680 vs. ExPASy TrEMBL
Match: A0A200Q5G5 (Uncharacterized protein ycf68 OS=Macleaya cordata OX=56857 GN=BVC80_4285g1 PE=3 SV=1)

HSP 1 Score: 306.6 bits (784), Expect = 9.5e-80
Identity = 173/247 (70.04%), Postives = 185/247 (74.90%), Query Frame = 0

Query: 7   VVGVGGSLRVPSSGIPGEEDQVGPCEQLDALSPFNPLSEMRQKEGKSMDRPHRLHPVGTT 66
           VVGVGGS RVPSSGIPGEEDQVGPCEQLDALSPFNPLSEMRQKEGKSMDRPHRLHPVGTT
Sbjct: 258 VVGVGGSPRVPSSGIPGEEDQVGPCEQLDALSPFNPLSEMRQKEGKSMDRPHRLHPVGTT 317

Query: 67  RSPQGRLR-PKRRAEKGGKLSVPGSPVAGFSGTTRILK-----SVGGLRGGGLPCGGCQR 126
           RSPQGRLR P   + +  +     S  A F    R+       S GGLRGGGLPCGGCQR
Sbjct: 318 RSPQGRLRHPGLDSPEPQESLEWDSNSAPFEILRRVALWRAQISRGGLRGGGLPCGGCQR 377

Query: 127 FESAYLQLVNLADTKLYDSTPFFRFGSSIYDLSFMDVDKILPFSSTLGWHSLKVKGEVQT 186
           FESAYLQLVNLADTK+YDST FFRFGSSIYDLSFMDVDKIL FSSTLGWHSLKV GEVQT
Sbjct: 378 FESAYLQLVNLADTKVYDSTQFFRFGSSIYDLSFMDVDKILLFSSTLGWHSLKVNGEVQT 437

Query: 187 KKGLRWIPRHPETRKGVGSRSASETMGDKLHRREGNSPDHQLRPLNDRSVIKEVGVQRQP 246
           +KGLRWIPRHPETRKGV S      + +K   R G+S   Q   L    ++     +RQP
Sbjct: 438 RKGLRWIPRHPETRKGVASDEMLRGVENK--HRSGDSRIGQPFEL----LLNPWAGKRQP 497

Query: 247 GGLPRSS 248
           G L   S
Sbjct: 498 GELKHLS 498

BLAST of CaUC02G033680 vs. TAIR 10
Match: AT2G07706.1 (unknown protein; BEST Arabidopsis thaliana protein match is: unknown protein (TAIR:ATMG00470.1); Has 35333 Blast hits to 34131 proteins in 2444 species: Archae - 798; Bacteria - 22429; Metazoa - 974; Fungi - 991; Plants - 531; Viruses - 0; Other Eukaryotes - 9610 (source: NCBI BLink). )

HSP 1 Score: 57.0 bits (136), Expect = 2.5e-08
Identity = 35/62 (56.45%), Postives = 37/62 (59.68%), Query Frame = 0

Query: 203 SETMGDKLHRR----------EGNSPDHQLRPLNDRSVIKEVGVQRQPGGL------PRS 249
           SET G ++ R           EGNSPDHQLRP N RSVIKEVGVQRQP  L      PRS
Sbjct: 36  SETRGTRVKRTLVTRELLSLGEGNSPDHQLRPPNGRSVIKEVGVQRQPRSLLAAPMPPRS 95

The following BLAST results are available for this feature:
Match NameE-valueIdentityDescription
CAD5336145.15.1e-8869.77unnamed protein product [Arabidopsis thaliana][more]
CAD5336141.15.1e-8869.77unnamed protein product [Arabidopsis thaliana][more]
CAD5336140.15.1e-8869.77unnamed protein product [Arabidopsis thaliana] >CAD5336144.1 unnamed protein pro... [more]
KAD3640919.15.1e-8074.65hypothetical protein E3N88_30142 [Mikania micrantha][more]
OVA05688.12.0e-7970.04hypothetical protein BVC80_4285g1 [Macleaya cordata][more]
Match NameE-valueIdentityDescription
Match NameE-valueIdentityDescription
A0A7G2FKR62.5e-8869.77Uncharacterized protein ycf68 OS=Arabidopsis thaliana OX=3702 GN=AT9943_LOCUS233... [more]
A0A7G2FMH42.5e-8869.77Uncharacterized protein ycf68 OS=Arabidopsis thaliana OX=3702 GN=AT9943_LOCUS233... [more]
A0A7G2FJL32.5e-8869.77Uncharacterized protein ycf68 OS=Arabidopsis thaliana OX=3702 GN=AT9943_LOCUS233... [more]
A0A5N6MLP82.5e-8074.65Uncharacterized protein ycf68 OS=Mikania micrantha OX=192012 GN=E3N88_30142 PE=3... [more]
A0A200Q5G59.5e-8070.04Uncharacterized protein ycf68 OS=Macleaya cordata OX=56857 GN=BVC80_4285g1 PE=3 ... [more]
Match NameE-valueIdentityDescription
AT2G07706.12.5e-0856.45unknown protein; BEST Arabidopsis thaliana protein match is: unknown protein (TA... [more]
InterPro
Analysis Name: InterPro Annotations of Watermelon (USVL246-FR2) v1
Date Performed: 2022-01-31
IPR TermIPR DescriptionSourceSource TermSource DescriptionAlignment
NoneNo IPR availableMOBIDB_LITEmobidb-litedisorder_predictioncoord: 183..197
NoneNo IPR availableMOBIDB_LITEmobidb-litedisorder_predictioncoord: 205..227
NoneNo IPR availableMOBIDB_LITEmobidb-litedisorder_predictioncoord: 183..249
NoneNo IPR availableMOBIDB_LITEmobidb-litedisorder_predictioncoord: 1..90
NoneNo IPR availableMOBIDB_LITEmobidb-litedisorder_predictioncoord: 48..62
NoneNo IPR availablePANTHERPTHR34890:SF13YCF68 PROTEINcoord: 103..184
IPR022546Uncharacterised protein family Ycf68PANTHERPTHR34890FAMILY NOT NAMEDcoord: 103..184

Relationships

The following mRNA feature(s) are a part of this gene:

Feature NameUnique NameType
CaUC02G033680.1CaUC02G033680.1mRNA


GO Annotation
GO Assignments
This gene is annotated with the following GO terms.
Category Term Accession Term Name
biological_process GO:0015074 DNA integration
cellular_component GO:0009536 plastid
molecular_function GO:0003676 nucleic acid binding