Cla97C05G096795 (gene) Watermelon (97103) v2.5

Overview
NameCla97C05G096795
Typegene
OrganismCitrullus lanatus subsp. vulgaris cv. 97103 (Watermelon (97103) v2.5)
DescriptionRetrotransposon protein
LocationCla97Chr05: 25962510 .. 25966303 (+)
RNA-Seq ExpressionCla97C05G096795
SyntenyCla97C05G096795
Sequences
The following sequences are available for this feature:

Gene sequence (with intron)

Legend: exonCDSpolypeptide
Hold the cursor over a type above to highlight its positions in the sequence below.
ATGGATGACATGGACATTCAACTTGACGAGTTAATCGCAATACTGACTGCTGTATATGCGGCCACCGTTACAATGGTGAATACTATCACCACTTTGCTGCAATTGGAGGACAATCGAGAGCGACCCTCTCCACTTATTAGACATCAAATTTGACAGTTGAATTTCTTTCGTTTGATTTATGAAGATGATCAGATGTGTCATGAGAACACTCGTATGGATAAGAGAATGTTCACGATTCTATTTCAATTACTTAGGACGATGGGTGGGCTAAGGCCAATAAAATATGTAGACGTGGAGGAAATGGTGGCCATCTTCTTGCACATAGTCGCACACGATGTAAAAAATTAAGTGATGCAACGCCAATTTGCAAGGTCTGGCTAGACAATGTCCAGGCACTTCAAAATATTGTGCTAAATGCAATATTGAGATTGTACGAAGATCTGTTGCGAAAACTAGAGCCAGTCACTAAGAATTGTACAGATGACAGATGGCGCTGGTTTCAGGTACAATCATCATGTTGTTTTAATGGTATTGTTTGCCGGTTTTGAAATTACCTTAAACTTATCATTGAAACAACGCTGTGTCACAGAATTGCTTAGGTGCATTTGACGGAACCCACATAAAGGTAAATGTCAGTGCAGCTGATCGACCACGCTATAGAACTAGAATGGGCGAGATCGCAACGAATGTCCTTATTGTATGCAATCAAAGTTGCGAGTTCATATTCGTTTTCACTGGATGGGAAGGATCAGTTGCTGACTCGAGAGTTCTGCGAGATGTAGTGTCCAGGCCAATTGGATTGAAAGTCCCAAGGGGTTAGTTTATAAATATATGTATACCCTTCCATACAAATGAACAATCATCAAGTGATAACTTGGCCCAATTTCGCAGGATACTATTATCTATGCGATGCCGATTATCCAAACGCCGAGGGATTTTTGGCTTCTTATTAGGGACAAAGATACCATTTGACCGAGTAGCGGGTAGGGAACCCGCCAACAAACCTTAGAGAGATTTTCAATATGATACATTCATCTGCCTGGAACGTCATTGAGAGAGCGTTCGGGATGTTGAAAGGTCGATGGGCAATCCTCCGAGGAAAATCATACTACTCGGTTGATGTGTAGTGCATGACCATACATGCATGTTGTCTACTGAAGGGTTGAAGCAAAACATAAGCACCCAAAAGGTTTAATATGGGAGTAATTAGCTTGTTTCCTATGGAGATGAAAATAAAGGGATCGTCATTTAAGCAAAGGTGATGAAGTCCGATTGATGAGAAAACAAGCTGCCAAAACACATTCTCCCCAAAACTAAATGGGAATATGTGATTGGAAAAGCAAAGAGCGAGCAACATCCAATAAGTGTTGGTGTTTCCTTTTCACTCGGAATTTTGTTTAGGGCAAGCAACACAAGAATACTGATGAATGCTAGCTCAGGGCATTGTTAGAACAAAAAACCTTGATGGAAGTGTTGTATTAATTCTCAACAAGTTTAAAAAATAATGAGACTATGTTTAACGCATCAACTTTTTTACGCATTAGAAACACCCATATGTACCATGAATTATCATCCACTAATGTGAGGAAGAAATGAAAACTAGCATGTGTAGGTGTTGGATATGGGCTTCATACATCACAATGCAATAAATCAAACATATGTTCAGCCATGTGAGATTGTGAGAATGAAATGATAACCTTCGTTGTTTAGCAAGAGGGCAAATAAAAAATGGCTCAAAATGAGAATGTTTATTTAATGGAAAATGTAATACATCATTTAAAACTGACATATGTTCGTATGATGGATGACCAAGTTTATCATGCCATGTATTAAGAGAAACAACATTAGAACACAGATTGGCAAATAGCGTGAAAAGCCACAATCGTGTTTAAAATAGCATTCTCTAATGGAGAAGTCTAGAGCAAATAAAGTCTCTGCCAAGCATTAGCCTTTCCAATTGTCTTGAAAGAGAACTTGTCCTGAATAAGACGTAAATCCCCGACAAATTGAGCAATCAAAGGTTGGTTAGTAGTGAATGTGATGATCAAAACAAGGTTGAAATGAAAGGATGGAATAAACAACACAATTTGAAGCAAAATATGTGAGTTTAGTCAAATATCATCAACAAATTCAACATTAACATGTGTTTGATTGGCCAGAGAAACAAACATTCTAGAAACAGGTTTAATAGAATCAAACATCTCTCGAACAAAGCATATATATGTAGACACACCATAATCAAGTATCCAACAATGAAGGGAAGAAGCAATATGAGTAATATCATAGCATATTCCGGCTACATGAGCTACAGTGACCTATGAATCAAAATTAATCTGTACTTTAGCAAGATGAATTGCAACATACTCAAAAGGCCCTAGCATTTTTCAGCAGTCAAACTAGATAACAAATCAAAGGGAGTAGCAACTGAAGAAGATGAAGTATTAGACTTCGTATTTAGTGGGGAAGGCTTTTGAGCTCCACTATGTCTCAGATGATATCCATGTAATTTGTAACATCGATCCATAGTGTGTCTAGAGATGTTACAATTAGTACAAATTAGTCGATCTTTCTCCTTATAGTTATTAATATTTGAAGTACGAGAAGAAGAAGTCGATGAATTCTTTACCAACAAAGCAGTGGCAACAACATAGGAAAGAGGAGGAGTTGAACAACAGAGGCTCTCTCTCATAGACCACCGCATTTATTCCCTAGTCCTCACCCATCATCAACCTATCCGACTAATCAGCAACAATTTTTAACCATGGGTTTAATCAACCTCATCAGCCATTTTATCCCCCTCATTATGCTCCTAGACCTAATTTCCCAAATCTTCCTCCAGCCCTTAATATCTTCCCATAACAGTTTGCTCCAAATCCCTATCCATCCTTACCTCAACCTCTTGTTGTGAAGCTTAATGACAATAATTGTGGAAGAAGCAGCTGCTGAATGTTGTTCAAGCCAATGGACTGGAAGGTTATCTTAATGGTACGGTTCCTGTTCCTTCAAAATATTTGGATGCTCAAAATATGCAGCTGAATCTAGAGTTTTCGACATGGGAAATGTATAATAGTTTCATTATGTGTTGGATTTACTCTTCATTTTCTGAAGAAAAGATGGGAGAAATTGTTAGTTTAGATACTGCTGCTAATATATGGAATTCGTTAAGAAGGTCTTATGATTCACAAACTACTGCACGTATTATGGGTTTAAAAGCCCATTTAAAAAGAACTTAGAAGGATGGTTCGTCTGTCAGTCAATATTTGTCTCAGATTAAAGAGGTTGCTGATAAGTTTAGTGCCATTGGAGAACTTATATCTTATAGAGATCATTTAGCTCATATTCTAGATAGTCTAGGAAGTGAATATAATGCTTTTGTCACTTATATTCAAAATTGCTCTGATAATCCTTCTATTGAAGATGTGAGAAGTTTATTGTTAGCTTATGAAGCCCCTTTGGAGAAACAAAATGTTGTTGATCAATTGAATGTTGCCCGAGCTAATTTTAGCAAGCTTTCTCTTCAACACAATAGCAAGCGGAATTCGTCTTGGTCCTTTCCAAACCCATCTTCCTCTGCTTCTCTAAGACCCTTTTCCCTTGTTTTTAATCTCCCCGCCTTCAATCAACCAAATCCAAACATACCGACGAGTGTTCTTGGTCGTCCTCAATTTTTCCCAAAATGGCCTCCAAAACCTTTTTCTTCTAAACCTCAATGCCAAATCTGTCACAAATTTGGTTATACCGCTCCTAACTGTCATCATCTTGCCTCTTTGGCCCACCAATCCATACCTCCTTAG

mRNA sequence

ATGGATGACATGGACATTCAACTTGACGAGTTAATCGCAATACTGACTGCTGTATATGCGGCCACCGTTACAATGATGTGTCATGAGAACACTCGTATGGATAAGAGAATGTTCACGATTCTATTTCAATTACTTAGGACGATGGGTGGGCTAAGGCCAATAAAATATGTAGACGTGGAGGAAATGGTGGCCATCTTCTTGCACATAGTCGCACACGATGTCTGGCTAGACAATGTCCAGGCACTTCAAAATATTGTGCTAAATGCAATATTGAGATTGTACGAAGATCTGTTGCGAAAACTAGAGCCAGTCACTAAGAATTGTACAGATGACAGATGGCGCTGGTTTCAGAATTGCTTAGGTGCATTTGACGGAACCCACATAAAGGTAAATGTCAGTGCAGCTGATCGACCACGCTATAGAACTAGAATGGGCGAGATCGCAACGAATGTCCTTATTGTATGCAATCAAAGTTGCGAGTTCATATTCGTTTTCACTGGATGGGAAGGATCAGTTGCTGACTCGAGAGTTCTGCGAGATGTAGTGTCCAGGCCAATTGGATTGAAAGTCCCAAGGGGGAACCCGCCAACAAACCTTAGAGAGATTTTCAATATGATACATTCATCTGCCTGGAACGTCATTGAGAGAGCGTTCGGGATGTTGAAAGGTCGATGGGCAATCCTCCGAGGAAAATCATACTACTCGAAGCAGCTGCTGAATGTTGTTCAAGCCAATGGACTGGAAGGTTATCTTAATGGTACGGTTCCTGTTCCTTCAAAATATTTGGATGCTCAAAATATGCAGCTGAATCTAGAGTTTTCGACATGGGAAATGTATAATAGTTTCATTATGTGTTGGATTTACTCTTCATTTTCTGAAGAAAAGATGGGAGAAATTGTTAGTTTAGATACTGCTGCTAATATATGGAATTCGTTAAGAAGTCAATATTTGTCTCAGATTAAAGAGGTTGCTGATAAGTTTAGTGCCATTGGAGAACTTATATCTTATAGAGATCATTTAGCTCATATTCTAGATAGTCTAGGAAGTGAATATAATGCTTTTGTCACTTATATTCAAAATTGCTCTGATAATCCTTCTATTGAAGATGTGAGAAGTTTATTGTTAGCTTATGAAGCCCCTTTGGAGAAACAAAATGTTGTTGATCAATTGAATGTTGCCCGAGCTAATTTTAGCAAGCTTTCTCTTCAACACAATAGCAAGCGGAATTCGTCTTGGTCCTTTCCAAACCCATCTTCCTCTGCTTCTCTAAGACCCTTTTCCCTTGTTTTTAATCTCCCCGCCTTCAATCAACCAAATCCAAACATACCGACGAGTGTTCTTGGTCGTCCTCAATTTTTCCCAAAATGGCCTCCAAAACCTTTTTCTTCTAAACCTCAATGCCAAATCTGTCACAAATTTGGTTATACCGCTCCTAACTGTCATCATCTTGCCTCTTTGGCCCACCAATCCATACCTCCTTAG

Coding sequence (CDS)

ATGGATGACATGGACATTCAACTTGACGAGTTAATCGCAATACTGACTGCTGTATATGCGGCCACCGTTACAATGATGTGTCATGAGAACACTCGTATGGATAAGAGAATGTTCACGATTCTATTTCAATTACTTAGGACGATGGGTGGGCTAAGGCCAATAAAATATGTAGACGTGGAGGAAATGGTGGCCATCTTCTTGCACATAGTCGCACACGATGTCTGGCTAGACAATGTCCAGGCACTTCAAAATATTGTGCTAAATGCAATATTGAGATTGTACGAAGATCTGTTGCGAAAACTAGAGCCAGTCACTAAGAATTGTACAGATGACAGATGGCGCTGGTTTCAGAATTGCTTAGGTGCATTTGACGGAACCCACATAAAGGTAAATGTCAGTGCAGCTGATCGACCACGCTATAGAACTAGAATGGGCGAGATCGCAACGAATGTCCTTATTGTATGCAATCAAAGTTGCGAGTTCATATTCGTTTTCACTGGATGGGAAGGATCAGTTGCTGACTCGAGAGTTCTGCGAGATGTAGTGTCCAGGCCAATTGGATTGAAAGTCCCAAGGGGGAACCCGCCAACAAACCTTAGAGAGATTTTCAATATGATACATTCATCTGCCTGGAACGTCATTGAGAGAGCGTTCGGGATGTTGAAAGGTCGATGGGCAATCCTCCGAGGAAAATCATACTACTCGAAGCAGCTGCTGAATGTTGTTCAAGCCAATGGACTGGAAGGTTATCTTAATGGTACGGTTCCTGTTCCTTCAAAATATTTGGATGCTCAAAATATGCAGCTGAATCTAGAGTTTTCGACATGGGAAATGTATAATAGTTTCATTATGTGTTGGATTTACTCTTCATTTTCTGAAGAAAAGATGGGAGAAATTGTTAGTTTAGATACTGCTGCTAATATATGGAATTCGTTAAGAAGTCAATATTTGTCTCAGATTAAAGAGGTTGCTGATAAGTTTAGTGCCATTGGAGAACTTATATCTTATAGAGATCATTTAGCTCATATTCTAGATAGTCTAGGAAGTGAATATAATGCTTTTGTCACTTATATTCAAAATTGCTCTGATAATCCTTCTATTGAAGATGTGAGAAGTTTATTGTTAGCTTATGAAGCCCCTTTGGAGAAACAAAATGTTGTTGATCAATTGAATGTTGCCCGAGCTAATTTTAGCAAGCTTTCTCTTCAACACAATAGCAAGCGGAATTCGTCTTGGTCCTTTCCAAACCCATCTTCCTCTGCTTCTCTAAGACCCTTTTCCCTTGTTTTTAATCTCCCCGCCTTCAATCAACCAAATCCAAACATACCGACGAGTGTTCTTGGTCGTCCTCAATTTTTCCCAAAATGGCCTCCAAAACCTTTTTCTTCTAAACCTCAATGCCAAATCTGTCACAAATTTGGTTATACCGCTCCTAACTGTCATCATCTTGCCTCTTTGGCCCACCAATCCATACCTCCTTAG

Protein sequence

MDDMDIQLDELIAILTAVYAATVTMMCHENTRMDKRMFTILFQLLRTMGGLRPIKYVDVEEMVAIFLHIVAHDVWLDNVQALQNIVLNAILRLYEDLLRKLEPVTKNCTDDRWRWFQNCLGAFDGTHIKVNVSAADRPRYRTRMGEIATNVLIVCNQSCEFIFVFTGWEGSVADSRVLRDVVSRPIGLKVPRGNPPTNLREIFNMIHSSAWNVIERAFGMLKGRWAILRGKSYYSKQLLNVVQANGLEGYLNGTVPVPSKYLDAQNMQLNLEFSTWEMYNSFIMCWIYSSFSEEKMGEIVSLDTAANIWNSLRSQYLSQIKEVADKFSAIGELISYRDHLAHILDSLGSEYNAFVTYIQNCSDNPSIEDVRSLLLAYEAPLEKQNVVDQLNVARANFSKLSLQHNSKRNSSWSFPNPSSSASLRPFSLVFNLPAFNQPNPNIPTSVLGRPQFFPKWPPKPFSSKPQCQICHKFGYTAPNCHHLASLAHQSIPP
Homology
BLAST of Cla97C05G096795 vs. NCBI nr
Match: XP_022155181.1 (uncharacterized protein LOC111022315 [Momordica charantia])

HSP 1 Score: 276.9 bits (707), Expect = 3.3e-70
Identity = 150/295 (50.85%), Postives = 186/295 (63.05%), Query Frame = 0

Query: 234 YSKQLLNVVQANGLEGYLNGTVPVPSKYLDAQNMQLNLEFSTWEMYNSFIMCWIYSSFSE 293
           +  QLLN V ANGL GYL+GT+  P ++LD   +Q N  +  WE YN  +MCWIYSS SE
Sbjct: 42  WKNQLLNAVIANGLRGYLDGTIMPPPQFLDHHQLQPNPAYYAWERYNRLLMCWIYSSLSE 101

Query: 294 EKMGEIVSLDTAANIWNSLR----------------------------SQYLSQIKEVAD 353
           EKMGE+VSL+T  +IW+SL                             SQYL++IKE+AD
Sbjct: 102 EKMGEVVSLETTHDIWSSLTRVYDSKTTARIMGLKTELQNLRKDGSSVSQYLAKIKEIAD 161

Query: 354 KFSAIGELISYRDHLAHILDSLGSEYNAFVTYIQNCSDNPSIEDVRSLLLAYEAPLEKQN 413
           KF+A+GE +SYRDHLAH+LD LGSEYNAFVT I N +D+PS+EDVRSLLLAYEA L+KQN
Sbjct: 162 KFAAVGEPLSYRDHLAHVLDGLGSEYNAFVTSIHNRADSPSLEDVRSLLLAYEARLDKQN 221

Query: 414 VVDQLNVARANFSKLSLQHNSKR-NSSWSFPNPSSSASLRPFSLVFNLPAFNQPNPNIP- 473
            VDQLN+A+AN   LSLQHNSKR    +SFPN                  +    PN P 
Sbjct: 222 TVDQLNIAQANLVNLSLQHNSKRPPPKFSFPN-----------------HYKHSFPNSPI 281

Query: 474 -----TSVLGRPQFFPKWPPKPFSSKPQCQICHKFGYTAPNCHHLASLAHQSIPP 494
                 S+LG+PQ   KWPPKP SSK QCQIC K G++A  C+H  ++A+ +  P
Sbjct: 282 SAAQSQSILGKPQSVHKWPPKPSSSKIQCQICGKLGHSAAVCYHRTNIAYHNASP 319

BLAST of Cla97C05G096795 vs. NCBI nr
Match: KAA0058874.1 (retrotransposon protein [Cucumis melo var. makuwa])

HSP 1 Score: 266.5 bits (680), Expect = 4.4e-67
Identity = 134/237 (56.54%), Postives = 166/237 (70.04%), Query Frame = 0

Query: 25  MMCHENTRMDKRMFTILFQLLRTMGGLRPIKYVDVEEMVAIFLHIVAHDVWL-------- 84
           ++C ++TRMD+R F IL  LLR + GL   + VDVEEMVA+FLH++AHDV          
Sbjct: 59  LVCRQSTRMDRRTFAILCHLLRNVAGLSSTEIVDVEEMVAMFLHVLAHDVKNRVIQREFV 118

Query: 85  ---DNVQALQNIVLNAILRLYEDLLRKLEPVTKNCTDDRWRWFQNCLGAFDGTHIKVNVS 144
              + V    NIVL A+LRLYE+L+++  PVT NC D RW+ F+NCLGA DGT+IKVNV 
Sbjct: 119 RSGETVSRHFNIVLLAVLRLYEELIKRPVPVTSNCNDQRWKCFENCLGALDGTYIKVNVP 178

Query: 145 AADRPRYRTRMGEIATNVLIVCNQSCEFIFVFTGWEGSVADSRVLRDVVSRPIGLKVPRG 204
           A DRP +RTR GEI TNVL VC+   +F++V  GWEGS ADSR+LR+ +SR  GL+VP+G
Sbjct: 179 AGDRPTFRTRKGEIVTNVLGVCDTKGDFVYVLAGWEGSAADSRILRNAISRENGLQVPKG 238

Query: 205 ------------NPPTNLREIFNMIHSSAWNVIERAFGMLKGRWAILRGKSYYSKQL 239
                       N PTN +E FNM HSSA NVIERAFG+LKGRWAILRGKSYY  Q+
Sbjct: 239 QWYHLQEWRGATNAPTNAKEYFNMKHSSARNVIERAFGVLKGRWAILRGKSYYPLQV 295

BLAST of Cla97C05G096795 vs. NCBI nr
Match: KAA0047510.1 (retrotransposon protein [Cucumis melo var. makuwa])

HSP 1 Score: 266.5 bits (680), Expect = 4.4e-67
Identity = 136/254 (53.54%), Postives = 168/254 (66.14%), Query Frame = 0

Query: 25  MMCHENTRMDKRMFTILFQLLRTMGGLRPIKYVDVEEMVAIFLHIVAHDVWL-------- 84
           ++C E+TRMD+R F IL  LLRT+ GL   + VDVEEMVA+FLHI+AHDV          
Sbjct: 59  LVCRESTRMDRRCFAILCHLLRTIAGLTSTEVVDVEEMVAMFLHILAHDVKNRVIQREFM 118

Query: 85  ---DNVQALQNIVLNAILRLYEDLLRKLEPVTKNCTDDRWRWFQNCLGAFDGTHIKVNVS 144
              + +    N+VL A++RL+E+LL+K +PV   CTD RWRWF+NCLGA DGT+IKVNV 
Sbjct: 119 RSGETISCHFNMVLLAVIRLHEELLKKPQPVPNECTDQRWRWFENCLGALDGTYIKVNVP 178

Query: 145 AADRPRYRTRMGEIATNVLIVCNQSCEFIFVFTGWEGSVADSRVLRDVVSRPIGLKVPRG 204
           A+DR RYRTR GE+ATNVL VC++  +F++V  GWEGS ADSR+LRD +SRP GLKVP+G
Sbjct: 179 ASDRARYRTRKGEVATNVLGVCDRKGDFVYVLAGWEGSAADSRILRDALSRPNGLKVPKG 238

Query: 205 ---------------------------------NPPTNLREIFNMIHSSAWNVIERAFGM 235
                                            N P+  +E FNM H SA NVIERAFG+
Sbjct: 239 YYYLVDAGYPNPEGFLAPYRGQRYHLQEWHGPENAPSTSKEFFNMKHPSARNVIERAFGV 298

BLAST of Cla97C05G096795 vs. NCBI nr
Match: KAA0035620.1 (retrotransposon protein [Cucumis melo var. makuwa] >KAA0038121.1 retrotransposon protein [Cucumis melo var. makuwa] >KAA0041646.1 retrotransposon protein [Cucumis melo var. makuwa] >KAA0047363.1 retrotransposon protein [Cucumis melo var. makuwa] >KAA0048378.1 retrotransposon protein [Cucumis melo var. makuwa])

HSP 1 Score: 263.1 bits (671), Expect = 4.9e-66
Identity = 135/254 (53.15%), Postives = 167/254 (65.75%), Query Frame = 0

Query: 25  MMCHENTRMDKRMFTILFQLLRTMGGLRPIKYVDVEEMVAIFLHIVAHDVWL-------- 84
           ++C ++TRMD+R F IL  LLRT+ GL   + VDVEEMVA+FLHI+AHDV          
Sbjct: 59  LVCRQSTRMDRRCFAILCHLLRTIAGLTSTEVVDVEEMVAMFLHILAHDVKNRVIQREFM 118

Query: 85  ---DNVQALQNIVLNAILRLYEDLLRKLEPVTKNCTDDRWRWFQNCLGAFDGTHIKVNVS 144
              + +    N+VL A++RL+E+LL+K +PV   CTD RWRWF+NCLGA DGT+IKVNV 
Sbjct: 119 RSGETISRHFNMVLLAVIRLHEELLKKPQPVPNECTDQRWRWFENCLGALDGTYIKVNVP 178

Query: 145 AADRPRYRTRMGEIATNVLIVCNQSCEFIFVFTGWEGSVADSRVLRDVVSRPIGLKVPRG 204
           A+DR RYRTR GE+ATNVL VC+   +F++V  GWEGS ADSR+LRD +SRP  LKVP+G
Sbjct: 179 ASDRARYRTRKGEVATNVLGVCDTKGDFVYVLAGWEGSAADSRILRDALSRPNRLKVPKG 238

Query: 205 ---------------------------------NPPTNLREIFNMIHSSAWNVIERAFGM 235
                                            N P+  +E FNM HSSA NVIERAFG+
Sbjct: 239 YYYLVDAGYPNAEGFLAPYRGQRYHLQEWRGPENAPSTSKEFFNMKHSSARNVIERAFGV 298

BLAST of Cla97C05G096795 vs. NCBI nr
Match: KAA0062547.1 (retrotransposon protein [Cucumis melo var. makuwa])

HSP 1 Score: 262.3 bits (669), Expect = 8.4e-66
Identity = 133/230 (57.83%), Postives = 164/230 (71.30%), Query Frame = 0

Query: 25  MMCHENTRMDKRMFTILFQLLRTMGGLRPIKYVDVEEMVAIFLHIVAHDV------WL-- 84
           ++C ++T MD+R F IL  LLR + GL   + VDVEEMVA+FLH++AHDV      W   
Sbjct: 32  LVCWQSTLMDRRTFAILCHLLRNVAGLSSTEIVDVEEMVAMFLHVLAHDVKNRVIQWEFV 91

Query: 85  ---DNVQALQNIVLNAILRLYEDLLRKLEPVTKNCTDDRWRWFQNCLGAFDGTHIKVNVS 144
              + V    NIVL A+ RLYE+L+++  PVT NC D RW+ F+NCLGA DGT+IKVNV 
Sbjct: 92  RSGEIVSRHFNIVLLAVFRLYEELIKRHVPVTSNCNDQRWKCFENCLGALDGTYIKVNVP 151

Query: 145 AADRPRYRTRMGEIATNVLIVCNQSCEFIFVFTGWEGSVADSRVLRDVVSRPIGLKVPR- 204
           A DRP +RTR GEIATNVL VC+   +F++V  GWEGS ADSR+LRD +SR  GL+VP+ 
Sbjct: 152 AEDRPTFRTRKGEIATNVLEVCDTKGDFVYVLAGWEGSAADSRILRDALSRENGLQVPKE 211

Query: 205 ----GNPPTNLREIFNMIHSSAWNVIERAFGMLKGRWAILRGKSYYSKQL 239
                N PTN +E FNM HSSA NVIERAFG+LKGRWAILRGK YY  Q+
Sbjct: 212 WRGAANAPTNAKEYFNMKHSSARNVIERAFGVLKGRWAILRGKLYYPLQV 261

BLAST of Cla97C05G096795 vs. ExPASy TrEMBL
Match: A0A6J1DQX7 (uncharacterized protein LOC111022315 OS=Momordica charantia OX=3673 GN=LOC111022315 PE=4 SV=1)

HSP 1 Score: 276.9 bits (707), Expect = 1.6e-70
Identity = 150/295 (50.85%), Postives = 186/295 (63.05%), Query Frame = 0

Query: 234 YSKQLLNVVQANGLEGYLNGTVPVPSKYLDAQNMQLNLEFSTWEMYNSFIMCWIYSSFSE 293
           +  QLLN V ANGL GYL+GT+  P ++LD   +Q N  +  WE YN  +MCWIYSS SE
Sbjct: 42  WKNQLLNAVIANGLRGYLDGTIMPPPQFLDHHQLQPNPAYYAWERYNRLLMCWIYSSLSE 101

Query: 294 EKMGEIVSLDTAANIWNSLR----------------------------SQYLSQIKEVAD 353
           EKMGE+VSL+T  +IW+SL                             SQYL++IKE+AD
Sbjct: 102 EKMGEVVSLETTHDIWSSLTRVYDSKTTARIMGLKTELQNLRKDGSSVSQYLAKIKEIAD 161

Query: 354 KFSAIGELISYRDHLAHILDSLGSEYNAFVTYIQNCSDNPSIEDVRSLLLAYEAPLEKQN 413
           KF+A+GE +SYRDHLAH+LD LGSEYNAFVT I N +D+PS+EDVRSLLLAYEA L+KQN
Sbjct: 162 KFAAVGEPLSYRDHLAHVLDGLGSEYNAFVTSIHNRADSPSLEDVRSLLLAYEARLDKQN 221

Query: 414 VVDQLNVARANFSKLSLQHNSKR-NSSWSFPNPSSSASLRPFSLVFNLPAFNQPNPNIP- 473
            VDQLN+A+AN   LSLQHNSKR    +SFPN                  +    PN P 
Sbjct: 222 TVDQLNIAQANLVNLSLQHNSKRPPPKFSFPN-----------------HYKHSFPNSPI 281

Query: 474 -----TSVLGRPQFFPKWPPKPFSSKPQCQICHKFGYTAPNCHHLASLAHQSIPP 494
                 S+LG+PQ   KWPPKP SSK QCQIC K G++A  C+H  ++A+ +  P
Sbjct: 282 SAAQSQSILGKPQSVHKWPPKPSSSKIQCQICGKLGHSAAVCYHRTNIAYHNASP 319

BLAST of Cla97C05G096795 vs. ExPASy TrEMBL
Match: A0A5A7TWH8 (Retrotransposon protein OS=Cucumis melo var. makuwa OX=1194695 GN=E6C27_scaffold498G001380 PE=3 SV=1)

HSP 1 Score: 266.5 bits (680), Expect = 2.1e-67
Identity = 136/254 (53.54%), Postives = 168/254 (66.14%), Query Frame = 0

Query: 25  MMCHENTRMDKRMFTILFQLLRTMGGLRPIKYVDVEEMVAIFLHIVAHDVWL-------- 84
           ++C E+TRMD+R F IL  LLRT+ GL   + VDVEEMVA+FLHI+AHDV          
Sbjct: 59  LVCRESTRMDRRCFAILCHLLRTIAGLTSTEVVDVEEMVAMFLHILAHDVKNRVIQREFM 118

Query: 85  ---DNVQALQNIVLNAILRLYEDLLRKLEPVTKNCTDDRWRWFQNCLGAFDGTHIKVNVS 144
              + +    N+VL A++RL+E+LL+K +PV   CTD RWRWF+NCLGA DGT+IKVNV 
Sbjct: 119 RSGETISCHFNMVLLAVIRLHEELLKKPQPVPNECTDQRWRWFENCLGALDGTYIKVNVP 178

Query: 145 AADRPRYRTRMGEIATNVLIVCNQSCEFIFVFTGWEGSVADSRVLRDVVSRPIGLKVPRG 204
           A+DR RYRTR GE+ATNVL VC++  +F++V  GWEGS ADSR+LRD +SRP GLKVP+G
Sbjct: 179 ASDRARYRTRKGEVATNVLGVCDRKGDFVYVLAGWEGSAADSRILRDALSRPNGLKVPKG 238

Query: 205 ---------------------------------NPPTNLREIFNMIHSSAWNVIERAFGM 235
                                            N P+  +E FNM H SA NVIERAFG+
Sbjct: 239 YYYLVDAGYPNPEGFLAPYRGQRYHLQEWHGPENAPSTSKEFFNMKHPSARNVIERAFGV 298

BLAST of Cla97C05G096795 vs. ExPASy TrEMBL
Match: A0A5A7UUT3 (Retrotransposon protein OS=Cucumis melo var. makuwa OX=1194695 GN=E6C27_scaffold98G00380 PE=3 SV=1)

HSP 1 Score: 266.5 bits (680), Expect = 2.1e-67
Identity = 134/237 (56.54%), Postives = 166/237 (70.04%), Query Frame = 0

Query: 25  MMCHENTRMDKRMFTILFQLLRTMGGLRPIKYVDVEEMVAIFLHIVAHDVWL-------- 84
           ++C ++TRMD+R F IL  LLR + GL   + VDVEEMVA+FLH++AHDV          
Sbjct: 59  LVCRQSTRMDRRTFAILCHLLRNVAGLSSTEIVDVEEMVAMFLHVLAHDVKNRVIQREFV 118

Query: 85  ---DNVQALQNIVLNAILRLYEDLLRKLEPVTKNCTDDRWRWFQNCLGAFDGTHIKVNVS 144
              + V    NIVL A+LRLYE+L+++  PVT NC D RW+ F+NCLGA DGT+IKVNV 
Sbjct: 119 RSGETVSRHFNIVLLAVLRLYEELIKRPVPVTSNCNDQRWKCFENCLGALDGTYIKVNVP 178

Query: 145 AADRPRYRTRMGEIATNVLIVCNQSCEFIFVFTGWEGSVADSRVLRDVVSRPIGLKVPRG 204
           A DRP +RTR GEI TNVL VC+   +F++V  GWEGS ADSR+LR+ +SR  GL+VP+G
Sbjct: 179 AGDRPTFRTRKGEIVTNVLGVCDTKGDFVYVLAGWEGSAADSRILRNAISRENGLQVPKG 238

Query: 205 ------------NPPTNLREIFNMIHSSAWNVIERAFGMLKGRWAILRGKSYYSKQL 239
                       N PTN +E FNM HSSA NVIERAFG+LKGRWAILRGKSYY  Q+
Sbjct: 239 QWYHLQEWRGATNAPTNAKEYFNMKHSSARNVIERAFGVLKGRWAILRGKSYYPLQV 295

BLAST of Cla97C05G096795 vs. ExPASy TrEMBL
Match: A0A5D3BDX0 (Retrotransposon protein OS=Cucumis melo var. makuwa OX=1194695 GN=E5676_scaffold1112G00360 PE=3 SV=1)

HSP 1 Score: 263.1 bits (671), Expect = 2.4e-66
Identity = 135/254 (53.15%), Postives = 167/254 (65.75%), Query Frame = 0

Query: 25  MMCHENTRMDKRMFTILFQLLRTMGGLRPIKYVDVEEMVAIFLHIVAHDVWL-------- 84
           ++C ++TRMD+R F IL  LLRT+ GL   + VDVEEMVA+FLHI+AHDV          
Sbjct: 59  LVCRQSTRMDRRCFAILCHLLRTIAGLTSTEVVDVEEMVAMFLHILAHDVKNRVIQREFM 118

Query: 85  ---DNVQALQNIVLNAILRLYEDLLRKLEPVTKNCTDDRWRWFQNCLGAFDGTHIKVNVS 144
              + +    N+VL A++RL+E+LL+K +PV   CTD RWRWF+NCLGA DGT+IKVNV 
Sbjct: 119 RSGETISRHFNMVLLAVIRLHEELLKKPQPVPNECTDQRWRWFENCLGALDGTYIKVNVP 178

Query: 145 AADRPRYRTRMGEIATNVLIVCNQSCEFIFVFTGWEGSVADSRVLRDVVSRPIGLKVPRG 204
           A+DR RYRTR GE+ATNVL VC+   +F++V  GWEGS ADSR+LRD +SRP  LKVP+G
Sbjct: 179 ASDRARYRTRKGEVATNVLGVCDTKGDFVYVLAGWEGSAADSRILRDALSRPNRLKVPKG 238

Query: 205 ---------------------------------NPPTNLREIFNMIHSSAWNVIERAFGM 235
                                            N P+  +E FNM HSSA NVIERAFG+
Sbjct: 239 YYYLVDAGYPNAEGFLAPYRGQRYHLQEWRGPENAPSTSKEFFNMKHSSARNVIERAFGV 298

BLAST of Cla97C05G096795 vs. ExPASy TrEMBL
Match: A0A5A7V6H4 (Retrotransposon protein OS=Cucumis melo var. makuwa OX=1194695 GN=E6C27_scaffold79G00260 PE=3 SV=1)

HSP 1 Score: 262.3 bits (669), Expect = 4.0e-66
Identity = 133/230 (57.83%), Postives = 164/230 (71.30%), Query Frame = 0

Query: 25  MMCHENTRMDKRMFTILFQLLRTMGGLRPIKYVDVEEMVAIFLHIVAHDV------WL-- 84
           ++C ++T MD+R F IL  LLR + GL   + VDVEEMVA+FLH++AHDV      W   
Sbjct: 32  LVCWQSTLMDRRTFAILCHLLRNVAGLSSTEIVDVEEMVAMFLHVLAHDVKNRVIQWEFV 91

Query: 85  ---DNVQALQNIVLNAILRLYEDLLRKLEPVTKNCTDDRWRWFQNCLGAFDGTHIKVNVS 144
              + V    NIVL A+ RLYE+L+++  PVT NC D RW+ F+NCLGA DGT+IKVNV 
Sbjct: 92  RSGEIVSRHFNIVLLAVFRLYEELIKRHVPVTSNCNDQRWKCFENCLGALDGTYIKVNVP 151

Query: 145 AADRPRYRTRMGEIATNVLIVCNQSCEFIFVFTGWEGSVADSRVLRDVVSRPIGLKVPR- 204
           A DRP +RTR GEIATNVL VC+   +F++V  GWEGS ADSR+LRD +SR  GL+VP+ 
Sbjct: 152 AEDRPTFRTRKGEIATNVLEVCDTKGDFVYVLAGWEGSAADSRILRDALSRENGLQVPKE 211

Query: 205 ----GNPPTNLREIFNMIHSSAWNVIERAFGMLKGRWAILRGKSYYSKQL 239
                N PTN +E FNM HSSA NVIERAFG+LKGRWAILRGK YY  Q+
Sbjct: 212 WRGAANAPTNAKEYFNMKHSSARNVIERAFGVLKGRWAILRGKLYYPLQV 261

BLAST of Cla97C05G096795 vs. TAIR 10
Match: AT5G28950.1 (unknown protein; BEST Arabidopsis thaliana protein match is: unknown protein (TAIR:AT5G41980.1); Has 448 Blast hits to 446 proteins in 74 species: Archae - 0; Bacteria - 0; Metazoa - 31; Fungi - 21; Plants - 396; Viruses - 0; Other Eukaryotes - 0 (source: NCBI BLink). )

HSP 1 Score: 80.1 bits (196), Expect = 5.5e-15
Identity = 33/70 (47.14%), Postives = 47/70 (67.14%), Query Frame = 0

Query: 115 WFQNCLGAFDGTHIKVNVSAADRPRYRTRMGEIATNVLIVCNQSCEFIFVFTGWEGSVAD 174
           +F++C+GA D THI   VS    P +R R G+I+ N+L  CN   EF++V +GWEGS  D
Sbjct: 21  YFKDCVGAIDDTHIFAMVSQKKMPSFRNRKGDISQNMLAACNFDVEFMYVLSGWEGSAHD 80

Query: 175 SRVLRDVVSR 185
           S+VL D ++R
Sbjct: 81  SKVLNDALTR 90

BLAST of Cla97C05G096795 vs. TAIR 10
Match: AT1G43722.1 (unknown protein; BEST Arabidopsis thaliana protein match is: unknown protein (TAIR:AT5G28730.1); Has 924 Blast hits to 912 proteins in 109 species: Archae - 0; Bacteria - 0; Metazoa - 222; Fungi - 31; Plants - 661; Viruses - 0; Other Eukaryotes - 10 (source: NCBI BLink). )

HSP 1 Score: 76.6 bits (187), Expect = 6.1e-14
Identity = 70/254 (27.56%), Postives = 103/254 (40.55%), Query Frame = 0

Query: 27  CHENTRMDKRMFTILFQLLRTMGGLRPIKYVDVEEMVAIFLHIVAH-----DVWL----- 86
           C +  RM    FT L  +L+T   L+P   + +EE VA+FL I  H     DV L     
Sbjct: 66  CLQLLRMSLPCFTTLCNMLQTNYDLQPTLNISIEESVAMFLRICGHNEVYRDVGLRFGRN 125

Query: 87  -DNVQALQNIVLNAILRLYEDLLR-----KLEPVTKNCTDDR--WRWFQNCLGAFDGTHI 146
            + VQ     VL A   L  D +R     +L  + +    D+  W +F   +GA DGTH+
Sbjct: 126 QETVQRKFREVLTATELLACDYIRTPTRQELYRIPERLQVDQRYWPYFSGFVGAMDGTHV 185

Query: 147 KVNVSAADRPRYRTRMGEIATNVLIVCNQSCEFIFVFTGWEGSVADSRVLRDVVSR---- 206
            V V    +  Y  R    + N++ +C+    F +++ G  GS  D+ VL+         
Sbjct: 186 CVKVKPDLQGMYWNRHDNASLNIMAICDLKMLFTYIWNGAPGSCYDTAVLQIAQQSDSEF 245

Query: 207 PI-----------------GLKVP-----------------RGNPPTNLREIFNMIHSSA 225
           P+                 GL  P                  G  P N  E+FN  H+S 
Sbjct: 246 PLPPSEKYYLVDSGYPNKQGLLAPYRSSRNRVVRYHMSQFYYGPRPRNKHELFNQCHTSL 305

BLAST of Cla97C05G096795 vs. TAIR 10
Match: AT5G35695.1 (CONTAINS InterPro DOMAIN/s: Putative harbinger transposase-derived nuclease (InterPro:IPR006912); BEST Arabidopsis thaliana protein match is: unknown protein (TAIR:AT5G41980.1); Has 30201 Blast hits to 17322 proteins in 780 species: Archae - 12; Bacteria - 1396; Metazoa - 17338; Fungi - 3422; Plants - 5037; Viruses - 0; Other Eukaryotes - 2996 (source: NCBI BLink). )

HSP 1 Score: 56.2 bits (134), Expect = 8.5e-08
Identity = 40/116 (34.48%), Postives = 52/116 (44.83%), Query Frame = 0

Query: 161 FIFVFTGWEGSVADSRVLRDVV----------SRPIGLKVP-RG------------NPPT 220
           FI+V +GWEGS  DSRVL D +          +  +    P RG              P 
Sbjct: 25  FIYVLSGWEGSAHDSRVLSDALRKFYLVDCGFANRLNFLAPFRGVRYHLQEFAGQRRDPE 84

Query: 221 NLREIFNMIHSSAWNVIERAFGMLKGRWAILRGKS--YYSKQLLNVVQANGLEGYL 252
              E+FN+ H S  NVIER FG+ K R+AI +      Y KQ   V+    L  +L
Sbjct: 85  TPHELFNLRHVSLRNVIERIFGIFKSRFAIFKSAPPFSYKKQAGLVLTCAALHNFL 140

BLAST of Cla97C05G096795 vs. TAIR 10
Match: AT5G48050.1 (CONTAINS InterPro DOMAIN/s: Retrotransposon gag protein (InterPro:IPR005162); BEST Arabidopsis thaliana protein match is: unknown protein (TAIR:AT1G34070.1); Has 1807 Blast hits to 1807 proteins in 277 species: Archae - 0; Bacteria - 0; Metazoa - 736; Fungi - 347; Plants - 385; Viruses - 0; Other Eukaryotes - 339 (source: NCBI BLink). )

HSP 1 Score: 47.0 bits (110), Expect = 5.1e-05
Identity = 37/175 (21.14%), Postives = 73/175 (41.71%), Query Frame = 0

Query: 261 YLDAQNMQLNLEFSTWEMYNSFIMCWIYSSFSEEKMGEIVSLD-TAANIWNSLRS----- 320
           ++D  +    +    W+  +  +  WIY + ++  +  I+ +  TA ++W SL +     
Sbjct: 52  HIDGSSTPTPMTEKRWKERDGLVKMWIYGTITDSLLDTIIKVGCTARDLWLSLENLFRDN 111

Query: 321 -----------------------QYLSQIKEVADKFSAIGELISYRDHLAHILDSLGSEY 380
                                  +Y  ++K ++D  + +   IS R  + H+L+ L  +Y
Sbjct: 112 KEARALQFENELRTTTIDDLSVHEYCQKLKSLSDLLTNVDSPISDRVLVMHLLNGLTEKY 171

Query: 381 NAFVTYIQNCSDNPSIEDVRSLLLAYEAPLEKQNVVDQLNVARANFSKLSLQHNS 407
           +  +  I++ S  PS  + RS+LL  E+ L             +N SK SL H +
Sbjct: 172 DYILNVIKHKSPFPSFTEARSMLLMEESRL-------------SNKSKSSLSHTN 213

BLAST of Cla97C05G096795 vs. TAIR 10
Match: AT4G10890.1 (unknown protein; CONTAINS InterPro DOMAIN/s: Protein of unknown function DUF2439 (InterPro:IPR018838); BEST Arabidopsis thaliana protein match is: unknown protein (TAIR:AT1G43722.1); Has 30201 Blast hits to 17322 proteins in 780 species: Archae - 12; Bacteria - 1396; Metazoa - 17338; Fungi - 3422; Plants - 5037; Viruses - 0; Other Eukaryotes - 2996 (source: NCBI BLink). )

HSP 1 Score: 44.7 bits (104), Expect = 2.6e-04
Identity = 20/54 (37.04%), Postives = 29/54 (53.70%), Query Frame = 0

Query: 192 RGNPPTNLREIFNMIHSSAWNVIERAFGMLKGRWAILRGKSYYSKQLLNVVQAN 246
           RG PP  ++E+FN  H    +VI+R FG+ K +W IL          +NV + N
Sbjct: 124 RGGPPVTVQELFNRKHLDLRSVIDRTFGVWKAKWRILDHTCKNVNIAINVTKTN 177

The following BLAST results are available for this feature:
Match NameE-valueIdentityDescription
XP_022155181.13.3e-7050.85uncharacterized protein LOC111022315 [Momordica charantia][more]
KAA0058874.14.4e-6756.54retrotransposon protein [Cucumis melo var. makuwa][more]
KAA0047510.14.4e-6753.54retrotransposon protein [Cucumis melo var. makuwa][more]
KAA0035620.14.9e-6653.15retrotransposon protein [Cucumis melo var. makuwa] >KAA0038121.1 retrotransposon... [more]
KAA0062547.18.4e-6657.83retrotransposon protein [Cucumis melo var. makuwa][more]
Match NameE-valueIdentityDescription
Match NameE-valueIdentityDescription
A0A6J1DQX71.6e-7050.85uncharacterized protein LOC111022315 OS=Momordica charantia OX=3673 GN=LOC111022... [more]
A0A5A7TWH82.1e-6753.54Retrotransposon protein OS=Cucumis melo var. makuwa OX=1194695 GN=E6C27_scaffold... [more]
A0A5A7UUT32.1e-6756.54Retrotransposon protein OS=Cucumis melo var. makuwa OX=1194695 GN=E6C27_scaffold... [more]
A0A5D3BDX02.4e-6653.15Retrotransposon protein OS=Cucumis melo var. makuwa OX=1194695 GN=E5676_scaffold... [more]
A0A5A7V6H44.0e-6657.83Retrotransposon protein OS=Cucumis melo var. makuwa OX=1194695 GN=E6C27_scaffold... [more]
Match NameE-valueIdentityDescription
AT5G28950.15.5e-1547.14unknown protein; BEST Arabidopsis thaliana protein match is: unknown protein (TA... [more]
AT1G43722.16.1e-1427.56unknown protein; BEST Arabidopsis thaliana protein match is: unknown protein (TA... [more]
AT5G35695.18.5e-0834.48CONTAINS InterPro DOMAIN/s: Putative harbinger transposase-derived nuclease (Int... [more]
AT5G48050.15.1e-0521.14CONTAINS InterPro DOMAIN/s: Retrotransposon gag protein (InterPro:IPR005162); BE... [more]
AT4G10890.12.6e-0437.04unknown protein; CONTAINS InterPro DOMAIN/s: Protein of unknown function DUF2439... [more]
InterPro
Analysis Name: InterPro Annotations of Watermelon (97103) v2.5
Date Performed: 2022-01-31
IPR TermIPR DescriptionSourceSource TermSource DescriptionAlignment
IPR027806Harbinger transposase-derived nuclease domainPFAMPF13359DDE_Tnp_4coord: 124..180
e-value: 2.4E-7
score: 30.6
NoneNo IPR availablePFAMPF14223Retrotran_gag_2coord: 315..379
e-value: 3.3E-9
score: 36.6
NoneNo IPR availablePANTHERPTHR22930UNCHARACTERIZEDcoord: 193..236
NoneNo IPR availablePANTHERPTHR22930:SF211NUCLEASE HARBI1-RELATEDcoord: 193..236
NoneNo IPR availablePANTHERPTHR22930:SF211NUCLEASE HARBI1-RELATEDcoord: 24..193
NoneNo IPR availablePANTHERPTHR22930UNCHARACTERIZEDcoord: 24..193

Relationships

The following mRNA feature(s) are a part of this gene:

Feature NameUnique NameType
Cla97C05G096795.1Cla97C05G096795.1mRNA


GO Annotation
GO Assignments
This gene is annotated with the following GO terms.
Category Term Accession Term Name
molecular_function GO:0005488 binding