Cla97C10G191300 (gene) Watermelon (97103) v2.5

Overview
NameCla97C10G191300
Typegene
OrganismCitrullus lanatus subsp. vulgaris cv. 97103 (Watermelon (97103) v2.5)
DescriptionNYN_YacP domain-containing protein
LocationCla97Chr10: 10760371 .. 10771260 (+)
RNA-Seq ExpressionCla97C10G191300
SyntenyCla97C10G191300
Sequences
The following sequences are available for this feature:

Gene sequence (with intron)

Legend: exonCDSpolypeptide
Hold the cursor over a type above to highlight its positions in the sequence below.
ATGTGTGGCTGCTTTAGCGACAGCGAAGCCATCGTCTTCCTCTTTTTCACCAAAAACCATACGAAAATTAACCGTGTAACCGGAGAAGCTGAGACGGAGCAATTGGTGGTGGAGCTCAATCGGAGAAAATTCCCGTTACTATGTGAATCGCAGGGACCGGAGCGAACCCTTCCGCAAAGTGAAAGTCTCTGTTGTCATGGAGTTCTTTTCTTTAACTTGAATCCATGGAGGTGGTCGGATCAGTGAAACTGTTTTCATCGTCACCGGCGAACACGGTTTCAGGCTTGAGTTACTCTCCCTCATCTTATTCTTCTTCCTTTTCTCCCAAGAAGAAGAAGAAGAAGCTGCTTGTAGTGTCGAAGAGCAAAAAGCAGCCTCAAACTTCATCGGTGATTATGCTTCAAGCTATTTTCTTTTTCATGCTCAATTTTGTTTCAGAAGTAGCTCGCTAGGTCTTCTGCCTTGCCGGACCAGAGAGATATCCTCATTCTGTAGTCTGTACATGCTTTAATAACTTGGTTATAATGTGCAGTTAAAGTTTCTGATATCATTGCTTATTGTTAAACACAATGTTCCGCCATTTTTGTGGATAGCCATTGTTGGAGTCCGTAGGTCCGCTAGAAATGCGTGAAAGTTCGACTATGTTAACGGATAGATGTACAATTTTGTGCAAAACCTAACTGGCTTCCAGCTGCAGGGTGCTTCTGATCCTCCTCCGAGAATTACGTCAAATCTGAAGCAGAATTTGCAGTTTCTGAGATTATGGAAGGTATGTTATTTGGTTCTGTTGCAACCTTTTCAACAAATTCGGGTATTTATAGCTCTCTCTCATATTACAAAGACTTGTAGGAGTTTCAAAAGAGAAAATCTGGCGCCCCTAAGCCTGCTACTAGTTACCGGAAGAAGAAGGTAGAGAAGGAGGACCTCCCAGGAGATACGGAGCTTTATCGTGATCCCACATTGGCTCTTTACTAGTACATCTCTGATTGACAAACTTGTTTTCATGACATTTTGTCTTTCAACCATGACATAACCCCAAAGTTGGTGGGAGAAATGCTGCTCTTAATGGCTTTTATTTTCCTGACAGTACAAACCAGGGCATAGACAACGCGGTCCCTGTCTTGCTGGTTGATGGTTATAATGTGTGTGGCTACTGGGTGAAACTGAAGAAACATTTTATGAATGGGAGACTTGACATAGCTCGCCAAAAGCTAATTGACGAGCTTATTACATTCAGTATGCTGAGAGGTTTGCAGTTTGCCCCTTCTTTATCCTCTAAAAAACGAGGCCTGAACCTTTGGGTTATAGACTTTTAGGTTTATGGTGTAGATTTTGGTTGATGTTTAGCAATTTTTATAGGAACTTGATTTTTAGTTGCCATTGTCCCTGATTACATACTTGCAACATTTACAGAGGTCAAAGTGGTAGTTGTATTTGATGCCTTGCAGTCTGGACTCCCAACACACAAGGAAAACTTTGCTGGGTACTTCCTATGATCTAAAATGTTAGCCTCGTTAATCTATATAATCTTCAGTTTGCTGCTAATCTTCTGCCAGTAGATGTGCATTTGGGATTAGGCTTTCCTATGTTTGCTATTTGCAAGCTTAATGACTTGTAAATAAATTTGAAGATGCAGAACAAATGGGAGATGATGAATTAGTGGCATAAGAAAAAGGGATATGTCTATGGGTGATATAGTTCGTGTAGATGGTTGATGAAACATAACCCAGCTTTTCTGCAGCATTAGAAACAACCTTTGACCAATTTCAGCTAGAAGAAAGAGTGAAATAGCCCCTTATCTCCTCGCTTAGGCCATTTTTCCTAAAAAAATTCAATAAGTGCACAATGACAGAAGATGGTTCATATTTGTTTAGTATGTTAACAGTAAATATATTTAGATGATGTCGGACAAATTAGAATTCAATACCTTAAAAAAAGATGATTTGTACTATGGTTTGAAGTTTCCATTGCTCTTTAAAGTTCTAAGCTGGAAATTTAAACAATTGGGGATGTTTAGGCCACCAACTTCAAGTTGGTGGAGTAGGCTATTATAACCTACCTCATGTTTGGGACATCAATTATAATAGTTGGTGTTTCCAACTATTATAATTTACAATTAACTACAATATTATTATTTCAAATCCCTCTTTGCTGCAGTGTTAACTATTTGTTACAGTGTTTACTATTTTCAACTCATTATCTTTTCCTCCTTGCTATAGTGTTTACTATTTCCTATTTAGAATGAAATAATCTATTCCTCAAACACATACTATTATAACTCATACTAAAATAGTCTGCACCCTGAACTCGGACTATTATAACCTAAAGAATATTATAATCCATAAACTATAATATCCACTGACTATAATAACCAACTCAACGCTCCAAATGCCCCCTTTCAGTTTTAAGACTTCAAATATCTTCTTGACCAAGAGGATGTGGATATAGCATACTCTCGCTAAGGGCTTTGGAAAATTTTCCTGGAATGTAATATTGATGGTCAGTCTCATATCAATGGGGCTTGTCTTTTTGAGGAACTTCTTGATGGGCATTTCCAGTTTCCTTTTGGGGCAAGTCTTTGTTGAAAATTAAACATGTTTTTATTAGATAAGTTATAGATGAATTAAAGGATGATAATTAATTATGTTAATATAAGAGTTAATTTAGAGTTTCATGAATTGAAATATGGGGTTAATATAAGTTTCATGCATTTAAGTATGAGAGTTATTGACCCATGAGAATCCCATAAATAGGATTGTTTTGTATTGTTTAGTGCAACCAAGTGTGTAGAAAAGAATACACAGAGAATTCAATAAAAGTGAGAATCAAGTCAGAAGAGTTTTTTCAATTATAAGTGTTTTGTTTTTCTCTCAAATACTCTTTGTTGAATATTATTTGTGAGCCCATCAGATGCTTCCAACAAAATGGTATCAAAGCCTGAGTTTGAAAAACAAAAATTGTTTTTTGCAAATGGCAAATAACAATTTGGTTCCCTTCCAAGTTCCTCGACTTATGGAAGAAAATTATAGCAGCTAGTGTATTCGTATGAAAGCTCTACTTGGTTGGCAAGATGCTTGGGACATTGTTGATAATGGTTATGAAGAACCAGAAAAAATGACGCAGCGTTAAATCAAGCTCAACGAGAAGCTTTGCAAAATACAAGAAAGAAAGACCAAAAGGCTCTCACCATCATTCATCAAGCCATTGATGATCCAAATTTTAAGAAGATTTCCGGAGCAACTATTGCACATCAAGCATGGCAAATATTGGAGAATACATATAAAGGAGTAGATCGAGTCAAGAAGGTTCACCTCCAAAGATTGTGAGGTGATCATGAATCACTACATATGAAGGTATTTGAATTGATTTTAGATTATACTTAAAGATTACTAGCAATAGTGAATGAAATGAAGTGATATGGTGAGAGAAAAAATGATGAGCAAGTAGTAGAAAAGATACTTCGCTCCTTGGATGAAAAATTCAATTTCATTGTTGTAGCTATTGAAGAATCAAAAGATTTGAGTACAATGTCCGTTGATCAATTTATGGGTACTTTACAAGCTCATGAAGAGATGCTTCTCAAAAAGAACAAACAGACAGCTGAGCAACTTTTTCAGTCAAAGTTGCAGTTGAGAGATAAGAAGATAGCCAAGACAAAGGTAATCGAGGTCGAGGACATGGTAACAATCGTGGACGTGGTGATTTTAGAGGTCGAAGTCGAGGAATTACTATGGTCAAAGAAAATTTGATGAGAGTAATTCAAACTCTAAGTCATCAAGAGGTCGTGGAAGACAAAATTATTCGAGGTCAAATGAAGAAAGATCAAATAATGCAAAAATAAAGTTGAATGTTATCGTTGCCATAAATTTGGCCATTGTTCTTGGGAATGCAAAAATAAAGTTGAAGAAAATGCAAATTATGTTGAGATAGATGAAGAAAGCAGTGAATCATCTTTGCTTCTAGCATTCAAAGGTGAAGAAACATGTGAAAACAATGCATGATATCTTGATAGCGGTGCAAGCAATCACATGTGTGAAAGTAAGTCGATGTCTGGAACTTGATGAATCTGTTGGTGGTAATATCATATTTGGTGATGCCACAAAAATTCATGTTAAAGGAAAAGGGAAGATTTTGATCAACTTGAAAAATGGAAAACATGAGTTTATCTCTAATGTTTATTATGTGCCTGATATGAAGAACAACATTTTGAGTTTGGGACAACTCCTAGAAAAAGGCTATAATATTTTGATGAAGGATTATAGTCTTTCAATAAGAGATAATCGGGACAACTCCTAGAGAAAGGCTATAATATTTTTTTTTCCTGATTATTTTGATTTGATATTTCAAATTTGCTTAATTAAAAAGAATATAGAATTTTCCTTTTTATATTTAATTAAAAAATATTTCTCTGCGGATGTTCTTTGATGTGTGCTGAAATGTTTACCCTTGGAAAATAAGCATTTATTGTGAGAATGAACTATGCTATACCTCATGTTATGTAAGGTCTCTTTTATGACCAGCTCACTCTCTCAGACCTTTATTTTTTCACGTGGCTTGTTACTCTGCAGTCTGCATACTTAGCAGTCATTACTCAATAGGAAATAAATAAATTTATTTATTGGCTCACTTAAGGCTTTTAATATGGTAGAATTGTTCTTGCAGAATCGATGTGGTTTATTCAGGTGAGACGTGTGCAGATACGTGGATTGAAACAGAGGTAAGATTAAGCTTCATGAATAAGATCCTGTAACAGTGGTAAAAAACAATAGAATAGAGTGTACTTCTCTATGCACGTGTCTCTGCTAGACTTATTTTGCTGGGAAGCAAGGGTTTCAGCCATATCGTGCTTGTCTAGAAAACTTATGGATGAGGCTCTGTGCATTTCAATTTGTAATATATAGAGATGTTTTCCGGTCCTTCACTTTTGGCAATGTCAGATTATTGATTCCATGGTATCGCATTTTCATTTTAAAAAGCAACTTGAATGAGAAACAACTTGTATCAATTTATTATTAAATTTTCTCTGAACCACATGCTTGATGTCGTTGATCTTAAATTTTGAAACTCCTGTGGTTTTCTTTATAATTATCTACATCTTTTTTGAGTTGCAGGTTGTAGCCCTGAAGGAGGATGGATGTCCCAAAGTTTGGGTAGTGACTTCTGATGTCTGTCAGCAGCATGCAGCCCATGGAGCAGTATGATCTCCTCATTGTTTGGGCATCACTGTATTATGCATTAACTTTTGCAGATATGGTCAGAGTATAATATTTTTATCTTTTATCTTCTAATGAAATTGATTCATAACAGGGAAGAATTTGTGGTCTGATCTGTTAAGGAAGGATTTCTTCTCATCAGTAACTTAGATTTTTATCTTTGGTATTTGTTAATTTTTATCTTTATCTTGTTTTTATTTCGCATGAGGTGGTTTTCTTAAGTGAGTCTTTTATTTAGAAGTTTTAGTTATTGATCTAATGTAAACCATCAATTATTGTCCTTACCAGAAAGAGAGAGAGAGAATGAATATAAAGGCTATCGTTGATTTGTCTGGTGGTCTAAAGTGGCAACCTATAAATTTAGATGACTTTGAGGGCATAGATCCGAACCATGCTAAGTCACCTAGTTGAGATGTTAATTACACATTATCATAATTTTTTGGGCTCCTATCATAAGTTTCTCTGACACTCATAGTTATTACTTATTTGTATATATTGCTGCTCCAATGTTGAAACAGAATATAATGCTTCACTGTGCTTTCTTTTCATCTATATGGTTTTGATTTACTTCTGTATTTCAAGTTTCTTTTTATTATTATTATTATTATTATTATTATTATTTTCTCTTTGAAGTGTTTAGTTGGGATGGTTAATTCATGTAAGTTCGCATATATAACTTTTGGTTGTTTAGACCAATATGCTGTTTATGGCTGAAAGGGCTGTATTCTGTTACTTAATTTTCAGGGAGCCTTTATTTGGAGTTGCAAGGCATTAGTTTCGGAGGTGAGAATAGTCGAATTTTATTTGGTGCATATTGATAACATGGTAAAAAACATATGATCTTTTTCCTGTGTAAACATTAATTCTAGGATGCCACCGTCTTTAGTTCACTATATTATAGACATTAGTCTTCAATAATTTTTCTTTTTGTTATTGTTTTCTCTATTCGGATGAAATGAATTTAGAATTCACTTTATGGCATTATTTCTTAAGCCAATTTATCCTTTCTTTTCGGTTAACAATTGGTGACCTAAATTCATATACACGTACACATGTATTCTCAATCTGTCTCAATCTATGGAAAGAGAAGAGTTCTATTTTTGAAGTTGTTATGAATTCTCTCGATGTGGGGGCACAGAAGAGGAAAGAACAAGGGGTACCACAAAATGAAGCCCTAAACAGTAAAAGAGGCCAAGTCCCAACTTTGCAGCCAAAATTAAAGACATGGTTTTCATTAATTAATGTAAACCCTTGAAACTCCTTTTTTTTTTCTTTTGGACCAATGTTGAGTAGATTGTCCAACATTGTAGTTAACATGAGGTGCACTGCAAGTGATGATTTACGCTTAAAGGGCGTGCTCAAAGGTTTGGACTCGACCTGTTCCCCTTGCAATATTGCAACGAGGTGTACTTACCTATTTGACGTTTAGGTCATAACATAATTTTTGTATATTCGTCTAATGGTGGATAATGTTGATAGGAATCGAAAATAGAATTTGGAGAAAGAAAGATAATATGGGAATCTTATTCAATTCATCAAAGAACCCAAAAGTCCAGGTACAATTCTCAGTCTCCTCAGAGAGCTAAGCCTCCTCTCAAAGCATTCACTACAAAAAGCATACCCCCGATTCCCCTTCACTCGTGCTTCTATAGACCCTCTCTAACTAACCATGGGGCCCATCCTTTCACTATTGGATGCTGATGGCACACTACCCATTGCATAGGCAATTCCCTTTTTACCCCTCCTATGATATTTTAATAATACTAGTGGTCTATCAAATGTCTTAAAATACCTATATTTGGATCATTGCATGAGCATCTGCTTTAGGGCTTCACATGGATTAGACTACAGTTCATTTTGGCACAGCCGACCTTTATTATTCTTGTAGGGCTTTGGTTATAAATGGATGATGTGGATGTGGATGTGGATGTGGAGTTGTATGAGGAACGTGAATTTCTCTTTTCTCATTAATGGGAGTCCTAGGGGTAAAGTTTTGGCCTCAAGAGTTCTAAGACAGAGAGACCCTATATCCCCCCTTCTTTTTCTTCTCATTGTGGATGTTTTAAATAGAATTGTCTCTAAGAGTGTTGTTGGTAAAATCTTTCATCCTTTTAAGGTTGGAAAGGAGATTGCGACTTTCCCACCTTCAATTTGTTGATGATACTATTTTCCTTTGCTCTGGAAATGAAACGTTACGAGGCTCAATTGTGATGACGAGAAGCTTTGAAGGTGGGGCGATTTGATGAGTTGTGAAGTTGGTTGCTTTCCCTCTTCTTACCTTGGTCTTCCCCTTGGGAGTAGTACCAAATCCATTTCTTTTTGGAACCCAGTGGTGGACAAAGTTAAGAAAAGGTTGGCCTCATGGAAAAATGCCTTTCTCCAAATCTGGTAGAATGACTTTGATTTGCTTTGTGTTGAGTGGTATTCCTATTTATTACTCTTTCCTTTATAGAGCTTTGAGGGAGGTATACAAGAGTATCAAGAAGCTTATGCTTAATTTCCTTTGGAAGGGTGTATGAATGAAAAGGAAAAGAGTCAAACTTGGTTAGATGGGAGGCCATTGAGAGGTCGGTTTCCTTAGGGGTTAGAAATTGGGAATTTAAGGATTCGTAACAAAGCTCTGTTAGCCAAATGGCTTTGGCGTCTCCCCTTAAGCCCAACTCTTTATGGTATAGGATCATTGTTAGTAAGTTTGGTCCTCACCCTTTTGAATGGCTGTCGCTTGGGTTGAAGGTACTAACCTGAATGTATGGAAGGATATTTCTAAAAAGCTCTTCCTCTTATTTGACTCATAGTGTGGTGGGGGAGGGGAAGGATATGTACTTTTGTGAAGATCAGTGTGAGGAATAGACCCATCTGCTCAGTTTTTCCACGTCTCTTTCATATGTCTTCCTTTAAAAATCGCGTGGTGTCCAACATCTCCTTTAGGTGAGTCGGTTTTTTTGCTCTCTAGAGGATTAAGGTTCCTAGGTTATTAAGTTCTTTATTTGGCAAATTTTTTATGGAAGAGCAAGCACTATGAATTGGCTTGTGAGGAAGTTGCCCCCATTAGTTGGCCCGTTTTGTTGCATTCTTTGTTGGTAGTCGGAGGAAGTCTTGGATCAGATTCTTTGGGAATGTGAGTTTGCATTATCCATGTGAAATTATTTCTTTCAACTGTTTGGTTTCTCACTAGCTCCACATAGGAATACTTCTCCCTTCTAAGAGAAGGGTTGTTTTTTATGGCTTGCTAGGGTGTGTGCTTTATTATGGGTCTTTGGGGGTGAGCTGAATAACCAAGTGTTCAGAGAATTGGAGGGTGATCCTAGTGATGTGTGGTCCCTCATTAGATTTAATGTACTTTTCCTCCAGGTTTTGATTTTAAAGACCTTTTGTAATTACTCTATAGTCACTATTTTGTATAGCTCAAACTCCTTCCTTTAGTTGCGCTCCCTTTTTGTAGGCTTGATTTTCTTTTGTATCCCTTATATTCTTTCATTTTTGTTTCTCAAAGACAGTTATAGAAAGAATGAAAAAAGAAAAAAAAAAAAAAAAAGAAAAACATAATACTCATGAAAACTAGGCTTTCAGTTCTTGCACAATGAGAAGATGTTGATGGCTTCAATATTTTCTGAATGAACATGATAATGCATATAGCACGTTATAGTTTACGTTTTATTTTGTGAATGTAGATTAAAGCGTCAAAAAAGGAGGTTGAAATGATGTTGCAAGAACACAGGTATGACTATTCCAGATGCAAATTTTACAATCTGTATGCAGATAGATTATAATAGGTTAGTGTCAACCACTAAAATCTGCTCCAGTTTGTAATTTGGTCTCTAAAGTTTAAAAGCAGAGTATAATTTGGTGTTGACCCTTGAAATCTCCAATTTTTAATTTGGTCGTTCAAAAATTTTATACGAGTCCCTCAAGTTTGAATCAGTTTTATTGTTGATAAGATGAAATGACTCAATGGACATACTATAAATCACTTGCATAGAAATCAAGGATAGTAAAGATTAAGGTGTGTCCAATTGTTCGAGAAGCATGAAAGAAATTCGAAGATCATGAGATATCTTATTAATTTTTTTGAACGTTTAAGATGGTAAATATCTAACAATTAGGAAATGAGTGGTCACCTTTCTCTACCGAACTCTTTAAACCTTAGGGAATGTTTGGAGCGCTGAGTTGGCTATTATAGTCTATGAGTTATAATAGTTTTTACGTTATGATAATCTGTATTTGGGGTGCAAACTATTTTAGTCTCAGTAGGAAATAGTAAACACTGTAACAAATAGTAATACTGAAGTTAATGATGAGTTATAATAATTGAAAACAAAAAAAAACATTCATTAAATTTGAAGTTGGGGCCCAAACAACACTTGTGATCAAAATATAACTTTTTGAACTTCATTGACCAAATTAGGTATAGGTACCAAAACTATATTTTAACCAATATGATACAAATTTAGGTCACTATATCTACTGTGAGATGCTTTTTGAGTTTTTCCTTTTCAATTTTTCTGAATTTCTCCTGAGTTGGTGATCTTTCTCCTTATATCTGATAAATTATTGTTTATTTTATTCCCATCCCCAAAGTGGCAAAAGCCCGACTGTCCCATCCTGCAGGATGAACCTTGCACTTGACACCTAATTGTCTTACCAGATAATTTAAATTGCATTAGACCACCCTGGACCTTATTCATACAAACCCTTAAAATTTCCCTTACCCTCTAATTCGAGGGATCGGCGATACTGCTACTGTATGATAGGGTAAAACTGATCTCGCTTCAACTCATTTACAGACTTGGAACTTTAGCATATCGATCAATAAGAAGAGTAGTGGCAACTACTATAAATCTCCTGCTTTATTTATTTATACTTTTTTTTCAGATCGACATCTTTCCAGGGGAAACTGCTGAAGCATAATCTTGATTCAGAAGTCGTCAATGCTCTCAATGACCTAAAAAGGAAGCTCAACGAAAGTGAACCGAAGTAGATGGGAAGCTTGTTTGTAGAAATACTCTATTTTTCTAACTCCATTCTTTGTTCATTAATTTAATCCGAATCTCATGTTCATAGGCATGTCACAGTCTGTTCCGAGCTGATTAATAGAAGACAAAATAACATTGTTTCTTTTGTGTACAAATCCTCTAGACTTCTGACAGAAAATAACCTTAAGTTTGTTGGTATGGGGATGAAGATGTAGGCTCCCTAGCCTTTTTCTCAATTCCAAATGTAGGGTCTCTATGTTCCAACAGTGAAGAGAAGCATAATATTTGTGCATGGTATGCTTTGGCATATTTAAAATTATAGGAGATAAAAGAAGAGAAGCCAGCTTTCTCATTTGTCAGCATTACGTTTGTAGAATAGCTTTTTAGTTCAGCTTTGGCCCATTTTCATTAAATGAAATAGAACTACCACCACCGCATTTGAAATTCATGAAGCTAACGGGGGATAGTACTGCCATTTCTTTTTTTTAAGTACAACAATTGGGGTTTAGGGGATTTGAGCTTTCCACCTCTAAGATGAAAGGTTATGTCAATTACCGCCGAATTAAGCTTGCTTTGGCGAGTATTGCTCATTTAAGTGGTTGGGTTTATGAAATGTTATGTAAATTTTGTAACGATCGATCTTAGGCAGAAAATTTCTTGGAGATGGTACTTTTTGTTGTATTATAATGTTGGTTAGGTTATAA

mRNA sequence

ATGTGTGGCTGCTTTAGCGACAGCGAAGCCATCGTCTTCCTCTTTTTCACCAAAAACCATACGAAAATTAACCGTGTAACCGGAGAAGCTGAGACGGAGCAATTGGTGGTGGAGCTCAATCGGAGAAAATTCCCGTTACTATGTGAATCGCAGGGACCGGAGCGAACCCTTCCGCAAAGTGAAAGTCTCTGTTGTCATGGAGTGGTCGGATCAGTGAAACTGTTTTCATCGTCACCGGCGAACACGGTTTCAGGCTTGAGTTACTCTCCCTCATCTTATTCTTCTTCCTTTTCTCCCAAGAAGAAGAAGAAGAAGCTGCTTGTAGTGTCGAAGAGCAAAAAGCAGCCTCAAACTTCATCGATGTACAATTTTGTGCAAAACCTAACTGGCTTCCAGCTGCAGGGTGCTTCTGATCCTCCTCCGAGAATTACGTCAAATCTGAAGCAGAATTTGCAGTTTCTGAGATTATGGAAGGAGTTTCAAAAGAGAAAATCTGGCGCCCCTAAGCCTGCTACTAGTTACCGGAAGAAGAAGGTAGAGAAGGAGGACCTCCCAGGAGATACGGAGCTTTATCGTGATCCCACATTGGCTCTTTACTATACAAACCAGGGCATAGACAACGCGGTCCCTGTCTTGCTGGTTGATGGTTATAATGTGTGTGGCTACTGGGTGAAACTGAAGAAACATTTTATGAATGGGAGACTTGACATAGCTCGCCAAAAGCTAATTGACGAGCTTATTACATTCAGTATGCTGAGAGAGGTCAAAGTGGTAGTTGTATTTGATGCCTTGCAGTCTGGACTCCCAACACACAAGGAAAACTTTGCTGGAATCGATGTGGTTTATTCAGGTGAGACGTGTGCAGATACGTGGATTGAAACAGAGGTTGTAGCCCTGAAGGAGGATGGATGTCCCAAAGTTTGGGTAGTGACTTCTGATGTCTGTCAGCAGCATGCAGCCCATGGAGCAGGAGCCTTTATTTGGAGTTGCAAGGCATTAGTTTCGGAGATTAAAGCGTCAAAAAAGGAGGTTGAAATGATGTTGCAAGAACACAGATCGACATCTTTCCAGGGGAAACTGCTGAAGCATAATCTTGATTCAGAAGTCGTCAATGCTCTCAATGACCTAAAAAGGAAGCTCAACGAAAGTGAACCGAAGTTATAA

Coding sequence (CDS)

ATGTGTGGCTGCTTTAGCGACAGCGAAGCCATCGTCTTCCTCTTTTTCACCAAAAACCATACGAAAATTAACCGTGTAACCGGAGAAGCTGAGACGGAGCAATTGGTGGTGGAGCTCAATCGGAGAAAATTCCCGTTACTATGTGAATCGCAGGGACCGGAGCGAACCCTTCCGCAAAGTGAAAGTCTCTGTTGTCATGGAGTGGTCGGATCAGTGAAACTGTTTTCATCGTCACCGGCGAACACGGTTTCAGGCTTGAGTTACTCTCCCTCATCTTATTCTTCTTCCTTTTCTCCCAAGAAGAAGAAGAAGAAGCTGCTTGTAGTGTCGAAGAGCAAAAAGCAGCCTCAAACTTCATCGATGTACAATTTTGTGCAAAACCTAACTGGCTTCCAGCTGCAGGGTGCTTCTGATCCTCCTCCGAGAATTACGTCAAATCTGAAGCAGAATTTGCAGTTTCTGAGATTATGGAAGGAGTTTCAAAAGAGAAAATCTGGCGCCCCTAAGCCTGCTACTAGTTACCGGAAGAAGAAGGTAGAGAAGGAGGACCTCCCAGGAGATACGGAGCTTTATCGTGATCCCACATTGGCTCTTTACTATACAAACCAGGGCATAGACAACGCGGTCCCTGTCTTGCTGGTTGATGGTTATAATGTGTGTGGCTACTGGGTGAAACTGAAGAAACATTTTATGAATGGGAGACTTGACATAGCTCGCCAAAAGCTAATTGACGAGCTTATTACATTCAGTATGCTGAGAGAGGTCAAAGTGGTAGTTGTATTTGATGCCTTGCAGTCTGGACTCCCAACACACAAGGAAAACTTTGCTGGAATCGATGTGGTTTATTCAGGTGAGACGTGTGCAGATACGTGGATTGAAACAGAGGTTGTAGCCCTGAAGGAGGATGGATGTCCCAAAGTTTGGGTAGTGACTTCTGATGTCTGTCAGCAGCATGCAGCCCATGGAGCAGGAGCCTTTATTTGGAGTTGCAAGGCATTAGTTTCGGAGATTAAAGCGTCAAAAAAGGAGGTTGAAATGATGTTGCAAGAACACAGATCGACATCTTTCCAGGGGAAACTGCTGAAGCATAATCTTGATTCAGAAGTCGTCAATGCTCTCAATGACCTAAAAAGGAAGCTCAACGAAAGTGAACCGAAGTTATAA

Protein sequence

MCGCFSDSEAIVFLFFTKNHTKINRVTGEAETEQLVVELNRRKFPLLCESQGPERTLPQSESLCCHGVVGSVKLFSSSPANTVSGLSYSPSSYSSSFSPKKKKKKLLVVSKSKKQPQTSSMYNFVQNLTGFQLQGASDPPPRITSNLKQNLQFLRLWKEFQKRKSGAPKPATSYRKKKVEKEDLPGDTELYRDPTLALYYTNQGIDNAVPVLLVDGYNVCGYWVKLKKHFMNGRLDIARQKLIDELITFSMLREVKVVVVFDALQSGLPTHKENFAGIDVVYSGETCADTWIETEVVALKEDGCPKVWVVTSDVCQQHAAHGAGAFIWSCKALVSEIKASKKEVEMMLQEHRSTSFQGKLLKHNLDSEVVNALNDLKRKLNESEPKL
Homology
BLAST of Cla97C10G191300 vs. NCBI nr
Match: XP_038904183.1 (uncharacterized protein YacP [Benincasa hispida])

HSP 1 Score: 577.8 bits (1488), Expect = 7.1e-161
Identity = 298/321 (92.83%), Postives = 304/321 (94.70%), Query Frame = 0

Query: 68  VVGSVKLFSSSPANTVSGLSYSPSSYSSSFSP--KKKKKKLLVVSKSKKQPQTSSMYNFV 127
           V+GSVKLFSSSPANTVSGLSYSPSSYSSSFSP  KKKKKKLLVVSKSK+QPQTSS     
Sbjct: 3   VIGSVKLFSSSPANTVSGLSYSPSSYSSSFSPKKKKKKKKLLVVSKSKRQPQTSS----- 62

Query: 128 QNLTGFQLQGASDPPPRITSNLKQNLQFLRLWKEFQKRKSGAPKPATSYRKKKVEKEDLP 187
                    GASDPPPRITSNLKQNLQFL+LWKEFQKRKSGAPKPATSYRKKKVEKEDLP
Sbjct: 63  ---------GASDPPPRITSNLKQNLQFLKLWKEFQKRKSGAPKPATSYRKKKVEKEDLP 122

Query: 188 GDTELYRDPTLALYYTNQGIDNAVPVLLVDGYNVCGYWVKLKKHFMNGRLDIARQKLIDE 247
           GDTELYRDPTLALYYTNQGIDNAVPVLLVDGYNVCGYWVKLKKHFMNGRLD+ARQKLIDE
Sbjct: 123 GDTELYRDPTLALYYTNQGIDNAVPVLLVDGYNVCGYWVKLKKHFMNGRLDVARQKLIDE 182

Query: 248 LITFSMLREVKVVVVFDALQSGLPTHKENFAGIDVVYSGETCADTWIETEVVALKEDGCP 307
           LITFSMLREVKVVVVFDA+ SGLPTHKENFAGIDVVYSGE+CADTWIETEVVALKEDGCP
Sbjct: 183 LITFSMLREVKVVVVFDAMLSGLPTHKENFAGIDVVYSGESCADTWIETEVVALKEDGCP 242

Query: 308 KVWVVTSDVCQQHAAHGAGAFIWSCKALVSEIKASKKEVEMMLQEHRSTSFQGKLLKHNL 367
           KVWVVTSDVCQQHAAHGAGAFIWSCKALVSEIKASKKEVEMMLQEHRSTSFQGKLLKHNL
Sbjct: 243 KVWVVTSDVCQQHAAHGAGAFIWSCKALVSEIKASKKEVEMMLQEHRSTSFQGKLLKHNL 302

Query: 368 DSEVVNALNDLKRKLNESEPK 387
           DSEVVNALNDLKRKLNESEPK
Sbjct: 303 DSEVVNALNDLKRKLNESEPK 309

BLAST of Cla97C10G191300 vs. NCBI nr
Match: XP_022136580.1 (uncharacterized protein LOC111008254 isoform X1 [Momordica charantia] >XP_022136582.1 uncharacterized protein LOC111008254 isoform X1 [Momordica charantia])

HSP 1 Score: 560.8 bits (1444), Expect = 8.9e-156
Identity = 289/320 (90.31%), Postives = 298/320 (93.12%), Query Frame = 0

Query: 68  VVGSVKLFSSSPANTVSGLSYSPSSYSSSFSP---KKKKKKLLVVSKSKKQPQTSSMYNF 127
           VVG VKLFSSSPANTVSGL YS SSY+SSFSP   KKKKKKLLVVSKSKKQPQTSS    
Sbjct: 3   VVGPVKLFSSSPANTVSGLCYSASSYTSSFSPPKKKKKKKKLLVVSKSKKQPQTSS---- 62

Query: 128 VQNLTGFQLQGASDPPPRITSNLKQNLQFLRLWKEFQKRKSGAPKPATSYRKKKVEKEDL 187
                   LQG SDPPPRITSNLKQNLQFLRLWKEFQKRKSG PKPATSYR+KKVEKEDL
Sbjct: 63  --------LQGGSDPPPRITSNLKQNLQFLRLWKEFQKRKSGTPKPATSYRRKKVEKEDL 122

Query: 188 PGDTELYRDPTLALYYTNQGIDNAVPVLLVDGYNVCGYWVKLKKHFMNGRLDIARQKLID 247
           PGDT+LYRDPTLALYYTNQGIDNAVPVLLVDGYNVCGYWVKLKKHFMNGRLD+ARQKLID
Sbjct: 123 PGDTDLYRDPTLALYYTNQGIDNAVPVLLVDGYNVCGYWVKLKKHFMNGRLDVARQKLID 182

Query: 248 ELITFSMLREVKVVVVFDALQSGLPTHKENFAGIDVVYSGETCADTWIETEVVALKEDGC 307
           ELITFSMLREVKVVVVFDA+ SGLPTHKENFAGIDVV+SGE+CADTWIETEVVALKEDGC
Sbjct: 183 ELITFSMLREVKVVVVFDAMLSGLPTHKENFAGIDVVFSGESCADTWIETEVVALKEDGC 242

Query: 308 PKVWVVTSDVCQQHAAHGAGAFIWSCKALVSEIKASKKEVEMMLQEHRSTSFQGKLLKHN 367
           PKVWVVTSD+CQQHAAHGAGAFIWSCKALVSEIKASKKEVEMMLQEHRSTSFQGKLLKHN
Sbjct: 243 PKVWVVTSDICQQHAAHGAGAFIWSCKALVSEIKASKKEVEMMLQEHRSTSFQGKLLKHN 302

Query: 368 LDSEVVNALNDLKRKLNESE 385
           LDSEVVNALNDLKRKL E+E
Sbjct: 303 LDSEVVNALNDLKRKLTENE 310

BLAST of Cla97C10G191300 vs. NCBI nr
Match: XP_022136583.1 (uncharacterized protein LOC111008254 isoform X2 [Momordica charantia])

HSP 1 Score: 556.6 bits (1433), Expect = 1.7e-154
Identity = 287/320 (89.69%), Postives = 296/320 (92.50%), Query Frame = 0

Query: 68  VVGSVKLFSSSPANTVSGLSYSPSSYSSSFSP---KKKKKKLLVVSKSKKQPQTSSMYNF 127
           VVG VKLFSSSPANTVSGL YS SSY+SSFSP   KKKKKKLLVVSKSKKQPQTSS    
Sbjct: 3   VVGPVKLFSSSPANTVSGLCYSASSYTSSFSPPKKKKKKKKLLVVSKSKKQPQTSS---- 62

Query: 128 VQNLTGFQLQGASDPPPRITSNLKQNLQFLRLWKEFQKRKSGAPKPATSYRKKKVEKEDL 187
                     G SDPPPRITSNLKQNLQFLRLWKEFQKRKSG PKPATSYR+KKVEKEDL
Sbjct: 63  ----------GGSDPPPRITSNLKQNLQFLRLWKEFQKRKSGTPKPATSYRRKKVEKEDL 122

Query: 188 PGDTELYRDPTLALYYTNQGIDNAVPVLLVDGYNVCGYWVKLKKHFMNGRLDIARQKLID 247
           PGDT+LYRDPTLALYYTNQGIDNAVPVLLVDGYNVCGYWVKLKKHFMNGRLD+ARQKLID
Sbjct: 123 PGDTDLYRDPTLALYYTNQGIDNAVPVLLVDGYNVCGYWVKLKKHFMNGRLDVARQKLID 182

Query: 248 ELITFSMLREVKVVVVFDALQSGLPTHKENFAGIDVVYSGETCADTWIETEVVALKEDGC 307
           ELITFSMLREVKVVVVFDA+ SGLPTHKENFAGIDVV+SGE+CADTWIETEVVALKEDGC
Sbjct: 183 ELITFSMLREVKVVVVFDAMLSGLPTHKENFAGIDVVFSGESCADTWIETEVVALKEDGC 242

Query: 308 PKVWVVTSDVCQQHAAHGAGAFIWSCKALVSEIKASKKEVEMMLQEHRSTSFQGKLLKHN 367
           PKVWVVTSD+CQQHAAHGAGAFIWSCKALVSEIKASKKEVEMMLQEHRSTSFQGKLLKHN
Sbjct: 243 PKVWVVTSDICQQHAAHGAGAFIWSCKALVSEIKASKKEVEMMLQEHRSTSFQGKLLKHN 302

Query: 368 LDSEVVNALNDLKRKLNESE 385
           LDSEVVNALNDLKRKL E+E
Sbjct: 303 LDSEVVNALNDLKRKLTENE 308

BLAST of Cla97C10G191300 vs. NCBI nr
Match: XP_008452195.1 (PREDICTED: uncharacterized protein YacP isoform X1 [Cucumis melo])

HSP 1 Score: 556.6 bits (1433), Expect = 1.7e-154
Identity = 291/323 (90.09%), Postives = 298/323 (92.26%), Query Frame = 0

Query: 68  VVGSVKLFSSSPANTVSGLSYSPSSYSSS-FSP---KKKKKKLLVVSKSKKQPQTSSMYN 127
           VVGSVKLFS+SPAN+VSGLSYSPSSYSS+ FSP   KKKKKKLLVVSKSKKQPQ SS   
Sbjct: 3   VVGSVKLFSASPANSVSGLSYSPSSYSSTLFSPKKKKKKKKKLLVVSKSKKQPQISS--- 62

Query: 128 FVQNLTGFQLQGASDPPPRITSNLKQNLQFLRLWKEFQKRKSGAPKPATSYRKKKVEKED 187
                       A   PPRITSNLKQNLQFLRLWKEFQKRKSG PKPATSYR+KKVEKED
Sbjct: 63  ------------AGSDPPRITSNLKQNLQFLRLWKEFQKRKSGVPKPATSYRRKKVEKED 122

Query: 188 LPGDTELYRDPTLALYYTNQGIDNAVPVLLVDGYNVCGYWVKLKKHFMNGRLDIARQKLI 247
           LPGDTELYRDPTLALYYTNQGIDNAVPVLLVDGYNVCGYWVKLKKHFMNGRLD+ARQKLI
Sbjct: 123 LPGDTELYRDPTLALYYTNQGIDNAVPVLLVDGYNVCGYWVKLKKHFMNGRLDVARQKLI 182

Query: 248 DELITFSMLREVKVVVVFDALQSGLPTHKENFAGIDVVYSGETCADTWIETEVVALKEDG 307
           DELITFSMLREVKVVVVFDA+ SGLPTHKENFAGIDVVYSGE+CADTWIETEVVALKEDG
Sbjct: 183 DELITFSMLREVKVVVVFDAMLSGLPTHKENFAGIDVVYSGESCADTWIETEVVALKEDG 242

Query: 308 CPKVWVVTSDVCQQHAAHGAGAFIWSCKALVSEIKASKKEVEMMLQEHRSTSFQGKLLKH 367
           CPKVWVVTSDVCQQHAAHGAGAFIWSCKALVSEIKASKKEVEMMLQEHRSTSFQGKLLKH
Sbjct: 243 CPKVWVVTSDVCQQHAAHGAGAFIWSCKALVSEIKASKKEVEMMLQEHRSTSFQGKLLKH 302

Query: 368 NLDSEVVNALNDLKRKLNESEPK 387
           NLDSEVVNALNDLKRKLNESEPK
Sbjct: 303 NLDSEVVNALNDLKRKLNESEPK 310

BLAST of Cla97C10G191300 vs. NCBI nr
Match: XP_004133742.2 (uncharacterized protein LOC101212837 isoform X1 [Cucumis sativus] >KAE8650197.1 hypothetical protein Csa_010763 [Cucumis sativus])

HSP 1 Score: 552.4 bits (1422), Expect = 3.2e-153
Identity = 286/321 (89.10%), Postives = 293/321 (91.28%), Query Frame = 0

Query: 68  VVGSVKLFSSSPANTVSGLSYSPSSYSSSFSP--KKKKKKLLVVSKSKKQPQTSSMYNFV 127
           VVGSVKLF +SPANTVSGL YSPSSYSS FSP  KKKKKKLLVVSKSKKQPQTSS     
Sbjct: 3   VVGSVKLFLASPANTVSGLVYSPSSYSSLFSPKKKKKKKKLLVVSKSKKQPQTSS----- 62

Query: 128 QNLTGFQLQGASDPPPRITSNLKQNLQFLRLWKEFQKRKSGAPKPATSYRKKKVEKEDLP 187
                      SDPPPRITSNLKQNLQFLRLWK+FQKRKSG PKPATSYR+KKVEKEDLP
Sbjct: 63  ---------AGSDPPPRITSNLKQNLQFLRLWKDFQKRKSGVPKPATSYRRKKVEKEDLP 122

Query: 188 GDTELYRDPTLALYYTNQGIDNAVPVLLVDGYNVCGYWVKLKKHFMNGRLDIARQKLIDE 247
            DTELYRDPTLALY+TNQGIDN  PVLLVDGYNVCGYWVKLKKHFMNGRLD+ARQKLIDE
Sbjct: 123 EDTELYRDPTLALYHTNQGIDNVFPVLLVDGYNVCGYWVKLKKHFMNGRLDVARQKLIDE 182

Query: 248 LITFSMLREVKVVVVFDALQSGLPTHKENFAGIDVVYSGETCADTWIETEVVALKEDGCP 307
           LITFSMLREVKVVVVFDA+ SGLPTHKENFAGIDVVYSGE+CADTWIETEVVALKEDGCP
Sbjct: 183 LITFSMLREVKVVVVFDAMLSGLPTHKENFAGIDVVYSGESCADTWIETEVVALKEDGCP 242

Query: 308 KVWVVTSDVCQQHAAHGAGAFIWSCKALVSEIKASKKEVEMMLQEHRSTSFQGKLLKHNL 367
           KVWVVTSDVC QHAAHGAGAFIWSCKALVSEI ASKKEVEMMLQEHRSTSFQGKLLKHNL
Sbjct: 243 KVWVVTSDVCHQHAAHGAGAFIWSCKALVSEINASKKEVEMMLQEHRSTSFQGKLLKHNL 302

Query: 368 DSEVVNALNDLKRKLNESEPK 387
           DSEVVNALNDLKRKLNESEPK
Sbjct: 303 DSEVVNALNDLKRKLNESEPK 309

BLAST of Cla97C10G191300 vs. ExPASy Swiss-Prot
Match: P37574 (Uncharacterized protein YacP OS=Bacillus subtilis (strain 168) OX=224308 GN=yacP PE=1 SV=1)

HSP 1 Score: 65.5 bits (158), Expect = 1.5e-09
Identity = 54/170 (31.76%), Postives = 86/170 (50.59%), Query Frame = 0

Query: 211 VLLVDGYNVCGYWVKLKKHFMNGRLDIARQKLIDELITFSMLREVKVVVVFDA-LQSGLP 270
           +LLVDGYN+ G W +LK    N   + AR  LI ++  +      +V+VVFDA L  GL 
Sbjct: 3   ILLVDGYNMIGAWPQLKDLKANS-FEEARDVLIQKMAEYQSYTGNRVIVVFDAHLVKGLE 62

Query: 271 THKENFAGIDVVYSGET-CADTWIETEVVALKEDGCPKVWVVTSDVCQQHAAHGAGAFIW 330
             + N   ++V+++ E   AD  IE    AL  +   ++ V TSD  +Q A  G GA   
Sbjct: 63  KKQTNHR-VEVIFTKENETADERIEKLAQAL-NNIATQIHVATSDYTEQWAIFGQGALRK 122

Query: 331 SCKALVSEIKASKKEVEMMLQEHRSTSFQGKLLKHNLDSEVVNALNDLKR 379
           S + L+ E++  ++ +E  +++  S    GK+    L  EV+      +R
Sbjct: 123 SARELLREVETIERRIERRVRKITSEKPAGKIA---LSEEVLKTFEKWRR 166

BLAST of Cla97C10G191300 vs. ExPASy TrEMBL
Match: A0A6J1C4B6 (uncharacterized protein LOC111008254 isoform X1 OS=Momordica charantia OX=3673 GN=LOC111008254 PE=4 SV=1)

HSP 1 Score: 560.8 bits (1444), Expect = 4.3e-156
Identity = 289/320 (90.31%), Postives = 298/320 (93.12%), Query Frame = 0

Query: 68  VVGSVKLFSSSPANTVSGLSYSPSSYSSSFSP---KKKKKKLLVVSKSKKQPQTSSMYNF 127
           VVG VKLFSSSPANTVSGL YS SSY+SSFSP   KKKKKKLLVVSKSKKQPQTSS    
Sbjct: 3   VVGPVKLFSSSPANTVSGLCYSASSYTSSFSPPKKKKKKKKLLVVSKSKKQPQTSS---- 62

Query: 128 VQNLTGFQLQGASDPPPRITSNLKQNLQFLRLWKEFQKRKSGAPKPATSYRKKKVEKEDL 187
                   LQG SDPPPRITSNLKQNLQFLRLWKEFQKRKSG PKPATSYR+KKVEKEDL
Sbjct: 63  --------LQGGSDPPPRITSNLKQNLQFLRLWKEFQKRKSGTPKPATSYRRKKVEKEDL 122

Query: 188 PGDTELYRDPTLALYYTNQGIDNAVPVLLVDGYNVCGYWVKLKKHFMNGRLDIARQKLID 247
           PGDT+LYRDPTLALYYTNQGIDNAVPVLLVDGYNVCGYWVKLKKHFMNGRLD+ARQKLID
Sbjct: 123 PGDTDLYRDPTLALYYTNQGIDNAVPVLLVDGYNVCGYWVKLKKHFMNGRLDVARQKLID 182

Query: 248 ELITFSMLREVKVVVVFDALQSGLPTHKENFAGIDVVYSGETCADTWIETEVVALKEDGC 307
           ELITFSMLREVKVVVVFDA+ SGLPTHKENFAGIDVV+SGE+CADTWIETEVVALKEDGC
Sbjct: 183 ELITFSMLREVKVVVVFDAMLSGLPTHKENFAGIDVVFSGESCADTWIETEVVALKEDGC 242

Query: 308 PKVWVVTSDVCQQHAAHGAGAFIWSCKALVSEIKASKKEVEMMLQEHRSTSFQGKLLKHN 367
           PKVWVVTSD+CQQHAAHGAGAFIWSCKALVSEIKASKKEVEMMLQEHRSTSFQGKLLKHN
Sbjct: 243 PKVWVVTSDICQQHAAHGAGAFIWSCKALVSEIKASKKEVEMMLQEHRSTSFQGKLLKHN 302

Query: 368 LDSEVVNALNDLKRKLNESE 385
           LDSEVVNALNDLKRKL E+E
Sbjct: 303 LDSEVVNALNDLKRKLTENE 310

BLAST of Cla97C10G191300 vs. ExPASy TrEMBL
Match: A0A6J1C7Z2 (uncharacterized protein LOC111008254 isoform X2 OS=Momordica charantia OX=3673 GN=LOC111008254 PE=4 SV=1)

HSP 1 Score: 556.6 bits (1433), Expect = 8.2e-155
Identity = 287/320 (89.69%), Postives = 296/320 (92.50%), Query Frame = 0

Query: 68  VVGSVKLFSSSPANTVSGLSYSPSSYSSSFSP---KKKKKKLLVVSKSKKQPQTSSMYNF 127
           VVG VKLFSSSPANTVSGL YS SSY+SSFSP   KKKKKKLLVVSKSKKQPQTSS    
Sbjct: 3   VVGPVKLFSSSPANTVSGLCYSASSYTSSFSPPKKKKKKKKLLVVSKSKKQPQTSS---- 62

Query: 128 VQNLTGFQLQGASDPPPRITSNLKQNLQFLRLWKEFQKRKSGAPKPATSYRKKKVEKEDL 187
                     G SDPPPRITSNLKQNLQFLRLWKEFQKRKSG PKPATSYR+KKVEKEDL
Sbjct: 63  ----------GGSDPPPRITSNLKQNLQFLRLWKEFQKRKSGTPKPATSYRRKKVEKEDL 122

Query: 188 PGDTELYRDPTLALYYTNQGIDNAVPVLLVDGYNVCGYWVKLKKHFMNGRLDIARQKLID 247
           PGDT+LYRDPTLALYYTNQGIDNAVPVLLVDGYNVCGYWVKLKKHFMNGRLD+ARQKLID
Sbjct: 123 PGDTDLYRDPTLALYYTNQGIDNAVPVLLVDGYNVCGYWVKLKKHFMNGRLDVARQKLID 182

Query: 248 ELITFSMLREVKVVVVFDALQSGLPTHKENFAGIDVVYSGETCADTWIETEVVALKEDGC 307
           ELITFSMLREVKVVVVFDA+ SGLPTHKENFAGIDVV+SGE+CADTWIETEVVALKEDGC
Sbjct: 183 ELITFSMLREVKVVVVFDAMLSGLPTHKENFAGIDVVFSGESCADTWIETEVVALKEDGC 242

Query: 308 PKVWVVTSDVCQQHAAHGAGAFIWSCKALVSEIKASKKEVEMMLQEHRSTSFQGKLLKHN 367
           PKVWVVTSD+CQQHAAHGAGAFIWSCKALVSEIKASKKEVEMMLQEHRSTSFQGKLLKHN
Sbjct: 243 PKVWVVTSDICQQHAAHGAGAFIWSCKALVSEIKASKKEVEMMLQEHRSTSFQGKLLKHN 302

Query: 368 LDSEVVNALNDLKRKLNESE 385
           LDSEVVNALNDLKRKL E+E
Sbjct: 303 LDSEVVNALNDLKRKLTENE 308

BLAST of Cla97C10G191300 vs. ExPASy TrEMBL
Match: A0A1S3BUE5 (uncharacterized protein YacP isoform X1 OS=Cucumis melo OX=3656 GN=LOC103493288 PE=4 SV=1)

HSP 1 Score: 556.6 bits (1433), Expect = 8.2e-155
Identity = 291/323 (90.09%), Postives = 298/323 (92.26%), Query Frame = 0

Query: 68  VVGSVKLFSSSPANTVSGLSYSPSSYSSS-FSP---KKKKKKLLVVSKSKKQPQTSSMYN 127
           VVGSVKLFS+SPAN+VSGLSYSPSSYSS+ FSP   KKKKKKLLVVSKSKKQPQ SS   
Sbjct: 3   VVGSVKLFSASPANSVSGLSYSPSSYSSTLFSPKKKKKKKKKLLVVSKSKKQPQISS--- 62

Query: 128 FVQNLTGFQLQGASDPPPRITSNLKQNLQFLRLWKEFQKRKSGAPKPATSYRKKKVEKED 187
                       A   PPRITSNLKQNLQFLRLWKEFQKRKSG PKPATSYR+KKVEKED
Sbjct: 63  ------------AGSDPPRITSNLKQNLQFLRLWKEFQKRKSGVPKPATSYRRKKVEKED 122

Query: 188 LPGDTELYRDPTLALYYTNQGIDNAVPVLLVDGYNVCGYWVKLKKHFMNGRLDIARQKLI 247
           LPGDTELYRDPTLALYYTNQGIDNAVPVLLVDGYNVCGYWVKLKKHFMNGRLD+ARQKLI
Sbjct: 123 LPGDTELYRDPTLALYYTNQGIDNAVPVLLVDGYNVCGYWVKLKKHFMNGRLDVARQKLI 182

Query: 248 DELITFSMLREVKVVVVFDALQSGLPTHKENFAGIDVVYSGETCADTWIETEVVALKEDG 307
           DELITFSMLREVKVVVVFDA+ SGLPTHKENFAGIDVVYSGE+CADTWIETEVVALKEDG
Sbjct: 183 DELITFSMLREVKVVVVFDAMLSGLPTHKENFAGIDVVYSGESCADTWIETEVVALKEDG 242

Query: 308 CPKVWVVTSDVCQQHAAHGAGAFIWSCKALVSEIKASKKEVEMMLQEHRSTSFQGKLLKH 367
           CPKVWVVTSDVCQQHAAHGAGAFIWSCKALVSEIKASKKEVEMMLQEHRSTSFQGKLLKH
Sbjct: 243 CPKVWVVTSDVCQQHAAHGAGAFIWSCKALVSEIKASKKEVEMMLQEHRSTSFQGKLLKH 302

Query: 368 NLDSEVVNALNDLKRKLNESEPK 387
           NLDSEVVNALNDLKRKLNESEPK
Sbjct: 303 NLDSEVVNALNDLKRKLNESEPK 310

BLAST of Cla97C10G191300 vs. ExPASy TrEMBL
Match: A0A6J1JLX1 (uncharacterized protein LOC111483584 isoform X1 OS=Cucurbita maxima OX=3661 GN=LOC111483584 PE=4 SV=1)

HSP 1 Score: 535.4 bits (1378), Expect = 1.9e-148
Identity = 278/322 (86.34%), Postives = 290/322 (90.06%), Query Frame = 0

Query: 68  VVGSVKLFSSSPANTVSGLSYSPSSYSSSFSPKKKKKK---LLVVSKSKKQPQTSSMYNF 127
           VVGSVKLFSSSP N VSGL YSPSSY+S FSPKKKKKK    LVVSKSKKQPQ+ S    
Sbjct: 3   VVGSVKLFSSSPTNIVSGLCYSPSSYTSLFSPKKKKKKKKNFLVVSKSKKQPQSPS---- 62

Query: 128 VQNLTGFQLQGASDPPPRITSNLKQNLQFLRLWKEFQKRKSGAPKPATSYRKKKVEKEDL 187
                     G SD PPRITSNLKQNLQFL+LWK+FQKRKS APKPATSYRKKKVEKEDL
Sbjct: 63  ----------GDSD-PPRITSNLKQNLQFLKLWKDFQKRKSSAPKPATSYRKKKVEKEDL 122

Query: 188 PGDTELYRDPTLALYYTNQGIDNAVPVLLVDGYNVCGYWVKLKKHFMNGRLDIARQKLID 247
           PGDTELYRDPTL LYYTNQGIDN VPVLLVDGYNVCGYWVKLKKHFMNGRLD+ARQKLID
Sbjct: 123 PGDTELYRDPTLTLYYTNQGIDNTVPVLLVDGYNVCGYWVKLKKHFMNGRLDVARQKLID 182

Query: 248 ELITFSMLREVKVVVVFDALQSGLPTHKENFAGIDVVYSGETCADTWIETEVVALKEDGC 307
           ELITFSMLREVKVVVVFDA+ SGLPTHKE+FAGIDVVYSGE+CADTWIE EVVAL+EDGC
Sbjct: 183 ELITFSMLREVKVVVVFDAMLSGLPTHKEDFAGIDVVYSGESCADTWIEKEVVALREDGC 242

Query: 308 PKVWVVTSDVCQQHAAHGAGAFIWSCKALVSEIKASKKEVEMMLQEHRSTSFQGKLLKHN 367
           PKVWVVTSD+C QHAAHGAGAFIWSCKALV+EIKASKKEVEMMLQEHRSTSFQGKLLKHN
Sbjct: 243 PKVWVVTSDICHQHAAHGAGAFIWSCKALVTEIKASKKEVEMMLQEHRSTSFQGKLLKHN 302

Query: 368 LDSEVVNALNDLKRKLNESEPK 387
           LDSEVVNALNDLKRKLNE+E K
Sbjct: 303 LDSEVVNALNDLKRKLNENESK 309

BLAST of Cla97C10G191300 vs. ExPASy TrEMBL
Match: A0A6J1FQV5 (uncharacterized protein LOC111447625 isoform X1 OS=Cucurbita moschata OX=3662 GN=LOC111447625 PE=4 SV=1)

HSP 1 Score: 532.3 bits (1370), Expect = 1.6e-147
Identity = 276/321 (85.98%), Postives = 289/321 (90.03%), Query Frame = 0

Query: 68  VVGSVKLFSSSPANTVSGLSYSPSSYSSSFSPKKKKKK--LLVVSKSKKQPQTSSMYNFV 127
           VVGSVKLFSSSP NTVSGL YSPSSY+S  SPKKKKKK   LVVSKSKKQPQ+ S     
Sbjct: 3   VVGSVKLFSSSPTNTVSGLCYSPSSYTSLLSPKKKKKKKNFLVVSKSKKQPQSPS----- 62

Query: 128 QNLTGFQLQGASDPPPRITSNLKQNLQFLRLWKEFQKRKSGAPKPATSYRKKKVEKEDLP 187
                    G SD PPRITSNLKQNLQFL+LWK+FQKRKS  PKPATSYRKKKVEKEDLP
Sbjct: 63  ---------GDSD-PPRITSNLKQNLQFLKLWKDFQKRKSSVPKPATSYRKKKVEKEDLP 122

Query: 188 GDTELYRDPTLALYYTNQGIDNAVPVLLVDGYNVCGYWVKLKKHFMNGRLDIARQKLIDE 247
           GDTELYRDPTL LYYTNQGIDN VPVLLVDGYNVCGYWVKLKKHFM+GRLD+ARQKLIDE
Sbjct: 123 GDTELYRDPTLTLYYTNQGIDNTVPVLLVDGYNVCGYWVKLKKHFMSGRLDVARQKLIDE 182

Query: 248 LITFSMLREVKVVVVFDALQSGLPTHKENFAGIDVVYSGETCADTWIETEVVALKEDGCP 307
           LITFSMLREVKVVVVFDA+ SGLPTHKE+FAGIDVVYSGE+CADTWIE EVVAL+EDGCP
Sbjct: 183 LITFSMLREVKVVVVFDAMLSGLPTHKEDFAGIDVVYSGESCADTWIEKEVVALREDGCP 242

Query: 308 KVWVVTSDVCQQHAAHGAGAFIWSCKALVSEIKASKKEVEMMLQEHRSTSFQGKLLKHNL 367
           KVWVVTSD+C QHAAHGAGAFIWSCKALV+EIKASKKEVEMMLQEHRSTSFQGKLLKHNL
Sbjct: 243 KVWVVTSDICHQHAAHGAGAFIWSCKALVTEIKASKKEVEMMLQEHRSTSFQGKLLKHNL 302

Query: 368 DSEVVNALNDLKRKLNESEPK 387
           DSEVVNALNDLKRKLNE+E K
Sbjct: 303 DSEVVNALNDLKRKLNENESK 308

BLAST of Cla97C10G191300 vs. TAIR 10
Match: AT2G02410.3 (unknown protein; FUNCTIONS IN: molecular_function unknown; INVOLVED IN: biological_process unknown; EXPRESSED IN: 21 plant structures; EXPRESSED DURING: 13 growth stages; CONTAINS InterPro DOMAIN/s: Protein of unknown function DUF901 (InterPro:IPR010298). )

HSP 1 Score: 407.5 bits (1046), Expect = 1.2e-113
Identity = 199/252 (78.97%), Postives = 227/252 (90.08%), Query Frame = 0

Query: 137 SDP-PPRITSNLKQNLQFLRLWKEFQKRKSGAPKPATSYRKKKVEKEDLPGDTELYRDPT 196
           S+P PPRI SN+K NLQ L+LWKEFQ R SG  KPATSYRKKKVEK++LP D+ELYRDPT
Sbjct: 8   SEPEPPRIKSNVKHNLQLLKLWKEFQSRGSGMAKPATSYRKKKVEKDELPDDSELYRDPT 67

Query: 197 LALYYTNQG-IDNAVPVLLVDGYNVCGYWVKLKKHFMNGRLDIARQKLIDELITFSMLRE 256
             LYYTNQG +D+AVPVLLVDGYNVCGYW+KLKKHFM GRLD+ARQKL+DEL++FSM++E
Sbjct: 68  NTLYYTNQGLLDDAVPVLLVDGYNVCGYWMKLKKHFMKGRLDVARQKLVDELVSFSMVKE 127

Query: 257 VKVVVVFDALQSGLPTHKENFAGIDVVYSGETCADTWIETEVVALKEDGCPKVWVVTSDV 316
           VKVVVVFDAL SGLPTHKE+FAG+DV++SGETCAD WIE EVVAL+EDGCPKVWVVTSDV
Sbjct: 128 VKVVVVFDALMSGLPTHKEDFAGVDVIFSGETCADAWIEKEVVALREDGCPKVWVVTSDV 187

Query: 317 CQQHAAHGAGAFIWSCKALVSEIKASKKEVEMMLQEHRSTSFQGKLLKHNLDSEVVNALN 376
           CQQ AAHGAGA+IWS KALVSEIK+  KEVE M+QE RSTSFQG+LLKHNLDSEVV+AL 
Sbjct: 188 CQQQAAHGAGAYIWSSKALVSEIKSMHKEVEKMMQETRSTSFQGRLLKHNLDSEVVDALK 247

Query: 377 DLKRKLNESEPK 387
           DL+ KL+E+E K
Sbjct: 248 DLRDKLSENETK 259

BLAST of Cla97C10G191300 vs. TAIR 10
Match: AT2G02410.1 (unknown protein; CONTAINS InterPro DOMAIN/s: Protein of unknown function DUF901 (InterPro:IPR010298); Has 1151 Blast hits to 1151 proteins in 597 species: Archae - 0; Bacteria - 1105; Metazoa - 0; Fungi - 0; Plants - 42; Viruses - 0; Other Eukaryotes - 4 (source: NCBI BLink). )

HSP 1 Score: 405.2 bits (1040), Expect = 5.8e-113
Identity = 215/315 (68.25%), Postives = 248/315 (78.73%), Query Frame = 0

Query: 76  SSSPANTVSGLSYSPSSYSSSFSPKKKKKKLLVV---SKSKKQPQTSSMYNFVQNLTGFQ 135
           SSS +      S S  SY S         ++LVV    KSKK  Q+SS            
Sbjct: 13  SSSSSAVYFNWSSSSCSYDSC--------RVLVVKMGGKSKKPHQSSS------------ 72

Query: 136 LQGASDPPPRITSNLKQNLQFLRLWKEFQKRKSGAPKPATSYRKKKVEKEDLPGDTELYR 195
            + +   PPRI SN+K NLQ L+LWKEFQ R SG  KPATSYRKKKVEK++LP D+ELYR
Sbjct: 73  FKESEPEPPRIKSNVKHNLQLLKLWKEFQSRGSGMAKPATSYRKKKVEKDELPDDSELYR 132

Query: 196 DPTLALYYTNQG-IDNAVPVLLVDGYNVCGYWVKLKKHFMNGRLDIARQKLIDELITFSM 255
           DPT  LYYTNQG +D+AVPVLLVDGYNVCGYW+KLKKHFM GRLD+ARQKL+DEL++FSM
Sbjct: 133 DPTNTLYYTNQGLLDDAVPVLLVDGYNVCGYWMKLKKHFMKGRLDVARQKLVDELVSFSM 192

Query: 256 LREVKVVVVFDALQSGLPTHKENFAGIDVVYSGETCADTWIETEVVALKEDGCPKVWVVT 315
           ++EVKVVVVFDAL SGLPTHKE+FAG+DV++SGETCAD WIE EVVAL+EDGCPKVWVVT
Sbjct: 193 VKEVKVVVVFDALMSGLPTHKEDFAGVDVIFSGETCADAWIEKEVVALREDGCPKVWVVT 252

Query: 316 SDVCQQHAAHGAGAFIWSCKALVSEIKASKKEVEMMLQEHRSTSFQGKLLKHNLDSEVVN 375
           SDVCQQ AAHGAGA+IWS KALVSEIK+  KEVE M+QE RSTSFQG+LLKHNLDSEVV+
Sbjct: 253 SDVCQQQAAHGAGAYIWSSKALVSEIKSMHKEVEKMMQETRSTSFQGRLLKHNLDSEVVD 307

Query: 376 ALNDLKRKLNESEPK 387
           AL DL+ KL+E+E K
Sbjct: 313 ALKDLRDKLSENETK 307

BLAST of Cla97C10G191300 vs. TAIR 10
Match: AT2G02410.2 (unknown protein; FUNCTIONS IN: molecular_function unknown; INVOLVED IN: biological_process unknown; EXPRESSED IN: 21 plant structures; EXPRESSED DURING: 13 growth stages; CONTAINS InterPro DOMAIN/s: Protein of unknown function DUF901 (InterPro:IPR010298). )

HSP 1 Score: 374.8 bits (961), Expect = 8.5e-104
Identity = 182/230 (79.13%), Postives = 208/230 (90.43%), Query Frame = 0

Query: 158 KEFQKRKSGAPKPATSYRKKKVEKEDLPGDTELYRDPTLALYYTNQG-IDNAVPVLLVDG 217
           + FQ R SG  KPATSYRKKKVEK++LP D+ELYRDPT  LYYTNQG +D+AVPVLLVDG
Sbjct: 13  RHFQSRGSGMAKPATSYRKKKVEKDELPDDSELYRDPTNTLYYTNQGLLDDAVPVLLVDG 72

Query: 218 YNVCGYWVKLKKHFMNGRLDIARQKLIDELITFSMLREVKVVVVFDALQSGLPTHKENFA 277
           YNVCGYW+KLKKHFM GRLD+ARQKL+DEL++FSM++EVKVVVVFDAL SGLPTHKE+FA
Sbjct: 73  YNVCGYWMKLKKHFMKGRLDVARQKLVDELVSFSMVKEVKVVVVFDALMSGLPTHKEDFA 132

Query: 278 GIDVVYSGETCADTWIETEVVALKEDGCPKVWVVTSDVCQQHAAHGAGAFIWSCKALVSE 337
           G+DV++SGETCAD WIE EVVAL+EDGCPKVWVVTSDVCQQ AAHGAGA+IWS KALVSE
Sbjct: 133 GVDVIFSGETCADAWIEKEVVALREDGCPKVWVVTSDVCQQQAAHGAGAYIWSSKALVSE 192

Query: 338 IKASKKEVEMMLQEHRSTSFQGKLLKHNLDSEVVNALNDLKRKLNESEPK 387
           IK+  KEVE M+QE RSTSFQG+LLKHNLDSEVV+AL DL+ KL+E+E K
Sbjct: 193 IKSMHKEVEKMMQETRSTSFQGRLLKHNLDSEVVDALKDLRDKLSENETK 242

The following BLAST results are available for this feature:
Match NameE-valueIdentityDescription
XP_038904183.17.1e-16192.83uncharacterized protein YacP [Benincasa hispida][more]
XP_022136580.18.9e-15690.31uncharacterized protein LOC111008254 isoform X1 [Momordica charantia] >XP_022136... [more]
XP_022136583.11.7e-15489.69uncharacterized protein LOC111008254 isoform X2 [Momordica charantia][more]
XP_008452195.11.7e-15490.09PREDICTED: uncharacterized protein YacP isoform X1 [Cucumis melo][more]
XP_004133742.23.2e-15389.10uncharacterized protein LOC101212837 isoform X1 [Cucumis sativus] >KAE8650197.1 ... [more]
Match NameE-valueIdentityDescription
P375741.5e-0931.76Uncharacterized protein YacP OS=Bacillus subtilis (strain 168) OX=224308 GN=yacP... [more]
Match NameE-valueIdentityDescription
A0A6J1C4B64.3e-15690.31uncharacterized protein LOC111008254 isoform X1 OS=Momordica charantia OX=3673 G... [more]
A0A6J1C7Z28.2e-15589.69uncharacterized protein LOC111008254 isoform X2 OS=Momordica charantia OX=3673 G... [more]
A0A1S3BUE58.2e-15590.09uncharacterized protein YacP isoform X1 OS=Cucumis melo OX=3656 GN=LOC103493288 ... [more]
A0A6J1JLX11.9e-14886.34uncharacterized protein LOC111483584 isoform X1 OS=Cucurbita maxima OX=3661 GN=L... [more]
A0A6J1FQV51.6e-14785.98uncharacterized protein LOC111447625 isoform X1 OS=Cucurbita moschata OX=3662 GN... [more]
Match NameE-valueIdentityDescription
AT2G02410.31.2e-11378.97unknown protein; FUNCTIONS IN: molecular_function unknown; INVOLVED IN: biologic... [more]
AT2G02410.15.8e-11368.25unknown protein; CONTAINS InterPro DOMAIN/s: Protein of unknown function DUF901 ... [more]
AT2G02410.28.5e-10479.13unknown protein; FUNCTIONS IN: molecular_function unknown; INVOLVED IN: biologic... [more]
InterPro
Analysis Name: InterPro Annotations of Watermelon (97103) v2.5
Date Performed: 2022-01-31
IPR TermIPR DescriptionSourceSource TermSource DescriptionAlignment
NoneNo IPR availableCOILSCoilCoilcoord: 330..350
NoneNo IPR availableCDDcd10912PIN_YacP-likecoord: 211..345
e-value: 4.86702E-49
score: 160.803
IPR010298Protein of unknown function DUF901PFAMPF05991NYN_YacPcoord: 212..378
e-value: 3.1E-40
score: 137.8
IPR010298Protein of unknown function DUF901PANTHERPTHR34547YACP-LIKE NYN DOMAIN PROTEINcoord: 102..384

Relationships

The following mRNA feature(s) are a part of this gene:

Feature NameUnique NameType
Cla97C10G191300.2Cla97C10G191300.2mRNA