Clc01G04800 (gene) Watermelon (cordophanus) v2

Overview
NameClc01G04800
Typegene
OrganismCitrullus lanatus subsp. cordophanus cv. cordophanus (Watermelon (cordophanus) v2)
DescriptionATP synthase subunit a like
LocationClcChr01: 4511807 .. 4522069 (-)
RNA-Seq ExpressionClc01G04800
SyntenyClc01G04800
Sequences
The following sequences are available for this feature:

Gene sequence (with intron)

Legend: polypeptideexonCDS
Hold the cursor over a type above to highlight its positions in the sequence below.
ATGGGAGTCAGGTCTCTATCCACGGCTTAAATCTATTAGATGGTATAGCAAGTGAGAAAGGGTAGTGCAGTAATTACCAATGAAGGCAGCAGTAGCCCCAATAAGGAGGAGGAGTTCTCCATATGAAATGCCTAGCATCTGAGTTTCGTAGTGGCTCAGAACTCAGAATAGCAATCAAGCCAAGCCTGAAGTTTGAGAATGATCTCAGATGCAATAACTTTCACTGAATTCACTCAGTTCACTCCTCACTGATTAACTGTGCGGTTATCTAAACATTTTTGAGGGGTTCCACTTCAATTAAGGACCAACCTTCAGAAACAAAATAGGAAATTTCCCAATCAGAAATATTTTCAAATTTCCATAACTGACACACAAAAACTCTGTCTTTTCTTTTCCTATGAAATATATAGCATCTTTTTTAATATTCGTGAGTGTCCGGCCAAGTTTACATGTACCTCAACTAACCTCATGGGACAACCCGTCTGACTCTACAATATTTGGGTGCCAAGTAAACTAGTAGGATATTAAATCCTAAGTAGGTGGCAACCATAAATTGAACCATAACTTCTTTGCTTTTTTCTCTATTTACTACTAGATCAACTCATGATAGTTAAACATATAGCATTTCATTAATGGGAGACTTGTACAAGTACCAGATTCATTTGATACCTTTGATAGAATCCAAAACAGAACCAAAATGTATTGAAATATGGAAGATAAAAGAAATTATTAGAGGTACCCGTACCCCCTTTTGAGATACCCTAATTCATTCACTAAATCACCAATTCACTCACCAAATGCCTCCATCCATCCTACTAACCCACTCCATTTATAACAAAACTCTCCTAACAAACTCCTTACCTAAATACCAACATACTCTAATACACTAATTCTATTCTAATAACATTCCTATCATTACCCACCTCGTTAAACAAAACAAAACAAAAAAGAAAAATGCAAAATTGGGTACAGAGCAAACCTTTTTTCTTTTTATCCTTTTCGCGTTCACGAAACCAAACACCCTTCAGAAACTTTAAACTTAGCCTTGTGCGGGGGAAATTGGCTGACAGTGGCGATCGGCGGCGACAAACCAATACAAAAGCAAGGGTGTGGGGCGGCACAAGTGAAAAGAAAACAGAGGGGTTTTCTCCCTTTAGAATCGGGTCGTGTAGAGTGGCGAGTTGATGGCTGTGCAATGGACGGCGCGACGACAGGCGCCGGTACCGAGTATGGTGTGTGCTGAGTAAAAATGGCAGAATAGAGATGCGTATGATCAATGTTCGAAAAAAAAAATTAAAAACTTTTTTGGTGCTAAACTTTGGCTAAAGTGATTAATTTTAAAAATAAGTTATTTAAAAAAGAATTGAAATGTTTTAACATAATTTTTTTTAGTAGTAGTTAAACAGTTTTTTTTTTTTTAATCAAAATAGTTTTTTAAGTCAATTTGAACGGAATCTTTTGTAAAATAGTGTATGTGAAATACTACTTAATTTAAAAAATAAATGAATTTGCACAATCATTCATGTTTAAATTTTAAACCATTCTTATAAAAATATAAATAAAAAAATATTTATTTTTTAAGTTAATCGAAACTAATCTTAGTCCATTTTGTAACAATTTAATTTTTGTATTTTTAATTAGGTAACAATTTAGTCTATAAACTTTAGTATTTAACAATTTAATCCTATGTACTTTCAAATGTATAACAATTTAATCCCTACATGAAAGACATTTCGAAATTAGATGTCATTTTTTATTATTTTACAATGTAAACTTTGTATTTCATAAAATATTGAGTCTCTAATTATATTGACGGTTTATTTATGTCGAACATCTCATCAAAACTTAACGCTAATTCTACCATAAAATTAAATTGTTGCAAATTTTGAAGACTTAATTGTTAAGAAAATTATTATAAATAGAAAAATATCAAACTATTTACAAATATAGAAAAATTTAATTGTTTATCAGCAATAGAATGCGATAGACAATGATAGACTTCTATGTTTATAAATAGTTTGACTCATTTTGCTATATTTGAAAACAATTTTAATTGTTAAATAATAAAAAGTTCATGAATTTAATTGTTACAAAATGGAAAATTCAAATACTAAATTGTTATAAATCAATGTCAATGACTAAATTGTTACTTTTATAAATGTTTAAGAACTAAAAGTGATTTTTAACCTTGACCAATGGTCAAAATATCGAGATCTGAATTTATGAAAATACGATATCAATGAAAATTTCAAAAAACTTTATAAAATAGATGAAAATTGGTAGATTTTGTTATAATTAGTTAATAAAATTTTGACCATGGGTCAATTAAACTATAATTAACTATTTTAGTATTCATTTATTATAAATTAAGAAAATATTTAAATAACAAAATAATATAAAGTTTTAGAAATATTTTAAAGTGAAAGGACAAAAATGAAAGTTTAACCTTTTTTTCAAAATCGAAGCTATAAAATGAAGACGGTTTGATATTCGGTCGATCACTGTTGCGATTTTTGAGGCAGTCGATCGATTACTGTTGCCCTTACAGCTACCACTCGTTCAATCGGGTAAGAATTCTTACTTCTTCCCTCAATTAATTGGTGAATTGCACGATATTTCAATTCCTAGACCGCCGCAAAAGCAACCACCTGTTTCTAATTCCAACAGATCCGATTCCATTTCTCAGCTGGGTTCGCTGTTAAGGTAAGCTAGTTGTAACATGTGATTGAATCTTGCACGTTGAAAACCAAATTTGTTCATTAGGACCTCATGTATTGATATAATTACTCTCCTCTGAATTCTTACTTTGCTTATTGATTATTTTCTGTAATCGCAGATTTTACTTACGCGTGTTGCGACCCTGTTCCTTTTATTTGTCATGTTGTATCGATTTCTTCTTCTTTTCTTTTCTTTTATCTCCACAATTATTTCTTTTTCCATCTTCATTAGTTACCTCATATTGATGTTGAATTGATTTGATGCATGTTTTTCCTAATCATCATTGTTATGAATCGAATTCGAGCACCTTAGTGATTTGTTATTTGTTTATATGTTTGTGAGTGTCCGTGTCAGCTTACACACCCTCTCCTAATCCCACGCGACAATCCATCTGTAAAAAAGATTGTAATATATTAAATTCTACGTAGATAGTCACTGGGAGTTTCATTACGCTCACAAGCAGAATCTTAGCTGGAGCCTACCTTTTGCCCTCCTTGGTGCTTGCCAAGATGCTATCATGTGATTGGTGCATGATCCCCATCGCTTCATCACCTTATAGACACCTCAATTGAATTCCACCTTAACGACAGAAAAAAGGATTCATCAAGTTCTATTGAGTTTGACTCATAGTAATGACTCGTTCCCTCTAAGTCTTTTATTGAACCCATAGTCCTTTTTTACCTAAGCTTAGGCACGACTGTTCATAATTCTCATGATTAAGTTTTCTAATACTTAAAATTGTTATCACTGTGGGGTCATAGGAAAATAATGGTTAAGGTATTTTCTTTTTTTACTTTATTGCAATAGTTGATGTCAATTAGAATAATTCTATAGTAATTGAACAAACCAATCAGCTCTCCTAAGTCCAGGCCTATTAGGGAAAACAAAAGTTTGTTCCTTGAGCTTAACATTGATCTTTGCATCCTCGAGTGGTCCTTGTAAGCTCCTCAATTGAATAAAAAGAAAAAAGAAGGAAAAAAAAGAAATAGCTGATGGTTTGAAAGTGAATGTAGGCAATCTTTTTTTTGTTCAACAATATGTGGGATGAGGGATTCAAACCTCTGACCTCTTTAGTTTCATATTAGTTGAGTAGCATGTTGGCGAATGAATGTTGTCAGTTTGTTGAACTAAACCACGGCAAATTACCCTTCTTCACTCTCTTGTTCTCCTCAAGGATGATAAATCACAAAAAGGTCAAATGCCTAGCCTAATAGACTTTTATTAGGAAGATTGACAATTTGACACTATTGACTAAATAAGATTGGCCTATTGGTGTCAGGGCCTCACTGACCATTTTATGTTAGTGTGTTTTTTAGGATTTAGATCATATTTTGTGGGATTTTTAGTAATTATATTCTGATTTGAGACAGTTTCTTAGAGAGTTTTAGGTTAGCTCTGGCTCATTAAAAATCATGTTGCTTAGTGGTGGAGTTGTATTCCAGCTTTTGTTTTGGGGCAACAAGGGCCTTGAGCATGCACTTTTTTTTTTCGATTTTATAAAATATTTTACTTCAGAGAAATAGAAGAATATTTAGAGAGGTTGAGAGCCCTCGAGGGAGGTGTGCTATATGCTAGGTTCAATTGGCTCTCTTTGGATGTTCATAATTATGGCCAAGCAGTTTTTGTGATTTGTCTTTTTATTTTATTCCCAATAAGGGGTCTATCTTTTGTAGGTTTTTTTTTTTTTTTTTATTTTTAATTTTTGGTAGATTTTCCTTTTCTAAATGACATTTTCTTTTTCTCGTTCTTTTCTCTTTTTTCGTGTTATTTCTCAATGAACGTTTGGTTTCTCTTTCCTTCCTTTTTCTAAAAAAGACTTATCTTTTAACAGCTGGATTTGATATAACTACTGCTTGAATATATGTTGTATTTGGATGTCATGCCAAGTTTGCTACTGCTACCTTACTAGATATTATTGATTGATAGTTTGAGCCTTGATATTTATTTGTCTGGTGCTTACATACATTATAACACATAAATAAAACCGCGCATTAACTCAATTCTCTCATCTTGATACTACAGTTCAGCACGACATTTGTTTGTTGTTCATGCATGCTCTACCTAATAGGGCCATTCCTTTCGTTTCTCTTCATTTTTATTTGGCCTTTTCAGATAATGGAGGCATAGTTCGTATCAACAGTTTGTTGGACTAACTATGAATCAAGGTGTGGTATCCGTATCATGTCCTCCATCTATTACCTTACCCCACTACCACCATAAAAACTTCAAGATATTCAAGTCCTCAAAAATATTCAATACATTAAGTCTCCGTTCCCGACGTCCTCCAATTTGTTGCACGCAGACAAATCCTTGGGAACCTGCACCAATCACATTTGCTCCCAACAATGAAGAGGATGATACCTTCTTGAAGAAAACCGACAATATTTTTGAAAGTCTAAATGCTGATAAAACAACTGGAGTTCCGGAAGTAGAGACTAAAGAACTTGTGGAGGCAAGTACTCAACCAGAGGTGTATTTGCAGATTTTCAAATGGCCAATGTGGCTTTTGGGGCCTTCTCTTCTCCTGACAACTGGGATGGCCCCAACATTGTGGCTTCCTATGTCTTCTGTATTTCTCGGTCCCAATGTAGCCAGCCTCCTCTCTTTGATTGGACTTGACTGCATCTATAACCTCGGTGCTATGCTTTTTCTTCTCATGGCTGATGCTTGTGCACGGCCTAAACAACCCATGAAACCCATGAGCAGTGAGGCTCCTTTCAGTTACCAATTCTGGAACATGCTTGCAAATGTGTTTGGATTTGTGATTCCTTTAGTGATGCTCTATGGATCTGAAAGTGGATTGGTTCAACCCCATCTGCCTTTCATCTCCTTGGCAGTTCTATTGGGCCCCTATATTTTGCTCCTTTCAGTACAGATTTTGACCGAAATGCTGACGTGGCACTGGCGATCGCCTGTTTGGCTGGTTACTCCAATTGTATACGAGGGTTATCGAGTTTTGCAGCTGATGAGAGGGTTGAAACTTGGGGCTGAGCTCAGTGCACCGGCCTGGATGATGCACACAATCAGAGGATTGGTTTGCTGGTGGGTACTTATACTCGGTATTCAACTCATGAGAGTAGCTTGGTTTGCAGGTATTGCTGCCTCCCTATCTCATAAGCAGGAAATTGTCGCTAATGGTTCGTGATACTGCCAGTCATAACGCAAGGGTGAGTTTAGGATGACTTTTGAAAAAAAAAAAAAAAAAAAAGGTGTTTTTAAGCAAACACAGTACCTTCTAGAAACACTTTTTGAAGAAGCACAACCAAGTGTTTTACTTAACTACTTCTAGTGGACTAAAAACACTTTTCATTCTCCAAAAGTGATTCCAAACCTACCGTAACAAATAGGATATACTCCTATTTTGATATGAGATATGTACAAAGTTAAAGGTCTCAAGGGTTTGCCTCAAATAAGTTTAGAAATAGGCATGGTACTGGAGAAAATGTGAAAGTAGGGGACATGGTTGTTGTATATGTTGCCTATTTTTATTATTCACAGAATTAGGAGTTTATGTGTATAAAAGACTATTAAAAAAAAGAACATGATATTTGATTATTCAAATCTATGAGTGCAGTGTTACAAATTCTTTTTCTCAGTTTTAGGAGTAAGTTCGAAAAATATTGAACTTGATAAACACAGGGTTGGACCTCAGTTATGGTATTGGACTCTATTTGGAGAAATTGGAGTATATGGCAAATTATAAACTGGAAAACTGAAGTTTCCATGGATTTTTATTATGCCATGACCTTAGGTTGTGCAACTGTGTCCTTGAATCCTAAGAATTTTTTATTATGCCCTCAGGTTTGGTTGTGGGTTTTGACTTCATTGATTTTTTATTTTCAGTGTTGGGTTTGACGTTTAAGATAGTGAACTTGCCAGAATTGCCCTATATTTCATTCAGCAACTTAAATCTATCAAGTGCTTTTTCACCAAAATTTTGTTGTTTTTTAGTTCAATATGAATAACATAATTTTTGGTCTAATGATTCATGTATAATAATACTTCAAATACAATTCATATAAAACGAATACATTGTATTCATATAGCTATTATACAAGTTATGTTTTCATATGTTTTCTCAAATTTTATTCAAGTCACATTATAGTCCCTTCGATCGCCCTTTTATTATTAATACAGTGAACTCTCCACAATTATGCTACATTTCATTGCATTGTTCAATTTTATCATGGTACGTTTCACTTGTACAACTTTATTTATCTTTATTTACTGCAAGTACACTATACTTTTAGTCATTTTGCTCGACTCAAGTTCAGATCGGTGAATGAAAATTAAGGTGTATCTTTTAATTCTCATATATTTAAATTCAAACCTACCTAAGAAATAGTTCAATTGGTATAAATTTAAACTATCAACGAAGTTTGAGGTTAGATCTTCTCACCTTATCACCTTATATATATATGTATATTTCTTTCTTCTTTTTGAAACGAGCGGTTGAAGATGGTAAATGTAGGGAGAGGAGAGAGAAAAAAATTATTATTTCATTCTTATATTTTAATTTTTAAAATGCACACTAGCAAATGGTTGTTTATTTAATCGATTCTATCAATGGAGGATCAAAATAGGCATTTCCCAAATATTTGGGGTTTAGAGTTTGCCTACTAATCATTTGTTATTTTGTTTTTTGTTTTTGAGAATTAAGCTTATAAAAAATTCATTCAACCTCTAAATTTTATGCTTTGTTATCTACTTTTTTTTTATCAATATTTTTAAAAATAAAACCAAATATAGAATATTGTTTTTAAAAACTTGTTTTTGTTTTTGAAAATTGAGTATAGATTCAATTATTGTACTTAAAAGAGATACATATAGTTGTAAGAAATTGAGAGGAAATAGACAAAATTAAAAAAAAAAAAAAACAAAATGATTATCAAACGAGACTTTAATTATTCAATTTTGAAACTTAAGGATCAATTTGAAACCAACGTAGTCCTAAGATTGGGGTAAACCAATATAAACTTTCAATGTTATTTTCTCTCTTTTTTTTTCTTATCTTCTTTATTTTTATTTTGTGACCGATTACTTATTTAGCTTCTAACTTATCTTCTTTATTTTTATTTTATTTTGCCTAGTAAATCTTTGAATTCTAAAAAAATATAAAGTAATGTCAAAAACTTATTAGACACATAATAGAAAATTTCATGGATTTGTTGGACACAAAATTAAAAAATTTAGGAGCTTCTAGATATTTACAAAGACTTATTAGTACTTCAAATTTAAACTAAAAATTAAGGGGGCGTTTGGGCCAACTTCAAGTGTTTAATATTTACTACTCATCCTCTTTTCCTTATTTATTAAAACGTTTAATATTTTCTACCTTAAAATAGTCTCCACCCTAAACATAGACTATTATAACTCATAAACTATAATAACTATAGACTATAATAACTAACCCAGTGTCTAAAATACCCTAAAGAGACTAAACTTATAATTTAATTTATATTTTAAATTGTCTAGAAACTAGTTGATAAATATCAAATCTTATTGTTAACCGAATCAAATAAAAACATTTTTCTTTTCTTGTGAGAATGAAAAGAAAATATATAAAACAAAAATGGTATACCATACTAATTTTCTTTTCATGTCCATAACGCATTTCTTAACTTTACTACCTTTATTAGGTATATATGAAATATGAGACATATATGGAATATGCTCAATTTAATCTTGGAAATGGCTTGGATATATGGAAAAACTATCAACCTTTGTTAATATTTACTAAAATAATAATTACCATTAATACTCTTATAGAATCCATATTGATGTTGTGTAATGATTCGTTGTTATTGCTATATAAAAATTTTAAAAAAATAATAATTTAAGTGTTACAATTACACACTATAAATTTATGGGAGAAATAAAAACATTATAAACCAAAAAGAAATATGGCAACTCATAAGCTTTCTGCAGTGTCCATCGCACTCGTGATTTTATTTGCCTGTTTTGTTGCATCAAGTAAGCTCTCGTCTGATCATTCCTTCCACTTTCTCCCCTTTTTTTCTATTCAAACATTGTAAAAGTAAAAATATATTTTGAAGTGATTATCAAAATATATTGATGTTGGATAAATGTTAAAATTTAATTTTGAAAAATCTAAGATAATTTGAATTGATTTTTCGATAATTAGATGAAATGTGATTTTCTGGCATATTACGTGATTTTACAATCTCTATAAGATACTTAAGTTACTTTTATTATTGTTAATAATATATTAAACCTCATAACAAAAAATTCAATTCAAGAATGGTGAACGTGAAGAGCCACCATGTTGAGATCCCATATGTTTGGCCAAAATCATCCAAAAAATTTGTCCAAATTTCATTCGCTTATTAAAATACCAAATTTGCCATTGTATTTAAGAATAATATATATATATATTTTTTGCCTCAGGTCACTTTTCTTTCTTTTGTGGAGGCTTGACCCTAAATTTGGAAGCCTAACACAATAAATTACTTCTAAATACATAAAAATTACTTTTAAAATTTTGGTATGATTATATGAAAATCAATTTTACAAAATTAATTGTACTTTAAATCACTTGTAATGCTAAAATCACTACTTCAACAATCCCTCTTAAGCATGTGCCAAGAGTATGTTTGAGAGTGAATTTGACATTTCCAAAATAACTTTACTTTCAAATATTGATGCTAGGAAAATGTTGAAATTGATTCTACAATAATTAGTTGAGATGTGATTCTTAGCAGAGTGATTGACCAAGAATATAACGTTTAGAAGTTTAAAACCAAACATCATATGAATCCTACCTTTAAATCTTTTTTTTCCCCAAGTCTAAATTATACAAAATACTCATAAACAAAAATACCCCTAAACTCTCAAAAGTTTCAAGAATACTTTTACACTTTCAAAAAAAAAAAAGTTAAAACATACCATTATCGTTCAGATGGAAAAAATAATATTTTGTTTAAAAAATATCCTTTGAACTTTTAAAAGTTTCAATAATACAATTAAACTTAATAAAAAAAAAAAAAAAAAAACCTTTACTGTTAGCATATAAACCCAAATTGTTAATACTTGTTAAAAAAATATCTTTAAACTTTCCAAGGTTGTATTGATATTTTCGATAAACTTTTGAAAATTCAAAGATATTTTTGGAACTTTTGAAAGTTCAAGGGTATTTTTGAAACAAAACCATTAATGGTTTCTATCTAAAACTAACGATAAAAGTATTTTGAAAGTTTAAAGATATTTATGAAACTTTTGAAAGTTCAAAGGTAATCTCGACACAAAGTAGTATGTAAAGTTGAAGATTTTATTTATTTATTTATTTATAATTTTTTATAATATAGACTTTTTCTAAACTCGATATTTCAAATTAAATAAAAGTTAATAATATTTGATGGCATTACAGGTGAAGGACAAGGATTGTGGAATTGCCTTCGTCCATGTTCCATGGACTATAAAAATCTGTCATGTTATGTGGATTGCATCGCACAAAACAGAGGAACAGGATCTTGTATTCCTAAACGACTTGGCTCCGATGAATATGTTTGTTGTTGTACTGGTTGA

mRNA sequence

ATGGGAGTCAGAGCAAACCTTTTTTCTTTTTATCCTTTTCGCGTTCACGAAACCAAACACCCTTCAGAAACTTTAAACTTAGCCTTGTGCGGGGGAAATTGGCTGACAGTGGCGATCGGCGGCGACAAACCAATACAAAAGCAAGGGTGTGGGGCGGCACAAGTGAAAAGAAAACAGAGGGGTTTTCTCCCTTTAGAATCGGGTCGTGTAGAGTGGCGAGTTGATGGCTGTGCAATGGACGGCGCGACGACAGGCGCCGGTACCGAGTATGTCGATCGATTACTGTTGCCCTTACAGCTACCACTCGTTCAATCGGGTAAGAATTCTTACTTCTTCCCTCAATTAATTGGTGAATTGCACGATATTTCAATTCCTAGACCGCCGCAAAAGCAACCACCTGTTTCTAATTCCAACAGATCCGATTCCATTTCTCAGCTGGGTTCGCTGTTAAGGAAAATAATGGTTAAGACAAATCCTTGGGAACCTGCACCAATCACATTTGCTCCCAACAATGAAGAGGATGATACCTTCTTGAAGAAAACCGACAATATTTTTGAAAGTCTAAATGCTGATAAAACAACTGGAGTTCCGGAAGTAGAGACTAAAGAACTTGTGGAGGCAAGTACTCAACCAGAGGTGTATTTGCAGATTTTCAAATGGCCAATGTGGCTTTTGGGGCCTTCTCTTCTCCTGACAACTGGGATGGCCCCAACATTGTGGCTTCCTATGTCTTCTGTATTTCTCGGTCCCAATGTAGCCAGCCTCCTCTCTTTGATTGGACTTGACTGCATCTATAACCTCGGTGCTATGCTTTTTCTTCTCATGGCTGATGCTTGTGCACGGCCTAAACAACCCATGAAACCCATGAGCAGTGAGGCTCCTTTCAGTTACCAATTCTGGAACATGCTTGCAAATGTGTTTGGATTTGTGATTCCTTTAGTGATGCTCTATGGATCTGAAAGTGGATTGGTTCAACCCCATCTGCCTTTCATCTCCTTGGCAGTTCTATTGGGCCCCTATATTTTGCTCCTTTCAGTACAGATTTTGACCGAAATGCTGACGTGGCACTGGCGATCGCCTGTTTGGCTGGTTACTCCAATTGTATACGAGGGTTATCGAGTTTTGCAGCTGATGAGAGGGTTGAAACTTGGGGCTGAGCTCAGTGCACCGGCCTGGATGATGCACACAATCAGAGGATTGGTTTGCTGGTGGGTACTTATACTCGGTATTCAACTCATGAGAGTAGCTTGGTTTGCAGGTGAAGGACAAGGATTGTGGAATTGCCTTCGTCCATGTTCCATGGACTATAAAAATCTGTCATGTTATGTGGATTGCATCGCACAAAACAGAGGAACAGGATCTTGTATTCCTAAACGACTTGGCTCCGATGAATATGTTTGTTGTTGTACTGGTTGA

Coding sequence (CDS)

ATGGGAGTCAGAGCAAACCTTTTTTCTTTTTATCCTTTTCGCGTTCACGAAACCAAACACCCTTCAGAAACTTTAAACTTAGCCTTGTGCGGGGGAAATTGGCTGACAGTGGCGATCGGCGGCGACAAACCAATACAAAAGCAAGGGTGTGGGGCGGCACAAGTGAAAAGAAAACAGAGGGGTTTTCTCCCTTTAGAATCGGGTCGTGTAGAGTGGCGAGTTGATGGCTGTGCAATGGACGGCGCGACGACAGGCGCCGGTACCGAGTATGTCGATCGATTACTGTTGCCCTTACAGCTACCACTCGTTCAATCGGGTAAGAATTCTTACTTCTTCCCTCAATTAATTGGTGAATTGCACGATATTTCAATTCCTAGACCGCCGCAAAAGCAACCACCTGTTTCTAATTCCAACAGATCCGATTCCATTTCTCAGCTGGGTTCGCTGTTAAGGAAAATAATGGTTAAGACAAATCCTTGGGAACCTGCACCAATCACATTTGCTCCCAACAATGAAGAGGATGATACCTTCTTGAAGAAAACCGACAATATTTTTGAAAGTCTAAATGCTGATAAAACAACTGGAGTTCCGGAAGTAGAGACTAAAGAACTTGTGGAGGCAAGTACTCAACCAGAGGTGTATTTGCAGATTTTCAAATGGCCAATGTGGCTTTTGGGGCCTTCTCTTCTCCTGACAACTGGGATGGCCCCAACATTGTGGCTTCCTATGTCTTCTGTATTTCTCGGTCCCAATGTAGCCAGCCTCCTCTCTTTGATTGGACTTGACTGCATCTATAACCTCGGTGCTATGCTTTTTCTTCTCATGGCTGATGCTTGTGCACGGCCTAAACAACCCATGAAACCCATGAGCAGTGAGGCTCCTTTCAGTTACCAATTCTGGAACATGCTTGCAAATGTGTTTGGATTTGTGATTCCTTTAGTGATGCTCTATGGATCTGAAAGTGGATTGGTTCAACCCCATCTGCCTTTCATCTCCTTGGCAGTTCTATTGGGCCCCTATATTTTGCTCCTTTCAGTACAGATTTTGACCGAAATGCTGACGTGGCACTGGCGATCGCCTGTTTGGCTGGTTACTCCAATTGTATACGAGGGTTATCGAGTTTTGCAGCTGATGAGAGGGTTGAAACTTGGGGCTGAGCTCAGTGCACCGGCCTGGATGATGCACACAATCAGAGGATTGGTTTGCTGGTGGGTACTTATACTCGGTATTCAACTCATGAGAGTAGCTTGGTTTGCAGGTGAAGGACAAGGATTGTGGAATTGCCTTCGTCCATGTTCCATGGACTATAAAAATCTGTCATGTTATGTGGATTGCATCGCACAAAACAGAGGAACAGGATCTTGTATTCCTAAACGACTTGGCTCCGATGAATATGTTTGTTGTTGTACTGGTTGA

Protein sequence

MGVRANLFSFYPFRVHETKHPSETLNLALCGGNWLTVAIGGDKPIQKQGCGAAQVKRKQRGFLPLESGRVEWRVDGCAMDGATTGAGTEYVDRLLLPLQLPLVQSGKNSYFFPQLIGELHDISIPRPPQKQPPVSNSNRSDSISQLGSLLRKIMVKTNPWEPAPITFAPNNEEDDTFLKKTDNIFESLNADKTTGVPEVETKELVEASTQPEVYLQIFKWPMWLLGPSLLLTTGMAPTLWLPMSSVFLGPNVASLLSLIGLDCIYNLGAMLFLLMADACARPKQPMKPMSSEAPFSYQFWNMLANVFGFVIPLVMLYGSESGLVQPHLPFISLAVLLGPYILLLSVQILTEMLTWHWRSPVWLVTPIVYEGYRVLQLMRGLKLGAELSAPAWMMHTIRGLVCWWVLILGIQLMRVAWFAGEGQGLWNCLRPCSMDYKNLSCYVDCIAQNRGTGSCIPKRLGSDEYVCCCTG
Homology
BLAST of Clc01G04800 vs. NCBI nr
Match: XP_038875766.1 (uncharacterized protein LOC120068138 [Benincasa hispida] >XP_038875767.1 uncharacterized protein LOC120068138 [Benincasa hispida] >XP_038875768.1 uncharacterized protein LOC120068138 [Benincasa hispida] >XP_038875769.1 uncharacterized protein LOC120068138 [Benincasa hispida])

HSP 1 Score: 505.0 bits (1299), Expect = 7.1e-139
Identity = 251/267 (94.01%), Postives = 258/267 (96.63%), Query Frame = 0

Query: 155 VKTNPWEPAPITFAPNNE-EDDTFLKKTDNIFESLNADKTTGVPEVETKELVEASTQPEV 214
           V+TNPWEPAPITFAPNNE +DDTFLKKTDNIFESLNAD+TT VPEV+TKELVEAS QPEV
Sbjct: 53  VQTNPWEPAPITFAPNNEKDDDTFLKKTDNIFESLNADRTTEVPEVDTKELVEASNQPEV 112

Query: 215 YLQIFKWPMWLLGPSLLLTTGMAPTLWLPMSSVFLGPNVASLLSLIGLDCIYNLGAMLFL 274
           +LQIFKWPMWLLGPSLLLTTGMAPTLWLPMSSVFLG NVASLLSLIGLDCIYNLGAMLFL
Sbjct: 113 HLQIFKWPMWLLGPSLLLTTGMAPTLWLPMSSVFLGSNVASLLSLIGLDCIYNLGAMLFL 172

Query: 275 LMADACARPKQPMKPMSSEAPFSYQFWNMLANVFGFVIPLVMLYGSESGLVQPHLPFISL 334
           LMADACARPKQ  KPMSSEAPFSYQFWNMLANVFGFVIP VMLYGSESG +QPHLPFISL
Sbjct: 173 LMADACARPKQLRKPMSSEAPFSYQFWNMLANVFGFVIPFVMLYGSESGFIQPHLPFISL 232

Query: 335 AVLLGPYILLLSVQILTEMLTWHWRSPVWLVTPIVYEGYRVLQLMRGLKLGAELSAPAWM 394
           AVLLGPYILLLSVQILTEMLTWHWRSPVWLVTPIVYEGYRVLQLMRGLKLGAELSAPAWM
Sbjct: 233 AVLLGPYILLLSVQILTEMLTWHWRSPVWLVTPIVYEGYRVLQLMRGLKLGAELSAPAWM 292

Query: 395 MHTIRGLVCWWVLILGIQLMRVAWFAG 421
           MHTI+GLVCWWVLILGIQLMRV WFAG
Sbjct: 293 MHTIKGLVCWWVLILGIQLMRVVWFAG 319

BLAST of Clc01G04800 vs. NCBI nr
Match: XP_004152380.1 (uncharacterized protein LOC101219687 [Cucumis sativus] >XP_011654828.1 uncharacterized protein LOC101219687 [Cucumis sativus] >KGN50298.1 hypothetical protein Csa_000500 [Cucumis sativus])

HSP 1 Score: 503.8 bits (1296), Expect = 1.6e-138
Identity = 248/266 (93.23%), Postives = 259/266 (97.37%), Query Frame = 0

Query: 156 KTNPWEPAPITFAPNNEEDDTFLKKTDNIFESLNADKTTGVPEVETKELVEASTQPE-VY 215
           +TNPWEPAP+TFAPNNEED+TFLKKTDNIFESLNAD+TT V EVETKEL+EA+ QPE V+
Sbjct: 54  QTNPWEPAPVTFAPNNEEDETFLKKTDNIFESLNADRTTEVSEVETKELLEATNQPEVVH 113

Query: 216 LQIFKWPMWLLGPSLLLTTGMAPTLWLPMSSVFLGPNVASLLSLIGLDCIYNLGAMLFLL 275
           LQIFKWPMW LGPSLLLTTGMAPTLWLPMSSVFLGPNVASLLSLIGLDCIYNLGAMLFLL
Sbjct: 114 LQIFKWPMWFLGPSLLLTTGMAPTLWLPMSSVFLGPNVASLLSLIGLDCIYNLGAMLFLL 173

Query: 276 MADACARPKQPMKPMSSEAPFSYQFWNMLANVFGFVIPLVMLYGSESGLVQPHLPFISLA 335
           MADACARPKQP+KPMSSEAPFSYQFWNMLANVFGF+IPLVM YGSESGL+QPHLPFISLA
Sbjct: 174 MADACARPKQPIKPMSSEAPFSYQFWNMLANVFGFMIPLVMFYGSESGLIQPHLPFISLA 233

Query: 336 VLLGPYILLLSVQILTEMLTWHWRSPVWLVTPIVYEGYRVLQLMRGLKLGAELSAPAWMM 395
           VLLGPYILLLSVQILTEML WHWRSPVWLVTPIVYEGYRVLQLMRGLKLGAELSAPAWMM
Sbjct: 234 VLLGPYILLLSVQILTEMLIWHWRSPVWLVTPIVYEGYRVLQLMRGLKLGAELSAPAWMM 293

Query: 396 HTIRGLVCWWVLILGIQLMRVAWFAG 421
           HT+RGLVCWWVLILGIQLMRVAWFAG
Sbjct: 294 HTMRGLVCWWVLILGIQLMRVAWFAG 319

BLAST of Clc01G04800 vs. NCBI nr
Match: XP_008436987.1 (PREDICTED: uncharacterized protein LOC103482552 [Cucumis melo] >XP_008436989.1 PREDICTED: uncharacterized protein LOC103482552 [Cucumis melo])

HSP 1 Score: 498.4 bits (1282), Expect = 6.6e-137
Identity = 256/306 (83.66%), Postives = 271/306 (88.56%), Query Frame = 0

Query: 122 ISIPRPPQKQPPVSNSNRSDSISQLGSLLRK------IMVKTNPWEPAPITFAPNNEEDD 181
           I+IP    K    S S++  + S L SL            +TNPWEPAP+TFA NN+ED+
Sbjct: 14  ITIPHYHHKNFKTSKSSKVLNASTLHSLFMHSRRPPICCTQTNPWEPAPVTFAHNNKEDE 73

Query: 182 TFLKKTDNIFESLNADKTTGVPEVETKELVEASTQPE-VYLQIFKWPMWLLGPSLLLTTG 241
           TFLKKTDNIFESLNAD+TT V EVETKELVEAS QPE V+LQIFKWPMWLLGPSLLLTTG
Sbjct: 74  TFLKKTDNIFESLNADRTTEVSEVETKELVEASNQPELVHLQIFKWPMWLLGPSLLLTTG 133

Query: 242 MAPTLWLPMSSVFLGPNVASLLSLIGLDCIYNLGAMLFLLMADACARPKQPMKPMSSEAP 301
           MAPTLWLPMSSVFLGPNVASLLSLIGLDCIYNLGAMLFLLMADACARPK+P+KPMSSEAP
Sbjct: 134 MAPTLWLPMSSVFLGPNVASLLSLIGLDCIYNLGAMLFLLMADACARPKEPIKPMSSEAP 193

Query: 302 FSYQFWNMLANVFGFVIPLVMLYGSESGLVQPHLPFISLAVLLGPYILLLSVQILTEMLT 361
           FSYQFWN+LANV GF+IPLVM YGSESGLVQPHLPFI LAVLLGPYILLLSVQILTEML 
Sbjct: 194 FSYQFWNILANVVGFMIPLVMFYGSESGLVQPHLPFIPLAVLLGPYILLLSVQILTEMLI 253

Query: 362 WHWRSPVWLVTPIVYEGYRVLQLMRGLKLGAELSAPAWMMHTIRGLVCWWVLILGIQLMR 421
           WHWRSPVWLVTPIVYEGYRVLQLMRGLKLGAELSAPAWMMHT+RGLVCWWVLILGIQLMR
Sbjct: 254 WHWRSPVWLVTPIVYEGYRVLQLMRGLKLGAELSAPAWMMHTMRGLVCWWVLILGIQLMR 313

BLAST of Clc01G04800 vs. NCBI nr
Match: KAA0043465.1 (uncharacterized protein E6C27_scaffold1167G00020 [Cucumis melo var. makuwa])

HSP 1 Score: 495.7 bits (1275), Expect = 4.3e-136
Identity = 246/266 (92.48%), Postives = 257/266 (96.62%), Query Frame = 0

Query: 156 KTNPWEPAPITFAPNNEEDDTFLKKTDNIFESLNADKTTGVPEVETKELVEASTQPE-VY 215
           +TNPWEPAP+TFA NN+ED+TFLKKTDNIFESLNAD+TT V EVETKELVEAS QPE V+
Sbjct: 12  QTNPWEPAPVTFAHNNKEDETFLKKTDNIFESLNADRTTEVSEVETKELVEASNQPELVH 71

Query: 216 LQIFKWPMWLLGPSLLLTTGMAPTLWLPMSSVFLGPNVASLLSLIGLDCIYNLGAMLFLL 275
           LQIFKWPMWLLGPSLLLTTGMAPTLWLPMSSVFLGPNVASLLSLIGLDCIYNLGAMLFLL
Sbjct: 72  LQIFKWPMWLLGPSLLLTTGMAPTLWLPMSSVFLGPNVASLLSLIGLDCIYNLGAMLFLL 131

Query: 276 MADACARPKQPMKPMSSEAPFSYQFWNMLANVFGFVIPLVMLYGSESGLVQPHLPFISLA 335
           MADACARPK+P+KPMSSEAPFSYQFWN+LANV GF+IPLVM YGSESGLVQPHLPFI LA
Sbjct: 132 MADACARPKEPIKPMSSEAPFSYQFWNILANVVGFMIPLVMFYGSESGLVQPHLPFIPLA 191

Query: 336 VLLGPYILLLSVQILTEMLTWHWRSPVWLVTPIVYEGYRVLQLMRGLKLGAELSAPAWMM 395
           VLLGPYILLLSVQILTEML WHWRSPVWLVTPIVYEGYRVLQLMRGLKLGAELSAPAWMM
Sbjct: 192 VLLGPYILLLSVQILTEMLIWHWRSPVWLVTPIVYEGYRVLQLMRGLKLGAELSAPAWMM 251

Query: 396 HTIRGLVCWWVLILGIQLMRVAWFAG 421
           HT+RGLVCWWVLILGIQLMRVAWFAG
Sbjct: 252 HTMRGLVCWWVLILGIQLMRVAWFAG 277

BLAST of Clc01G04800 vs. NCBI nr
Match: KAG6579387.1 (hypothetical protein SDJN03_23835, partial [Cucurbita argyrosperma subsp. sororia])

HSP 1 Score: 485.3 bits (1248), Expect = 5.8e-133
Identity = 241/271 (88.93%), Postives = 249/271 (91.88%), Query Frame = 0

Query: 158 NPWEPAPITFAPNNEEDDTFLKKTDNIFESLNADKTTGVPEVETKEL--------VEAST 217
           NPWEPAPITFA  NEEDDTFLK+T+NIF SLNAD TT  PEVETKEL        VE S 
Sbjct: 56  NPWEPAPITFASENEEDDTFLKRTENIFGSLNADSTTEAPEVETKELVEVEAKEVVEVSN 115

Query: 218 QPEVYLQIFKWPMWLLGPSLLLTTGMAPTLWLPMSSVFLGPNVASLLSLIGLDCIYNLGA 277
           QPEV+LQIFKWPMWLLGPSLLLTTGMAPTLWLPMSSVFLGPNVASLLSLIGLDCIYNLGA
Sbjct: 116 QPEVHLQIFKWPMWLLGPSLLLTTGMAPTLWLPMSSVFLGPNVASLLSLIGLDCIYNLGA 175

Query: 278 MLFLLMADACARPKQPMKPMSSEAPFSYQFWNMLANVFGFVIPLVMLYGSESGLVQPHLP 337
           MLFLLMADACARPKQP+KPM SEAPFSYQFWNMLANV GF IP +MLYGS SGLVQPHLP
Sbjct: 176 MLFLLMADACARPKQPIKPMRSEAPFSYQFWNMLANVVGFAIPFIMLYGSGSGLVQPHLP 235

Query: 338 FISLAVLLGPYILLLSVQILTEMLTWHWRSPVWLVTPIVYEGYRVLQLMRGLKLGAELSA 397
           FISLAVLLGPY+LLLSVQILTEMLTWHWRSPVWLVTPIVYEGYR+LQLMRGLKLGAELSA
Sbjct: 236 FISLAVLLGPYVLLLSVQILTEMLTWHWRSPVWLVTPIVYEGYRILQLMRGLKLGAELSA 295

Query: 398 PAWMMHTIRGLVCWWVLILGIQLMRVAWFAG 421
           PAW MHTIRGLVCWWVLILG+QLMRVAWFAG
Sbjct: 296 PAWTMHTIRGLVCWWVLILGVQLMRVAWFAG 326

BLAST of Clc01G04800 vs. ExPASy TrEMBL
Match: A0A0A0KKZ1 (Uncharacterized protein OS=Cucumis sativus OX=3659 GN=Csa_5G166420 PE=4 SV=1)

HSP 1 Score: 503.8 bits (1296), Expect = 7.6e-139
Identity = 248/266 (93.23%), Postives = 259/266 (97.37%), Query Frame = 0

Query: 156 KTNPWEPAPITFAPNNEEDDTFLKKTDNIFESLNADKTTGVPEVETKELVEASTQPE-VY 215
           +TNPWEPAP+TFAPNNEED+TFLKKTDNIFESLNAD+TT V EVETKEL+EA+ QPE V+
Sbjct: 54  QTNPWEPAPVTFAPNNEEDETFLKKTDNIFESLNADRTTEVSEVETKELLEATNQPEVVH 113

Query: 216 LQIFKWPMWLLGPSLLLTTGMAPTLWLPMSSVFLGPNVASLLSLIGLDCIYNLGAMLFLL 275
           LQIFKWPMW LGPSLLLTTGMAPTLWLPMSSVFLGPNVASLLSLIGLDCIYNLGAMLFLL
Sbjct: 114 LQIFKWPMWFLGPSLLLTTGMAPTLWLPMSSVFLGPNVASLLSLIGLDCIYNLGAMLFLL 173

Query: 276 MADACARPKQPMKPMSSEAPFSYQFWNMLANVFGFVIPLVMLYGSESGLVQPHLPFISLA 335
           MADACARPKQP+KPMSSEAPFSYQFWNMLANVFGF+IPLVM YGSESGL+QPHLPFISLA
Sbjct: 174 MADACARPKQPIKPMSSEAPFSYQFWNMLANVFGFMIPLVMFYGSESGLIQPHLPFISLA 233

Query: 336 VLLGPYILLLSVQILTEMLTWHWRSPVWLVTPIVYEGYRVLQLMRGLKLGAELSAPAWMM 395
           VLLGPYILLLSVQILTEML WHWRSPVWLVTPIVYEGYRVLQLMRGLKLGAELSAPAWMM
Sbjct: 234 VLLGPYILLLSVQILTEMLIWHWRSPVWLVTPIVYEGYRVLQLMRGLKLGAELSAPAWMM 293

Query: 396 HTIRGLVCWWVLILGIQLMRVAWFAG 421
           HT+RGLVCWWVLILGIQLMRVAWFAG
Sbjct: 294 HTMRGLVCWWVLILGIQLMRVAWFAG 319

BLAST of Clc01G04800 vs. ExPASy TrEMBL
Match: A0A1S3ASL3 (uncharacterized protein LOC103482552 OS=Cucumis melo OX=3656 GN=LOC103482552 PE=4 SV=1)

HSP 1 Score: 498.4 bits (1282), Expect = 3.2e-137
Identity = 256/306 (83.66%), Postives = 271/306 (88.56%), Query Frame = 0

Query: 122 ISIPRPPQKQPPVSNSNRSDSISQLGSLLRK------IMVKTNPWEPAPITFAPNNEEDD 181
           I+IP    K    S S++  + S L SL            +TNPWEPAP+TFA NN+ED+
Sbjct: 14  ITIPHYHHKNFKTSKSSKVLNASTLHSLFMHSRRPPICCTQTNPWEPAPVTFAHNNKEDE 73

Query: 182 TFLKKTDNIFESLNADKTTGVPEVETKELVEASTQPE-VYLQIFKWPMWLLGPSLLLTTG 241
           TFLKKTDNIFESLNAD+TT V EVETKELVEAS QPE V+LQIFKWPMWLLGPSLLLTTG
Sbjct: 74  TFLKKTDNIFESLNADRTTEVSEVETKELVEASNQPELVHLQIFKWPMWLLGPSLLLTTG 133

Query: 242 MAPTLWLPMSSVFLGPNVASLLSLIGLDCIYNLGAMLFLLMADACARPKQPMKPMSSEAP 301
           MAPTLWLPMSSVFLGPNVASLLSLIGLDCIYNLGAMLFLLMADACARPK+P+KPMSSEAP
Sbjct: 134 MAPTLWLPMSSVFLGPNVASLLSLIGLDCIYNLGAMLFLLMADACARPKEPIKPMSSEAP 193

Query: 302 FSYQFWNMLANVFGFVIPLVMLYGSESGLVQPHLPFISLAVLLGPYILLLSVQILTEMLT 361
           FSYQFWN+LANV GF+IPLVM YGSESGLVQPHLPFI LAVLLGPYILLLSVQILTEML 
Sbjct: 194 FSYQFWNILANVVGFMIPLVMFYGSESGLVQPHLPFIPLAVLLGPYILLLSVQILTEMLI 253

Query: 362 WHWRSPVWLVTPIVYEGYRVLQLMRGLKLGAELSAPAWMMHTIRGLVCWWVLILGIQLMR 421
           WHWRSPVWLVTPIVYEGYRVLQLMRGLKLGAELSAPAWMMHT+RGLVCWWVLILGIQLMR
Sbjct: 254 WHWRSPVWLVTPIVYEGYRVLQLMRGLKLGAELSAPAWMMHTMRGLVCWWVLILGIQLMR 313

BLAST of Clc01G04800 vs. ExPASy TrEMBL
Match: A0A5A7TPX8 (Uncharacterized protein OS=Cucumis melo var. makuwa OX=1194695 GN=E6C27_scaffold1167G00020 PE=4 SV=1)

HSP 1 Score: 495.7 bits (1275), Expect = 2.1e-136
Identity = 246/266 (92.48%), Postives = 257/266 (96.62%), Query Frame = 0

Query: 156 KTNPWEPAPITFAPNNEEDDTFLKKTDNIFESLNADKTTGVPEVETKELVEASTQPE-VY 215
           +TNPWEPAP+TFA NN+ED+TFLKKTDNIFESLNAD+TT V EVETKELVEAS QPE V+
Sbjct: 12  QTNPWEPAPVTFAHNNKEDETFLKKTDNIFESLNADRTTEVSEVETKELVEASNQPELVH 71

Query: 216 LQIFKWPMWLLGPSLLLTTGMAPTLWLPMSSVFLGPNVASLLSLIGLDCIYNLGAMLFLL 275
           LQIFKWPMWLLGPSLLLTTGMAPTLWLPMSSVFLGPNVASLLSLIGLDCIYNLGAMLFLL
Sbjct: 72  LQIFKWPMWLLGPSLLLTTGMAPTLWLPMSSVFLGPNVASLLSLIGLDCIYNLGAMLFLL 131

Query: 276 MADACARPKQPMKPMSSEAPFSYQFWNMLANVFGFVIPLVMLYGSESGLVQPHLPFISLA 335
           MADACARPK+P+KPMSSEAPFSYQFWN+LANV GF+IPLVM YGSESGLVQPHLPFI LA
Sbjct: 132 MADACARPKEPIKPMSSEAPFSYQFWNILANVVGFMIPLVMFYGSESGLVQPHLPFIPLA 191

Query: 336 VLLGPYILLLSVQILTEMLTWHWRSPVWLVTPIVYEGYRVLQLMRGLKLGAELSAPAWMM 395
           VLLGPYILLLSVQILTEML WHWRSPVWLVTPIVYEGYRVLQLMRGLKLGAELSAPAWMM
Sbjct: 192 VLLGPYILLLSVQILTEMLIWHWRSPVWLVTPIVYEGYRVLQLMRGLKLGAELSAPAWMM 251

Query: 396 HTIRGLVCWWVLILGIQLMRVAWFAG 421
           HT+RGLVCWWVLILGIQLMRVAWFAG
Sbjct: 252 HTMRGLVCWWVLILGIQLMRVAWFAG 277

BLAST of Clc01G04800 vs. ExPASy TrEMBL
Match: A0A6J1E860 (uncharacterized protein LOC111430277 OS=Cucurbita moschata OX=3662 GN=LOC111430277 PE=4 SV=1)

HSP 1 Score: 484.2 bits (1245), Expect = 6.3e-133
Identity = 240/271 (88.56%), Postives = 249/271 (91.88%), Query Frame = 0

Query: 158 NPWEPAPITFAPNNEEDDTFLKKTDNIFESLNADKTTGVPEVETKELVEAST-------- 217
           NPWEPAPITFA  NEEDDTFLK+T+NIF SLNAD TT  PEVETKELVE  T        
Sbjct: 56  NPWEPAPITFASENEEDDTFLKRTENIFGSLNADSTTEAPEVETKELVEVETKEVVEVSN 115

Query: 218 QPEVYLQIFKWPMWLLGPSLLLTTGMAPTLWLPMSSVFLGPNVASLLSLIGLDCIYNLGA 277
           QP+V+LQIFKWPMWLLGPSLLLTTGMAPTLWLPMSSVFLGPNVASLLSLIGLDCIYNLGA
Sbjct: 116 QPKVHLQIFKWPMWLLGPSLLLTTGMAPTLWLPMSSVFLGPNVASLLSLIGLDCIYNLGA 175

Query: 278 MLFLLMADACARPKQPMKPMSSEAPFSYQFWNMLANVFGFVIPLVMLYGSESGLVQPHLP 337
           MLFLLMADACARPKQP+KPM SEAPFSYQFWNMLANV GF IP +MLYGS SGLVQPHLP
Sbjct: 176 MLFLLMADACARPKQPIKPMRSEAPFSYQFWNMLANVVGFAIPFIMLYGSGSGLVQPHLP 235

Query: 338 FISLAVLLGPYILLLSVQILTEMLTWHWRSPVWLVTPIVYEGYRVLQLMRGLKLGAELSA 397
           FISLAVLLGPY+LLLSVQILTEMLTWHWRSPVWLVTPIVYEGYR+LQLMRGLKLGAELSA
Sbjct: 236 FISLAVLLGPYVLLLSVQILTEMLTWHWRSPVWLVTPIVYEGYRILQLMRGLKLGAELSA 295

Query: 398 PAWMMHTIRGLVCWWVLILGIQLMRVAWFAG 421
           PAW MHTIRGLVCWWVLILG+QLMRVAWFAG
Sbjct: 296 PAWTMHTIRGLVCWWVLILGVQLMRVAWFAG 326

BLAST of Clc01G04800 vs. ExPASy TrEMBL
Match: A0A6J1E102 (uncharacterized protein LOC111026155 OS=Momordica charantia OX=3673 GN=LOC111026155 PE=4 SV=1)

HSP 1 Score: 480.3 bits (1235), Expect = 9.0e-132
Identity = 234/265 (88.30%), Postives = 250/265 (94.34%), Query Frame = 0

Query: 156 KTNPWEPAPITFAPNNEEDDTFLKKTDNIFESLNADKTTGVPEVETKELVEASTQPEVYL 215
           ++NPWEPAPIT+A NNE DD+FLK+TDNIFESLNAD TT VPEVE KE+   S QPEV+L
Sbjct: 54  QSNPWEPAPITYASNNEADDSFLKRTDNIFESLNADSTTEVPEVEIKEVTGVSNQPEVHL 113

Query: 216 QIFKWPMWLLGPSLLLTTGMAPTLWLPMSSVFLGPNVASLLSLIGLDCIYNLGAMLFLLM 275
           Q FKWPMWLLGPSLLLTTGMAPTLWLPMSSVFLGPNVASLLSLIGLDC+YNLGA LFLLM
Sbjct: 114 QFFKWPMWLLGPSLLLTTGMAPTLWLPMSSVFLGPNVASLLSLIGLDCLYNLGATLFLLM 173

Query: 276 ADACARPKQPMKPMSSEAPFSYQFWNMLANVFGFVIPLVMLYGSESGLVQPHLPFISLAV 335
           ADACARPKQP+K M+SEAPFSYQFWNM+ANVFG+VIPLVMLYGSESGL+QP LPFISLAV
Sbjct: 174 ADACARPKQPIKAMNSEAPFSYQFWNMVANVFGYVIPLVMLYGSESGLIQPQLPFISLAV 233

Query: 336 LLGPYILLLSVQILTEMLTWHWRSPVWLVTPIVYEGYRVLQLMRGLKLGAELSAPAWMMH 395
           LLGPYILLLSVQ+LTEMLTW WRSPVWLVTPIVYEGYR+LQLMRGLKLGAELSAPAWMMH
Sbjct: 234 LLGPYILLLSVQVLTEMLTWRWRSPVWLVTPIVYEGYRILQLMRGLKLGAELSAPAWMMH 293

Query: 396 TIRGLVCWWVLILGIQLMRVAWFAG 421
           TIRGLV WWVLILG+QLMRVAWFAG
Sbjct: 294 TIRGLVSWWVLILGVQLMRVAWFAG 318

BLAST of Clc01G04800 vs. TAIR 10
Match: AT3G60590.3 (unknown protein; BEST Arabidopsis thaliana protein match is: unknown protein (TAIR:AT1G48460.1); Has 35333 Blast hits to 34131 proteins in 2444 species: Archae - 798; Bacteria - 22429; Metazoa - 974; Fungi - 991; Plants - 531; Viruses - 0; Other Eukaryotes - 9610 (source: NCBI BLink). )

HSP 1 Score: 339.3 bits (869), Expect = 4.8e-93
Identity = 174/302 (57.62%), Postives = 222/302 (73.51%), Query Frame = 0

Query: 125 PRPPQKQPPVSNSNRSDSISQLGSLLRKIMV-KTNPWEPAPITFAPNNEEDDTFLKKTDN 184
           PR   K P +    ++ S +     LR +   K + WEP+P   A   E  D  L KT N
Sbjct: 93  PRVRLKNPNMLQKLKTGSCNFRFRNLRVLCTPKLSQWEPSPFIHASAEEAADIVLDKTAN 152

Query: 185 IFESLNADKTTGVPEVETKELVEASTQPEV--YLQIFKWPMWLLGPSLLLTTGMAPTLWL 244
           +FES+       V E   +E V+ S Q      +Q+ KWP+WLLGPS+LLT+GMAPTLWL
Sbjct: 153 VFESI-------VSESAEEEKVDMSAQQRTNSQVQVLKWPIWLLGPSVLLTSGMAPTLWL 212

Query: 245 PMSSVFLGPNVASLLSLIGLDCIYNLGAMLFLLMADACARPKQPMKPMSSEAPFSYQFWN 304
           P+SSVFLG NV SLLSLIGLDCI+NLGA LFLLMAD+CARPK P +  +S+ PFSY+FWN
Sbjct: 213 PLSSVFLGSNVVSLLSLIGLDCIFNLGATLFLLMADSCARPKDPSQSCNSKPPFSYKFWN 272

Query: 305 MLANVFGFVIPLVMLYGSESGL---VQPHLPFISLAVLLGPYILLLSVQILTEMLTWHWR 364
           M + + GF++P+++L+GS+SGL   +QP +PF+S AV+L PY +LL+VQ LTE+LTWHW+
Sbjct: 273 MFSLIIGFLVPMLLLFGSQSGLLASLQPQIPFLSSAVILFPYFILLAVQTLTEILTWHWQ 332

Query: 365 SPVWLVTPIVYEGYRVLQLMRGLKLGAELSAPAWMMHTIRGLVCWWVLILGIQLMRVAWF 421
           SPVWLVTP+VYE YR+LQLMRGL L AE++AP W++H +RGLV WWVLILG+QLMRVAWF
Sbjct: 333 SPVWLVTPVVYEAYRILQLMRGLTLSAEVNAPVWVVHMLRGLVSWWVLILGMQLMRVAWF 387

BLAST of Clc01G04800 vs. TAIR 10
Match: AT3G60590.2 (unknown protein; LOCATED IN: chloroplast, chloroplast inner membrane, chloroplast envelope; EXPRESSED IN: 23 plant structures; EXPRESSED DURING: 14 growth stages; BEST Arabidopsis thaliana protein match is: unknown protein (TAIR:AT1G48460.1); Has 35333 Blast hits to 34131 proteins in 2444 species: Archae - 798; Bacteria - 22429; Metazoa - 974; Fungi - 991; Plants - 531; Viruses - 0; Other Eukaryotes - 9610 (source: NCBI BLink). )

HSP 1 Score: 339.3 bits (869), Expect = 4.8e-93
Identity = 174/302 (57.62%), Postives = 222/302 (73.51%), Query Frame = 0

Query: 125 PRPPQKQPPVSNSNRSDSISQLGSLLRKIMV-KTNPWEPAPITFAPNNEEDDTFLKKTDN 184
           PR   K P +    ++ S +     LR +   K + WEP+P   A   E  D  L KT N
Sbjct: 18  PRVRLKNPNMLQKLKTGSCNFRFRNLRVLCTPKLSQWEPSPFIHASAEEAADIVLDKTAN 77

Query: 185 IFESLNADKTTGVPEVETKELVEASTQPEV--YLQIFKWPMWLLGPSLLLTTGMAPTLWL 244
           +FES+       V E   +E V+ S Q      +Q+ KWP+WLLGPS+LLT+GMAPTLWL
Sbjct: 78  VFESI-------VSESAEEEKVDMSAQQRTNSQVQVLKWPIWLLGPSVLLTSGMAPTLWL 137

Query: 245 PMSSVFLGPNVASLLSLIGLDCIYNLGAMLFLLMADACARPKQPMKPMSSEAPFSYQFWN 304
           P+SSVFLG NV SLLSLIGLDCI+NLGA LFLLMAD+CARPK P +  +S+ PFSY+FWN
Sbjct: 138 PLSSVFLGSNVVSLLSLIGLDCIFNLGATLFLLMADSCARPKDPSQSCNSKPPFSYKFWN 197

Query: 305 MLANVFGFVIPLVMLYGSESGL---VQPHLPFISLAVLLGPYILLLSVQILTEMLTWHWR 364
           M + + GF++P+++L+GS+SGL   +QP +PF+S AV+L PY +LL+VQ LTE+LTWHW+
Sbjct: 198 MFSLIIGFLVPMLLLFGSQSGLLASLQPQIPFLSSAVILFPYFILLAVQTLTEILTWHWQ 257

Query: 365 SPVWLVTPIVYEGYRVLQLMRGLKLGAELSAPAWMMHTIRGLVCWWVLILGIQLMRVAWF 421
           SPVWLVTP+VYE YR+LQLMRGL L AE++AP W++H +RGLV WWVLILG+QLMRVAWF
Sbjct: 258 SPVWLVTPVVYEAYRILQLMRGLTLSAEVNAPVWVVHMLRGLVSWWVLILGMQLMRVAWF 312

BLAST of Clc01G04800 vs. TAIR 10
Match: AT3G60590.1 (unknown protein; LOCATED IN: chloroplast, chloroplast inner membrane, chloroplast envelope; EXPRESSED IN: 23 plant structures; EXPRESSED DURING: 14 growth stages; BEST Arabidopsis thaliana protein match is: unknown protein (TAIR:AT1G48460.1); Has 81 Blast hits to 81 proteins in 19 species: Archae - 0; Bacteria - 10; Metazoa - 0; Fungi - 0; Plants - 70; Viruses - 0; Other Eukaryotes - 1 (source: NCBI BLink). )

HSP 1 Score: 315.8 bits (808), Expect = 5.7e-86
Identity = 147/219 (67.12%), Postives = 186/219 (84.93%), Query Frame = 0

Query: 205 VEASTQPEVYLQIFKWPMWLLGPSLLLTTGMAPTLWLPMSSVFLGPNVASLLSLIGLDCI 264
           + A  +    +Q+ KWP+WLLGPS+LLT+GMAPTLWLP+SSVFLG NV SLLSLIGLDCI
Sbjct: 1   MSAQQRTNSQVQVLKWPIWLLGPSVLLTSGMAPTLWLPLSSVFLGSNVVSLLSLIGLDCI 60

Query: 265 YNLGAMLFLLMADACARPKQPMKPMSSEAPFSYQFWNMLANVFGFVIPLVMLYGSESGL- 324
           +NLGA LFLLMAD+CARPK P +  +S+ PFSY+FWNM + + GF++P+++L+GS+SGL 
Sbjct: 61  FNLGATLFLLMADSCARPKDPSQSCNSKPPFSYKFWNMFSLIIGFLVPMLLLFGSQSGLL 120

Query: 325 --VQPHLPFISLAVLLGPYILLLSVQILTEMLTWHWRSPVWLVTPIVYEGYRVLQLMRGL 384
             +QP +PF+S AV+L PY +LL+VQ LTE+LTWHW+SPVWLVTP+VYE YR+LQLMRGL
Sbjct: 121 ASLQPQIPFLSSAVILFPYFILLAVQTLTEILTWHWQSPVWLVTPVVYEAYRILQLMRGL 180

Query: 385 KLGAELSAPAWMMHTIRGLVCWWVLILGIQLMRVAWFAG 421
            L AE++AP W++H +RGLV WWVLILG+QLMRVAWFAG
Sbjct: 181 TLSAEVNAPVWVVHMLRGLVSWWVLILGMQLMRVAWFAG 219

BLAST of Clc01G04800 vs. TAIR 10
Match: AT3G60590.4 (unknown protein; LOCATED IN: chloroplast inner membrane; EXPRESSED IN: 23 plant structures; EXPRESSED DURING: 14 growth stages; BEST Arabidopsis thaliana protein match is: unknown protein (TAIR:AT1G48460.1); Has 30201 Blast hits to 17322 proteins in 780 species: Archae - 12; Bacteria - 1396; Metazoa - 17338; Fungi - 3422; Plants - 5037; Viruses - 0; Other Eukaryotes - 2996 (source: NCBI BLink). )

HSP 1 Score: 315.8 bits (808), Expect = 5.7e-86
Identity = 147/219 (67.12%), Postives = 186/219 (84.93%), Query Frame = 0

Query: 205 VEASTQPEVYLQIFKWPMWLLGPSLLLTTGMAPTLWLPMSSVFLGPNVASLLSLIGLDCI 264
           + A  +    +Q+ KWP+WLLGPS+LLT+GMAPTLWLP+SSVFLG NV SLLSLIGLDCI
Sbjct: 1   MSAQQRTNSQVQVLKWPIWLLGPSVLLTSGMAPTLWLPLSSVFLGSNVVSLLSLIGLDCI 60

Query: 265 YNLGAMLFLLMADACARPKQPMKPMSSEAPFSYQFWNMLANVFGFVIPLVMLYGSESGL- 324
           +NLGA LFLLMAD+CARPK P +  +S+ PFSY+FWNM + + GF++P+++L+GS+SGL 
Sbjct: 61  FNLGATLFLLMADSCARPKDPSQSCNSKPPFSYKFWNMFSLIIGFLVPMLLLFGSQSGLL 120

Query: 325 --VQPHLPFISLAVLLGPYILLLSVQILTEMLTWHWRSPVWLVTPIVYEGYRVLQLMRGL 384
             +QP +PF+S AV+L PY +LL+VQ LTE+LTWHW+SPVWLVTP+VYE YR+LQLMRGL
Sbjct: 121 ASLQPQIPFLSSAVILFPYFILLAVQTLTEILTWHWQSPVWLVTPVVYEAYRILQLMRGL 180

Query: 385 KLGAELSAPAWMMHTIRGLVCWWVLILGIQLMRVAWFAG 421
            L AE++AP W++H +RGLV WWVLILG+QLMRVAWFAG
Sbjct: 181 TLSAEVNAPVWVVHMLRGLVSWWVLILGMQLMRVAWFAG 219

BLAST of Clc01G04800 vs. TAIR 10
Match: AT5G63040.1 (unknown protein; FUNCTIONS IN: molecular_function unknown; INVOLVED IN: biological_process unknown; LOCATED IN: chloroplast; EXPRESSED IN: 25 plant structures; EXPRESSED DURING: 15 growth stages; BEST Arabidopsis thaliana protein match is: unknown protein (TAIR:AT1G48460.1); Has 30201 Blast hits to 17322 proteins in 780 species: Archae - 12; Bacteria - 1396; Metazoa - 17338; Fungi - 3422; Plants - 5037; Viruses - 0; Other Eukaryotes - 2996 (source: NCBI BLink). )

HSP 1 Score: 45.1 bits (105), Expect = 1.9e-04
Identity = 45/174 (25.86%), Postives = 83/174 (47.70%), Query Frame = 0

Query: 222 MWLLGPSLLLTTGMAPTLWLP--MSSVFLGPNVASLLSLIGLDCIYNLGAMLFLLMADAC 281
           +WL+GP++L+++ + P ++L   +S+VF    +   L L   + ++  G   FLL+ D  
Sbjct: 151 LWLIGPAVLVSSFILPPVYLRRIVSAVFEDSLLTDFLILFFTEALFYCGVAAFLLIIDRS 210

Query: 282 AR-----PKQPMKPMSSEAPFSYQFWNMLANVFGFVIPLVMLYGSESGLVQPHLPFISLA 341
            +     P+  + P    +    +  ++   V   +IP+V +     G V P     + A
Sbjct: 211 RKGSGKVPQNRINP----SQLGQRISSVATLVLSLMIPMVTM-----GFVWPWTGPAASA 270

Query: 342 VLLGPYILLLSVQILTEMLTWHWRSPVWLVTPIVYEGYRVLQLMRGLKLGAELS 389
             L PY++ + VQ   E    +  SP   + PI+++ YR+ QL R  +L   LS
Sbjct: 271 T-LAPYLVGIVVQFAFEQYARYRNSPSSPIIPIIFQVYRLHQLNRAAQLVTALS 314

The following BLAST results are available for this feature:
Match NameE-valueIdentityDescription
XP_038875766.17.1e-13994.01uncharacterized protein LOC120068138 [Benincasa hispida] >XP_038875767.1 unchara... [more]
XP_004152380.11.6e-13893.23uncharacterized protein LOC101219687 [Cucumis sativus] >XP_011654828.1 uncharact... [more]
XP_008436987.16.6e-13783.66PREDICTED: uncharacterized protein LOC103482552 [Cucumis melo] >XP_008436989.1 P... [more]
KAA0043465.14.3e-13692.48uncharacterized protein E6C27_scaffold1167G00020 [Cucumis melo var. makuwa][more]
KAG6579387.15.8e-13388.93hypothetical protein SDJN03_23835, partial [Cucurbita argyrosperma subsp. sorori... [more]
Match NameE-valueIdentityDescription
Match NameE-valueIdentityDescription
A0A0A0KKZ17.6e-13993.23Uncharacterized protein OS=Cucumis sativus OX=3659 GN=Csa_5G166420 PE=4 SV=1[more]
A0A1S3ASL33.2e-13783.66uncharacterized protein LOC103482552 OS=Cucumis melo OX=3656 GN=LOC103482552 PE=... [more]
A0A5A7TPX82.1e-13692.48Uncharacterized protein OS=Cucumis melo var. makuwa OX=1194695 GN=E6C27_scaffold... [more]
A0A6J1E8606.3e-13388.56uncharacterized protein LOC111430277 OS=Cucurbita moschata OX=3662 GN=LOC1114302... [more]
A0A6J1E1029.0e-13288.30uncharacterized protein LOC111026155 OS=Momordica charantia OX=3673 GN=LOC111026... [more]
Match NameE-valueIdentityDescription
AT3G60590.34.8e-9357.62unknown protein; BEST Arabidopsis thaliana protein match is: unknown protein (TA... [more]
AT3G60590.24.8e-9357.62unknown protein; LOCATED IN: chloroplast, chloroplast inner membrane, chloroplas... [more]
AT3G60590.15.7e-8667.12unknown protein; LOCATED IN: chloroplast, chloroplast inner membrane, chloroplas... [more]
AT3G60590.45.7e-8667.12unknown protein; LOCATED IN: chloroplast inner membrane; EXPRESSED IN: 23 plant ... [more]
AT5G63040.11.9e-0425.86unknown protein; FUNCTIONS IN: molecular_function unknown; INVOLVED IN: biologic... [more]
InterPro
Analysis Name: InterPro Annotations of Watermelon (cordophanus) v2
Date Performed: 2022-01-31
IPR TermIPR DescriptionSourceSource TermSource DescriptionAlignment
NoneNo IPR availableMOBIDB_LITEmobidb-litedisorder_predictioncoord: 122..142
NoneNo IPR availablePANTHERPTHR33918:SF3CYTOCHROME P450 FAMILY PROTEINcoord: 154..421
NoneNo IPR availablePANTHERPTHR33918OS01G0704200 PROTEINcoord: 154..421

Relationships

The following mRNA feature(s) are a part of this gene:

Feature NameUnique NameType
Clc01G04800.2Clc01G04800.2mRNA


GO Annotation
GO Assignments
This gene is annotated with the following GO terms.
Category Term Accession Term Name
cellular_component GO:0016021 integral component of membrane