Cla97C01G005640 (gene) Watermelon (97103) v2

NameCla97C01G005640
Typegene
OrganismCitrullus lanatus (Watermelon (97103) v2)
Descriptionprotein BOLA4, chloroplastic/mitochondrial
LocationCla97Chr01 : 5415081 .. 5422554 (-)
The following sequences are available for this feature:

Gene sequence (with intron)

Legend: polypeptideCDSexon
Hold the cursor over a type above to highlight its positions in the sequence below.
ATGATGATGATTAAGGTTCCTCATGTCGCGTTCCGTCGGGTTCTCCCATTGTTCGTTCACCGCAACCAACCTTTTTCGGTTGTCTCCCGCCAAATCACTTCCACCTGTGTTTGCACCGCCGTTTTCCCCTCAATTACCTCCACATCCCCCACCGGCGTCCGTAGTAAGAGCAGTAATATTGCGAAAAATAGCCGTGCTGAAGGAACATTCCCAAAATGGGGTTCGAGCCGGAACTTGAGCATTAGAGCCACTCAAGTGAATGACTCTGGATCTATTGATTCTCCACTCATGCAGTCAATGGAGAACAAGGTAATGTTTGTTTAGTTTCTAGTTTCTTCATGGTTTTAAATTTATACTTTTCTACGTTTTCATTTATGTGAGCAAACCAGTCCGTATGCTCTTTGAGGGAAATTAGGGCAACTTATTGAGCTAATTTAAGAGGAATGGAATCTCCCAGTCGATTATAGTTCAGAAATGTTCTTAGTAGCACAGCGATAAAGACCTTGACGCCGATTTTATGTGATGGCTTTTGCTTATCTTTTGTTTTCATTTTAACACGTTTATTATCAAATATATGATTTTCAGTTTCTTTTCTGCAAATGTGTATTTGTTTGGAAAACGGATGTAGTTTGTGTTTCTCGCTCCAATATTCTATCCACCAAGTTGTTTAGTTCTAAGGGGAGCCATGTAATTAGAAGGGCTACAAGCACAGCTAGTTGACTTATTCATCCTGAGATCACTTGCGCCCAAAGTCGCATATGTAGCAATGGATGGCATACGTGATCAAAAATCAATTTATTCTCCAAAGAAAAGGATCTGTCCTTCGATGAAATGCAAACAGATTTATACACTCAACCAATCGCTATTACAGTTTAGCTAGCTACTAGCAAAGCCAACATCTGCCACTGAGTAACCAGAAATTCCAGCACGTGAGCGCTGCTGTACATCTGCAGAGCCAATGGTGGCCGGTGGTTCTGAAGGTTTTGATAGAACGTGTTCGAATCTTTCCATTTTTGGTTCTCGTACCGGCTTGGGGTGGTGGTGGACGACGGAAACAACTGGCGACGCTGAGGTGGTAGTCGTGATGGTGGTGGTGGTATAAGTAGTGGGATTATTGGATGATGATTATGAGAACAAATCCATTAATAAAAAAGCAGAAAATTAAAAAGATCAAGGAAAGCAAAAAAAAAAAAAAAAAAAAAAAAAGGAAAAAAAGGAAAAAAAAGAGAAATAGAGTAATCTTTCAAAGACCAGCTTTTTGTCATGGTGGAAGAAGGATCTCATCTTTGGGGGTTTATAAGAGAACCTTTTTCTTAACTAGGAGGTAGGTAGTAAATCAAGTAGTGATTCAAGATGAATATCAAAGGAAGAGAAGAAACAAGAAGTGGGTTTAATTTGCATGCCACGAAGAGATCAAGAAGAACAATGTGAAGGAGATCATAAAGGGATTGACATCATTGATAATATTGAACATGGTCCTTCTTTAAAAAAAAACTGGAAAGAGGGTTGTTTTTGTTTTGTGTTTCTTTAGAAGAAGAGAAATGGAGGAAGGAAGAAGAAGAAGAGAAGGAAAAAAGGTCATGAGTTTATGGTATGAAGAAGGGAAATGAGGAAATTATTAGAGATCAAAGAGAGATCATAAGATTTTCTGTTGGAAAGAAAACTGAGAGTTAAACACTATAGAGAGAGAGGGAGAAAATAAAGGTTGAAATGAGGAGTAAAGTTAGTAGTGGGAACAATAGTAAATAGGCATAAAGTTTTGTCCTTGGTAAGGACAAAAAAAAAGGGCTTTGTTGGCAAAATGGTTATGGCATTTTCCCCTTGAAAATGAATCTTTATGGCATAGGGTCATTGTTAGGAAGCATGACTCATCCATTTGAGTGGTCGTTGAGTGGGGTTAGAGGCACATTTCAGAATCTATGGAAGGATATTTCTCGAGAGCTACCCTCCTTTTCCCATCTCGTTTGTTGTGTGGTGGGGAAAGGTAAGAAAACATATTTTTGGGAAGATTAGTGGGTGGGGGATAAACCTCTCTACTCCTCATTTTCTTGTTTATATCATTTATCTTTCTTTAAAAATTATATAATCTCTAATCTGTTGGTTTGGTCGAGGAGCTTGGTTGTTGCCGTTTGTTGATCGATGAGGAAACAACAAATGTTGCCTCTCTTCTGTCCTTGATAGGGGAGGTTAATTTTATTCTTGGGAGAAGGGATATTCGTATTTGGAGTCTTGATCCCTAGTGGGTTTCTCTTGTAAGTCCTTTTTTCGGCATTTATTGGATCCGACTCCCATTAGCGAATTTGTTTTTGACTTCTTATGGAGGATTAAGATTCCTAAGAAAGTTAAGTTCTTTTCCTTACAACTTTTATTTGATTGTGTGAATATTATGGACCAACTTTCTAGGAAGCTACCCTCGCTAGTGGGCCCTTGCTGTTGTACTCGTGTTGGAAGGTGGAGGAAAACCTGGATCACTTGCTTTGGACGTGTGACTTTGTGAGGACAATGTGGAATAGTTTCTTTTAGGAGTTTGGTTTTGTGTCTGTTCATCATTGAGATATTTTCTTTGAAATAAAAACAAACTTTTCATTGAATTAATGAAATGAGACTAATGCTCAAAGTACATGAGAGAAAATAAGACTTAGAACTCAAACTTGCCAATATAAAATAAGAAATAAAACAAGACTAAATAGAGAGTCTTCAGTATCCACCGAACAAAGAGAACTAATTGAGGGCAAGAGAGAGTGACTTGGAAGCTTCAGAATTTGAGCAAATAAATGCCCCAAAGATCCTAACCCAAAAGTAATATAACTCTCCGTAGACAGCTTGAAATCCACAAAGGTTTGTACAGCTTGATAATCATGGGAGGAGATGCCACCAATGAGTTTGAAAAGAGAATTAACTGGGAATGAACCTGAAGAATCCAATTCTAACACTTCATCTTCCTTAAAAAAAACTTCTAAATATGAATGACCATAGTTGAGCAACTGGATCGCAATGATCAAGTACAAAACCTTTGGGATTGGAAGCAGATCTTAGTAAACTTGGGGAGAGAGAACTCAAAGAAGAAGTGTTCAGCCAAGTAACTTGCCAAAATGAAATTCTTCTTCCATTGCCGAGTTTAAAGGAGATCAGATATTCAACCAACCAGAAATCAATTTTATGCCTAGCGCATAATGTTTAAGACTCCTCCCAAGCACCGAGGATCAGAAATATAATGAGAGAGGGATAACAATTCTGACCAAAGGAATCTTCTTTCCTTGTAGTAACCATAATCAATTGTAATCCAAATGGGCTTCATGCACTTGCAATGAGTCAAAAACGTTGACATGTAGCCTCCTTTTAAAACTTCTAAATCCACAGGCTCTCCCTTCCCATGAGATTATCAATCCTCCAAATTTTCCAAAAGACTCCACAAAATTCCAGCCAATATCTCCTGAACTCCCCATTGATTTGATTAAATAATGACAAAATCCTCCCTTTTTGGTCTCCTGAATTAGAACTAAAGTTCTGATTTTTGATGAGGTTCGGGTTTTTGGGTCTTCATTCCACGACAACATTTCCCAAATTTTGGTTGGTCCTTCATTGCCCTCTAAAGCTCAACTATTATGGTCAAATGCGGTAAAAGCCATATTATCAGAAATTTGGTTTGAACACAATCAAAGGGTCTTCCATGACAAGTCTATTGATTGGTTGGAACGCTCGGCTTCTTGCATCTACTTGGTATTCACAATCCAAATACTTTGCTGAATATTCTATCCAAGACATATACCTAAATTGGAATGCTTTCATTCCTTTAGATTAATTATGTATTTTTGACGTTATGTCTCTCAAGTTGCTCTTTTCAATGTACTTTTGTAAAGGAAGATGAGGAGGGTGCTAAGGTGTCAACCTAGTTGAGATAGTCGGATACGCTCATTGGTTATAAACCATTGTAGTCATGTTATCTCATTTGAAACTTGGTTTCAACATTTCATTGTAACTTGAGCATTAGACTCTTTTCATTACATCAATGAAAAAGTTTAGTTTCCGTTTCAAAAAAAAAAAAAAAAAAAAAGAGGTTTTTTAGAGCAACTATTTGGAAATATCCCCAAGCCCTCTGGTAATCCATGAAATAATTTTCATACTGAAGATAAAAAAGAAAGGACTTTTACTTCACCTGATGAGGCAATTGACAAGAAACTTGACCCAATTGGATCCCACAAACTTCCACTATTGTGTTAAGGCTAGAAGGATCTGCGAAGGCAACAAAGATGTGAAACTTACTTGATTTGGCTGGGAAACGGAGTTTTCAACTGAATTTTGGAATAAGCTTTCCATGACTTCTCCATATTAATCTTCTGAAAATTGAGAATCTGGAATCGATGTCAATGATTGATAGTCATCTTCCTCACTACTCACACTTGCTGAGTTTGAATCATCTTTTGAATCTATTGGAGACAAGGGGCATTTTTGGGAAGAAAGGCAGGGAATACTTTTCCTGATCTTGATTGAAGAATCCGGTAGAATGATGGTGAATCTTTTAAACCCTGAGAGGGATGGTCTAATTTTGAGGATCCCAACCGCTTTGAGGATTAACTTGAGATCTGAGCAATTACACAACTGATTTCAATGGCATTATTATTGATCTTTTGAAGCCTGGAGAATTTAGAAGATGCTTTTTGTCTTGAGTAATGAAATGGAAATGCTGGAGTTAAAGATCCGACTGTTTCTTCTCTTTTTCCAAATTGAAGCTGAAATCTGACATTTTCCCTCCAAAAATTGATAAATATTAGAGATTGTTTATTGGAGATTTGATCGAGGAGTTTCTCGCCCTTCTGCCTTTTAAGGAGAAGGGTCAATTCTTGTGGTTAGCAGGGGTGTGTGCTTGTGGGATCTTTGGGGGCAAGAAATAATAGGATTTTAGAGGAGTGGAGAGGGACCCTAGTGCTGTTTGGTCCTTACTCCTTGGTGAGTTTTCATGTTTCTCTTTGGGTTTCGATTTCGACAACTTTTTGTAATATTCCATCGACAACATTATACCTAGTTGGTCCCCTTTTCTTTTTAGGAGATTTTTGTAGGGCTTGACATTTGTATGCTTATGTGTACTTCCATTTTCTCTAAGTGAAAACAATTGTTTCTATAATACAAAGGCAAGGTTGTGAGTCATGTCAAACTGAGGACAATCAAAAGCTTAGAAGTACATTCCAACATCATGTCAAACCTAGGACTATCAAATGTGATAGGAACCAAAACACACACAAGATAACAAAATAATTCGAGATACTTGGACCTTCCCCTCTTGAACACACTTACGAGATAACAAAATAATTTGAGATACTTGGACCCTCCCCTCTTGAGACCTCTTTCAAGGCCTAATTCACTCTTCAAGATAATGCCTATCTTTCCTAATTTCCTTCTCTCTATTTTTATATCCAAATTTCATCATTTATGGTTAGAAAACAGAAGCTTTGCAGTCCTTGTCTTGAATATTTATTGAAATTAGTCTGGTAAAGTGTGTTTGAATGGAAATTGTTATTTCTTTCTAGAGTTTTTTCTTCAGAGTTGTGGCTGTCTCTTTCAAGATGATTAAAATTTTGTTTGTTTAGTTGACTTTTCAAATTTCTTCATCAGATTAAGGAACAACTAGATGCACAATCTGTCTCGGTGAAAGATGCTTATGGGGATGGACGACATGTTAGGTGAGGACTTTTTCCCTACAGTTGCTTTTCTATGGTAGCTTAGTGTAAGAAGCTTGATCTTTCCAGAATGCCTGAGAAATGTATATACTGGCCAGAAAAAAAAAGGAAACATTTTCTTTTCTTCCATTTGTTCCTGATTCACGTTTGGTCTTCTTATAAAATTTGTTTGCCTTCAAAATTCAAATCATGGTGTTTAAAGACTCCAATCGTCCAGAGCTCGTTAGCAGATTTTTTTTCTTCGAGTCTTTTAGCAATGAAACAGAAAATAGGAAAATTATCATAAATAGAAAAAATATCAAACTGTTTACAAATATAGCGAAATTTTACTTTCTATTTGTGATAGACTGTGATAGACCGTGATAGACCCAAATAGACATTTATCCGTGTGATAAATGTCTATTTGGGTCTATCGCAGTCTATCACAGATAGAAAGTAAAATTTTGCTATATTTGTAATTATTTTCAATAGTTTTTCTATTTTTGGAAACATCCCGAAAAAATATTGAGACTTACGTTTGAATTGGAGCACATTTATTTTTCCTAAATAGGTCTTATGTTTAACTAAATTAAGTTAGTATTATTATTTTATTTGTGTCATTGTATTTTGAGCATTAGCCCCTTTTTATAGCCCCTTTTTATTTCACGAATTATAATTTTCATTAAAAAAAAAAAAAGAAGAAGAAGAAAACAAAACAAAAGAAAAAAGTTGAATTGAGAGCCCAAAGAGAGGTATTGGAGGTGGGGGCCGTTTCATTGTCTTGCGAGGTAAATATATACATATTTTGTCTGTTAAATGTATATATAGTCTAGCAAAAAAAACTTTGGTATACAGTTTGTATGGGAGTGAGTTTGATAGGAATTAAAACACAATCAAACTATATTGAAATGTAAATGACTGTGAATGACTAAAACTTAGAGAGAGAACCAAGTAATTCGAAGTACTCGGACCTTTCCCTCTTGAGACCTCTTCAAGTCCTAATTCATTCATCAAGATAATATCTATCTTCCCTATCTTTCCTATTCTCCTTTCTCTCTATTTATAACCAAATTCCATAACCAACTCCCCCATCTAATTACTAATTACTAATATACTCTTAATATCCGCTGAACACCCTAATACTATTCCTATGAGAGATCGAAAGCCCTTTTGGCATCTGTTTCTTGCCAAATCATAGTTTATGACCTATGGGTTTTCTATCCTTCAAATCTCTGATATTTCAAGTTTAAATTTAACATTTTCCATAAAGATGGTCCCTCTCCTTTAGCTTATTCGAAGACTTCCCATTCTTGTAAATGATTGATGTCCCTAATTTGAGTCTTGTTCAACTGGTTAAAATATATATTTTGATCAAAAGGTAAGAAGATTGAATCTCTACCTCTGCATATTGTTGGACTAAAAAAAATGATCCTGGGCCTCCAATATATTTCCCTAATAAATTGATCAAACGTCTTTATATCTGTTTTACCTACTTGAACATCCCGTGGTGCACTCCTCTTAAAAAAGTTCATTGAGAGGCATATTGGTTAGGGAGTTTCCTGTAGTTTGCATGTAACGTTGAAGTATAATTGGCTTCATAAGCCATATCCAAAGAACAAATCAAGATGAATTGATTACTCAATTAAGTTAATTTTGCTTGCAGCATTGATGTTGTTTCTTCAGCCTTTGAGGGACAGACGGCTGTGAATAGGCAGAGGATGGTATACAAAGCTATTTGGGAAGAGCTACAAAGCACAGTGCATGCAGTGGATCAAATGACCACCAAAACACCTGCTGAAGCAGCAGCTCAGAATTCATAA

mRNA sequence

ATGATGATGATTAAGGTTCCTCATGTCGCGTTCCGTCGGGTTCTCCCATTGTTCGTTCACCGCAACCAACCTTTTTCGGTTGTCTCCCGCCAAATCACTTCCACCTGTGTTTGCACCGCCGTTTTCCCCTCAATTACCTCCACATCCCCCACCGGCGTCCGTAGTAAGAGCAGTAATATTGCGAAAAATAGCCGTGCTGAAGGAACATTCCCAAAATGGGGTTCGAGCCGGAACTTGAGCATTAGAGCCACTCAAGTGAATGACTCTGGATCTATTGATTCTCCACTCATGCAGTCAATGGAGAACAAGATTAAGGAACAACTAGATGCACAATCTGTCTCGGTGAAAGATGCTTATGGGGATGGACGACATGTTAGCATTGATGTTGTTTCTTCAGCCTTTGAGGGACAGACGGCTGTGAATAGGCAGAGGATGGTATACAAAGCTATTTGGGAAGAGCTACAAAGCACAGTGCATGCAGTGGATCAAATGACCACCAAAACACCTGCTGAAGCAGCAGCTCAGAATTCATAA

Coding sequence (CDS)

ATGATGATGATTAAGGTTCCTCATGTCGCGTTCCGTCGGGTTCTCCCATTGTTCGTTCACCGCAACCAACCTTTTTCGGTTGTCTCCCGCCAAATCACTTCCACCTGTGTTTGCACCGCCGTTTTCCCCTCAATTACCTCCACATCCCCCACCGGCGTCCGTAGTAAGAGCAGTAATATTGCGAAAAATAGCCGTGCTGAAGGAACATTCCCAAAATGGGGTTCGAGCCGGAACTTGAGCATTAGAGCCACTCAAGTGAATGACTCTGGATCTATTGATTCTCCACTCATGCAGTCAATGGAGAACAAGATTAAGGAACAACTAGATGCACAATCTGTCTCGGTGAAAGATGCTTATGGGGATGGACGACATGTTAGCATTGATGTTGTTTCTTCAGCCTTTGAGGGACAGACGGCTGTGAATAGGCAGAGGATGGTATACAAAGCTATTTGGGAAGAGCTACAAAGCACAGTGCATGCAGTGGATCAAATGACCACCAAAACACCTGCTGAAGCAGCAGCTCAGAATTCATAA

Protein sequence

MMMIKVPHVAFRRVLPLFVHRNQPFSVVSRQITSTCVCTAVFPSITSTSPTGVRSKSSNIAKNSRAEGTFPKWGSSRNLSIRATQVNDSGSIDSPLMQSMENKIKEQLDAQSVSVKDAYGDGRHVSIDVVSSAFEGQTAVNRQRMVYKAIWEELQSTVHAVDQMTTKTPAEAAAQNS
BLAST of Cla97C01G005640 vs. NCBI nr
Match: XP_004143637.1 (PREDICTED: protein BOLA4, chloroplastic/mitochondrial [Cucumis sativus] >KGN50406.1 hypothetical protein Csa_5G172860 [Cucumis sativus])

HSP 1 Score: 294.3 bits (752), Expect = 2.8e-76
Identity = 158/177 (89.27%), Postives = 166/177 (93.79%), Query Frame = 0

Query: 2   MMIKVPHVAFRRVLPLFVHRNQPFSVVSRQITSTCVCT-AVFPSITSTSPTGVRSKSSNI 61
           MMIKVP VAFRRVLP+F+ RN+PFSVVSRQ+TS CVCT AV   I STS TG+RSKSSN+
Sbjct: 1   MMIKVPQVAFRRVLPIFLPRNRPFSVVSRQVTSNCVCTPAVVAVIISTSTTGLRSKSSNV 60

Query: 62  AKNSRAEGTFPKWGSSRNLSIRATQVNDSGSIDSPLMQSMENKIKEQLDAQSVSVKDAYG 121
           AK SRA GTFPKWGSSRNLSIRATQVN+SGSIDSPLMQSMENKIKEQLDAQSVSVKDAYG
Sbjct: 61  AKYSRANGTFPKWGSSRNLSIRATQVNESGSIDSPLMQSMENKIKEQLDAQSVSVKDAYG 120

Query: 122 DGRHVSIDVVSSAFEGQTAVNRQRMVYKAIWEELQSTVHAVDQMTTKTPAEAAAQNS 178
           DGRHVSIDVVSSAFEGQTAVNRQRMVYKAIWEELQSTVHAVDQMTT+TPAEAAAQNS
Sbjct: 121 DGRHVSIDVVSSAFEGQTAVNRQRMVYKAIWEELQSTVHAVDQMTTRTPAEAAAQNS 177

BLAST of Cla97C01G005640 vs. NCBI nr
Match: XP_008467240.1 (PREDICTED: protein BOLA4, chloroplastic/mitochondrial [Cucumis melo])

HSP 1 Score: 290.0 bits (741), Expect = 5.3e-75
Identity = 157/178 (88.20%), Postives = 164/178 (92.13%), Query Frame = 0

Query: 1   MMMIKVPHVAFRRVLPLFVHRNQPFSVVSRQITSTCVCT-AVFPSITSTSPTGVRSKSSN 60
           M MIKVP +AFRRVLP+ + RNQPFSVVSRQITS CVCT AV   ITS S TG+RSKSSN
Sbjct: 1   MTMIKVPQLAFRRVLPMLLPRNQPFSVVSRQITSNCVCTAAVVAVITSKSNTGLRSKSSN 60

Query: 61  IAKNSRAEGTFPKWGSSRNLSIRATQVNDSGSIDSPLMQSMENKIKEQLDAQSVSVKDAY 120
           + K SRA GTFPKWGSSRNLSIRATQVN+SGSIDSPLMQSMENKIKEQLDAQSVSVKDAY
Sbjct: 61  VVKYSRANGTFPKWGSSRNLSIRATQVNESGSIDSPLMQSMENKIKEQLDAQSVSVKDAY 120

Query: 121 GDGRHVSIDVVSSAFEGQTAVNRQRMVYKAIWEELQSTVHAVDQMTTKTPAEAAAQNS 178
           GDGRHVSIDVVSSAFEGQTAVNRQRMVYKAIWEELQSTVHAVDQMTT+TPAEAAAQNS
Sbjct: 121 GDGRHVSIDVVSSAFEGQTAVNRQRMVYKAIWEELQSTVHAVDQMTTRTPAEAAAQNS 178

BLAST of Cla97C01G005640 vs. NCBI nr
Match: XP_022159690.1 (protein BOLA4, chloroplastic/mitochondrial [Momordica charantia])

HSP 1 Score: 269.6 bits (688), Expect = 7.4e-69
Identity = 145/177 (81.92%), Postives = 158/177 (89.27%), Query Frame = 0

Query: 1   MMMIKVPHVAFRRVLPLFVHRNQPFSVVSRQITSTCVCTAVFPSITSTSPTGVRSKSSNI 60
           M M++VP VAFRRVLPLFVHR+ PF VVSRQIT+ CV TA   +I+S + T VR KSSN+
Sbjct: 1   MAMVRVPQVAFRRVLPLFVHRDVPFLVVSRQITTNCVRTAA-AAISSRTTTSVRCKSSNV 60

Query: 61  AKNSRAEGTFPKWGSSRNLSIRATQVNDSGSIDSPLMQSMENKIKEQLDAQSVSVKDAYG 120
           AK   A GTF KWGSSRNLSIRATQVNDSGSIDSPLMQSMENKIKEQLDAQSV+VKDAYG
Sbjct: 61  AKYGSANGTFSKWGSSRNLSIRATQVNDSGSIDSPLMQSMENKIKEQLDAQSVTVKDAYG 120

Query: 121 DGRHVSIDVVSSAFEGQTAVNRQRMVYKAIWEELQSTVHAVDQMTTKTPAEAAAQNS 178
           DGRHVSIDV+SSAFEGQ+AVNRQRMVYKAIWEELQSTVHAVDQMTT+TPAEAA +NS
Sbjct: 121 DGRHVSIDVISSAFEGQSAVNRQRMVYKAIWEELQSTVHAVDQMTTRTPAEAAVKNS 176

BLAST of Cla97C01G005640 vs. NCBI nr
Match: XP_022996142.1 (protein BOLA4, chloroplastic/mitochondrial [Cucurbita maxima])

HSP 1 Score: 263.1 bits (671), Expect = 6.9e-67
Identity = 142/176 (80.68%), Postives = 154/176 (87.50%), Query Frame = 0

Query: 3   MIKVPHVAFRRVLPLFVHRNQPFSVVSRQITSTCVCT-AVFPSITSTSPTGVRSKSSNIA 62
           MIKVP VAFRRVLPLFVHRN+ F +VS+Q T+ CV T AV  +IT+T+ TGV SKSSN+ 
Sbjct: 6   MIKVPQVAFRRVLPLFVHRNEQFFIVSQQFTTNCVRTAAVSAAITTTTATGVCSKSSNVL 65

Query: 63  KNSRAEGTFPKWGSSRNLSIRATQVNDSGSIDSPLMQSMENKIKEQLDAQSVSVKDAYGD 122
           K S   G F  WGSSRN+ +RATQVNDSGSIDSPLMQSMENKIKEQL+AQSVSVKDAYGD
Sbjct: 66  KFSYVNGRFSNWGSSRNMCVRATQVNDSGSIDSPLMQSMENKIKEQLEAQSVSVKDAYGD 125

Query: 123 GRHVSIDVVSSAFEGQTAVNRQRMVYKAIWEELQSTVHAVDQMTTKTPAEAAAQNS 178
           GRHVSIDVVSSAFEGQTAVNRQRMVYKAIWEELQSTVHAVDQMTT+TP EA AQNS
Sbjct: 126 GRHVSIDVVSSAFEGQTAVNRQRMVYKAIWEELQSTVHAVDQMTTRTPTEAEAQNS 181

BLAST of Cla97C01G005640 vs. NCBI nr
Match: XP_023534696.1 (protein BOLA4, chloroplastic/mitochondrial [Cucurbita pepo subsp. pepo])

HSP 1 Score: 261.5 bits (667), Expect = 2.0e-66
Identity = 142/176 (80.68%), Postives = 153/176 (86.93%), Query Frame = 0

Query: 3   MIKVPHVAFRRVLPLFVHRNQPFSVVSRQITSTCVCT-AVFPSITSTSPTGVRSKSSNIA 62
           MIK P VAFRRVLPL VHRN+ F VVS+Q T+ CV T AV  +IT+T+ TGV SKSSN+ 
Sbjct: 1   MIKAPQVAFRRVLPLIVHRNEQFFVVSQQFTTNCVRTAAVSAAITTTTATGVGSKSSNVL 60

Query: 63  KNSRAEGTFPKWGSSRNLSIRATQVNDSGSIDSPLMQSMENKIKEQLDAQSVSVKDAYGD 122
           K S   G F  WGSSRN+ +RATQVNDSGSIDSPLMQSMENKIKEQL+AQSVSVKDAYGD
Sbjct: 61  KFSYVNGRFSNWGSSRNMCVRATQVNDSGSIDSPLMQSMENKIKEQLEAQSVSVKDAYGD 120

Query: 123 GRHVSIDVVSSAFEGQTAVNRQRMVYKAIWEELQSTVHAVDQMTTKTPAEAAAQNS 178
           GRHVSIDVVSSAFEGQTAVNRQRMVYKAIWEELQSTVHAVDQMTT+TPAEA AQNS
Sbjct: 121 GRHVSIDVVSSAFEGQTAVNRQRMVYKAIWEELQSTVHAVDQMTTRTPAEAEAQNS 176

BLAST of Cla97C01G005640 vs. TrEMBL
Match: tr|A0A0A0KRR5|A0A0A0KRR5_CUCSA (Uncharacterized protein OS=Cucumis sativus OX=3659 GN=Csa_5G172860 PE=3 SV=1)

HSP 1 Score: 294.3 bits (752), Expect = 1.9e-76
Identity = 158/177 (89.27%), Postives = 166/177 (93.79%), Query Frame = 0

Query: 2   MMIKVPHVAFRRVLPLFVHRNQPFSVVSRQITSTCVCT-AVFPSITSTSPTGVRSKSSNI 61
           MMIKVP VAFRRVLP+F+ RN+PFSVVSRQ+TS CVCT AV   I STS TG+RSKSSN+
Sbjct: 1   MMIKVPQVAFRRVLPIFLPRNRPFSVVSRQVTSNCVCTPAVVAVIISTSTTGLRSKSSNV 60

Query: 62  AKNSRAEGTFPKWGSSRNLSIRATQVNDSGSIDSPLMQSMENKIKEQLDAQSVSVKDAYG 121
           AK SRA GTFPKWGSSRNLSIRATQVN+SGSIDSPLMQSMENKIKEQLDAQSVSVKDAYG
Sbjct: 61  AKYSRANGTFPKWGSSRNLSIRATQVNESGSIDSPLMQSMENKIKEQLDAQSVSVKDAYG 120

Query: 122 DGRHVSIDVVSSAFEGQTAVNRQRMVYKAIWEELQSTVHAVDQMTTKTPAEAAAQNS 178
           DGRHVSIDVVSSAFEGQTAVNRQRMVYKAIWEELQSTVHAVDQMTT+TPAEAAAQNS
Sbjct: 121 DGRHVSIDVVSSAFEGQTAVNRQRMVYKAIWEELQSTVHAVDQMTTRTPAEAAAQNS 177

BLAST of Cla97C01G005640 vs. TrEMBL
Match: tr|A0A1S3CT29|A0A1S3CT29_CUCME (protein BOLA4, chloroplastic/mitochondrial OS=Cucumis melo OX=3656 GN=LOC103504639 PE=3 SV=1)

HSP 1 Score: 290.0 bits (741), Expect = 3.5e-75
Identity = 157/178 (88.20%), Postives = 164/178 (92.13%), Query Frame = 0

Query: 1   MMMIKVPHVAFRRVLPLFVHRNQPFSVVSRQITSTCVCT-AVFPSITSTSPTGVRSKSSN 60
           M MIKVP +AFRRVLP+ + RNQPFSVVSRQITS CVCT AV   ITS S TG+RSKSSN
Sbjct: 1   MTMIKVPQLAFRRVLPMLLPRNQPFSVVSRQITSNCVCTAAVVAVITSKSNTGLRSKSSN 60

Query: 61  IAKNSRAEGTFPKWGSSRNLSIRATQVNDSGSIDSPLMQSMENKIKEQLDAQSVSVKDAY 120
           + K SRA GTFPKWGSSRNLSIRATQVN+SGSIDSPLMQSMENKIKEQLDAQSVSVKDAY
Sbjct: 61  VVKYSRANGTFPKWGSSRNLSIRATQVNESGSIDSPLMQSMENKIKEQLDAQSVSVKDAY 120

Query: 121 GDGRHVSIDVVSSAFEGQTAVNRQRMVYKAIWEELQSTVHAVDQMTTKTPAEAAAQNS 178
           GDGRHVSIDVVSSAFEGQTAVNRQRMVYKAIWEELQSTVHAVDQMTT+TPAEAAAQNS
Sbjct: 121 GDGRHVSIDVVSSAFEGQTAVNRQRMVYKAIWEELQSTVHAVDQMTTRTPAEAAAQNS 178

BLAST of Cla97C01G005640 vs. TrEMBL
Match: tr|A0A2C9VHY8|A0A2C9VHY8_MANES (Uncharacterized protein OS=Manihot esculenta OX=3983 GN=MANES_08G158500 PE=3 SV=1)

HSP 1 Score: 172.2 bits (435), Expect = 1.1e-39
Identity = 87/103 (84.47%), Postives = 98/103 (95.15%), Query Frame = 0

Query: 73  WGSSRNLSIRATQVNDSGSIDSPLMQSMENKIKEQLDAQSVSVKDAYGDGRHVSIDVVSS 132
           +  SR L++RAT VND+GSIDSPLMQSME KIKE+L+A+SV+VKDAYGDGRHVSIDV+SS
Sbjct: 86  FAGSRGLTVRATDVNDAGSIDSPLMQSMERKIKEELNAESVTVKDAYGDGRHVSIDVISS 145

Query: 133 AFEGQTAVNRQRMVYKAIWEELQSTVHAVDQMTTKTPAEAAAQ 176
           AFEGQ+AVNRQRMVYKAIWEELQSTVHAVDQMTTKTP+EAAAQ
Sbjct: 146 AFEGQSAVNRQRMVYKAIWEELQSTVHAVDQMTTKTPSEAAAQ 188

BLAST of Cla97C01G005640 vs. TrEMBL
Match: tr|B9SDE8|B9SDE8_RICCO (Transcription regulator, putative OS=Ricinus communis OX=3988 GN=RCOM_1517150 PE=3 SV=1)

HSP 1 Score: 169.1 bits (427), Expect = 9.0e-39
Identity = 97/134 (72.39%), Postives = 109/134 (81.34%), Query Frame = 0

Query: 44  SITSTSPTGVRSKSSNIAKNSRAEGTFPKWG--SSRNLSIRATQVNDSGSIDSPLMQSME 103
           S T+ + T ++++SS   K S+       +G   +R  SIRAT VND+GSIDSPLMQSME
Sbjct: 45  SCTNNTKTKLQTESSGYNKLSKL-----GYGVVGNRRFSIRATHVNDAGSIDSPLMQSME 104

Query: 104 NKIKEQLDAQSVSVKDAYGDGRHVSIDVVSSAFEGQTAVNRQRMVYKAIWEELQSTVHAV 163
            KIKE LDA SV VKDAYGDGRHVSIDV+SSAFEGQ+AVNRQRMVYKAIWEELQSTVHAV
Sbjct: 105 KKIKESLDADSVIVKDAYGDGRHVSIDVISSAFEGQSAVNRQRMVYKAIWEELQSTVHAV 164

Query: 164 DQMTTKTPAEAAAQ 176
           DQMTTKTPAEAAAQ
Sbjct: 165 DQMTTKTPAEAAAQ 173

BLAST of Cla97C01G005640 vs. TrEMBL
Match: tr|A0A199VU84|A0A199VU84_ANACO (Protein BOLA4, chloroplastic/mitochondrial OS=Ananas comosus OX=4615 GN=ACMD2_20599 PE=3 SV=1)

HSP 1 Score: 166.0 bits (419), Expect = 7.6e-38
Identity = 87/106 (82.08%), Postives = 95/106 (89.62%), Query Frame = 0

Query: 71  PKWGSSRNLSIRATQVNDSGSIDSPLMQSMENKIKEQLDAQSVSVKDAYGDGRHVSIDVV 130
           P +   R  S+RATQV+DSGSI+SP+MQ+MENKIKEQL+A  V VKDAYGDGRHVSIDVV
Sbjct: 79  PPFPVRRLCSVRATQVSDSGSINSPMMQAMENKIKEQLEADVVIVKDAYGDGRHVSIDVV 138

Query: 131 SSAFEGQTAVNRQRMVYKAIWEELQSTVHAVDQMTTKTPAEAAAQN 177
           S AFEGQ+AVNRQRMVYKAIWEELQSTVHAVDQMTTKTPAEAAA N
Sbjct: 139 SKAFEGQSAVNRQRMVYKAIWEELQSTVHAVDQMTTKTPAEAAADN 184

BLAST of Cla97C01G005640 vs. Swiss-Prot
Match: sp|Q9LF68|BOLA4_ARATH (Protein BOLA4, chloroplastic/mitochondrial OS=Arabidopsis thaliana OX=3702 GN=BOLA4 PE=1 SV=1)

HSP 1 Score: 141.4 bits (355), Expect = 9.9e-33
Identity = 85/157 (54.14%), Postives = 106/157 (67.52%), Query Frame = 0

Query: 17  LFVHRNQPFSVVSRQIT--STCVCTAVFPSITSTSPTGVRSKSSNIAKNSRAEGTFPKWG 76
           LF    +PF+  S  +   +   C +   S   T+P+ +        + SR  G      
Sbjct: 25  LFQSSKRPFASFSAPLVRFTNSRCVSAVLSRKETAPSSIYGN-----RVSRVSGFGALDI 84

Query: 77  SSRNLSIRATQVNDSGSIDSPLMQSMENKIKEQLDAQSVSVKDAYGDGRHVSIDVVSSAF 136
            S N S +++Q+ND+GSID  LMQSME KIKEQL+A+SV+V D  GDGRHV I+VVSSAF
Sbjct: 85  RSLNFSTKSSQINDAGSIDQTLMQSMELKIKEQLNAESVTVTDMSGDGRHVCINVVSSAF 144

Query: 137 EGQTAVNRQRMVYKAIWEELQSTVHAVDQMTTKTPAE 172
           EGQ+AVNRQRMVYKAIWEELQ+ VHAVDQMTTKTP+E
Sbjct: 145 EGQSAVNRQRMVYKAIWEELQNVVHAVDQMTTKTPSE 176

BLAST of Cla97C01G005640 vs. TAIR10
Match: AT5G17560.1 (BolA-like family protein)

HSP 1 Score: 141.4 bits (355), Expect = 5.5e-34
Identity = 85/157 (54.14%), Postives = 106/157 (67.52%), Query Frame = 0

Query: 17  LFVHRNQPFSVVSRQIT--STCVCTAVFPSITSTSPTGVRSKSSNIAKNSRAEGTFPKWG 76
           LF    +PF+  S  +   +   C +   S   T+P+ +        + SR  G      
Sbjct: 25  LFQSSKRPFASFSAPLVRFTNSRCVSAVLSRKETAPSSIYGN-----RVSRVSGFGALDI 84

Query: 77  SSRNLSIRATQVNDSGSIDSPLMQSMENKIKEQLDAQSVSVKDAYGDGRHVSIDVVSSAF 136
            S N S +++Q+ND+GSID  LMQSME KIKEQL+A+SV+V D  GDGRHV I+VVSSAF
Sbjct: 85  RSLNFSTKSSQINDAGSIDQTLMQSMELKIKEQLNAESVTVTDMSGDGRHVCINVVSSAF 144

Query: 137 EGQTAVNRQRMVYKAIWEELQSTVHAVDQMTTKTPAE 172
           EGQ+AVNRQRMVYKAIWEELQ+ VHAVDQMTTKTP+E
Sbjct: 145 EGQSAVNRQRMVYKAIWEELQNVVHAVDQMTTKTPSE 176

BLAST of Cla97C01G005640 vs. TAIR10
Match: AT4G26500.1 (chloroplast sulfur E)

HSP 1 Score: 44.3 bits (103), Expect = 9.2e-05
Identity = 28/87 (32.18%), Postives = 49/87 (56.32%), Query Frame = 0

Query: 100 MENKIKEQLDAQSVSVKD--------------AYGDGR-HVSIDVVSSAFEGQTAVNRQR 159
           +  K++++LD   + V+D              A  DG  H ++ +VS AF+G++ V R R
Sbjct: 285 IREKLEKELDPVELEVEDVSYQHAGHAAVRGSAGDDGETHFNLRIVSDAFQGKSLVKRHR 344

Query: 160 MVYKAIWEELQSTVHAVDQMTTKTPAE 172
           ++Y  + +EL+S +HA+  +  KTPAE
Sbjct: 345 LIYDLLQDELKSGLHAL-SIVAKTPAE 370

BLAST of Cla97C01G005640 vs. TAIR10
Match: AT1G55805.1 (BolA-like family protein)

HSP 1 Score: 42.7 bits (99), Expect = 2.7e-04
Identity = 45/170 (26.47%), Postives = 78/170 (45.88%), Query Frame = 0

Query: 20  HRNQPFSVVSRQITSTCVCTAVFPSITSTSPTGVRSKSSNIAKNSRAEGTFPKWGSSRNL 79
           HR QP       + S     +VF S+     +   SKS+     S A  +  K GS    
Sbjct: 14  HRTQP-------LKSPVNSPSVFISVPKFFNS--ESKSTGTGSRSVAMSSVEKTGS---- 73

Query: 80  SIRATQVNDSGSIDSPLMQSMENKIKEQLDAQSVSVKDA-------------YGDGRHVS 139
                   DSG+I++   + M  K++++L+   + ++D                D  H +
Sbjct: 74  --------DSGAIENRASR-MREKLQKELEPVELVIEDVSYQHAGHAGMKGRTDDETHFN 133

Query: 140 IDVVSSAFEGQTAVNRQRMVYKAIWEELQSTVHAVDQMTTKTPAEAAAQN 177
           + +VS  FEG   V R R+VY  + EEL + +HA+  + +KTP+E+ +++
Sbjct: 134 VKIVSKGFEGMNLVKRHRLVYHLLREELDTGLHAL-SIVSKTPSESPSKD 160

The following BLAST results are available for this feature:
Match NameE-valueIdentityDescription
XP_004143637.12.8e-7689.27PREDICTED: protein BOLA4, chloroplastic/mitochondrial [Cucumis sativus] >KGN5040... [more]
XP_008467240.15.3e-7588.20PREDICTED: protein BOLA4, chloroplastic/mitochondrial [Cucumis melo][more]
XP_022159690.17.4e-6981.92protein BOLA4, chloroplastic/mitochondrial [Momordica charantia][more]
XP_022996142.16.9e-6780.68protein BOLA4, chloroplastic/mitochondrial [Cucurbita maxima][more]
XP_023534696.12.0e-6680.68protein BOLA4, chloroplastic/mitochondrial [Cucurbita pepo subsp. pepo][more]
Match NameE-valueIdentityDescription
tr|A0A0A0KRR5|A0A0A0KRR5_CUCSA1.9e-7689.27Uncharacterized protein OS=Cucumis sativus OX=3659 GN=Csa_5G172860 PE=3 SV=1[more]
tr|A0A1S3CT29|A0A1S3CT29_CUCME3.5e-7588.20protein BOLA4, chloroplastic/mitochondrial OS=Cucumis melo OX=3656 GN=LOC1035046... [more]
tr|A0A2C9VHY8|A0A2C9VHY8_MANES1.1e-3984.47Uncharacterized protein OS=Manihot esculenta OX=3983 GN=MANES_08G158500 PE=3 SV=... [more]
tr|B9SDE8|B9SDE8_RICCO9.0e-3972.39Transcription regulator, putative OS=Ricinus communis OX=3988 GN=RCOM_1517150 PE... [more]
tr|A0A199VU84|A0A199VU84_ANACO7.6e-3882.08Protein BOLA4, chloroplastic/mitochondrial OS=Ananas comosus OX=4615 GN=ACMD2_20... [more]
Match NameE-valueIdentityDescription
sp|Q9LF68|BOLA4_ARATH9.9e-3354.14Protein BOLA4, chloroplastic/mitochondrial OS=Arabidopsis thaliana OX=3702 GN=BO... [more]
Match NameE-valueIdentityDescription
AT5G17560.15.5e-3454.14BolA-like family protein[more]
AT4G26500.19.2e-0532.18chloroplast sulfur E[more]
AT1G55805.12.7e-0426.47BolA-like family protein[more]
The following terms have been associated with this gene:
Vocabulary: INTERPRO
TermDefinition
IPR036065BolA-like_sf
IPR002634BolA
GO Assignments
This gene is annotated with the following GO terms.
Category Term Accession Term Name
biological_process GO:0008150 biological_process
biological_process GO:0019243 methylglyoxal catabolic process to D-lactate via S-lactoyl-glutathione
cellular_component GO:0005575 cellular_component
cellular_component GO:0009507 chloroplast
cellular_component GO:0005739 mitochondrion
molecular_function GO:0003674 molecular_function

The following mRNA feature(s) are a part of this gene:

Feature NameUnique NameType
Cla97C01G005640.1Cla97C01G005640.1mRNA


Analysis Name: InterPro Annotations of watermelon 97103 v2
Date Performed: 2019-05-12
IPR TermIPR DescriptionSourceSource TermSource DescriptionAlignment
NoneNo IPR availableGENE3DG3DSA:3.30.300.90coord: 96..174
e-value: 1.1E-21
score: 78.5
NoneNo IPR availablePANTHERPTHR12735:SF6PROTEIN BOLA4, CHLOROPLASTIC/MITOCHONDRIALcoord: 47..173
IPR002634BolA proteinPFAMPF01722BolAcoord: 118..171
e-value: 1.3E-13
score: 50.9
IPR002634BolA proteinPANTHERPTHR12735BOLA-LIKE PROTEIN-RELATEDcoord: 47..173
IPR036065BolA-like superfamilySUPERFAMILYSSF82657BolA-likecoord: 85..175