Clc02G01460 (gene) Watermelon (cordophanus) v2

Overview
NameClc02G01460
Typegene
OrganismCitrullus lanatus subsp. cordophanus cv. cordophanus (Watermelon (cordophanus) v2)
Descriptionprotein THYLAKOID FORMATION1, chloroplastic
LocationClcChr02: 1426390 .. 1429732 (+)
RNA-Seq ExpressionClc02G01460
SyntenyClc02G01460
Sequences
The following sequences are available for this feature:

Gene sequence (with intron)

Legend: exonfive_prime_UTRCDSpolypeptidethree_prime_UTR
Hold the cursor over a type above to highlight its positions in the sequence below.
AATATTTCTGAAAGCCCAATCCATATTGGACCTGGCCCATACCAACTATGCCACGTGTCGATCTTTTACTGGCCTCTTTGGGAAGGTGTAGAAATTAACACTTGGATAAGCCTCTCTATCATGGCCTCTCAGTCTCACCCTCTCTTCTTTCCATTTTTCCCAGAAAATTTCTTCCAATTTCATTCTCTATACCTCTTTTCTCCCTCTCTTCTTCGATATGAAACCCATTTTCTCCAAAAATTCATAAGCTTCCTACATTTTCCCTCATTCTTCTTTAATGGCGGCTGTTAATTCCATTTCAATCTCAACATTAAACCAATGTTCTGATAGAAGATTGCCGGTTCCGTCCCCTCGTTCACTCGCCTCCAATTTCGACGCCTTTCGTTTTCGTACGAGCGTTTTCAGTCATTATTCCCGAGTAAGAGCATCCACCTTCAGTTCTCGCATGGTCATTCATTGCATGTCTGCCGGAACAGGTAGCTTCTCATGATTTGTTTTCCTTACTCACTTATCAGACTATTCCCTTGCCAGAAGAAAACAGCCTCGCTTCTCTGATTTCGATTGAGTTTTCGAATCTCTTTTTGTTTTGATTTTCCTCATTTGTAGAGATGGAATGTTATGGAGGGAATTAGAAGATTTAAGTAACAAATAAGCTTTCAATAGTTCAGTTGAATCAGTTCTATGCTTGTGTAGTGTGTTGAGATTTCTCTAGTTTCGTTTTCTTGCTACTTAGGTCTAGAAACTGATGCGTTATTTTGACGGCTCAGATGTGACCACTGTAGCCGAGACAAAATTGAACTTCCTCAAGGCGTATAAACGGCCTATCCCTAGCATTTACAACACTGTTCTGCAAGAATTGATTGTTCAACAGCATTTGATGAGGTATAAGAGGACATACCGTTATGACCCTGTTTTCGCCCTTGGTTTTGTTACTGTATATGATCAGCTTATGGAAGGGTACCCTAGCGATGAGGATCGTGATGCCATCTTCCAAGCATACATTAAGGCATTGAATGAGGATCCAGAGCAATATAGGTTTGGCCATGTATTCCCCTTTGCTTTTACCATCCCTGATCTCTATCTATTCTTGTCACTTTTCAGCTAAAAACTTCAGTAGGCTTATTTATCGGTAGACACTGTTAAGTTGAAAAATCATAAGCATAATTCACTTTATAGCTGTCTAAACGAATGACCTGGATTGGTATCGAATGTTGGTGCATGTTCAAAAAGTAGTATCGATGGCTTTTATTTGTACCTGGTAATCAAATTTATGAATGTTAATTCTTTGTCCAATAACTCCAAGAATGTAATGTTCTTCACTGGAAATTATTTCCAACCGTCGACAATTTTTCTAATAATTTCGAGAAGGGTTTTTCTTTATTTTAAGTAATATTTTCTGTTATGCCGTCCAGAATTGATGCTAAAAAATTGGAAGAGTGGGCTCGGTCTCAGACTGCCACTTCATTGGTTGAGTTTGCATCAAGAGAAGGAGAAGTTGAGAGTACGTTGAAGGACATTGCAGAACGGGCAGGGAATAAGGGGAATTTCAGTTACAGCCGATTTTTTGCTATTGGACTTTTCCGACTCCTTGAATTGGCAAATGCTACTGAACCCAGTATCTTGGAAAAGGTTTATTTCATATTTCTTTCTGTGATATAGCTTGAAAGTAGATGAAGATTATGGTATGACAATTTTTATCAAGCTATTTATTGCTAAATTAAATGACAACCAGCCCCTCTCATAGGACTTTTATTGATAGATTATATTATTTCTCTGAGTGCTGCCATTTATAAAGTATAATACCGTTTCTGGTTCATTAGTTATCTTTTTACCCGTTAGAGAAATTGAATCTTCATGATATATTGCCTAACCTTATTCTGTTCGAGACTTGAGAGAATTAGGTTCAAAGGCTTCTTTCCTGCATAAGGCTGCATCTTATTTTCTAGCTAACTATTTTTTTTTGGAACATTTGCCCCTCCCCTTCCACCCTTGCCCCTGACCCAACACACACACACAGACTTACAGAGCTTTTGTGCGATTTCCCTTCGCCCTTGTGAAGACTGACTCTATTTACTGTGTCTATCACTGAAGATATAATACTTTTGATCTTTTAAAACAGCTAAAGCTATATAGTCGAACAGTTTCTAAATCAGCTGAAGTCATGGATGGAGATTTGACCCCTCAGTATTGGCATCTAACTTATTGCTACTTCTGAACAGCTCTGTGCTGCTTTAAACATTGACAAAAAAGGTGTAGACCGAGACCTTGATGTATACCGCAACCTGCTTTCAAAGTTGGTTCAAGCGAAGGAGCTCCTAAAGGAATACGTCGATAGGTAAGAGACTGTTAAGTTCATGGTTCTAGGCATGATAAAGTAGGTCTAACTACTCTTCCAAACCATCATTTTTTCACAGATGTTATGCATCTATTTGAAAACTGTCAATCTTGCATGTGTCAAATTGTCTGCAAGTCACTGTCATAGCATTTATATTCATTGGATATCCACCACATGTTCCTCATCCTTGATTGTGATTCTACATTTTAAGGACAAAAGTTTGTATGATTAACCAGTGGTTTTCAATTGTATAGAGAGAAGAAGAAAAGAGATGAGAGGAGTGGATCACAGACAGCTAATGAGGCCATAACTAAATGCTTGGGAGAATACAGCATGCAGACTGGTTTGTAAAGAAAAGATGATTGGAGCACTTCCTAATTTGAGATAGACTTGAGAGTTTATAGCAATATTCTAAATTACCCTATAGATATGCATCTGTAATGAATTGTTGGGTCTCATGCATTTGGTAAATTTTGTATTCAGCCAGGCACTACCTTTATTTCTCATTTGATATAACCACCACTCAAGCTATTTTTTAATCATTTGTATTACATATTTTAGAGCGCAATTGTTAATACAAATGCTATCGCACTCTGTGCAGTAATCACTCTCCAAGTGATATTTGACCATCTGGCTTTTGTATTGTGCAATCATTTCTGGATTATTATGATCTTTCACCCTTTGCTTTTGTAGGGTCTAAATTCATTCAGTTACAAACATTTAGGATTGTTTTTAAATACAAATTGGCTAGGGGATTTGAACTACGAACCTCTTGATTGCACCTATGTTAGTTGAGTTGAGCTTTTTTTTGGCACATTTAGGATTTTTATTGATGTTTTAAGTGATACGACCTCCAAAGTTTAGATGTATACATTTAATTTTTTTCCCCTTTTTAACCCATTGTTGAAGACATGAGCTTTCTTGAACACATGATTTGTTTTATGGGTGTTATTTAGATTGTGTAGAGTTGTGAAGCTCAATGACAAAATTTAATTAAATTAGTTAACT

mRNA sequence

AATATTTCTGAAAGCCCAATCCATATTGGACCTGGCCCATACCAACTATGCCACGTGTCGATCTTTTACTGGCCTCTTTGGGAAGGTGTAGAAATTAACACTTGGATAAGCCTCTCTATCATGGCCTCTCAGTCTCACCCTCTCTTCTTTCCATTTTTCCCAGAAAATTTCTTCCAATTTCATTCTCTATACCTCTTTTCTCCCTCTCTTCTTCGATATGAAACCCATTTTCTCCAAAAATTCATAAGCTTCCTACATTTTCCCTCATTCTTCTTTAATGGCGGCTGTTAATTCCATTTCAATCTCAACATTAAACCAATGTTCTGATAGAAGATTGCCGGTTCCGTCCCCTCGTTCACTCGCCTCCAATTTCGACGCCTTTCGTTTTCGTACGAGCGTTTTCAGTCATTATTCCCGAGTAAGAGCATCCACCTTCAGTTCTCGCATGGTCATTCATTGCATGTCTGCCGGAACAGATGTGACCACTGTAGCCGAGACAAAATTGAACTTCCTCAAGGCGTATAAACGGCCTATCCCTAGCATTTACAACACTGTTCTGCAAGAATTGATTGTTCAACAGCATTTGATGAGGTATAAGAGGACATACCGTTATGACCCTGTTTTCGCCCTTGGTTTTGTTACTGTATATGATCAGCTTATGGAAGGGTACCCTAGCGATGAGGATCGTGATGCCATCTTCCAAGCATACATTAAGGCATTGAATGAGGATCCAGAGCAATATAGAATTGATGCTAAAAAATTGGAAGAGTGGGCTCGGTCTCAGACTGCCACTTCATTGGTTGAGTTTGCATCAAGAGAAGGAGAAGTTGAGAGTACGTTGAAGGACATTGCAGAACGGGCAGGGAATAAGGGGAATTTCAGTTACAGCCGATTTTTTGCTATTGGACTTTTCCGACTCCTTGAATTGGCAAATGCTACTGAACCCAGTATCTTGGAAAAGCTCTGTGCTGCTTTAAACATTGACAAAAAAGGTGTAGACCGAGACCTTGATGTATACCGCAACCTGCTTTCAAAGTTGGTTCAAGCGAAGGAGCTCCTAAAGGAATACGTCGATAGAGAGAAGAAGAAAAGAGATGAGAGGAGTGGATCACAGACAGCTAATGAGGCCATAACTAAATGCTTGGGAGAATACAGCATGCAGACTGGTTTGTAAAGAAAAGATGATTGGAGCACTTCCTAATTTGAGATAGACTTGAGAGTTTATAGCAATATTCTAAATTACCCTATAGATATGCATCTGTAATGAATTGTTGGGTCTCATGCATTTGGTAAATTTTGTATTCAGCCAGGCACTACCTTTATTTCTCATTTGATATAACCACCACTCAAGCTATTTTTTAATCATTTGTATTACATATTTTAGAGCGCAATTGTTAATACAAATGCTATCGCACTCTGTGCAGTAATCACTCTCCAAGTGATATTTGACCATCTGGCTTTTGTATTGTGCAATCATTTCTGGATTATTATGATCTTTCACCCTTTGCTTTTGTAGGGTCTAAATTCATTCAGTTACAAACATTTAGGATTGTTTTTAAATACAAATTGGCTAGGGGATTTGAACTACGAACCTCTTGATTGCACCTATGTTAGTTGAGTTGAGCTTTTTTTTGGCACATTTAGGATTTTTATTGATGTTTTAAGTGATACGACCTCCAAAGTTTAGATGTATACATTTAATTTTTTTCCCCTTTTTAACCCATTGTTGAAGACATGAGCTTTCTTGAACACATGATTTGTTTTATGGGTGTTATTTAGATTGTGTAGAGTTGTGAAGCTCAATGACAAAATTTAATTAAATTAGTTAACT

Coding sequence (CDS)

ATGGCGGCTGTTAATTCCATTTCAATCTCAACATTAAACCAATGTTCTGATAGAAGATTGCCGGTTCCGTCCCCTCGTTCACTCGCCTCCAATTTCGACGCCTTTCGTTTTCGTACGAGCGTTTTCAGTCATTATTCCCGAGTAAGAGCATCCACCTTCAGTTCTCGCATGGTCATTCATTGCATGTCTGCCGGAACAGATGTGACCACTGTAGCCGAGACAAAATTGAACTTCCTCAAGGCGTATAAACGGCCTATCCCTAGCATTTACAACACTGTTCTGCAAGAATTGATTGTTCAACAGCATTTGATGAGGTATAAGAGGACATACCGTTATGACCCTGTTTTCGCCCTTGGTTTTGTTACTGTATATGATCAGCTTATGGAAGGGTACCCTAGCGATGAGGATCGTGATGCCATCTTCCAAGCATACATTAAGGCATTGAATGAGGATCCAGAGCAATATAGAATTGATGCTAAAAAATTGGAAGAGTGGGCTCGGTCTCAGACTGCCACTTCATTGGTTGAGTTTGCATCAAGAGAAGGAGAAGTTGAGAGTACGTTGAAGGACATTGCAGAACGGGCAGGGAATAAGGGGAATTTCAGTTACAGCCGATTTTTTGCTATTGGACTTTTCCGACTCCTTGAATTGGCAAATGCTACTGAACCCAGTATCTTGGAAAAGCTCTGTGCTGCTTTAAACATTGACAAAAAAGGTGTAGACCGAGACCTTGATGTATACCGCAACCTGCTTTCAAAGTTGGTTCAAGCGAAGGAGCTCCTAAAGGAATACGTCGATAGAGAGAAGAAGAAAAGAGATGAGAGGAGTGGATCACAGACAGCTAATGAGGCCATAACTAAATGCTTGGGAGAATACAGCATGCAGACTGGTTTGTAA

Protein sequence

MAAVNSISISTLNQCSDRRLPVPSPRSLASNFDAFRFRTSVFSHYSRVRASTFSSRMVIHCMSAGTDVTTVAETKLNFLKAYKRPIPSIYNTVLQELIVQQHLMRYKRTYRYDPVFALGFVTVYDQLMEGYPSDEDRDAIFQAYIKALNEDPEQYRIDAKKLEEWARSQTATSLVEFASREGEVESTLKDIAERAGNKGNFSYSRFFAIGLFRLLELANATEPSILEKLCAALNIDKKGVDRDLDVYRNLLSKLVQAKELLKEYVDREKKKRDERSGSQTANEAITKCLGEYSMQTGL
Homology
BLAST of Clc02G01460 vs. NCBI nr
Match: XP_038888611.1 (protein THYLAKOID FORMATION1, chloroplastic [Benincasa hispida])

HSP 1 Score: 555.4 bits (1430), Expect = 2.9e-154
Identity = 287/298 (96.31%), Postives = 292/298 (97.99%), Query Frame = 0

Query: 1   MAAVNSISISTLNQCSDRRLPVPSPRSLASNFDAFRFRTSVFSHYSRVRASTFSSRMVIH 60
           MAAV SIS STLNQCSDRRLPVPS RSLASNFD FRFRTSVFSHYSRVRASTFSS MVIH
Sbjct: 1   MAAVYSISFSTLNQCSDRRLPVPSARSLASNFDGFRFRTSVFSHYSRVRASTFSSSMVIH 60

Query: 61  CMSAGTDVTTVAETKLNFLKAYKRPIPSIYNTVLQELIVQQHLMRYKRTYRYDPVFALGF 120
           CMSAGTDVTTVAETKLNFLKAYKRPIPSIYNTVLQELIVQQHLMRYKRTYRYDPVFALGF
Sbjct: 61  CMSAGTDVTTVAETKLNFLKAYKRPIPSIYNTVLQELIVQQHLMRYKRTYRYDPVFALGF 120

Query: 121 VTVYDQLMEGYPSDEDRDAIFQAYIKALNEDPEQYRIDAKKLEEWARSQTATSLVEFASR 180
           VTVYDQLMEGYPSDEDRDAIFQAYIKALNEDPEQYRIDA+KLEEWARSQTA SLV+FASR
Sbjct: 121 VTVYDQLMEGYPSDEDRDAIFQAYIKALNEDPEQYRIDAQKLEEWARSQTAASLVDFASR 180

Query: 181 EGEVESTLKDIAERAGNKGNFSYSRFFAIGLFRLLELANATEPSILEKLCAALNIDKKGV 240
           EGEVESTLKDIAERAG+KGNFSYSRFFAIGLFRLLELANATEPSILEKLCA+LNIDKKGV
Sbjct: 181 EGEVESTLKDIAERAGSKGNFSYSRFFAIGLFRLLELANATEPSILEKLCASLNIDKKGV 240

Query: 241 DRDLDVYRNLLSKLVQAKELLKEYVDREKKKRDERSGSQTANEAITKCLGEYSMQTGL 299
           DRDLDVYRNLLSKLVQAKELLKEYVDREKKKRDER+GSQTANEAITKCLGEYSMQTGL
Sbjct: 241 DRDLDVYRNLLSKLVQAKELLKEYVDREKKKRDERAGSQTANEAITKCLGEYSMQTGL 298

BLAST of Clc02G01460 vs. NCBI nr
Match: XP_008455361.1 (PREDICTED: protein THYLAKOID FORMATION1, chloroplastic [Cucumis melo] >KAA0031606.1 protein THYLAKOID FORMATION1 [Cucumis melo var. makuwa] >TYK07058.1 protein THYLAKOID FORMATION1 [Cucumis melo var. makuwa])

HSP 1 Score: 552.0 bits (1421), Expect = 3.2e-153
Identity = 283/298 (94.97%), Postives = 291/298 (97.65%), Query Frame = 0

Query: 1   MAAVNSISISTLNQCSDRRLPVPSPRSLASNFDAFRFRTSVFSHYSRVRASTFSSRMVIH 60
           MAAVNSIS STLNQCSDRR PVPS RSL+SNFD FRFRTS+F+HYSRVR STFSSRMVIH
Sbjct: 1   MAAVNSISFSTLNQCSDRRFPVPSSRSLSSNFDGFRFRTSLFTHYSRVRPSTFSSRMVIH 60

Query: 61  CMSAGTDVTTVAETKLNFLKAYKRPIPSIYNTVLQELIVQQHLMRYKRTYRYDPVFALGF 120
           CMSAGTDVTTVAETKLNFLKAYKRPIPSIYNTVLQELIVQQHLMRYKRTYRYDPVFALGF
Sbjct: 61  CMSAGTDVTTVAETKLNFLKAYKRPIPSIYNTVLQELIVQQHLMRYKRTYRYDPVFALGF 120

Query: 121 VTVYDQLMEGYPSDEDRDAIFQAYIKALNEDPEQYRIDAKKLEEWARSQTATSLVEFASR 180
           VTVYDQLMEGYPSDEDR+AIFQAYIKALNEDPEQYRIDA+KLEEWARSQTA SLVEFASR
Sbjct: 121 VTVYDQLMEGYPSDEDREAIFQAYIKALNEDPEQYRIDAQKLEEWARSQTAASLVEFASR 180

Query: 181 EGEVESTLKDIAERAGNKGNFSYSRFFAIGLFRLLELANATEPSILEKLCAALNIDKKGV 240
           EGEVES LKDIAERAG+KGNFSYSRFFAIGLFRLLELANATEPSILEKLCAALNIDKKGV
Sbjct: 181 EGEVESILKDIAERAGSKGNFSYSRFFAIGLFRLLELANATEPSILEKLCAALNIDKKGV 240

Query: 241 DRDLDVYRNLLSKLVQAKELLKEYVDREKKKRDERSGSQTANEAITKCLGEYSMQTGL 299
           DRDLDVYRNLLSKLVQAKELLKEY+DREKKKRDER+GSQTANEAITKCLGEYSMQTGL
Sbjct: 241 DRDLDVYRNLLSKLVQAKELLKEYIDREKKKRDERAGSQTANEAITKCLGEYSMQTGL 298

BLAST of Clc02G01460 vs. NCBI nr
Match: XP_004136805.1 (protein THYLAKOID FORMATION1, chloroplastic [Cucumis sativus])

HSP 1 Score: 543.9 bits (1400), Expect = 8.7e-151
Identity = 282/298 (94.63%), Postives = 288/298 (96.64%), Query Frame = 0

Query: 1   MAAVNSISISTLNQCSDRRLPVPSPRSLASNFDAFRFRTSVFSHYSRVRASTFSSRMVIH 60
           MAAVNSIS STLNQCSDRRL +PS RS +SNF  F FRTSVF+HYSRVRASTFSSRMVIH
Sbjct: 1   MAAVNSISFSTLNQCSDRRLLLPSSRSHSSNFHGFPFRTSVFTHYSRVRASTFSSRMVIH 60

Query: 61  CMSAGTDVTTVAETKLNFLKAYKRPIPSIYNTVLQELIVQQHLMRYKRTYRYDPVFALGF 120
           CMSAGTDVTTVAETKLNFLKAYKRPIPSIYNTVLQELIVQQHLMRYKRTYRYDPVFALGF
Sbjct: 61  CMSAGTDVTTVAETKLNFLKAYKRPIPSIYNTVLQELIVQQHLMRYKRTYRYDPVFALGF 120

Query: 121 VTVYDQLMEGYPSDEDRDAIFQAYIKALNEDPEQYRIDAKKLEEWARSQTATSLVEFASR 180
           VTVYDQLMEGYPSDEDR+AIFQAYIKALNEDPEQYRIDAKK EEWARSQTA SLVEFASR
Sbjct: 121 VTVYDQLMEGYPSDEDREAIFQAYIKALNEDPEQYRIDAKKFEEWARSQTAASLVEFASR 180

Query: 181 EGEVESTLKDIAERAGNKGNFSYSRFFAIGLFRLLELANATEPSILEKLCAALNIDKKGV 240
           EGEVES LKDIAERAG+KGNFSYSRFFAIGLFRLLELANATEPSILEKLCAALNIDKKGV
Sbjct: 181 EGEVESILKDIAERAGSKGNFSYSRFFAIGLFRLLELANATEPSILEKLCAALNIDKKGV 240

Query: 241 DRDLDVYRNLLSKLVQAKELLKEYVDREKKKRDERSGSQTANEAITKCLGEYSMQTGL 299
           DRDLDVYRNLLSKLVQAKELLKEYVDREKKKRDER+GSQTANEAITKCLGEYSMQTGL
Sbjct: 241 DRDLDVYRNLLSKLVQAKELLKEYVDREKKKRDERAGSQTANEAITKCLGEYSMQTGL 298

BLAST of Clc02G01460 vs. NCBI nr
Match: KAE8645760.1 (hypothetical protein Csa_020345 [Cucumis sativus])

HSP 1 Score: 542.3 bits (1396), Expect = 2.5e-150
Identity = 281/297 (94.61%), Postives = 287/297 (96.63%), Query Frame = 0

Query: 1   MAAVNSISISTLNQCSDRRLPVPSPRSLASNFDAFRFRTSVFSHYSRVRASTFSSRMVIH 60
           MAAVNSIS STLNQCSDRRL +PS RS +SNF  F FRTSVF+HYSRVRASTFSSRMVIH
Sbjct: 1   MAAVNSISFSTLNQCSDRRLLLPSSRSHSSNFHGFPFRTSVFTHYSRVRASTFSSRMVIH 60

Query: 61  CMSAGTDVTTVAETKLNFLKAYKRPIPSIYNTVLQELIVQQHLMRYKRTYRYDPVFALGF 120
           CMSAGTDVTTVAETKLNFLKAYKRPIPSIYNTVLQELIVQQHLMRYKRTYRYDPVFALGF
Sbjct: 61  CMSAGTDVTTVAETKLNFLKAYKRPIPSIYNTVLQELIVQQHLMRYKRTYRYDPVFALGF 120

Query: 121 VTVYDQLMEGYPSDEDRDAIFQAYIKALNEDPEQYRIDAKKLEEWARSQTATSLVEFASR 180
           VTVYDQLMEGYPSDEDR+AIFQAYIKALNEDPEQYRIDAKK EEWARSQTA SLVEFASR
Sbjct: 121 VTVYDQLMEGYPSDEDREAIFQAYIKALNEDPEQYRIDAKKFEEWARSQTAASLVEFASR 180

Query: 181 EGEVESTLKDIAERAGNKGNFSYSRFFAIGLFRLLELANATEPSILEKLCAALNIDKKGV 240
           EGEVES LKDIAERAG+KGNFSYSRFFAIGLFRLLELANATEPSILEKLCAALNIDKKGV
Sbjct: 181 EGEVESILKDIAERAGSKGNFSYSRFFAIGLFRLLELANATEPSILEKLCAALNIDKKGV 240

Query: 241 DRDLDVYRNLLSKLVQAKELLKEYVDREKKKRDERSGSQTANEAITKCLGEYSMQTG 298
           DRDLDVYRNLLSKLVQAKELLKEYVDREKKKRDER+GSQTANEAITKCLGEYSMQTG
Sbjct: 241 DRDLDVYRNLLSKLVQAKELLKEYVDREKKKRDERAGSQTANEAITKCLGEYSMQTG 297

BLAST of Clc02G01460 vs. NCBI nr
Match: XP_022969189.1 (protein THYLAKOID FORMATION1, chloroplastic-like isoform X2 [Cucurbita maxima])

HSP 1 Score: 529.6 bits (1363), Expect = 1.7e-146
Identity = 270/298 (90.60%), Postives = 286/298 (95.97%), Query Frame = 0

Query: 1   MAAVNSISISTLNQCSDRRLPVPSPRSLASNFDAFRFRTSVFSHYSRVRASTFSSRMVIH 60
           MAAVNS+S S L+QCSDRRLP+PS RSLAS+FD FRFR SVF HYS VR S+FSSRMVIH
Sbjct: 1   MAAVNSVSFSGLSQCSDRRLPIPSARSLASHFDGFRFRKSVFCHYSGVRTSSFSSRMVIH 60

Query: 61  CMSAGTDVTTVAETKLNFLKAYKRPIPSIYNTVLQELIVQQHLMRYKRTYRYDPVFALGF 120
           CM++GTDVTTVAETK NFLKAYKRPIPSIYNTV+QELIVQQHLMRYK+TYRYDPVFALGF
Sbjct: 61  CMASGTDVTTVAETKANFLKAYKRPIPSIYNTVVQELIVQQHLMRYKKTYRYDPVFALGF 120

Query: 121 VTVYDQLMEGYPSDEDRDAIFQAYIKALNEDPEQYRIDAKKLEEWARSQTATSLVEFASR 180
           VTVYD+LMEGYPSDEDRDAIFQAYI ALNEDPEQYRIDA+KLEEWARSQTA SLVEFASR
Sbjct: 121 VTVYDRLMEGYPSDEDRDAIFQAYINALNEDPEQYRIDAQKLEEWARSQTAASLVEFASR 180

Query: 181 EGEVESTLKDIAERAGNKGNFSYSRFFAIGLFRLLELANATEPSILEKLCAALNIDKKGV 240
           EGEVES LKDIAERAG+KGNFSYSRFFAIGLFRLLELANA+EPSILEKLCAALN+DKKGV
Sbjct: 181 EGEVESILKDIAERAGSKGNFSYSRFFAIGLFRLLELANASEPSILEKLCAALNVDKKGV 240

Query: 241 DRDLDVYRNLLSKLVQAKELLKEYVDREKKKRDERSGSQTANEAITKCLGEYSMQTGL 299
           DRDLDVYRNLLSKLVQAKELLKEYVDREKKKRDER+GSQTA+EAITKCLGEYSMQTGL
Sbjct: 241 DRDLDVYRNLLSKLVQAKELLKEYVDREKKKRDERAGSQTASEAITKCLGEYSMQTGL 298

BLAST of Clc02G01460 vs. ExPASy Swiss-Prot
Match: Q7XAB8 (Protein THYLAKOID FORMATION1, chloroplastic OS=Solanum tuberosum OX=4113 GN=THF1 PE=2 SV=1)

HSP 1 Score: 392.1 bits (1006), Expect = 5.5e-108
Identity = 202/293 (68.94%), Postives = 243/293 (82.94%), Query Frame = 0

Query: 1   MAAVNSISISTLNQCSDRRLPVPSPRSLASNFDAFRFRTSVFSHYSRVRASTFSSRMVIH 60
           MAAV S+S S + Q ++R+  V S RS+    D FRFR++       VR+S  +SR V+H
Sbjct: 1   MAAVTSVSFSAITQSAERKSSVSSSRSI----DTFRFRSNFSFDSVNVRSSNSTSRFVVH 60

Query: 61  C-MSAGTDVTTVAETKLNFLKAYKRPIPSIYNTVLQELIVQQHLMRYKRTYRYDPVFALG 120
           C  S+  D+ TVA+TKL FL AYKRPIP++YNTVLQELIVQQHL RYK++Y+YDPVFALG
Sbjct: 61  CTSSSAADLPTVADTKLKFLTAYKRPIPTVYNTVLQELIVQQHLTRYKKSYQYDPVFALG 120

Query: 121 FVTVYDQLMEGYPSDEDRDAIFQAYIKALNEDPEQYRIDAKKLEEWARSQTATSLVEFAS 180
           FVTVYDQLMEGYPS+EDR+AIF+AYI+AL EDPEQYR DA+KLEEWAR+Q A +LV+F+S
Sbjct: 121 FVTVYDQLMEGYPSEEDRNAIFKAYIEALKEDPEQYRADAQKLEEWARTQNANTLVDFSS 180

Query: 181 REGEVESTLKDIAERAGNKGNFSYSRFFAIGLFRLLELANATEPSILEKLCAALNIDKKG 240
           +EGE+E+  KDIA+RAG K  F YSR FA+GLFRLLELAN T+P+ILEKLCAALN++KK 
Sbjct: 181 KEGEIENIFKDIAQRAGTKDGFCYSRLFAVGLFRLLELANVTDPTILEKLCAALNVNKKS 240

Query: 241 VDRDLDVYRNLLSKLVQAKELLKEYVDREKKKRDERSGSQTANEAITKCLGEY 293
           VDRDLDVYRNLLSKLVQAKELLKEYV+REKKKR ER  +Q ANE +TKCLG+Y
Sbjct: 241 VDRDLDVYRNLLSKLVQAKELLKEYVEREKKKRGERE-TQKANETVTKCLGDY 288

BLAST of Clc02G01460 vs. ExPASy Swiss-Prot
Match: Q9SKT0 (Protein THYLAKOID FORMATION 1, chloroplastic OS=Arabidopsis thaliana OX=3702 GN=THF1 PE=1 SV=1)

HSP 1 Score: 383.6 bits (984), Expect = 2.0e-105
Identity = 209/290 (72.07%), Postives = 244/290 (84.14%), Query Frame = 0

Query: 3   AVNSISISTLNQCSDRRLPVPSPRSLASNFDAFRFRTSVFSHYSRVRASTFSSRMVIHCM 62
           A++S+S   L Q SD+     S R LAS   A R  T  FS  S    ST  S+ +IHCM
Sbjct: 5   AISSLSFPALGQ-SDKISNFASSRPLAS---AIRICTK-FSRLSLNSRST--SKSLIHCM 64

Query: 63  SAGT-DVTTVAETKLNFLKAYKRPIPSIYNTVLQELIVQQHLMRYKRTYRYDPVFALGFV 122
           S  T DV  V+ETK  FLKAYKRPIPSIYNTVLQELIVQQHLMRYK+TYRYDPVFALGFV
Sbjct: 65  SNVTADVPPVSETKSKFLKAYKRPIPSIYNTVLQELIVQQHLMRYKKTYRYDPVFALGFV 124

Query: 123 TVYDQLMEGYPSDEDRDAIFQAYIKALNEDPEQYRIDAKKLEEWARSQTATSLVEFASRE 182
           TVYDQLMEGYPSD+DRDAIF+AYI+ALNEDP+QYRIDA+K+EEWARSQT+ SLV+F+S+E
Sbjct: 125 TVYDQLMEGYPSDQDRDAIFKAYIEALNEDPKQYRIDAQKMEEWARSQTSASLVDFSSKE 184

Query: 183 GEVESTLKDIAERAGNKGNFSYSRFFAIGLFRLLELANATEPSILEKLCAALNIDKKGVD 242
           G++E+ LKDIA RAG+K  FSYSRFFA+GLFRLLELA+AT+P++L+KLCA+LNI+KK VD
Sbjct: 185 GDIEAVLKDIAGRAGSKEGFSYSRFFAVGLFRLLELASATDPTVLDKLCASLNINKKSVD 244

Query: 243 RDLDVYRNLLSKLVQAKELLKEYVDREKKKRDERSGSQTANEAITKCLGE 292
           RDLDVYRNLLSKLVQAKELLKEYV+REKKK+ ER+ SQ ANE I+KCLG+
Sbjct: 245 RDLDVYRNLLSKLVQAKELLKEYVEREKKKQGERAQSQKANETISKCLGD 287

BLAST of Clc02G01460 vs. ExPASy Swiss-Prot
Match: Q84PB7 (Protein THYLAKOID FORMATION1, chloroplastic OS=Oryza sativa subsp. japonica OX=39947 GN=THF1 PE=2 SV=1)

HSP 1 Score: 364.4 bits (934), Expect = 1.2e-99
Identity = 197/288 (68.40%), Postives = 235/288 (81.60%), Query Frame = 0

Query: 1   MAAVNSISISTLNQCSDRRLPVPSPRSLASNFDAFRFRTSVFSHYSRVRASTFSSRMVIH 60
           MAA++S+  + L + +D R   PS  + A+   A     SV     R R     SR V+ 
Sbjct: 1   MAAISSLPFAALRRAADCR---PSTAAAAAGAGAGAVVLSV-----RPRR---GSRSVVR 60

Query: 61  CMSAGTDV-TTVAETKLNFLKAYKRPIPSIYNTVLQELIVQQHLMRYKRTYRYDPVFALG 120
           C++   DV  TVAETK+NFLK+YKRPI SIY+TVLQEL+VQQHLMRYK TY+YD VFALG
Sbjct: 61  CVATAGDVPPTVAETKMNFLKSYKRPILSIYSTVLQELLVQQHLMRYKTTYQYDAVFALG 120

Query: 121 FVTVYDQLMEGYPSDEDRDAIFQAYIKALNEDPEQYRIDAKKLEEWARSQTATSLVEFAS 180
           FVTVYDQLMEGYPS+EDRDAIF+AYI ALNEDPEQYR DA+K+EEWARSQ   SLVEF+S
Sbjct: 121 FVTVYDQLMEGYPSNEDRDAIFKAYITALNEDPEQYRADAQKMEEWARSQNGNSLVEFSS 180

Query: 181 REGEVESTLKDIAERAGNKGNFSYSRFFAIGLFRLLELANATEPSILEKLCAALNIDKKG 240
           ++GE+E+ LKDI+ERA  KG+FSYSRFFA+GLFRLLELANATEP+IL+KLCAALNI+K+ 
Sbjct: 181 KDGEIEAILKDISERAQGKGSFSYSRFFAVGLFRLLELANATEPTILDKLCAALNINKRS 240

Query: 241 VDRDLDVYRNLLSKLVQAKELLKEYVDREKKKRDERSGSQTANEAITK 288
           VDRDLDVYRN+LSKLVQAKELLKEYV+REKKKR+ERS +  +NEA+TK
Sbjct: 241 VDRDLDVYRNILSKLVQAKELLKEYVEREKKKREERSETPKSNEAVTK 277

BLAST of Clc02G01460 vs. ExPASy Swiss-Prot
Match: B0C3M8 (Protein Thf1 OS=Acaryochloris marina (strain MBIC 11017) OX=329726 GN=thf1 PE=3 SV=1)

HSP 1 Score: 152.5 bits (384), Expect = 7.4e-36
Identity = 83/219 (37.90%), Postives = 136/219 (62.10%), Query Frame = 0

Query: 67  DVTTVAETKLNFLKAYKRPIPSIYNTVLQELIVQQHLMRYKRTYRYDPVFALGFVTVYDQ 126
           ++ TV++TK  F   + RP+ S+Y  V++EL+V+ HL+R    +RYDP+FALG  T +D+
Sbjct: 3   NLRTVSDTKRAFYSIHTRPVNSVYRRVVEELMVEMHLLRVNEDFRYDPIFALGVTTSFDR 62

Query: 127 LMEGYPSDEDRDAIFQAYIKALNEDPEQYRIDAKKLEEWARSQTATSLVEF---ASREG- 186
            M+GY  + D+DAIF A  KA   DP Q + D ++L E A+S++A  ++++   A+  G 
Sbjct: 63  FMDGYQPENDKDAIFSAICKAQEADPVQMKKDGQRLTELAQSKSAQEMLDWITQAANSGG 122

Query: 187 -EVESTLKDIAERAGNKGNFSYSRFFAIGLFRLLELA--NATE-----PSILEKLCAALN 246
            E++  L++IA+       F YSR FAIGLF LLEL+  N T+        L  +C  LN
Sbjct: 123 DELQWQLRNIAQNP----KFKYSRLFAIGLFTLLELSEGNITQDEESLAEFLPNICTVLN 182

Query: 247 IDKKGVDRDLDVYRNLLSKLVQAKELLKEYVDREKKKRD 274
           I +  + +DL++YR  L K+ Q ++ + + ++ +KK+R+
Sbjct: 183 ISESKLQKDLEIYRGNLDKIAQVRQAMDDILEAQKKRRE 217

BLAST of Clc02G01460 vs. ExPASy Swiss-Prot
Match: Q116P5 (Protein Thf1 OS=Trichodesmium erythraeum (strain IMS101) OX=203124 GN=thf1 PE=3 SV=1)

HSP 1 Score: 147.5 bits (371), Expect = 2.4e-34
Identity = 80/216 (37.04%), Postives = 130/216 (60.19%), Query Frame = 0

Query: 70  TVAETKLNFLKAYKRPIPSIYNTVLQELIVQQHLMRYKRTYRYDPVFALGFVTVYDQLME 129
           TV++TK  F   + RPI SIYN V++EL+V+ HL+     Y Y+P +ALG VT +D+ M+
Sbjct: 6   TVSDTKKTFYHFHTRPINSIYNRVIEELLVEMHLISVNVDYSYNPFYALGVVTAFDRFMQ 65

Query: 130 GYPSDEDRDAIFQAYIKALNEDPEQYRIDAKKLEEWARSQTATSLVEFASREGEVEST-- 189
           GY   ED+ +IF A I+   EDP +YR DAK LE+ A   +A+ ++ +      +++T  
Sbjct: 66  GYSPQEDKTSIFNALIQGQEEDPNKYRSDAKGLEDLAGKISASDILSWICLSKNIDNTQY 125

Query: 190 LKDIAERAGNKGNFSYSRFFAIGLFRLLELANA-------TEPSILEKLCAALNIDKKGV 249
           L+D          F YSR FAIGLF LLE+ +             L+K+C +LN+ ++ +
Sbjct: 126 LQDDLRAISENSKFRYSRLFAIGLFTLLEIVDTELIKEQEKRTEALKKICQSLNLVEEKL 185

Query: 250 DRDLDVYRNLLSKLVQAKELLKEYVDREKKKRDERS 277
            +D+D+Y + L ++ QA+  +++ +   +KKR++RS
Sbjct: 186 LKDIDLYLSNLERVAQARSAMEDTLAAMRKKREKRS 221

BLAST of Clc02G01460 vs. ExPASy TrEMBL
Match: A0A5D3C7D3 (Protein THYLAKOID FORMATION1 OS=Cucumis melo var. makuwa OX=1194695 GN=E5676_scaffold13G003760 PE=3 SV=1)

HSP 1 Score: 552.0 bits (1421), Expect = 1.5e-153
Identity = 283/298 (94.97%), Postives = 291/298 (97.65%), Query Frame = 0

Query: 1   MAAVNSISISTLNQCSDRRLPVPSPRSLASNFDAFRFRTSVFSHYSRVRASTFSSRMVIH 60
           MAAVNSIS STLNQCSDRR PVPS RSL+SNFD FRFRTS+F+HYSRVR STFSSRMVIH
Sbjct: 1   MAAVNSISFSTLNQCSDRRFPVPSSRSLSSNFDGFRFRTSLFTHYSRVRPSTFSSRMVIH 60

Query: 61  CMSAGTDVTTVAETKLNFLKAYKRPIPSIYNTVLQELIVQQHLMRYKRTYRYDPVFALGF 120
           CMSAGTDVTTVAETKLNFLKAYKRPIPSIYNTVLQELIVQQHLMRYKRTYRYDPVFALGF
Sbjct: 61  CMSAGTDVTTVAETKLNFLKAYKRPIPSIYNTVLQELIVQQHLMRYKRTYRYDPVFALGF 120

Query: 121 VTVYDQLMEGYPSDEDRDAIFQAYIKALNEDPEQYRIDAKKLEEWARSQTATSLVEFASR 180
           VTVYDQLMEGYPSDEDR+AIFQAYIKALNEDPEQYRIDA+KLEEWARSQTA SLVEFASR
Sbjct: 121 VTVYDQLMEGYPSDEDREAIFQAYIKALNEDPEQYRIDAQKLEEWARSQTAASLVEFASR 180

Query: 181 EGEVESTLKDIAERAGNKGNFSYSRFFAIGLFRLLELANATEPSILEKLCAALNIDKKGV 240
           EGEVES LKDIAERAG+KGNFSYSRFFAIGLFRLLELANATEPSILEKLCAALNIDKKGV
Sbjct: 181 EGEVESILKDIAERAGSKGNFSYSRFFAIGLFRLLELANATEPSILEKLCAALNIDKKGV 240

Query: 241 DRDLDVYRNLLSKLVQAKELLKEYVDREKKKRDERSGSQTANEAITKCLGEYSMQTGL 299
           DRDLDVYRNLLSKLVQAKELLKEY+DREKKKRDER+GSQTANEAITKCLGEYSMQTGL
Sbjct: 241 DRDLDVYRNLLSKLVQAKELLKEYIDREKKKRDERAGSQTANEAITKCLGEYSMQTGL 298

BLAST of Clc02G01460 vs. ExPASy TrEMBL
Match: A0A1S3C0V5 (protein THYLAKOID FORMATION1, chloroplastic OS=Cucumis melo OX=3656 GN=LOC103495547 PE=3 SV=1)

HSP 1 Score: 552.0 bits (1421), Expect = 1.5e-153
Identity = 283/298 (94.97%), Postives = 291/298 (97.65%), Query Frame = 0

Query: 1   MAAVNSISISTLNQCSDRRLPVPSPRSLASNFDAFRFRTSVFSHYSRVRASTFSSRMVIH 60
           MAAVNSIS STLNQCSDRR PVPS RSL+SNFD FRFRTS+F+HYSRVR STFSSRMVIH
Sbjct: 1   MAAVNSISFSTLNQCSDRRFPVPSSRSLSSNFDGFRFRTSLFTHYSRVRPSTFSSRMVIH 60

Query: 61  CMSAGTDVTTVAETKLNFLKAYKRPIPSIYNTVLQELIVQQHLMRYKRTYRYDPVFALGF 120
           CMSAGTDVTTVAETKLNFLKAYKRPIPSIYNTVLQELIVQQHLMRYKRTYRYDPVFALGF
Sbjct: 61  CMSAGTDVTTVAETKLNFLKAYKRPIPSIYNTVLQELIVQQHLMRYKRTYRYDPVFALGF 120

Query: 121 VTVYDQLMEGYPSDEDRDAIFQAYIKALNEDPEQYRIDAKKLEEWARSQTATSLVEFASR 180
           VTVYDQLMEGYPSDEDR+AIFQAYIKALNEDPEQYRIDA+KLEEWARSQTA SLVEFASR
Sbjct: 121 VTVYDQLMEGYPSDEDREAIFQAYIKALNEDPEQYRIDAQKLEEWARSQTAASLVEFASR 180

Query: 181 EGEVESTLKDIAERAGNKGNFSYSRFFAIGLFRLLELANATEPSILEKLCAALNIDKKGV 240
           EGEVES LKDIAERAG+KGNFSYSRFFAIGLFRLLELANATEPSILEKLCAALNIDKKGV
Sbjct: 181 EGEVESILKDIAERAGSKGNFSYSRFFAIGLFRLLELANATEPSILEKLCAALNIDKKGV 240

Query: 241 DRDLDVYRNLLSKLVQAKELLKEYVDREKKKRDERSGSQTANEAITKCLGEYSMQTGL 299
           DRDLDVYRNLLSKLVQAKELLKEY+DREKKKRDER+GSQTANEAITKCLGEYSMQTGL
Sbjct: 241 DRDLDVYRNLLSKLVQAKELLKEYIDREKKKRDERAGSQTANEAITKCLGEYSMQTGL 298

BLAST of Clc02G01460 vs. ExPASy TrEMBL
Match: A0A0A0K3P0 (Uncharacterized protein OS=Cucumis sativus OX=3659 GN=Csa_7G046130 PE=3 SV=1)

HSP 1 Score: 543.9 bits (1400), Expect = 4.2e-151
Identity = 282/298 (94.63%), Postives = 288/298 (96.64%), Query Frame = 0

Query: 1   MAAVNSISISTLNQCSDRRLPVPSPRSLASNFDAFRFRTSVFSHYSRVRASTFSSRMVIH 60
           MAAVNSIS STLNQCSDRRL +PS RS +SNF  F FRTSVF+HYSRVRASTFSSRMVIH
Sbjct: 1   MAAVNSISFSTLNQCSDRRLLLPSSRSHSSNFHGFPFRTSVFTHYSRVRASTFSSRMVIH 60

Query: 61  CMSAGTDVTTVAETKLNFLKAYKRPIPSIYNTVLQELIVQQHLMRYKRTYRYDPVFALGF 120
           CMSAGTDVTTVAETKLNFLKAYKRPIPSIYNTVLQELIVQQHLMRYKRTYRYDPVFALGF
Sbjct: 61  CMSAGTDVTTVAETKLNFLKAYKRPIPSIYNTVLQELIVQQHLMRYKRTYRYDPVFALGF 120

Query: 121 VTVYDQLMEGYPSDEDRDAIFQAYIKALNEDPEQYRIDAKKLEEWARSQTATSLVEFASR 180
           VTVYDQLMEGYPSDEDR+AIFQAYIKALNEDPEQYRIDAKK EEWARSQTA SLVEFASR
Sbjct: 121 VTVYDQLMEGYPSDEDREAIFQAYIKALNEDPEQYRIDAKKFEEWARSQTAASLVEFASR 180

Query: 181 EGEVESTLKDIAERAGNKGNFSYSRFFAIGLFRLLELANATEPSILEKLCAALNIDKKGV 240
           EGEVES LKDIAERAG+KGNFSYSRFFAIGLFRLLELANATEPSILEKLCAALNIDKKGV
Sbjct: 181 EGEVESILKDIAERAGSKGNFSYSRFFAIGLFRLLELANATEPSILEKLCAALNIDKKGV 240

Query: 241 DRDLDVYRNLLSKLVQAKELLKEYVDREKKKRDERSGSQTANEAITKCLGEYSMQTGL 299
           DRDLDVYRNLLSKLVQAKELLKEYVDREKKKRDER+GSQTANEAITKCLGEYSMQTGL
Sbjct: 241 DRDLDVYRNLLSKLVQAKELLKEYVDREKKKRDERAGSQTANEAITKCLGEYSMQTGL 298

BLAST of Clc02G01460 vs. ExPASy TrEMBL
Match: A0A6J1I1V8 (protein THYLAKOID FORMATION1, chloroplastic-like isoform X2 OS=Cucurbita maxima OX=3661 GN=LOC111468259 PE=3 SV=1)

HSP 1 Score: 529.6 bits (1363), Expect = 8.2e-147
Identity = 270/298 (90.60%), Postives = 286/298 (95.97%), Query Frame = 0

Query: 1   MAAVNSISISTLNQCSDRRLPVPSPRSLASNFDAFRFRTSVFSHYSRVRASTFSSRMVIH 60
           MAAVNS+S S L+QCSDRRLP+PS RSLAS+FD FRFR SVF HYS VR S+FSSRMVIH
Sbjct: 1   MAAVNSVSFSGLSQCSDRRLPIPSARSLASHFDGFRFRKSVFCHYSGVRTSSFSSRMVIH 60

Query: 61  CMSAGTDVTTVAETKLNFLKAYKRPIPSIYNTVLQELIVQQHLMRYKRTYRYDPVFALGF 120
           CM++GTDVTTVAETK NFLKAYKRPIPSIYNTV+QELIVQQHLMRYK+TYRYDPVFALGF
Sbjct: 61  CMASGTDVTTVAETKANFLKAYKRPIPSIYNTVVQELIVQQHLMRYKKTYRYDPVFALGF 120

Query: 121 VTVYDQLMEGYPSDEDRDAIFQAYIKALNEDPEQYRIDAKKLEEWARSQTATSLVEFASR 180
           VTVYD+LMEGYPSDEDRDAIFQAYI ALNEDPEQYRIDA+KLEEWARSQTA SLVEFASR
Sbjct: 121 VTVYDRLMEGYPSDEDRDAIFQAYINALNEDPEQYRIDAQKLEEWARSQTAASLVEFASR 180

Query: 181 EGEVESTLKDIAERAGNKGNFSYSRFFAIGLFRLLELANATEPSILEKLCAALNIDKKGV 240
           EGEVES LKDIAERAG+KGNFSYSRFFAIGLFRLLELANA+EPSILEKLCAALN+DKKGV
Sbjct: 181 EGEVESILKDIAERAGSKGNFSYSRFFAIGLFRLLELANASEPSILEKLCAALNVDKKGV 240

Query: 241 DRDLDVYRNLLSKLVQAKELLKEYVDREKKKRDERSGSQTANEAITKCLGEYSMQTGL 299
           DRDLDVYRNLLSKLVQAKELLKEYVDREKKKRDER+GSQTA+EAITKCLGEYSMQTGL
Sbjct: 241 DRDLDVYRNLLSKLVQAKELLKEYVDREKKKRDERAGSQTASEAITKCLGEYSMQTGL 298

BLAST of Clc02G01460 vs. ExPASy TrEMBL
Match: A0A6J1C3B8 (protein THYLAKOID FORMATION1, chloroplastic isoform X1 OS=Momordica charantia OX=3673 GN=LOC111007981 PE=3 SV=1)

HSP 1 Score: 528.9 bits (1361), Expect = 1.4e-146
Identity = 271/297 (91.25%), Postives = 283/297 (95.29%), Query Frame = 0

Query: 1   MAAVNSISISTLNQCSDRRLPVPSPRSLASNFDAFRFRTSVFSHYSRVRASTFSSRMVIH 60
           MAAVNS+S S L+QCS+RRL VPS RSLASNFD FRFRTSVF HYS VR S++SSRMV+H
Sbjct: 1   MAAVNSVSFSALSQCSERRLLVPSARSLASNFDGFRFRTSVFCHYSGVRTSSYSSRMVVH 60

Query: 61  CMSAGTDVTTVAETKLNFLKAYKRPIPSIYNTVLQELIVQQHLMRYKRTYRYDPVFALGF 120
           CMSAGTDVTTVAETK NFLK YKRPIPSIYNTVLQELIVQQHLMRYKRTYRYDPVFALGF
Sbjct: 61  CMSAGTDVTTVAETKANFLKVYKRPIPSIYNTVLQELIVQQHLMRYKRTYRYDPVFALGF 120

Query: 121 VTVYDQLMEGYPSDEDRDAIFQAYIKALNEDPEQYRIDAKKLEEWARSQTATSLVEFASR 180
           VTVYDQLMEGYPSDEDR+AIFQAYIKALNEDPEQYRIDAKKLEEWARSQTA SLVEFAS+
Sbjct: 121 VTVYDQLMEGYPSDEDREAIFQAYIKALNEDPEQYRIDAKKLEEWARSQTAASLVEFASK 180

Query: 181 EGEVESTLKDIAERAGNKGNFSYSRFFAIGLFRLLELANATEPSILEKLCAALNIDKKGV 240
           EGEVES LKDIAERAG KG+FSYSRFFAIGLFRLLELANATEPSILEKLCAALN++KK V
Sbjct: 181 EGEVESILKDIAERAGGKGSFSYSRFFAIGLFRLLELANATEPSILEKLCAALNVNKKSV 240

Query: 241 DRDLDVYRNLLSKLVQAKELLKEYVDREKKKRDERSGSQTANEAITKCLGEYSMQTG 298
           DRDLDVYRNLLSKLVQAKELLKEYVDREKKKRDER+GSQTANEAITKCLGEYSMQTG
Sbjct: 241 DRDLDVYRNLLSKLVQAKELLKEYVDREKKKRDERAGSQTANEAITKCLGEYSMQTG 297

BLAST of Clc02G01460 vs. TAIR 10
Match: AT2G20890.1 (photosystem II reaction center PSB29 protein )

HSP 1 Score: 383.6 bits (984), Expect = 1.4e-106
Identity = 209/290 (72.07%), Postives = 244/290 (84.14%), Query Frame = 0

Query: 3   AVNSISISTLNQCSDRRLPVPSPRSLASNFDAFRFRTSVFSHYSRVRASTFSSRMVIHCM 62
           A++S+S   L Q SD+     S R LAS   A R  T  FS  S    ST  S+ +IHCM
Sbjct: 5   AISSLSFPALGQ-SDKISNFASSRPLAS---AIRICTK-FSRLSLNSRST--SKSLIHCM 64

Query: 63  SAGT-DVTTVAETKLNFLKAYKRPIPSIYNTVLQELIVQQHLMRYKRTYRYDPVFALGFV 122
           S  T DV  V+ETK  FLKAYKRPIPSIYNTVLQELIVQQHLMRYK+TYRYDPVFALGFV
Sbjct: 65  SNVTADVPPVSETKSKFLKAYKRPIPSIYNTVLQELIVQQHLMRYKKTYRYDPVFALGFV 124

Query: 123 TVYDQLMEGYPSDEDRDAIFQAYIKALNEDPEQYRIDAKKLEEWARSQTATSLVEFASRE 182
           TVYDQLMEGYPSD+DRDAIF+AYI+ALNEDP+QYRIDA+K+EEWARSQT+ SLV+F+S+E
Sbjct: 125 TVYDQLMEGYPSDQDRDAIFKAYIEALNEDPKQYRIDAQKMEEWARSQTSASLVDFSSKE 184

Query: 183 GEVESTLKDIAERAGNKGNFSYSRFFAIGLFRLLELANATEPSILEKLCAALNIDKKGVD 242
           G++E+ LKDIA RAG+K  FSYSRFFA+GLFRLLELA+AT+P++L+KLCA+LNI+KK VD
Sbjct: 185 GDIEAVLKDIAGRAGSKEGFSYSRFFAVGLFRLLELASATDPTVLDKLCASLNINKKSVD 244

Query: 243 RDLDVYRNLLSKLVQAKELLKEYVDREKKKRDERSGSQTANEAITKCLGE 292
           RDLDVYRNLLSKLVQAKELLKEYV+REKKK+ ER+ SQ ANE I+KCLG+
Sbjct: 245 RDLDVYRNLLSKLVQAKELLKEYVEREKKKQGERAQSQKANETISKCLGD 287

The following BLAST results are available for this feature:
Match NameE-valueIdentityDescription
XP_038888611.12.9e-15496.31protein THYLAKOID FORMATION1, chloroplastic [Benincasa hispida][more]
XP_008455361.13.2e-15394.97PREDICTED: protein THYLAKOID FORMATION1, chloroplastic [Cucumis melo] >KAA003160... [more]
XP_004136805.18.7e-15194.63protein THYLAKOID FORMATION1, chloroplastic [Cucumis sativus][more]
KAE8645760.12.5e-15094.61hypothetical protein Csa_020345 [Cucumis sativus][more]
XP_022969189.11.7e-14690.60protein THYLAKOID FORMATION1, chloroplastic-like isoform X2 [Cucurbita maxima][more]
Match NameE-valueIdentityDescription
Q7XAB85.5e-10868.94Protein THYLAKOID FORMATION1, chloroplastic OS=Solanum tuberosum OX=4113 GN=THF1... [more]
Q9SKT02.0e-10572.07Protein THYLAKOID FORMATION 1, chloroplastic OS=Arabidopsis thaliana OX=3702 GN=... [more]
Q84PB71.2e-9968.40Protein THYLAKOID FORMATION1, chloroplastic OS=Oryza sativa subsp. japonica OX=3... [more]
B0C3M87.4e-3637.90Protein Thf1 OS=Acaryochloris marina (strain MBIC 11017) OX=329726 GN=thf1 PE=3 ... [more]
Q116P52.4e-3437.04Protein Thf1 OS=Trichodesmium erythraeum (strain IMS101) OX=203124 GN=thf1 PE=3 ... [more]
Match NameE-valueIdentityDescription
A0A5D3C7D31.5e-15394.97Protein THYLAKOID FORMATION1 OS=Cucumis melo var. makuwa OX=1194695 GN=E5676_sca... [more]
A0A1S3C0V51.5e-15394.97protein THYLAKOID FORMATION1, chloroplastic OS=Cucumis melo OX=3656 GN=LOC103495... [more]
A0A0A0K3P04.2e-15194.63Uncharacterized protein OS=Cucumis sativus OX=3659 GN=Csa_7G046130 PE=3 SV=1[more]
A0A6J1I1V88.2e-14790.60protein THYLAKOID FORMATION1, chloroplastic-like isoform X2 OS=Cucurbita maxima ... [more]
A0A6J1C3B81.4e-14691.25protein THYLAKOID FORMATION1, chloroplastic isoform X1 OS=Momordica charantia OX... [more]
Match NameE-valueIdentityDescription
AT2G20890.11.4e-10672.07photosystem II reaction center PSB29 protein [more]
InterPro
Analysis Name: InterPro Annotations of Watermelon (cordophanus) v2
Date Performed: 2022-01-31
IPR TermIPR DescriptionSourceSource TermSource DescriptionAlignment
NoneNo IPR availableCOILSCoilCoilcoord: 247..274
NoneNo IPR availablePANTHERPTHR34793:SF7PHOTOSYSTEM II BIOGENESIS PROTEINcoord: 1..286
IPR017499Protein Thf1PFAMPF11264ThylakoidFormatcoord: 70..275
e-value: 4.8E-76
score: 255.2
IPR017499Protein Thf1TIGRFAMTIGR03060TIGR03060coord: 67..272
e-value: 1.7E-48
score: 163.1
IPR017499Protein Thf1PANTHERPTHR34793PROTEIN THYLAKOID FORMATION 1, CHLOROPLASTICcoord: 1..286
IPR017499Protein Thf1HAMAPMF_01843Thf1coord: 65..273
score: 22.407061

Relationships

The following mRNA feature(s) are a part of this gene:

Feature NameUnique NameType
Clc02G01460.1Clc02G01460.1mRNA


GO Annotation
GO Assignments
This gene is annotated with the following GO terms.
Category Term Accession Term Name
biological_process GO:0010207 photosystem II assembly
biological_process GO:0045037 protein import into chloroplast stroma
biological_process GO:0045038 protein import into chloroplast thylakoid membrane
biological_process GO:0010027 thylakoid membrane organization
biological_process GO:0015979 photosynthesis