MC04g0167 (gene) Bitter gourd (Dali-11) v1

Overview
NameMC04g0167
Typegene
OrganismMomordica charantia cv. Dali-11 (Bitter gourd (Dali-11) v1)
Descriptionprotein THYLAKOID FORMATION1, chloroplastic
LocationMC04: 1269585 .. 1272829 (+)
RNA-Seq ExpressionMC04g0167
SyntenyMC04g0167
Sequences
The following sequences are available for this feature:

Gene sequence (with intron)

Legend: utr5CDSpolypeptideutr3
Hold the cursor over a type above to highlight its positions in the sequence below.
CAATTTCTAAGTTAGCCCACAGCAATAGAGCCCAATTCATATCCCATATGCCACTAAAGTCCAAACTTACAACTTTTTTTTCTTTTTTGAATAATTCCTTTTGAGTGAAAAAGAGGGCATGTACTAATGATCTGGTCCATGCAAATGGAAAATCGAAAGTACGGAGTCCGCCAGGTGTCGATATTCTACTGGCTTGTTGATGAAGGCGTAGAAATTAACACTTGGGATAAGCCTCTCTATCATGGCCTCTCAGTCCCACCCTCTCCTCTTTCTATTTTCCGACAAAATTCGTTCGAAACTCATATTCTATATGTGTTTTCTCCCACCTTGCTCCGATATGAAATCTATTTTCTCCGGAAGTTCGTAAGCTTCGCAAGTTTCTTCTCTATCTTTGCCAATGGCGGCTGTTAATTCCGTGTCATTCTCCGCATTAAGTCAATGTTCTGAAAGAAGATTGCTGGTTCCTTCGGCTCGTTCACTAGCCTCGAATTTCGACGGGTTTCGTTTTCGTACAAGCGTTTTCTGCCATTATTCGGGAGTTCGGACATCGAGTTACAGTTCTCGAATGGTCGTCCATTGCATGTCTGCCGGAACAGGTATTTTCTGGTGATTTGTTTTCCTTGCTCGCTTATCGGACAGTTTCATTGCCACAAATGATTGCAGAGAAAACGAGGCTCGTTTCGCGTAGTCCGGTTGAGTTTGATTTTCAAGCTGAATTTGTGTTTTGGCTTTTCCTCAATTGTAGAGATTTGTGTTTTGTTATTTTGGACGGAATTAGGCTTTCAGTAGTTCGGTTGGATTAATGCTAACTTCGGCAATGTGTTTAAGAATTCTCTAGTTCGATTTTTTACTACTTGAGTTGAGTCTAAAAACTGATCATTGTTTTGACGGCTCAGATGTGACCACCGTGGCCGAGACAAAGGCGAACTTCCTCAAGGTGTATAAGCGGCCTATTCCTAGCATTTACAATACTGTTCTGCAAGAGTTGATTGTGCAGCAGCATTTGATGAGGTATAAGAGGACATACCGTTATGATCCTGTTTTCGCCCTTGGTTTTGTTACAGTATATGATCAGCTTATGGAAGGGTACCCTAGCGATGAGGATCGAGAGGCCATTTTCCAAGCATATATTAAGGCATTGAATGAGGATCCAGAGCAATATAGGTTTGGTCATGTATTCCCCTTTGCTTTTATCATCCCTGGTCCGTATCTCTTGCCAGCTAATAATTTCAATAGCCTTATTTATAGGTAGACATTGTTAAACTAAAAAATCATTACATGATTCCATTTATAGTAAAAGAACAGTGATTCCGATTGGAACTTTATCCTCTTTTCCAGCATTCATGTGTTAATTTGGACTAGATTGATTGGTCAAATTACCTGCTATCAGATGTTAGGGGAGGGTTTAAGTAGTCTTCATGACTTTTATCTGTATCTGGAAATCAAACTAATGAATGTTTATTTTATTCAATAACTTCAAGATTCTAATGTTCTTCCCTGGAAATAATATTCCAACCTCTGAGTCTTGGACTATTCTTCCAATAATTTCAAGAAGGTTTTTTCCTTATTATAACTAGTCTTTTCTGTTATACTGTCCAGAATTGATGCTAAAAAATTGGAGGAGTGGGCTCGGTCTCAGACAGCAGCTTCATTGGTTGAGTTTGCATCAAAAGAAGGAGAAGTTGAGAGTATTTTAAAGGACATTGCAGAACGGGCAGGGGGTAAGGGGAGTTTCAGTTACAGCCGTTTTTTTGCTATTGGGCTGTTTCGACTCCTTGAATTGGCCAATGCTACCGAACCCAGTATCTTGGAAAAGGTATGAATATTTTTATGTTACAGTTCATTGAACTTATTATTCCTAAATAAAGCGATAGAAATCCCCTCTGATAGGACCTTTACTGATAGATTATATTCTTTTCTCTGAGTGCTGCCATTTGTGAAGTATAACTGCATTTCTTGCACATTACTTATCTTTTAGAGAAATTGGGGTCTTGATCGAATGGTGCTTAACCTTATTCTGTTTGAGAGAATTAAGTTCATATTTGGTTGAGGAGAAATATAACAATTTTTTAGATAGGTTTCTAGGGAGCAGCAAGCTAGTATTCATTTCAACTTGTGGAGGGTGTGCTGATCTTTCGAATGTAGTGTATGCTCTTTATTAGTTTCTGCTGGCCCTCTTCATTCAGTTTATTAGCTCTTGTGAACGGAAACATATACCTGTTCCCCTGCCCTAGGAACAACAGTTATTGCAAATGATTGCCTTAGTTGGCCGACTCTATTTAGCAAGCGTGTCTATCACAAAATATTTTGTTGTTTCAAAACATCTAAAGCTATATATTCTAACATTTTCCAAAACAGCTGAAGCCATGTGATGGAGATTTGACCAAATATTTGCATCTAACTTTTTGCTGCTTCTGAACAGCTCTGTGCTGCTTTAAATGTCAACAAAAAAAGTGTGGACCGAGACCTAGATGTCTACCGCAACCTGCTTTCAAAGTTGGTTCAGGCAAAAGAGCTCCTAAAGGAATACGTGGATAGGTAAGAGATTGTTAAGTTCCTAGTTCCAGGCATGTAGATTAACATACTCAGCAATATGCTCTTTCAAACTACCTTATTTGCATCGATTTTATGAATCCGTAATTGATTCGAAAATTTTCAACCTTGTACTCGTTAAATCTCCTGCAAGTCACTGTCATAGCATTTATAATCACTGGATATCTACAACTTCACCATCCTTGATTCTACACCTTGCTTTTTCACATATGGATCTTGGCTGAAGTCTAAATGATGTAGAATTTGGTGGATCATTGTGGGTCTAGAGTAAAGTTTTAACAATAAAGGTTCTATGACTAACCAGTGCTTTGGATTGCATTTTTACCTATAGAGAGAAGAAGAAAAGAGATGAGAGGGCTGGATCTCAGACAGCTAATGAGGCCATAACAAAATGCTTGGGAGAGTACAGCATGCAGACTGGTTTTTGAGAGGTAATGACATCAATTGAAGTACTGCCTAATTTGAGATAGACTTGAGTTTTAGCAATATTCTAAATACCTGATAGATGTGCTTCTGTAATGTATTATTGGGTTTTACGCATTTGGTAAATTTTGCATTCATATCCTCATTTGATATAACCACCTTAAGCTATTTTTTCATCATTTGTATTTTGTATTACATAGTTTGAGTGCAATTGTCAATACAAATGCTTTCGTACTCTGTGCAGTAATCATAGTTTTCAGTTCCTGTCT

mRNA sequence

CAATTTCTAAGTTAGCCCACAGCAATAGAGCCCAATTCATATCCCATATGCCACTAAAGTCCAAACTTACAACTTTTTTTTCTTTTTTGAATAATTCCTTTTGAGTGAAAAAGAGGGCATGTACTAATGATCTGGTCCATGCAAATGGAAAATCGAAAGTACGGAGTCCGCCAGGTGTCGATATTCTACTGGCTTGTTGATGAAGGCGTAGAAATTAACACTTGGGATAAGCCTCTCTATCATGGCCTCTCAGTCCCACCCTCTCCTCTTTCTATTTTCCGACAAAATTCGTTCGAAACTCATATTCTATATGTGTTTTCTCCCACCTTGCTCCGATATGAAATCTATTTTCTCCGGAAGTTCGTAAGCTTCGCAAGTTTCTTCTCTATCTTTGCCAATGGCGGCTGTTAATTCCGTGTCATTCTCCGCATTAAGTCAATGTTCTGAAAGAAGATTGCTGGTTCCTTCGGCTCGTTCACTAGCCTCGAATTTCGACGGGTTTCGTTTTCGTACAAGCGTTTTCTGCCATTATTCGGGAGTTCGGACATCGAGTTACAGTTCTCGAATGGTCGTCCATTGCATGTCTGCCGGAACAGATGTGACCACCGTGGCCGAGACAAAGGCGAACTTCCTCAAGGTGTATAAGCGGCCTATTCCTAGCATTTACAATACTGTTCTGCAAGAGTTGATTGTGCAGCAGCATTTGATGAGGTATAAGAGGACATACCGTTATGATCCTGTTTTCGCCCTTGGTTTTGTTACAGTATATGATCAGCTTATGGAAGGGTACCCTAGCGATGAGGATCGAGAGGCCATTTTCCAAGCATATATTAAGGCATTGAATGAGGATCCAGAGCAATATAGAATTGATGCTAAAAAATTGGAGGAGTGGGCTCGGTCTCAGACAGCAGCTTCATTGGTTGAGTTTGCATCAAAAGAAGGAGAAGTTGAGAGTATTTTAAAGGACATTGCAGAACGGGCAGGGGGTAAGGGGAGTTTCAGTTACAGCCGTTTTTTTGCTATTGGGCTGTTTCGACTCCTTGAATTGGCCAATGCTACCGAACCCAGTATCTTGGAAAAGCTCTGTGCTGCTTTAAATGTCAACAAAAAAAGTGTGGACCGAGACCTAGATGTCTACCGCAACCTGCTTTCAAAGTTGGTTCAGGCAAAAGAGCTCCTAAAGGAATACGTGGATAGAGAGAAGAAGAAAAGAGATGAGAGGGCTGGATCTCAGACAGCTAATGAGGCCATAACAAAATGCTTGGGAGAGTACAGCATGCAGACTGGTTTTTGAGAGGTAATGACATCAATTGAAGTACTGCCTAATTTGAGATAGACTTGAGTTTTAGCAATATTCTAAATACCTGATAGATGTGCTTCTGTAATGTATTATTGGGTTTTACGCATTTGGTAAATTTTGCATTCATATCCTCATTTGATATAACCACCTTAAGCTATTTTTTCATCATTTGTATTTTGTATTACATAGTTTGAGTGCAATTGTCAATACAAATGCTTTCGTACTCTGTGCAGTAATCATAGTTTTCAGTTCCTGTCT

Coding sequence (CDS)

ATGGCGGCTGTTAATTCCGTGTCATTCTCCGCATTAAGTCAATGTTCTGAAAGAAGATTGCTGGTTCCTTCGGCTCGTTCACTAGCCTCGAATTTCGACGGGTTTCGTTTTCGTACAAGCGTTTTCTGCCATTATTCGGGAGTTCGGACATCGAGTTACAGTTCTCGAATGGTCGTCCATTGCATGTCTGCCGGAACAGATGTGACCACCGTGGCCGAGACAAAGGCGAACTTCCTCAAGGTGTATAAGCGGCCTATTCCTAGCATTTACAATACTGTTCTGCAAGAGTTGATTGTGCAGCAGCATTTGATGAGGTATAAGAGGACATACCGTTATGATCCTGTTTTCGCCCTTGGTTTTGTTACAGTATATGATCAGCTTATGGAAGGGTACCCTAGCGATGAGGATCGAGAGGCCATTTTCCAAGCATATATTAAGGCATTGAATGAGGATCCAGAGCAATATAGAATTGATGCTAAAAAATTGGAGGAGTGGGCTCGGTCTCAGACAGCAGCTTCATTGGTTGAGTTTGCATCAAAAGAAGGAGAAGTTGAGAGTATTTTAAAGGACATTGCAGAACGGGCAGGGGGTAAGGGGAGTTTCAGTTACAGCCGTTTTTTTGCTATTGGGCTGTTTCGACTCCTTGAATTGGCCAATGCTACCGAACCCAGTATCTTGGAAAAGCTCTGTGCTGCTTTAAATGTCAACAAAAAAAGTGTGGACCGAGACCTAGATGTCTACCGCAACCTGCTTTCAAAGTTGGTTCAGGCAAAAGAGCTCCTAAAGGAATACGTGGATAGAGAGAAGAAGAAAAGAGATGAGAGGGCTGGATCTCAGACAGCTAATGAGGCCATAACAAAATGCTTGGGAGAGTACAGCATGCAGACTGGTTTTTGA

Protein sequence

MAAVNSVSFSALSQCSERRLLVPSARSLASNFDGFRFRTSVFCHYSGVRTSSYSSRMVVHCMSAGTDVTTVAETKANFLKVYKRPIPSIYNTVLQELIVQQHLMRYKRTYRYDPVFALGFVTVYDQLMEGYPSDEDREAIFQAYIKALNEDPEQYRIDAKKLEEWARSQTAASLVEFASKEGEVESILKDIAERAGGKGSFSYSRFFAIGLFRLLELANATEPSILEKLCAALNVNKKSVDRDLDVYRNLLSKLVQAKELLKEYVDREKKKRDERAGSQTANEAITKCLGEYSMQTGF
Homology
BLAST of MC04g0167 vs. ExPASy Swiss-Prot
Match: Q7XAB8 (Protein THYLAKOID FORMATION1, chloroplastic OS=Solanum tuberosum OX=4113 GN=THF1 PE=2 SV=1)

HSP 1 Score: 397.5 bits (1020), Expect = 1.3e-109
Identity = 210/293 (71.67%), Postives = 247/293 (84.30%), Query Frame = 0

Query: 1   MAAVNSVSFSALSQCSERRLLVPSARSLASNFDGFRFRTSVFCHYSGVRTSSYSSRMVVH 60
           MAAV SVSFSA++Q +ER+  V S+RS+    D FRFR++       VR+S+ +SR VVH
Sbjct: 1   MAAVTSVSFSAITQSAERKSSVSSSRSI----DTFRFRSNFSFDSVNVRSSNSTSRFVVH 60

Query: 61  C-MSAGTDVTTVAETKANFLKVYKRPIPSIYNTVLQELIVQQHLMRYKRTYRYDPVFALG 120
           C  S+  D+ TVA+TK  FL  YKRPIP++YNTVLQELIVQQHL RYK++Y+YDPVFALG
Sbjct: 61  CTSSSAADLPTVADTKLKFLTAYKRPIPTVYNTVLQELIVQQHLTRYKKSYQYDPVFALG 120

Query: 121 FVTVYDQLMEGYPSDEDREAIFQAYIKALNEDPEQYRIDAKKLEEWARSQTAASLVEFAS 180
           FVTVYDQLMEGYPS+EDR AIF+AYI+AL EDPEQYR DA+KLEEWAR+Q A +LV+F+S
Sbjct: 121 FVTVYDQLMEGYPSEEDRNAIFKAYIEALKEDPEQYRADAQKLEEWARTQNANTLVDFSS 180

Query: 181 KEGEVESILKDIAERAGGKGSFSYSRFFAIGLFRLLELANATEPSILEKLCAALNVNKKS 240
           KEGE+E+I KDIA+RAG K  F YSR FA+GLFRLLELAN T+P+ILEKLCAALNVNKKS
Sbjct: 181 KEGEIENIFKDIAQRAGTKDGFCYSRLFAVGLFRLLELANVTDPTILEKLCAALNVNKKS 240

Query: 241 VDRDLDVYRNLLSKLVQAKELLKEYVDREKKKRDERAGSQTANEAITKCLGEY 293
           VDRDLDVYRNLLSKLVQAKELLKEYV+REKKKR ER  +Q ANE +TKCLG+Y
Sbjct: 241 VDRDLDVYRNLLSKLVQAKELLKEYVEREKKKRGERE-TQKANETVTKCLGDY 288

BLAST of MC04g0167 vs. ExPASy Swiss-Prot
Match: Q9SKT0 (Protein THYLAKOID FORMATION 1, chloroplastic OS=Arabidopsis thaliana OX=3702 GN=THF1 PE=1 SV=1)

HSP 1 Score: 386.0 bits (990), Expect = 4.0e-106
Identity = 206/291 (70.79%), Postives = 248/291 (85.22%), Query Frame = 0

Query: 3   AVNSVSFSALSQCSERRLLVPSARSLASNFDGFRFRTSVFCHYSGVRTSSYS-SRMVVHC 62
           A++S+SF AL Q S++     S+R LAS          +   +S +  +S S S+ ++HC
Sbjct: 5   AISSLSFPALGQ-SDKISNFASSRPLAS-------AIRICTKFSRLSLNSRSTSKSLIHC 64

Query: 63  MSAGT-DVTTVAETKANFLKVYKRPIPSIYNTVLQELIVQQHLMRYKRTYRYDPVFALGF 122
           MS  T DV  V+ETK+ FLK YKRPIPSIYNTVLQELIVQQHLMRYK+TYRYDPVFALGF
Sbjct: 65  MSNVTADVPPVSETKSKFLKAYKRPIPSIYNTVLQELIVQQHLMRYKKTYRYDPVFALGF 124

Query: 123 VTVYDQLMEGYPSDEDREAIFQAYIKALNEDPEQYRIDAKKLEEWARSQTAASLVEFASK 182
           VTVYDQLMEGYPSD+DR+AIF+AYI+ALNEDP+QYRIDA+K+EEWARSQT+ASLV+F+SK
Sbjct: 125 VTVYDQLMEGYPSDQDRDAIFKAYIEALNEDPKQYRIDAQKMEEWARSQTSASLVDFSSK 184

Query: 183 EGEVESILKDIAERAGGKGSFSYSRFFAIGLFRLLELANATEPSILEKLCAALNVNKKSV 242
           EG++E++LKDIA RAG K  FSYSRFFA+GLFRLLELA+AT+P++L+KLCA+LN+NKKSV
Sbjct: 185 EGDIEAVLKDIAGRAGSKEGFSYSRFFAVGLFRLLELASATDPTVLDKLCASLNINKKSV 244

Query: 243 DRDLDVYRNLLSKLVQAKELLKEYVDREKKKRDERAGSQTANEAITKCLGE 292
           DRDLDVYRNLLSKLVQAKELLKEYV+REKKK+ ERA SQ ANE I+KCLG+
Sbjct: 245 DRDLDVYRNLLSKLVQAKELLKEYVEREKKKQGERAQSQKANETISKCLGD 287

BLAST of MC04g0167 vs. ExPASy Swiss-Prot
Match: Q84PB7 (Protein THYLAKOID FORMATION1, chloroplastic OS=Oryza sativa subsp. japonica OX=39947 GN=THF1 PE=2 SV=1)

HSP 1 Score: 366.3 bits (939), Expect = 3.3e-100
Identity = 199/288 (69.10%), Postives = 235/288 (81.60%), Query Frame = 0

Query: 1   MAAVNSVSFSALSQCSERRLLVPSARSLASNFDGFRFRTSVFCHYSGVRTSSYSSRMVVH 60
           MAA++S+ F+AL + ++ R   PS  + A+         SV             SR VV 
Sbjct: 1   MAAISSLPFAALRRAADCR---PSTAAAAAGAGAGAVVLSV--------RPRRGSRSVVR 60

Query: 61  CMSAGTDV-TTVAETKANFLKVYKRPIPSIYNTVLQELIVQQHLMRYKRTYRYDPVFALG 120
           C++   DV  TVAETK NFLK YKRPI SIY+TVLQEL+VQQHLMRYK TY+YD VFALG
Sbjct: 61  CVATAGDVPPTVAETKMNFLKSYKRPILSIYSTVLQELLVQQHLMRYKTTYQYDAVFALG 120

Query: 121 FVTVYDQLMEGYPSDEDREAIFQAYIKALNEDPEQYRIDAKKLEEWARSQTAASLVEFAS 180
           FVTVYDQLMEGYPS+EDR+AIF+AYI ALNEDPEQYR DA+K+EEWARSQ   SLVEF+S
Sbjct: 121 FVTVYDQLMEGYPSNEDRDAIFKAYITALNEDPEQYRADAQKMEEWARSQNGNSLVEFSS 180

Query: 181 KEGEVESILKDIAERAGGKGSFSYSRFFAIGLFRLLELANATEPSILEKLCAALNVNKKS 240
           K+GE+E+ILKDI+ERA GKGSFSYSRFFA+GLFRLLELANATEP+IL+KLCAALN+NK+S
Sbjct: 181 KDGEIEAILKDISERAQGKGSFSYSRFFAVGLFRLLELANATEPTILDKLCAALNINKRS 240

Query: 241 VDRDLDVYRNLLSKLVQAKELLKEYVDREKKKRDERAGSQTANEAITK 288
           VDRDLDVYRN+LSKLVQAKELLKEYV+REKKKR+ER+ +  +NEA+TK
Sbjct: 241 VDRDLDVYRNILSKLVQAKELLKEYVEREKKKREERSETPKSNEAVTK 277

BLAST of MC04g0167 vs. ExPASy Swiss-Prot
Match: B0C3M8 (Protein Thf1 OS=Acaryochloris marina (strain MBIC 11017) OX=329726 GN=thf1 PE=3 SV=1)

HSP 1 Score: 151.4 bits (381), Expect = 1.6e-35
Identity = 81/219 (36.99%), Postives = 138/219 (63.01%), Query Frame = 0

Query: 67  DVTTVAETKANFLKVYKRPIPSIYNTVLQELIVQQHLMRYKRTYRYDPVFALGFVTVYDQ 126
           ++ TV++TK  F  ++ RP+ S+Y  V++EL+V+ HL+R    +RYDP+FALG  T +D+
Sbjct: 3   NLRTVSDTKRAFYSIHTRPVNSVYRRVVEELMVEMHLLRVNEDFRYDPIFALGVTTSFDR 62

Query: 127 LMEGYPSDEDREAIFQAYIKALNEDPEQYRIDAKKLEEWARSQTAASLVEF---ASKEG- 186
            M+GY  + D++AIF A  KA   DP Q + D ++L E A+S++A  ++++   A+  G 
Sbjct: 63  FMDGYQPENDKDAIFSAICKAQEADPVQMKKDGQRLTELAQSKSAQEMLDWITQAANSGG 122

Query: 187 -EVESILKDIAERAGGKGSFSYSRFFAIGLFRLLELA--NATE-----PSILEKLCAALN 246
            E++  L++IA+       F YSR FAIGLF LLEL+  N T+        L  +C  LN
Sbjct: 123 DELQWQLRNIAQNP----KFKYSRLFAIGLFTLLELSEGNITQDEESLAEFLPNICTVLN 182

Query: 247 VNKKSVDRDLDVYRNLLSKLVQAKELLKEYVDREKKKRD 274
           +++  + +DL++YR  L K+ Q ++ + + ++ +KK+R+
Sbjct: 183 ISESKLQKDLEIYRGNLDKIAQVRQAMDDILEAQKKRRE 217

BLAST of MC04g0167 vs. ExPASy Swiss-Prot
Match: Q116P5 (Protein Thf1 OS=Trichodesmium erythraeum (strain IMS101) OX=203124 GN=thf1 PE=3 SV=1)

HSP 1 Score: 143.7 bits (361), Expect = 3.4e-33
Identity = 80/216 (37.04%), Postives = 129/216 (59.72%), Query Frame = 0

Query: 70  TVAETKANFLKVYKRPIPSIYNTVLQELIVQQHLMRYKRTYRYDPVFALGFVTVYDQLME 129
           TV++TK  F   + RPI SIYN V++EL+V+ HL+     Y Y+P +ALG VT +D+ M+
Sbjct: 6   TVSDTKKTFYHFHTRPINSIYNRVIEELLVEMHLISVNVDYSYNPFYALGVVTAFDRFMQ 65

Query: 130 GYPSDEDREAIFQAYIKALNEDPEQYRIDAKKLEEWARSQTAASLVEF--ASKEGEVESI 189
           GY   ED+ +IF A I+   EDP +YR DAK LE+ A   +A+ ++ +   SK  +    
Sbjct: 66  GYSPQEDKTSIFNALIQGQEEDPNKYRSDAKGLEDLAGKISASDILSWICLSKNIDNTQY 125

Query: 190 LKDIAERAGGKGSFSYSRFFAIGLFRLLELANA-------TEPSILEKLCAALNVNKKSV 249
           L+D          F YSR FAIGLF LLE+ +             L+K+C +LN+ ++ +
Sbjct: 126 LQDDLRAISENSKFRYSRLFAIGLFTLLEIVDTELIKEQEKRTEALKKICQSLNLVEEKL 185

Query: 250 DRDLDVYRNLLSKLVQAKELLKEYVDREKKKRDERA 277
            +D+D+Y + L ++ QA+  +++ +   +KKR++R+
Sbjct: 186 LKDIDLYLSNLERVAQARSAMEDTLAAMRKKREKRS 221

BLAST of MC04g0167 vs. NCBI nr
Match: XP_022136235.1 (protein THYLAKOID FORMATION1, chloroplastic isoform X1 [Momordica charantia])

HSP 1 Score: 573 bits (1476), Expect = 1.23e-205
Identity = 298/298 (100.00%), Postives = 298/298 (100.00%), Query Frame = 0

Query: 1   MAAVNSVSFSALSQCSERRLLVPSARSLASNFDGFRFRTSVFCHYSGVRTSSYSSRMVVH 60
           MAAVNSVSFSALSQCSERRLLVPSARSLASNFDGFRFRTSVFCHYSGVRTSSYSSRMVVH
Sbjct: 1   MAAVNSVSFSALSQCSERRLLVPSARSLASNFDGFRFRTSVFCHYSGVRTSSYSSRMVVH 60

Query: 61  CMSAGTDVTTVAETKANFLKVYKRPIPSIYNTVLQELIVQQHLMRYKRTYRYDPVFALGF 120
           CMSAGTDVTTVAETKANFLKVYKRPIPSIYNTVLQELIVQQHLMRYKRTYRYDPVFALGF
Sbjct: 61  CMSAGTDVTTVAETKANFLKVYKRPIPSIYNTVLQELIVQQHLMRYKRTYRYDPVFALGF 120

Query: 121 VTVYDQLMEGYPSDEDREAIFQAYIKALNEDPEQYRIDAKKLEEWARSQTAASLVEFASK 180
           VTVYDQLMEGYPSDEDREAIFQAYIKALNEDPEQYRIDAKKLEEWARSQTAASLVEFASK
Sbjct: 121 VTVYDQLMEGYPSDEDREAIFQAYIKALNEDPEQYRIDAKKLEEWARSQTAASLVEFASK 180

Query: 181 EGEVESILKDIAERAGGKGSFSYSRFFAIGLFRLLELANATEPSILEKLCAALNVNKKSV 240
           EGEVESILKDIAERAGGKGSFSYSRFFAIGLFRLLELANATEPSILEKLCAALNVNKKSV
Sbjct: 181 EGEVESILKDIAERAGGKGSFSYSRFFAIGLFRLLELANATEPSILEKLCAALNVNKKSV 240

Query: 241 DRDLDVYRNLLSKLVQAKELLKEYVDREKKKRDERAGSQTANEAITKCLGEYSMQTGF 298
           DRDLDVYRNLLSKLVQAKELLKEYVDREKKKRDERAGSQTANEAITKCLGEYSMQTGF
Sbjct: 241 DRDLDVYRNLLSKLVQAKELLKEYVDREKKKRDERAGSQTANEAITKCLGEYSMQTGF 298

BLAST of MC04g0167 vs. NCBI nr
Match: XP_023554556.1 (protein THYLAKOID FORMATION1, chloroplastic-like isoform X2 [Cucurbita pepo subsp. pepo])

HSP 1 Score: 537 bits (1384), Expect = 1.30e-191
Identity = 274/297 (92.26%), Postives = 292/297 (98.32%), Query Frame = 0

Query: 1   MAAVNSVSFSALSQCSERRLLVPSARSLASNFDGFRFRTSVFCHYSGVRTSSYSSRMVVH 60
           MAAVNSVSF+ALSQCS+RRL +PSARSLAS+FDGFRFR SVFCHYSGVRTSS+SSRMV+H
Sbjct: 1   MAAVNSVSFTALSQCSDRRLPIPSARSLASHFDGFRFRKSVFCHYSGVRTSSFSSRMVIH 60

Query: 61  CMSAGTDVTTVAETKANFLKVYKRPIPSIYNTVLQELIVQQHLMRYKRTYRYDPVFALGF 120
           CM++GTDVTTVAETKANFLK YKRPIPSIYNTV+QELIVQQHLMRYK+TYRYDPVFALGF
Sbjct: 61  CMASGTDVTTVAETKANFLKAYKRPIPSIYNTVVQELIVQQHLMRYKKTYRYDPVFALGF 120

Query: 121 VTVYDQLMEGYPSDEDREAIFQAYIKALNEDPEQYRIDAKKLEEWARSQTAASLVEFASK 180
           VTVYD+LMEGYPSDEDR+AIFQAYI ALNEDPEQYRIDA+KLEEWARSQTAASLVEFAS+
Sbjct: 121 VTVYDRLMEGYPSDEDRDAIFQAYINALNEDPEQYRIDAQKLEEWARSQTAASLVEFASR 180

Query: 181 EGEVESILKDIAERAGGKGSFSYSRFFAIGLFRLLELANATEPSILEKLCAALNVNKKSV 240
           EGEVESILKDIAERAGGKG+FSYSRFFAIGLFRLLELANA+EPSILEKLCAALNV+KK V
Sbjct: 181 EGEVESILKDIAERAGGKGNFSYSRFFAIGLFRLLELANASEPSILEKLCAALNVDKKGV 240

Query: 241 DRDLDVYRNLLSKLVQAKELLKEYVDREKKKRDERAGSQTANEAITKCLGEYSMQTG 297
           DRDLDVYRNLLSKLVQAKELLKEYVDREKKKRDERAGSQTA+EAITKCLGEYSMQTG
Sbjct: 241 DRDLDVYRNLLSKLVQAKELLKEYVDREKKKRDERAGSQTASEAITKCLGEYSMQTG 297

BLAST of MC04g0167 vs. NCBI nr
Match: XP_022969189.1 (protein THYLAKOID FORMATION1, chloroplastic-like isoform X2 [Cucurbita maxima])

HSP 1 Score: 535 bits (1377), Expect = 1.51e-190
Identity = 273/297 (91.92%), Postives = 290/297 (97.64%), Query Frame = 0

Query: 1   MAAVNSVSFSALSQCSERRLLVPSARSLASNFDGFRFRTSVFCHYSGVRTSSYSSRMVVH 60
           MAAVNSVSFS LSQCS+RRL +PSARSLAS+FDGFRFR SVFCHYSGVRTSS+SSRMV+H
Sbjct: 1   MAAVNSVSFSGLSQCSDRRLPIPSARSLASHFDGFRFRKSVFCHYSGVRTSSFSSRMVIH 60

Query: 61  CMSAGTDVTTVAETKANFLKVYKRPIPSIYNTVLQELIVQQHLMRYKRTYRYDPVFALGF 120
           CM++GTDVTTVAETKANFLK YKRPIPSIYNTV+QELIVQQHLMRYK+TYRYDPVFALGF
Sbjct: 61  CMASGTDVTTVAETKANFLKAYKRPIPSIYNTVVQELIVQQHLMRYKKTYRYDPVFALGF 120

Query: 121 VTVYDQLMEGYPSDEDREAIFQAYIKALNEDPEQYRIDAKKLEEWARSQTAASLVEFASK 180
           VTVYD+LMEGYPSDEDR+AIFQAYI ALNEDPEQYRIDA+KLEEWARSQTAASLVEFAS+
Sbjct: 121 VTVYDRLMEGYPSDEDRDAIFQAYINALNEDPEQYRIDAQKLEEWARSQTAASLVEFASR 180

Query: 181 EGEVESILKDIAERAGGKGSFSYSRFFAIGLFRLLELANATEPSILEKLCAALNVNKKSV 240
           EGEVESILKDIAERAG KG+FSYSRFFAIGLFRLLELANA+EPSILEKLCAALNV+KK V
Sbjct: 181 EGEVESILKDIAERAGSKGNFSYSRFFAIGLFRLLELANASEPSILEKLCAALNVDKKGV 240

Query: 241 DRDLDVYRNLLSKLVQAKELLKEYVDREKKKRDERAGSQTANEAITKCLGEYSMQTG 297
           DRDLDVYRNLLSKLVQAKELLKEYVDREKKKRDERAGSQTA+EAITKCLGEYSMQTG
Sbjct: 241 DRDLDVYRNLLSKLVQAKELLKEYVDREKKKRDERAGSQTASEAITKCLGEYSMQTG 297

BLAST of MC04g0167 vs. NCBI nr
Match: XP_022952157.1 (protein THYLAKOID FORMATION1, chloroplastic-like isoform X2 [Cucurbita moschata])

HSP 1 Score: 533 bits (1373), Expect = 6.16e-190
Identity = 272/297 (91.58%), Postives = 290/297 (97.64%), Query Frame = 0

Query: 1   MAAVNSVSFSALSQCSERRLLVPSARSLASNFDGFRFRTSVFCHYSGVRTSSYSSRMVVH 60
           MAAVNSVSFSALSQCS+RRL +PSARSLAS+FDGFRFR SVFCHYSGVRT S++SRMV+H
Sbjct: 1   MAAVNSVSFSALSQCSDRRLPIPSARSLASHFDGFRFRKSVFCHYSGVRTPSFNSRMVIH 60

Query: 61  CMSAGTDVTTVAETKANFLKVYKRPIPSIYNTVLQELIVQQHLMRYKRTYRYDPVFALGF 120
           CM++GTDVTTVAETKANFLK YKRPIPSIYNTV+QELIVQQHLMRYK+TYRYDPVFALGF
Sbjct: 61  CMASGTDVTTVAETKANFLKAYKRPIPSIYNTVVQELIVQQHLMRYKKTYRYDPVFALGF 120

Query: 121 VTVYDQLMEGYPSDEDREAIFQAYIKALNEDPEQYRIDAKKLEEWARSQTAASLVEFASK 180
           VTVYD+LMEGYPSDEDR+AIFQAYI ALNEDPEQYRIDA+KLEEWARSQTAASLVEFAS+
Sbjct: 121 VTVYDRLMEGYPSDEDRDAIFQAYINALNEDPEQYRIDAQKLEEWARSQTAASLVEFASR 180

Query: 181 EGEVESILKDIAERAGGKGSFSYSRFFAIGLFRLLELANATEPSILEKLCAALNVNKKSV 240
           EGEVESILKDIAERAG KG+FSYSRFFAIGLFRLLELANA+EPSILEKLCAALNV+KK V
Sbjct: 181 EGEVESILKDIAERAGSKGNFSYSRFFAIGLFRLLELANASEPSILEKLCAALNVDKKGV 240

Query: 241 DRDLDVYRNLLSKLVQAKELLKEYVDREKKKRDERAGSQTANEAITKCLGEYSMQTG 297
           DRDLDVYRNLLSKLVQAKELLKEYVDREKKKRDERAGSQTA+EAITKCLGEYSMQTG
Sbjct: 241 DRDLDVYRNLLSKLVQAKELLKEYVDREKKKRDERAGSQTASEAITKCLGEYSMQTG 297

BLAST of MC04g0167 vs. NCBI nr
Match: XP_008455361.1 (PREDICTED: protein THYLAKOID FORMATION1, chloroplastic [Cucumis melo] >KAA0031606.1 protein THYLAKOID FORMATION1 [Cucumis melo var. makuwa] >TYK07058.1 protein THYLAKOID FORMATION1 [Cucumis melo var. makuwa])

HSP 1 Score: 532 bits (1371), Expect = 1.24e-189
Identity = 272/297 (91.58%), Postives = 287/297 (96.63%), Query Frame = 0

Query: 1   MAAVNSVSFSALSQCSERRLLVPSARSLASNFDGFRFRTSVFCHYSGVRTSSYSSRMVVH 60
           MAAVNS+SFS L+QCS+RR  VPS+RSL+SNFDGFRFRTS+F HYS VR S++SSRMV+H
Sbjct: 1   MAAVNSISFSTLNQCSDRRFPVPSSRSLSSNFDGFRFRTSLFTHYSRVRPSTFSSRMVIH 60

Query: 61  CMSAGTDVTTVAETKANFLKVYKRPIPSIYNTVLQELIVQQHLMRYKRTYRYDPVFALGF 120
           CMSAGTDVTTVAETK NFLK YKRPIPSIYNTVLQELIVQQHLMRYKRTYRYDPVFALGF
Sbjct: 61  CMSAGTDVTTVAETKLNFLKAYKRPIPSIYNTVLQELIVQQHLMRYKRTYRYDPVFALGF 120

Query: 121 VTVYDQLMEGYPSDEDREAIFQAYIKALNEDPEQYRIDAKKLEEWARSQTAASLVEFASK 180
           VTVYDQLMEGYPSDEDREAIFQAYIKALNEDPEQYRIDA+KLEEWARSQTAASLVEFAS+
Sbjct: 121 VTVYDQLMEGYPSDEDREAIFQAYIKALNEDPEQYRIDAQKLEEWARSQTAASLVEFASR 180

Query: 181 EGEVESILKDIAERAGGKGSFSYSRFFAIGLFRLLELANATEPSILEKLCAALNVNKKSV 240
           EGEVESILKDIAERAG KG+FSYSRFFAIGLFRLLELANATEPSILEKLCAALN++KK V
Sbjct: 181 EGEVESILKDIAERAGSKGNFSYSRFFAIGLFRLLELANATEPSILEKLCAALNIDKKGV 240

Query: 241 DRDLDVYRNLLSKLVQAKELLKEYVDREKKKRDERAGSQTANEAITKCLGEYSMQTG 297
           DRDLDVYRNLLSKLVQAKELLKEY+DREKKKRDERAGSQTANEAITKCLGEYSMQTG
Sbjct: 241 DRDLDVYRNLLSKLVQAKELLKEYIDREKKKRDERAGSQTANEAITKCLGEYSMQTG 297

BLAST of MC04g0167 vs. ExPASy TrEMBL
Match: A0A6J1C3B8 (protein THYLAKOID FORMATION1, chloroplastic isoform X1 OS=Momordica charantia OX=3673 GN=LOC111007981 PE=3 SV=1)

HSP 1 Score: 573 bits (1476), Expect = 5.98e-206
Identity = 298/298 (100.00%), Postives = 298/298 (100.00%), Query Frame = 0

Query: 1   MAAVNSVSFSALSQCSERRLLVPSARSLASNFDGFRFRTSVFCHYSGVRTSSYSSRMVVH 60
           MAAVNSVSFSALSQCSERRLLVPSARSLASNFDGFRFRTSVFCHYSGVRTSSYSSRMVVH
Sbjct: 1   MAAVNSVSFSALSQCSERRLLVPSARSLASNFDGFRFRTSVFCHYSGVRTSSYSSRMVVH 60

Query: 61  CMSAGTDVTTVAETKANFLKVYKRPIPSIYNTVLQELIVQQHLMRYKRTYRYDPVFALGF 120
           CMSAGTDVTTVAETKANFLKVYKRPIPSIYNTVLQELIVQQHLMRYKRTYRYDPVFALGF
Sbjct: 61  CMSAGTDVTTVAETKANFLKVYKRPIPSIYNTVLQELIVQQHLMRYKRTYRYDPVFALGF 120

Query: 121 VTVYDQLMEGYPSDEDREAIFQAYIKALNEDPEQYRIDAKKLEEWARSQTAASLVEFASK 180
           VTVYDQLMEGYPSDEDREAIFQAYIKALNEDPEQYRIDAKKLEEWARSQTAASLVEFASK
Sbjct: 121 VTVYDQLMEGYPSDEDREAIFQAYIKALNEDPEQYRIDAKKLEEWARSQTAASLVEFASK 180

Query: 181 EGEVESILKDIAERAGGKGSFSYSRFFAIGLFRLLELANATEPSILEKLCAALNVNKKSV 240
           EGEVESILKDIAERAGGKGSFSYSRFFAIGLFRLLELANATEPSILEKLCAALNVNKKSV
Sbjct: 181 EGEVESILKDIAERAGGKGSFSYSRFFAIGLFRLLELANATEPSILEKLCAALNVNKKSV 240

Query: 241 DRDLDVYRNLLSKLVQAKELLKEYVDREKKKRDERAGSQTANEAITKCLGEYSMQTGF 298
           DRDLDVYRNLLSKLVQAKELLKEYVDREKKKRDERAGSQTANEAITKCLGEYSMQTGF
Sbjct: 241 DRDLDVYRNLLSKLVQAKELLKEYVDREKKKRDERAGSQTANEAITKCLGEYSMQTGF 298

BLAST of MC04g0167 vs. ExPASy TrEMBL
Match: A0A6J1I1V8 (protein THYLAKOID FORMATION1, chloroplastic-like isoform X2 OS=Cucurbita maxima OX=3661 GN=LOC111468259 PE=3 SV=1)

HSP 1 Score: 535 bits (1377), Expect = 7.33e-191
Identity = 273/297 (91.92%), Postives = 290/297 (97.64%), Query Frame = 0

Query: 1   MAAVNSVSFSALSQCSERRLLVPSARSLASNFDGFRFRTSVFCHYSGVRTSSYSSRMVVH 60
           MAAVNSVSFS LSQCS+RRL +PSARSLAS+FDGFRFR SVFCHYSGVRTSS+SSRMV+H
Sbjct: 1   MAAVNSVSFSGLSQCSDRRLPIPSARSLASHFDGFRFRKSVFCHYSGVRTSSFSSRMVIH 60

Query: 61  CMSAGTDVTTVAETKANFLKVYKRPIPSIYNTVLQELIVQQHLMRYKRTYRYDPVFALGF 120
           CM++GTDVTTVAETKANFLK YKRPIPSIYNTV+QELIVQQHLMRYK+TYRYDPVFALGF
Sbjct: 61  CMASGTDVTTVAETKANFLKAYKRPIPSIYNTVVQELIVQQHLMRYKKTYRYDPVFALGF 120

Query: 121 VTVYDQLMEGYPSDEDREAIFQAYIKALNEDPEQYRIDAKKLEEWARSQTAASLVEFASK 180
           VTVYD+LMEGYPSDEDR+AIFQAYI ALNEDPEQYRIDA+KLEEWARSQTAASLVEFAS+
Sbjct: 121 VTVYDRLMEGYPSDEDRDAIFQAYINALNEDPEQYRIDAQKLEEWARSQTAASLVEFASR 180

Query: 181 EGEVESILKDIAERAGGKGSFSYSRFFAIGLFRLLELANATEPSILEKLCAALNVNKKSV 240
           EGEVESILKDIAERAG KG+FSYSRFFAIGLFRLLELANA+EPSILEKLCAALNV+KK V
Sbjct: 181 EGEVESILKDIAERAGSKGNFSYSRFFAIGLFRLLELANASEPSILEKLCAALNVDKKGV 240

Query: 241 DRDLDVYRNLLSKLVQAKELLKEYVDREKKKRDERAGSQTANEAITKCLGEYSMQTG 297
           DRDLDVYRNLLSKLVQAKELLKEYVDREKKKRDERAGSQTA+EAITKCLGEYSMQTG
Sbjct: 241 DRDLDVYRNLLSKLVQAKELLKEYVDREKKKRDERAGSQTASEAITKCLGEYSMQTG 297

BLAST of MC04g0167 vs. ExPASy TrEMBL
Match: A0A6J1GJN0 (protein THYLAKOID FORMATION1, chloroplastic-like isoform X2 OS=Cucurbita moschata OX=3662 GN=LOC111454918 PE=3 SV=1)

HSP 1 Score: 533 bits (1373), Expect = 2.98e-190
Identity = 272/297 (91.58%), Postives = 290/297 (97.64%), Query Frame = 0

Query: 1   MAAVNSVSFSALSQCSERRLLVPSARSLASNFDGFRFRTSVFCHYSGVRTSSYSSRMVVH 60
           MAAVNSVSFSALSQCS+RRL +PSARSLAS+FDGFRFR SVFCHYSGVRT S++SRMV+H
Sbjct: 1   MAAVNSVSFSALSQCSDRRLPIPSARSLASHFDGFRFRKSVFCHYSGVRTPSFNSRMVIH 60

Query: 61  CMSAGTDVTTVAETKANFLKVYKRPIPSIYNTVLQELIVQQHLMRYKRTYRYDPVFALGF 120
           CM++GTDVTTVAETKANFLK YKRPIPSIYNTV+QELIVQQHLMRYK+TYRYDPVFALGF
Sbjct: 61  CMASGTDVTTVAETKANFLKAYKRPIPSIYNTVVQELIVQQHLMRYKKTYRYDPVFALGF 120

Query: 121 VTVYDQLMEGYPSDEDREAIFQAYIKALNEDPEQYRIDAKKLEEWARSQTAASLVEFASK 180
           VTVYD+LMEGYPSDEDR+AIFQAYI ALNEDPEQYRIDA+KLEEWARSQTAASLVEFAS+
Sbjct: 121 VTVYDRLMEGYPSDEDRDAIFQAYINALNEDPEQYRIDAQKLEEWARSQTAASLVEFASR 180

Query: 181 EGEVESILKDIAERAGGKGSFSYSRFFAIGLFRLLELANATEPSILEKLCAALNVNKKSV 240
           EGEVESILKDIAERAG KG+FSYSRFFAIGLFRLLELANA+EPSILEKLCAALNV+KK V
Sbjct: 181 EGEVESILKDIAERAGSKGNFSYSRFFAIGLFRLLELANASEPSILEKLCAALNVDKKGV 240

Query: 241 DRDLDVYRNLLSKLVQAKELLKEYVDREKKKRDERAGSQTANEAITKCLGEYSMQTG 297
           DRDLDVYRNLLSKLVQAKELLKEYVDREKKKRDERAGSQTA+EAITKCLGEYSMQTG
Sbjct: 241 DRDLDVYRNLLSKLVQAKELLKEYVDREKKKRDERAGSQTASEAITKCLGEYSMQTG 297

BLAST of MC04g0167 vs. ExPASy TrEMBL
Match: A0A5D3C7D3 (Protein THYLAKOID FORMATION1 OS=Cucumis melo var. makuwa OX=1194695 GN=E5676_scaffold13G003760 PE=3 SV=1)

HSP 1 Score: 532 bits (1371), Expect = 6.02e-190
Identity = 272/297 (91.58%), Postives = 287/297 (96.63%), Query Frame = 0

Query: 1   MAAVNSVSFSALSQCSERRLLVPSARSLASNFDGFRFRTSVFCHYSGVRTSSYSSRMVVH 60
           MAAVNS+SFS L+QCS+RR  VPS+RSL+SNFDGFRFRTS+F HYS VR S++SSRMV+H
Sbjct: 1   MAAVNSISFSTLNQCSDRRFPVPSSRSLSSNFDGFRFRTSLFTHYSRVRPSTFSSRMVIH 60

Query: 61  CMSAGTDVTTVAETKANFLKVYKRPIPSIYNTVLQELIVQQHLMRYKRTYRYDPVFALGF 120
           CMSAGTDVTTVAETK NFLK YKRPIPSIYNTVLQELIVQQHLMRYKRTYRYDPVFALGF
Sbjct: 61  CMSAGTDVTTVAETKLNFLKAYKRPIPSIYNTVLQELIVQQHLMRYKRTYRYDPVFALGF 120

Query: 121 VTVYDQLMEGYPSDEDREAIFQAYIKALNEDPEQYRIDAKKLEEWARSQTAASLVEFASK 180
           VTVYDQLMEGYPSDEDREAIFQAYIKALNEDPEQYRIDA+KLEEWARSQTAASLVEFAS+
Sbjct: 121 VTVYDQLMEGYPSDEDREAIFQAYIKALNEDPEQYRIDAQKLEEWARSQTAASLVEFASR 180

Query: 181 EGEVESILKDIAERAGGKGSFSYSRFFAIGLFRLLELANATEPSILEKLCAALNVNKKSV 240
           EGEVESILKDIAERAG KG+FSYSRFFAIGLFRLLELANATEPSILEKLCAALN++KK V
Sbjct: 181 EGEVESILKDIAERAGSKGNFSYSRFFAIGLFRLLELANATEPSILEKLCAALNIDKKGV 240

Query: 241 DRDLDVYRNLLSKLVQAKELLKEYVDREKKKRDERAGSQTANEAITKCLGEYSMQTG 297
           DRDLDVYRNLLSKLVQAKELLKEY+DREKKKRDERAGSQTANEAITKCLGEYSMQTG
Sbjct: 241 DRDLDVYRNLLSKLVQAKELLKEYIDREKKKRDERAGSQTANEAITKCLGEYSMQTG 297

BLAST of MC04g0167 vs. ExPASy TrEMBL
Match: A0A1S3C0V5 (protein THYLAKOID FORMATION1, chloroplastic OS=Cucumis melo OX=3656 GN=LOC103495547 PE=3 SV=1)

HSP 1 Score: 532 bits (1371), Expect = 6.02e-190
Identity = 272/297 (91.58%), Postives = 287/297 (96.63%), Query Frame = 0

Query: 1   MAAVNSVSFSALSQCSERRLLVPSARSLASNFDGFRFRTSVFCHYSGVRTSSYSSRMVVH 60
           MAAVNS+SFS L+QCS+RR  VPS+RSL+SNFDGFRFRTS+F HYS VR S++SSRMV+H
Sbjct: 1   MAAVNSISFSTLNQCSDRRFPVPSSRSLSSNFDGFRFRTSLFTHYSRVRPSTFSSRMVIH 60

Query: 61  CMSAGTDVTTVAETKANFLKVYKRPIPSIYNTVLQELIVQQHLMRYKRTYRYDPVFALGF 120
           CMSAGTDVTTVAETK NFLK YKRPIPSIYNTVLQELIVQQHLMRYKRTYRYDPVFALGF
Sbjct: 61  CMSAGTDVTTVAETKLNFLKAYKRPIPSIYNTVLQELIVQQHLMRYKRTYRYDPVFALGF 120

Query: 121 VTVYDQLMEGYPSDEDREAIFQAYIKALNEDPEQYRIDAKKLEEWARSQTAASLVEFASK 180
           VTVYDQLMEGYPSDEDREAIFQAYIKALNEDPEQYRIDA+KLEEWARSQTAASLVEFAS+
Sbjct: 121 VTVYDQLMEGYPSDEDREAIFQAYIKALNEDPEQYRIDAQKLEEWARSQTAASLVEFASR 180

Query: 181 EGEVESILKDIAERAGGKGSFSYSRFFAIGLFRLLELANATEPSILEKLCAALNVNKKSV 240
           EGEVESILKDIAERAG KG+FSYSRFFAIGLFRLLELANATEPSILEKLCAALN++KK V
Sbjct: 181 EGEVESILKDIAERAGSKGNFSYSRFFAIGLFRLLELANATEPSILEKLCAALNIDKKGV 240

Query: 241 DRDLDVYRNLLSKLVQAKELLKEYVDREKKKRDERAGSQTANEAITKCLGEYSMQTG 297
           DRDLDVYRNLLSKLVQAKELLKEY+DREKKKRDERAGSQTANEAITKCLGEYSMQTG
Sbjct: 241 DRDLDVYRNLLSKLVQAKELLKEYIDREKKKRDERAGSQTANEAITKCLGEYSMQTG 297

BLAST of MC04g0167 vs. TAIR 10
Match: AT2G20890.1 (photosystem II reaction center PSB29 protein )

HSP 1 Score: 386.0 bits (990), Expect = 2.8e-107
Identity = 206/291 (70.79%), Postives = 248/291 (85.22%), Query Frame = 0

Query: 3   AVNSVSFSALSQCSERRLLVPSARSLASNFDGFRFRTSVFCHYSGVRTSSYS-SRMVVHC 62
           A++S+SF AL Q S++     S+R LAS          +   +S +  +S S S+ ++HC
Sbjct: 5   AISSLSFPALGQ-SDKISNFASSRPLAS-------AIRICTKFSRLSLNSRSTSKSLIHC 64

Query: 63  MSAGT-DVTTVAETKANFLKVYKRPIPSIYNTVLQELIVQQHLMRYKRTYRYDPVFALGF 122
           MS  T DV  V+ETK+ FLK YKRPIPSIYNTVLQELIVQQHLMRYK+TYRYDPVFALGF
Sbjct: 65  MSNVTADVPPVSETKSKFLKAYKRPIPSIYNTVLQELIVQQHLMRYKKTYRYDPVFALGF 124

Query: 123 VTVYDQLMEGYPSDEDREAIFQAYIKALNEDPEQYRIDAKKLEEWARSQTAASLVEFASK 182
           VTVYDQLMEGYPSD+DR+AIF+AYI+ALNEDP+QYRIDA+K+EEWARSQT+ASLV+F+SK
Sbjct: 125 VTVYDQLMEGYPSDQDRDAIFKAYIEALNEDPKQYRIDAQKMEEWARSQTSASLVDFSSK 184

Query: 183 EGEVESILKDIAERAGGKGSFSYSRFFAIGLFRLLELANATEPSILEKLCAALNVNKKSV 242
           EG++E++LKDIA RAG K  FSYSRFFA+GLFRLLELA+AT+P++L+KLCA+LN+NKKSV
Sbjct: 185 EGDIEAVLKDIAGRAGSKEGFSYSRFFAVGLFRLLELASATDPTVLDKLCASLNINKKSV 244

Query: 243 DRDLDVYRNLLSKLVQAKELLKEYVDREKKKRDERAGSQTANEAITKCLGE 292
           DRDLDVYRNLLSKLVQAKELLKEYV+REKKK+ ERA SQ ANE I+KCLG+
Sbjct: 245 DRDLDVYRNLLSKLVQAKELLKEYVEREKKKQGERAQSQKANETISKCLGD 287

The following BLAST results are available for this feature:
Match NameE-valueIdentityDescription
Q7XAB81.3e-10971.67Protein THYLAKOID FORMATION1, chloroplastic OS=Solanum tuberosum OX=4113 GN=THF1... [more]
Q9SKT04.0e-10670.79Protein THYLAKOID FORMATION 1, chloroplastic OS=Arabidopsis thaliana OX=3702 GN=... [more]
Q84PB73.3e-10069.10Protein THYLAKOID FORMATION1, chloroplastic OS=Oryza sativa subsp. japonica OX=3... [more]
B0C3M81.6e-3536.99Protein Thf1 OS=Acaryochloris marina (strain MBIC 11017) OX=329726 GN=thf1 PE=3 ... [more]
Q116P53.4e-3337.04Protein Thf1 OS=Trichodesmium erythraeum (strain IMS101) OX=203124 GN=thf1 PE=3 ... [more]
Match NameE-valueIdentityDescription
XP_022136235.11.23e-205100.00protein THYLAKOID FORMATION1, chloroplastic isoform X1 [Momordica charantia][more]
XP_023554556.11.30e-19192.26protein THYLAKOID FORMATION1, chloroplastic-like isoform X2 [Cucurbita pepo subs... [more]
XP_022969189.11.51e-19091.92protein THYLAKOID FORMATION1, chloroplastic-like isoform X2 [Cucurbita maxima][more]
XP_022952157.16.16e-19091.58protein THYLAKOID FORMATION1, chloroplastic-like isoform X2 [Cucurbita moschata][more]
XP_008455361.11.24e-18991.58PREDICTED: protein THYLAKOID FORMATION1, chloroplastic [Cucumis melo] >KAA003160... [more]
Match NameE-valueIdentityDescription
A0A6J1C3B85.98e-206100.00protein THYLAKOID FORMATION1, chloroplastic isoform X1 OS=Momordica charantia OX... [more]
A0A6J1I1V87.33e-19191.92protein THYLAKOID FORMATION1, chloroplastic-like isoform X2 OS=Cucurbita maxima ... [more]
A0A6J1GJN02.98e-19091.58protein THYLAKOID FORMATION1, chloroplastic-like isoform X2 OS=Cucurbita moschat... [more]
A0A5D3C7D36.02e-19091.58Protein THYLAKOID FORMATION1 OS=Cucumis melo var. makuwa OX=1194695 GN=E5676_sca... [more]
A0A1S3C0V56.02e-19091.58protein THYLAKOID FORMATION1, chloroplastic OS=Cucumis melo OX=3656 GN=LOC103495... [more]
Match NameE-valueIdentityDescription
AT2G20890.12.8e-10770.79photosystem II reaction center PSB29 protein [more]
InterPro
Analysis Name: InterPro Annotations of Bitter gourd (Dali-11) v1
Date Performed: 2021-10-25
IPR TermIPR DescriptionSourceSource TermSource DescriptionAlignment
NoneNo IPR availableCOILSCoilCoilcoord: 247..274
NoneNo IPR availablePANTHERPTHR34793:SF7PHOTOSYSTEM II BIOGENESIS PROTEINcoord: 1..286
IPR017499Protein Thf1TIGRFAMTIGR03060TIGR03060coord: 67..272
e-value: 1.6E-47
score: 160.0
IPR017499Protein Thf1PFAMPF11264ThylakoidFormatcoord: 70..275
e-value: 1.3E-75
score: 253.8
IPR017499Protein Thf1PANTHERPTHR34793PROTEIN THYLAKOID FORMATION 1, CHLOROPLASTICcoord: 1..286
IPR017499Protein Thf1HAMAPMF_01843Thf1coord: 65..273
score: 22.203957

Relationships

The following mRNA feature(s) are a part of this gene:

Feature NameUnique NameType
MC04g0167.1MC04g0167.1mRNA


GO Annotation
GO Assignments
This gene is annotated with the following GO terms.
Category Term Accession Term Name
biological_process GO:0010207 photosystem II assembly
biological_process GO:0045037 protein import into chloroplast stroma
biological_process GO:0045038 protein import into chloroplast thylakoid membrane
biological_process GO:0010027 thylakoid membrane organization
biological_process GO:0015979 photosynthesis