Tan0018023 (gene) Snake gourd v1

Overview
NameTan0018023
Typegene
OrganismTrichosanthes anguina (Snake gourd v1)
Descriptionprotein THYLAKOID FORMATION1, chloroplastic
LocationLG02: 2293785 .. 2296717 (+)
RNA-Seq ExpressionTan0018023
SyntenyTan0018023
Sequences
The following sequences are available for this feature:

Gene sequence (with intron)

Legend: five_prime_UTRexonCDSpolypeptidethree_prime_UTR
Hold the cursor over a type above to highlight its positions in the sequence below.
CACCCGCTCTTTCTTTCTTTTTTCCCAGAACTTTTCTTCCAATTTCATGTTCTATAGGATTTTTCTCCCACTCTTCTTCTTCGATATGAAATCCATTTGCTCTGGAAGTTCGTAAGCTTTTCAAGTTTCTTCTCATTCTTCTACAATGGCGGCTGTTAATTCCGTATCATTCTCAACATTAAGTCAATGTGCTGATAGAAGGTTGCCGGTTTCGTCGGCTCGTTCACTCGCCTCGAATTTCGACGGGTTCCGTTTTCGTTCGAGCGTTTTCTGTCATTATTCGGGAGTTCGAACCTCGGGTTTCAGTTCTCGCTTGGTTATTCATTGCATGTCCGCCGGAACAGGTACCTTCTGGTGATTTGTTTTCCTTGCTCGCTTATTAGACTGTTCCGTTGCTAGATAATGGTAGCAGAAAAGAATAGGCTCGTCTTGCTGATTCTGATCGAGTTTGATTTTCGTAGTGAATATGTGTTTTGATTTTTCGTCAGTTGTAGAGATGGAAAGTTATGGACGGAATTATTAGTTTCGATAGTGCCGGTGGATCAATTCTAAATTTGTGCAGTGCGTTTGAGATTTCTCTAGTTCGATTTTTTAGTGTAGAAACTGATGCGATATTTTGACGGCTCAGATGTGACTACTGTAGCCGAGACTAAATCGAACTTTCTCAAGGCGTATAAACGGCCTATCCCTAGCATTTACAATACTGTTCTACAAGAGTTGATTGTGCAGCAGCATTTGATGAGGTATAAGAGGACATACCGTTATGATCCTGTTTTCGCCCTTGGTTTTGTTACTGTATATGATCAGCTTATGGAAGGGTACCCTAGCGATGAGGATCGAGAGGCCATTTTCCAAGCATACATTAAGGCATTGAATGAGGATCCAGAGCAATATAGGTTTGGCCATGTATTCCCCTTTGCTTTTACCATCCCTGGTCTCTATCTCTTCTTGCCAGTTAATAATCTCATTAGCCTTGTTTATGGGTAGTAAACACTGTTAAGTTGAAAAATCATGGCATGATTCCAATTACAGCAAAAAGTCGATGATTCTTTTTTGAACTCTATATCCTCTTTTCTAGTATTTCACGTGTTAATTTCGACATTGTAGTCTAAACGACTATTGATCGGTCAAATTACATGGTATCAAATGTTAATGGATGGTCAAATTAGTATCAATGACTTTTATTTGTACCTGGTGAAAAATAGTATCAAGAGTGTAATGTTCTTCATCACTGGAAATTATATTCCAACCTTCAACCATTATTCCAATCATTTCTAGAACGTTTTTCCTTATTTTAAGTAGTATTTTTCTTTTATACTGTTCAGAATTGATGCTCAAAAATTGGAAGAGTGGGCTCGGTCTCAGACTGCAGCTTCATTGGTTGAGTTTGCATCAAAAGAAGGAGAGGCTGAGAGTATTTTGAAGGACATTGCAGAACGAGCAGGGAGTAAGGGGAGTTTCAGTTACAGCCGATTTTTTGCTATTGGGCTATTTCGACTCCTTGAATTGGCAAATGCTACTGAACCCAGTATCTTGGAAAAGGTTTGTTTCATATTCCTTCTTGTGATATAGCTCTTAAATAGATGAAGAATTTTGATGTGACCATTTATGTAGCTTATTATTGCTACATTAGATGATAGCAAGCCGAATAGGACTTTTACTGGTAGATTATATTCTCTCTCCGAGTGCTGCTATTTGTAAAGTATTATTGCGTTTCTTGCTCATTGGAGAAATTGGTTCTTGATAGAATATTCCTTAACTTTATTCTGTTCCAGAGAATAAGGTTCAAGGATTCTCTCCTGCATAAGACTGCATTTTATTTTTCTAGTTAACATAAGTCTTTTTGTGGTGTTTGGTCGGAGATGAATATGAGCCCCACACACACGCAGTCACAGGGCTTTTGTGGGATTTCCTTTCTTCTCCTCTATGGAGGTTGTGCTGATCTTTTGGATGTAGTTCATACTCTTTAGTAGGCTCTGCTGGTCATCTTCATCCACTTTATTAGCTCTTGTAAATGATTGTACGACTCTATTTATCACATCTGTCACCAAAGATACTTTTGTTCTTTTAAAACAGCTGAAGCTATATAGTTGAACATTTTCTAATTTTTCTGCTTCTGAACAGCTCTGTGCCGCTTTAAATGTTGACAAAAAGAGCGTGGACAGAGACCTTGATGTATACCGCAACCTGCTTTCAAAGTTGGTTCAGGCGAAAGAGCTCTTAAAGGAATATGTTGACAGGTAAGTGATTGTTGTGTTCACAGTTCCAGGCATAGTAGATAAACAAACTCACAATATGCTCTCACACCACCATATTTTCATAGATTTTATGAATCTATAATCGATCTGAAATCTGCCAATCTAGTATTTGTCAAATCTCCTGGTGGATCATGGTGCATCTGAGAATCAGTTTTTAACAATAAAAGTTGTATGATTAACCAGTGCTTTTCAACTGTGCTTTTTTACCTATAGAGAGAAGAAGAAAAGAGATGAGAGGGCTGGATCTGGATCTGGATCACAGACAGCTAATGAGGCAATAACAAAATGCTTGGGAGAATACAACTTGTAGACTGGTTTTTGAGAGCTGATTACATCAATTGGAGCACTGCCTAATTTGAGATAAGACTTGAGAGTTATAGCAATATTCTAAATTTACCTGATAGATATGCTGCATTTGGTAATGTGTTGTTGGGTCTTTTCGCATTTGGTAAATTTTGTATTAAGCCAGGCACTACCTTTATTCAAGCTATTTTTTAATCATTTGTATTACATATTTTGAGTGCAATTGTCAATACAAATGCTTTCGCACTCTGTGTAGTAATCATATTTTGCAGTTCTCTCCTTCAAGGGATATTTGACCATTTACTTTTGTTTGTGCAATTATTTCTGGATTCTTAGTGAGTTCAAAATATATATATTGAAGGTATATGAGAA

mRNA sequence

CACCCGCTCTTTCTTTCTTTTTTCCCAGAACTTTTCTTCCAATTTCATGTTCTATAGGATTTTTCTCCCACTCTTCTTCTTCGATATGAAATCCATTTGCTCTGGAAGTTCGTAAGCTTTTCAAGTTTCTTCTCATTCTTCTACAATGGCGGCTGTTAATTCCGTATCATTCTCAACATTAAGTCAATGTGCTGATAGAAGGTTGCCGGTTTCGTCGGCTCGTTCACTCGCCTCGAATTTCGACGGGTTCCGTTTTCGTTCGAGCGTTTTCTGTCATTATTCGGGAGTTCGAACCTCGGGTTTCAGTTCTCGCTTGGTTATTCATTGCATGTCCGCCGGAACAGATGTGACTACTGTAGCCGAGACTAAATCGAACTTTCTCAAGGCGTATAAACGGCCTATCCCTAGCATTTACAATACTGTTCTACAAGAGTTGATTGTGCAGCAGCATTTGATGAGGTATAAGAGGACATACCGTTATGATCCTGTTTTCGCCCTTGGTTTTGTTACTGTATATGATCAGCTTATGGAAGGGTACCCTAGCGATGAGGATCGAGAGGCCATTTTCCAAGCATACATTAAGGCATTGAATGAGGATCCAGAGCAATATAGAATTGATGCTCAAAAATTGGAAGAGTGGGCTCGGTCTCAGACTGCAGCTTCATTGGTTGAGTTTGCATCAAAAGAAGGAGAGGCTGAGAGTATTTTGAAGGACATTGCAGAACGAGCAGGGAGTAAGGGGAGTTTCAGTTACAGCCGATTTTTTGCTATTGGGCTATTTCGACTCCTTGAATTGGCAAATGCTACTGAACCCAGTATCTTGGAAAAGCTCTGTGCCGCTTTAAATGTTGACAAAAAGAGCGTGGACAGAGACCTTGATGTATACCGCAACCTGCTTTCAAAGTTGGTTCAGGCGAAAGAGCTCTTAAAGGAATATGTTGACAGAGAGAAGAAGAAAAGAGATGAGAGGGCTGGATCTGGATCTGGATCACAGACAGCTAATGAGGCAATAACAAAATGCTTGGGAGAATACAACTTGTAGACTGGTTTTTGAGAGCTGATTACATCAATTGGAGCACTGCCTAATTTGAGATAAGACTTGAGAGTTATAGCAATATTCTAAATTTACCTGATAGATATGCTGCATTTGGTAATGTGTTGTTGGGTCTTTTCGCATTTGGTAAATTTTGTATTAAGCCAGGCACTACCTTTATTCAAGCTATTTTTTAATCATTTGTATTACATATTTTGAGTGCAATTGTCAATACAAATGCTTTCGCACTCTGTGTAGTAATCATATTTTGCAGTTCTCTCCTTCAAGGGATATTTGACCATTTACTTTTGTTTGTGCAATTATTTCTGGATTCTTAGTGAGTTCAAAATATATATATTGAAGGTATATGAGAA

Coding sequence (CDS)

ATGGCGGCTGTTAATTCCGTATCATTCTCAACATTAAGTCAATGTGCTGATAGAAGGTTGCCGGTTTCGTCGGCTCGTTCACTCGCCTCGAATTTCGACGGGTTCCGTTTTCGTTCGAGCGTTTTCTGTCATTATTCGGGAGTTCGAACCTCGGGTTTCAGTTCTCGCTTGGTTATTCATTGCATGTCCGCCGGAACAGATGTGACTACTGTAGCCGAGACTAAATCGAACTTTCTCAAGGCGTATAAACGGCCTATCCCTAGCATTTACAATACTGTTCTACAAGAGTTGATTGTGCAGCAGCATTTGATGAGGTATAAGAGGACATACCGTTATGATCCTGTTTTCGCCCTTGGTTTTGTTACTGTATATGATCAGCTTATGGAAGGGTACCCTAGCGATGAGGATCGAGAGGCCATTTTCCAAGCATACATTAAGGCATTGAATGAGGATCCAGAGCAATATAGAATTGATGCTCAAAAATTGGAAGAGTGGGCTCGGTCTCAGACTGCAGCTTCATTGGTTGAGTTTGCATCAAAAGAAGGAGAGGCTGAGAGTATTTTGAAGGACATTGCAGAACGAGCAGGGAGTAAGGGGAGTTTCAGTTACAGCCGATTTTTTGCTATTGGGCTATTTCGACTCCTTGAATTGGCAAATGCTACTGAACCCAGTATCTTGGAAAAGCTCTGTGCCGCTTTAAATGTTGACAAAAAGAGCGTGGACAGAGACCTTGATGTATACCGCAACCTGCTTTCAAAGTTGGTTCAGGCGAAAGAGCTCTTAAAGGAATATGTTGACAGAGAGAAGAAGAAAAGAGATGAGAGGGCTGGATCTGGATCTGGATCACAGACAGCTAATGAGGCAATAACAAAATGCTTGGGAGAATACAACTTGTAG

Protein sequence

MAAVNSVSFSTLSQCADRRLPVSSARSLASNFDGFRFRSSVFCHYSGVRTSGFSSRLVIHCMSAGTDVTTVAETKSNFLKAYKRPIPSIYNTVLQELIVQQHLMRYKRTYRYDPVFALGFVTVYDQLMEGYPSDEDREAIFQAYIKALNEDPEQYRIDAQKLEEWARSQTAASLVEFASKEGEAESILKDIAERAGSKGSFSYSRFFAIGLFRLLELANATEPSILEKLCAALNVDKKSVDRDLDVYRNLLSKLVQAKELLKEYVDREKKKRDERAGSGSGSQTANEAITKCLGEYNL
Homology
BLAST of Tan0018023 vs. ExPASy Swiss-Prot
Match: Q7XAB8 (Protein THYLAKOID FORMATION1, chloroplastic OS=Solanum tuberosum OX=4113 GN=THF1 PE=2 SV=1)

HSP 1 Score: 397.9 bits (1021), Expect = 1.0e-109
Identity = 211/297 (71.04%), Postives = 247/297 (83.16%), Query Frame = 0

Query: 1   MAAVNSVSFSTLSQCADRRLPVSSARSLASNFDGFRFRSSVFCHYSGVRTSGFSSRLVIH 60
           MAAV SVSFS ++Q A+R+  VSS+RS+    D FRFRS+       VR+S  +SR V+H
Sbjct: 1   MAAVTSVSFSAITQSAERKSSVSSSRSI----DTFRFRSNFSFDSVNVRSSNSTSRFVVH 60

Query: 61  C-MSAGTDVTTVAETKSNFLKAYKRPIPSIYNTVLQELIVQQHLMRYKRTYRYDPVFALG 120
           C  S+  D+ TVA+TK  FL AYKRPIP++YNTVLQELIVQQHL RYK++Y+YDPVFALG
Sbjct: 61  CTSSSAADLPTVADTKLKFLTAYKRPIPTVYNTVLQELIVQQHLTRYKKSYQYDPVFALG 120

Query: 121 FVTVYDQLMEGYPSDEDREAIFQAYIKALNEDPEQYRIDAQKLEEWARSQTAASLVEFAS 180
           FVTVYDQLMEGYPS+EDR AIF+AYI+AL EDPEQYR DAQKLEEWAR+Q A +LV+F+S
Sbjct: 121 FVTVYDQLMEGYPSEEDRNAIFKAYIEALKEDPEQYRADAQKLEEWARTQNANTLVDFSS 180

Query: 181 KEGEAESILKDIAERAGSKGSFSYSRFFAIGLFRLLELANATEPSILEKLCAALNVDKKS 240
           KEGE E+I KDIA+RAG+K  F YSR FA+GLFRLLELAN T+P+ILEKLCAALNV+KKS
Sbjct: 181 KEGEIENIFKDIAQRAGTKDGFCYSRLFAVGLFRLLELANVTDPTILEKLCAALNVNKKS 240

Query: 241 VDRDLDVYRNLLSKLVQAKELLKEYVDREKKKRDERAGSGSGSQTANEAITKCLGEY 297
           VDRDLDVYRNLLSKLVQAKELLKEYV+REKKKR ER      +Q ANE +TKCLG+Y
Sbjct: 241 VDRDLDVYRNLLSKLVQAKELLKEYVEREKKKRGERE-----TQKANETVTKCLGDY 288

BLAST of Tan0018023 vs. ExPASy Swiss-Prot
Match: Q9SKT0 (Protein THYLAKOID FORMATION 1, chloroplastic OS=Arabidopsis thaliana OX=3702 GN=THF1 PE=1 SV=1)

HSP 1 Score: 380.6 bits (976), Expect = 1.7e-104
Identity = 208/295 (70.51%), Postives = 248/295 (84.07%), Query Frame = 0

Query: 3   AVNSVSFSTLSQCADRRLPVSSARSLASNFDGFRFRSSVFCHYSGVR-TSGFSSRLVIHC 62
           A++S+SF  L Q +D+    +S+R LAS          +   +S +   S  +S+ +IHC
Sbjct: 5   AISSLSFPALGQ-SDKISNFASSRPLASAI-------RICTKFSRLSLNSRSTSKSLIHC 64

Query: 63  MSAGT-DVTTVAETKSNFLKAYKRPIPSIYNTVLQELIVQQHLMRYKRTYRYDPVFALGF 122
           MS  T DV  V+ETKS FLKAYKRPIPSIYNTVLQELIVQQHLMRYK+TYRYDPVFALGF
Sbjct: 65  MSNVTADVPPVSETKSKFLKAYKRPIPSIYNTVLQELIVQQHLMRYKKTYRYDPVFALGF 124

Query: 123 VTVYDQLMEGYPSDEDREAIFQAYIKALNEDPEQYRIDAQKLEEWARSQTAASLVEFASK 182
           VTVYDQLMEGYPSD+DR+AIF+AYI+ALNEDP+QYRIDAQK+EEWARSQT+ASLV+F+SK
Sbjct: 125 VTVYDQLMEGYPSDQDRDAIFKAYIEALNEDPKQYRIDAQKMEEWARSQTSASLVDFSSK 184

Query: 183 EGEAESILKDIAERAGSKGSFSYSRFFAIGLFRLLELANATEPSILEKLCAALNVDKKSV 242
           EG+ E++LKDIA RAGSK  FSYSRFFA+GLFRLLELA+AT+P++L+KLCA+LN++KKSV
Sbjct: 185 EGDIEAVLKDIAGRAGSKEGFSYSRFFAVGLFRLLELASATDPTVLDKLCASLNINKKSV 244

Query: 243 DRDLDVYRNLLSKLVQAKELLKEYVDREKKKRDERAGSGSGSQTANEAITKCLGE 296
           DRDLDVYRNLLSKLVQAKELLKEYV+REKKK+ ERA     SQ ANE I+KCLG+
Sbjct: 245 DRDLDVYRNLLSKLVQAKELLKEYVEREKKKQGERA----QSQKANETISKCLGD 287

BLAST of Tan0018023 vs. ExPASy Swiss-Prot
Match: Q84PB7 (Protein THYLAKOID FORMATION1, chloroplastic OS=Oryza sativa subsp. japonica OX=39947 GN=THF1 PE=2 SV=1)

HSP 1 Score: 359.4 bits (921), Expect = 4.0e-98
Identity = 200/298 (67.11%), Postives = 236/298 (79.19%), Query Frame = 0

Query: 1   MAAVNSVSFSTLSQCADRRLPVSSARSLASNFDGFRFRSSVFCHYSGVRTSGFSSRLVIH 60
           MAA++S+ F+ L + AD R   ++A + A          +V       R     SR V+ 
Sbjct: 1   MAAISSLPFAALRRAADCRPSTAAAAAGAG-------AGAVVLSVRPRR----GSRSVVR 60

Query: 61  CMSAGTDV-TTVAETKSNFLKAYKRPIPSIYNTVLQELIVQQHLMRYKRTYRYDPVFALG 120
           C++   DV  TVAETK NFLK+YKRPI SIY+TVLQEL+VQQHLMRYK TY+YD VFALG
Sbjct: 61  CVATAGDVPPTVAETKMNFLKSYKRPILSIYSTVLQELLVQQHLMRYKTTYQYDAVFALG 120

Query: 121 FVTVYDQLMEGYPSDEDREAIFQAYIKALNEDPEQYRIDAQKLEEWARSQTAASLVEFAS 180
           FVTVYDQLMEGYPS+EDR+AIF+AYI ALNEDPEQYR DAQK+EEWARSQ   SLVEF+S
Sbjct: 121 FVTVYDQLMEGYPSNEDRDAIFKAYITALNEDPEQYRADAQKMEEWARSQNGNSLVEFSS 180

Query: 181 KEGEAESILKDIAERAGSKGSFSYSRFFAIGLFRLLELANATEPSILEKLCAALNVDKKS 240
           K+GE E+ILKDI+ERA  KGSFSYSRFFA+GLFRLLELANATEP+IL+KLCAALN++K+S
Sbjct: 181 KDGEIEAILKDISERAQGKGSFSYSRFFAVGLFRLLELANATEPTILDKLCAALNINKRS 240

Query: 241 VDRDLDVYRNLLSKLVQAKELLKEYVDREKKKRDERAGSGSGSQTANEAITKCLGEYN 298
           VDRDLDVYRN+LSKLVQAKELLKEYV+REKKKR+ER    S +  +NEA+TK  G  N
Sbjct: 241 VDRDLDVYRNILSKLVQAKELLKEYVEREKKKREER----SETPKSNEAVTKFDGSLN 283

BLAST of Tan0018023 vs. ExPASy Swiss-Prot
Match: B0C3M8 (Protein Thf1 OS=Acaryochloris marina (strain MBIC 11017) OX=329726 GN=thf1 PE=3 SV=1)

HSP 1 Score: 151.8 bits (382), Expect = 1.3e-35
Identity = 88/238 (36.97%), Postives = 141/238 (59.24%), Query Frame = 0

Query: 67  DVTTVAETKSNFLKAYKRPIPSIYNTVLQELIVQQHLMRYKRTYRYDPVFALGFVTVYDQ 126
           ++ TV++TK  F   + RP+ S+Y  V++EL+V+ HL+R    +RYDP+FALG  T +D+
Sbjct: 3   NLRTVSDTKRAFYSIHTRPVNSVYRRVVEELMVEMHLLRVNEDFRYDPIFALGVTTSFDR 62

Query: 127 LMEGYPSDEDREAIFQAYIKALNEDPEQYRIDAQKLEEWARSQTAASLVEF---ASKEG- 186
            M+GY  + D++AIF A  KA   DP Q + D Q+L E A+S++A  ++++   A+  G 
Sbjct: 63  FMDGYQPENDKDAIFSAICKAQEADPVQMKKDGQRLTELAQSKSAQEMLDWITQAANSGG 122

Query: 187 -EAESILKDIAERAGSKGSFSYSRFFAIGLFRLLELA--NATE-----PSILEKLCAALN 246
            E +  L++IA+       F YSR FAIGLF LLEL+  N T+        L  +C  LN
Sbjct: 123 DELQWQLRNIAQNP----KFKYSRLFAIGLFTLLELSEGNITQDEESLAEFLPNICTVLN 182

Query: 247 VDKKSVDRDLDVYRNLLSKLVQAKELLKEYVDREKKKR--DERAGSGSGSQTANEAIT 291
           + +  + +DL++YR  L K+ Q ++ + + ++ +KK+R  D+    GS      EA T
Sbjct: 183 ISESKLQKDLEIYRGNLDKIAQVRQAMDDILEAQKKRREADQAKKEGSDDTPTTEAST 236

BLAST of Tan0018023 vs. ExPASy Swiss-Prot
Match: Q116P5 (Protein Thf1 OS=Trichodesmium erythraeum (strain IMS101) OX=203124 GN=thf1 PE=3 SV=1)

HSP 1 Score: 142.9 bits (359), Expect = 5.9e-33
Identity = 83/233 (35.62%), Postives = 138/233 (59.23%), Query Frame = 0

Query: 70  TVAETKSNFLKAYKRPIPSIYNTVLQELIVQQHLMRYKRTYRYDPVFALGFVTVYDQLME 129
           TV++TK  F   + RPI SIYN V++EL+V+ HL+     Y Y+P +ALG VT +D+ M+
Sbjct: 6   TVSDTKKTFYHFHTRPINSIYNRVIEELLVEMHLISVNVDYSYNPFYALGVVTAFDRFMQ 65

Query: 130 GYPSDEDREAIFQAYIKALNEDPEQYRIDAQKLEEWARSQTAASLVEF--ASKEGEAESI 189
           GY   ED+ +IF A I+   EDP +YR DA+ LE+ A   +A+ ++ +   SK  +    
Sbjct: 66  GYSPQEDKTSIFNALIQGQEEDPNKYRSDAKGLEDLAGKISASDILSWICLSKNIDNTQY 125

Query: 190 LKDIAERAGSKGSFSYSRFFAIGLFRLLELANA-------TEPSILEKLCAALNVDKKSV 249
           L+D          F YSR FAIGLF LLE+ +             L+K+C +LN+ ++ +
Sbjct: 126 LQDDLRAISENSKFRYSRLFAIGLFTLLEIVDTELIKEQEKRTEALKKICQSLNLVEEKL 185

Query: 250 DRDLDVYRNLLSKLVQAKELLKEYVDREKKKRDERA-----GSGSGSQTANEA 289
            +D+D+Y + L ++ QA+  +++ +   +KKR++R+      S SG++T+ ++
Sbjct: 186 LKDIDLYLSNLERVAQARSAMEDTLAAMRKKREKRSLEKVNLSTSGNKTSEDS 238

BLAST of Tan0018023 vs. NCBI nr
Match: XP_022136235.1 (protein THYLAKOID FORMATION1, chloroplastic isoform X1 [Momordica charantia])

HSP 1 Score: 530.4 bits (1365), Expect = 1.0e-146
Identity = 276/298 (92.62%), Postives = 287/298 (96.31%), Query Frame = 0

Query: 1   MAAVNSVSFSTLSQCADRRLPVSSARSLASNFDGFRFRSSVFCHYSGVRTSGFSSRLVIH 60
           MAAVNSVSFS LSQC++RRL V SARSLASNFDGFRFR+SVFCHYSGVRTS +SSR+V+H
Sbjct: 1   MAAVNSVSFSALSQCSERRLLVPSARSLASNFDGFRFRTSVFCHYSGVRTSSYSSRMVVH 60

Query: 61  CMSAGTDVTTVAETKSNFLKAYKRPIPSIYNTVLQELIVQQHLMRYKRTYRYDPVFALGF 120
           CMSAGTDVTTVAETK+NFLK YKRPIPSIYNTVLQELIVQQHLMRYKRTYRYDPVFALGF
Sbjct: 61  CMSAGTDVTTVAETKANFLKVYKRPIPSIYNTVLQELIVQQHLMRYKRTYRYDPVFALGF 120

Query: 121 VTVYDQLMEGYPSDEDREAIFQAYIKALNEDPEQYRIDAQKLEEWARSQTAASLVEFASK 180
           VTVYDQLMEGYPSDEDREAIFQAYIKALNEDPEQYRIDA+KLEEWARSQTAASLVEFASK
Sbjct: 121 VTVYDQLMEGYPSDEDREAIFQAYIKALNEDPEQYRIDAKKLEEWARSQTAASLVEFASK 180

Query: 181 EGEAESILKDIAERAGSKGSFSYSRFFAIGLFRLLELANATEPSILEKLCAALNVDKKSV 240
           EGE ESILKDIAERAG KGSFSYSRFFAIGLFRLLELANATEPSILEKLCAALNV+KKSV
Sbjct: 181 EGEVESILKDIAERAGGKGSFSYSRFFAIGLFRLLELANATEPSILEKLCAALNVNKKSV 240

Query: 241 DRDLDVYRNLLSKLVQAKELLKEYVDREKKKRDERAGSGSGSQTANEAITKCLGEYNL 299
           DRDLDVYRNLLSKLVQAKELLKEYVDREKKKRDERA    GSQTANEAITKCLGEY++
Sbjct: 241 DRDLDVYRNLLSKLVQAKELLKEYVDREKKKRDERA----GSQTANEAITKCLGEYSM 294

BLAST of Tan0018023 vs. NCBI nr
Match: XP_022969189.1 (protein THYLAKOID FORMATION1, chloroplastic-like isoform X2 [Cucurbita maxima])

HSP 1 Score: 523.9 bits (1348), Expect = 9.3e-145
Identity = 270/298 (90.60%), Postives = 287/298 (96.31%), Query Frame = 0

Query: 1   MAAVNSVSFSTLSQCADRRLPVSSARSLASNFDGFRFRSSVFCHYSGVRTSGFSSRLVIH 60
           MAAVNSVSFS LSQC+DRRLP+ SARSLAS+FDGFRFR SVFCHYSGVRTS FSSR+VIH
Sbjct: 1   MAAVNSVSFSGLSQCSDRRLPIPSARSLASHFDGFRFRKSVFCHYSGVRTSSFSSRMVIH 60

Query: 61  CMSAGTDVTTVAETKSNFLKAYKRPIPSIYNTVLQELIVQQHLMRYKRTYRYDPVFALGF 120
           CM++GTDVTTVAETK+NFLKAYKRPIPSIYNTV+QELIVQQHLMRYK+TYRYDPVFALGF
Sbjct: 61  CMASGTDVTTVAETKANFLKAYKRPIPSIYNTVVQELIVQQHLMRYKKTYRYDPVFALGF 120

Query: 121 VTVYDQLMEGYPSDEDREAIFQAYIKALNEDPEQYRIDAQKLEEWARSQTAASLVEFASK 180
           VTVYD+LMEGYPSDEDR+AIFQAYI ALNEDPEQYRIDAQKLEEWARSQTAASLVEFAS+
Sbjct: 121 VTVYDRLMEGYPSDEDRDAIFQAYINALNEDPEQYRIDAQKLEEWARSQTAASLVEFASR 180

Query: 181 EGEAESILKDIAERAGSKGSFSYSRFFAIGLFRLLELANATEPSILEKLCAALNVDKKSV 240
           EGE ESILKDIAERAGSKG+FSYSRFFAIGLFRLLELANA+EPSILEKLCAALNVDKK V
Sbjct: 181 EGEVESILKDIAERAGSKGNFSYSRFFAIGLFRLLELANASEPSILEKLCAALNVDKKGV 240

Query: 241 DRDLDVYRNLLSKLVQAKELLKEYVDREKKKRDERAGSGSGSQTANEAITKCLGEYNL 299
           DRDLDVYRNLLSKLVQAKELLKEYVDREKKKRDERA    GSQTA+EAITKCLGEY++
Sbjct: 241 DRDLDVYRNLLSKLVQAKELLKEYVDREKKKRDERA----GSQTASEAITKCLGEYSM 294

BLAST of Tan0018023 vs. NCBI nr
Match: XP_008455361.1 (PREDICTED: protein THYLAKOID FORMATION1, chloroplastic [Cucumis melo] >KAA0031606.1 protein THYLAKOID FORMATION1 [Cucumis melo var. makuwa] >TYK07058.1 protein THYLAKOID FORMATION1 [Cucumis melo var. makuwa])

HSP 1 Score: 523.5 bits (1347), Expect = 1.2e-144
Identity = 271/298 (90.94%), Postives = 285/298 (95.64%), Query Frame = 0

Query: 1   MAAVNSVSFSTLSQCADRRLPVSSARSLASNFDGFRFRSSVFCHYSGVRTSGFSSRLVIH 60
           MAAVNS+SFSTL+QC+DRR PV S+RSL+SNFDGFRFR+S+F HYS VR S FSSR+VIH
Sbjct: 1   MAAVNSISFSTLNQCSDRRFPVPSSRSLSSNFDGFRFRTSLFTHYSRVRPSTFSSRMVIH 60

Query: 61  CMSAGTDVTTVAETKSNFLKAYKRPIPSIYNTVLQELIVQQHLMRYKRTYRYDPVFALGF 120
           CMSAGTDVTTVAETK NFLKAYKRPIPSIYNTVLQELIVQQHLMRYKRTYRYDPVFALGF
Sbjct: 61  CMSAGTDVTTVAETKLNFLKAYKRPIPSIYNTVLQELIVQQHLMRYKRTYRYDPVFALGF 120

Query: 121 VTVYDQLMEGYPSDEDREAIFQAYIKALNEDPEQYRIDAQKLEEWARSQTAASLVEFASK 180
           VTVYDQLMEGYPSDEDREAIFQAYIKALNEDPEQYRIDAQKLEEWARSQTAASLVEFAS+
Sbjct: 121 VTVYDQLMEGYPSDEDREAIFQAYIKALNEDPEQYRIDAQKLEEWARSQTAASLVEFASR 180

Query: 181 EGEAESILKDIAERAGSKGSFSYSRFFAIGLFRLLELANATEPSILEKLCAALNVDKKSV 240
           EGE ESILKDIAERAGSKG+FSYSRFFAIGLFRLLELANATEPSILEKLCAALN+DKK V
Sbjct: 181 EGEVESILKDIAERAGSKGNFSYSRFFAIGLFRLLELANATEPSILEKLCAALNIDKKGV 240

Query: 241 DRDLDVYRNLLSKLVQAKELLKEYVDREKKKRDERAGSGSGSQTANEAITKCLGEYNL 299
           DRDLDVYRNLLSKLVQAKELLKEY+DREKKKRDERA    GSQTANEAITKCLGEY++
Sbjct: 241 DRDLDVYRNLLSKLVQAKELLKEYIDREKKKRDERA----GSQTANEAITKCLGEYSM 294

BLAST of Tan0018023 vs. NCBI nr
Match: XP_023554556.1 (protein THYLAKOID FORMATION1, chloroplastic-like isoform X2 [Cucurbita pepo subsp. pepo])

HSP 1 Score: 521.9 bits (1343), Expect = 3.5e-144
Identity = 268/298 (89.93%), Postives = 286/298 (95.97%), Query Frame = 0

Query: 1   MAAVNSVSFSTLSQCADRRLPVSSARSLASNFDGFRFRSSVFCHYSGVRTSGFSSRLVIH 60
           MAAVNSVSF+ LSQC+DRRLP+ SARSLAS+FDGFRFR SVFCHYSGVRTS FSSR+VIH
Sbjct: 1   MAAVNSVSFTALSQCSDRRLPIPSARSLASHFDGFRFRKSVFCHYSGVRTSSFSSRMVIH 60

Query: 61  CMSAGTDVTTVAETKSNFLKAYKRPIPSIYNTVLQELIVQQHLMRYKRTYRYDPVFALGF 120
           CM++GTDVTTVAETK+NFLKAYKRPIPSIYNTV+QELIVQQHLMRYK+TYRYDPVFALGF
Sbjct: 61  CMASGTDVTTVAETKANFLKAYKRPIPSIYNTVVQELIVQQHLMRYKKTYRYDPVFALGF 120

Query: 121 VTVYDQLMEGYPSDEDREAIFQAYIKALNEDPEQYRIDAQKLEEWARSQTAASLVEFASK 180
           VTVYD+LMEGYPSDEDR+AIFQAYI ALNEDPEQYRIDAQKLEEWARSQTAASLVEFAS+
Sbjct: 121 VTVYDRLMEGYPSDEDRDAIFQAYINALNEDPEQYRIDAQKLEEWARSQTAASLVEFASR 180

Query: 181 EGEAESILKDIAERAGSKGSFSYSRFFAIGLFRLLELANATEPSILEKLCAALNVDKKSV 240
           EGE ESILKDIAERAG KG+FSYSRFFAIGLFRLLELANA+EPSILEKLCAALNVDKK V
Sbjct: 181 EGEVESILKDIAERAGGKGNFSYSRFFAIGLFRLLELANASEPSILEKLCAALNVDKKGV 240

Query: 241 DRDLDVYRNLLSKLVQAKELLKEYVDREKKKRDERAGSGSGSQTANEAITKCLGEYNL 299
           DRDLDVYRNLLSKLVQAKELLKEYVDREKKKRDERA    GSQTA+EAITKCLGEY++
Sbjct: 241 DRDLDVYRNLLSKLVQAKELLKEYVDREKKKRDERA----GSQTASEAITKCLGEYSM 294

BLAST of Tan0018023 vs. NCBI nr
Match: XP_022952157.1 (protein THYLAKOID FORMATION1, chloroplastic-like isoform X2 [Cucurbita moschata])

HSP 1 Score: 521.5 bits (1342), Expect = 4.6e-144
Identity = 268/298 (89.93%), Postives = 286/298 (95.97%), Query Frame = 0

Query: 1   MAAVNSVSFSTLSQCADRRLPVSSARSLASNFDGFRFRSSVFCHYSGVRTSGFSSRLVIH 60
           MAAVNSVSFS LSQC+DRRLP+ SARSLAS+FDGFRFR SVFCHYSGVRT  F+SR+VIH
Sbjct: 1   MAAVNSVSFSALSQCSDRRLPIPSARSLASHFDGFRFRKSVFCHYSGVRTPSFNSRMVIH 60

Query: 61  CMSAGTDVTTVAETKSNFLKAYKRPIPSIYNTVLQELIVQQHLMRYKRTYRYDPVFALGF 120
           CM++GTDVTTVAETK+NFLKAYKRPIPSIYNTV+QELIVQQHLMRYK+TYRYDPVFALGF
Sbjct: 61  CMASGTDVTTVAETKANFLKAYKRPIPSIYNTVVQELIVQQHLMRYKKTYRYDPVFALGF 120

Query: 121 VTVYDQLMEGYPSDEDREAIFQAYIKALNEDPEQYRIDAQKLEEWARSQTAASLVEFASK 180
           VTVYD+LMEGYPSDEDR+AIFQAYI ALNEDPEQYRIDAQKLEEWARSQTAASLVEFAS+
Sbjct: 121 VTVYDRLMEGYPSDEDRDAIFQAYINALNEDPEQYRIDAQKLEEWARSQTAASLVEFASR 180

Query: 181 EGEAESILKDIAERAGSKGSFSYSRFFAIGLFRLLELANATEPSILEKLCAALNVDKKSV 240
           EGE ESILKDIAERAGSKG+FSYSRFFAIGLFRLLELANA+EPSILEKLCAALNVDKK V
Sbjct: 181 EGEVESILKDIAERAGSKGNFSYSRFFAIGLFRLLELANASEPSILEKLCAALNVDKKGV 240

Query: 241 DRDLDVYRNLLSKLVQAKELLKEYVDREKKKRDERAGSGSGSQTANEAITKCLGEYNL 299
           DRDLDVYRNLLSKLVQAKELLKEYVDREKKKRDERA    GSQTA+EAITKCLGEY++
Sbjct: 241 DRDLDVYRNLLSKLVQAKELLKEYVDREKKKRDERA----GSQTASEAITKCLGEYSM 294

BLAST of Tan0018023 vs. ExPASy TrEMBL
Match: A0A6J1C3B8 (protein THYLAKOID FORMATION1, chloroplastic isoform X1 OS=Momordica charantia OX=3673 GN=LOC111007981 PE=3 SV=1)

HSP 1 Score: 530.4 bits (1365), Expect = 4.8e-147
Identity = 276/298 (92.62%), Postives = 287/298 (96.31%), Query Frame = 0

Query: 1   MAAVNSVSFSTLSQCADRRLPVSSARSLASNFDGFRFRSSVFCHYSGVRTSGFSSRLVIH 60
           MAAVNSVSFS LSQC++RRL V SARSLASNFDGFRFR+SVFCHYSGVRTS +SSR+V+H
Sbjct: 1   MAAVNSVSFSALSQCSERRLLVPSARSLASNFDGFRFRTSVFCHYSGVRTSSYSSRMVVH 60

Query: 61  CMSAGTDVTTVAETKSNFLKAYKRPIPSIYNTVLQELIVQQHLMRYKRTYRYDPVFALGF 120
           CMSAGTDVTTVAETK+NFLK YKRPIPSIYNTVLQELIVQQHLMRYKRTYRYDPVFALGF
Sbjct: 61  CMSAGTDVTTVAETKANFLKVYKRPIPSIYNTVLQELIVQQHLMRYKRTYRYDPVFALGF 120

Query: 121 VTVYDQLMEGYPSDEDREAIFQAYIKALNEDPEQYRIDAQKLEEWARSQTAASLVEFASK 180
           VTVYDQLMEGYPSDEDREAIFQAYIKALNEDPEQYRIDA+KLEEWARSQTAASLVEFASK
Sbjct: 121 VTVYDQLMEGYPSDEDREAIFQAYIKALNEDPEQYRIDAKKLEEWARSQTAASLVEFASK 180

Query: 181 EGEAESILKDIAERAGSKGSFSYSRFFAIGLFRLLELANATEPSILEKLCAALNVDKKSV 240
           EGE ESILKDIAERAG KGSFSYSRFFAIGLFRLLELANATEPSILEKLCAALNV+KKSV
Sbjct: 181 EGEVESILKDIAERAGGKGSFSYSRFFAIGLFRLLELANATEPSILEKLCAALNVNKKSV 240

Query: 241 DRDLDVYRNLLSKLVQAKELLKEYVDREKKKRDERAGSGSGSQTANEAITKCLGEYNL 299
           DRDLDVYRNLLSKLVQAKELLKEYVDREKKKRDERA    GSQTANEAITKCLGEY++
Sbjct: 241 DRDLDVYRNLLSKLVQAKELLKEYVDREKKKRDERA----GSQTANEAITKCLGEYSM 294

BLAST of Tan0018023 vs. ExPASy TrEMBL
Match: A0A6J1I1V8 (protein THYLAKOID FORMATION1, chloroplastic-like isoform X2 OS=Cucurbita maxima OX=3661 GN=LOC111468259 PE=3 SV=1)

HSP 1 Score: 523.9 bits (1348), Expect = 4.5e-145
Identity = 270/298 (90.60%), Postives = 287/298 (96.31%), Query Frame = 0

Query: 1   MAAVNSVSFSTLSQCADRRLPVSSARSLASNFDGFRFRSSVFCHYSGVRTSGFSSRLVIH 60
           MAAVNSVSFS LSQC+DRRLP+ SARSLAS+FDGFRFR SVFCHYSGVRTS FSSR+VIH
Sbjct: 1   MAAVNSVSFSGLSQCSDRRLPIPSARSLASHFDGFRFRKSVFCHYSGVRTSSFSSRMVIH 60

Query: 61  CMSAGTDVTTVAETKSNFLKAYKRPIPSIYNTVLQELIVQQHLMRYKRTYRYDPVFALGF 120
           CM++GTDVTTVAETK+NFLKAYKRPIPSIYNTV+QELIVQQHLMRYK+TYRYDPVFALGF
Sbjct: 61  CMASGTDVTTVAETKANFLKAYKRPIPSIYNTVVQELIVQQHLMRYKKTYRYDPVFALGF 120

Query: 121 VTVYDQLMEGYPSDEDREAIFQAYIKALNEDPEQYRIDAQKLEEWARSQTAASLVEFASK 180
           VTVYD+LMEGYPSDEDR+AIFQAYI ALNEDPEQYRIDAQKLEEWARSQTAASLVEFAS+
Sbjct: 121 VTVYDRLMEGYPSDEDRDAIFQAYINALNEDPEQYRIDAQKLEEWARSQTAASLVEFASR 180

Query: 181 EGEAESILKDIAERAGSKGSFSYSRFFAIGLFRLLELANATEPSILEKLCAALNVDKKSV 240
           EGE ESILKDIAERAGSKG+FSYSRFFAIGLFRLLELANA+EPSILEKLCAALNVDKK V
Sbjct: 181 EGEVESILKDIAERAGSKGNFSYSRFFAIGLFRLLELANASEPSILEKLCAALNVDKKGV 240

Query: 241 DRDLDVYRNLLSKLVQAKELLKEYVDREKKKRDERAGSGSGSQTANEAITKCLGEYNL 299
           DRDLDVYRNLLSKLVQAKELLKEYVDREKKKRDERA    GSQTA+EAITKCLGEY++
Sbjct: 241 DRDLDVYRNLLSKLVQAKELLKEYVDREKKKRDERA----GSQTASEAITKCLGEYSM 294

BLAST of Tan0018023 vs. ExPASy TrEMBL
Match: A0A5D3C7D3 (Protein THYLAKOID FORMATION1 OS=Cucumis melo var. makuwa OX=1194695 GN=E5676_scaffold13G003760 PE=3 SV=1)

HSP 1 Score: 523.5 bits (1347), Expect = 5.9e-145
Identity = 271/298 (90.94%), Postives = 285/298 (95.64%), Query Frame = 0

Query: 1   MAAVNSVSFSTLSQCADRRLPVSSARSLASNFDGFRFRSSVFCHYSGVRTSGFSSRLVIH 60
           MAAVNS+SFSTL+QC+DRR PV S+RSL+SNFDGFRFR+S+F HYS VR S FSSR+VIH
Sbjct: 1   MAAVNSISFSTLNQCSDRRFPVPSSRSLSSNFDGFRFRTSLFTHYSRVRPSTFSSRMVIH 60

Query: 61  CMSAGTDVTTVAETKSNFLKAYKRPIPSIYNTVLQELIVQQHLMRYKRTYRYDPVFALGF 120
           CMSAGTDVTTVAETK NFLKAYKRPIPSIYNTVLQELIVQQHLMRYKRTYRYDPVFALGF
Sbjct: 61  CMSAGTDVTTVAETKLNFLKAYKRPIPSIYNTVLQELIVQQHLMRYKRTYRYDPVFALGF 120

Query: 121 VTVYDQLMEGYPSDEDREAIFQAYIKALNEDPEQYRIDAQKLEEWARSQTAASLVEFASK 180
           VTVYDQLMEGYPSDEDREAIFQAYIKALNEDPEQYRIDAQKLEEWARSQTAASLVEFAS+
Sbjct: 121 VTVYDQLMEGYPSDEDREAIFQAYIKALNEDPEQYRIDAQKLEEWARSQTAASLVEFASR 180

Query: 181 EGEAESILKDIAERAGSKGSFSYSRFFAIGLFRLLELANATEPSILEKLCAALNVDKKSV 240
           EGE ESILKDIAERAGSKG+FSYSRFFAIGLFRLLELANATEPSILEKLCAALN+DKK V
Sbjct: 181 EGEVESILKDIAERAGSKGNFSYSRFFAIGLFRLLELANATEPSILEKLCAALNIDKKGV 240

Query: 241 DRDLDVYRNLLSKLVQAKELLKEYVDREKKKRDERAGSGSGSQTANEAITKCLGEYNL 299
           DRDLDVYRNLLSKLVQAKELLKEY+DREKKKRDERA    GSQTANEAITKCLGEY++
Sbjct: 241 DRDLDVYRNLLSKLVQAKELLKEYIDREKKKRDERA----GSQTANEAITKCLGEYSM 294

BLAST of Tan0018023 vs. ExPASy TrEMBL
Match: A0A1S3C0V5 (protein THYLAKOID FORMATION1, chloroplastic OS=Cucumis melo OX=3656 GN=LOC103495547 PE=3 SV=1)

HSP 1 Score: 523.5 bits (1347), Expect = 5.9e-145
Identity = 271/298 (90.94%), Postives = 285/298 (95.64%), Query Frame = 0

Query: 1   MAAVNSVSFSTLSQCADRRLPVSSARSLASNFDGFRFRSSVFCHYSGVRTSGFSSRLVIH 60
           MAAVNS+SFSTL+QC+DRR PV S+RSL+SNFDGFRFR+S+F HYS VR S FSSR+VIH
Sbjct: 1   MAAVNSISFSTLNQCSDRRFPVPSSRSLSSNFDGFRFRTSLFTHYSRVRPSTFSSRMVIH 60

Query: 61  CMSAGTDVTTVAETKSNFLKAYKRPIPSIYNTVLQELIVQQHLMRYKRTYRYDPVFALGF 120
           CMSAGTDVTTVAETK NFLKAYKRPIPSIYNTVLQELIVQQHLMRYKRTYRYDPVFALGF
Sbjct: 61  CMSAGTDVTTVAETKLNFLKAYKRPIPSIYNTVLQELIVQQHLMRYKRTYRYDPVFALGF 120

Query: 121 VTVYDQLMEGYPSDEDREAIFQAYIKALNEDPEQYRIDAQKLEEWARSQTAASLVEFASK 180
           VTVYDQLMEGYPSDEDREAIFQAYIKALNEDPEQYRIDAQKLEEWARSQTAASLVEFAS+
Sbjct: 121 VTVYDQLMEGYPSDEDREAIFQAYIKALNEDPEQYRIDAQKLEEWARSQTAASLVEFASR 180

Query: 181 EGEAESILKDIAERAGSKGSFSYSRFFAIGLFRLLELANATEPSILEKLCAALNVDKKSV 240
           EGE ESILKDIAERAGSKG+FSYSRFFAIGLFRLLELANATEPSILEKLCAALN+DKK V
Sbjct: 181 EGEVESILKDIAERAGSKGNFSYSRFFAIGLFRLLELANATEPSILEKLCAALNIDKKGV 240

Query: 241 DRDLDVYRNLLSKLVQAKELLKEYVDREKKKRDERAGSGSGSQTANEAITKCLGEYNL 299
           DRDLDVYRNLLSKLVQAKELLKEY+DREKKKRDERA    GSQTANEAITKCLGEY++
Sbjct: 241 DRDLDVYRNLLSKLVQAKELLKEYIDREKKKRDERA----GSQTANEAITKCLGEYSM 294

BLAST of Tan0018023 vs. ExPASy TrEMBL
Match: A0A6J1GJN0 (protein THYLAKOID FORMATION1, chloroplastic-like isoform X2 OS=Cucurbita moschata OX=3662 GN=LOC111454918 PE=3 SV=1)

HSP 1 Score: 521.5 bits (1342), Expect = 2.2e-144
Identity = 268/298 (89.93%), Postives = 286/298 (95.97%), Query Frame = 0

Query: 1   MAAVNSVSFSTLSQCADRRLPVSSARSLASNFDGFRFRSSVFCHYSGVRTSGFSSRLVIH 60
           MAAVNSVSFS LSQC+DRRLP+ SARSLAS+FDGFRFR SVFCHYSGVRT  F+SR+VIH
Sbjct: 1   MAAVNSVSFSALSQCSDRRLPIPSARSLASHFDGFRFRKSVFCHYSGVRTPSFNSRMVIH 60

Query: 61  CMSAGTDVTTVAETKSNFLKAYKRPIPSIYNTVLQELIVQQHLMRYKRTYRYDPVFALGF 120
           CM++GTDVTTVAETK+NFLKAYKRPIPSIYNTV+QELIVQQHLMRYK+TYRYDPVFALGF
Sbjct: 61  CMASGTDVTTVAETKANFLKAYKRPIPSIYNTVVQELIVQQHLMRYKKTYRYDPVFALGF 120

Query: 121 VTVYDQLMEGYPSDEDREAIFQAYIKALNEDPEQYRIDAQKLEEWARSQTAASLVEFASK 180
           VTVYD+LMEGYPSDEDR+AIFQAYI ALNEDPEQYRIDAQKLEEWARSQTAASLVEFAS+
Sbjct: 121 VTVYDRLMEGYPSDEDRDAIFQAYINALNEDPEQYRIDAQKLEEWARSQTAASLVEFASR 180

Query: 181 EGEAESILKDIAERAGSKGSFSYSRFFAIGLFRLLELANATEPSILEKLCAALNVDKKSV 240
           EGE ESILKDIAERAGSKG+FSYSRFFAIGLFRLLELANA+EPSILEKLCAALNVDKK V
Sbjct: 181 EGEVESILKDIAERAGSKGNFSYSRFFAIGLFRLLELANASEPSILEKLCAALNVDKKGV 240

Query: 241 DRDLDVYRNLLSKLVQAKELLKEYVDREKKKRDERAGSGSGSQTANEAITKCLGEYNL 299
           DRDLDVYRNLLSKLVQAKELLKEYVDREKKKRDERA    GSQTA+EAITKCLGEY++
Sbjct: 241 DRDLDVYRNLLSKLVQAKELLKEYVDREKKKRDERA----GSQTASEAITKCLGEYSM 294

BLAST of Tan0018023 vs. TAIR 10
Match: AT2G20890.1 (photosystem II reaction center PSB29 protein )

HSP 1 Score: 380.6 bits (976), Expect = 1.2e-105
Identity = 208/295 (70.51%), Postives = 248/295 (84.07%), Query Frame = 0

Query: 3   AVNSVSFSTLSQCADRRLPVSSARSLASNFDGFRFRSSVFCHYSGVR-TSGFSSRLVIHC 62
           A++S+SF  L Q +D+    +S+R LAS          +   +S +   S  +S+ +IHC
Sbjct: 5   AISSLSFPALGQ-SDKISNFASSRPLASAI-------RICTKFSRLSLNSRSTSKSLIHC 64

Query: 63  MSAGT-DVTTVAETKSNFLKAYKRPIPSIYNTVLQELIVQQHLMRYKRTYRYDPVFALGF 122
           MS  T DV  V+ETKS FLKAYKRPIPSIYNTVLQELIVQQHLMRYK+TYRYDPVFALGF
Sbjct: 65  MSNVTADVPPVSETKSKFLKAYKRPIPSIYNTVLQELIVQQHLMRYKKTYRYDPVFALGF 124

Query: 123 VTVYDQLMEGYPSDEDREAIFQAYIKALNEDPEQYRIDAQKLEEWARSQTAASLVEFASK 182
           VTVYDQLMEGYPSD+DR+AIF+AYI+ALNEDP+QYRIDAQK+EEWARSQT+ASLV+F+SK
Sbjct: 125 VTVYDQLMEGYPSDQDRDAIFKAYIEALNEDPKQYRIDAQKMEEWARSQTSASLVDFSSK 184

Query: 183 EGEAESILKDIAERAGSKGSFSYSRFFAIGLFRLLELANATEPSILEKLCAALNVDKKSV 242
           EG+ E++LKDIA RAGSK  FSYSRFFA+GLFRLLELA+AT+P++L+KLCA+LN++KKSV
Sbjct: 185 EGDIEAVLKDIAGRAGSKEGFSYSRFFAVGLFRLLELASATDPTVLDKLCASLNINKKSV 244

Query: 243 DRDLDVYRNLLSKLVQAKELLKEYVDREKKKRDERAGSGSGSQTANEAITKCLGE 296
           DRDLDVYRNLLSKLVQAKELLKEYV+REKKK+ ERA     SQ ANE I+KCLG+
Sbjct: 245 DRDLDVYRNLLSKLVQAKELLKEYVEREKKKQGERA----QSQKANETISKCLGD 287

The following BLAST results are available for this feature:
Match NameE-valueIdentityDescription
Q7XAB81.0e-10971.04Protein THYLAKOID FORMATION1, chloroplastic OS=Solanum tuberosum OX=4113 GN=THF1... [more]
Q9SKT01.7e-10470.51Protein THYLAKOID FORMATION 1, chloroplastic OS=Arabidopsis thaliana OX=3702 GN=... [more]
Q84PB74.0e-9867.11Protein THYLAKOID FORMATION1, chloroplastic OS=Oryza sativa subsp. japonica OX=3... [more]
B0C3M81.3e-3536.97Protein Thf1 OS=Acaryochloris marina (strain MBIC 11017) OX=329726 GN=thf1 PE=3 ... [more]
Q116P55.9e-3335.62Protein Thf1 OS=Trichodesmium erythraeum (strain IMS101) OX=203124 GN=thf1 PE=3 ... [more]
Match NameE-valueIdentityDescription
XP_022136235.11.0e-14692.62protein THYLAKOID FORMATION1, chloroplastic isoform X1 [Momordica charantia][more]
XP_022969189.19.3e-14590.60protein THYLAKOID FORMATION1, chloroplastic-like isoform X2 [Cucurbita maxima][more]
XP_008455361.11.2e-14490.94PREDICTED: protein THYLAKOID FORMATION1, chloroplastic [Cucumis melo] >KAA003160... [more]
XP_023554556.13.5e-14489.93protein THYLAKOID FORMATION1, chloroplastic-like isoform X2 [Cucurbita pepo subs... [more]
XP_022952157.14.6e-14489.93protein THYLAKOID FORMATION1, chloroplastic-like isoform X2 [Cucurbita moschata][more]
Match NameE-valueIdentityDescription
A0A6J1C3B84.8e-14792.62protein THYLAKOID FORMATION1, chloroplastic isoform X1 OS=Momordica charantia OX... [more]
A0A6J1I1V84.5e-14590.60protein THYLAKOID FORMATION1, chloroplastic-like isoform X2 OS=Cucurbita maxima ... [more]
A0A5D3C7D35.9e-14590.94Protein THYLAKOID FORMATION1 OS=Cucumis melo var. makuwa OX=1194695 GN=E5676_sca... [more]
A0A1S3C0V55.9e-14590.94protein THYLAKOID FORMATION1, chloroplastic OS=Cucumis melo OX=3656 GN=LOC103495... [more]
A0A6J1GJN02.2e-14489.93protein THYLAKOID FORMATION1, chloroplastic-like isoform X2 OS=Cucurbita moschat... [more]
Match NameE-valueIdentityDescription
AT2G20890.11.2e-10570.51photosystem II reaction center PSB29 protein [more]
InterPro
Analysis Name: InterPro Annotations of Snake gourd (anguina) v1
Date Performed: 2021-10-25
IPR TermIPR DescriptionSourceSource TermSource DescriptionAlignment
NoneNo IPR availableCOILSCoilCoilcoord: 247..274
NoneNo IPR availablePANTHERPTHR34793:SF7PHOTOSYSTEM II BIOGENESIS PROTEINcoord: 1..290
IPR017499Protein Thf1PFAMPF11264ThylakoidFormatcoord: 70..275
e-value: 2.9E-76
score: 255.9
IPR017499Protein Thf1TIGRFAMTIGR03060TIGR03060coord: 67..272
e-value: 1.2E-48
score: 163.7
IPR017499Protein Thf1PANTHERPTHR34793PROTEIN THYLAKOID FORMATION 1, CHLOROPLASTICcoord: 1..290
IPR017499Protein Thf1HAMAPMF_01843Thf1coord: 65..273
score: 22.490385

Relationships

The following mRNA feature(s) are a part of this gene:

Feature NameUnique NameType
Tan0018023.1Tan0018023.1mRNA


GO Annotation
GO Assignments
This gene is annotated with the following GO terms.
Category Term Accession Term Name
biological_process GO:0010207 photosystem II assembly
biological_process GO:0045037 protein import into chloroplast stroma
biological_process GO:0045038 protein import into chloroplast thylakoid membrane
biological_process GO:0010027 thylakoid membrane organization
biological_process GO:0015979 photosynthesis