Lag0004043 (gene) Sponge gourd (AG‐4) v1

Overview
NameLag0004043
Typegene
OrganismLuffa acutangula (Sponge gourd (AG‐4) v1)
DescriptionPsbP domain-containing protein
Locationchr6: 670010 .. 675310 (-)
RNA-Seq ExpressionLag0004043
SyntenyLag0004043
Sequences
The following sequences are available for this feature:

Gene sequence (with intron)

Legend: polypeptideexonCDS
Hold the cursor over a type above to highlight its positions in the sequence below.
ATGGCGTCCCTTCCTTCACCGAGCGCTGTAATCCAACCTCCTCGCTCATGGCGCTTCTCAGAATCATCACTCTGCAATGGTAATCCTTCTACTCTACTCATCTTCCTCGCCTTTCACTTCATGCGCCATTTTAAATCCTATCTTCTTCCTTCTTCAATTTCAGGAATCGCCATTCCCATCCGCTCCAGACTCCGTGTCTTCTGCTCTTGCAAGAACACCGACATTTCCAATCGACAGCCCTGGTAAATCGGCATTTCTTCCTCTATCGTCACTATTACTCGTTTTCCTGTTCAATTCAAATGTTGCTTTAATGGAATCAATGCTCGTTCCTTCTGTTGGTTCTTAGTTATTGGGCGAACGGAGTTAGTAGACGAGAAGTTATGCTAGGCATGGGATTGGCCGCGTTTTCTTTTCAAGAAGTTGTTCCTAATGCCCTAGCTGAGAGTGGTACCGATTTGCCCTTCTTCCTCTCGAGCGTGTTCTCAGAAATTACATTATCCCCCCTTTGGTATTGTTTTATTTGAAGATTGAATGCGTTCAGTTTACGGGTGATAGTTGTTGTTGCTGAGGATTATCGGACATACACAGACGAAGCAAATAAGTTCAGATTGGTGATTCCTCAAGGTGTGTGGCTTAGCTTTTCGTATAGAGAATTAGAGATTGAACTGTATTAATTAGATGAACTTGTCCGATCTGTGTAGATTGGCAAGTGGGCAATGGTGAACCGAATGGATTCAAGTCGGTTACAGCCTTTTATCCTCAAGAAACTTCAAGTTCCAATGGTATTTTTAGATCTTCTCTTCTTTAATCTTCCTCGCTTCAGTTTTGAATATTTTGATAAAGCGAAATTTGGTTCAGTCAGTGTTGTAATCTCGGGGCTTGGTCCTGATTTCACGAGAATGGAATCCTTTGGCAAGGTTGAAGAATTTGCCGATACCCTGGTATGACTTGCATATCATCTTGATGTTTTCGAAATGAAAAAAGAAAGCCAAAATTGGATCTCCTTTTGCAGTTCTTGAACGAAACAAGAAACTAGCTAGCTCTTTTCTCTTGCAGTTTCTTAAATAGACAAAAGAAAGTCAAAATGGTGTCTATTATTTTGCAGTTTTTTAATGGAAATAGACATCTTAAATCTGGATTTTCCTCTTCTCTTGATATCTCTTTCAGGTAAGCGGACTGGACAGAAGCTGGAAAAGGCCACCAGGTGTGGCGGCGAAACTTATAGACTGTAGATCATCTAAAGGTATACCAGTTGACTCCGCATTTACATAACCATAGTTAAAATGAAACTCAATAGAAACTTGTTTATAAAACTTTATAAAACATATTAATAGTTTCCTAGACTTTCAAATTTATGTCTAATACATATATATATATATATATATTTTTTGACAATTCAAATTTATGTCTAATAAGTTCCTAACATATTCCAGAATTTTTAAAATTAACAAAAACAATTTAGGTTTATTTGGACCATAAACTTCAAATTTTGTATATTGTAGGCTTGTGAGTTTTAAAAAATGTTGAATAGGTTAGACACAAAATTGAGAGTTGAAGAACTTATTGCACACAAAAGTGAAAGTTTAAGGACTCACTAGATATTTTTTAAAATTCAAGGATCTATTGAGCATAAAATTGAAATTTTCGAACCCTTATATTTTTAAAGTTTAGATATCAAATTAATACCATCCTCAAAATTTAGGGACTACAATTGTAATTAAACCTAAAAACTGTATATCCCACCTTTTGTCATATTTCATAATATAACGAGGTTTGATTTGGACAAAAAAAAAATCGCAGGAATATATTACATTGAGTACACGCTGCAGAATCCAGGTGAAAGCCGCAAACATTTATACTCTGCAATTGGGATGGCAAACAATGGCTGGTACAACAGACTTTACACCATAACAGGACAGGTAGCTTCCTCCGCCGTTCCTATCAAAACACTTGGTTTGTTTTTGGCTCAAAAATTTTAAGCCTTATGATTGAGTTATAAAATTTAATTCACAATGTTATAAAAATTCTCGAAGGAGTCCTCCCATAGAAAATTTGTTAGCAAAAAGGTAATGAGATTTTAAAACAGATTCAAGTTTTTGTCAAGTGATTAATAGATAATTGACTGCAAGTAGTGTTGTAAAAGGCTCCAGACGCACTAAGGCGCAATGGCCTCCTGAGGCCTAGGCGCAAGGTGCACAAATAGGCGTTGATTATTTTCTTTAAGGCACACTAAATAGAAAAAAAACTCACAAAAATATATACATGTAAATACAACACTTTCATGACAAGTAATGAAGTTTCTAACCAAGAAAAATAGATAAAAGATAAACAAGGTTCAAATATTAGTCTTCCAACATAGAGTCAAATGTCAATACTCCACAAAAAAATATCCTTGAATCTAATATTTGACACCCCGAACGACCAAGTACGGGAGTCTCGTGCCTCAAGGGCTTTGAGGAAGGTGCGTGTCTTGGCCGCGCCTTCTTCATTGCGCCTTGCGTCTAGGCATGCTCGGGCGCACACCTTTTAAAACACTGACGGCAAGGACTAGAATAGTCTTTAAATTGAGAGACTAAAACAAATGTTTGAAAGTTGAATAACCAAAAATACCATATAAGAGACCAAAATAAGATTCAAATGAATCTAGATTTTTTTTTTTTTTTTTTATTTAGTTTGTTGAGTGGTGTAATGTGTTATATAGTATGCAGATGAAGAATCGGAGAACTATAGTTCGAAAGTTCAGAAGGTAAGTGAGAGATATCCCAAGTTTGAGTGGTTAATTGTCATATAATCTAATTCTGGAATCTGGCGTTTGTTTCACTGAAAGATAAGTGTGAATGTTGGTTTTGTAGGTTGTCAATTCCTTCACTTTCATTTAATGATGTCACAGAACTGGCTTCCACTACATTTGCTCATGGGGTTAAAGTTTTTCCACTTCAGCTTCCAATTATGGTGCCTTTTCTTTTTGTTCCATTTCTTATCCCCAATTTTATATTTGAAATACCTACATGATAATAATTTTGTTTTCTTAAATCATGCCTATACACACACACACACACACATAACTAAATCATGCTTATTTACTGATTTTTTTTTTTTTTTTTTTAATATACTTCAAATGTTTTGAAAGTTCTACTCAAGGTTTAAAAATAGTTTTCAAAAAAAATATTTTTTTCATTTTCAAAAGCATTTTTAAGAAGCTAAGAATTAAGATTTTTATGAATTCATCTTTTTTGTTTTTAAAGAGATTTTTTCGACTTGGTGAATTGAAAAAGAAAAAAAAAATTGTCAGAAGAGAGTTAAAATTGCTTTCCTCCCGAACTTGAGTGAGCAAATGGTGTAGTTATACGATTGAGATAGCTTTTGAAAAAGGTTGTTTTTTTAAACAAAATAAAATAAAAAAATATATTTTTTAAAATCAATACTGGAAGAATTACCAACTTAAAAGTGCTCTATACCACTTTCTATATGGATTTAGATGTCAATTTTTTTTTTTAAAGTTTAAATTGTTAGATAATAAATAAAGAGCGTTTTAAAGAATTAGATTGATTTAAAAACTCTTTGATAAAAAAAATTTTGAGAAATTATTATCCAAATGTGACTTTATTATTAATAAAAAAAATCACTTTGAAGAATAAAAACATATTTTATTAGTATTTGTGATGAGTAGAAATATTTTTAAATTTAATTCAACCAAATAAAATTTGTTTTAAGAGTACTTTTTTTAAAGTACAAATCAAATACATAATTAATAGTACTTTTTTACTAAAAATATTTTAAGAAGAAAAAAACTTAAATTAAAAGCCTTAACAATACATTATCTAAACACTACTCCATTGGTTAGTGTTAATATTCTATAATCTATAATATATAGGAAAAGTTATGTAAGAGAGTTTTTGCCACTAAATTTTTTTCATAAATTTATACTTTATTCTTCTCTTCCCATAATTAATTGTTTGAGTTGAGGTGTCTATTAATAGAAGTTGATTCAAGCATTTATTTGTATATGGGTATGATTCAAATTGTAAATGTGAAGCATCACAATCTAACGGTTGTAAATATGAAAGTACGGACGAACACATCTATCCAAAGCCACCTTTATAAATGAGGTTGGGGAGGAAATCTTTGCTTTTCTATTTTGCTTTTAAAATTTTTGTAAATTCTTATTTTTTTCATTTTATTCAACTATTGAAGTCGAGATGTTTGTTGAGAAAAGTTACTCCAATGGAGGTGGTTACATGTCAAAACAATACATCCATTTGCATATGGGTATGGTCCAACGATGTGTTTGTACCAATGAGCACATCCTTTATTAATGAGGCTATGATGAAGAATTTTTGCCTTCATATTTGGTCCTCTAAATTTTTATAAATTCTTATGTTGTCCTTTTCCAATAGTTTTGACTATCCAATTTTAAGTGTTTTTTTGAGAGAAATTAATCCAATGAAGGTAGTTACATGATGAAGTGAATCGATCCATCCATTCGCGTAAGAGTGTGATTTGTATGTATGAAACATCCACAATTTAATGTTGAAAATGTGTCTATAAAGACAGACACATAATCCAAAGAAAGCCACCTTTATATCTAATTCTTTGAAGCATTGTATCCCAAATGGTTAGATATTTGAAAAAAAAAACATTTATTGCAAACCATTTGTAATTATATATATATATAGATTATTCTAATGGATTGGGAAAAAAAAAAAAATCATATTTCATGTTTGGAAGAAAAATAAATAAATTCTTTTATCATAGCTTTTGGTTAACAAAAATGGGACTTAATTTGTAGTTGCACGATGAAATTGGCACTATGGAATCATTTGACAACCAATGAAGGTTTATGATAAACAATAATTTCAACACCGATTTTAAAGTGGACATGTATGAAGTTAATACTTTATGGACATACCATCTTTTTAATGTTATTTCATTTTAATCAATAAATCATATTTGACTATTTTTAATGTTATTTCTTTTTTTTTTCCTCTATGTTTCTAAAGTCAACATATCATGTTATAGGCAATTTTGGACCACCCCGATATACAAGGAGCTGACGAGGACAACCGGGGAGAAATCGGGCTGAAAGATGGACCAAGGAGGCAAAACCGGCAAATGGGACGGGCCAAGACCGAAGGGGTCGGGTTTTCGGCCCGACCCCCTACTCGGCCGAGGCCCATGGCCGAGGCCGAGCATATGGTCGGCCGAGGCCGACCCTCGGTCCGCTCGTGCGGGCTGAGTCCGTTCGGTCTCGTCTGGTCCCCACCGCCTCTGGTTGCCCCGGTTTTGCCTGGTTTGACCTAA

mRNA sequence

ATGGCGTCCCTTCCTTCACCGAGCGCTGTAATCCAACCTCCTCGCTCATGGCGCTTCTCAGAATCATCACTCTGCAATGGTAATCCTTCTACTCTACTCATCTTCCTCGCCTTTCACTTCATGCGCCATTTTAAATCCTATCTTCTTCCTTCTTCAATTTCAGGAATCGCCATTCCCATCCGCTCCAGACTCCGTGTCTTCTGCTCTTGCAAGAACACCGACATTTCCAATCGACAGCCCTGTTATTGGGCGAACGGAGTTAGTAGACGAGAAGTTATGCTAGGCATGGGATTGGCCGCGTTTTCTTTTCAAGAAGTTGTTCCTAATGCCCTAGCTGAGAGTGTTGTTGTTGCTGAGGATTATCGGACATACACAGACGAAGCAAATAAGTTCAGATTGGTGATTCCTCAAGATTGGCAAGTGGGCAATGGTGAACCGAATGGATTCAAGTCGGTTACAGCCTTTTATCCTCAAGAAACTTCAAGTTCCAATGTCAGTGTTGTAATCTCGGGGCTTGGTCCTGATTTCACGAGAATGGAATCCTTTGGCAAGGTTGAAGAATTTGCCGATACCCTGGTAAGCGGACTGGACAGAAGCTGGAAAAGGCCACCAGGTGTGGCGGCGAAACTTATAGACTGTAGATCATCTAAAGGAATATATTACATTGAGTACACGCTGCAGAATCCAGGTGAAAGCCGCAAACATTTATACTCTGCAATTGGGATGGCAAACAATGGCTGGTACAACAGACTTTACACCATAACAGGACAGGCTCCAGACGCACTAAGGCGCAATGGCCTCCTGAGGCCTAGGCGCAAGATGAAGAATCGGAGAACTATAGTTCGAAAGTTCAGAAGGTTGTCAATTCCTTCACTTTCATTTAATGATGTCACAGAACTGGCTTCCACTACATTTGCTCATGGGGTTAAAGTTTTTCCACTTCAGCTTCCAATTATGGCAATTTTGGACCACCCCGATATACAAGGAGCTGACGAGGACAACCGGGGAGAAATCGGGCTGAAAGATGGACCAAGGAGGCAAAACCGGCAAATGGGACGGGCCAAGACCGAAGGGGTCGGGTTTTCGGCCCGACCCCCTACTCGGCCGAGGCCCATGGCCGAGGCCGAGCATATGGTCGGCCGAGGCCGACCCTCGGTCCGCTCGTGCGGGCTGAGTCCGTTCGGTCTCGTCTGGTCCCCACCGCCTCTGGTTGCCCCGGTTTTGCCTGGTTTGACCTAA

Coding sequence (CDS)

ATGGCGTCCCTTCCTTCACCGAGCGCTGTAATCCAACCTCCTCGCTCATGGCGCTTCTCAGAATCATCACTCTGCAATGGTAATCCTTCTACTCTACTCATCTTCCTCGCCTTTCACTTCATGCGCCATTTTAAATCCTATCTTCTTCCTTCTTCAATTTCAGGAATCGCCATTCCCATCCGCTCCAGACTCCGTGTCTTCTGCTCTTGCAAGAACACCGACATTTCCAATCGACAGCCCTGTTATTGGGCGAACGGAGTTAGTAGACGAGAAGTTATGCTAGGCATGGGATTGGCCGCGTTTTCTTTTCAAGAAGTTGTTCCTAATGCCCTAGCTGAGAGTGTTGTTGTTGCTGAGGATTATCGGACATACACAGACGAAGCAAATAAGTTCAGATTGGTGATTCCTCAAGATTGGCAAGTGGGCAATGGTGAACCGAATGGATTCAAGTCGGTTACAGCCTTTTATCCTCAAGAAACTTCAAGTTCCAATGTCAGTGTTGTAATCTCGGGGCTTGGTCCTGATTTCACGAGAATGGAATCCTTTGGCAAGGTTGAAGAATTTGCCGATACCCTGGTAAGCGGACTGGACAGAAGCTGGAAAAGGCCACCAGGTGTGGCGGCGAAACTTATAGACTGTAGATCATCTAAAGGAATATATTACATTGAGTACACGCTGCAGAATCCAGGTGAAAGCCGCAAACATTTATACTCTGCAATTGGGATGGCAAACAATGGCTGGTACAACAGACTTTACACCATAACAGGACAGGCTCCAGACGCACTAAGGCGCAATGGCCTCCTGAGGCCTAGGCGCAAGATGAAGAATCGGAGAACTATAGTTCGAAAGTTCAGAAGGTTGTCAATTCCTTCACTTTCATTTAATGATGTCACAGAACTGGCTTCCACTACATTTGCTCATGGGGTTAAAGTTTTTCCACTTCAGCTTCCAATTATGGCAATTTTGGACCACCCCGATATACAAGGAGCTGACGAGGACAACCGGGGAGAAATCGGGCTGAAAGATGGACCAAGGAGGCAAAACCGGCAAATGGGACGGGCCAAGACCGAAGGGGTCGGGTTTTCGGCCCGACCCCCTACTCGGCCGAGGCCCATGGCCGAGGCCGAGCATATGGTCGGCCGAGGCCGACCCTCGGTCCGCTCGTGCGGGCTGAGTCCGTTCGGTCTCGTCTGGTCCCCACCGCCTCTGGTTGCCCCGGTTTTGCCTGGTTTGACCTAA

Protein sequence

MASLPSPSAVIQPPRSWRFSESSLCNGNPSTLLIFLAFHFMRHFKSYLLPSSISGIAIPIRSRLRVFCSCKNTDISNRQPCYWANGVSRREVMLGMGLAAFSFQEVVPNALAESVVVAEDYRTYTDEANKFRLVIPQDWQVGNGEPNGFKSVTAFYPQETSSSNVSVVISGLGPDFTRMESFGKVEEFADTLVSGLDRSWKRPPGVAAKLIDCRSSKGIYYIEYTLQNPGESRKHLYSAIGMANNGWYNRLYTITGQAPDALRRNGLLRPRRKMKNRRTIVRKFRRLSIPSLSFNDVTELASTTFAHGVKVFPLQLPIMAILDHPDIQGADEDNRGEIGLKDGPRRQNRQMGRAKTEGVGFSARPPTRPRPMAEAEHMVGRGRPSVRSCGLSPFGLVWSPPPLVAPVLPGLT
Homology
BLAST of Lag0004043 vs. NCBI nr
Match: XP_038885576.1 (psbP domain-containing protein 3, chloroplastic [Benincasa hispida])

HSP 1 Score: 412.9 bits (1060), Expect = 3.2e-111
Identity = 208/265 (78.49%), Postives = 220/265 (83.02%), Query Frame = 0

Query: 1   MASLPSPSAVIQPPRSWRFSESSLCNGNPSTLLIFLAFHFMRHFKSYLLPSSISGIAIPI 60
           MASLPSPSAVIQ PRSWRFS+SS  NG P                            IPI
Sbjct: 1   MASLPSPSAVIQRPRSWRFSQSSPSNGLP----------------------------IPI 60

Query: 61  RSRLRVFCSCKNTDISNRQPCYWANGVSRREVMLGMGLAAFSFQEVVPNALAESVVVAED 120
           RS+LRVFCS  N +ISN+Q CYWA+GV+RRE+MLG+GL AFSFQEVV NALAESV+VAED
Sbjct: 61  RSKLRVFCSGNNINISNQQSCYWASGVNRREIMLGIGLTAFSFQEVVSNALAESVMVAED 120

Query: 121 YRTYTDEANKFRLVIPQDWQVGNGEPNGFKSVTAFYPQETSSSNVSVVISGLGPDFTRME 180
           YRTYTDEANKFRL IPQDWQVGNGEPNGFKSVTAF+PQETSSSNVSVVISGLGPDFTRME
Sbjct: 121 YRTYTDEANKFRLAIPQDWQVGNGEPNGFKSVTAFFPQETSSSNVSVVISGLGPDFTRME 180

Query: 181 SFGKVEEFADTLVSGLDRSWKRPPGVAAKLIDCRSSKGIYYIEYTLQNPGESRKHLYSAI 240
           SFGKVEEFADTLVSGLDRSWKRPPGVAAKLI+CRSSKGIYYIEYTLQNPGESRKHLYSAI
Sbjct: 181 SFGKVEEFADTLVSGLDRSWKRPPGVAAKLINCRSSKGIYYIEYTLQNPGESRKHLYSAI 237

Query: 241 GMANNGWYNRLYTITGQAPDALRRN 266
           GMA+NGWYNRLYTITGQ  D    N
Sbjct: 241 GMASNGWYNRLYTITGQYADEESEN 237

BLAST of Lag0004043 vs. NCBI nr
Match: XP_022155884.1 (psbP domain-containing protein 3, chloroplastic isoform X1 [Momordica charantia])

HSP 1 Score: 412.9 bits (1060), Expect = 3.2e-111
Identity = 220/301 (73.09%), Postives = 234/301 (77.74%), Query Frame = 0

Query: 1   MASL--PSPSAVIQPPRSWRFSESSLCNGNPSTLLIFLAFHFMRHFKSYLLPSSISGIAI 60
           MASL  PSPSAVIQ PR WRF ESSL N                            GIAI
Sbjct: 1   MASLPSPSPSAVIQRPRPWRFRESSLSN----------------------------GIAI 60

Query: 61  PIRSRLR--VFCSCKNTDISNRQPCYWANGVSRREVMLGMGLAAFSFQEVVPNALAESVV 120
            IRS+ +  V CSC N DIS+ Q CYWA+GV+RRE+MLG+ L+ FSFQ VV N+LAESVV
Sbjct: 61  HIRSKSKPGVLCSCNNIDISDPQLCYWASGVNRREIMLGIALSTFSFQAVVSNSLAESVV 120

Query: 121 VAEDYRTYTDEANKFRLVIPQDWQVGNGEPNGFKSVTAFYPQETSSSNVSVVISGLGPDF 180
           VAED+RTYTDEANKFRLVIPQDW VGNGEPNGFKSVTAFYPQETSSSNVSVVISGLGPDF
Sbjct: 121 VAEDFRTYTDEANKFRLVIPQDWVVGNGEPNGFKSVTAFYPQETSSSNVSVVISGLGPDF 180

Query: 181 TRMESFGKVEEFADTLVSGLDRSWKRPPGVAAKLIDCRSSKGIYYIEYTLQNPGESRKHL 240
           TRMESFGKVEEFADTLVSGLDRSWKRPPGVAAKLIDCRSSKGIYYIEYTLQNPGESRKHL
Sbjct: 181 TRMESFGKVEEFADTLVSGLDRSWKRPPGVAAKLIDCRSSKGIYYIEYTLQNPGESRKHL 240

Query: 241 YSAIGMANNGWYNRLYTITGQAPDALRRNGLLRPRRKMKNRRTIVRKFRRLSIPSLSFND 298
           YSAIGMA+NGWYNRLYTITGQ                MKNRRTIV K RR SIPSLSF++
Sbjct: 241 YSAIGMASNGWYNRLYTITGQ----------------MKNRRTIVPKLRRSSIPSLSFDE 257

BLAST of Lag0004043 vs. NCBI nr
Match: XP_022946502.1 (psbP domain-containing protein 3, chloroplastic [Cucurbita moschata] >KAG6599083.1 PsbP domain-containing protein 3, chloroplastic, partial [Cucurbita argyrosperma subsp. sororia])

HSP 1 Score: 396.4 bits (1017), Expect = 3.1e-106
Identity = 201/260 (77.31%), Postives = 214/260 (82.31%), Query Frame = 0

Query: 1   MASLPSPSAVIQPPRSWRFSESSLCNGNPSTLLIFLAFHFMRHFKSYLLPSSISGIAIPI 60
           MASLPSPSAVIQ PRSWRF+ SSL N                            GIAIPI
Sbjct: 1   MASLPSPSAVIQRPRSWRFTPSSLSN----------------------------GIAIPI 60

Query: 61  RSRLRVFCSCKNTDISNRQPCYWANGVSRREVMLGMGLAAFSFQEVVPNALAESVVVAED 120
           R+RLRVFCS KN DI +++PC W +GV+RRE++LGMGL AFSFQEVV  ALAES VVAED
Sbjct: 61  RTRLRVFCSGKNIDIPDQKPCCWTSGVNRREIVLGMGLTAFSFQEVVSIALAES-VVAED 120

Query: 121 YRTYTDEANKFRLVIPQDWQVGNGEPNGFKSVTAFYPQETSSSNVSVVISGLGPDFTRME 180
           YRTYTDEANKFRLVIPQDWQVGNGEPNGFK VTAF+P+ET SSNVSVVISGLGPDFTRME
Sbjct: 121 YRTYTDEANKFRLVIPQDWQVGNGEPNGFKLVTAFFPKETLSSNVSVVISGLGPDFTRME 180

Query: 181 SFGKVEEFADTLVSGLDRSWKRPPGVAAKLIDCRSSKGIYYIEYTLQNPGESRKHLYSAI 240
           SFGKVEEFADTLVSGLDRSWKRPPGVAAKLIDCRSSKGIYYIEYTLQNPGE R HLYSAI
Sbjct: 181 SFGKVEEFADTLVSGLDRSWKRPPGVAAKLIDCRSSKGIYYIEYTLQNPGEGRNHLYSAI 231

Query: 241 GMANNGWYNRLYTITGQAPD 261
           GMA+NGWYNRLYT+TGQ  D
Sbjct: 241 GMASNGWYNRLYTVTGQYGD 231

BLAST of Lag0004043 vs. NCBI nr
Match: KAG7030019.1 (PsbP domain-containing protein 3, chloroplastic, partial [Cucurbita argyrosperma subsp. argyrosperma])

HSP 1 Score: 396.4 bits (1017), Expect = 3.1e-106
Identity = 201/260 (77.31%), Postives = 214/260 (82.31%), Query Frame = 0

Query: 1   MASLPSPSAVIQPPRSWRFSESSLCNGNPSTLLIFLAFHFMRHFKSYLLPSSISGIAIPI 60
           MASLPSPSAVIQ PRSWRF+ SSL N                            GIAIPI
Sbjct: 1   MASLPSPSAVIQRPRSWRFTPSSLSN----------------------------GIAIPI 60

Query: 61  RSRLRVFCSCKNTDISNRQPCYWANGVSRREVMLGMGLAAFSFQEVVPNALAESVVVAED 120
           R+RLRVFCS KN DI +++PC W +GV+RRE++LGMGL AFSFQEVV  ALAES VVAED
Sbjct: 61  RTRLRVFCSGKNIDIPDQKPCCWTSGVNRREIVLGMGLTAFSFQEVVSIALAES-VVAED 120

Query: 121 YRTYTDEANKFRLVIPQDWQVGNGEPNGFKSVTAFYPQETSSSNVSVVISGLGPDFTRME 180
           YRTYTDEANKFRLVIPQDWQVGNGEPNGFK VTAF+P+ET SSNVSVVISGLGPDFTRME
Sbjct: 121 YRTYTDEANKFRLVIPQDWQVGNGEPNGFKLVTAFFPKETLSSNVSVVISGLGPDFTRME 180

Query: 181 SFGKVEEFADTLVSGLDRSWKRPPGVAAKLIDCRSSKGIYYIEYTLQNPGESRKHLYSAI 240
           SFGKVEEFADTLVSGLDRSWKRPPGVAAKLIDCRSSKGIYYIEYTLQNPGE R HLYSAI
Sbjct: 181 SFGKVEEFADTLVSGLDRSWKRPPGVAAKLIDCRSSKGIYYIEYTLQNPGEGRNHLYSAI 231

Query: 241 GMANNGWYNRLYTITGQAPD 261
           GMA+NGWYNRLYT+TGQ  D
Sbjct: 241 GMASNGWYNRLYTVTGQYGD 231

BLAST of Lag0004043 vs. NCBI nr
Match: XP_023546046.1 (psbP domain-containing protein 3, chloroplastic [Cucurbita pepo subsp. pepo])

HSP 1 Score: 394.0 bits (1011), Expect = 1.5e-105
Identity = 200/260 (76.92%), Postives = 213/260 (81.92%), Query Frame = 0

Query: 1   MASLPSPSAVIQPPRSWRFSESSLCNGNPSTLLIFLAFHFMRHFKSYLLPSSISGIAIPI 60
           MASLPSPSAVIQ PRSWRF+ SSL N                            GIAIPI
Sbjct: 1   MASLPSPSAVIQRPRSWRFTPSSLSN----------------------------GIAIPI 60

Query: 61  RSRLRVFCSCKNTDISNRQPCYWANGVSRREVMLGMGLAAFSFQEVVPNALAESVVVAED 120
           R+RLRVFCS KN DI +++PC W +GV+RRE++LGMGL AFSFQEVV  ALAES VVAED
Sbjct: 61  RTRLRVFCSGKNIDIPDQKPCCWTSGVNRREIVLGMGLTAFSFQEVVSIALAES-VVAED 120

Query: 121 YRTYTDEANKFRLVIPQDWQVGNGEPNGFKSVTAFYPQETSSSNVSVVISGLGPDFTRME 180
           YRTY DEANKFRLVIPQDWQVGNGEPNGFK VTAF+P+ET SSNVSVVISGLGPDFTRME
Sbjct: 121 YRTYIDEANKFRLVIPQDWQVGNGEPNGFKLVTAFFPRETLSSNVSVVISGLGPDFTRME 180

Query: 181 SFGKVEEFADTLVSGLDRSWKRPPGVAAKLIDCRSSKGIYYIEYTLQNPGESRKHLYSAI 240
           SFGKVEEFADTLVSGLDRSWKRPPGVAAKLIDCRSSKGIYYIEYTLQNPGE R HLYSAI
Sbjct: 181 SFGKVEEFADTLVSGLDRSWKRPPGVAAKLIDCRSSKGIYYIEYTLQNPGEGRNHLYSAI 231

Query: 241 GMANNGWYNRLYTITGQAPD 261
           GMA+NGWYNRLYT+TGQ  D
Sbjct: 241 GMASNGWYNRLYTVTGQYGD 231

BLAST of Lag0004043 vs. ExPASy Swiss-Prot
Match: Q9S720 (PsbP domain-containing protein 3, chloroplastic OS=Arabidopsis thaliana OX=3702 GN=PPD3 PE=1 SV=2)

HSP 1 Score: 262.7 bits (670), Expect = 7.0e-69
Identity = 124/175 (70.86%), Postives = 143/175 (81.71%), Query Frame = 0

Query: 86  GVSRREVMLGMGLAAFSFQEVVPNALAESVVVAEDYRTYTDEANKFRLVIPQDWQVGNGE 145
           G+ RR+VML +  + F     +  A AE+   +E +R YTDE NKF + IPQDWQVG  E
Sbjct: 54  GMKRRDVMLQIASSVFFLPLAISPAFAET-NASEAFRVYTDETNKFEISIPQDWQVGQAE 113

Query: 146 PNGFKSVTAFYPQETSSSNVSVVISGLGPDFTRMESFGKVEEFADTLVSGLDRSWKRPPG 205
           PNGFKS+TAFYPQETS+SNVS+ I+GLGPDFTRMESFGKVE FA+TLVSGLDRSW++P G
Sbjct: 114 PNGFKSITAFYPQETSTSNVSIAITGLGPDFTRMESFGKVEAFAETLVSGLDRSWQKPVG 173

Query: 206 VAAKLIDCRSSKGIYYIEYTLQNPGESRKHLYSAIGMANNGWYNRLYTITGQAPD 261
           V AKLID R+SKG YYIEYTLQNPGE+RKHLYSAIGMA NGWYNRLYT+TGQ  D
Sbjct: 174 VTAKLIDSRASKGFYYIEYTLQNPGEARKHLYSAIGMATNGWYNRLYTVTGQFTD 227

BLAST of Lag0004043 vs. ExPASy TrEMBL
Match: A0A6J1DRN2 (psbP domain-containing protein 3, chloroplastic isoform X1 OS=Momordica charantia OX=3673 GN=LOC111022894 PE=4 SV=1)

HSP 1 Score: 412.9 bits (1060), Expect = 1.5e-111
Identity = 220/301 (73.09%), Postives = 234/301 (77.74%), Query Frame = 0

Query: 1   MASL--PSPSAVIQPPRSWRFSESSLCNGNPSTLLIFLAFHFMRHFKSYLLPSSISGIAI 60
           MASL  PSPSAVIQ PR WRF ESSL N                            GIAI
Sbjct: 1   MASLPSPSPSAVIQRPRPWRFRESSLSN----------------------------GIAI 60

Query: 61  PIRSRLR--VFCSCKNTDISNRQPCYWANGVSRREVMLGMGLAAFSFQEVVPNALAESVV 120
            IRS+ +  V CSC N DIS+ Q CYWA+GV+RRE+MLG+ L+ FSFQ VV N+LAESVV
Sbjct: 61  HIRSKSKPGVLCSCNNIDISDPQLCYWASGVNRREIMLGIALSTFSFQAVVSNSLAESVV 120

Query: 121 VAEDYRTYTDEANKFRLVIPQDWQVGNGEPNGFKSVTAFYPQETSSSNVSVVISGLGPDF 180
           VAED+RTYTDEANKFRLVIPQDW VGNGEPNGFKSVTAFYPQETSSSNVSVVISGLGPDF
Sbjct: 121 VAEDFRTYTDEANKFRLVIPQDWVVGNGEPNGFKSVTAFYPQETSSSNVSVVISGLGPDF 180

Query: 181 TRMESFGKVEEFADTLVSGLDRSWKRPPGVAAKLIDCRSSKGIYYIEYTLQNPGESRKHL 240
           TRMESFGKVEEFADTLVSGLDRSWKRPPGVAAKLIDCRSSKGIYYIEYTLQNPGESRKHL
Sbjct: 181 TRMESFGKVEEFADTLVSGLDRSWKRPPGVAAKLIDCRSSKGIYYIEYTLQNPGESRKHL 240

Query: 241 YSAIGMANNGWYNRLYTITGQAPDALRRNGLLRPRRKMKNRRTIVRKFRRLSIPSLSFND 298
           YSAIGMA+NGWYNRLYTITGQ                MKNRRTIV K RR SIPSLSF++
Sbjct: 241 YSAIGMASNGWYNRLYTITGQ----------------MKNRRTIVPKLRRSSIPSLSFDE 257

BLAST of Lag0004043 vs. ExPASy TrEMBL
Match: A0A6J1G3W9 (psbP domain-containing protein 3, chloroplastic OS=Cucurbita moschata OX=3662 GN=LOC111450544 PE=4 SV=1)

HSP 1 Score: 396.4 bits (1017), Expect = 1.5e-106
Identity = 201/260 (77.31%), Postives = 214/260 (82.31%), Query Frame = 0

Query: 1   MASLPSPSAVIQPPRSWRFSESSLCNGNPSTLLIFLAFHFMRHFKSYLLPSSISGIAIPI 60
           MASLPSPSAVIQ PRSWRF+ SSL N                            GIAIPI
Sbjct: 1   MASLPSPSAVIQRPRSWRFTPSSLSN----------------------------GIAIPI 60

Query: 61  RSRLRVFCSCKNTDISNRQPCYWANGVSRREVMLGMGLAAFSFQEVVPNALAESVVVAED 120
           R+RLRVFCS KN DI +++PC W +GV+RRE++LGMGL AFSFQEVV  ALAES VVAED
Sbjct: 61  RTRLRVFCSGKNIDIPDQKPCCWTSGVNRREIVLGMGLTAFSFQEVVSIALAES-VVAED 120

Query: 121 YRTYTDEANKFRLVIPQDWQVGNGEPNGFKSVTAFYPQETSSSNVSVVISGLGPDFTRME 180
           YRTYTDEANKFRLVIPQDWQVGNGEPNGFK VTAF+P+ET SSNVSVVISGLGPDFTRME
Sbjct: 121 YRTYTDEANKFRLVIPQDWQVGNGEPNGFKLVTAFFPKETLSSNVSVVISGLGPDFTRME 180

Query: 181 SFGKVEEFADTLVSGLDRSWKRPPGVAAKLIDCRSSKGIYYIEYTLQNPGESRKHLYSAI 240
           SFGKVEEFADTLVSGLDRSWKRPPGVAAKLIDCRSSKGIYYIEYTLQNPGE R HLYSAI
Sbjct: 181 SFGKVEEFADTLVSGLDRSWKRPPGVAAKLIDCRSSKGIYYIEYTLQNPGEGRNHLYSAI 231

Query: 241 GMANNGWYNRLYTITGQAPD 261
           GMA+NGWYNRLYT+TGQ  D
Sbjct: 241 GMASNGWYNRLYTVTGQYGD 231

BLAST of Lag0004043 vs. ExPASy TrEMBL
Match: A0A6J1KI01 (psbP domain-containing protein 3, chloroplastic OS=Cucurbita maxima OX=3661 GN=LOC111494008 PE=4 SV=1)

HSP 1 Score: 393.7 bits (1010), Expect = 9.7e-106
Identity = 200/260 (76.92%), Postives = 213/260 (81.92%), Query Frame = 0

Query: 1   MASLPSPSAVIQPPRSWRFSESSLCNGNPSTLLIFLAFHFMRHFKSYLLPSSISGIAIPI 60
           MASLPSPSAVIQ PRSWRF+ SSL N                            GIAIPI
Sbjct: 1   MASLPSPSAVIQRPRSWRFTPSSLSN----------------------------GIAIPI 60

Query: 61  RSRLRVFCSCKNTDISNRQPCYWANGVSRREVMLGMGLAAFSFQEVVPNALAESVVVAED 120
           R+RLRVFCS KN DI +++PC W +GV+RRE+ LGMGL AFSFQEVV  ALAE+ VVAED
Sbjct: 61  RTRLRVFCSGKNIDIPDQKPCCWTSGVNRREIGLGMGLTAFSFQEVVSIALAEN-VVAED 120

Query: 121 YRTYTDEANKFRLVIPQDWQVGNGEPNGFKSVTAFYPQETSSSNVSVVISGLGPDFTRME 180
           YRTYTDEANKFRLVIPQDWQVGNGEPNGFK VTAF+P+ET SSNVSVVISGLGPDFTRME
Sbjct: 121 YRTYTDEANKFRLVIPQDWQVGNGEPNGFKLVTAFFPKETLSSNVSVVISGLGPDFTRME 180

Query: 181 SFGKVEEFADTLVSGLDRSWKRPPGVAAKLIDCRSSKGIYYIEYTLQNPGESRKHLYSAI 240
           SFGKVEEFADTLVSGLDRSWKRPPGVAAKLIDCRSSKGIYYIEYTLQNPGE R HLYSAI
Sbjct: 181 SFGKVEEFADTLVSGLDRSWKRPPGVAAKLIDCRSSKGIYYIEYTLQNPGEGRNHLYSAI 231

Query: 241 GMANNGWYNRLYTITGQAPD 261
           GMA+NGWYNRLYT+TGQ  D
Sbjct: 241 GMASNGWYNRLYTVTGQYGD 231

BLAST of Lag0004043 vs. ExPASy TrEMBL
Match: A0A6J1DNN6 (psbP domain-containing protein 3, chloroplastic isoform X2 OS=Momordica charantia OX=3673 GN=LOC111022894 PE=4 SV=1)

HSP 1 Score: 392.9 bits (1008), Expect = 1.7e-105
Identity = 204/269 (75.84%), Postives = 215/269 (79.93%), Query Frame = 0

Query: 1   MASL--PSPSAVIQPPRSWRFSESSLCNGNPSTLLIFLAFHFMRHFKSYLLPSSISGIAI 60
           MASL  PSPSAVIQ PR WRF ESSL N                            GIAI
Sbjct: 1   MASLPSPSPSAVIQRPRPWRFRESSLSN----------------------------GIAI 60

Query: 61  PIRSRLR--VFCSCKNTDISNRQPCYWANGVSRREVMLGMGLAAFSFQEVVPNALAESVV 120
            IRS+ +  V CSC N DIS+ Q CYWA+GV+RRE+MLG+ L+ FSFQ VV N+LAESVV
Sbjct: 61  HIRSKSKPGVLCSCNNIDISDPQLCYWASGVNRREIMLGIALSTFSFQAVVSNSLAESVV 120

Query: 121 VAEDYRTYTDEANKFRLVIPQDWQVGNGEPNGFKSVTAFYPQETSSSNVSVVISGLGPDF 180
           VAED+RTYTDEANKFRLVIPQDW VGNGEPNGFKSVTAFYPQETSSSNVSVVISGLGPDF
Sbjct: 121 VAEDFRTYTDEANKFRLVIPQDWVVGNGEPNGFKSVTAFYPQETSSSNVSVVISGLGPDF 180

Query: 181 TRMESFGKVEEFADTLVSGLDRSWKRPPGVAAKLIDCRSSKGIYYIEYTLQNPGESRKHL 240
           TRMESFGKVEEFADTLVSGLDRSWKRPPGVAAKLIDCRSSKGIYYIEYTLQNPGESRKHL
Sbjct: 181 TRMESFGKVEEFADTLVSGLDRSWKRPPGVAAKLIDCRSSKGIYYIEYTLQNPGESRKHL 240

Query: 241 YSAIGMANNGWYNRLYTITGQAPDALRRN 266
           YSAIGMA+NGWYNRLYTITGQ  D    N
Sbjct: 241 YSAIGMASNGWYNRLYTITGQYADEESEN 241

BLAST of Lag0004043 vs. ExPASy TrEMBL
Match: A0A0A0KDI5 (PsbP domain-containing protein OS=Cucumis sativus OX=3659 GN=Csa_6G401450 PE=4 SV=1)

HSP 1 Score: 379.8 bits (974), Expect = 1.5e-101
Identity = 197/262 (75.19%), Postives = 209/262 (79.77%), Query Frame = 0

Query: 1   MASLPSPSAVIQPPRSWRFSESSLCNGNPSTLLIFLAFHFMRHFKSYLLPSSISGIAIPI 60
           MASL SPSAVI  P S RFS+SSL NG                              IPI
Sbjct: 3   MASLLSPSAVILRPHSLRFSQSSLSNGFS---------------------------IIPI 62

Query: 61  RSRLRVFCSCKNTDI--SNRQPCYWANGVSRREVMLGMGLAAFSFQEVVPNALAESVVVA 120
           RS LRVFCS     I  SN++P Y A+GV+RRE+MLG+G  AFSFQEV  NALAESVVVA
Sbjct: 63  RSTLRVFCSANGNSIHTSNKKPSYLASGVNRREIMLGIGFTAFSFQEVGSNALAESVVVA 122

Query: 121 EDYRTYTDEANKFRLVIPQDWQVGNGEPNGFKSVTAFYPQETSSSNVSVVISGLGPDFTR 180
           EDYRTYTDEANKF LVIPQDWQVGNGEPNGFKSVTAF+PQETS+SNVSVVISGLGPD+TR
Sbjct: 123 EDYRTYTDEANKFSLVIPQDWQVGNGEPNGFKSVTAFFPQETSTSNVSVVISGLGPDYTR 182

Query: 181 MESFGKVEEFADTLVSGLDRSWKRPPGVAAKLIDCRSSKGIYYIEYTLQNPGESRKHLYS 240
           MESFGKVEEFADTLVSGLDRSWKRPPGVAAKLIDCRSSKGIYYIEYTLQNPGESRKHLYS
Sbjct: 183 MESFGKVEEFADTLVSGLDRSWKRPPGVAAKLIDCRSSKGIYYIEYTLQNPGESRKHLYS 237

Query: 241 AIGMANNGWYNRLYTITGQAPD 261
           AIGM++NGWYNRLYTITGQ  D
Sbjct: 243 AIGMSSNGWYNRLYTITGQYAD 237

BLAST of Lag0004043 vs. TAIR 10
Match: AT1G76450.1 (Photosystem II reaction center PsbP family protein )

HSP 1 Score: 262.7 bits (670), Expect = 5.0e-70
Identity = 124/175 (70.86%), Postives = 143/175 (81.71%), Query Frame = 0

Query: 86  GVSRREVMLGMGLAAFSFQEVVPNALAESVVVAEDYRTYTDEANKFRLVIPQDWQVGNGE 145
           G+ RR+VML +  + F     +  A AE+   +E +R YTDE NKF + IPQDWQVG  E
Sbjct: 54  GMKRRDVMLQIASSVFFLPLAISPAFAET-NASEAFRVYTDETNKFEISIPQDWQVGQAE 113

Query: 146 PNGFKSVTAFYPQETSSSNVSVVISGLGPDFTRMESFGKVEEFADTLVSGLDRSWKRPPG 205
           PNGFKS+TAFYPQETS+SNVS+ I+GLGPDFTRMESFGKVE FA+TLVSGLDRSW++P G
Sbjct: 114 PNGFKSITAFYPQETSTSNVSIAITGLGPDFTRMESFGKVEAFAETLVSGLDRSWQKPVG 173

Query: 206 VAAKLIDCRSSKGIYYIEYTLQNPGESRKHLYSAIGMANNGWYNRLYTITGQAPD 261
           V AKLID R+SKG YYIEYTLQNPGE+RKHLYSAIGMA NGWYNRLYT+TGQ  D
Sbjct: 174 VTAKLIDSRASKGFYYIEYTLQNPGEARKHLYSAIGMATNGWYNRLYTVTGQFTD 227

The following BLAST results are available for this feature:
Match NameE-valueIdentityDescription
XP_038885576.13.2e-11178.49psbP domain-containing protein 3, chloroplastic [Benincasa hispida][more]
XP_022155884.13.2e-11173.09psbP domain-containing protein 3, chloroplastic isoform X1 [Momordica charantia][more]
XP_022946502.13.1e-10677.31psbP domain-containing protein 3, chloroplastic [Cucurbita moschata] >KAG6599083... [more]
KAG7030019.13.1e-10677.31PsbP domain-containing protein 3, chloroplastic, partial [Cucurbita argyrosperma... [more]
XP_023546046.11.5e-10576.92psbP domain-containing protein 3, chloroplastic [Cucurbita pepo subsp. pepo][more]
Match NameE-valueIdentityDescription
Q9S7207.0e-6970.86PsbP domain-containing protein 3, chloroplastic OS=Arabidopsis thaliana OX=3702 ... [more]
Match NameE-valueIdentityDescription
A0A6J1DRN21.5e-11173.09psbP domain-containing protein 3, chloroplastic isoform X1 OS=Momordica charanti... [more]
A0A6J1G3W91.5e-10677.31psbP domain-containing protein 3, chloroplastic OS=Cucurbita moschata OX=3662 GN... [more]
A0A6J1KI019.7e-10676.92psbP domain-containing protein 3, chloroplastic OS=Cucurbita maxima OX=3661 GN=L... [more]
A0A6J1DNN61.7e-10575.84psbP domain-containing protein 3, chloroplastic isoform X2 OS=Momordica charanti... [more]
A0A0A0KDI51.5e-10175.19PsbP domain-containing protein OS=Cucumis sativus OX=3659 GN=Csa_6G401450 PE=4 S... [more]
Match NameE-valueIdentityDescription
AT1G76450.15.0e-7070.86Photosystem II reaction center PsbP family protein [more]
InterPro
Analysis Name: InterPro Annotations of Sponge gourd (AG-4) v1
Date Performed: 2022-08-01
IPR TermIPR DescriptionSourceSource TermSource DescriptionAlignment
IPR002683PsbP, C-terminalPFAMPF01789PsbPcoord: 120..261
e-value: 1.9E-29
score: 102.8
NoneNo IPR availableGENE3D3.40.1000.10Mog1/PsbP, alpha/beta/alpha sandwichcoord: 112..266
e-value: 3.4E-30
score: 107.1
NoneNo IPR availableMOBIDB_LITEmobidb-litedisorder_predictioncoord: 333..371
NoneNo IPR availableMOBIDB_LITEmobidb-litedisorder_predictioncoord: 333..352
NoneNo IPR availablePANTHERPTHR31407FAMILY NOT NAMEDcoord: 63..261
NoneNo IPR availablePANTHERPTHR31407:SF17PSBP DOMAIN-CONTAINING PROTEIN 3, CHLOROPLASTICcoord: 63..261
IPR016123Mog1/PsbP, alpha/beta/alpha sandwichSUPERFAMILY55724Mog1p/PsbP-likecoord: 122..260

Relationships

The following mRNA feature(s) are a part of this gene:

Feature NameUnique NameType
Lag0004043.1Lag0004043.1mRNA


GO Annotation
GO Assignments
This gene is annotated with the following GO terms.
Category Term Accession Term Name
biological_process GO:0015979 photosynthesis
cellular_component GO:0009507 chloroplast
cellular_component GO:0019898 extrinsic component of membrane
cellular_component GO:0009654 photosystem II oxygen evolving complex
cellular_component GO:0009523 photosystem II
molecular_function GO:0005509 calcium ion binding