Sequences
The following sequences are available for this feature:
Gene sequence (with intron)
Legend: five_prime_UTRexonCDSpolypeptidethree_prime_UTR
Hold the cursor over a type above to highlight its positions in the sequence below.
GACAAATGTAGAACATTGTACAAAGAAAATCCATCTCTAATCCTTGAGTGGACTACAAGCATGGCGATGCCCTCCCTTCTTTCACCCAGCGCTCTAATCCAACGCCCTCACTCATTCCGCTTCTCACAATCATCACTCTCCAATGGTAATCTTCTATCGTCTCTACTCTACTCTCCTCATCTTCCTCCTCCTTCTCTTCAATCCCATCTTCTTCCTTCTTCCATTTCAGGATTCTCCATTTTTCCTATCCGCTCAACGCTCCGAGTTTTCTGCTCTGCCAATAAAAAACCCAGGTTAAATCCCCCTCTTTTCCTCCAGATTTTATCACATTTGCTTGTTGTCCTGTTTAATACCACTGTTGCTTCGTTTTAGTAAGAAGATTCAAATGCTCGTTTCTTCTATTGGTTCTTAGTTATTTGGCGAGCAGAGTCAACAGACGAGAAATTATGCTAGGGATTGGATTCACCGCATTTTCATTGCAAGAAGTTGTTTCTAATGCTCTAGCTGAGAGTGGTACCAATTTGCCTTCTTCACCTTTGGTATCAGAGATTACGTTATCCCCTTTCGGTATTGTTTTTATTTGAAGACTGAATTCGTTCAGTTTACGAGTGATAGTTGTGGTTGCTGAGGATTATCGGACGTACACAGACGAAGCGAATAAGTTCAGCTTGGTGATTCCTCAAGGTCTGTAGAACGAGTTTGAGATTTTTGTTTAGAGAATTAGAGATTAAACTGTGTTAATTAGTTGAACTTGTGTGATTTATGTAGATTGGCAAGTGGGTAATGGTGAACCGAATGGATTCAAGTCGGTTACGGCATTTTTTCCTCAAGAAACTTCAACTTCCAATGGTAATTTTCTTAGATCTTCTCTCATTTATTTATTTTTTTTTTCTCACTTGCTTCAGTTTTGAATATTTTGATGGACTGAAATTTGGTTTAGTCAGTGTTGTAATCTCGGGGCTTGGTCCTGATTTCACGAGGATGGAATCCTTTGGCAAGGTTGAGGAATTTGCTGATACATTGGTATGTCTTTCATATCATTTTGCAGTTATTGAACCAAACATTTAACCCAAAATTGATTCTCTTATCCTGAAGTTGCTTAAATAGAAAGACAGCCAAAATTGGACATCTTTTTGGGTTTTCTGTATGAAAACTGAAAGCCAAAATTGGGTCTCTTCTTTCGCAGTTTACGAATGGAAATAGATAGCCTAAATATGGATTTTCCTCTTCTCTTGATATCTCTTTCAGGTGAGTGGACTGGACAGAAGCTGGAAAAGGCCACCAGGTGTGGCGGCAAAACTTATCAACTGTAGATCGTCGAAAGGTATGTGGCTCAACTACAACCAATCGACTCTGCATTTACATAATTAAGCATAGTTCGAGCTTGATAGAAACTTGTTTGTAAACATGAAACTTACTCAAGAGAACCGCGCAATAGATTTTGTCCGGAAGTTCTCTTCTGTTAGGGGCCAACAAAGTTCCACGTTAGTTAAAGAACAGTGACAGGTAAATGAAGGTAACTATTTTTTTATTAGGAAGTATTTAGACTGAAACTAAAAGTCGAAGTTATGAGCACTTATGTCCAAAGTGAACAATATCATACCAATGTAGAGAAAGGACTCTTCCATCTTGTCTAGCAATGTCTAGGAGATTGAGCAAGAAAGGGACGATTTCTTGGTTACTATCCCTACTCTCCATTTCATTCTCCATCCACACAAAGTTCTCTATGGAGATTGGGGTTGAGATACATTTACACCAAGAATTTTTCTCCTTTTGGTTTTTTATATTTAAAAAACTTGTGAACTTAGATCACTACTATGAAAAAGTTTTGACGGATAAGAAAAAACTCTCTAAACAGAATTTGTTAAAAATAAAACATGTCATCGAGCATAATACGAAACATGTTATACACGCCACTCAAGTTTTGCAGGTAATAAAATCAACTAATCATACTACATTAACATTCACGATGCCTTTTGCTAGTCTAACATGTAATTTCATTATCCTTACTAGAAAGATCACATCCCTCATTCTAGGCTTTCTATACATGCTTCTAAAGCTACTTGATTAGGTATGGCATTAAGTACATTACTCTCAAGTGTGCTCACAAGTTCCTCAAACGAATTATAATACAAAACCGTCTAAAATAATCTCTTTTATCATCAATTATAAATGGACTTTGATTTCATGAGTCTAAAAATACATAAATCCTTGACATCGGAGAAGAATGAAGTTTGATTTGAAAAGAAATTGGCAGGGATATATTACATAGAGTATACACTGCAGAATCCAGGTGAAAGCCGCAGACATTTATACTCGGCAATTGGGATGACATCCAATGGCTGGTACAATAGACTTTACACCATAACAGGACAGGTAGCTTACTCATTATTTTATCCTACCACCTATTTTATTATTTTATTATTGTCTAAAATTATTATGGCTGAGTTACAAGAGTTTTCTTTTGAAATATATAGATCATTTTAGTTTGGATTGTAATAGTAACTAAGTGTAAACAATTTTAAATTAATATGGAAATAGAAAATGCTATAGTAAAAAATAAAAATGGTGATGTAGTACTAAATATCATATAAAACTGTAAATATAGTAAATGAAGGGTTTTAAAATAATGTTGACGATAGTTAATTAAGAATTTTTAAATATATTTACTATAACAAAGATTATAATCTTAGGACAGTCCTCCCAAATCTCTCAAATATTTAAATATCTTGAAGGTAATTTTAATTTTAGTCTAAATCTGTTCGTAATCACAATTTTTATCACAAACTTTATTAGCAAAAAGTTGATGCGCATCAACATATTTGGATGTGCCCCCTAGACGCATCAAACTTAACCTATGCGTCTACAAAACGTAGGCGGTCAGTTAAAGATATCTAAGTAGCATGCCTCTGTATCCAATTATGTTATTATTAACAATAGTTTATTGACACATATTTTATGTTAATATATTATTTTAGTCATGTGCTTTAAAACTTGTTTAATTATAATCTTTATACTTTCAATGTCTTAAATTTTGTCTAACATAGAGTTTAAGTTGATTTTTATAAAACAGATAATCTTTTAGCTTTTTTTATTATTCAGAACATATAGGGTGTATTTTCTTCCATCCAAATTATCGTTATTATTCAACCTATTTTGTTTAAATAATAAATAACTTTAAGGAATTAAATTTAATATTTATTAAAAATATAGAGACTATAATTAGACAGTGAACTAAAATTGTATTAGTTTTAAAGTTTCGAACTAAAATACCATTTTTACTTATAGTTTAATAATTGAGATTATTATGTATTAGGGATAAAATTGATAATTTCTTTTGTCTGTTTATCTTTTTTAAAAGAATTTTTTTTTAACAATATGTAAGGCATGAGAATGTTTACTCATTTTGGACTTTTTGATGCTTTTCTAATAAGTAAGTTTATCTAAGTGAGGCCATAAGATTTTCAAATAGATTTATAGTTCTTTTTGGGGTGGAATGATTGATGATGGATGGCTAGGACTAAAATGGTTTCTAAATTGAGTTACTAAAAACAACACATGTAGAAGTTGATTGGCTCAATAAAGACCCAACCAATAAGAAATATATATGAGAAAATATAATTTTATTTTCTTATTAATAGTTGTTTTGATTTGTTATTCAGTATGCAGATGAAGAATCGGCGAACTATAGCTCCAAAATTGAGAAGGTTAGTGAAAGATATGTGAAGCATGAATTTTTAATTTTCATAAAATTAAGGGCTTTTTAAGAATAATCTATTATAAGGCAAATAAATTTTTTGGTAAAGTATAAATATTTTGTGAAAGTTTGGTATGCTTGAGAAAGTTTCTAAAAAATTAATCTATCATTGAGTTTCCCTCAAAAAATGAGAGTGTGAGAATGTTGGTTTTGTAGGTTGTCAATTCCTTCAGTTTCATTTGATGATTGCCACAGAATTGGCTTCCACTACACCATTCATTATGGGTTAAATATTTTCCACTTCTCTCTCTCTCTCTCTCTCTAATTATTATTATTTCATTACTATTTCTATTATTATTATTAATAGTGTTGTTTCTAATCTAAACCCAATATTAATTTTTTACTGATGCATCGATATTTTTACATTTTTATAGTTCCG
mRNA sequence
GACAAATGTAGAACATTGTACAAAGAAAATCCATCTCTAATCCTTGAGTGGACTACAAGCATGGCGATGCCCTCCCTTCTTTCACCCAGCGCTCTAATCCAACGCCCTCACTCATTCCGCTTCTCACAATCATCACTCTCCAATGGATTCTCCATTTTTCCTATCCGCTCAACGCTCCGAGTTTTCTGCTCTGCCAATAAAAAACCCAGTTATTTGGCGAGCAGAGTCAACAGACGAGAAATTATGCTAGGGATTGGATTCACCGCATTTTCATTGCAAGAAGTTGTTTCTAATGCTCTAGCTGAGAGTGTTGTGGTTGCTGAGGATTATCGGACGTACACAGACGAAGCGAATAAGTTCAGCTTGGTGATTCCTCAAGATTGGCAAGTGGGTAATGGTGAACCGAATGGATTCAAGTCGGTTACGGCATTTTTTCCTCAAGAAACTTCAACTTCCAATGTCAGTGTTGTAATCTCGGGGCTTGGTCCTGATTTCACGAGGATGGAATCCTTTGGCAAGGTTGAGGAATTTGCTGATACATTGGTGAGTGGACTGGACAGAAGCTGGAAAAGGCCACCAGGTGTGGCGGCAAAACTTATCAACTGTAGATCGTCGAAAGGGATATATTACATAGAGTATACACTGCAGAATCCAGGTGAAAGCCGCAGACATTTATACTCGGCAATTGGGATGACATCCAATGGCTGGTACAATAGACTTTACACCATAACAGGACAGTATGCAGATGAAGAATCGGCGAACTATAGCTCCAAAATTGAGAAGGTTGTCAATTCCTTCAGTTTCATTTGATGATTGCCACAGAATTGGCTTCCACTACACCATTCATTATGGGTTAAATATTTTCCACTTCTCTCTCTCTCTCTCTCTCTAATTATTATTATTTCATTACTATTTCTATTATTATTATTAATAGTGTTGTTTCTAATCTAAACCCAATATTAATTTTTTACTGATGCATCGATATTTTTACATTTTTATAGTTCCG
Coding sequence (CDS)
ATGGCGATGCCCTCCCTTCTTTCACCCAGCGCTCTAATCCAACGCCCTCACTCATTCCGCTTCTCACAATCATCACTCTCCAATGGATTCTCCATTTTTCCTATCCGCTCAACGCTCCGAGTTTTCTGCTCTGCCAATAAAAAACCCAGTTATTTGGCGAGCAGAGTCAACAGACGAGAAATTATGCTAGGGATTGGATTCACCGCATTTTCATTGCAAGAAGTTGTTTCTAATGCTCTAGCTGAGAGTGTTGTGGTTGCTGAGGATTATCGGACGTACACAGACGAAGCGAATAAGTTCAGCTTGGTGATTCCTCAAGATTGGCAAGTGGGTAATGGTGAACCGAATGGATTCAAGTCGGTTACGGCATTTTTTCCTCAAGAAACTTCAACTTCCAATGTCAGTGTTGTAATCTCGGGGCTTGGTCCTGATTTCACGAGGATGGAATCCTTTGGCAAGGTTGAGGAATTTGCTGATACATTGGTGAGTGGACTGGACAGAAGCTGGAAAAGGCCACCAGGTGTGGCGGCAAAACTTATCAACTGTAGATCGTCGAAAGGGATATATTACATAGAGTATACACTGCAGAATCCAGGTGAAAGCCGCAGACATTTATACTCGGCAATTGGGATGACATCCAATGGCTGGTACAATAGACTTTACACCATAACAGGACAGTATGCAGATGAAGAATCGGCGAACTATAGCTCCAAAATTGAGAAGGTTGTCAATTCCTTCAGTTTCATTTGA
Protein sequence
MAMPSLLSPSALIQRPHSFRFSQSSLSNGFSIFPIRSTLRVFCSANKKPSYLASRVNRREIMLGIGFTAFSLQEVVSNALAESVVVAEDYRTYTDEANKFSLVIPQDWQVGNGEPNGFKSVTAFFPQETSTSNVSVVISGLGPDFTRMESFGKVEEFADTLVSGLDRSWKRPPGVAAKLINCRSSKGIYYIEYTLQNPGESRRHLYSAIGMTSNGWYNRLYTITGQYADEESANYSSKIEKVVNSFSFI
Homology
BLAST of IVF0007374 vs. ExPASy Swiss-Prot
Match:
Q9S720 (PsbP domain-containing protein 3, chloroplastic OS=Arabidopsis thaliana OX=3702 GN=PPD3 PE=1 SV=2)
HSP 1 Score: 292.0 bits (746), Expect = 6.5e-78
Identity = 146/244 (59.84%), Postives = 183/244 (75.00%), Query Frame = 0
Query: 10 SALIQRPHSFRFSQSSLSNGFSIFPIRSTLRVFCSANKKPSYLASR----VNRREIMLGI 69
S + P SF + ++++ I + + V S+N++ ++SR + RR++ML I
Sbjct: 5 SPWLSSPQSFSNPRVTITDSRRCSSISAAISVLDSSNEEQHRISSRDHVGMKRRDVMLQI 64
Query: 70 GFTAFSLQEVVSNALAESVVVAEDYRTYTDEANKFSLVIPQDWQVGNGEPNGFKSVTAFF 129
+ F L +S A AE+ +E +R YTDE NKF + IPQDWQVG EPNGFKS+TAF+
Sbjct: 65 ASSVFFLPLAISPAFAET-NASEAFRVYTDETNKFEISIPQDWQVGQAEPNGFKSITAFY 124
Query: 130 PQETSTSNVSVVISGLGPDFTRMESFGKVEEFADTLVSGLDRSWKRPPGVAAKLINCRSS 189
PQETSTSNVS+ I+GLGPDFTRMESFGKVE FA+TLVSGLDRSW++P GV AKLI+ R+S
Sbjct: 125 PQETSTSNVSIAITGLGPDFTRMESFGKVEAFAETLVSGLDRSWQKPVGVTAKLIDSRAS 184
Query: 190 KGIYYIEYTLQNPGESRRHLYSAIGMTSNGWYNRLYTITGQYADEESANYSSKIEKVVNS 249
KG YYIEYTLQNPGE+R+HLYSAIGM +NGWYNRLYT+TGQ+ DEESA SSKI+K V S
Sbjct: 185 KGFYYIEYTLQNPGEARKHLYSAIGMATNGWYNRLYTVTGQFTDEESAEQSSKIQKTVKS 244
BLAST of IVF0007374 vs. ExPASy TrEMBL
Match:
A0A1S3CMG5 (psbP domain-containing protein 3, chloroplastic OS=Cucumis melo OX=3656 GN=LOC103502604 PE=4 SV=1)
HSP 1 Score: 495.0 bits (1273), Expect = 1.9e-136
Identity = 249/249 (100.00%), Postives = 249/249 (100.00%), Query Frame = 0
Query: 1 MAMPSLLSPSALIQRPHSFRFSQSSLSNGFSIFPIRSTLRVFCSANKKPSYLASRVNRRE 60
MAMPSLLSPSALIQRPHSFRFSQSSLSNGFSIFPIRSTLRVFCSANKKPSYLASRVNRRE
Sbjct: 1 MAMPSLLSPSALIQRPHSFRFSQSSLSNGFSIFPIRSTLRVFCSANKKPSYLASRVNRRE 60
Query: 61 IMLGIGFTAFSLQEVVSNALAESVVVAEDYRTYTDEANKFSLVIPQDWQVGNGEPNGFKS 120
IMLGIGFTAFSLQEVVSNALAESVVVAEDYRTYTDEANKFSLVIPQDWQVGNGEPNGFKS
Sbjct: 61 IMLGIGFTAFSLQEVVSNALAESVVVAEDYRTYTDEANKFSLVIPQDWQVGNGEPNGFKS 120
Query: 121 VTAFFPQETSTSNVSVVISGLGPDFTRMESFGKVEEFADTLVSGLDRSWKRPPGVAAKLI 180
VTAFFPQETSTSNVSVVISGLGPDFTRMESFGKVEEFADTLVSGLDRSWKRPPGVAAKLI
Sbjct: 121 VTAFFPQETSTSNVSVVISGLGPDFTRMESFGKVEEFADTLVSGLDRSWKRPPGVAAKLI 180
Query: 181 NCRSSKGIYYIEYTLQNPGESRRHLYSAIGMTSNGWYNRLYTITGQYADEESANYSSKIE 240
NCRSSKGIYYIEYTLQNPGESRRHLYSAIGMTSNGWYNRLYTITGQYADEESANYSSKIE
Sbjct: 181 NCRSSKGIYYIEYTLQNPGESRRHLYSAIGMTSNGWYNRLYTITGQYADEESANYSSKIE 240
Query: 241 KVVNSFSFI 250
KVVNSFSFI
Sbjct: 241 KVVNSFSFI 249
BLAST of IVF0007374 vs. ExPASy TrEMBL
Match:
A0A0A0KDI5 (PsbP domain-containing protein OS=Cucumis sativus OX=3659 GN=Csa_6G401450 PE=4 SV=1)
HSP 1 Score: 458.4 bits (1178), Expect = 1.9e-125
Identity = 234/257 (91.05%), Postives = 241/257 (93.77%), Query Frame = 0
Query: 1 MAMPSLLSPSALIQRPHSFRFSQSSLSNGFSIFPIRSTLRVFCSA--------NKKPSYL 60
MAM SLLSPSA+I RPHS RFSQSSLSNGFSI PIRSTLRVFCSA NKKPSYL
Sbjct: 1 MAMASLLSPSAVILRPHSLRFSQSSLSNGFSIIPIRSTLRVFCSANGNSIHTSNKKPSYL 60
Query: 61 ASRVNRREIMLGIGFTAFSLQEVVSNALAESVVVAEDYRTYTDEANKFSLVIPQDWQVGN 120
AS VNRREIMLGIGFTAFS QEV SNALAESVVVAEDYRTYTDEANKFSLVIPQDWQVGN
Sbjct: 61 ASGVNRREIMLGIGFTAFSFQEVGSNALAESVVVAEDYRTYTDEANKFSLVIPQDWQVGN 120
Query: 121 GEPNGFKSVTAFFPQETSTSNVSVVISGLGPDFTRMESFGKVEEFADTLVSGLDRSWKRP 180
GEPNGFKSVTAFFPQETSTSNVSVVISGLGPD+TRMESFGKVEEFADTLVSGLDRSWKRP
Sbjct: 121 GEPNGFKSVTAFFPQETSTSNVSVVISGLGPDYTRMESFGKVEEFADTLVSGLDRSWKRP 180
Query: 181 PGVAAKLINCRSSKGIYYIEYTLQNPGESRRHLYSAIGMTSNGWYNRLYTITGQYADEES 240
PGVAAKLI+CRSSKGIYYIEYTLQNPGESR+HLYSAIGM+SNGWYNRLYTITGQYADEES
Sbjct: 181 PGVAAKLIDCRSSKGIYYIEYTLQNPGESRKHLYSAIGMSSNGWYNRLYTITGQYADEES 240
Query: 241 ANYSSKIEKVVNSFSFI 250
+YSSKIEKVVNSF+FI
Sbjct: 241 ESYSSKIEKVVNSFAFI 257
BLAST of IVF0007374 vs. ExPASy TrEMBL
Match:
A0A6J1DNN6 (psbP domain-containing protein 3, chloroplastic isoform X2 OS=Momordica charantia OX=3673 GN=LOC111022894 PE=4 SV=1)
HSP 1 Score: 394.4 bits (1012), Expect = 3.4e-106
Identity = 205/250 (82.00%), Postives = 218/250 (87.20%), Query Frame = 0
Query: 8 SPSALIQRPHSFRFSQSSLSNGFSIFPIRSTLR--VFCSANK------KPSYLASRVNRR 67
SPSA+IQRP +RF +SSLSNG +I IRS + V CS N + Y AS VNRR
Sbjct: 8 SPSAVIQRPRPWRFRESSLSNGIAIH-IRSKSKPGVLCSCNNIDISDPQLCYWASGVNRR 67
Query: 68 EIMLGIGFTAFSLQEVVSNALAESVVVAEDYRTYTDEANKFSLVIPQDWQVGNGEPNGFK 127
EIMLGI + FS Q VVSN+LAESVVVAED+RTYTDEANKF LVIPQDW VGNGEPNGFK
Sbjct: 68 EIMLGIALSTFSFQAVVSNSLAESVVVAEDFRTYTDEANKFRLVIPQDWVVGNGEPNGFK 127
Query: 128 SVTAFFPQETSTSNVSVVISGLGPDFTRMESFGKVEEFADTLVSGLDRSWKRPPGVAAKL 187
SVTAF+PQETS+SNVSVVISGLGPDFTRMESFGKVEEFADTLVSGLDRSWKRPPGVAAKL
Sbjct: 128 SVTAFYPQETSSSNVSVVISGLGPDFTRMESFGKVEEFADTLVSGLDRSWKRPPGVAAKL 187
Query: 188 INCRSSKGIYYIEYTLQNPGESRRHLYSAIGMTSNGWYNRLYTITGQYADEESANYSSKI 247
I+CRSSKGIYYIEYTLQNPGESR+HLYSAIGM SNGWYNRLYTITGQYADEES NYSSKI
Sbjct: 188 IDCRSSKGIYYIEYTLQNPGESRKHLYSAIGMASNGWYNRLYTITGQYADEESENYSSKI 247
Query: 248 EKVVNSFSFI 250
EKVVNSFSFI
Sbjct: 248 EKVVNSFSFI 256
BLAST of IVF0007374 vs. ExPASy TrEMBL
Match:
A0A6J1G3W9 (psbP domain-containing protein 3, chloroplastic OS=Cucurbita moschata OX=3662 GN=LOC111450544 PE=4 SV=1)
HSP 1 Score: 392.9 bits (1008), Expect = 1.0e-105
Identity = 203/253 (80.24%), Postives = 221/253 (87.35%), Query Frame = 0
Query: 3 MPSLLSPSALIQRPHSFRFSQSSLSNGFSIFPIRSTLRVFCSA------NKKPSYLASRV 62
M SL SPSA+IQRP S+RF+ SSLSNG +I PIR+ LRVFCS ++KP S V
Sbjct: 1 MASLPSPSAVIQRPRSWRFTPSSLSNGIAI-PIRTRLRVFCSGKNIDIPDQKPCCWTSGV 60
Query: 63 NRREIMLGIGFTAFSLQEVVSNALAESVVVAEDYRTYTDEANKFSLVIPQDWQVGNGEPN 122
NRREI+LG+G TAFS QEVVS ALAES VVAEDYRTYTDEANKF LVIPQDWQVGNGEPN
Sbjct: 61 NRREIVLGMGLTAFSFQEVVSIALAES-VVAEDYRTYTDEANKFRLVIPQDWQVGNGEPN 120
Query: 123 GFKSVTAFFPQETSTSNVSVVISGLGPDFTRMESFGKVEEFADTLVSGLDRSWKRPPGVA 182
GFK VTAFFP+ET +SNVSVVISGLGPDFTRMESFGKVEEFADTLVSGLDRSWKRPPGVA
Sbjct: 121 GFKLVTAFFPKETLSSNVSVVISGLGPDFTRMESFGKVEEFADTLVSGLDRSWKRPPGVA 180
Query: 183 AKLINCRSSKGIYYIEYTLQNPGESRRHLYSAIGMTSNGWYNRLYTITGQYADEESANYS 242
AKLI+CRSSKGIYYIEYTLQNPGE R HLYSAIGM SNGWYNRLYT+TGQY DE+S +S
Sbjct: 181 AKLIDCRSSKGIYYIEYTLQNPGEGRNHLYSAIGMASNGWYNRLYTVTGQYGDEDSEKFS 240
Query: 243 SKIEKVVNSFSFI 250
S+I+KVVNSF+FI
Sbjct: 241 SEIKKVVNSFTFI 251
BLAST of IVF0007374 vs. ExPASy TrEMBL
Match:
A0A6J1KI01 (psbP domain-containing protein 3, chloroplastic OS=Cucurbita maxima OX=3661 GN=LOC111494008 PE=4 SV=1)
HSP 1 Score: 390.2 bits (1001), Expect = 6.5e-105
Identity = 202/253 (79.84%), Postives = 220/253 (86.96%), Query Frame = 0
Query: 3 MPSLLSPSALIQRPHSFRFSQSSLSNGFSIFPIRSTLRVFCSA------NKKPSYLASRV 62
M SL SPSA+IQRP S+RF+ SSLSNG +I PIR+ LRVFCS ++KP S V
Sbjct: 1 MASLPSPSAVIQRPRSWRFTPSSLSNGIAI-PIRTRLRVFCSGKNIDIPDQKPCCWTSGV 60
Query: 63 NRREIMLGIGFTAFSLQEVVSNALAESVVVAEDYRTYTDEANKFSLVIPQDWQVGNGEPN 122
NRREI LG+G TAFS QEVVS ALAE+ VVAEDYRTYTDEANKF LVIPQDWQVGNGEPN
Sbjct: 61 NRREIGLGMGLTAFSFQEVVSIALAEN-VVAEDYRTYTDEANKFRLVIPQDWQVGNGEPN 120
Query: 123 GFKSVTAFFPQETSTSNVSVVISGLGPDFTRMESFGKVEEFADTLVSGLDRSWKRPPGVA 182
GFK VTAFFP+ET +SNVSVVISGLGPDFTRMESFGKVEEFADTLVSGLDRSWKRPPGVA
Sbjct: 121 GFKLVTAFFPKETLSSNVSVVISGLGPDFTRMESFGKVEEFADTLVSGLDRSWKRPPGVA 180
Query: 183 AKLINCRSSKGIYYIEYTLQNPGESRRHLYSAIGMTSNGWYNRLYTITGQYADEESANYS 242
AKLI+CRSSKGIYYIEYTLQNPGE R HLYSAIGM SNGWYNRLYT+TGQY DE+S +S
Sbjct: 181 AKLIDCRSSKGIYYIEYTLQNPGEGRNHLYSAIGMASNGWYNRLYTVTGQYGDEDSEKFS 240
Query: 243 SKIEKVVNSFSFI 250
S+I+KVVNSF+FI
Sbjct: 241 SEIKKVVNSFTFI 251
BLAST of IVF0007374 vs. NCBI nr
Match:
XP_008464804.1 (PREDICTED: psbP domain-containing protein 3, chloroplastic [Cucumis melo])
HSP 1 Score: 494 bits (1272), Expect = 3.51e-176
Identity = 249/249 (100.00%), Postives = 249/249 (100.00%), Query Frame = 0
Query: 1 MAMPSLLSPSALIQRPHSFRFSQSSLSNGFSIFPIRSTLRVFCSANKKPSYLASRVNRRE 60
MAMPSLLSPSALIQRPHSFRFSQSSLSNGFSIFPIRSTLRVFCSANKKPSYLASRVNRRE
Sbjct: 1 MAMPSLLSPSALIQRPHSFRFSQSSLSNGFSIFPIRSTLRVFCSANKKPSYLASRVNRRE 60
Query: 61 IMLGIGFTAFSLQEVVSNALAESVVVAEDYRTYTDEANKFSLVIPQDWQVGNGEPNGFKS 120
IMLGIGFTAFSLQEVVSNALAESVVVAEDYRTYTDEANKFSLVIPQDWQVGNGEPNGFKS
Sbjct: 61 IMLGIGFTAFSLQEVVSNALAESVVVAEDYRTYTDEANKFSLVIPQDWQVGNGEPNGFKS 120
Query: 121 VTAFFPQETSTSNVSVVISGLGPDFTRMESFGKVEEFADTLVSGLDRSWKRPPGVAAKLI 180
VTAFFPQETSTSNVSVVISGLGPDFTRMESFGKVEEFADTLVSGLDRSWKRPPGVAAKLI
Sbjct: 121 VTAFFPQETSTSNVSVVISGLGPDFTRMESFGKVEEFADTLVSGLDRSWKRPPGVAAKLI 180
Query: 181 NCRSSKGIYYIEYTLQNPGESRRHLYSAIGMTSNGWYNRLYTITGQYADEESANYSSKIE 240
NCRSSKGIYYIEYTLQNPGESRRHLYSAIGMTSNGWYNRLYTITGQYADEESANYSSKIE
Sbjct: 181 NCRSSKGIYYIEYTLQNPGESRRHLYSAIGMTSNGWYNRLYTITGQYADEESANYSSKIE 240
Query: 241 KVVNSFSFI 249
KVVNSFSFI
Sbjct: 241 KVVNSFSFI 249
BLAST of IVF0007374 vs. NCBI nr
Match:
XP_004146765.1 (psbP domain-containing protein 3, chloroplastic isoform X1 [Cucumis sativus])
HSP 1 Score: 457 bits (1177), Expect = 1.42e-161
Identity = 234/257 (91.05%), Postives = 241/257 (93.77%), Query Frame = 0
Query: 1 MAMPSLLSPSALIQRPHSFRFSQSSLSNGFSIFPIRSTLRVFCSAN--------KKPSYL 60
MAM SLLSPSA+I RPHS RFSQSSLSNGFSI PIRSTLRVFCSAN KKPSYL
Sbjct: 1 MAMASLLSPSAVILRPHSLRFSQSSLSNGFSIIPIRSTLRVFCSANGNSIHTSNKKPSYL 60
Query: 61 ASRVNRREIMLGIGFTAFSLQEVVSNALAESVVVAEDYRTYTDEANKFSLVIPQDWQVGN 120
AS VNRREIMLGIGFTAFS QEV SNALAESVVVAEDYRTYTDEANKFSLVIPQDWQVGN
Sbjct: 61 ASGVNRREIMLGIGFTAFSFQEVGSNALAESVVVAEDYRTYTDEANKFSLVIPQDWQVGN 120
Query: 121 GEPNGFKSVTAFFPQETSTSNVSVVISGLGPDFTRMESFGKVEEFADTLVSGLDRSWKRP 180
GEPNGFKSVTAFFPQETSTSNVSVVISGLGPD+TRMESFGKVEEFADTLVSGLDRSWKRP
Sbjct: 121 GEPNGFKSVTAFFPQETSTSNVSVVISGLGPDYTRMESFGKVEEFADTLVSGLDRSWKRP 180
Query: 181 PGVAAKLINCRSSKGIYYIEYTLQNPGESRRHLYSAIGMTSNGWYNRLYTITGQYADEES 240
PGVAAKLI+CRSSKGIYYIEYTLQNPGESR+HLYSAIGM+SNGWYNRLYTITGQYADEES
Sbjct: 181 PGVAAKLIDCRSSKGIYYIEYTLQNPGESRKHLYSAIGMSSNGWYNRLYTITGQYADEES 240
Query: 241 ANYSSKIEKVVNSFSFI 249
+YSSKIEKVVNSF+FI
Sbjct: 241 ESYSSKIEKVVNSFAFI 257
BLAST of IVF0007374 vs. NCBI nr
Match:
KAE8647284.1 (hypothetical protein Csa_002893 [Cucumis sativus])
HSP 1 Score: 457 bits (1177), Expect = 7.49e-161
Identity = 234/257 (91.05%), Postives = 241/257 (93.77%), Query Frame = 0
Query: 1 MAMPSLLSPSALIQRPHSFRFSQSSLSNGFSIFPIRSTLRVFCSAN--------KKPSYL 60
MAM SLLSPSA+I RPHS RFSQSSLSNGFSI PIRSTLRVFCSAN KKPSYL
Sbjct: 47 MAMASLLSPSAVILRPHSLRFSQSSLSNGFSIIPIRSTLRVFCSANGNSIHTSNKKPSYL 106
Query: 61 ASRVNRREIMLGIGFTAFSLQEVVSNALAESVVVAEDYRTYTDEANKFSLVIPQDWQVGN 120
AS VNRREIMLGIGFTAFS QEV SNALAESVVVAEDYRTYTDEANKFSLVIPQDWQVGN
Sbjct: 107 ASGVNRREIMLGIGFTAFSFQEVGSNALAESVVVAEDYRTYTDEANKFSLVIPQDWQVGN 166
Query: 121 GEPNGFKSVTAFFPQETSTSNVSVVISGLGPDFTRMESFGKVEEFADTLVSGLDRSWKRP 180
GEPNGFKSVTAFFPQETSTSNVSVVISGLGPD+TRMESFGKVEEFADTLVSGLDRSWKRP
Sbjct: 167 GEPNGFKSVTAFFPQETSTSNVSVVISGLGPDYTRMESFGKVEEFADTLVSGLDRSWKRP 226
Query: 181 PGVAAKLINCRSSKGIYYIEYTLQNPGESRRHLYSAIGMTSNGWYNRLYTITGQYADEES 240
PGVAAKLI+CRSSKGIYYIEYTLQNPGESR+HLYSAIGM+SNGWYNRLYTITGQYADEES
Sbjct: 227 PGVAAKLIDCRSSKGIYYIEYTLQNPGESRKHLYSAIGMSSNGWYNRLYTITGQYADEES 286
Query: 241 ANYSSKIEKVVNSFSFI 249
+YSSKIEKVVNSF+FI
Sbjct: 287 ESYSSKIEKVVNSFAFI 303
BLAST of IVF0007374 vs. NCBI nr
Match:
XP_038885576.1 (psbP domain-containing protein 3, chloroplastic [Benincasa hispida])
HSP 1 Score: 426 bits (1094), Expect = 5.28e-149
Identity = 220/253 (86.96%), Postives = 229/253 (90.51%), Query Frame = 0
Query: 3 MPSLLSPSALIQRPHSFRFSQSSLSNGFSIFPIRSTLRVFCS------ANKKPSYLASRV 62
M SL SPSA+IQRP S+RFSQSS SNG I PIRS LRVFCS +N++ Y AS V
Sbjct: 1 MASLPSPSAVIQRPRSWRFSQSSPSNGLPI-PIRSKLRVFCSGNNINISNQQSCYWASGV 60
Query: 63 NRREIMLGIGFTAFSLQEVVSNALAESVVVAEDYRTYTDEANKFSLVIPQDWQVGNGEPN 122
NRREIMLGIG TAFS QEVVSNALAESV+VAEDYRTYTDEANKF L IPQDWQVGNGEPN
Sbjct: 61 NRREIMLGIGLTAFSFQEVVSNALAESVMVAEDYRTYTDEANKFRLAIPQDWQVGNGEPN 120
Query: 123 GFKSVTAFFPQETSTSNVSVVISGLGPDFTRMESFGKVEEFADTLVSGLDRSWKRPPGVA 182
GFKSVTAFFPQETS+SNVSVVISGLGPDFTRMESFGKVEEFADTLVSGLDRSWKRPPGVA
Sbjct: 121 GFKSVTAFFPQETSSSNVSVVISGLGPDFTRMESFGKVEEFADTLVSGLDRSWKRPPGVA 180
Query: 183 AKLINCRSSKGIYYIEYTLQNPGESRRHLYSAIGMTSNGWYNRLYTITGQYADEESANYS 242
AKLINCRSSKGIYYIEYTLQNPGESR+HLYSAIGM SNGWYNRLYTITGQYADEES NYS
Sbjct: 181 AKLINCRSSKGIYYIEYTLQNPGESRKHLYSAIGMASNGWYNRLYTITGQYADEESENYS 240
Query: 243 SKIEKVVNSFSFI 249
SKIEKVVNSF+FI
Sbjct: 241 SKIEKVVNSFTFI 252
BLAST of IVF0007374 vs. NCBI nr
Match:
XP_022155885.1 (psbP domain-containing protein 3, chloroplastic isoform X2 [Momordica charantia])
HSP 1 Score: 393 bits (1010), Expect = 3.80e-136
Identity = 205/250 (82.00%), Postives = 218/250 (87.20%), Query Frame = 0
Query: 8 SPSALIQRPHSFRFSQSSLSNGFSIFPIRSTLR--VFCSANK------KPSYLASRVNRR 67
SPSA+IQRP +RF +SSLSNG +I IRS + V CS N + Y AS VNRR
Sbjct: 8 SPSAVIQRPRPWRFRESSLSNGIAIH-IRSKSKPGVLCSCNNIDISDPQLCYWASGVNRR 67
Query: 68 EIMLGIGFTAFSLQEVVSNALAESVVVAEDYRTYTDEANKFSLVIPQDWQVGNGEPNGFK 127
EIMLGI + FS Q VVSN+LAESVVVAED+RTYTDEANKF LVIPQDW VGNGEPNGFK
Sbjct: 68 EIMLGIALSTFSFQAVVSNSLAESVVVAEDFRTYTDEANKFRLVIPQDWVVGNGEPNGFK 127
Query: 128 SVTAFFPQETSTSNVSVVISGLGPDFTRMESFGKVEEFADTLVSGLDRSWKRPPGVAAKL 187
SVTAF+PQETS+SNVSVVISGLGPDFTRMESFGKVEEFADTLVSGLDRSWKRPPGVAAKL
Sbjct: 128 SVTAFYPQETSSSNVSVVISGLGPDFTRMESFGKVEEFADTLVSGLDRSWKRPPGVAAKL 187
Query: 188 INCRSSKGIYYIEYTLQNPGESRRHLYSAIGMTSNGWYNRLYTITGQYADEESANYSSKI 247
I+CRSSKGIYYIEYTLQNPGESR+HLYSAIGM SNGWYNRLYTITGQYADEES NYSSKI
Sbjct: 188 IDCRSSKGIYYIEYTLQNPGESRKHLYSAIGMASNGWYNRLYTITGQYADEESENYSSKI 247
Query: 248 EKVVNSFSFI 249
EKVVNSFSFI
Sbjct: 248 EKVVNSFSFI 256
BLAST of IVF0007374 vs. TAIR 10
Match:
AT1G76450.1 (Photosystem II reaction center PsbP family protein )
HSP 1 Score: 292.0 bits (746), Expect = 4.6e-79
Identity = 146/244 (59.84%), Postives = 183/244 (75.00%), Query Frame = 0
Query: 10 SALIQRPHSFRFSQSSLSNGFSIFPIRSTLRVFCSANKKPSYLASR----VNRREIMLGI 69
S + P SF + ++++ I + + V S+N++ ++SR + RR++ML I
Sbjct: 5 SPWLSSPQSFSNPRVTITDSRRCSSISAAISVLDSSNEEQHRISSRDHVGMKRRDVMLQI 64
Query: 70 GFTAFSLQEVVSNALAESVVVAEDYRTYTDEANKFSLVIPQDWQVGNGEPNGFKSVTAFF 129
+ F L +S A AE+ +E +R YTDE NKF + IPQDWQVG EPNGFKS+TAF+
Sbjct: 65 ASSVFFLPLAISPAFAET-NASEAFRVYTDETNKFEISIPQDWQVGQAEPNGFKSITAFY 124
Query: 130 PQETSTSNVSVVISGLGPDFTRMESFGKVEEFADTLVSGLDRSWKRPPGVAAKLINCRSS 189
PQETSTSNVS+ I+GLGPDFTRMESFGKVE FA+TLVSGLDRSW++P GV AKLI+ R+S
Sbjct: 125 PQETSTSNVSIAITGLGPDFTRMESFGKVEAFAETLVSGLDRSWQKPVGVTAKLIDSRAS 184
Query: 190 KGIYYIEYTLQNPGESRRHLYSAIGMTSNGWYNRLYTITGQYADEESANYSSKIEKVVNS 249
KG YYIEYTLQNPGE+R+HLYSAIGM +NGWYNRLYT+TGQ+ DEESA SSKI+K V S
Sbjct: 185 KGFYYIEYTLQNPGEARKHLYSAIGMATNGWYNRLYTVTGQFTDEESAEQSSKIQKTVKS 244
The following BLAST results are available for this feature:
Match Name | E-value | Identity | Description | |
Q9S720 | 6.5e-78 | 59.84 | PsbP domain-containing protein 3, chloroplastic OS=Arabidopsis thaliana OX=3702 ... | [more] |
Match Name | E-value | Identity | Description | |
A0A1S3CMG5 | 1.9e-136 | 100.00 | psbP domain-containing protein 3, chloroplastic OS=Cucumis melo OX=3656 GN=LOC10... | [more] |
A0A0A0KDI5 | 1.9e-125 | 91.05 | PsbP domain-containing protein OS=Cucumis sativus OX=3659 GN=Csa_6G401450 PE=4 S... | [more] |
A0A6J1DNN6 | 3.4e-106 | 82.00 | psbP domain-containing protein 3, chloroplastic isoform X2 OS=Momordica charanti... | [more] |
A0A6J1G3W9 | 1.0e-105 | 80.24 | psbP domain-containing protein 3, chloroplastic OS=Cucurbita moschata OX=3662 GN... | [more] |
A0A6J1KI01 | 6.5e-105 | 79.84 | psbP domain-containing protein 3, chloroplastic OS=Cucurbita maxima OX=3661 GN=L... | [more] |
Match Name | E-value | Identity | Description | |
XP_008464804.1 | 3.51e-176 | 100.00 | PREDICTED: psbP domain-containing protein 3, chloroplastic [Cucumis melo] | [more] |
XP_004146765.1 | 1.42e-161 | 91.05 | psbP domain-containing protein 3, chloroplastic isoform X1 [Cucumis sativus] | [more] |
KAE8647284.1 | 7.49e-161 | 91.05 | hypothetical protein Csa_002893 [Cucumis sativus] | [more] |
XP_038885576.1 | 5.28e-149 | 86.96 | psbP domain-containing protein 3, chloroplastic [Benincasa hispida] | [more] |
XP_022155885.1 | 3.80e-136 | 82.00 | psbP domain-containing protein 3, chloroplastic isoform X2 [Momordica charantia] | [more] |
Match Name | E-value | Identity | Description | |
AT1G76450.1 | 4.6e-79 | 59.84 | Photosystem II reaction center PsbP family protein | [more] |