Bhi10G000327 (gene) Wax gourd

NameBhi10G000327
Typegene
OrganismBenincasa hispida (Wax gourd)
Descriptionprotein EMBRYO SAC DEVELOPMENT ARREST 3, chloroplastic
Locationchr10 : 9001223 .. 9008119 (-)
The following sequences are available for this feature:

Gene sequence (with intron)

Legend: exonfive_prime_UTRpolypeptideCDSthree_prime_UTR
Hold the cursor over a type above to highlight its positions in the sequence below.
GCCCTATTGAATCATGGCACTTCGCTAATTAAGAGTATAGAGTAGCAGATCGAGTGGCGCAGAGCAGAGAGAAGATGATGGCGTCTTCTTCTCATCTATCGGCCATTCCCCTGCGCCCGTCTTCCGCCTCTACACCTTCCTTATACCACCGTGAGCACTCTCTCTCCCTTTCTCGCTCTCTCTCTCTCTCCTCTATCCTCTGCAACTTGCTTGGTGTTATGCCCAATTGTCGACTATCCTGAAACGTGCATTGTTGCATACTTGTATTTCGTTGAGGGGTTTAATCTCCTCTGCTCAATGTTTTCCGCCGGAGTACATAATTTCCTCTGTATTTATTCATCTCTTTAAACTATATGGATATTCTTGTGTTTGTTTTCTGTTATGCCCATTCACGTATGTATACTGAATTTCTGAATTATCAGCAGATGCGCGTTTTCTTTTTGGTTGATTAGCTGATTTTCCTTCGTTTCTTTCACTTCGGTGGACTGTTTGTGTAATCTTATCCCTTTCTCATTGAGTTTTGTTGCGGCGTTTGTCATCCCCATCTCCAATTGTTATTTCGGACTATCTTAATCTTGAGTTGTGAAAAGTTTTGTGCCACCCGATTCGCTTTTAATCGTCTTCCTTTAATGCTCTGTGATAAAACTATCGTACTGAGCTGGTTAACCACGAGCGATAGTACTACAAGTTTACTACCAAGTTATCTTTTATCTCGTTTCCAAAGGGGTATATATTTTTCATGGTCTATTTTTAAGTTGTATCTTATTGGGTGGTTGATCAGATCTCCTTCGAAAGAATAGAAGGTTTACAGTAAATTAGAATTTAGAAGGTTAAGTTAAGATTGCATAACATGATGAAATGTTTTGGTAGTAGTGTTGGATTCATAACAGTATTTTCTGGTCCTTGTACTAATCTGACATTTCTTTGTATTGATCTGTGTTAAACATGCTGATCATTCTATAGAAAAGTTTTGATCCTCATTCTCCTGAACTCAACAGCTAACTCAAAGCCTGTTGTACTTCATGTGACATCAAAATCTGACGGTGAAAGCTGCAGTACTGGAGTTTCAAATCCACCGTCAAAACCGCTGAAGGTAAGGATCTTGGTTACTTATTTGTCTTTTTAGCTTTCTTTCTGTGTAACAACTTATTCATGTTAGTGATACGTGCTTCAGTTTCCTCCTTGTACCTATAATTTACAACCACAACGCTTCACTGGTTAAGCAACAAGTTTAGCTCATATACACTTGCGTAGTTCCTCACATGATATGCGTCTTTCGACATGGCTTGCGTCAGAGCCTTCCGAATTTTCAAATGTGGGTGGCCGATAAATGTATAATTCCATTGGAATATGACAAGGTTCAAAAGCCTTGTAGATTTTCTCATCAGTTTTTCAACCAGTTAACCATCCTTTAGAAAGTGACACCTTAAATGAATTTGCAAGTTTTTCTGTGATTAGACCTTTTAAGAATATAACCTGATCTCTTTGTGACAGCATTGGGAATGAAAACGTATAGCTTCATTAATGTGTAGTCTTAAGTAGGTTACTACGTTTTTATCTTCCAATTAGTAATATAGAACAACTTTTATTTACCACAGGGAACTCAAAACTTGATTTCTCGTCGATGGTGCCTCACATGTTTATGTTCCTCTGTGACATTGTTAAAGAATTATGGGGGCACAGGGACTGAAGCTATTGCAAATACCATGGATGGAAAACCTGTGTGCCGAAATTGTGGAGGAAGTGGTGCCGTACTTTGTAAGCAGGGCTTTTTGATTTATTAGACATAACACCTTTTGGTCGTTAGATATAATTTTTACTCTCCTTTCATGAAGGGCTTGTGATTTAAAAGTATTGAGCTTCTTCATGGCCTTATAACATCTAGAAGTCAACGAAATGAACCATACGAACTTGGGAATAAAATATTTTTGGTAAAGAAAAGTTATTTGTTTATTCTGGTAACCACTTATATCATTTTGGCATTCTTTTCATAACTCAAGTATTCTAATTCTTGCAAAAAATAATTCATCAAGTACTTCCATATATGAGCTTATCTTTGGAGTTAAAATACTCCTTTCCTTTTTCTCATTTCTTATCTTCCCCGTTGAAAAGTAATGCGTAATTTTTCCTTGCTAAAAGGTGTATAGGAGTTTGATGAGAACATACATTTATTATTTTTTCCATTTTACTTGCCGTAACTGTTTTTTTGTCTAATAATTTTTTAAAATTAGTTTCATATATCTGTGCTCCCTATTTCCCGTGGGATGCTTACTGTTCTTCGTTTTAGGTGACATGTGCGGTGGTACAGGGAAATGGAAAGCTCTTAACAGAAAACGGGCTAAAGATGTCTATGAGTTTACAGAATGTCCAAATTGTTATGGTAAGTACTCATCTCTGTGTATAAATTATGATGTTCTCATCAAGAATGCTTCATATATTTGCATGGTTTCAGTTTGATACTTCACGTCTGTTGGAAAGCAATGCCCTCTTGTAATGCTATAGCCGTGAGATGTCTCTCTCTCTCTCAGATTCATATCCAGAAATTTAGACCTTTGCAGATGGTGAACGTTTTGGTTCACCATGTTCTCGTATGATCAATCATGTCTAGTTAGATTTCAAGGTCAAACGCTCATATGACAAGTACAACTTGTGCAGATAGCAATTTTTTTTTGGAATAACCGTCACTTATTGGGGAAGTAATATTGGGGCTCGGGTTTAAGTTATCGTCCTTAAACTACTTGTATACACATCTTTTGGTGTTTGTTTTTAAGCTAAGATTATTTTTTCTTTCCTCCCTCCAATGTGAAACTAACAAGGGACACGGATATAAATATTAAAACTAAAACATTCCTTGGACGTCAATATGCTGTTGAGCCGGCTTTCAAATCCATATCCTGAATCTGAAGAAAGAATCTTGGTTGATCATTCACTTAGCTATTTCTAAAGCAGCGGAATACATGAGTTCAGCTAATGATGATCTTTAATATGCTAACAGTAGAAAGAAGAAATATGTTGTTTCATCTAGGCTTAACGATAATCTTGAAGTACCAATCTAAATTTGGTCATTGTGCTTCACCAAATTTATTATGTTGCCACTATAAATATTTGAGTCGATGATCGTTTGATCTTCTGTGAGATAGCAAGCTTGACCCTTTTACAATAACCAAAATTCTGTAATTTCCGATCCTAATTTACAGAAAATGCCGACAAGTAGGAAATGGTGGACTTCTGTTCCTTAGGCATTGTGTTTTTTTTGTTGCATTTATCATTCGTAATGTGTTATTGAACGTCAATATCAGGTGTTGACTGATTACAAAAGGCACGAGATTTAAAATTTTGATTCATATGGATTTGAATTGTATCAAATTTAAACTTCCAATATATGTGTTCATCTCTTACCCTTTGTACTAATTTCAAATTTCATTGTGGCAGGTAGAGGAAAACTTGTATGTCCTGTTTGTCTAGGAACTGGTTTACCAAACAACAAAGGTCTCCTAAGAAGGCCTGACGCACGAAAATTGCTTGATAAGATGTACAATGGTCGCTTATTACCAAATTCTTGAACACCTCTTTGCAAATTTGGGTCCATGTCCATAGATGGTAGTAGATTTATAACTTGTTTCTAGCTGGTATAAGTTACTAAATTGCGGCTTTTGTCATGGTTGCTTCATGACAGAGTTCAGTGAGTGGAAAAACATAGCTACAAAGTTAGATGGCGATCCTGTTTACAGTCTCTTCTAATCGCCAAATGCATATTGTTTATACCAAAAAAATACACGAAAATCACACTGTTCTGACCTCATCACAAAGCATGGTCAAACCAAGTTCAACTTTTATTATATATATGAAATCAGGGTTATTGCAAATGAAAGGGAACAAAGAATAGAAAATAAATTTGAAGAAAAACTAGGAATGAATCTTGATGGATTTTGAAAAGACATGTACGACCTGACTACAGAAGGTAAAACCGCGACTCTCAACTTTTGGGGCTGCAAAGTCGCAGAGGAAACCATCAATTCCTCAAGTTTATAATTACTATCCATTCTAGGTGTTGCAACTGCCTCCCGCAATGAATACAGAAGCTAAAGCCTTCCTTGAAAAGGAAGCTGTGGATTTAATATATCATTCTCTGATATGACGCAACTTACTTAGGGAGTGTGATGATCTTCACAGGCATTGTAGTTTTACACCACCTCATATACGTCATGAATATGAAGTAAATATACTGGAGGTGAAAGTCGCACGAATCCAAGAAACTTATCACCGCTTGAAAGGCTTGATGATGCTTGCATTGCCATTTCTCAGCATAAATTCCATCAAACACCATACTTGTACATTGATGTGGATCACTTTCTGTACAATGCTACAAAATTTAAGCAGAATCATAGTCTGAGCGAGCTTTCGAGACTGATGTTTCAGCCAGCACATGAACTTTAGGGTCACCCTTTTTCTTCTTTTCACGTTGATCAAGTGGTTTAGTTGCTACCTGATCAGCCTTGGAGACAATTTTGGAGATTGTAAATCCAGCATCTTCAAAGAAGTTTGATCTGTACGGAATGACAGAACTTCGTATAACAACCTCGACCCCCTCGTAAGTCCATATGCTTAAGACTGCCTCTCCTTTTAGTCCAATGGCGAACCAACCGAGGCCAGCTGCAGCAGCATCTACACAACTTGAATCCCAACTACTTCCACAAACACGAAATTCCCTTCTCACCCATTTTCCTAATTCTGCAACTCGATCTTTG

mRNA sequence

GCCCTATTGAATCATGGCACTTCGCTAATTAAGAGTATAGAGTAGCAGATCGAGTGGCGCAGAGCAGAGAGAAGATGATGGCGTCTTCTTCTCATCTATCGGCCATTCCCCTGCGCCCGTCTTCCGCCTCTACACCTTCCTTATACCACCCTAACTCAAAGCCTGTTGTACTTCATGTGACATCAAAATCTGACGGTGAAAGCTGCAGTACTGGAGTTTCAAATCCACCGTCAAAACCGCTGAAGGGAACTCAAAACTTGATTTCTCGTCGATGGTGCCTCACATGTTTATGTTCCTCTGTGACATTGTTAAAGAATTATGGGGGCACAGGGACTGAAGCTATTGCAAATACCATGGATGGAAAACCTGTGTGCCGAAATTGTGGAGGAAGTGGTGCCGTACTTTGTGACATGTGCGGTGGTACAGGGAAATGGAAAGCTCTTAACAGAAAACGGGCTAAAGATGTCTATGAGTTTACAGAATGTCCAAATTGTTATGGTAGAGGAAAACTTGTATGTCCTGTTTGTCTAGGAACTGGTTTACCAAACAACAAAGGTCTCCTAAGAAGGCCTGACGCACGAAAATTGCTTGATAAGATGTACAATGGTCGCTTATTACCAAATTCTTGAACACCTCTTTGCAAATTTGGGTCCATGTCCATAGATGGTAGTAGATTTATAACTTGTTTCTAGCTGGTATAAGTTACTAAATTGCGGCTTTTGTCATGGTTGCTTCATGACAGAGTTCAGTGAGTGGAAAAACATAGCTACAAAGTTAGATGGCGATCCTGTTTACAGTCTCTTCTAATCGCCAAATGCATATTGTTTATACCAAAAAAATACACGAAAATCACACTGTTCTGACCTCATCACAAAGCATGGTCAAACCAAGTTCAACTTTTATTATATATATGAAATCAGGGTTATTGCAAATGAAAGGGAACAAAGAATAGAAAATAAATTTGAAGAAAAACTAGGAATGAATCTTGATGGATTTTGAAAAGACATGTACGACCTGACTACAGAAGGTGTTGCAACTGCCTCCCGCAATGAATACAGAAGCTAAAGCCTTCCTTGAAAAGGAAGCTGTGGATTTAATATATCATTCTCTGATATGACGCAACTTACTTAGGGAGTGTGATGATCTTCACAGGCATTGTAGTTTTACACCACCTCATATACGTCATGAATATGAAGTAAATATACTGGAGGTGAAAGTCGCACGAATCCAAGAAACTTATCACCGCTTGAAAGGCTTGATGATGCTTGCATTGCCATTTCTCAGCATAAATTCCATCAAACACCATACTTGTACATTGATGTGGATCACTTTCTGTACAATGCTACAAAATTTAAGCAGAATCATAGTCTGAGCGAGCTTTCGAGACTGATGTTTCAGCCAGCACATGAACTTTAGGGTCACCCTTTTTCTTCTTTTCACGTTGATCAAGTGGTTTAGTTGCTACCTGATCAGCCTTGGAGACAATTTTGGAGATTGTAAATCCAGCATCTTCAAAGAAGTTTGATCTGTACGGAATGACAGAACTTCGTATAACAACCTCGACCCCCTCGTAAGTCCATATGCTTAAGACTGCCTCTCCTTTTAGTCCAATGGCGAACCAACCGAGGCCAGCTGCAGCAGCATCTACACAACTTGAATCCCAACTACTTCCACAAACACGAAATTCCCTTCTCACCCATTTTCCTAATTCTGCAACTCGATCTTTG

Coding sequence (CDS)

ATGATGGCGTCTTCTTCTCATCTATCGGCCATTCCCCTGCGCCCGTCTTCCGCCTCTACACCTTCCTTATACCACCCTAACTCAAAGCCTGTTGTACTTCATGTGACATCAAAATCTGACGGTGAAAGCTGCAGTACTGGAGTTTCAAATCCACCGTCAAAACCGCTGAAGGGAACTCAAAACTTGATTTCTCGTCGATGGTGCCTCACATGTTTATGTTCCTCTGTGACATTGTTAAAGAATTATGGGGGCACAGGGACTGAAGCTATTGCAAATACCATGGATGGAAAACCTGTGTGCCGAAATTGTGGAGGAAGTGGTGCCGTACTTTGTGACATGTGCGGTGGTACAGGGAAATGGAAAGCTCTTAACAGAAAACGGGCTAAAGATGTCTATGAGTTTACAGAATGTCCAAATTGTTATGGTAGAGGAAAACTTGTATGTCCTGTTTGTCTAGGAACTGGTTTACCAAACAACAAAGGTCTCCTAAGAAGGCCTGACGCACGAAAATTGCTTGATAAGATGTACAATGGTCGCTTATTACCAAATTCTTGA

Protein sequence

MMASSSHLSAIPLRPSSASTPSLYHPNSKPVVLHVTSKSDGESCSTGVSNPPSKPLKGTQNLISRRWCLTCLCSSVTLLKNYGGTGTEAIANTMDGKPVCRNCGGSGAVLCDMCGGTGKWKALNRKRAKDVYEFTECPNCYGRGKLVCPVCLGTGLPNNKGLLRRPDARKLLDKMYNGRLLPNS
BLAST of Bhi10G000327 vs. TAIR10
Match: AT2G34860.1 (DnaJ/Hsp40 cysteine-rich domain superfamily protein)

HSP 1 Score: 120.2 bits (300), Expect = 1.4e-27
Identity = 102/190 (53.68%), Postives = 121/190 (63.68%), Query Frame = 0

Query: 1   MMASSSHLSAIPLRPSSASTPSLYHPNSKPVVLHVTSKSDGESCSTGVSNPPSK---PLK 60
           M ASSSHL A+P    S ++P L  PN   V +   S  + +S  +  S+  S+     +
Sbjct: 1   MAASSSHLFALP----SPASPFLSAPNRNRVRVLAKSCPENQSFDSNDSDSSSETTHKAQ 60

Query: 61  GTQNLISRR-WCLTCLCSSVTLLKNYGGTGTEAIANTMDGKP--VCRNCGGSGAVXXXXX 120
           G Q  +SRR W   C+C+S  L+ N     +   A  +D KP   CRNC GSGAV     
Sbjct: 61  GDQKSVSRRQWMTACVCASAALISNSYTFVSVQSAAALDKKPGGSCRNCQGSGAVLCDMC 120

Query: 121 XXXXKWKALNRKRAKDVYEFTXXXXXXXXXXXXXXXXXXXXXPNNKGLLRRPDARKLLDK 180
               KWKALNRKRAKDVY   XXXXXXXXXXXXXXXXXXXXXPNNKGLLRRP AR+LL+K
Sbjct: 121 GGTGKWKALNRKRAKDVYXXXXXXXXXXXXXXXXXXXXXXXXPNNKGLLRRPGARELLEK 180

Query: 181 MYNGRLLPNS 185
           MYNGRLLP+S
Sbjct: 181 MYNGRLLPDS 186

BLAST of Bhi10G000327 vs. Swiss-Prot
Match: sp|Q6YUA8|PSA22_ORYSJ (Protein PHOTOSYSTEM I ASSEMBLY 2, chloroplastic OS=Oryza sativa subsp. japonica OX=39947 GN=PSA2 PE=2 SV=1)

HSP 1 Score: 122.5 bits (306), Expect = 5.0e-27
Identity = 99/174 (56.90%), Postives = 118/174 (67.82%), Query Frame = 0

Query: 14  RPSSASTPSLYHPNSKPVVLHVTSKSDGESCSTGVSNPPSKPLKGTQNLISRRWCLTCLC 73
           RP++A  P+     ++  +   +   D E+CST  S P +   +  +   SRR CL CLC
Sbjct: 21  RPAAAHRPA----KARSHISCCSRHDDAEACST--SKPLTNGKEEEKTTPSRRKCLACLC 80

Query: 74  SSVTLLKNYGGT--GTEAIANTMDGKP-VCRNCGGSGAVXXXXXXXXXKWKALNRKRAKD 133
            +VTL+   G T      +A+ M  KP VCRNC GSGAV         KWKALNRKRAKD
Sbjct: 81  -AVTLISASGPTMLTPNGLASDMMSKPAVCRNCNGSGAVLCDMCGGTGKWKALNRKRAKD 140

Query: 134 VYEFTXXXXXXXXXXXXXXXXXXXXXPNNKGLLRRPDARKLLDKMYNGRLLPNS 185
           VY FTXXXXXXXXXXXXXXXXXXXXXPNNKGLLRRPDA+KLLDKMYNG++LP+S
Sbjct: 141 VYLFTXXXXXXXXXXXXXXXXXXXXXPNNKGLLRRPDAKKLLDKMYNGKILPDS 187

BLAST of Bhi10G000327 vs. Swiss-Prot
Match: sp|O64750|PSA2_ARATH (Protein PHOTOSYSTEM I ASSEMBLY 2, chloroplastic OS=Arabidopsis thaliana OX=3702 GN=PSA2 PE=1 SV=1)

HSP 1 Score: 120.2 bits (300), Expect = 2.5e-26
Identity = 102/190 (53.68%), Postives = 121/190 (63.68%), Query Frame = 0

Query: 1   MMASSSHLSAIPLRPSSASTPSLYHPNSKPVVLHVTSKSDGESCSTGVSNPPSK---PLK 60
           M ASSSHL A+P    S ++P L  PN   V +   S  + +S  +  S+  S+     +
Sbjct: 1   MAASSSHLFALP----SPASPFLSAPNRNRVRVLAKSCPENQSFDSNDSDSSSETTHKAQ 60

Query: 61  GTQNLISRR-WCLTCLCSSVTLLKNYGGTGTEAIANTMDGKP--VCRNCGGSGAVXXXXX 120
           G Q  +SRR W   C+C+S  L+ N     +   A  +D KP   CRNC GSGAV     
Sbjct: 61  GDQKSVSRRQWMTACVCASAALISNSYTFVSVQSAAALDKKPGGSCRNCQGSGAVLCDMC 120

Query: 121 XXXXKWKALNRKRAKDVYEFTXXXXXXXXXXXXXXXXXXXXXPNNKGLLRRPDARKLLDK 180
               KWKALNRKRAKDVY   XXXXXXXXXXXXXXXXXXXXXPNNKGLLRRP AR+LL+K
Sbjct: 121 GGTGKWKALNRKRAKDVYXXXXXXXXXXXXXXXXXXXXXXXXPNNKGLLRRPGARELLEK 180

Query: 181 MYNGRLLPNS 185
           MYNGRLLP+S
Sbjct: 181 MYNGRLLPDS 186

BLAST of Bhi10G000327 vs. Swiss-Prot
Match: sp|A0A1D6KL43|PSA2_MAIZE (Protein PHOTOSYSTEM I ASSEMBLY 2, chloroplastic OS=Zea mays OX=4577 GN=PSA2 PE=2 SV=1)

HSP 1 Score: 109.8 bits (273), Expect = 3.3e-23
Identity = 83/124 (66.94%), Postives = 92/124 (74.19%), Query Frame = 0

Query: 64  SRRWCLTCLCSSVTLLKNYG---GTGTEAIANTMDGKPVCRNCGGSGAVXXXXXXXXXKW 123
           SRR CL CL  +VTL+   G   G   +A+      K VCRNC GSGAV         KW
Sbjct: 74  SRRRCLVCL-GAVTLISATGPPNGLAADAMNKAGVQKAVCRNCNGSGAVICDMCGGTGKW 133

Query: 124 KALNRKRAKDVYEFTXXXXXXXXXXXXXXXXXXXXXPNNKGLLRRPDARKLLDKMYNGRL 183
           KALNRKRAKDVYEF XXXXXXXXXXXXXXXXXXXXXPNNKGLLRRP+A++LLDKMYNG++
Sbjct: 134 KALNRKRAKDVYEFXXXXXXXXXXXXXXXXXXXXXXPNNKGLLRRPEAKQLLDKMYNGKI 193

Query: 184 LPNS 185
           LP S
Sbjct: 194 LPRS 196

BLAST of Bhi10G000327 vs. TrEMBL
Match: tr|A0A1S4DYE8|A0A1S4DYE8_CUCME (protein EMBRYO SAC DEVELOPMENT ARREST 3, chloroplastic isoform X1 OS=Cucumis melo OX=3656 GN=LOC103491567 PE=4 SV=1)

HSP 1 Score: 266.5 bits (680), Expect = 4.3e-68
Identity = 171/184 (92.93%), Postives = 175/184 (95.11%), Query Frame = 0

Query: 1   MMASSSHLSAIPL-RPSSASTPSLYHPNSKPVVLHVTSKSDGESCSTGVSNPPSKPLKGT 60
           MMASSSHLSAIPL RPSS+STPSL H NSKPVVLH+TS SD ESCSTG S  PSKPLKGT
Sbjct: 1   MMASSSHLSAIPLRRPSSSSTPSLSHSNSKPVVLHLTSNSDDESCSTGDSRTPSKPLKGT 60

Query: 61  QNLISRRWCLTCLCSSVTLLKNYGGTGTEAIANTMDGKPVCRNCGGSGAVXXXXXXXXXK 120
           QNLISRRWCLTCLCSSVTL+KNYGGT TEAIANTMDGKPVCRNCGGSGAVXXXXXXXXXK
Sbjct: 61  QNLISRRWCLTCLCSSVTLMKNYGGT-TEAIANTMDGKPVCRNCGGSGAVXXXXXXXXXK 120

Query: 121 WKALNRKRAKDVYEFTXXXXXXXXXXXXXXXXXXXXXPNNKGLLRRPDARKLLDKMYNGR 180
           WKALNRKRAKDVYEFTXXXXXXXXXXXXXXXXXXXXXPNNKGLLRRPDAR+LLDKMYNGR
Sbjct: 121 WKALNRKRAKDVYEFTXXXXXXXXXXXXXXXXXXXXXPNNKGLLRRPDARQLLDKMYNGR 180

Query: 181 LLPN 184
           LLPN
Sbjct: 181 LLPN 183

BLAST of Bhi10G000327 vs. TrEMBL
Match: tr|A0A0A0KX46|A0A0A0KX46_CUCSA (Uncharacterized protein OS=Cucumis sativus OX=3659 GN=Csa_4G285690 PE=4 SV=1)

HSP 1 Score: 255.8 bits (652), Expect = 7.6e-65
Identity = 158/185 (85.41%), Postives = 162/185 (87.57%), Query Frame = 0

Query: 2   MASSSHLSAIPL-RPSSASTPSLYH-PNSKPVVLHVTSKSDGESCSTGVSNPPSKPLKGT 61
           MASSSHLSAIPL RPSS+S PSL H  N KPVVLHVTS SD ESCSTG S  PSKPLKGT
Sbjct: 1   MASSSHLSAIPLRRPSSSSPPSLSHSANLKPVVLHVTSNSDDESCSTGDSKTPSKPLKGT 60

Query: 62  QNLISRRWCLTCLCSSVTLLKNYGGTGTEAIANTMDGKPVCRNCGGSGAVXXXXXXXXXK 121
           Q LISRRWCLTCLCSSVTL+K+YGGT TEAIANTMDGKP CRNCGGSGAV         K
Sbjct: 61  QKLISRRWCLTCLCSSVTLMKSYGGTVTEAIANTMDGKPACRNCGGSGAVLCDMCGGTGK 120

Query: 122 WKALNRKRAKDVYEFTXXXXXXXXXXXXXXXXXXXXXPNNKGLLRRPDARKLLDKMYNGR 181
           WKALNRKRAKDVYEFTXXXXXXXXXXXXXXXXXXXXXPNNKGLLRRPDAR+LLDKMYNGR
Sbjct: 121 WKALNRKRAKDVYEFTXXXXXXXXXXXXXXXXXXXXXPNNKGLLRRPDARQLLDKMYNGR 180

Query: 182 LLPNS 185
           LLPNS
Sbjct: 181 LLPNS 185

BLAST of Bhi10G000327 vs. TrEMBL
Match: tr|A0A1S4DXN8|A0A1S4DXN8_CUCME (protein EMBRYO SAC DEVELOPMENT ARREST 3, chloroplastic isoform X2 OS=Cucumis melo OX=3656 GN=LOC103491567 PE=4 SV=1)

HSP 1 Score: 233.4 bits (594), Expect = 4.0e-58
Identity = 150/164 (91.46%), Postives = 154/164 (93.90%), Query Frame = 0

Query: 20  TPSLYHPNSKPVVLHVTSKSDGESCSTGVSNPPSKPLKGTQNLISRRWCLTCLCSSVTLL 79
           T +L   NSKPVVLH+TS SD ESCSTG S  PSKPLKGTQNLISRRWCLTCLCSSVTL+
Sbjct: 18  TTNLLPTNSKPVVLHLTSNSDDESCSTGDSRTPSKPLKGTQNLISRRWCLTCLCSSVTLM 77

Query: 80  KNYGGTGTEAIANTMDGKPVCRNCGGSGAVXXXXXXXXXKWKALNRKRAKDVYEFTXXXX 139
           KNYGGT TEAIANTMDGKPVCRNCGGSGAVXXXXXXXXXKWKALNRKRAKDVYEFTXXXX
Sbjct: 78  KNYGGT-TEAIANTMDGKPVCRNCGGSGAVXXXXXXXXXKWKALNRKRAKDVYEFTXXXX 137

Query: 140 XXXXXXXXXXXXXXXXXPNNKGLLRRPDARKLLDKMYNGRLLPN 184
           XXXXXXXXXXXXXXXXXPNNKGLLRRPDAR+LLDKMYNGRLLPN
Sbjct: 138 XXXXXXXXXXXXXXXXXPNNKGLLRRPDARQLLDKMYNGRLLPN 180

BLAST of Bhi10G000327 vs. TrEMBL
Match: tr|A0A1S4DXP4|A0A1S4DXP4_CUCME (protein EMBRYO SAC DEVELOPMENT ARREST 3, chloroplastic isoform X3 OS=Cucumis melo OX=3656 GN=LOC103491567 PE=4 SV=1)

HSP 1 Score: 231.5 bits (589), Expect = 1.5e-57
Identity = 151/169 (89.35%), Postives = 154/169 (91.12%), Query Frame = 0

Query: 15  PSSASTPSLYHPNSKPVVLHVTSKSDGESCSTGVSNPPSKPLKGTQNLISRRWCLTCLCS 74
           PS     S    NSKPVVLH+TS SD ESCSTG S  PSKPLKGTQNLISRRWCLTCLCS
Sbjct: 2   PSLVRFCSKLEANSKPVVLHLTSNSDDESCSTGDSRTPSKPLKGTQNLISRRWCLTCLCS 61

Query: 75  SVTLLKNYGGTGTEAIANTMDGKPVCRNCGGSGAVXXXXXXXXXKWKALNRKRAKDVYEF 134
           SVTL+KNYGGT TEAIANTMDGKPVCRNCGGSGAVXXXXXXXXXKWKALNRKRAKDVYEF
Sbjct: 62  SVTLMKNYGGT-TEAIANTMDGKPVCRNCGGSGAVXXXXXXXXXKWKALNRKRAKDVYEF 121

Query: 135 TXXXXXXXXXXXXXXXXXXXXXPNNKGLLRRPDARKLLDKMYNGRLLPN 184
           TXXXXXXXXXXXXXXXXXXXXXPNNKGLLRRPDAR+LLDKMYNGRLLPN
Sbjct: 122 TXXXXXXXXXXXXXXXXXXXXXPNNKGLLRRPDARQLLDKMYNGRLLPN 169

BLAST of Bhi10G000327 vs. TrEMBL
Match: tr|A0A2P5F6X6|A0A2P5F6X6_9ROSA (Heat shock protein DnaJ, cysteine-rich domain containing protein OS=Trema orientalis OX=63057 GN=TorRG33x02_105360 PE=4 SV=1)

HSP 1 Score: 191.0 bits (484), Expect = 2.3e-45
Identity = 138/185 (74.59%), Postives = 152/185 (82.16%), Query Frame = 0

Query: 2   MASSSHLSAIPLRPSSASTPSLYHPNSKPVVLHVTSKSDGESCSTGVSNPPSKPLKGTQN 61
           MASSSHLSAIP R    S PSL H NSK V LHV S  + ESC TG     SKP+K +Q 
Sbjct: 1   MASSSHLSAIPQR---LSKPSLSHLNSKTVRLHVRSSLENESCGTGDPGESSKPVKRSQL 60

Query: 62  LISRRWCLTCLCSSVTLLKNYGGTGTEAIANTMDG--KPVCRNCGGSGAVXXXXXXXXXK 121
            ++RR CLTCLCS+V L+ ++G + ++A A+TMDG  KPVCRNCGGSGAVXXXXXXXXXK
Sbjct: 61  AMNRRLCLTCLCSTVGLINDFGTSTSKAKASTMDGRDKPVCRNCGGSGAVXXXXXXXXXK 120

Query: 122 WKALNRKRAKDVYEFTXXXXXXXXXXXXXXXXXXXXXPNNKGLLRRPDARKLLDKMYNGR 181
           WKALNRKRAKDVYE  XXXXXXXXXXXXXXXXXXXXXPNNKGLLRRPDAR+LLDKMYNGR
Sbjct: 121 WKALNRKRAKDVYEXXXXXXXXXXXXXXXXXXXXXXXPNNKGLLRRPDARQLLDKMYNGR 180

Query: 182 LLPNS 185
           LLPNS
Sbjct: 181 LLPNS 182

BLAST of Bhi10G000327 vs. NCBI nr
Match: XP_016900745.1 (PREDICTED: protein EMBRYO SAC DEVELOPMENT ARREST 3, chloroplastic isoform X1 [Cucumis melo])

HSP 1 Score: 266.5 bits (680), Expect = 6.5e-68
Identity = 171/184 (92.93%), Postives = 175/184 (95.11%), Query Frame = 0

Query: 1   MMASSSHLSAIPL-RPSSASTPSLYHPNSKPVVLHVTSKSDGESCSTGVSNPPSKPLKGT 60
           MMASSSHLSAIPL RPSS+STPSL H NSKPVVLH+TS SD ESCSTG S  PSKPLKGT
Sbjct: 1   MMASSSHLSAIPLRRPSSSSTPSLSHSNSKPVVLHLTSNSDDESCSTGDSRTPSKPLKGT 60

Query: 61  QNLISRRWCLTCLCSSVTLLKNYGGTGTEAIANTMDGKPVCRNCGGSGAVXXXXXXXXXK 120
           QNLISRRWCLTCLCSSVTL+KNYGGT TEAIANTMDGKPVCRNCGGSGAVXXXXXXXXXK
Sbjct: 61  QNLISRRWCLTCLCSSVTLMKNYGGT-TEAIANTMDGKPVCRNCGGSGAVXXXXXXXXXK 120

Query: 121 WKALNRKRAKDVYEFTXXXXXXXXXXXXXXXXXXXXXPNNKGLLRRPDARKLLDKMYNGR 180
           WKALNRKRAKDVYEFTXXXXXXXXXXXXXXXXXXXXXPNNKGLLRRPDAR+LLDKMYNGR
Sbjct: 121 WKALNRKRAKDVYEFTXXXXXXXXXXXXXXXXXXXXXPNNKGLLRRPDARQLLDKMYNGR 180

Query: 181 LLPN 184
           LLPN
Sbjct: 181 LLPN 183

BLAST of Bhi10G000327 vs. NCBI nr
Match: XP_011653566.1 (PREDICTED: uncharacterized protein LOC101213672 isoform X2 [Cucumis sativus])

HSP 1 Score: 260.4 bits (664), Expect = 4.7e-66
Identity = 158/184 (85.87%), Postives = 162/184 (88.04%), Query Frame = 0

Query: 2   MASSSHLSAIPL-RPSSASTPSLYHPNSKPVVLHVTSKSDGESCSTGVSNPPSKPLKGTQ 61
           MASSSHLSAIPL RPSS+S PSL H N KPVVLHVTS SD ESCSTG S  PSKPLKGTQ
Sbjct: 1   MASSSHLSAIPLRRPSSSSPPSLSHSNLKPVVLHVTSNSDDESCSTGDSKTPSKPLKGTQ 60

Query: 62  NLISRRWCLTCLCSSVTLLKNYGGTGTEAIANTMDGKPVCRNCGGSGAVXXXXXXXXXKW 121
            LISRRWCLTCLCSSVTL+K+YGGT TEAIANTMDGKP CRNCGGSGAV         KW
Sbjct: 61  KLISRRWCLTCLCSSVTLMKSYGGTVTEAIANTMDGKPACRNCGGSGAVLCDMCGGTGKW 120

Query: 122 KALNRKRAKDVYEFTXXXXXXXXXXXXXXXXXXXXXPNNKGLLRRPDARKLLDKMYNGRL 181
           KALNRKRAKDVYEFTXXXXXXXXXXXXXXXXXXXXXPNNKGLLRRPDAR+LLDKMYNGRL
Sbjct: 121 KALNRKRAKDVYEFTXXXXXXXXXXXXXXXXXXXXXPNNKGLLRRPDARQLLDKMYNGRL 180

Query: 182 LPNS 185
           LPNS
Sbjct: 181 LPNS 184

BLAST of Bhi10G000327 vs. NCBI nr
Match: XP_004142153.1 (PREDICTED: uncharacterized protein LOC101213672 isoform X1 [Cucumis sativus] >KGN54093.1 hypothetical protein Csa_4G285690 [Cucumis sativus])

HSP 1 Score: 255.8 bits (652), Expect = 1.2e-64
Identity = 158/185 (85.41%), Postives = 162/185 (87.57%), Query Frame = 0

Query: 2   MASSSHLSAIPL-RPSSASTPSLYH-PNSKPVVLHVTSKSDGESCSTGVSNPPSKPLKGT 61
           MASSSHLSAIPL RPSS+S PSL H  N KPVVLHVTS SD ESCSTG S  PSKPLKGT
Sbjct: 1   MASSSHLSAIPLRRPSSSSPPSLSHSANLKPVVLHVTSNSDDESCSTGDSKTPSKPLKGT 60

Query: 62  QNLISRRWCLTCLCSSVTLLKNYGGTGTEAIANTMDGKPVCRNCGGSGAVXXXXXXXXXK 121
           Q LISRRWCLTCLCSSVTL+K+YGGT TEAIANTMDGKP CRNCGGSGAV         K
Sbjct: 61  QKLISRRWCLTCLCSSVTLMKSYGGTVTEAIANTMDGKPACRNCGGSGAVLCDMCGGTGK 120

Query: 122 WKALNRKRAKDVYEFTXXXXXXXXXXXXXXXXXXXXXPNNKGLLRRPDARKLLDKMYNGR 181
           WKALNRKRAKDVYEFTXXXXXXXXXXXXXXXXXXXXXPNNKGLLRRPDAR+LLDKMYNGR
Sbjct: 121 WKALNRKRAKDVYEFTXXXXXXXXXXXXXXXXXXXXXPNNKGLLRRPDARQLLDKMYNGR 180

Query: 182 LLPNS 185
           LLPNS
Sbjct: 181 LLPNS 185

BLAST of Bhi10G000327 vs. NCBI nr
Match: XP_022958131.1 (protein PHOTOSYSTEM I ASSEMBLY 2, chloroplastic [Cucurbita moschata] >XP_023532563.1 protein PHOTOSYSTEM I ASSEMBLY 2, chloroplastic [Cucurbita pepo subsp. pepo])

HSP 1 Score: 249.2 bits (635), Expect = 1.1e-62
Identity = 152/184 (82.61%), Postives = 159/184 (86.41%), Query Frame = 0

Query: 1   MMASSSHLSAIPLRPSSASTPSLYHPNSKPVVLHVTSKSDGESCSTGVSNPPSKPLKGTQ 60
           MMASSSHLSAIPLR SS+S PSL H NSKPVVL VTS  D ES +TG S+ PSKPLKGT+
Sbjct: 1   MMASSSHLSAIPLRSSSSSRPSLSHSNSKPVVLQVTSNLDDESSTTGDSSTPSKPLKGTR 60

Query: 61  NLISRRWCLTCLCSSVTLLKNYGGTGTEAIANTMDGKPVCRNCGGSGAVXXXXXXXXXKW 120
            LISRRWCLTCLCSS TL+K+YGGT  EAIANTMDGKP CRNCGGSGAV         KW
Sbjct: 61  ILISRRWCLTCLCSSPTLIKDYGGTMAEAIANTMDGKPACRNCGGSGAVLCDMCGGTGKW 120

Query: 121 KALNRKRAKDVYEFTXXXXXXXXXXXXXXXXXXXXXPNNKGLLRRPDARKLLDKMYNGRL 180
           KALNRKRAKDVYEFTXXXXXXXXXXXXXXXXXXXXXPNNKGLLRRPDARKLLDKM+NGRL
Sbjct: 121 KALNRKRAKDVYEFTXXXXXXXXXXXXXXXXXXXXXPNNKGLLRRPDARKLLDKMFNGRL 180

Query: 181 LPNS 185
           LPNS
Sbjct: 181 LPNS 184

BLAST of Bhi10G000327 vs. NCBI nr
Match: XP_022996228.1 (protein PHOTOSYSTEM I ASSEMBLY 2, chloroplastic [Cucurbita maxima])

HSP 1 Score: 248.8 bits (634), Expect = 1.4e-62
Identity = 152/184 (82.61%), Postives = 157/184 (85.33%), Query Frame = 0

Query: 1   MMASSSHLSAIPLRPSSASTPSLYHPNSKPVVLHVTSKSDGESCSTGVSNPPSKPLKGTQ 60
           MMASSSHLSAIPLR SS+S P L H NSKPVVL VTS  D ESCSTG  + PSKPLKGT 
Sbjct: 1   MMASSSHLSAIPLRSSSSSRPFLSHSNSKPVVLQVTSNLDDESCSTGDLSTPSKPLKGTP 60

Query: 61  NLISRRWCLTCLCSSVTLLKNYGGTGTEAIANTMDGKPVCRNCGGSGAVXXXXXXXXXKW 120
            LISRRWCLTCLCSS TL+K+YGGT  EAIANTMDGKP CRNCGGSGAV         KW
Sbjct: 61  ILISRRWCLTCLCSSPTLIKDYGGTMAEAIANTMDGKPACRNCGGSGAVLCDMCGGTGKW 120

Query: 121 KALNRKRAKDVYEFTXXXXXXXXXXXXXXXXXXXXXPNNKGLLRRPDARKLLDKMYNGRL 180
           KALNRKRAKDVYEFTXXXXXXXXXXXXXXXXXXXXXPNNKGLLRRPDARKLLDKM+NGRL
Sbjct: 121 KALNRKRAKDVYEFTXXXXXXXXXXXXXXXXXXXXXPNNKGLLRRPDARKLLDKMFNGRL 180

Query: 181 LPNS 185
           LPNS
Sbjct: 181 LPNS 184

The following BLAST results are available for this feature:
Match NameE-valueIdentityDescription
AT2G34860.11.4e-2753.68DnaJ/Hsp40 cysteine-rich domain superfamily protein[more]
Match NameE-valueIdentityDescription
sp|Q6YUA8|PSA22_ORYSJ5.0e-2756.90Protein PHOTOSYSTEM I ASSEMBLY 2, chloroplastic OS=Oryza sativa subsp. japonica ... [more]
sp|O64750|PSA2_ARATH2.5e-2653.68Protein PHOTOSYSTEM I ASSEMBLY 2, chloroplastic OS=Arabidopsis thaliana OX=3702 ... [more]
sp|A0A1D6KL43|PSA2_MAIZE3.3e-2366.94Protein PHOTOSYSTEM I ASSEMBLY 2, chloroplastic OS=Zea mays OX=4577 GN=PSA2 PE=2... [more]
Match NameE-valueIdentityDescription
tr|A0A1S4DYE8|A0A1S4DYE8_CUCME4.3e-6892.93protein EMBRYO SAC DEVELOPMENT ARREST 3, chloroplastic isoform X1 OS=Cucumis mel... [more]
tr|A0A0A0KX46|A0A0A0KX46_CUCSA7.6e-6585.41Uncharacterized protein OS=Cucumis sativus OX=3659 GN=Csa_4G285690 PE=4 SV=1[more]
tr|A0A1S4DXN8|A0A1S4DXN8_CUCME4.0e-5891.46protein EMBRYO SAC DEVELOPMENT ARREST 3, chloroplastic isoform X2 OS=Cucumis mel... [more]
tr|A0A1S4DXP4|A0A1S4DXP4_CUCME1.5e-5789.35protein EMBRYO SAC DEVELOPMENT ARREST 3, chloroplastic isoform X3 OS=Cucumis mel... [more]
tr|A0A2P5F6X6|A0A2P5F6X6_9ROSA2.3e-4574.59Heat shock protein DnaJ, cysteine-rich domain containing protein OS=Trema orient... [more]
Match NameE-valueIdentityDescription
XP_016900745.16.5e-6892.93PREDICTED: protein EMBRYO SAC DEVELOPMENT ARREST 3, chloroplastic isoform X1 [Cu... [more]
XP_011653566.14.7e-6685.87PREDICTED: uncharacterized protein LOC101213672 isoform X2 [Cucumis sativus][more]
XP_004142153.11.2e-6485.41PREDICTED: uncharacterized protein LOC101213672 isoform X1 [Cucumis sativus] >KG... [more]
XP_022958131.11.1e-6282.61protein PHOTOSYSTEM I ASSEMBLY 2, chloroplastic [Cucurbita moschata] >XP_0235325... [more]
XP_022996228.11.4e-6282.61protein PHOTOSYSTEM I ASSEMBLY 2, chloroplastic [Cucurbita maxima][more]
The following terms have been associated with this gene:
Vocabulary: INTERPRO
TermDefinition
IPR036410HSP_DnaJ_Cys-rich_dom_sf
GO Assignments
This gene is annotated with the following GO terms.
Category Term Accession Term Name
biological_process GO:0016117 carotenoid biosynthetic process
biological_process GO:0044271 cellular nitrogen compound biosynthetic process
biological_process GO:0048564 photosystem I assembly
biological_process GO:0008150 biological_process
biological_process GO:0044281 small molecule metabolic process
biological_process GO:0016070 RNA metabolic process
biological_process GO:0009657 plastid organization
biological_process GO:0046148 pigment biosynthetic process
biological_process GO:1901564 organonitrogen compound metabolic process
biological_process GO:1901362 organic cyclic compound biosynthetic process
biological_process GO:0008299 isoprenoid biosynthetic process
biological_process GO:0018130 heterocycle biosynthetic process
biological_process GO:0019682 glyceraldehyde-3-phosphate metabolic process
biological_process GO:0051186 cofactor metabolic process
biological_process GO:0044267 cellular protein metabolic process
biological_process GO:0019438 aromatic compound biosynthetic process
biological_process GO:0015995 chlorophyll biosynthetic process
biological_process GO:0019684 photosynthesis, light reaction
biological_process GO:0009902 chloroplast relocation
biological_process GO:0019288 isopentenyl diphosphate biosynthetic process, methylerythritol 4-phosphate pathway
biological_process GO:0009561 megagametogenesis
biological_process GO:0034660 ncRNA metabolic process
biological_process GO:0006098 pentose-phosphate shunt
biological_process GO:0010304 PSII associated light-harvesting complex II catabolic process
biological_process GO:0035304 regulation of protein dephosphorylation
biological_process GO:0010027 thylakoid membrane organization
biological_process GO:0042793 transcription from plastid promoter
cellular_component GO:0009507 chloroplast
cellular_component GO:0009535 chloroplast thylakoid membrane
cellular_component GO:0005575 cellular_component
molecular_function GO:0051082 unfolded protein binding
molecular_function GO:0031072 heat shock protein binding
molecular_function GO:0003674 molecular_function

The following mRNA feature(s) are a part of this gene:

Feature NameUnique NameType
Bhi10M000327Bhi10M000327mRNA


Analysis Name: InterPro Annotations of wax gourd
Date Performed: 2019-11-17
IPR TermIPR DescriptionSourceSource TermSource DescriptionAlignment
NoneNo IPR availablePANTHERPTHR15852:SF29PROTEIN EMBRYO SAC DEVELOPMENT ARREST 3, CHLOROPLASTICcoord: 95..155
NoneNo IPR availablePANTHERPTHR15852FAMILY NOT NAMEDcoord: 95..155
IPR036410Heat shock protein DnaJ, cysteine-rich domain superfamilySUPERFAMILYSSF57938DnaJ/Hsp40 cysteine-rich domaincoord: 99..156