Bhi01G002303 (gene) Wax gourd (B227) v1

Overview
NameBhi01G002303
Typegene
OrganismBenincasa hispida (Wax gourd (B227) v1)
DescriptionPentatricopeptide repeat (PPR) superfamily protein isoform 2
Locationchr1: 73184990 .. 73189088 (+)
RNA-Seq ExpressionBhi01G002303
SyntenyBhi01G002303
Sequences
The following sequences are available for this feature:

Gene sequence (with intron)

Legend: five_prime_UTRexonCDSpolypeptidethree_prime_UTR
Hold the cursor over a type above to highlight its positions in the sequence below.
CAAATCCCTATCAAACGTCCTTAATATTATTTCTACATGAAATTTCTACTCAAACGACGTCGTTGTTGTCTCCTCTATATATATAACCCTTCCTTTGCCCTATTTTTTTCCACCCGAGTTTCGTCTTCCTTCGGCGGCAAGGAATTTGACTCTTCCGCAAAGCTCAGGAACTTTTCATCTCTTTCGTTTAGGTTTTCCCTAATTCTCTGCCCTAATTCTTCACTGCTATTTCTCCGATCGGTTTTAGCGGCGAGTTATCTTCTCCTCAAGAGGTGTGTGTTCTTTCTTCTCCGTTTTGATCTACGATGCTTACTTTACGATGGAAACCCTAGATGTTTTGTCTCCGATTTCGTGGTTCTTTTGTATGGTTGCTGTTTGATTTCTTGTGGGAGCCGGAGTAGACTTTTGCCCCGTGAGTTATGGCTTTTCATTTGATATATCATAAGGTTTGAGTGCGAGTGGGAAGTTTCTTGGTTCTGTTATGGTTGTATTGTTGAAGGTCTCTGTATTTGGATGCAGTTTCCTTTTTCGGGGTTTTTTTCGATGTATAATGAGACCATAGAAGTATATTGTATCTATTGGTAACTTTTGTTTCTGCAACTTATGTAAGGTTATGGCTGGTTTCTTTGAGTTTTCTGTGCCATGTGTCTGATTTACGCAGTATTGTTCTTTTTCTTGATTCTTTGAACGAACCCTGGCGAGGAGCTGCATGATTTTGTGCGCTTTTCTGAGCGGCTTTGCGTCCCTATTGTTTTGAATTTATTTTGAATGTTACTTTTCTTTGTTCACTACCAACTCTATTTTTCTAACTTTGTGTTAGATGGCGGCTAAACCACTTACTACCGAGGCGATTGCCATAACTGAGAAGAAGATGGATATGGCTTTAGGTTGGGCTTTTGAATTTCTTGCTTCAGTACTTTTGAATTTATGTACATCTTAAAAATTTATTGATTATTCTGTTTCTACATTTGTTTTGAATTAGACGACATTATTAAAATGTCCAAAAATACGGGAAATAAAGGCACGAAGCAAAGAAGGATACCGGTAATAACCTGTTTTCTTTATCATGCCTTTAATCTGGGTAGTCCATTTTTGAAATTGGAAAATTTCTTTTTATTATGATAATGTAGTGATTTATCTTTCCACATGCAGAACAAAATGCAGAAATTTCCAAATAATGCTTCTCAAAATAGACCCAGAAAGTTGCAGCGTTTCATGGACTCTAGGTCTTCTCTGAGACAGGTTTGTGCAGTCATTTAACTTGATTATAATTTACGTTGCTATTTTGTATGATGAGTTTTTTTTTTTTTTTTTTTTTTTTTTTTTAAATTTTAGGGGGCTTTGGCCAACAGAAGGTCAAGCTTTCAAGGGAATCAGTTTCCTTTGGCAACTGAGGTTGCAAGAAAGGCTGCAGTTGCTCCAATTCGCCCTAGAGCTTTTAATCGTAGGGCAACCAATTGGAATAAAACAAGGTATTGACCAATACCTCGTAACCTACAGAAAGGTCTTCTCGTAAATTATGAACATATGTTGAAAATCAGCAACCTTCTAAAACTGTATGATATATAAAAGAAGAAATGCTATCAGTTTAACAATTGAAACCTGGTTGCTGATTGTTTTAATGTGTTATTTGATGCTGTGACAGGTAAATCACGTTTATACACAATTGATGACTTCTACAATGTCCATCAAGTTTTACTATATGAAGTTAGCAACATAATATGTTGGTTGGAAAAACTGGATGATAAGTTTTTGCTTCAACCAGTTAAATTGGAGTGGGACTATTTCCAATAATTTTGATTCTTCCATGCTGACTGGGAATTGTTTCTGCTTTATCTTCTTAAGTCTGGTTTTGGAATGCATCGAGTTATCTTTTGGTTTTCTGACGAGTAAGTGATTGGTGGAGTCTGCAGAAGTTGGGACAAAGATAATTTGGAAATTTTGAATGAATTTGTTTGCTGCGGTCATCTGGAAAAATGTGGTGGTAAAAAGGGCTTTTGTGTGCATTGGATGAGTGTGTATACTTTTGAGTTTTTGCTCCTTGTTAAATTTGTTAATTGTCTTGGATTTCTCAAGCCTTCGTAGACAACCCATATCATTAGAGCAGCCTTGACCGGATACCTCTTTTGTTGCAAGGAAGTGGAAGCTGTATTTCAAGATGTGACGTTTTAACTACACCAGATGGTTTAAAATTGTGGCATCTCCCTTTTATTGATTATTCCACAATAGTCTTGTTGTTCATCATGCTCAAACTCCGCTGTTAACTTCCATTCCTGGTGGATGACAATTGACATATCTTCATTGGCTTGCAATAATGCAGGCAAAGAACCTTTTGGGCCTGTGGTATTGTGAGATTTTGGTAAATTTTGTATCTATTTTAGTGATCCGTCTGCTAGTTTCCTTTATTCCACTTGGTGTTCCCTTTTCACTTGCTGTCGCATCCTGGGGACATTAGTTAGCCTTCCAACTACCATGTTGTGACAAGTCTACAAGCACTTCTACTTGTTCACTAGCCATAGGACTCAAATGAGGAAACCCTCGACCACTGAAAGTTTTCCACTTGTGAAATATGGTTCTATAAATTCCCGTAGTCAGCCACTCAGCCTGGCCCATGGGAGATGATATTTCATTTTCTATAGTAGATCCTTGGGGTGGAATTGAAATTTTGCTTGTTGCAAGTAAAACAAGAACTCCGATTTTGGTAATTTGTTGTTGCATAATCCCTGTGATAACATAAGTGCTGAAGGAACTATGAATGGTGAGTTAAAATGTGTCGAAAGATGACTTTATGCTGTTCTGTGAGGTATCATGTGTAGATCTTGGGTTTAGTCAGTGGAATACACTAATACTGCAGAAGTTCATGCTGCTTGATTTTCTGGTTGAAGCCTTCATTTTCTGGTTCAAGTTCCGGACAGGAATCTTCCTGGAGTATATTTTTTGTTCATTGATAATACTAGTTTGGAGCTTTTCATTGTGCATTAAAGCAAGCAAAGCATGTGGGGTGTTTGATTTGAGGTTTGGATGGATCATCCTATTTAACCAGAAAGTGCTTTTCTGGATCCTATCAAGCGTAACGCATTTTGAGCTGATTCACATTTTTCGATGACTGCATTCTCCTGATCTGGTTTGTCTTTTATTTCTGTTTATGGGGTTACCATTTCAATTTTGATCACGGAGGTAAGACACTCTTGATCGACTTGTATCTAGACACTCAAGGACATGTTGAAGATTGTTGTTGTTTGTGAAATTCTAAATTGGATAATACATGTTTTACCTAGCGGGTTTACTTATAAATGTTTTATCTTCTGTTCATCAATCATTTCAACATTTTTTGGTACAGGGTTGATACTCCACCGGTTCCAAGGAAGCCTTTCATTAATGGAACCTTTGTTCCCAAGGTACTGTGTTCCTTACTGAGTTGGCCAGAAGTGTAGAGGGCTCCTCAATGGTTTGCAATTGTTTGATCTCTCGTTTATTTCCCTCACCCACCTTCCGTCTACAGGTATCTGCACCGGCCCAGCCGCAAACAAATGTCACGCCGAGACAGAGGCCACAAACTCTTGACTCACTGTTTGCCAACATGAAGGAACAGAGGTTGAGGGTGTTAGCACAGCGACAAAATAGCGGCGGGGCACAAAGGAATGGTGGTTGGCAGCAAAGACCTCCATGGGGGAAGAGGCCGTTTTGGTAACTGAAGAATACACAACACATACCTGGTGCAATGAATTGATCTTTTGTGTGGGAAATGTAGATGATGCTTAATTTGGCTTGCTTGTCTTCGGCACCCGATAGCTGATAGGGGATAAAAAGGAATCTGGTTGTTTCTTTTTTTCTTTTTTCCTTTTTTTTTTTAATCCAAACCCTTTGGCTTGCTATTTTTAGCTTTTTCTAGCTTCAGTACGAACAGAGATCTTTCTGTATCCTTGCAGATTGTGTACAGTGGTTTTTCTTTTCCTTCGTGTTCTTGTTGGCCTAATTTCTCTACTTGTTCTCTTAACAAAAAAATCCTATTTAGACGCCCGTTCCCTTCAACTTTGCGTCTTTGAAATACTGTTGTAAGAGGATTATTATTTCAAATGGTATTTACAGTATAGACAAG

mRNA sequence

CAAATCCCTATCAAACGTCCTTAATATTATTTCTACATGAAATTTCTACTCAAACGACGTCGTTGTTGTCTCCTCTATATATATAACCCTTCCTTTGCCCTATTTTTTTCCACCCGAGTTTCGTCTTCCTTCGGCGGCAAGGAATTTGACTCTTCCGCAAAGCTCAGGAACTTTTCATCTCTTTCGTTTAGGTTTTCCCTAATTCTCTGCCCTAATTCTTCACTGCTATTTCTCCGATCGGTTTTAGCGGCGAGTTATCTTCTCCTCAAGAGATGGCGGCTAAACCACTTACTACCGAGGCGATTGCCATAACTGAGAAGAAGATGGATATGGCTTTAGACGACATTATTAAAATGTCCAAAAATACGGGAAATAAAGGCACGAAGCAAAGAAGGATACCGAACAAAATGCAGAAATTTCCAAATAATGCTTCTCAAAATAGACCCAGAAAGTTGCAGCGTTTCATGGACTCTAGGTCTTCTCTGAGACAGGGGGCTTTGGCCAACAGAAGGTCAAGCTTTCAAGGGAATCAGTTTCCTTTGGCAACTGAGGTTGCAAGAAAGGCTGCAGTTGCTCCAATTCGCCCTAGAGCTTTTAATCGTAGGGCAACCAATTGGAATAAAACAAGGGTTGATACTCCACCGGTTCCAAGGAAGCCTTTCATTAATGGAACCTTTGTTCCCAAGGTATCTGCACCGGCCCAGCCGCAAACAAATGTCACGCCGAGACAGAGGCCACAAACTCTTGACTCACTGTTTGCCAACATGAAGGAACAGAGGTTGAGGGTGTTAGCACAGCGACAAAATAGCGGCGGGGCACAAAGGAATGGTGGTTGGCAGCAAAGACCTCCATGGGGGAAGAGGCCGTTTTGGTAACTGAAGAATACACAACACATACCTGGTGCAATGAATTGATCTTTTGTGTGGGAAATGTAGATGATGCTTAATTTGGCTTGCTTGTCTTCGGCACCCGATAGCTGATAGGGGATAAAAAGGAATCTGGTTGTTTCTTTTTTTCTTTTTTCCTTTTTTTTTTTAATCCAAACCCTTTGGCTTGCTATTTTTAGCTTTTTCTAGCTTCAGTACGAACAGAGATCTTTCTGTATCCTTGCAGATTGTGTACAGTGGTTTTTCTTTTCCTTCGTGTTCTTGTTGGCCTAATTTCTCTACTTGTTCTCTTAACAAAAAAATCCTATTTAGACGCCCGTTCCCTTCAACTTTGCGTCTTTGAAATACTGTTGTAAGAGGATTATTATTTCAAATGGTATTTACAGTATAGACAAG

Coding sequence (CDS)

ATGGCGGCTAAACCACTTACTACCGAGGCGATTGCCATAACTGAGAAGAAGATGGATATGGCTTTAGACGACATTATTAAAATGTCCAAAAATACGGGAAATAAAGGCACGAAGCAAAGAAGGATACCGAACAAAATGCAGAAATTTCCAAATAATGCTTCTCAAAATAGACCCAGAAAGTTGCAGCGTTTCATGGACTCTAGGTCTTCTCTGAGACAGGGGGCTTTGGCCAACAGAAGGTCAAGCTTTCAAGGGAATCAGTTTCCTTTGGCAACTGAGGTTGCAAGAAAGGCTGCAGTTGCTCCAATTCGCCCTAGAGCTTTTAATCGTAGGGCAACCAATTGGAATAAAACAAGGGTTGATACTCCACCGGTTCCAAGGAAGCCTTTCATTAATGGAACCTTTGTTCCCAAGGTATCTGCACCGGCCCAGCCGCAAACAAATGTCACGCCGAGACAGAGGCCACAAACTCTTGACTCACTGTTTGCCAACATGAAGGAACAGAGGTTGAGGGTGTTAGCACAGCGACAAAATAGCGGCGGGGCACAAAGGAATGGTGGTTGGCAGCAAAGACCTCCATGGGGGAAGAGGCCGTTTTGGTAA

Protein sequence

MAAKPLTTEAIAITEKKMDMALDDIIKMSKNTGNKGTKQRRIPNKMQKFPNNASQNRPRKLQRFMDSRSSLRQGALANRRSSFQGNQFPLATEVARKAAVAPIRPRAFNRRATNWNKTRVDTPPVPRKPFINGTFVPKVSAPAQPQTNVTPRQRPQTLDSLFANMKEQRLRVLAQRQNSGGAQRNGGWQQRPPWGKRPFW
Homology
BLAST of Bhi01G002303 vs. TAIR 10
Match: AT4G10970.1 (unknown protein; BEST Arabidopsis thaliana protein match is: unknown protein (TAIR:AT4G23910.2); Has 1807 Blast hits to 1807 proteins in 277 species: Archae - 0; Bacteria - 0; Metazoa - 736; Fungi - 347; Plants - 385; Viruses - 0; Other Eukaryotes - 339 (source: NCBI BLink). )

HSP 1 Score: 141.0 bits (354), Expect = 1.1e-33
Identity = 106/210 (50.48%), Postives = 129/210 (61.43%), Query Frame = 0

Query: 4   KPLTTEAIAITEKKMDMALDDIIKMSK-NTG-NKGTKQRRIPNKMQKFPNNASQNRPRKL 63
           KP+TTE +A+TEKKMDM+LD+IIKM K NT  NKG KQ R+ NK +KF + A++N   K 
Sbjct: 5   KPITTETVALTEKKMDMSLDEIIKMEKSNTNVNKGKKQ-RVLNKKEKF-SGAAKNSAVKA 64

Query: 64  QRFMDSRSSLRQGALANRRSSFQGNQFPLATEVARKAAVAPIRPRAFN-RRATNWNKTRV 123
           QR+MDSRS +RQGA A +RS+FQGNQFP+ T VARKAA A  R R +N  R TN N++R 
Sbjct: 65  QRYMDSRSDVRQGAFAKKRSNFQGNQFPVTTTVARKAASATPRGRPYNGGRMTNTNQSRF 124

Query: 124 DTPPVPRKPFINGTFVPKVSAP-----AQPQTN---VTPRQRPQTLDSLFANMKEQRLRV 183
             PP   +    G FV K          Q Q N      RQ PQTLDS FANMKE+R+R+
Sbjct: 125 IAPPAQNRASQRG-FVGKQQQQQREKIVQQQANGGGGGQRQWPQTLDSRFANMKEERMRM 184

Query: 184 LAQRQNSGGAQRNGG---WQQRP--PWGKR 198
                N      NG     QQR   PW +R
Sbjct: 185 RRFADNRSNVGNNGAGSHQQQRSMVPWVRR 211

BLAST of Bhi01G002303 vs. TAIR 10
Match: AT4G10970.2 (unknown protein; BEST Arabidopsis thaliana protein match is: unknown protein (TAIR:AT4G23910.2); Has 35333 Blast hits to 34131 proteins in 2444 species: Archae - 798; Bacteria - 22429; Metazoa - 974; Fungi - 991; Plants - 531; Viruses - 0; Other Eukaryotes - 9610 (source: NCBI BLink). )

HSP 1 Score: 141.0 bits (354), Expect = 1.1e-33
Identity = 106/210 (50.48%), Postives = 129/210 (61.43%), Query Frame = 0

Query: 4   KPLTTEAIAITEKKMDMALDDIIKMSK-NTG-NKGTKQRRIPNKMQKFPNNASQNRPRKL 63
           KP+TTE +A+TEKKMDM+LD+IIKM K NT  NKG KQ R+ NK +KF + A++N   K 
Sbjct: 5   KPITTETVALTEKKMDMSLDEIIKMEKSNTNVNKGKKQ-RVLNKKEKF-SGAAKNSAVKA 64

Query: 64  QRFMDSRSSLRQGALANRRSSFQGNQFPLATEVARKAAVAPIRPRAFN-RRATNWNKTRV 123
           QR+MDSRS +RQGA A +RS+FQGNQFP+ T VARKAA A  R R +N  R TN N++R 
Sbjct: 65  QRYMDSRSDVRQGAFAKKRSNFQGNQFPVTTTVARKAASATPRGRPYNGGRMTNTNQSRF 124

Query: 124 DTPPVPRKPFINGTFVPKVSAP-----AQPQTN---VTPRQRPQTLDSLFANMKEQRLRV 183
             PP   +    G FV K          Q Q N      RQ PQTLDS FANMKE+R+R+
Sbjct: 125 IAPPAQNRASQRG-FVGKQQQQQREKIVQQQANGGGGGQRQWPQTLDSRFANMKEERMRM 184

Query: 184 LAQRQNSGGAQRNGG---WQQRP--PWGKR 198
                N      NG     QQR   PW +R
Sbjct: 185 RRFADNRSNVGNNGAGSHQQQRSMVPWVRR 211

BLAST of Bhi01G002303 vs. TAIR 10
Match: AT4G10970.3 (unknown protein; BEST Arabidopsis thaliana protein match is: unknown protein (TAIR:AT4G23910.2); Has 35333 Blast hits to 34131 proteins in 2444 species: Archae - 798; Bacteria - 22429; Metazoa - 974; Fungi - 991; Plants - 531; Viruses - 0; Other Eukaryotes - 9610 (source: NCBI BLink). )

HSP 1 Score: 141.0 bits (354), Expect = 1.1e-33
Identity = 106/210 (50.48%), Postives = 129/210 (61.43%), Query Frame = 0

Query: 4   KPLTTEAIAITEKKMDMALDDIIKMSK-NTG-NKGTKQRRIPNKMQKFPNNASQNRPRKL 63
           KP+TTE +A+TEKKMDM+LD+IIKM K NT  NKG KQ R+ NK +KF + A++N   K 
Sbjct: 5   KPITTETVALTEKKMDMSLDEIIKMEKSNTNVNKGKKQ-RVLNKKEKF-SGAAKNSAVKA 64

Query: 64  QRFMDSRSSLRQGALANRRSSFQGNQFPLATEVARKAAVAPIRPRAFN-RRATNWNKTRV 123
           QR+MDSRS +RQGA A +RS+FQGNQFP+ T VARKAA A  R R +N  R TN N++R 
Sbjct: 65  QRYMDSRSDVRQGAFAKKRSNFQGNQFPVTTTVARKAASATPRGRPYNGGRMTNTNQSRF 124

Query: 124 DTPPVPRKPFINGTFVPKVSAP-----AQPQTN---VTPRQRPQTLDSLFANMKEQRLRV 183
             PP   +    G FV K          Q Q N      RQ PQTLDS FANMKE+R+R+
Sbjct: 125 IAPPAQNRASQRG-FVGKQQQQQREKIVQQQANGGGGGQRQWPQTLDSRFANMKEERMRM 184

Query: 184 LAQRQNSGGAQRNGG---WQQRP--PWGKR 198
                N      NG     QQR   PW +R
Sbjct: 185 RRFADNRSNVGNNGAGSHQQQRSMVPWVRR 211

BLAST of Bhi01G002303 vs. TAIR 10
Match: AT4G10970.4 (unknown protein; FUNCTIONS IN: molecular_function unknown; INVOLVED IN: biological_process unknown; LOCATED IN: cellular_component unknown; EXPRESSED IN: 25 plant structures; EXPRESSED DURING: 14 growth stages; BEST Arabidopsis thaliana protein match is: unknown protein (TAIR:AT4G23910.2); Has 30201 Blast hits to 17322 proteins in 780 species: Archae - 12; Bacteria - 1396; Metazoa - 17338; Fungi - 3422; Plants - 5037; Viruses - 0; Other Eukaryotes - 2996 (source: NCBI BLink). )

HSP 1 Score: 141.0 bits (354), Expect = 1.1e-33
Identity = 106/210 (50.48%), Postives = 129/210 (61.43%), Query Frame = 0

Query: 4   KPLTTEAIAITEKKMDMALDDIIKMSK-NTG-NKGTKQRRIPNKMQKFPNNASQNRPRKL 63
           KP+TTE +A+TEKKMDM+LD+IIKM K NT  NKG KQ R+ NK +KF + A++N   K 
Sbjct: 5   KPITTETVALTEKKMDMSLDEIIKMEKSNTNVNKGKKQ-RVLNKKEKF-SGAAKNSAVKA 64

Query: 64  QRFMDSRSSLRQGALANRRSSFQGNQFPLATEVARKAAVAPIRPRAFN-RRATNWNKTRV 123
           QR+MDSRS +RQGA A +RS+FQGNQFP+ T VARKAA A  R R +N  R TN N++R 
Sbjct: 65  QRYMDSRSDVRQGAFAKKRSNFQGNQFPVTTTVARKAASATPRGRPYNGGRMTNTNQSRF 124

Query: 124 DTPPVPRKPFINGTFVPKVSAP-----AQPQTN---VTPRQRPQTLDSLFANMKEQRLRV 183
             PP   +    G FV K          Q Q N      RQ PQTLDS FANMKE+R+R+
Sbjct: 125 IAPPAQNRASQRG-FVGKQQQQQREKIVQQQANGGGGGQRQWPQTLDSRFANMKEERMRM 184

Query: 184 LAQRQNSGGAQRNGG---WQQRP--PWGKR 198
                N      NG     QQR   PW +R
Sbjct: 185 RRFADNRSNVGNNGAGSHQQQRSMVPWVRR 211

BLAST of Bhi01G002303 vs. TAIR 10
Match: AT4G10970.5 (unknown protein; FUNCTIONS IN: molecular_function unknown; INVOLVED IN: biological_process unknown; LOCATED IN: cellular_component unknown; EXPRESSED IN: 25 plant structures; EXPRESSED DURING: 14 growth stages; BEST Arabidopsis thaliana protein match is: unknown protein (TAIR:AT4G23910.2); Has 52 Blast hits to 51 proteins in 11 species: Archae - 0; Bacteria - 0; Metazoa - 0; Fungi - 0; Plants - 52; Viruses - 0; Other Eukaryotes - 0 (source: NCBI BLink). )

HSP 1 Score: 110.9 bits (276), Expect = 1.2e-24
Identity = 96/209 (45.93%), Postives = 114/209 (54.55%), Query Frame = 0

Query: 18  MDMALDDIIKMSK-NTG-NKGTKQRRIPNKMQKFPNNASQNRPRKLQRFMDSRSSLRQGA 77
           MDM+LD+IIKM K NT  NKG KQ R+ NK +KF + A++N   K QR+MDSRS +RQGA
Sbjct: 1   MDMSLDEIIKMEKSNTNVNKGKKQ-RVLNKKEKF-SGAAKNSAVKAQRYMDSRSDVRQGA 60

Query: 78  LANRRSSFQGNQFPLATEVARKAAVAPIRPRAFN-RRATN-------------WNKTRVD 137
            A +RS+FQGNQFP+ T VARKAA A  R R +N  R TN             W   R  
Sbjct: 61  FAKKRSNFQGNQFPVTTTVARKAASATPRGRPYNGGRMTNTNQSSWSIVGRLKWVDARFI 120

Query: 138 TPPVPRKPFINGTFVPKVSAP-----AQPQTN---VTPRQRPQTLDSLFANMKEQRLRVL 197
            PP   +    G FV K          Q Q N      RQ PQTLDS FANMKE+R+R+ 
Sbjct: 121 APPAQNRASQRG-FVGKQQQQQREKIVQQQANGGGGGQRQWPQTLDSRFANMKEERMRMR 180

BLAST of Bhi01G002303 vs. NCBI nr
Match: XP_038888349.1 (uncharacterized protein LOC120078194 isoform X1 [Benincasa hispida])

HSP 1 Score: 391.3 bits (1004), Expect = 4.8e-105
Identity = 200/200 (100.00%), Postives = 200/200 (100.00%), Query Frame = 0

Query: 1   MAAKPLTTEAIAITEKKMDMALDDIIKMSKNTGNKGTKQRRIPNKMQKFPNNASQNRPRK 60
           MAAKPLTTEAIAITEKKMDMALDDIIKMSKNTGNKGTKQRRIPNKMQKFPNNASQNRPRK
Sbjct: 1   MAAKPLTTEAIAITEKKMDMALDDIIKMSKNTGNKGTKQRRIPNKMQKFPNNASQNRPRK 60

Query: 61  LQRFMDSRSSLRQGALANRRSSFQGNQFPLATEVARKAAVAPIRPRAFNRRATNWNKTRV 120
           LQRFMDSRSSLRQGALANRRSSFQGNQFPLATEVARKAAVAPIRPRAFNRRATNWNKTRV
Sbjct: 61  LQRFMDSRSSLRQGALANRRSSFQGNQFPLATEVARKAAVAPIRPRAFNRRATNWNKTRV 120

Query: 121 DTPPVPRKPFINGTFVPKVSAPAQPQTNVTPRQRPQTLDSLFANMKEQRLRVLAQRQNSG 180
           DTPPVPRKPFINGTFVPKVSAPAQPQTNVTPRQRPQTLDSLFANMKEQRLRVLAQRQNSG
Sbjct: 121 DTPPVPRKPFINGTFVPKVSAPAQPQTNVTPRQRPQTLDSLFANMKEQRLRVLAQRQNSG 180

Query: 181 GAQRNGGWQQRPPWGKRPFW 201
           GAQRNGGWQQRPPWGKRPFW
Sbjct: 181 GAQRNGGWQQRPPWGKRPFW 200

BLAST of Bhi01G002303 vs. NCBI nr
Match: XP_004148996.1 (uncharacterized protein LOC101210049 [Cucumis sativus])

HSP 1 Score: 349.0 bits (894), Expect = 2.8e-92
Identity = 185/203 (91.13%), Postives = 191/203 (94.09%), Query Frame = 0

Query: 1   MAAKPLTTEAIAITEKKMDMALDDIIKMSKNTGNKGTKQRRIPNKMQKFPNNASQNRPRK 60
           MAAKPLTTEAIAITEKKMDMALDDIIKMSKNTGNKG KQRR+PNKMQKFPNNA+Q+RPRK
Sbjct: 1   MAAKPLTTEAIAITEKKMDMALDDIIKMSKNTGNKGRKQRRLPNKMQKFPNNATQDRPRK 60

Query: 61  LQRFMDSRSSLRQGALANRRSSFQGNQFPLATEVARKAAVAPIRPRAFNRRATNWNKTRV 120
           LQRFMDSRSSLRQGALANRRS+FQGNQFPLATEVARKAAVAPIRPRAF RRA NWNKTRV
Sbjct: 61  LQRFMDSRSSLRQGALANRRSNFQGNQFPLATEVARKAAVAPIRPRAFTRRAPNWNKTRV 120

Query: 121 DT-PPVPRKPFINGTFVPKVSAPAQPQTNVTPRQRPQTLDSLFANMKEQRLRVLAQRQNS 180
           +  PPVPRKPF NG FVPKVSAPAQPQTN TPRQRPQTLDSLFANMKEQRLRVL+QRQN 
Sbjct: 121 EAHPPVPRKPFTNGNFVPKVSAPAQPQTNTTPRQRPQTLDSLFANMKEQRLRVLSQRQNG 180

Query: 181 GGA-QRNGG-WQQRPPWGKRPFW 201
           GGA QRNGG  QQRPPWGKRPFW
Sbjct: 181 GGAQQRNGGRQQQRPPWGKRPFW 203

BLAST of Bhi01G002303 vs. NCBI nr
Match: KAE8649209.1 (hypothetical protein Csa_014401 [Cucumis sativus])

HSP 1 Score: 349.0 bits (894), Expect = 2.8e-92
Identity = 185/203 (91.13%), Postives = 191/203 (94.09%), Query Frame = 0

Query: 1   MAAKPLTTEAIAITEKKMDMALDDIIKMSKNTGNKGTKQRRIPNKMQKFPNNASQNRPRK 60
           MAAKPLTTEAIAITEKKMDMALDDIIKMSKNTGNKG KQRR+PNKMQKFPNNA+Q+RPRK
Sbjct: 15  MAAKPLTTEAIAITEKKMDMALDDIIKMSKNTGNKGRKQRRLPNKMQKFPNNATQDRPRK 74

Query: 61  LQRFMDSRSSLRQGALANRRSSFQGNQFPLATEVARKAAVAPIRPRAFNRRATNWNKTRV 120
           LQRFMDSRSSLRQGALANRRS+FQGNQFPLATEVARKAAVAPIRPRAF RRA NWNKTRV
Sbjct: 75  LQRFMDSRSSLRQGALANRRSNFQGNQFPLATEVARKAAVAPIRPRAFTRRAPNWNKTRV 134

Query: 121 DT-PPVPRKPFINGTFVPKVSAPAQPQTNVTPRQRPQTLDSLFANMKEQRLRVLAQRQNS 180
           +  PPVPRKPF NG FVPKVSAPAQPQTN TPRQRPQTLDSLFANMKEQRLRVL+QRQN 
Sbjct: 135 EAHPPVPRKPFTNGNFVPKVSAPAQPQTNTTPRQRPQTLDSLFANMKEQRLRVLSQRQNG 194

Query: 181 GGA-QRNGG-WQQRPPWGKRPFW 201
           GGA QRNGG  QQRPPWGKRPFW
Sbjct: 195 GGAQQRNGGRQQQRPPWGKRPFW 217

BLAST of Bhi01G002303 vs. NCBI nr
Match: XP_008451954.1 (PREDICTED: uncharacterized protein LOC103493102 [Cucumis melo] >TYK16572.1 uncharacterized protein E5676_scaffold21G003930 [Cucumis melo var. makuwa])

HSP 1 Score: 341.7 bits (875), Expect = 4.4e-90
Identity = 181/205 (88.29%), Postives = 187/205 (91.22%), Query Frame = 0

Query: 1   MAAKPLTTEAIAITEKKMDMALDDIIKMSKNTGNKGTKQRRIPNKMQKFPNNASQNRPRK 60
           MAAKPLTTEAIAITEKKMDMALDDIIKMSKNTGNKG KQRR+PNKMQKFPNNA+Q+RPRK
Sbjct: 1   MAAKPLTTEAIAITEKKMDMALDDIIKMSKNTGNKGRKQRRLPNKMQKFPNNATQDRPRK 60

Query: 61  LQRFMDSRSSLRQGALANRRSSFQGNQFPLATEVARKAAVAPIRPRAFNRRATNWNKTRV 120
           LQRFMDSRSSLRQGALANRRS+FQGNQF LATEVARKAAVAPIRPRAF RRA NWNKTRV
Sbjct: 61  LQRFMDSRSSLRQGALANRRSNFQGNQFALATEVARKAAVAPIRPRAFTRRAPNWNKTRV 120

Query: 121 DT-PPVPRKPFINGTFVPKVSAPAQPQTNVTPRQRPQTLDSLFANMKEQRLRVLAQRQNS 180
           D  PPVP+K F NG FVPKVSAPAQ QTN TPRQRPQTLDSLFANMKEQRLRVL+QRQN 
Sbjct: 121 DAPPPVPKKSFTNGNFVPKVSAPAQQQTNATPRQRPQTLDSLFANMKEQRLRVLSQRQNG 180

Query: 181 GGA----QRNGGWQQRPPWGKRPFW 201
           GG     QRNGG QQRPPWGKRPFW
Sbjct: 181 GGGGAQQQRNGGRQQRPPWGKRPFW 205

BLAST of Bhi01G002303 vs. NCBI nr
Match: KAA0044898.1 (Pentatricopeptide repeat (PPR) superfamily protein isoform 2 [Cucumis melo var. makuwa])

HSP 1 Score: 337.8 bits (865), Expect = 6.4e-89
Identity = 180/204 (88.24%), Postives = 186/204 (91.18%), Query Frame = 0

Query: 1   MAAKPLTTEAIAITEKKMDMALDDIIKMSKNTGNKGTKQRRIPNKMQKFPNNASQNRPRK 60
           MAAKPLTTEAIAITEKKMDMALDDIIKMSKNTGNKG KQRR+PNKMQKFPNNA+Q+RPRK
Sbjct: 1   MAAKPLTTEAIAITEKKMDMALDDIIKMSKNTGNKGRKQRRLPNKMQKFPNNATQDRPRK 60

Query: 61  LQRFMDSRSSLRQGALANRRSSFQGNQFPLATEVARKAAVAPIRPRAFNRRATNWNKTRV 120
           LQRFMDSRSSLRQGALANRRS+FQGNQF LATEVARKAAVAPIRPRAF RRA NWNKTRV
Sbjct: 61  LQRFMDSRSSLRQGALANRRSNFQGNQFALATEVARKAAVAPIRPRAFTRRAPNWNKTRV 120

Query: 121 DT-PPVPRKPFINGTFVPKVSAPAQPQTNVTPRQRPQTLDSLFANMKEQRLRVLAQRQNS 180
           D  PPVP+K F NG FVPKVSAPAQ QTN TPRQRPQTLDSLFANMKEQRLRVL+QRQN 
Sbjct: 121 DAPPPVPKKSFTNGNFVPKVSAPAQQQTNATPRQRPQTLDSLFANMKEQRLRVLSQRQNG 180

Query: 181 GGA----QRNGGWQQRPPWGKRPF 200
           GG     QRNGG QQRPPWGKRPF
Sbjct: 181 GGGGAQQQRNGGRQQRPPWGKRPF 204

BLAST of Bhi01G002303 vs. ExPASy TrEMBL
Match: A0A0A0KUX2 (Uncharacterized protein OS=Cucumis sativus OX=3659 GN=Csa_4G052600 PE=4 SV=1)

HSP 1 Score: 349.0 bits (894), Expect = 1.3e-92
Identity = 185/203 (91.13%), Postives = 191/203 (94.09%), Query Frame = 0

Query: 1   MAAKPLTTEAIAITEKKMDMALDDIIKMSKNTGNKGTKQRRIPNKMQKFPNNASQNRPRK 60
           MAAKPLTTEAIAITEKKMDMALDDIIKMSKNTGNKG KQRR+PNKMQKFPNNA+Q+RPRK
Sbjct: 1   MAAKPLTTEAIAITEKKMDMALDDIIKMSKNTGNKGRKQRRLPNKMQKFPNNATQDRPRK 60

Query: 61  LQRFMDSRSSLRQGALANRRSSFQGNQFPLATEVARKAAVAPIRPRAFNRRATNWNKTRV 120
           LQRFMDSRSSLRQGALANRRS+FQGNQFPLATEVARKAAVAPIRPRAF RRA NWNKTRV
Sbjct: 61  LQRFMDSRSSLRQGALANRRSNFQGNQFPLATEVARKAAVAPIRPRAFTRRAPNWNKTRV 120

Query: 121 DT-PPVPRKPFINGTFVPKVSAPAQPQTNVTPRQRPQTLDSLFANMKEQRLRVLAQRQNS 180
           +  PPVPRKPF NG FVPKVSAPAQPQTN TPRQRPQTLDSLFANMKEQRLRVL+QRQN 
Sbjct: 121 EAHPPVPRKPFTNGNFVPKVSAPAQPQTNTTPRQRPQTLDSLFANMKEQRLRVLSQRQNG 180

Query: 181 GGA-QRNGG-WQQRPPWGKRPFW 201
           GGA QRNGG  QQRPPWGKRPFW
Sbjct: 181 GGAQQRNGGRQQQRPPWGKRPFW 203

BLAST of Bhi01G002303 vs. ExPASy TrEMBL
Match: A0A5D3CYD8 (Uncharacterized protein OS=Cucumis melo var. makuwa OX=1194695 GN=E5676_scaffold21G003930 PE=4 SV=1)

HSP 1 Score: 341.7 bits (875), Expect = 2.1e-90
Identity = 181/205 (88.29%), Postives = 187/205 (91.22%), Query Frame = 0

Query: 1   MAAKPLTTEAIAITEKKMDMALDDIIKMSKNTGNKGTKQRRIPNKMQKFPNNASQNRPRK 60
           MAAKPLTTEAIAITEKKMDMALDDIIKMSKNTGNKG KQRR+PNKMQKFPNNA+Q+RPRK
Sbjct: 1   MAAKPLTTEAIAITEKKMDMALDDIIKMSKNTGNKGRKQRRLPNKMQKFPNNATQDRPRK 60

Query: 61  LQRFMDSRSSLRQGALANRRSSFQGNQFPLATEVARKAAVAPIRPRAFNRRATNWNKTRV 120
           LQRFMDSRSSLRQGALANRRS+FQGNQF LATEVARKAAVAPIRPRAF RRA NWNKTRV
Sbjct: 61  LQRFMDSRSSLRQGALANRRSNFQGNQFALATEVARKAAVAPIRPRAFTRRAPNWNKTRV 120

Query: 121 DT-PPVPRKPFINGTFVPKVSAPAQPQTNVTPRQRPQTLDSLFANMKEQRLRVLAQRQNS 180
           D  PPVP+K F NG FVPKVSAPAQ QTN TPRQRPQTLDSLFANMKEQRLRVL+QRQN 
Sbjct: 121 DAPPPVPKKSFTNGNFVPKVSAPAQQQTNATPRQRPQTLDSLFANMKEQRLRVLSQRQNG 180

Query: 181 GGA----QRNGGWQQRPPWGKRPFW 201
           GG     QRNGG QQRPPWGKRPFW
Sbjct: 181 GGGGAQQQRNGGRQQRPPWGKRPFW 205

BLAST of Bhi01G002303 vs. ExPASy TrEMBL
Match: A0A1S3BTH9 (uncharacterized protein LOC103493102 OS=Cucumis melo OX=3656 GN=LOC103493102 PE=4 SV=1)

HSP 1 Score: 341.7 bits (875), Expect = 2.1e-90
Identity = 181/205 (88.29%), Postives = 187/205 (91.22%), Query Frame = 0

Query: 1   MAAKPLTTEAIAITEKKMDMALDDIIKMSKNTGNKGTKQRRIPNKMQKFPNNASQNRPRK 60
           MAAKPLTTEAIAITEKKMDMALDDIIKMSKNTGNKG KQRR+PNKMQKFPNNA+Q+RPRK
Sbjct: 1   MAAKPLTTEAIAITEKKMDMALDDIIKMSKNTGNKGRKQRRLPNKMQKFPNNATQDRPRK 60

Query: 61  LQRFMDSRSSLRQGALANRRSSFQGNQFPLATEVARKAAVAPIRPRAFNRRATNWNKTRV 120
           LQRFMDSRSSLRQGALANRRS+FQGNQF LATEVARKAAVAPIRPRAF RRA NWNKTRV
Sbjct: 61  LQRFMDSRSSLRQGALANRRSNFQGNQFALATEVARKAAVAPIRPRAFTRRAPNWNKTRV 120

Query: 121 DT-PPVPRKPFINGTFVPKVSAPAQPQTNVTPRQRPQTLDSLFANMKEQRLRVLAQRQNS 180
           D  PPVP+K F NG FVPKVSAPAQ QTN TPRQRPQTLDSLFANMKEQRLRVL+QRQN 
Sbjct: 121 DAPPPVPKKSFTNGNFVPKVSAPAQQQTNATPRQRPQTLDSLFANMKEQRLRVLSQRQNG 180

Query: 181 GGA----QRNGGWQQRPPWGKRPFW 201
           GG     QRNGG QQRPPWGKRPFW
Sbjct: 181 GGGGAQQQRNGGRQQRPPWGKRPFW 205

BLAST of Bhi01G002303 vs. ExPASy TrEMBL
Match: A0A5A7TPS5 (Pentatricopeptide repeat (PPR) superfamily protein isoform 2 OS=Cucumis melo var. makuwa OX=1194695 GN=E6C27_scaffold74G001910 PE=4 SV=1)

HSP 1 Score: 337.8 bits (865), Expect = 3.1e-89
Identity = 180/204 (88.24%), Postives = 186/204 (91.18%), Query Frame = 0

Query: 1   MAAKPLTTEAIAITEKKMDMALDDIIKMSKNTGNKGTKQRRIPNKMQKFPNNASQNRPRK 60
           MAAKPLTTEAIAITEKKMDMALDDIIKMSKNTGNKG KQRR+PNKMQKFPNNA+Q+RPRK
Sbjct: 1   MAAKPLTTEAIAITEKKMDMALDDIIKMSKNTGNKGRKQRRLPNKMQKFPNNATQDRPRK 60

Query: 61  LQRFMDSRSSLRQGALANRRSSFQGNQFPLATEVARKAAVAPIRPRAFNRRATNWNKTRV 120
           LQRFMDSRSSLRQGALANRRS+FQGNQF LATEVARKAAVAPIRPRAF RRA NWNKTRV
Sbjct: 61  LQRFMDSRSSLRQGALANRRSNFQGNQFALATEVARKAAVAPIRPRAFTRRAPNWNKTRV 120

Query: 121 DT-PPVPRKPFINGTFVPKVSAPAQPQTNVTPRQRPQTLDSLFANMKEQRLRVLAQRQNS 180
           D  PPVP+K F NG FVPKVSAPAQ QTN TPRQRPQTLDSLFANMKEQRLRVL+QRQN 
Sbjct: 121 DAPPPVPKKSFTNGNFVPKVSAPAQQQTNATPRQRPQTLDSLFANMKEQRLRVLSQRQNG 180

Query: 181 GGA----QRNGGWQQRPPWGKRPF 200
           GG     QRNGG QQRPPWGKRPF
Sbjct: 181 GGGGAQQQRNGGRQQRPPWGKRPF 204

BLAST of Bhi01G002303 vs. ExPASy TrEMBL
Match: A0A6J1FPB6 (uncharacterized protein LOC111447019 isoform X1 OS=Cucurbita moschata OX=3662 GN=LOC111447019 PE=4 SV=1)

HSP 1 Score: 323.2 bits (827), Expect = 7.8e-85
Identity = 166/196 (84.69%), Postives = 177/196 (90.31%), Query Frame = 0

Query: 1   MAAKPLTTEAIAITEKKMDMALDDIIKMSKNTGNKGTKQRRIPNKMQKFPNNASQNRPRK 60
           MAAKPLTTEAIAITEKKMDMALDDIIKMSKNTGNKG KQRR PNKMQKFPNNA+Q+RPRK
Sbjct: 1   MAAKPLTTEAIAITEKKMDMALDDIIKMSKNTGNKGRKQRRFPNKMQKFPNNATQDRPRK 60

Query: 61  LQRFMDSRSSLRQGALANRRSSFQGNQFPLATEVARKAAVAPIRPRAFNRRATNWNKTRV 120
           LQRFMD+R+SLRQGALA RRS+FQGNQF LATEVAR AAVAPIRPRAFNRR  NW KTRV
Sbjct: 61  LQRFMDARTSLRQGALAKRRSNFQGNQFALATEVARTAAVAPIRPRAFNRRVPNWKKTRV 120

Query: 121 DTPPVPRKPFINGTFVPKVSAPAQPQTNVTPRQRPQTLDSLFANMKEQRLRVLAQRQNSG 180
           + PPV RKPF NGTF+PK++AP Q QTN TPRQRPQTLDSLFANMKEQRLRVL+QRQN G
Sbjct: 121 EAPPVQRKPFNNGTFIPKITAPVQTQTNATPRQRPQTLDSLFANMKEQRLRVLSQRQNGG 180

Query: 181 GAQRNGGWQQRPPWGK 197
             QRNG  QQRPPWG+
Sbjct: 181 AQQRNGARQQRPPWGR 196

The following BLAST results are available for this feature:
Match NameE-valueIdentityDescription
AT4G10970.11.1e-3350.48unknown protein; BEST Arabidopsis thaliana protein match is: unknown protein (TA... [more]
AT4G10970.21.1e-3350.48unknown protein; BEST Arabidopsis thaliana protein match is: unknown protein (TA... [more]
AT4G10970.31.1e-3350.48unknown protein; BEST Arabidopsis thaliana protein match is: unknown protein (TA... [more]
AT4G10970.41.1e-3350.48unknown protein; FUNCTIONS IN: molecular_function unknown; INVOLVED IN: biologic... [more]
AT4G10970.51.2e-2445.93unknown protein; FUNCTIONS IN: molecular_function unknown; INVOLVED IN: biologic... [more]
Match NameE-valueIdentityDescription
Match NameE-valueIdentityDescription
XP_038888349.14.8e-105100.00uncharacterized protein LOC120078194 isoform X1 [Benincasa hispida][more]
XP_004148996.12.8e-9291.13uncharacterized protein LOC101210049 [Cucumis sativus][more]
KAE8649209.12.8e-9291.13hypothetical protein Csa_014401 [Cucumis sativus][more]
XP_008451954.14.4e-9088.29PREDICTED: uncharacterized protein LOC103493102 [Cucumis melo] >TYK16572.1 uncha... [more]
KAA0044898.16.4e-8988.24Pentatricopeptide repeat (PPR) superfamily protein isoform 2 [Cucumis melo var. ... [more]
Match NameE-valueIdentityDescription
A0A0A0KUX21.3e-9291.13Uncharacterized protein OS=Cucumis sativus OX=3659 GN=Csa_4G052600 PE=4 SV=1[more]
A0A5D3CYD82.1e-9088.29Uncharacterized protein OS=Cucumis melo var. makuwa OX=1194695 GN=E5676_scaffold... [more]
A0A1S3BTH92.1e-9088.29uncharacterized protein LOC103493102 OS=Cucumis melo OX=3656 GN=LOC103493102 PE=... [more]
A0A5A7TPS53.1e-8988.24Pentatricopeptide repeat (PPR) superfamily protein isoform 2 OS=Cucumis melo var... [more]
A0A6J1FPB67.8e-8584.69uncharacterized protein LOC111447019 isoform X1 OS=Cucurbita moschata OX=3662 GN... [more]
InterPro
Analysis Name: InterPro Annotations of Wax gourd (B227) v1
Date Performed: 2021-10-22
IPR TermIPR DescriptionSourceSource TermSource DescriptionAlignment
NoneNo IPR availableMOBIDB_LITEmobidb-litedisorder_predictioncoord: 175..200
NoneNo IPR availableMOBIDB_LITEmobidb-litedisorder_predictioncoord: 30..59
NoneNo IPR availableMOBIDB_LITEmobidb-litedisorder_predictioncoord: 29..59
NoneNo IPR availablePANTHERPTHR36048:SF1RIBOSOME MATURATION FACTORcoord: 1..197
NoneNo IPR availablePANTHERPTHR36048RIBOSOME MATURATION FACTORcoord: 1..197

Relationships

The following mRNA feature(s) are a part of this gene:

Feature NameUnique NameType
Bhi01M002303Bhi01M002303mRNA