Sequences
The following sequences are available for this feature:
Gene sequence (with intron)
Legend: CDSpolypeptide
Hold the cursor over a type above to highlight its positions in the sequence below.
ATGGCTACTCCAATGCTTCTCATGGCCGCCCAAAAATCCAACGCCAACGCATCACAAGAAACCAAGCCAGAGCCCACCAAAACCAGACCCTCAAGATCGTCAAAGTCCGCCCCCAAAGCCCCCGCCCACAAGAAGCCGCCGCAGCGCGGCCTCGGCGTCGCCCAGCTCGAGCGCCTCCGCCTCCAGGAACGCTGGAAGAAAATGACCGAAATGCCCCCGCCGCCCGTCCACGCCCACCACTTCGCCAACATTCCCGGCCTCCACTTCTCCGCCGTTGATGACTGTACTGGCGAAGGCGGTGGTCTCGTCTTTCAGGGGATGGGGAATGTTGGAGGGTTTGTCGCCGGCGCCGGAGGGTTCACGGTGGTGGAGCCGTACACGCACGGCGGCGGAGCCCTGGATCCGAGGGTTCTGATCGGAAGTTACGGCGAGGAGGATTTGAGAGAGCTCTCTTCAATCCCAAAAATGCCGATGCCGTGCGTTTCCGATCGCTGTGATATTTGCTTCAAGGTAATATTTATTATTCTTCGTATCAACTTGTTTGGATTCCGAGAAAATTTCCGTGACGAGAAAACGGTTATACATATTCATCTCCGTTTTCCGACGCCCAAATCGATTAAATTAATTACTCACGCCATTTTTCTGAGTTCACGTTTTAATCAAACAGAAGAAACGCGTCAACATTTCCACAGAACTGCCGGCGGCGGCGGCCATTATCAACACTGACAGTTACGATTTTCTCGGACTGAGCACAAACTCCGCCGCGAGAGGCGGCGGCGGAACCAACCAGGTGATCAAATCAAAACGAAAACCACACCAAAAACAAGAAAACAAAAAAACCCACTACGGAAAATTTGAAGAAGAACAAAATTAATTAATTTGTTTGGTGCAGGGCGGAGCAGAGATCAAGCAAATACACAGGAAATTAGTGGGGAACGGCGGAGGAGGAGGAGAAAATATATTAATGGAATACGAGTTTTTTCCTGGGAAAAATGGCAGAGGCACAGAGTTCAAGGAACTGGAAATGCCAAAGGAAGAAGAAGAAGAAGAATTTTCGTTTGCAGAAGAAGCAGTGGATCATGGAGAAGCTTCGTCTTGTATTACTACAACCTACAACACTGCCATTGTTAATAATAATGGCAGCAGTAGCAGTGGTTCCAGTGCAGTTGATTTGTCTCTCAAACTTTCATTT
mRNA sequence
ATGGCTACTCCAATGCTTCTCATGGCCGCCCAAAAATCCAACGCCAACGCATCACAAGAAACCAAGCCAGAGCCCACCAAAACCAGACCCTCAAGATCGTCAAAGTCCGCCCCCAAAGCCCCCGCCCACAAGAAGCCGCCGCAGCGCGGCCTCGGCGTCGCCCAGCTCGAGCGCCTCCGCCTCCAGGAACGCTGGAAGAAAATGACCGAAATGCCCCCGCCGCCCGTCCACGCCCACCACTTCGCCAACATTCCCGGCCTCCACTTCTCCGCCGTTGATGACTGTACTGGCGAAGGCGGTGGTCTCGTCTTTCAGGGGATGGGGAATGTTGGAGGGTTTGTCGCCGGCGCCGGAGGGTTCACGGTGGTGGAGCCGTACACGCACGGCGGCGGAGCCCTGGATCCGAGGGTTCTGATCGGAAGTTACGGCGAGGAGGATTTGAGAGAGCTCTCTTCAATCCCAAAAATGCCGATGCCGTGCGTTTCCGATCGCTGTGATATTTGCTTCAAGGTAATATTTATTATTCTTCGTATCAACTTGTTTGGATTCCGAGAAAATTTCCGTGACGAGAAAACGGTTATACATATTCATCTCCGTTTTCCGACGCCCAAATCGATTAAATTAATTACTCACGCCATTTTTCTGAGTTCACGTTTTAATCAAACAGAAGAAACGCTGGGGAACGGCGGAGGAGGAGGAGAAAATATATTAATGGAATACGAGTTTTTTCCTGGGAAAAATGGCAGAGGCACAGAGTTCAAGGAACTGGAAATGCCAAAGGAAGAAGAAGAAGAAGAATTTTCGTTTGCAGAAGAAGCAGTGGATCATGGAGAAGCTTCGTCTTGTATTACTACAACCTACAACACTGCCATTGTTAATAATAATGGCAGCAGTAGCAGTGGTTCCAGTGCAGTTGATTTGTCTCTCAAACTTTCATTT
Coding sequence (CDS)
ATGGCTACTCCAATGCTTCTCATGGCCGCCCAAAAATCCAACGCCAACGCATCACAAGAAACCAAGCCAGAGCCCACCAAAACCAGACCCTCAAGATCGTCAAAGTCCGCCCCCAAAGCCCCCGCCCACAAGAAGCCGCCGCAGCGCGGCCTCGGCGTCGCCCAGCTCGAGCGCCTCCGCCTCCAGGAACGCTGGAAGAAAATGACCGAAATGCCCCCGCCGCCCGTCCACGCCCACCACTTCGCCAACATTCCCGGCCTCCACTTCTCCGCCGTTGATGACTGTACTGGCGAAGGCGGTGGTCTCGTCTTTCAGGGGATGGGGAATGTTGGAGGGTTTGTCGCCGGCGCCGGAGGGTTCACGGTGGTGGAGCCGTACACGCACGGCGGCGGAGCCCTGGATCCGAGGGTTCTGATCGGAAGTTACGGCGAGGAGGATTTGAGAGAGCTCTCTTCAATCCCAAAAATGCCGATGCCGTGCGTTTCCGATCGCTGTGATATTTGCTTCAAGGTAATATTTATTATTCTTCGTATCAACTTGTTTGGATTCCGAGAAAATTTCCGTGACGAGAAAACGGTTATACATATTCATCTCCGTTTTCCGACGCCCAAATCGATTAAATTAATTACTCACGCCATTTTTCTGAGTTCACGTTTTAATCAAACAGAAGAAACGCTGGGGAACGGCGGAGGAGGAGGAGAAAATATATTAATGGAATACGAGTTTTTTCCTGGGAAAAATGGCAGAGGCACAGAGTTCAAGGAACTGGAAATGCCAAAGGAAGAAGAAGAAGAAGAATTTTCGTTTGCAGAAGAAGCAGTGGATCATGGAGAAGCTTCGTCTTGTATTACTACAACCTACAACACTGCCATTGTTAATAATAATGGCAGCAGTAGCAGTGGTTCCAGTGCAGTTGATTTGTCTCTCAAACTTTCATTT
Protein sequence
MATPMLLMAAQKSNANASQETKPEPTKTRPSRSSKSAPKAPAHKKPPQRGLGVAQLERLRLQERWKKMTEMPPPPVHAHHFANIPGLHFSAVDDCTGEGGGLVFQGMGNVGGFVAGAGGFTVVEPYTHGGGALDPRVLIGSYGEEDLRELSSIPKMPMPCVSDRCDICFKVIFIILRINLFGFRENFRDEKTVIHIHLRFPTPKSIKLITHAIFLSSRFNQTEETLGNGGGGGENILMEYEFFPGKNGRGTEFKELEMPKEEEEEEFSFAEEAVDHGEASSCITTTYNTAIVNNNGSSSSGSSAVDLSLKLSF
Homology
BLAST of MS021351 vs. NCBI nr
Match:
XP_022975733.1 (protein SPOROCYTELESS [Cucurbita maxima])
HSP 1 Score: 197.6 bits (501), Expect = 1.6e-46
Identity = 164/350 (46.86%), Postives = 201/350 (57.43%), Query Frame = 0
Query: 1 MATPMLLMAAQKSNANASQETKP-EPTKTRPSRSSKSAPKAPAHKKPPQRGLGVAQLERL 60
MATPM+++ +ETKP EP KTR +R K+A K P KKPPQRGLGVAQLERL
Sbjct: 1 MATPMVVI----------EETKPGEPLKTRAAR--KTAAKNPHQKKPPQRGLGVAQLERL 60
Query: 61 RLQERWKKMTEMPPPPVHAHHFA---NIPGLHFSAVDDCTGEGGGLVFQGMGNVGGFVAG 120
RLQERWKKMT++ PP F + G + V + G G V +GN GF +G
Sbjct: 61 RLQERWKKMTQISPPHPFLLDFPLQFPVAGASSAPVGNDYGTG---VLGFIGNC-GFGSG 120
Query: 121 AGGFTVVEPYTHGGGAL-DPRVLIGSYGEEDLRELSSIPKM-----PMPCVSDRCDICFK 180
GG +EP+ HGGGA+ DPR+LIG+ E RELSSIP + P PCVSDRCDICFK
Sbjct: 121 GGGLMTMEPFPHGGGAMVDPRLLIGN-SVEASRELSSIPNLPPPPPPPPCVSDRCDICFK 180
Query: 181 VIFIILRINLFGFRENFRDEKTVIHIHLRFPTPKS---IKLITHA----IFLSSRFNQT- 240
R+N N EK + I P P S + L T++ S FNQ
Sbjct: 181 K----KRVNF----SNTMKEKNI--IGTAEPPPVSFDFLGLSTNSAATDFDFSLNFNQDG 240
Query: 241 -------EETLGNGGGGGE-NILMEYEFFPGKNGRGTEFKELEMP-----------KEEE 300
+ G GGGGGE + LMEYEFFP KNGRGTEF+EL+ P +EE+
Sbjct: 241 AGVKQIHRKVAGKGGGGGEGSTLMEYEFFPRKNGRGTEFEELKRPNEELGLFAEEEEEEQ 300
Query: 301 EEEFSFAEEAVDHGEASSCITTTYNTAIVNNNGSSSSGSSAVDLSLKLSF 314
EEE A +AVDHGE SCITT+ + I NG + + S+ +DLSLKLSF
Sbjct: 301 EEEEEEAVQAVDHGE-GSCITTSCSDII---NGGTRN-STVLDLSLKLSF 318
BLAST of MS021351 vs. NCBI nr
Match:
XP_022936226.1 (protein virilizer homolog [Cucurbita moschata])
HSP 1 Score: 190.7 bits (483), Expect = 2.0e-44
Identity = 167/362 (46.13%), Postives = 199/362 (54.97%), Query Frame = 0
Query: 1 MATPMLLMAAQKSNANASQETKP-EPTKTRPSRSSKSAPKAPAHKKPPQRGLGVAQLERL 60
MA+PM+++ +ETKP EP KTR +R K+A K P HKKPPQRGLGVAQLERL
Sbjct: 1 MASPMVVI----------EETKPGEPLKTRAAR--KTAAKNPHHKKPPQRGLGVAQLERL 60
Query: 61 RLQERWKKMTEMPPP---PVHAHHFANIPGLHFSAV-----DDCTGEGGGLVFQGMGNVG 120
RLQERWKKMTE+ PP H+ + P L FSA D TG G F G G G
Sbjct: 61 RLQERWKKMTEISPPHPFQSHSPFLLDFP-LQFSAAAAGGNDYATGVLG---FIGNGGFG 120
Query: 121 GFVAGAGGFTVVEPYTHGGGAL-DPRVLIGSYGEEDLRELSSIPKM---------PMPCV 180
G G GG +EP+ HGGGA+ DPR+LIG+ E RELSSIP + P PCV
Sbjct: 121 G-GGGGGGLMTMEPFPHGGGAMVDPRLLIGN-SVEASRELSSIPNLPPPPPPPPPPPPCV 180
Query: 181 SDRCDICFKVIFIILRINLFGFRENFRDEKTVIHIHLRFPTP-----KSIKLITHA---- 240
SDRCDICFK R+N N EK +I P P + L T++
Sbjct: 181 SDRCDICFKK----KRVNF----SNTMKEKNIIGTAEPPPPPAVASFDFLGLNTNSAATD 240
Query: 241 IFLSSRFNQT--------EETLGNGGGGG--ENILMEYEFFPGKNGRGTEFKE------- 300
S FNQ + G GGGGG + LMEYEFFP KNGRGTE +E
Sbjct: 241 FDFSLNFNQEGAGVKQIHRKVAGKGGGGGGEGSTLMEYEFFPRKNGRGTELEERKRANEE 300
Query: 301 ----LEMPKEEEEEEFSFAEEAVDHGEASSCITTTYNTAIVNNNGSSSSGSSAVDLSLKL 314
E +EEEEEE A +AVDHGE SCITT+ + I NG + + S+ +DLSLKL
Sbjct: 301 LGLFAEEEEEEEEEEEEEAVQAVDHGE-GSCITTSCSDII---NGGTRN-STVLDLSLKL 331
BLAST of MS021351 vs. NCBI nr
Match:
KAG7024875.1 (hypothetical protein SDJN02_13694, partial [Cucurbita argyrosperma subsp. argyrosperma])
HSP 1 Score: 187.6 bits (475), Expect = 1.7e-43
Identity = 166/362 (45.86%), Postives = 199/362 (54.97%), Query Frame = 0
Query: 1 MATPMLLMAAQKSNANASQETKP-EPTKTRPSRSSKSAPKAPAHKKPPQRGLGVAQLERL 60
MA+PM+++ +ETKP EP KTR +R K+A K P HKKPPQRGLGVAQLERL
Sbjct: 1 MASPMVVI----------EETKPGEPHKTRAAR--KTAAKNPHHKKPPQRGLGVAQLERL 60
Query: 61 RLQERWKKMTEMPPP---PVHAHHFANIPGLHFSAV-----DDCTGEGGGLVFQGMGNVG 120
RLQERWKKMTE+ PP H+ + P L F A D TG G F G G G
Sbjct: 61 RLQERWKKMTEISPPHPFQSHSPFLLDFP-LQFPAAAAGGNDYATGVLG---FIGNGGFG 120
Query: 121 GFVAGAGGFTVVEPYTHGGGAL-DPRVLIGSYGEEDLRELSSIPKM---------PMPCV 180
G G GG +EP+ HGGGA+ DPR+LIG+ E RELSSIP + P PCV
Sbjct: 121 G-GGGGGGLMTMEPFPHGGGAMVDPRLLIGN-SVEASRELSSIPNLPPPPPPPPPPPPCV 180
Query: 181 SDRCDICFKVIFIILRINLFGFRENFRDEKTVIHIHLRFPTPKS-----IKLITHA---- 240
SDRCDICFK R+N N EK +I P P + + L T++
Sbjct: 181 SDRCDICFKK----KRVNF----SNTMKEKNIIGT-AEPPPPAAASFDFLGLNTNSAATD 240
Query: 241 IFLSSRFNQT--------EETLGNGGGGG--ENILMEYEFFPGKNGRGTEFKE------- 300
S FNQ + G GGGGG + LMEYEFFP KNGRGTE +E
Sbjct: 241 FDFSLNFNQDGAGVKQIHRKVAGKGGGGGGEGSTLMEYEFFPRKNGRGTELEERKRANEE 300
Query: 301 ----LEMPKEEEEEEFSFAEEAVDHGEASSCITTTYNTAIVNNNGSSSSGSSAVDLSLKL 314
E +EEEEEE A +AVDHGE SCITT+ + I NG + + S+ +DLSLKL
Sbjct: 301 LGLFAEEEEEEEEEEEEEAVQAVDHGE-GSCITTSCSDII---NGGTRN-STVLDLSLKL 330
BLAST of MS021351 vs. NCBI nr
Match:
KAG6591999.1 (hypothetical protein SDJN03_14345, partial [Cucurbita argyrosperma subsp. sororia])
HSP 1 Score: 186.0 bits (471), Expect = 4.8e-43
Identity = 165/365 (45.21%), Postives = 198/365 (54.25%), Query Frame = 0
Query: 1 MATPMLLMAAQKSNANASQETKP-EPTKTRPSRSSKSAPKAPAHKKPPQRGLGVAQLERL 60
MA+PM+++ +ETKP EP KTR +R K+A K P HKKPPQRGLGVAQLERL
Sbjct: 1 MASPMVVI----------EETKPGEPLKTRAAR--KTAAKNPHHKKPPQRGLGVAQLERL 60
Query: 61 RLQERWKKMTEMPPP---PVHAHHFANIPGLHFSAV-----DDCTGEGGGLVFQGMGNVG 120
RLQERWKKMTE+ PP H+ + P L F A D TG G F G G G
Sbjct: 61 RLQERWKKMTEISPPHPFQSHSPFLLDFP-LQFPAAAAGGNDYATGVLG---FIGNGGFG 120
Query: 121 GFVAGAGGFTVVEPYTHGGGAL-DPRVLIGSYGEEDLRELSSIPKM---------PMPCV 180
G G GG +EP+ HGGGA+ D R+LIG+ E RELSSIP + P PCV
Sbjct: 121 G--GGCGGLMTMEPFPHGGGAMVDHRLLIGN-SVEASRELSSIPNLPPPPPPPPPPPPCV 180
Query: 181 SDRCDICFKVIFIILRINLFGFRENFRDEKTVIHIHLRFPTPKS-----IKLITHA---- 240
SDRCDICFK R+N N EK +I P P + + L T++
Sbjct: 181 SDRCDICFK-----KRVNF----SNTMKEKNIIGTAEPPPPPAAASFDFLGLNTNSAATD 240
Query: 241 IFLSSRFNQT--------EETLGNGGGGG--ENILMEYEFFPGKNGRGTEFKEL------ 300
S FNQ + G GGGGG + LMEYEFFP KNGRGTE +E
Sbjct: 241 FDFSLNFNQDGAGVKQIHRKVAGKGGGGGGEGSTLMEYEFFPRKNGRGTELEERKRANEE 300
Query: 301 --------EMPKEEEEEEFSFAEEAVDHGEASSCITTTYNTAIVNNNGSSSSGSSAVDLS 314
E +EEEEEE A +AVDHGE SCITT+ + I NG + + S+ +DLS
Sbjct: 301 LGLFAEEEEEEEEEEEEEEEEAVQAVDHGE-GSCITTSCSDII---NGGTRN-STVLDLS 332
BLAST of MS021351 vs. NCBI nr
Match:
XP_038900102.1 (protein SPOROCYTELESS-like [Benincasa hispida])
HSP 1 Score: 176.0 bits (445), Expect = 5.0e-40
Identity = 159/366 (43.44%), Postives = 186/366 (50.82%), Query Frame = 0
Query: 19 QETKP-EPTKTRPSRSSKSAPKAPAHKKPPQRGLGVAQLERLRLQERWKKMTEMPPPPVH 78
+ETKP EP KTRP R K+ + P KKPPQRGLGVAQLERLRLQ++W K+TEM PP
Sbjct: 10 EETKPHEPPKTRPGR--KTGARNPHQKKPPQRGLGVAQLERLRLQDKWNKITEMSPP--- 69
Query: 79 AHHF--------------ANIPGLHFSA------VDDCTGEGG-------GLVFQGMGNV 138
HHF N P L F A D G GG GLV Q +GN
Sbjct: 70 -HHFIQSHSPTFLLDNTLTNFP-LQFPAAAPVMVATDSGGGGGVLGFDHQGLVVQRIGNH 129
Query: 139 GGFVAGAGGFTVVEPYTHGGGALDPRVLIGSYGEEDLRELSSIPKMPMP-----CVSDRC 198
GGF+AG EPY+HGGG VLIG+ E RELSSIPK+P P C SD C
Sbjct: 130 GGFLAGG------EPYSHGGG-----VLIGNSSVEASRELSSIPKLPPPSLPPSCDSDHC 189
Query: 199 DICFKVIFIILRINLFGFRENFRDEKTVIHIHLRFP-------------TPKSIKLITHA 258
DICFK R+N N EK + P T S + ++
Sbjct: 190 DICFKK----KRVNF----SNLMKEKNINIAAAETPPLAAAGFDFLGLSTTNSTAELNNS 249
Query: 259 IFLSS------------RFNQ----TEETLGNGGGG----GENILMEYEFFPGKNGRGTE 314
++ FNQ G+GGGG G + LMEYEFFP KN RGTE
Sbjct: 250 TVVNHHANPDFGFSFNFNFNQGRSGGNSGSGSGGGGGNGEGSSRLMEYEFFPRKNCRGTE 309
BLAST of MS021351 vs. ExPASy TrEMBL
Match:
A0A6J1IK53 (protein SPOROCYTELESS OS=Cucurbita maxima OX=3661 GN=LOC111475949 PE=4 SV=1)
HSP 1 Score: 197.6 bits (501), Expect = 7.8e-47
Identity = 164/350 (46.86%), Postives = 201/350 (57.43%), Query Frame = 0
Query: 1 MATPMLLMAAQKSNANASQETKP-EPTKTRPSRSSKSAPKAPAHKKPPQRGLGVAQLERL 60
MATPM+++ +ETKP EP KTR +R K+A K P KKPPQRGLGVAQLERL
Sbjct: 1 MATPMVVI----------EETKPGEPLKTRAAR--KTAAKNPHQKKPPQRGLGVAQLERL 60
Query: 61 RLQERWKKMTEMPPPPVHAHHFA---NIPGLHFSAVDDCTGEGGGLVFQGMGNVGGFVAG 120
RLQERWKKMT++ PP F + G + V + G G V +GN GF +G
Sbjct: 61 RLQERWKKMTQISPPHPFLLDFPLQFPVAGASSAPVGNDYGTG---VLGFIGNC-GFGSG 120
Query: 121 AGGFTVVEPYTHGGGAL-DPRVLIGSYGEEDLRELSSIPKM-----PMPCVSDRCDICFK 180
GG +EP+ HGGGA+ DPR+LIG+ E RELSSIP + P PCVSDRCDICFK
Sbjct: 121 GGGLMTMEPFPHGGGAMVDPRLLIGN-SVEASRELSSIPNLPPPPPPPPCVSDRCDICFK 180
Query: 181 VIFIILRINLFGFRENFRDEKTVIHIHLRFPTPKS---IKLITHA----IFLSSRFNQT- 240
R+N N EK + I P P S + L T++ S FNQ
Sbjct: 181 K----KRVNF----SNTMKEKNI--IGTAEPPPVSFDFLGLSTNSAATDFDFSLNFNQDG 240
Query: 241 -------EETLGNGGGGGE-NILMEYEFFPGKNGRGTEFKELEMP-----------KEEE 300
+ G GGGGGE + LMEYEFFP KNGRGTEF+EL+ P +EE+
Sbjct: 241 AGVKQIHRKVAGKGGGGGEGSTLMEYEFFPRKNGRGTEFEELKRPNEELGLFAEEEEEEQ 300
Query: 301 EEEFSFAEEAVDHGEASSCITTTYNTAIVNNNGSSSSGSSAVDLSLKLSF 314
EEE A +AVDHGE SCITT+ + I NG + + S+ +DLSLKLSF
Sbjct: 301 EEEEEEAVQAVDHGE-GSCITTSCSDII---NGGTRN-STVLDLSLKLSF 318
BLAST of MS021351 vs. ExPASy TrEMBL
Match:
A0A6J1F6X7 (protein virilizer homolog OS=Cucurbita moschata OX=3662 GN=LOC111442900 PE=4 SV=1)
HSP 1 Score: 190.7 bits (483), Expect = 9.5e-45
Identity = 167/362 (46.13%), Postives = 199/362 (54.97%), Query Frame = 0
Query: 1 MATPMLLMAAQKSNANASQETKP-EPTKTRPSRSSKSAPKAPAHKKPPQRGLGVAQLERL 60
MA+PM+++ +ETKP EP KTR +R K+A K P HKKPPQRGLGVAQLERL
Sbjct: 1 MASPMVVI----------EETKPGEPLKTRAAR--KTAAKNPHHKKPPQRGLGVAQLERL 60
Query: 61 RLQERWKKMTEMPPP---PVHAHHFANIPGLHFSAV-----DDCTGEGGGLVFQGMGNVG 120
RLQERWKKMTE+ PP H+ + P L FSA D TG G F G G G
Sbjct: 61 RLQERWKKMTEISPPHPFQSHSPFLLDFP-LQFSAAAAGGNDYATGVLG---FIGNGGFG 120
Query: 121 GFVAGAGGFTVVEPYTHGGGAL-DPRVLIGSYGEEDLRELSSIPKM---------PMPCV 180
G G GG +EP+ HGGGA+ DPR+LIG+ E RELSSIP + P PCV
Sbjct: 121 G-GGGGGGLMTMEPFPHGGGAMVDPRLLIGN-SVEASRELSSIPNLPPPPPPPPPPPPCV 180
Query: 181 SDRCDICFKVIFIILRINLFGFRENFRDEKTVIHIHLRFPTP-----KSIKLITHA---- 240
SDRCDICFK R+N N EK +I P P + L T++
Sbjct: 181 SDRCDICFKK----KRVNF----SNTMKEKNIIGTAEPPPPPAVASFDFLGLNTNSAATD 240
Query: 241 IFLSSRFNQT--------EETLGNGGGGG--ENILMEYEFFPGKNGRGTEFKE------- 300
S FNQ + G GGGGG + LMEYEFFP KNGRGTE +E
Sbjct: 241 FDFSLNFNQEGAGVKQIHRKVAGKGGGGGGEGSTLMEYEFFPRKNGRGTELEERKRANEE 300
Query: 301 ----LEMPKEEEEEEFSFAEEAVDHGEASSCITTTYNTAIVNNNGSSSSGSSAVDLSLKL 314
E +EEEEEE A +AVDHGE SCITT+ + I NG + + S+ +DLSLKL
Sbjct: 301 LGLFAEEEEEEEEEEEEEAVQAVDHGE-GSCITTSCSDII---NGGTRN-STVLDLSLKL 331
BLAST of MS021351 vs. ExPASy TrEMBL
Match:
A0A0A0LDJ3 (Uncharacterized protein OS=Cucumis sativus OX=3659 GN=Csa_3G850670 PE=4 SV=1)
HSP 1 Score: 171.0 bits (432), Expect = 7.8e-39
Identity = 146/327 (44.65%), Postives = 170/327 (51.99%), Query Frame = 0
Query: 24 EPTKTRPSRSSKSAPKAPAHKKPPQRGLGVAQLERLRLQERWKKMTEMPPPPVHAHH-FA 83
EP KTR R K PK P KKPPQRGLGVAQLERLRLQE WK +TE+ PP H+
Sbjct: 16 EPPKTRAGR--KPGPKNPNQKKPPQRGLGVAQLERLRLQENWKTVTEISPPTFLLHNPLP 75
Query: 84 NIPGLHFSAV------DDCTG-EGGGLVFQGMGNVGGFVAGAGGFTVVEPYTHGGGALDP 143
N P LHF DC G + G V Q +GN GGF+ +G
Sbjct: 76 NFP-LHFPPAPAPILHTDCIGFDHHGFVVQRIGNNGGFLPASG----------------- 135
Query: 144 RVLIGSYGEEDLRELSSIPKMPMPCVSDRCDICFKVIFIILRINLFGFRENFRDEKTVI- 203
VLIG+ E RELSSIPK+P+ C SDRCD CFK R+N N EK +I
Sbjct: 136 -VLIGNTSVEASRELSSIPKLPLACDSDRCDHCFKK----KRVNY----SNRMKEKNIIV 195
Query: 204 -------HIHLRFPTPKSIKLITHAIFLSSRFNQTEETL--------------GNGGGGG 263
L T S +L TH + + T L G GG GG
Sbjct: 196 GAAETPSFDFLGLSTNSSAELNTHTHTHTVMNHHTNSDLDLDYDLSFNLKQGRGGGGDGG 255
Query: 264 E-NILMEYEFFPGKNGRGTEFKELEMPKE------EEEEEFSFAEEAVDHGEASSCITTT 314
E + LMEYEFFP KNGRGTE +EL+MPKE EE EE A+DHGE SCITT+
Sbjct: 256 EGSKLMEYEFFPRKNGRGTEIEELKMPKEELSLFREENEEEEEEVLAMDHGE-GSCITTS 308
BLAST of MS021351 vs. ExPASy TrEMBL
Match:
A0A1S3BZ18 (uncharacterized protein LOC103494987 isoform X2 OS=Cucumis melo OX=3656 GN=LOC103494987 PE=4 SV=1)
HSP 1 Score: 169.1 bits (427), Expect = 3.0e-38
Identity = 148/354 (41.81%), Postives = 178/354 (50.28%), Query Frame = 0
Query: 1 MATPMLLMAAQKSNANASQETKPEPTKTRPSRSSKSAPKAPAHKKPPQRGLGVAQLERLR 60
MATP+ Q+ +N S KP + K PK P KKPPQRGLGVAQLERLR
Sbjct: 1 MATPL-----QQQTSNTSHPPKPRAGR-------KPGPKNPNQKKPPQRGLGVAQLERLR 60
Query: 61 LQERWKKMTEMPPPPVHAHH-FANIPGLHFSAV------DDCTGEGG---------GLVF 120
LQE WK +TE+ PP H+ N P LHF DC G G V
Sbjct: 61 LQENWKTVTEISPPTFLLHNTLPNFP-LHFPPAPPPILHTDCIAAGAGAVLGFDHHGFVV 120
Query: 121 QGMGNVGGFVAGAGGFTVVEPYTHGGGALDPRVLIGSYGEEDLRELSSIPKMPMPCVSDR 180
Q +GN GGF+ G VLIG+ E RELSSIPK+P+ C SDR
Sbjct: 121 QRIGNNGGFLPAGG------------------VLIGNTSVEASRELSSIPKLPLACDSDR 180
Query: 181 CDICFKVIFIILRINLFGFRENFRDEKTVIHIHLRFP--------TPKSIKLITHAIF-- 240
CD CFK R+N N EK +I P T + +L TH++
Sbjct: 181 CDHCFK-----KRVNF----SNRMKEKNIIAAAAETPSFDFLGLGTNSTAELNTHSVMNH 240
Query: 241 -------LSSRFN-QTEETLGNGGGGGE-NILMEYEFFPGKNGRGTEFKELEMPKE---- 300
L F+ ++ G GG GGE + LMEYEFFP KNGRGTE +EL+MPKE
Sbjct: 241 HTDSGWDLDYDFSLNLKQVRGGGGDGGEGSKLMEYEFFPRKNGRGTEIEELKMPKEELSL 300
Query: 301 --EEEEEFSFAEEAVDHGEASSCITTTYNTAIVNNNGSSSSGSSAVDLSLKLSF 314
EE EE A+DHGE SCITT+ N I NG + + S+A+DLSLKLSF
Sbjct: 301 FREENEEEEEEVLAMDHGE-GSCITTSCNDII---NGGTRN-STALDLSLKLSF 309
BLAST of MS021351 vs. ExPASy TrEMBL
Match:
A0A5A7T067 (Protein SPOROCYTELESS OS=Cucumis melo var. makuwa OX=1194695 GN=E5676_scaffold110G001020 PE=4 SV=1)
HSP 1 Score: 168.7 bits (426), Expect = 3.9e-38
Identity = 148/354 (41.81%), Postives = 178/354 (50.28%), Query Frame = 0
Query: 1 MATPMLLMAAQKSNANASQETKPEPTKTRPSRSSKSAPKAPAHKKPPQRGLGVAQLERLR 60
MATP+ Q+ +N S KP + K PK P KKPPQRGLGVAQLERLR
Sbjct: 1 MATPL-----QQQTSNTSHPPKPRAGR-------KPGPKNPNQKKPPQRGLGVAQLERLR 60
Query: 61 LQERWKKMTEMPPPPVHAHH-FANIPGLHFSAV------DDCTGEGG---------GLVF 120
LQE WK +TE+ PP H+ N P LHF DC G G V
Sbjct: 61 LQENWKTVTEISPPTFLLHNTLPNFP-LHFPPAPPPILHTDCIAAGAGAVLGFDHHGFVV 120
Query: 121 QGMGNVGGFVAGAGGFTVVEPYTHGGGALDPRVLIGSYGEEDLRELSSIPKMPMPCVSDR 180
Q +GN GGF+ G VLIG+ E RELSSIPK+P+ C SDR
Sbjct: 121 QRIGNNGGFLPAGG------------------VLIGNTSVEASRELSSIPKLPLACDSDR 180
Query: 181 CDICFKVIFIILRINLFGFRENFRDEKTVIHIHLRFP--------TPKSIKLITHAIF-- 240
CD CFK R+N N EK +I P T + +L TH++
Sbjct: 181 CDHCFKK----KRVNF----SNRMKEKNIIAAAAETPSFDFLGLGTNSTAELNTHSVMNH 240
Query: 241 -------LSSRFN-QTEETLGNGGGGGE-NILMEYEFFPGKNGRGTEFKELEMPKE---- 300
L F+ ++ G GG GGE + LMEYEFFP KNGRGTE +EL+MPKE
Sbjct: 241 HTDSGWDLDYDFSLNLKQVRGGGGDGGEGSKLMEYEFFPRKNGRGTEIEELKMPKEELSL 300
Query: 301 --EEEEEFSFAEEAVDHGEASSCITTTYNTAIVNNNGSSSSGSSAVDLSLKLSF 314
EE EE A+DHGE SCITT+ N I NG + + S+A+DLSLKLSF
Sbjct: 301 FREENEEEEEEVLAMDHGE-GSCITTSCNDII---NGGTRN-STALDLSLKLSF 310
The following BLAST results are available for this feature:
Match Name | E-value | Identity | Description | |
XP_022975733.1 | 1.6e-46 | 46.86 | protein SPOROCYTELESS [Cucurbita maxima] | [more] |
XP_022936226.1 | 2.0e-44 | 46.13 | protein virilizer homolog [Cucurbita moschata] | [more] |
KAG7024875.1 | 1.7e-43 | 45.86 | hypothetical protein SDJN02_13694, partial [Cucurbita argyrosperma subsp. argyro... | [more] |
KAG6591999.1 | 4.8e-43 | 45.21 | hypothetical protein SDJN03_14345, partial [Cucurbita argyrosperma subsp. sorori... | [more] |
XP_038900102.1 | 5.0e-40 | 43.44 | protein SPOROCYTELESS-like [Benincasa hispida] | [more] |
Match Name | E-value | Identity | Description | |
Match Name | E-value | Identity | Description | |
A0A6J1IK53 | 7.8e-47 | 46.86 | protein SPOROCYTELESS OS=Cucurbita maxima OX=3661 GN=LOC111475949 PE=4 SV=1 | [more] |
A0A6J1F6X7 | 9.5e-45 | 46.13 | protein virilizer homolog OS=Cucurbita moschata OX=3662 GN=LOC111442900 PE=4 SV=... | [more] |
A0A0A0LDJ3 | 7.8e-39 | 44.65 | Uncharacterized protein OS=Cucumis sativus OX=3659 GN=Csa_3G850670 PE=4 SV=1 | [more] |
A0A1S3BZ18 | 3.0e-38 | 41.81 | uncharacterized protein LOC103494987 isoform X2 OS=Cucumis melo OX=3656 GN=LOC10... | [more] |
A0A5A7T067 | 3.9e-38 | 41.81 | Protein SPOROCYTELESS OS=Cucumis melo var. makuwa OX=1194695 GN=E5676_scaffold11... | [more] |
Match Name | E-value | Identity | Description | |