Sequences
The following sequences are available for this feature:
Gene sequence (with intron)
Legend: CDSpolypeptide
Hold the cursor over a type above to highlight its positions in the sequence below.
ATGATCTCACTCGGACCCTGCGCTTACCAAAATGGATCCTCATTTCCGTTCTACTCTTCAAATCTTCTAAATCTCCGTCCAATTCCTTCCAATTTTGCACTCCCACTCCCAACAAACTCAATAATCCTCGGAAAAAGAAACTTCAGATTCAGAGTTGTTCGCGCCGAACAGGCCGGAGGAAACAGCGCTCAAAGACGAACTTTCTTGACGGTCGAAGAGGCCGGCCTCGTCGAAGTATCCGGCCTCAGCACTCACGAGCGCTTTCTGTGTCGTTTGACGGTATTTCTCTCTCTGCCTCTGTGGCCGTCGTTAATTAATTTCCAACTTCCTTTCATTCTGTTTGTTTTTTTGCAGATATCGTCTCTGAATTTACTGCGAGTAATAGCGGAGGAAGAGAAATGTTCGATCGAGGAGTTGAATGCTGGCAGATTATGCGACTGGTTTCTCAAGGATAAGTTGAAAAGAGAGCAGAATTTGGACTCCGCTGTTCTTCAATGGGATGATTCTGACCTGACCTTC
mRNA sequence
ATGATCTCACTCGGACCCTGCGCTTACCAAAATGGATCCTCATTTCCGTTCTACTCTTCAAATCTTCTAAATCTCCGTCCAATTCCTTCCAATTTTGCACTCCCACTCCCAACAAACTCAATAATCCTCGGAAAAAGAAACTTCAGATTCAGAGTTGTTCGCGCCGAACAGGCCGGAGGAAACAGCGCTCAAAGACGAACTTTCTTGACGGTCGAAGAGGCCGGCCTCGTCGAAGTATCCGGCCTCAGCACTCACGAGCGCTTTCTGTGTCGTTTGACGGTATTTCTCTCTCTGCCTCTGTGGCCGTCGTTAATTAATTTCCAACTTCCTTTCATTCTGTTTGTTTTTTTGCAGATATCGTCTCTGAATTTACTGCGAGTAATAGCGGAGGAAGAGAAATGTTCGATCGAGGAGTTGAATGCTGGCAGATTATGCGACTGGTTTCTCAAGGATAAGTTGAAAAGAGAGCAGAATTTGGACTCCGCTGTTCTTCAATGGGATGATTCTGACCTGACCTTC
Coding sequence (CDS)
ATGATCTCACTCGGACCCTGCGCTTACCAAAATGGATCCTCATTTCCGTTCTACTCTTCAAATCTTCTAAATCTCCGTCCAATTCCTTCCAATTTTGCACTCCCACTCCCAACAAACTCAATAATCCTCGGAAAAAGAAACTTCAGATTCAGAGTTGTTCGCGCCGAACAGGCCGGAGGAAACAGCGCTCAAAGACGAACTTTCTTGACGGTCGAAGAGGCCGGCCTCGTCGAAGTATCCGGCCTCAGCACTCACGAGCGCTTTCTGTGTCGTTTGACGGTATTTCTCTCTCTGCCTCTGTGGCCGTCGTTAATTAATTTCCAACTTCCTTTCATTCTGTTTGTTTTTTTGCAGATATCGTCTCTGAATTTACTGCGAGTAATAGCGGAGGAAGAGAAATGTTCGATCGAGGAGTTGAATGCTGGCAGATTATGCGACTGGTTTCTCAAGGATAAGTTGAAAAGAGAGCAGAATTTGGACTCCGCTGTTCTTCAATGGGATGATTCTGACCTGACCTTC
Protein sequence
MISLGPCAYQNGSSFPFYSSNLLNLRPIPSNFALPLPTNSIILGKRNFRFRVVRAEQAGGNSAQRRTFLTVEEAGLVEVSGLSTHERFLCRLTVFLSLPLWPSLINFQLPFILFVFLQISSLNLLRVIAEEEKCSIEELNAGRLCDWFLKDKLKREQNLDSAVLQWDDSDLTF
Homology
BLAST of MS010816 vs. NCBI nr
Match:
XP_022142270.1 (uncharacterized protein LOC111012432 [Momordica charantia])
HSP 1 Score: 283.1 bits (723), Expect = 1.6e-72
Identity = 148/173 (85.55%), Postives = 148/173 (85.55%), Query Frame = 0
Query: 1 MISLGPCAYQNGSSFPFYSSNLLNLRPIPSNFALPLPTNSIILGKRNFRFRVVRAEQAGG 60
MISLGPCAYQNGSSFPFYSSNLLNLRPIPSNFALPLPTNSIILGKRNFRFRVVRAEQAGG
Sbjct: 1 MISLGPCAYQNGSSFPFYSSNLLNLRPIPSNFALPLPTNSIILGKRNFRFRVVRAEQAGG 60
Query: 61 NSAQRRTFLTVEEAGLVEVSGLSTHERFLCRLTVFLSLPLWPSLINFQLPFILFVFLQIS 120
NSAQRRTFLTVEEAGLVEVSGLSTHERFLCRLT IS
Sbjct: 61 NSAQRRTFLTVEEAGLVEVSGLSTHERFLCRLT-------------------------IS 120
Query: 121 SLNLLRVIAEEEKCSIEELNAGRLCDWFLKDKLKREQNLDSAVLQWDDSDLTF 174
SLNLLRVIAEEEKCSIEELNAGRLCDWFLKDKLKREQNLDSAVLQWDDSDLTF
Sbjct: 121 SLNLLRVIAEEEKCSIEELNAGRLCDWFLKDKLKREQNLDSAVLQWDDSDLTF 148
BLAST of MS010816 vs. NCBI nr
Match:
XP_038881387.1 (uncharacterized protein LOC120072922 [Benincasa hispida])
HSP 1 Score: 184.1 bits (466), Expect = 1.0e-42
Identity = 114/180 (63.33%), Postives = 119/180 (66.11%), Query Frame = 0
Query: 1 MISLGPCAYQNGSSFPFYSSNL-LNLRP----IPSNFALPLPTNSIILGKRNFRFRVVRA 60
MISL P + NGSSF + SSNL N P I NFA P T + FR +A
Sbjct: 1 MISLNPSTHHNGSSFHYNSSNLQQNFIPTIIRIKPNFAPPFHTKT-----TTFR---TQA 60
Query: 61 EQAG--GNSAQRRTFLTVEEAGLVEVSGLSTHERFLCRLTVFLSLPLWPSLINFQLPFIL 120
EQAG GN AQRRTFLT+EEAGLVEVSGLSTHERFLCRLT
Sbjct: 61 EQAGSSGNGAQRRTFLTIEEAGLVEVSGLSTHERFLCRLT-------------------- 120
Query: 121 FVFLQISSLNLLRVIAEEEKCSIEELNAGRLCDWFLKDKLKREQNLDSAVLQWDDSDLTF 174
ISSLNLLRVIAEEEKCSIEELNAGRLCDWFLKDKLKREQNLDSAVLQWDDSD F
Sbjct: 121 -----ISSLNLLRVIAEEEKCSIEELNAGRLCDWFLKDKLKREQNLDSAVLQWDDSDPLF 147
BLAST of MS010816 vs. NCBI nr
Match:
XP_023544671.1 (uncharacterized protein LOC111804184 [Cucurbita pepo subsp. pepo])
HSP 1 Score: 183.0 bits (463), Expect = 2.3e-42
Identity = 114/178 (64.04%), Postives = 119/178 (66.85%), Query Frame = 0
Query: 1 MISLGPCAYQNGSSFPFYSSNL-LNLRPIP---SNFALPLPTNSIILGKRNFRFRVVRAE 60
MISL P + NGSSF F SSNL L PI SNF P T K F+ +RAE
Sbjct: 1 MISLKPSPHHNGSSFLFNSSNLHQKLSPITRHRSNFPPPFRT------KTTFK---IRAE 60
Query: 61 QA-GGNSAQRRTFLTVEEAGLVEVSGLSTHERFLCRLTVFLSLPLWPSLINFQLPFILFV 120
QA GN AQRRTFLT+EEAGLVEVSGLSTHERFLCRLT
Sbjct: 61 QAESGNGAQRRTFLTLEEAGLVEVSGLSTHERFLCRLT---------------------- 120
Query: 121 FLQISSLNLLRVIAEEEKCSIEELNAGRLCDWFLKDKLKREQNLDSAVLQWDDSDLTF 174
ISSLNLL+VIAEEEKCSIEELNAGRLCDWFLKDKLKREQNLDSAVLQWDDSD F
Sbjct: 121 ---ISSLNLLKVIAEEEKCSIEELNAGRLCDWFLKDKLKREQNLDSAVLQWDDSDPLF 144
BLAST of MS010816 vs. NCBI nr
Match:
XP_022949732.1 (uncharacterized protein LOC111453037 [Cucurbita moschata] >KAG6603726.1 hypothetical protein SDJN03_04335, partial [Cucurbita argyrosperma subsp. sororia] >KAG7033905.1 hypothetical protein SDJN02_03630 [Cucurbita argyrosperma subsp. argyrosperma])
HSP 1 Score: 182.6 bits (462), Expect = 3.0e-42
Identity = 115/178 (64.61%), Postives = 119/178 (66.85%), Query Frame = 0
Query: 1 MISLGPCAYQNGSSFPFYSSNL-LNLRPIP---SNFALPLPTNSIILGKRNFRFRVVRAE 60
MISL P + NGSSF SSNL L PI SNF PL T K FR +RAE
Sbjct: 1 MISLKPSPHHNGSSFLSNSSNLHQKLSPITRHRSNFPPPLRT------KTTFR---IRAE 60
Query: 61 QA-GGNSAQRRTFLTVEEAGLVEVSGLSTHERFLCRLTVFLSLPLWPSLINFQLPFILFV 120
QA GN AQRRTFLT+EEAGLVEVSGLSTHERFLCRLT
Sbjct: 61 QAESGNGAQRRTFLTLEEAGLVEVSGLSTHERFLCRLT---------------------- 120
Query: 121 FLQISSLNLLRVIAEEEKCSIEELNAGRLCDWFLKDKLKREQNLDSAVLQWDDSDLTF 174
ISSLNLL+VIAEEEKCSIEELNAGRLCDWFLKDKLKREQNLDSAVLQWDDSD F
Sbjct: 121 ---ISSLNLLKVIAEEEKCSIEELNAGRLCDWFLKDKLKREQNLDSAVLQWDDSDPLF 144
BLAST of MS010816 vs. NCBI nr
Match:
XP_022977984.1 (uncharacterized protein LOC111478111 [Cucurbita maxima])
HSP 1 Score: 181.4 bits (459), Expect = 6.6e-42
Identity = 114/178 (64.04%), Postives = 118/178 (66.29%), Query Frame = 0
Query: 1 MISLGPCAYQNGSSFPFYSSNL-LNLRPIP---SNFALPLPTNSIILGKRNFRFRVVRAE 60
MISL P + NGSSF SSNL L PI SNF P T K FR +RAE
Sbjct: 1 MISLKPSPHHNGSSFLSNSSNLHQKLNPITRLRSNFPPPFRT------KTTFR---IRAE 60
Query: 61 QA-GGNSAQRRTFLTVEEAGLVEVSGLSTHERFLCRLTVFLSLPLWPSLINFQLPFILFV 120
QA GN AQRRTFLT+EEAGLVEVSGLSTHERFLCRLT
Sbjct: 61 QAESGNGAQRRTFLTLEEAGLVEVSGLSTHERFLCRLT---------------------- 120
Query: 121 FLQISSLNLLRVIAEEEKCSIEELNAGRLCDWFLKDKLKREQNLDSAVLQWDDSDLTF 174
ISSLNLL+VIAEEEKCSIEELNAGRLCDWFLKDKLKREQNLDSAVLQWDDSD F
Sbjct: 121 ---ISSLNLLKVIAEEEKCSIEELNAGRLCDWFLKDKLKREQNLDSAVLQWDDSDPLF 144
BLAST of MS010816 vs. ExPASy TrEMBL
Match:
A0A6J1CKG5 (uncharacterized protein LOC111012432 OS=Momordica charantia OX=3673 GN=LOC111012432 PE=4 SV=1)
HSP 1 Score: 283.1 bits (723), Expect = 7.8e-73
Identity = 148/173 (85.55%), Postives = 148/173 (85.55%), Query Frame = 0
Query: 1 MISLGPCAYQNGSSFPFYSSNLLNLRPIPSNFALPLPTNSIILGKRNFRFRVVRAEQAGG 60
MISLGPCAYQNGSSFPFYSSNLLNLRPIPSNFALPLPTNSIILGKRNFRFRVVRAEQAGG
Sbjct: 1 MISLGPCAYQNGSSFPFYSSNLLNLRPIPSNFALPLPTNSIILGKRNFRFRVVRAEQAGG 60
Query: 61 NSAQRRTFLTVEEAGLVEVSGLSTHERFLCRLTVFLSLPLWPSLINFQLPFILFVFLQIS 120
NSAQRRTFLTVEEAGLVEVSGLSTHERFLCRLT IS
Sbjct: 61 NSAQRRTFLTVEEAGLVEVSGLSTHERFLCRLT-------------------------IS 120
Query: 121 SLNLLRVIAEEEKCSIEELNAGRLCDWFLKDKLKREQNLDSAVLQWDDSDLTF 174
SLNLLRVIAEEEKCSIEELNAGRLCDWFLKDKLKREQNLDSAVLQWDDSDLTF
Sbjct: 121 SLNLLRVIAEEEKCSIEELNAGRLCDWFLKDKLKREQNLDSAVLQWDDSDLTF 148
BLAST of MS010816 vs. ExPASy TrEMBL
Match:
A0A6J1GCY7 (uncharacterized protein LOC111453037 OS=Cucurbita moschata OX=3662 GN=LOC111453037 PE=4 SV=1)
HSP 1 Score: 182.6 bits (462), Expect = 1.4e-42
Identity = 115/178 (64.61%), Postives = 119/178 (66.85%), Query Frame = 0
Query: 1 MISLGPCAYQNGSSFPFYSSNL-LNLRPIP---SNFALPLPTNSIILGKRNFRFRVVRAE 60
MISL P + NGSSF SSNL L PI SNF PL T K FR +RAE
Sbjct: 1 MISLKPSPHHNGSSFLSNSSNLHQKLSPITRHRSNFPPPLRT------KTTFR---IRAE 60
Query: 61 QA-GGNSAQRRTFLTVEEAGLVEVSGLSTHERFLCRLTVFLSLPLWPSLINFQLPFILFV 120
QA GN AQRRTFLT+EEAGLVEVSGLSTHERFLCRLT
Sbjct: 61 QAESGNGAQRRTFLTLEEAGLVEVSGLSTHERFLCRLT---------------------- 120
Query: 121 FLQISSLNLLRVIAEEEKCSIEELNAGRLCDWFLKDKLKREQNLDSAVLQWDDSDLTF 174
ISSLNLL+VIAEEEKCSIEELNAGRLCDWFLKDKLKREQNLDSAVLQWDDSD F
Sbjct: 121 ---ISSLNLLKVIAEEEKCSIEELNAGRLCDWFLKDKLKREQNLDSAVLQWDDSDPLF 144
BLAST of MS010816 vs. ExPASy TrEMBL
Match:
A0A6J1ISV2 (uncharacterized protein LOC111478111 OS=Cucurbita maxima OX=3661 GN=LOC111478111 PE=4 SV=1)
HSP 1 Score: 181.4 bits (459), Expect = 3.2e-42
Identity = 114/178 (64.04%), Postives = 118/178 (66.29%), Query Frame = 0
Query: 1 MISLGPCAYQNGSSFPFYSSNL-LNLRPIP---SNFALPLPTNSIILGKRNFRFRVVRAE 60
MISL P + NGSSF SSNL L PI SNF P T K FR +RAE
Sbjct: 1 MISLKPSPHHNGSSFLSNSSNLHQKLNPITRLRSNFPPPFRT------KTTFR---IRAE 60
Query: 61 QA-GGNSAQRRTFLTVEEAGLVEVSGLSTHERFLCRLTVFLSLPLWPSLINFQLPFILFV 120
QA GN AQRRTFLT+EEAGLVEVSGLSTHERFLCRLT
Sbjct: 61 QAESGNGAQRRTFLTLEEAGLVEVSGLSTHERFLCRLT---------------------- 120
Query: 121 FLQISSLNLLRVIAEEEKCSIEELNAGRLCDWFLKDKLKREQNLDSAVLQWDDSDLTF 174
ISSLNLL+VIAEEEKCSIEELNAGRLCDWFLKDKLKREQNLDSAVLQWDDSD F
Sbjct: 121 ---ISSLNLLKVIAEEEKCSIEELNAGRLCDWFLKDKLKREQNLDSAVLQWDDSDPLF 144
BLAST of MS010816 vs. ExPASy TrEMBL
Match:
A0A0A0KHI4 (Uncharacterized protein OS=Cucumis sativus OX=3659 GN=Csa_6G516880 PE=4 SV=1)
HSP 1 Score: 166.0 bits (419), Expect = 1.4e-37
Identity = 102/180 (56.67%), Postives = 110/180 (61.11%), Query Frame = 0
Query: 1 MISLGPCAYQNG--SSFPFYSSNLLNLRPIPSNFALPLPTNSIILGKRNF-----RFRVV 60
MISL P + NG +SFP Y+S+ PIP I + K NF FR
Sbjct: 1 MISLKPSTHHNGFDTSFP-YNSSKFQQNPIP----------IIRITKPNFSHTKTTFRTH 60
Query: 61 RAEQAGGNSAQRRTFLTVEEAGLVEVSGLSTHERFLCRLTVFLSLPLWPSLINFQLPFIL 120
G Q+RTFLT+EEAGLVEVSGLSTHERFLCRLT
Sbjct: 61 AQPSGSGKGPQKRTFLTIEEAGLVEVSGLSTHERFLCRLT-------------------- 120
Query: 121 FVFLQISSLNLLRVIAEEEKCSIEELNAGRLCDWFLKDKLKREQNLDSAVLQWDDSDLTF 174
ISSLNLL+VIAEEEKCSIEELNAGRLCDWFLKDKLKREQNLDSAVLQWDDSD F
Sbjct: 121 -----ISSLNLLKVIAEEEKCSIEELNAGRLCDWFLKDKLKREQNLDSAVLQWDDSDPLF 144
BLAST of MS010816 vs. ExPASy TrEMBL
Match:
A0A1U8EE61 (uncharacterized protein LOC107844852 OS=Capsicum annuum OX=4072 GN=LOC107844852 PE=4 SV=1)
HSP 1 Score: 150.2 bits (378), Expect = 7.9e-33
Identity = 80/126 (63.49%), Postives = 90/126 (71.43%), Query Frame = 0
Query: 45 KRNFRFRVVRAEQAGGNSAQRRTFLTVEEAGLVEVSGLSTHERFLCRLTVFLSLPLWPSL 104
++ + VVRA G ++RR+FLT+EEAGLVE+SGLSTHERFLCRLT
Sbjct: 41 RKKMQSLVVRASSGGEKQSERRSFLTLEEAGLVELSGLSTHERFLCRLT----------- 100
Query: 105 INFQLPFILFVFLQISSLNLLRVIAEEEKCSIEELNAGRLCDWFLKDKLKREQNLDSAVL 164
ISSLNLLRVIAE+E CSIEELNAGR+CDWFLKDKLKREQNLDSAVL
Sbjct: 101 --------------ISSLNLLRVIAEQEGCSIEELNAGRVCDWFLKDKLKREQNLDSAVL 141
Query: 165 QWDDSD 171
QWDDSD
Sbjct: 161 QWDDSD 141
BLAST of MS010816 vs. TAIR 10
Match:
AT4G21445.1 (unknown protein; FUNCTIONS IN: molecular_function unknown; INVOLVED IN: biological_process unknown; LOCATED IN: chloroplast, chloroplast stroma; Has 30201 Blast hits to 17322 proteins in 780 species: Archae - 12; Bacteria - 1396; Metazoa - 17338; Fungi - 3422; Plants - 5037; Viruses - 0; Other Eukaryotes - 2996 (source: NCBI BLink). )
HSP 1 Score: 130.2 bits (326), Expect = 1.6e-30
Identity = 66/113 (58.41%), Postives = 78/113 (69.03%), Query Frame = 0
Query: 61 NSAQRRTFLTVEEAGLVEVSGLSTHERFLCRLTVFLSLPLWPSLINFQLPFILFVFLQIS 120
N +RR+FL++ EAGLVE+SGL HE+FLCRLT IS
Sbjct: 67 NKPERRSFLSLAEAGLVEISGLGAHEKFLCRLT-------------------------IS 126
Query: 121 SLNLLRVIAEEEKCSIEELNAGRLCDWFLKDKLKREQNLDSAVLQWDDSDLTF 174
SLNLLRVI+E+E CSIEELNAG++CDWFLKDKLKRE N++SAVLQWDD D F
Sbjct: 127 SLNLLRVISEQEGCSIEELNAGKICDWFLKDKLKREHNIESAVLQWDDPDFPF 154
The following BLAST results are available for this feature:
Match Name | E-value | Identity | Description | |
XP_022142270.1 | 1.6e-72 | 85.55 | uncharacterized protein LOC111012432 [Momordica charantia] | [more] |
XP_038881387.1 | 1.0e-42 | 63.33 | uncharacterized protein LOC120072922 [Benincasa hispida] | [more] |
XP_023544671.1 | 2.3e-42 | 64.04 | uncharacterized protein LOC111804184 [Cucurbita pepo subsp. pepo] | [more] |
XP_022949732.1 | 3.0e-42 | 64.61 | uncharacterized protein LOC111453037 [Cucurbita moschata] >KAG6603726.1 hypothet... | [more] |
XP_022977984.1 | 6.6e-42 | 64.04 | uncharacterized protein LOC111478111 [Cucurbita maxima] | [more] |
Match Name | E-value | Identity | Description | |
Match Name | E-value | Identity | Description | |
A0A6J1CKG5 | 7.8e-73 | 85.55 | uncharacterized protein LOC111012432 OS=Momordica charantia OX=3673 GN=LOC111012... | [more] |
A0A6J1GCY7 | 1.4e-42 | 64.61 | uncharacterized protein LOC111453037 OS=Cucurbita moschata OX=3662 GN=LOC1114530... | [more] |
A0A6J1ISV2 | 3.2e-42 | 64.04 | uncharacterized protein LOC111478111 OS=Cucurbita maxima OX=3661 GN=LOC111478111... | [more] |
A0A0A0KHI4 | 1.4e-37 | 56.67 | Uncharacterized protein OS=Cucumis sativus OX=3659 GN=Csa_6G516880 PE=4 SV=1 | [more] |
A0A1U8EE61 | 7.9e-33 | 63.49 | uncharacterized protein LOC107844852 OS=Capsicum annuum OX=4072 GN=LOC107844852 ... | [more] |
Match Name | E-value | Identity | Description | |
AT4G21445.1 | 1.6e-30 | 58.41 | unknown protein; FUNCTIONS IN: molecular_function unknown; INVOLVED IN: biologic... | [more] |