Sequences
The following sequences are available for this feature:
Gene sequence (with intron)
Legend: exonfive_prime_UTRCDSpolypeptidethree_prime_UTR
Hold the cursor over a type above to highlight its positions in the sequence below.
AATTATTTTATACTTGAAAGAATTGGCTCTCAGTCAGAAATGAATGTGTGGTTCGACACTTGTTTTTCTATAATCAAGTTCAAGTCGGCCATGAATTGGTGACGGAAAAAGCTTCGGATTCTTATGTTTCTTCATCCGCTCCCGCCACCGTTCTAAAACCCACAAAACTTGTCGCCAATTCCTAATCAAAATCCAAATTCAAACATGATAACCAATATCAATCTTATCTCCTGCAACTCTTTCTCACCGTCTCCTCCTTCAAGATTCTCAAAGCTTGGAATTCTCCATAAAACCCAAACTCGAAACCCCCAAATCACCCCCTTCAAGTATTCGACATCGGTCTGCCCCGGCATCAACACTCGTCGAGATTCAAGCTACAGAAAAGTGGGTCTGCTCCAAAAATGGCGGAGTGCATCGGGAACTCAGAATGCGGGTGACCCAGTTGGGGAAAAAGCGACGCCGGTGGAAAGTGAGCGCGGCGGTAGCAGCGGCGGCGGGAATGGTGGGGAGGGAAGAGACTGGACGACTTCGATTTTACTGTTTGTATTGTGGGCTGGTCTTATGTTTTATGTGTTCATTCTCGCTCCAAATCAGACTCCGGTAATTACTTGATTGCCTTCTTCTTGTCTGTATGTTTGCTTAATGCATAAGCATCAAATGCTTATATGTTTGCCTGAATATTTGCAGTCAACGGACTTGTATTTCTTGAAGAAGCTTCTGAACTTGAAAAATGACGATGGTTTCAAAATGAATGAAGTGCTTGTCTCCCTTTGGTATATTATGGGGTTGTGGCCCCTTATCTACAGCATGCTGCTGCTTCCTTCTGGTAGAAGGTACCTTCATCTTCTTCCCCCAAATTTGGAATTCATTCATTCTGAGAATGTTTGAGCTACTGAGTTGATTGAACGCTTAAGCCTTTTATTTCCATGTCCAGCTTTTGGAGTTGATAATGAGTATCATAGATTGTAGTTGGGTAGCCTACATGTTAAGTGATTCAAGCTTTTTTCTCCAATGGAGTGTTTGAGATTATTAGCAAATCTTACATGTCATCTGCGAGATCTCACATTGGTTGGAGCGAGATCTCACATTGGTTGGAGAGTAGAACGAAACATTCCTTACAAGGGTGTGGAAACTTCTTCCTAGTAGACGCGTTTTAAAACCTTGAGGGGAAGCCCTGGAAGGGAAAATCGAAAGAGGACAATATCTGCTAGCGGTTGGTTTGTGTTGTTACAAATGGTATTAGAGCTTGACACCGGACGGTGTGCCAGCGAGGATGCTGGGCTCTTGTGAGATCCCACATTGGTTGGAGAGGGGAACGAAGCATTCCTTATAAGGGTGTGCAAACCTCTCCCTAGTAGACGCGTTTTAAAATCTTGAGGGGAAGCCCTTAAAGAGAAAACCCAAAGAGGACAATATCTGCTAGCGGTGCGTATGGGCTGTTACATCATCGAGGTCGATGCAGTGTGTGGTTAGCTTTAATCTGATGGTTACATGTAGCTTAAATACATTTTAGACTTCCAGTTCAATTCAAATTGCATCAGAGTGACATAATTTCTTGCAAGTGCAGTTCAAACAGCAATGTTCCTGTCTGGCCTTTCCTAGGACTGTCTTTCTTTTTGGGTGCTTATGGTCTTCTTCCATATTTTGTACTTTGGAAGCCGCCGCCGCCTCCTGTTGAAGAAGATGAGCTCGAGAGATGGCCTTTGAATTTTCTCGAGTCGAAATTTACTGCTGGGGTATGGTAAATTCTTCCCCTTACCATGAATGTATCCGGAAGAAGATTGTGATGGTGCTCTCTTTATCTTTACCGTTGGCATAGCCAAATATTATTTTCAGAACCATCTATGGCCTTTGATGGTTCTTATAATGGTGTGGAAACCTCTCCCTAATGAACGCGTTTTAAAATCTTGAGGGGAATCCTAGAAGGAAAAGCTCAAAGAAGACAATATCTGCTAGTGGAGGGCTTGAGTTGGTACAAATGGTATTAGGGCTAGACACCGGGCGGTGTGCCAACGAGGACGTTGGGCTCTCAAGGGGGTAGTTTGTGAGATCCCACGTCGGTTGGAGAGGAGAACGAAACATTCCTTATAAGAGTGTGGAAACCTCTCCCCAACGGTCTCAAGGGGGGTAGATTGTGAGATCTCAGGTCGGTTGGAGAGGGAACGAAACATTTCTTGTAAGAGTGTGGAAACCTCTCCCCAATGGATGTATTTTAAAACCTTGAGAGGAAGTTCAGAAGGGAAAGCCCAAAGAGGACAATATTTGCTAGCGGTGGGCTTGGGCTAGTACAAATGGTATTAGAGCCAGACATTGGGCATTGTGCCAACAAGGATGCTGGGCTCTCAAAGGGGGTGGATTGTGATATCCCACGTCGGTTGGAGAGGAAAACGAAACATTCCTTATAAGGGTGTGAAAACCTCTCCCTAATAGACGCGTTTTAGAACCTTGAGAACAATATCGGAAGGGAAAGCTCAAAGAGAACAATATCTGCTAGCGGTAGGCTTGAGCACACATCACTATTTTGCCCCAACCGTTCGATTAAAAATGAAAATTAACCAGGTTGGAATCAGATAACATTTGCTGCAGGACTAGGCTTATTATTCTACGCTGGATTAGCTGGTGAGAGTGCGTGGAAGGAATTCTATCAGTACTTCAGAGAAAGCAGATTTGTAAGTCTAGCAGTTTGTTTTTGGCATACCGTTATAGTTTTTTGCACCATGTCTTCATGTCTGTGGTGGGTACTCGCTATCTCGCGCTACCTATCAATTGTGTCGTTGAAACGCTTGATTTGTAGACTAGTTGAAGAACATTTTCTTGCTGCTGCTACTCCTGGAATCTGATTGATGATCAATAGTTTTGGTGTTTAGATATTTATATGAACTTTCTATCATTGCAACGTCATATAGCCTAACCTCGATCCCGAAACGTTGTTGATGCAGATCCATGCTACGAGCATTGATTTCATGCTGTTATCTTCATTTGCTCCGTTTTGGGTTTACAATGACATGTCTGCTCGAAAATGGTTCGATTACCATCTTGTTCTAATCCTTCTTCCTTATTCAATTCTATGTTAGCCTTTTGGACATTAATGTTTCATATCATTGCAATGAACTCATTTCAGGTATGACCAAGGTTCTTGGCTTCTTCCATTTTCGTTGGTGCCGTTCTTGGGTCCTGCCTTGTATCTCGTCCTACGACCAATGCCAACGACGACTCCCGTTCCACTCGACCGTGCTGCTTCTGAACCGAAATGATCTTGGAAAAGTGAACTGCAAAAAGGAAACTTGGCTGTAATGGTGGAAAGTAGCTTTCCAGGTAAAGGGAGAAGTTTTGTACTTCACGCACGGTAGATTAGGAAACAGGGACGTGAAATGATTTGTGTTGTTATGTTATTATGTTTATGGCACAATATATTATTCTTCCAACAGGGTTGATGTTGATGTTGATGCTGGCC
mRNA sequence
AATTATTTTATACTTGAAAGAATTGGCTCTCAGTCAGAAATGAATGTGTGGTTCGACACTTGTTTTTCTATAATCAAGTTCAAGTCGGCCATGAATTGGTGACGGAAAAAGCTTCGGATTCTTATGTTTCTTCATCCGCTCCCGCCACCGTTCTAAAACCCACAAAACTTGTCGCCAATTCCTAATCAAAATCCAAATTCAAACATGATAACCAATATCAATCTTATCTCCTGCAACTCTTTCTCACCGTCTCCTCCTTCAAGATTCTCAAAGCTTGGAATTCTCCATAAAACCCAAACTCGAAACCCCCAAATCACCCCCTTCAAGTATTCGACATCGGTCTGCCCCGGCATCAACACTCGTCGAGATTCAAGCTACAGAAAAGTGGGTCTGCTCCAAAAATGGCGGAGTGCATCGGGAACTCAGAATGCGGGTGACCCAGTTGGGGAAAAAGCGACGCCGGTGGAAAGTGAGCGCGGCGGTAGCAGCGGCGGCGGGAATGGTGGGGAGGGAAGAGACTGGACGACTTCGATTTTACTGTTTGTATTGTGGGCTGGTCTTATGTTTTATGTGTTCATTCTCGCTCCAAATCAGACTCCGTCAACGGACTTGTATTTCTTGAAGAAGCTTCTGAACTTGAAAAATGACGATGGTTTCAAAATGAATGAAGTGCTTGTCTCCCTTTGGTATATTATGGGGTTGTGGCCCCTTATCTACAGCATGCTGCTGCTTCCTTCTGGTAGAAGTTCAAACAGCAATGTTCCTGTCTGGCCTTTCCTAGGACTGTCTTTCTTTTTGGGTGCTTATGGTCTTCTTCCATATTTTGTACTTTGGAAGCCGCCGCCGCCTCCTGTTGAAGAAGATGAGCTCGAGAGATGGCCTTTGAATTTTCTCGAGTCGAAATTTACTGCTGGGATAACATTTGCTGCAGGACTAGGCTTATTATTCTACGCTGGATTAGCTGGTGAGAGTGCGTGGAAGGAATTCTATCAGTACTTCAGAGAAAGCAGATTTATCCATGCTACGAGCATTGATTTCATGCTGTTATCTTCATTTGCTCCGTTTTGGGTTTACAATGACATGTCTGCTCGAAAATGGTATGACCAAGGTTCTTGGCTTCTTCCATTTTCGTTGGTGCCGTTCTTGGGTCCTGCCTTGTATCTCGTCCTACGACCAATGCCAACGACGACTCCCGTTCCACTCGACCGTGCTGCTTCTGAACCGAAATGATCTTGGAAAAGTGAACTGCAAAAAGGAAACTTGGCTGTAATGGTGGAAAGTAGCTTTCCAGGTAAAGGGAGAAGTTTTGTACTTCACGCACGGTAGATTAGGAAACAGGGACGTGAAATGATTTGTGTTGTTATGTTATTATGTTTATGGCACAATATATTATTCTTCCAACAGGGTTGATGTTGATGTTGATGCTGGCC
Coding sequence (CDS)
ATGATAACCAATATCAATCTTATCTCCTGCAACTCTTTCTCACCGTCTCCTCCTTCAAGATTCTCAAAGCTTGGAATTCTCCATAAAACCCAAACTCGAAACCCCCAAATCACCCCCTTCAAGTATTCGACATCGGTCTGCCCCGGCATCAACACTCGTCGAGATTCAAGCTACAGAAAAGTGGGTCTGCTCCAAAAATGGCGGAGTGCATCGGGAACTCAGAATGCGGGTGACCCAGTTGGGGAAAAAGCGACGCCGGTGGAAAGTGAGCGCGGCGGTAGCAGCGGCGGCGGGAATGGTGGGGAGGGAAGAGACTGGACGACTTCGATTTTACTGTTTGTATTGTGGGCTGGTCTTATGTTTTATGTGTTCATTCTCGCTCCAAATCAGACTCCGTCAACGGACTTGTATTTCTTGAAGAAGCTTCTGAACTTGAAAAATGACGATGGTTTCAAAATGAATGAAGTGCTTGTCTCCCTTTGGTATATTATGGGGTTGTGGCCCCTTATCTACAGCATGCTGCTGCTTCCTTCTGGTAGAAGTTCAAACAGCAATGTTCCTGTCTGGCCTTTCCTAGGACTGTCTTTCTTTTTGGGTGCTTATGGTCTTCTTCCATATTTTGTACTTTGGAAGCCGCCGCCGCCTCCTGTTGAAGAAGATGAGCTCGAGAGATGGCCTTTGAATTTTCTCGAGTCGAAATTTACTGCTGGGATAACATTTGCTGCAGGACTAGGCTTATTATTCTACGCTGGATTAGCTGGTGAGAGTGCGTGGAAGGAATTCTATCAGTACTTCAGAGAAAGCAGATTTATCCATGCTACGAGCATTGATTTCATGCTGTTATCTTCATTTGCTCCGTTTTGGGTTTACAATGACATGTCTGCTCGAAAATGGTATGACCAAGGTTCTTGGCTTCTTCCATTTTCGTTGGTGCCGTTCTTGGGTCCTGCCTTGTATCTCGTCCTACGACCAATGCCAACGACGACTCCCGTTCCACTCGACCGTGCTGCTTCTGAACCGAAATGA
Protein sequence
MITNINLISCNSFSPSPPSRFSKLGILHKTQTRNPQITPFKYSTSVCPGINTRRDSSYRKVGLLQKWRSASGTQNAGDPVGEKATPVESERGGSSGGGNGGEGRDWTTSILLFVLWAGLMFYVFILAPNQTPSTDLYFLKKLLNLKNDDGFKMNEVLVSLWYIMGLWPLIYSMLLLPSGRSSNSNVPVWPFLGLSFFLGAYGLLPYFVLWKPPPPPVEEDELERWPLNFLESKFTAGITFAAGLGLLFYAGLAGESAWKEFYQYFRESRFIHATSIDFMLLSSFAPFWVYNDMSARKWYDQGSWLLPFSLVPFLGPALYLVLRPMPTTTPVPLDRAASEPK
Homology
BLAST of Carg16109 vs. NCBI nr
Match:
KAG7010788.1 (hypothetical protein SDJN02_27584 [Cucurbita argyrosperma subsp. argyrosperma])
HSP 1 Score: 701.0 bits (1808), Expect = 4.9e-198
Identity = 341/341 (100.00%), Postives = 341/341 (100.00%), Query Frame = 0
Query: 1 MITNINLISCNSFSPSPPSRFSKLGILHKTQTRNPQITPFKYSTSVCPGINTRRDSSYRK 60
MITNINLISCNSFSPSPPSRFSKLGILHKTQTRNPQITPFKYSTSVCPGINTRRDSSYRK
Sbjct: 1 MITNINLISCNSFSPSPPSRFSKLGILHKTQTRNPQITPFKYSTSVCPGINTRRDSSYRK 60
Query: 61 VGLLQKWRSASGTQNAGDPVGEKATPVESERGGSSGGGNGGEGRDWTTSILLFVLWAGLM 120
VGLLQKWRSASGTQNAGDPVGEKATPVESERGGSSGGGNGGEGRDWTTSILLFVLWAGLM
Sbjct: 61 VGLLQKWRSASGTQNAGDPVGEKATPVESERGGSSGGGNGGEGRDWTTSILLFVLWAGLM 120
Query: 121 FYVFILAPNQTPSTDLYFLKKLLNLKNDDGFKMNEVLVSLWYIMGLWPLIYSMLLLPSGR 180
FYVFILAPNQTPSTDLYFLKKLLNLKNDDGFKMNEVLVSLWYIMGLWPLIYSMLLLPSGR
Sbjct: 121 FYVFILAPNQTPSTDLYFLKKLLNLKNDDGFKMNEVLVSLWYIMGLWPLIYSMLLLPSGR 180
Query: 181 SSNSNVPVWPFLGLSFFLGAYGLLPYFVLWKPPPPPVEEDELERWPLNFLESKFTAGITF 240
SSNSNVPVWPFLGLSFFLGAYGLLPYFVLWKPPPPPVEEDELERWPLNFLESKFTAGITF
Sbjct: 181 SSNSNVPVWPFLGLSFFLGAYGLLPYFVLWKPPPPPVEEDELERWPLNFLESKFTAGITF 240
Query: 241 AAGLGLLFYAGLAGESAWKEFYQYFRESRFIHATSIDFMLLSSFAPFWVYNDMSARKWYD 300
AAGLGLLFYAGLAGESAWKEFYQYFRESRFIHATSIDFMLLSSFAPFWVYNDMSARKWYD
Sbjct: 241 AAGLGLLFYAGLAGESAWKEFYQYFRESRFIHATSIDFMLLSSFAPFWVYNDMSARKWYD 300
Query: 301 QGSWLLPFSLVPFLGPALYLVLRPMPTTTPVPLDRAASEPK 342
QGSWLLPFSLVPFLGPALYLVLRPMPTTTPVPLDRAASEPK
Sbjct: 301 QGSWLLPFSLVPFLGPALYLVLRPMPTTTPVPLDRAASEPK 341
BLAST of Carg16109 vs. NCBI nr
Match:
XP_023511914.1 (uncharacterized protein LOC111776784 [Cucurbita pepo subsp. pepo])
HSP 1 Score: 696.8 bits (1797), Expect = 9.2e-197
Identity = 338/341 (99.12%), Postives = 340/341 (99.71%), Query Frame = 0
Query: 1 MITNINLISCNSFSPSPPSRFSKLGILHKTQTRNPQITPFKYSTSVCPGINTRRDSSYRK 60
M+TNINLISCN+FSPSPPSRFSKLGILHKTQTRNPQI PFKYSTSVCPGINTRRDSSYRK
Sbjct: 1 MLTNINLISCNAFSPSPPSRFSKLGILHKTQTRNPQIIPFKYSTSVCPGINTRRDSSYRK 60
Query: 61 VGLLQKWRSASGTQNAGDPVGEKATPVESERGGSSGGGNGGEGRDWTTSILLFVLWAGLM 120
VGLLQKWRSASGTQNAGDPVGEKATPVESERGGSSGGGNGGEGRDWTTSILLFVLWAGLM
Sbjct: 61 VGLLQKWRSASGTQNAGDPVGEKATPVESERGGSSGGGNGGEGRDWTTSILLFVLWAGLM 120
Query: 121 FYVFILAPNQTPSTDLYFLKKLLNLKNDDGFKMNEVLVSLWYIMGLWPLIYSMLLLPSGR 180
FYVFILAPNQTPSTDLYFLKKLLNLKNDDGFKMNEVLVSLWYIMGLWPLIYSMLLLPSGR
Sbjct: 121 FYVFILAPNQTPSTDLYFLKKLLNLKNDDGFKMNEVLVSLWYIMGLWPLIYSMLLLPSGR 180
Query: 181 SSNSNVPVWPFLGLSFFLGAYGLLPYFVLWKPPPPPVEEDELERWPLNFLESKFTAGITF 240
SSNSNVPVWPFLGLSFFLGAYGLLPYFVLWKPPPPPVEEDELERWPLNFLESKFTAGITF
Sbjct: 181 SSNSNVPVWPFLGLSFFLGAYGLLPYFVLWKPPPPPVEEDELERWPLNFLESKFTAGITF 240
Query: 241 AAGLGLLFYAGLAGESAWKEFYQYFRESRFIHATSIDFMLLSSFAPFWVYNDMSARKWYD 300
AAGLGLLFYAGLAGESAWKEFYQYFRESRFIHATSIDFMLLSSFAPFWVYNDMSARKWYD
Sbjct: 241 AAGLGLLFYAGLAGESAWKEFYQYFRESRFIHATSIDFMLLSSFAPFWVYNDMSARKWYD 300
Query: 301 QGSWLLPFSLVPFLGPALYLVLRPMPTTTPVPLDRAASEPK 342
QGSWLLPFSLVPFLGPALYLVLRPMPTTTPVPLDRAASEPK
Sbjct: 301 QGSWLLPFSLVPFLGPALYLVLRPMPTTTPVPLDRAASEPK 341
BLAST of Carg16109 vs. NCBI nr
Match:
XP_022943893.1 (uncharacterized protein LOC111448482 [Cucurbita moschata])
HSP 1 Score: 693.3 bits (1788), Expect = 1.0e-195
Identity = 338/341 (99.12%), Postives = 338/341 (99.12%), Query Frame = 0
Query: 1 MITNINLISCNSFSPSPPSRFSKLGILHKTQTRNPQITPFKYSTSVCPGINTRRDSSYRK 60
MI NINLISCNSFSPSPPSRFSKLGILHKTQTRNPQI PFKYSTSVCP INTRRDSSYRK
Sbjct: 1 MIPNINLISCNSFSPSPPSRFSKLGILHKTQTRNPQIIPFKYSTSVCPRINTRRDSSYRK 60
Query: 61 VGLLQKWRSASGTQNAGDPVGEKATPVESERGGSSGGGNGGEGRDWTTSILLFVLWAGLM 120
VGLLQKWRSASGTQNAGDPVGEKATPVESERGGSSGGGNGGEGRDWTTSILLFVLWAGLM
Sbjct: 61 VGLLQKWRSASGTQNAGDPVGEKATPVESERGGSSGGGNGGEGRDWTTSILLFVLWAGLM 120
Query: 121 FYVFILAPNQTPSTDLYFLKKLLNLKNDDGFKMNEVLVSLWYIMGLWPLIYSMLLLPSGR 180
FYVFILAPNQTPSTDLYFLKKLLNLKNDDGFKMNEVLVSLWYIMGLWPLIYSMLLLPSGR
Sbjct: 121 FYVFILAPNQTPSTDLYFLKKLLNLKNDDGFKMNEVLVSLWYIMGLWPLIYSMLLLPSGR 180
Query: 181 SSNSNVPVWPFLGLSFFLGAYGLLPYFVLWKPPPPPVEEDELERWPLNFLESKFTAGITF 240
SSNSNVPVWPFLGLSFFLGAYGLLPYFVLWKPPPPPVEEDELERWPLNFLESKFTAGITF
Sbjct: 181 SSNSNVPVWPFLGLSFFLGAYGLLPYFVLWKPPPPPVEEDELERWPLNFLESKFTAGITF 240
Query: 241 AAGLGLLFYAGLAGESAWKEFYQYFRESRFIHATSIDFMLLSSFAPFWVYNDMSARKWYD 300
AAGLGLLFYAGLAGESAWKEFYQYFRESRFIHATSIDFMLLSSFAPFWVYNDMSARKWYD
Sbjct: 241 AAGLGLLFYAGLAGESAWKEFYQYFRESRFIHATSIDFMLLSSFAPFWVYNDMSARKWYD 300
Query: 301 QGSWLLPFSLVPFLGPALYLVLRPMPTTTPVPLDRAASEPK 342
QGSWLLPFSLVPFLGPALYLVLRPMPTTTPVPLDRAASEPK
Sbjct: 301 QGSWLLPFSLVPFLGPALYLVLRPMPTTTPVPLDRAASEPK 341
BLAST of Carg16109 vs. NCBI nr
Match:
XP_022985722.1 (uncharacterized protein LOC111483692 [Cucurbita maxima])
HSP 1 Score: 679.5 bits (1752), Expect = 1.5e-191
Identity = 331/341 (97.07%), Postives = 334/341 (97.95%), Query Frame = 0
Query: 1 MITNINLISCNSFSPSPPSRFSKLGILHKTQTRNPQITPFKYSTSVCPGINTRRDSSYRK 60
MI+NINLISCNSFSPSPPSRFSKLGILHKTQTRNPQI PFKYSTSVCPGINTRRDSSYRK
Sbjct: 1 MISNINLISCNSFSPSPPSRFSKLGILHKTQTRNPQIIPFKYSTSVCPGINTRRDSSYRK 60
Query: 61 VGLLQKWRSASGTQNAGDPVGEKATPVESERGGSSGGGNGGEGRDWTTSILLFVLWAGLM 120
VGLLQKWRSAS + NAG PVGEKATPVESERGGSS GGNGGEGRDWTTSILLFVLWAGLM
Sbjct: 61 VGLLQKWRSASASPNAGGPVGEKATPVESERGGSSNGGNGGEGRDWTTSILLFVLWAGLM 120
Query: 121 FYVFILAPNQTPSTDLYFLKKLLNLKNDDGFKMNEVLVSLWYIMGLWPLIYSMLLLPSGR 180
FYVFILAPNQTPSTDLYFLKKLLNLKNDDGFKMNEVLVSLWYIMGLWPLIYSMLLLPSGR
Sbjct: 121 FYVFILAPNQTPSTDLYFLKKLLNLKNDDGFKMNEVLVSLWYIMGLWPLIYSMLLLPSGR 180
Query: 181 SSNSNVPVWPFLGLSFFLGAYGLLPYFVLWKPPPPPVEEDELERWPLNFLESKFTAGITF 240
SSNSNVPVWPFLGLSFFLGAYGLLPYFVLWKPPPPPVEE+ELERWPLNFLESKFTAGITF
Sbjct: 181 SSNSNVPVWPFLGLSFFLGAYGLLPYFVLWKPPPPPVEEEELERWPLNFLESKFTAGITF 240
Query: 241 AAGLGLLFYAGLAGESAWKEFYQYFRESRFIHATSIDFMLLSSFAPFWVYNDMSARKWYD 300
AAGLGLLFYAGLAGESAWKEFYQYFRESRFIHATSIDFMLLSSFAPFWVYNDMSARKWYD
Sbjct: 241 AAGLGLLFYAGLAGESAWKEFYQYFRESRFIHATSIDFMLLSSFAPFWVYNDMSARKWYD 300
Query: 301 QGSWLLPFSLVPFLGPALYLVLRPMPTTTPVPLDRAASEPK 342
QGSWLLP SLVPFLGPALYLVLRPMPTTTPVPLD AASEPK
Sbjct: 301 QGSWLLPLSLVPFLGPALYLVLRPMPTTTPVPLDPAASEPK 341
BLAST of Carg16109 vs. NCBI nr
Match:
KAG6570949.1 (Protein NOI4, partial [Cucurbita argyrosperma subsp. sororia])
HSP 1 Score: 608.2 bits (1567), Expect = 4.3e-170
Identity = 297/313 (94.89%), Postives = 299/313 (95.53%), Query Frame = 0
Query: 29 KTQTRNPQITPFKYSTSVCPGINTRRDSSYRKVGLLQKWRSASGTQNAGDPVGEKATPVE 88
K + P+ P PGINTRRDSSYRKVGLLQKWRSASGTQNAGDPVGEKATPVE
Sbjct: 83 KPKLETPKSPPSSIRHRSAPGINTRRDSSYRKVGLLQKWRSASGTQNAGDPVGEKATPVE 142
Query: 89 SERGGSSGGGNGGEGRDWTTSILLFVLWAGLMFYVFILAPNQTPSTDLYFLKKLLNLKND 148
SERGGSSGGGNGGEGRDWTTSILLFVLWAGLMFYVFILAPNQTPSTDLYFLKKLLNLKND
Sbjct: 143 SERGGSSGGGNGGEGRDWTTSILLFVLWAGLMFYVFILAPNQTPSTDLYFLKKLLNLKND 202
Query: 149 DGFKMNEVLVSLWYIMGLWPLIYSMLLLPSGRSSNSNVPVWPFLGLSFFLGAYGLLPYFV 208
DGFKMNEVLVSLWYIMGLWPLIYSMLLLPSGRSSNSNVPVWPFLGLSFFLGAYGLLPYFV
Sbjct: 203 DGFKMNEVLVSLWYIMGLWPLIYSMLLLPSGRSSNSNVPVWPFLGLSFFLGAYGLLPYFV 262
Query: 209 LWKPPPPPVEEDELERWPLNFLESKFTAGITFAAGLGLLFYAGLAGESAWKEFYQYFRES 268
LWKPPPPPVEEDELERWPLNFLESKFTAGITFAAGLGLLFYAGLAGESAWKEFYQYFRES
Sbjct: 263 LWKPPPPPVEEDELERWPLNFLESKFTAGITFAAGLGLLFYAGLAGESAWKEFYQYFRES 322
Query: 269 RFIHATSIDFMLLSSFAPFWVYNDMSARKWYDQGSWLLPFSLVPFLGPALYLVLRPMPTT 328
RFIHATSIDFMLLSSFAPFWVYNDMSARKWYDQGSWLLPFSLVPFLGPALYLVLRPMPTT
Sbjct: 323 RFIHATSIDFMLLSSFAPFWVYNDMSARKWYDQGSWLLPFSLVPFLGPALYLVLRPMPTT 382
Query: 329 TPVPLDRAASEPK 342
TPVPLDRAASEPK
Sbjct: 383 TPVPLDRAASEPK 395
BLAST of Carg16109 vs. ExPASy TrEMBL
Match:
A0A6J1FXH1 (uncharacterized protein LOC111448482 OS=Cucurbita moschata OX=3662 GN=LOC111448482 PE=4 SV=1)
HSP 1 Score: 693.3 bits (1788), Expect = 4.9e-196
Identity = 338/341 (99.12%), Postives = 338/341 (99.12%), Query Frame = 0
Query: 1 MITNINLISCNSFSPSPPSRFSKLGILHKTQTRNPQITPFKYSTSVCPGINTRRDSSYRK 60
MI NINLISCNSFSPSPPSRFSKLGILHKTQTRNPQI PFKYSTSVCP INTRRDSSYRK
Sbjct: 1 MIPNINLISCNSFSPSPPSRFSKLGILHKTQTRNPQIIPFKYSTSVCPRINTRRDSSYRK 60
Query: 61 VGLLQKWRSASGTQNAGDPVGEKATPVESERGGSSGGGNGGEGRDWTTSILLFVLWAGLM 120
VGLLQKWRSASGTQNAGDPVGEKATPVESERGGSSGGGNGGEGRDWTTSILLFVLWAGLM
Sbjct: 61 VGLLQKWRSASGTQNAGDPVGEKATPVESERGGSSGGGNGGEGRDWTTSILLFVLWAGLM 120
Query: 121 FYVFILAPNQTPSTDLYFLKKLLNLKNDDGFKMNEVLVSLWYIMGLWPLIYSMLLLPSGR 180
FYVFILAPNQTPSTDLYFLKKLLNLKNDDGFKMNEVLVSLWYIMGLWPLIYSMLLLPSGR
Sbjct: 121 FYVFILAPNQTPSTDLYFLKKLLNLKNDDGFKMNEVLVSLWYIMGLWPLIYSMLLLPSGR 180
Query: 181 SSNSNVPVWPFLGLSFFLGAYGLLPYFVLWKPPPPPVEEDELERWPLNFLESKFTAGITF 240
SSNSNVPVWPFLGLSFFLGAYGLLPYFVLWKPPPPPVEEDELERWPLNFLESKFTAGITF
Sbjct: 181 SSNSNVPVWPFLGLSFFLGAYGLLPYFVLWKPPPPPVEEDELERWPLNFLESKFTAGITF 240
Query: 241 AAGLGLLFYAGLAGESAWKEFYQYFRESRFIHATSIDFMLLSSFAPFWVYNDMSARKWYD 300
AAGLGLLFYAGLAGESAWKEFYQYFRESRFIHATSIDFMLLSSFAPFWVYNDMSARKWYD
Sbjct: 241 AAGLGLLFYAGLAGESAWKEFYQYFRESRFIHATSIDFMLLSSFAPFWVYNDMSARKWYD 300
Query: 301 QGSWLLPFSLVPFLGPALYLVLRPMPTTTPVPLDRAASEPK 342
QGSWLLPFSLVPFLGPALYLVLRPMPTTTPVPLDRAASEPK
Sbjct: 301 QGSWLLPFSLVPFLGPALYLVLRPMPTTTPVPLDRAASEPK 341
BLAST of Carg16109 vs. ExPASy TrEMBL
Match:
A0A6J1J5P7 (uncharacterized protein LOC111483692 OS=Cucurbita maxima OX=3661 GN=LOC111483692 PE=4 SV=1)
HSP 1 Score: 679.5 bits (1752), Expect = 7.3e-192
Identity = 331/341 (97.07%), Postives = 334/341 (97.95%), Query Frame = 0
Query: 1 MITNINLISCNSFSPSPPSRFSKLGILHKTQTRNPQITPFKYSTSVCPGINTRRDSSYRK 60
MI+NINLISCNSFSPSPPSRFSKLGILHKTQTRNPQI PFKYSTSVCPGINTRRDSSYRK
Sbjct: 1 MISNINLISCNSFSPSPPSRFSKLGILHKTQTRNPQIIPFKYSTSVCPGINTRRDSSYRK 60
Query: 61 VGLLQKWRSASGTQNAGDPVGEKATPVESERGGSSGGGNGGEGRDWTTSILLFVLWAGLM 120
VGLLQKWRSAS + NAG PVGEKATPVESERGGSS GGNGGEGRDWTTSILLFVLWAGLM
Sbjct: 61 VGLLQKWRSASASPNAGGPVGEKATPVESERGGSSNGGNGGEGRDWTTSILLFVLWAGLM 120
Query: 121 FYVFILAPNQTPSTDLYFLKKLLNLKNDDGFKMNEVLVSLWYIMGLWPLIYSMLLLPSGR 180
FYVFILAPNQTPSTDLYFLKKLLNLKNDDGFKMNEVLVSLWYIMGLWPLIYSMLLLPSGR
Sbjct: 121 FYVFILAPNQTPSTDLYFLKKLLNLKNDDGFKMNEVLVSLWYIMGLWPLIYSMLLLPSGR 180
Query: 181 SSNSNVPVWPFLGLSFFLGAYGLLPYFVLWKPPPPPVEEDELERWPLNFLESKFTAGITF 240
SSNSNVPVWPFLGLSFFLGAYGLLPYFVLWKPPPPPVEE+ELERWPLNFLESKFTAGITF
Sbjct: 181 SSNSNVPVWPFLGLSFFLGAYGLLPYFVLWKPPPPPVEEEELERWPLNFLESKFTAGITF 240
Query: 241 AAGLGLLFYAGLAGESAWKEFYQYFRESRFIHATSIDFMLLSSFAPFWVYNDMSARKWYD 300
AAGLGLLFYAGLAGESAWKEFYQYFRESRFIHATSIDFMLLSSFAPFWVYNDMSARKWYD
Sbjct: 241 AAGLGLLFYAGLAGESAWKEFYQYFRESRFIHATSIDFMLLSSFAPFWVYNDMSARKWYD 300
Query: 301 QGSWLLPFSLVPFLGPALYLVLRPMPTTTPVPLDRAASEPK 342
QGSWLLP SLVPFLGPALYLVLRPMPTTTPVPLD AASEPK
Sbjct: 301 QGSWLLPLSLVPFLGPALYLVLRPMPTTTPVPLDPAASEPK 341
BLAST of Carg16109 vs. ExPASy TrEMBL
Match:
A0A0A0KA76 (Uncharacterized protein OS=Cucumis sativus OX=3659 GN=Csa_6G113520 PE=4 SV=1)
HSP 1 Score: 571.6 bits (1472), Expect = 2.2e-159
Identity = 286/349 (81.95%), Postives = 303/349 (86.82%), Query Frame = 0
Query: 1 MITNINLISCNSFSPSPPSRFSKLGILH--KTQTRNPQ------ITPFKYSTSVCPGINT 60
MITN+NLISCN FSPS PSR SKL I H +TQTRNP+ ITPFK P N
Sbjct: 1 MITNLNLISCNFFSPSLPSRVSKLTITHQTQTQTRNPKTIRFPIITPFK----SYPNFN- 60
Query: 61 RRDSSYRKVGLLQKWRSASGTQNAGDPVGEKATPVESERGGSSGGGNGGEGRDWTTSILL 120
SS K+GL +KWRSASG+Q GDPV +PVE E GGS GGGNGGEGRDWTTSILL
Sbjct: 61 ---SSSSKMGLFRKWRSASGSQTTGDPVAANGSPVEGESGGSGGGGNGGEGRDWTTSILL 120
Query: 121 FVLWAGLMFYVFILAPNQTPSTDLYFLKKLLNLKNDDGFKMNEVLVSLWYIMGLWPLIYS 180
FVLWAGLMFYVF APNQTPSTDLYFLKKLLNLK+DDGFKMNEVLVSLWYIMGLWPL+YS
Sbjct: 121 FVLWAGLMFYVFNFAPNQTPSTDLYFLKKLLNLKSDDGFKMNEVLVSLWYIMGLWPLVYS 180
Query: 181 MLLLPSGRSSNSNVPVWPFLGLSFFLGAYGLLPYFVLWKPPPPPVEEDELERWPLNFLES 240
MLLLPSGRSSNSNVPVWPFL LSFFLGAYGLLPYFVLWKPPPPPVEED+L+RWPLNFLES
Sbjct: 181 MLLLPSGRSSNSNVPVWPFLVLSFFLGAYGLLPYFVLWKPPPPPVEEDDLKRWPLNFLES 240
Query: 241 KFTAGITFAAGLGLLFYAGLAGESAWKEFYQYFRESRFIHATSIDFMLLSSFAPFWVYND 300
KFTAGITFAAGLG+LFY GLAGESAWKEFYQYFRESRFIHA SIDFMLLSSFAPFW+YND
Sbjct: 241 KFTAGITFAAGLGILFYGGLAGESAWKEFYQYFRESRFIHAMSIDFMLLSSFAPFWIYND 300
Query: 301 MSARKWYDQGSWLLPFSLVPFLGPALYLVLRPMPTTTPVPLDRAASEPK 342
MSARKWY+QGSWLLP SLVPFLGPALYLVLRP+P TP+PL+ AASEPK
Sbjct: 301 MSARKWYNQGSWLLPLSLVPFLGPALYLVLRPLPKVTPIPLNSAASEPK 341
BLAST of Carg16109 vs. ExPASy TrEMBL
Match:
A0A1S3CKU4 (uncharacterized protein LOC103502103 OS=Cucumis melo OX=3656 GN=LOC103502103 PE=4 SV=1)
HSP 1 Score: 567.0 bits (1460), Expect = 5.3e-158
Identity = 284/349 (81.38%), Postives = 301/349 (86.25%), Query Frame = 0
Query: 1 MITNINLISCNSFSPSPPSRFSKLGILHKTQT--------RNPQITPFKYSTSVCPGINT 60
MITN+NLISCN FSPS PSR SKL I H+TQT R+P I PFK+ P N
Sbjct: 1 MITNLNLISCNFFSPSLPSRLSKLTITHQTQTQTRNPKTLRSPSIIPFKFP----PNFN- 60
Query: 61 RRDSSYRKVGLLQKWRSASGTQNAGDPVGEKATPVESERGGSSGGGNGGEGRDWTTSILL 120
SS K+GL +KWRSASG+Q GDP EK +PVE E G GGGNGGEGRDWTTSILL
Sbjct: 61 ---SSSSKMGLFKKWRSASGSQTMGDPAAEKGSPVEGESG---GGGNGGEGRDWTTSILL 120
Query: 121 FVLWAGLMFYVFILAPNQTPSTDLYFLKKLLNLKNDDGFKMNEVLVSLWYIMGLWPLIYS 180
FVLWAGLMFYVF LAPNQTPSTDLYFLKKLLNLK+DDGFKMNEVLVSLWYIMGLWPL+YS
Sbjct: 121 FVLWAGLMFYVFNLAPNQTPSTDLYFLKKLLNLKSDDGFKMNEVLVSLWYIMGLWPLVYS 180
Query: 181 MLLLPSGRSSNSNVPVWPFLGLSFFLGAYGLLPYFVLWKPPPPPVEEDELERWPLNFLES 240
MLLLPSGRSSNSNVPVWPFL LSFFLGAYGLLPYFVLWKPPPPPVEED+L+RWPLNFLES
Sbjct: 181 MLLLPSGRSSNSNVPVWPFLVLSFFLGAYGLLPYFVLWKPPPPPVEEDDLKRWPLNFLES 240
Query: 241 KFTAGITFAAGLGLLFYAGLAGESAWKEFYQYFRESRFIHATSIDFMLLSSFAPFWVYND 300
KFTAGITFAAGLG+L Y GLAGESAWKEFYQYFRESRFIHA SIDFMLLSSFAPFWVYND
Sbjct: 241 KFTAGITFAAGLGILCYGGLAGESAWKEFYQYFRESRFIHAMSIDFMLLSSFAPFWVYND 300
Query: 301 MSARKWYDQGSWLLPFSLVPFLGPALYLVLRPMPTTTPVPLDRAASEPK 342
MSARKWYDQGSWLLP SLVPFLGP+LYLVLRP+P TP+PL+ AASEPK
Sbjct: 301 MSARKWYDQGSWLLPLSLVPFLGPSLYLVLRPLPKATPIPLNSAASEPK 338
BLAST of Carg16109 vs. ExPASy TrEMBL
Match:
A0A6J1CIN4 (uncharacterized protein LOC111011318 OS=Momordica charantia OX=3673 GN=LOC111011318 PE=4 SV=1)
HSP 1 Score: 546.6 bits (1407), Expect = 7.4e-152
Identity = 271/343 (79.01%), Postives = 293/343 (85.42%), Query Frame = 0
Query: 1 MITNINLISCNSFSPSPPSRFSKLGILHKTQTR--NPQITPFKYSTSVCPGINTRRDSSY 60
MITN NLISCN SPS P R KLGI H+ QT+ QI PFK+ V P N RRDSS
Sbjct: 1 MITNFNLISCNFLSPSLPLRIPKLGIAHQNQTQKLRSQIIPFKFPIWVSPNFNNRRDSSS 60
Query: 61 RKVGLLQKWRSASGTQNAGDPVGEKATPVESERGGSSGGGNGGEGRDWTTSILLFVLWAG 120
++ LLQK R+ AGDP GEK+T VE+ER GGGNGGEGRDWTTSILLFV WA
Sbjct: 61 SRMSLLQKCRT------AGDPDGEKSTAVETER----GGGNGGEGRDWTTSILLFVFWAA 120
Query: 121 LMFYVFILAPNQTPSTDLYFLKKLLNLKNDDGFKMNEVLVSLWYIMGLWPLIYSMLLLPS 180
LM+YVF LAPNQTPSTDLYFLKKLL LK+DDGFKMNEVLVSLWY+MGLWPL+Y MLLLPS
Sbjct: 121 LMYYVFNLAPNQTPSTDLYFLKKLLRLKSDDGFKMNEVLVSLWYLMGLWPLVYGMLLLPS 180
Query: 181 GRSSNSNVPVWPFLGLSFFLGAYGLLPYFVLWKPPPPPVEEDELERWPLNFLESKFTAGI 240
GRSSNSNVPVWPFL LS FLGAYGLLPYFVLWKPPPPP+EED+L+RWPLNFLESKFTAGI
Sbjct: 181 GRSSNSNVPVWPFLVLSVFLGAYGLLPYFVLWKPPPPPIEEDDLKRWPLNFLESKFTAGI 240
Query: 241 TFAAGLGLLFYAGLAGESAWKEFYQYFRESRFIHATSIDFMLLSSFAPFWVYNDMSARKW 300
TFAAGLG++FYAGLAGES WKEFYQYFRESRFIHA SIDFMLLSSFAPFWVYNDM+ARKW
Sbjct: 241 TFAAGLGIIFYAGLAGESVWKEFYQYFRESRFIHAMSIDFMLLSSFAPFWVYNDMTARKW 300
Query: 301 YDQGSWLLPFSLVPFLGPALYLVLRPMPTTTPVPLDRAASEPK 342
Y+QGSWLLPFSL+P LGPALYLVLRP PTTTPVP++ A SEPK
Sbjct: 301 YNQGSWLLPFSLLPLLGPALYLVLRPSPTTTPVPVNTAPSEPK 333
BLAST of Carg16109 vs. TAIR 10
Match:
AT2G04360.1 (unknown protein; FUNCTIONS IN: molecular_function unknown; LOCATED IN: chloroplast; EXPRESSED IN: 22 plant structures; EXPRESSED DURING: 13 growth stages; Has 35333 Blast hits to 34131 proteins in 2444 species: Archae - 798; Bacteria - 22429; Metazoa - 974; Fungi - 991; Plants - 531; Viruses - 0; Other Eukaryotes - 9610 (source: NCBI BLink). )
HSP 1 Score: 361.3 bits (926), Expect = 8.5e-100
Identity = 183/345 (53.04%), Postives = 233/345 (67.54%), Query Frame = 0
Query: 1 MITNINLISCNSFSPSPPSRFSKLGILHKTQTRNPQITPFKYSTSVCPGINTRRDSSYRK 60
M+ +I+LISCN FS+L L K TR +T S+S+ + +R SS +
Sbjct: 1 MLGSISLISCN---------FSRLPRLLKPSTRPQTLT---QSSSLLLLLQSRASSSPHR 60
Query: 61 VGLLQKWRSASG-----TQNAGDPVGEKATPVESERGGSSGGGNGGEGRDWTTSILLFVL 120
+ L K+ ++ +P + T ++ EGRDW++SILLF L
Sbjct: 61 IALFPKYEKKVTFFHICKSSSNNPEEPEKTQIQD------------EGRDWSSSILLFAL 120
Query: 121 WAGLMFYVFILAPNQTPSTDLYFLKKLLNLKNDDGFKMNEVLVSLWYIMGLWPLIYSMLL 180
W L++Y F LAP+QTP+ DLYFLKKLLNLK DDGF+MN++LV LWYIMGLWPL+Y+MLL
Sbjct: 121 WGALLYYCFNLAPDQTPTQDLYFLKKLLNLKGDDGFRMNQILVGLWYIMGLWPLVYAMLL 180
Query: 181 LPSGRSSNSNVPVWPFLGLSFFLGAYGLLPYFVLWKPPPPPVEEDELERWPLNFLESKFT 240
LP+G S P WPF+ LSFF G Y LLPYF LW PP PPV E EL +WPLN LESK T
Sbjct: 181 LPTG---TSKTPAWPFVVLSFFGGVYALLPYFALWNPPSPPVSETELRQWPLNVLESKVT 240
Query: 241 AGITFAAGLGLLFYAGLAGESAWKEFYQYFRESRFIHATSIDFMLLSSFAPFWVYNDMSA 300
AG+T AGLG++ Y+ + W EFYQYFRES+FIH TS+DF LLS+FAPFWVYNDM+
Sbjct: 241 AGVTLVAGLGIILYSVVGNAGDWTEFYQYFRESKFIHVTSLDFCLLSAFAPFWVYNDMTT 300
Query: 301 RKWYDQGSWLLPFSLVPFLGPALYLVLRPMPTTTPVPLDRAASEP 341
RKW+D+GSWLLP S++PFLGP+LYL+LRP + T P D A+S+P
Sbjct: 301 RKWFDKGSWLLPVSVIPFLGPSLYLLLRPAVSETIAPKDTASSDP 318
The following BLAST results are available for this feature:
Match Name | E-value | Identity | Description | |
KAG7010788.1 | 4.9e-198 | 100.00 | hypothetical protein SDJN02_27584 [Cucurbita argyrosperma subsp. argyrosperma] | [more] |
XP_023511914.1 | 9.2e-197 | 99.12 | uncharacterized protein LOC111776784 [Cucurbita pepo subsp. pepo] | [more] |
XP_022943893.1 | 1.0e-195 | 99.12 | uncharacterized protein LOC111448482 [Cucurbita moschata] | [more] |
XP_022985722.1 | 1.5e-191 | 97.07 | uncharacterized protein LOC111483692 [Cucurbita maxima] | [more] |
KAG6570949.1 | 4.3e-170 | 94.89 | Protein NOI4, partial [Cucurbita argyrosperma subsp. sororia] | [more] |
Match Name | E-value | Identity | Description | |
Match Name | E-value | Identity | Description | |
A0A6J1FXH1 | 4.9e-196 | 99.12 | uncharacterized protein LOC111448482 OS=Cucurbita moschata OX=3662 GN=LOC1114484... | [more] |
A0A6J1J5P7 | 7.3e-192 | 97.07 | uncharacterized protein LOC111483692 OS=Cucurbita maxima OX=3661 GN=LOC111483692... | [more] |
A0A0A0KA76 | 2.2e-159 | 81.95 | Uncharacterized protein OS=Cucumis sativus OX=3659 GN=Csa_6G113520 PE=4 SV=1 | [more] |
A0A1S3CKU4 | 5.3e-158 | 81.38 | uncharacterized protein LOC103502103 OS=Cucumis melo OX=3656 GN=LOC103502103 PE=... | [more] |
A0A6J1CIN4 | 7.4e-152 | 79.01 | uncharacterized protein LOC111011318 OS=Momordica charantia OX=3673 GN=LOC111011... | [more] |
Match Name | E-value | Identity | Description | |
AT2G04360.1 | 8.5e-100 | 53.04 | unknown protein; FUNCTIONS IN: molecular_function unknown; LOCATED IN: chloropla... | [more] |