Sequences
The following sequences are available for this feature:
Gene sequence (with intron)
Legend: polypeptideCDSinitialstart_codonintronterminalstop_codon
Hold the cursor over a type above to highlight its positions in the sequence below.
ATGGATTCCGCTACTCAAATTCTCAAAAAAGAGCACAAAACCTTCGTTAGACGACGATTACCGCCAACCCCACTACCGCCGCCGCCGCCTTCTCTTTCTTCTTCCTCTTCGCCTCCTCTCAACCCAACTCTACCGGCCAAGCCCTCAGCCCCTTGTCCGGTAAAGAAAACTCGGGATTTGCCTAATTTTTCGGAATGTCATGCTTGTGGCTTTCGTGTTGACGCCGTTGACGGTCGTTCCAGGCTTAATTCTCTCTACAGCGAGTGGCGGATTGTTCTTCTTTGTAAGAAGTGCTTCTCTCTTGTCGAATCTTCTCAGGTTTGTTCTTACTGTTTTGCTGACTCGAGAGGCGACTCTTTTAATTGCTCTGAGTGTAACCGTCGGGTTCATCGGGAGTGTTTTTCCCTGTATAGTAGAGTTGCTCCCTGGTCGTACTCCTCTTCGGGTTCTGAGTTTTCTGTGTGTATTGATTGTTGGGTTCCTAAACCAATTGTGACAGCGAGGGCGGTTTTGAGAAGTAGGAAAATTAGGAGGAAGAACATCAATGTTTCGGATTTGCGGAGTTCTAAGGTTTCAACTAGCGGAAACTGCAAATCATTCAGTTCATTAGTGAAGGATGCAAATTGTTTGGCGGATAAGAATGTTGATGCTGCGGTTAGAGGTAGGGAACATGCGCTGAAAAAGGCTGCTGTGGTTAGGAGGGCTTCTGAGTTGGCTAGCGATGCCTTAAATTTGGTTGCCCAAAGGGATGAAACTGCTGCCAAAGAGTCTGGGGAATCTGCTGACGATGCCGAGTTGGCAATTCAGTTGCACCGGGCTATGAATAGTTCTCCAAGGTTATCCAAGAATTTTTGTTCGGCAAATTCGAACTACATGGCTTTTGAGAATACAAGGGTGGTGGATGATGGGGACACATCAATAGGAGAACTATGTAGTGAGGAGTTTGATTTCTTTAAAACCCCACAAGTTTTGATTAATAACAGCTTATGCAATTCTCCTGACAATGCTGCTTCTGAGCCTTCGGTCACGGCTAAAGATCATGTCGCACCGTTAGAAATTAATCATTTAGAATCCATGGGAAACAATCCTATACCGGCGAAGGGCAATGGTTGCTCGGTAAAGTGTGATACTGAGTCTGAGAATGAGGAATTGTCTCCTAAAGAAGATCTGAGGAGCAGGTCAATTGAATTAATGACTGCCGGTTGCAATCATGATAGGTTATGCTCTGAAGATAAGTACACTCTACCCAGGGATGAGAGATGCATTGCAAAGCCTTATCACTATTTCTTCAAGTATAAAAGGAGAGATACAACGAAACGCTATCTGTTGAAGTATAGCAAGAGAAAATCAAGATTGAAGAGAATGCCTGATTGCAAACCTAAAATTCTTGTTGATGGAATGTGCTTGGGCGTTCCGTCTTCCTCTGCAGCAATCCTAATTAGTACTGAGAAATTTCCAGTTATTTCAAATGCCTCATTTGGCTGCTGTGCTGTTCCTCTGCAAGCTTCTAGCCTGGGAGTGCATGCAGTTCAAGAAATCAGTGACCAAGGAGGCAGGTAATCCTCATGCTTTTTTCTGCAGCTGATTATTTGAGGTTTCCTTTATACCGTCTAGCATGCACTATAGTATGAACTGACAACTTTTGGAGGTAGTATTTAAATGAAAAATCTATGGATAATGTAATGACCTGATTTTCGATACCTCGAATCACATGTTGCCATGTACAATCAAGGGTTAAAAGGGTGGTAAAATTTTCTTTGGTAAAATGGGAGACACATAGACTTTAAAATTTAAAGATGTATTCAAAATCAACATGAAAGTAGTTTAAAGGTAATTTCTTGGGAGTCAAATCAAAACAGTTTATAAAAAAAATACACAAGTTTTGAGAAGTAGCATTTAAAATAATAATAAAATGACAAGAAAGACGACTTGATCTAAAGGGCACCCCTTGCAACAGTGGTCTTCACTCCATCTGAAAAAAAAAAATAATAAAAAAATAACATGGAATAATTTTAAAAATACTCCGTAAACAACCTACTTTTAGGCTTTTATCAGACTTAATCCTACGCATGGAAACTTGGTTATGTGCTCATAACTAGGCCTAGTGATAGTCAAACTGTGGCTTAATGTCTCTACCAAACTTTGTAGGTCATTGTGTTACTAACCCTCTGCATACCTCGTTTAAGAAGTATTCAATCCTTCTCAAGTTCACAATCTAGGGGTTTCCTGTTTTCTCTTGGCTCTAGGTTTCATGTATGTCTAGGTCCTTCTTCAAGTCGCTGTGGCCCATGGAAAGGCCCTAGGTCGTACTCTTAAGTCTGGAGGAGCCCTTTTGCTCTAAAGCGATGACCGTACTCCATGCATGTCTACCTATTGAGCTCTAATCGATGGATTCTTCCTAATGGTGTCGGACAACATCTACATGTTCTTAAGATCAATAAACTACTTGTCACTGTGCATTTCCTGCCTGAAATGTAAACTACCTGCTCATGTTCTCTCCTCAGGTTATAACAATAAAACTACTTGTCTTAGGAACAGTAAATACCTATTACTTCGTCTTTTTAGGCTAGAACAGTACACCACTTAGGATGGAACAGTAAAAAGACCATAGTTCTCCCCACTAGGCACCATCGTTAGTACTTCAATATTATACCCTTTATGACTCTTGAAATCCTTAGGTGATAACTAAGCCATGTGTAACAAGTAAACACTAACCTCTATTGGTTAGTTCACGAATACGGGTTGCGCCCTATCCGTCCCTACACGGTACTTGGTCAACCCAAGGAGGCTATGGAACTCATGACTAGGTACTAGGACTATACAGGAAGCAATATCTCATGGTCTATGGAAACCATAACTAGCACTCTATGACACATAGCAATAGTCCAACACATAATTGGGCAATAACGGGCTATCCAAACCCTAAATCATGCATAATAAACGTATAAAGTCATACAACATGCTCGTCATGCTCATTAAGTGATAATAATCCAACCTCAACATATTGGAAAGCATAAAAGCCTAATGATCTATCAGAGTTCACATATCATGTATGCAAAAGCTATATAACTGTAAATAGGGAAGTTGTAAGCTATTCATATTATCTAAGCACAATTATCATGAGACTAGGTCAAATTAGGCTCCTAAACATAATACTTAATCATTGTTCTCAATCCCAATCGACTCCGGCAAGTATTTCTTATAAACTTGCTCCATATGGGTACTTGATTGTAGACTTCCTGAGCTTGTTTTGGGTCTTCGAGGATATTCCAAATTTCTCCAAAATTCCTCAATATTCCCTAAATTAACTGAAAAGAACAAAAGGGGTGAAAATGACTACCTTCAAATTCGTTAAAAACAAGGTTGAGAAACTAACCGTTGACTGTACAGTTATTGTCTCCAACTTGTGTGCATTCCATTCTACATTCCTTCCTTCCTTTCTTTTTAATTTATACTTCGAAAAAAAATCTTTTATTTATTTTCTTAAAATTCTAGGTGTTACAGGTAATCACTTATTTGTATTTAAGCTAGCTCTAGAGTTGCATACAAATTTTAGTGAGTTTGCTTAGGGTAGAAGTACTCTGTTAATGTTTCACCTCTTTCAATTCAAATTTTGTTCAATATATGATTCGTGATGAGTACTATCTGTATTGTAAATAGTTTCTGCTAGTTCAATGTAATGAGCATACGTTACTAGTCTAGTACCACCAAAATCACCTTCAACCTCTTGGACAGTAAGGGGTCCTTCCATAAAACTTATTAATATATCTTCCCCTGCTCAGTACTGGGGTTGTTGGTGGCTGCAAAAAGTGTCACAGGTCATATAGCCAACTACTTACCTTGTTGGAAGGTTGTACAATTTCCTTGTTGCTTGATGATACTCTTAAAAAATTCTTGTATGTTCACCCATCAAACTATCATGGAAATCATGTAAAATCAATCTGAATAGTAGCTAATGTTAAGGTATTACTTTTTTTTCCTTTTTTGTTTTTGATGAATAAATATGCTTTCATTGAGAATGAATAAAAGAATAGCTACGTCACCAAAGCCTACATAGAAGCATTAAATCTAATAAGGGACCAAAGATCGCTAGGAGAATATTTGATCCTTCTAAAAACTTTATTGTTCCTCTCGCCCTAGAGGCCCCATAAATAGCAAACACCTCAGCCTGCCACAAAAAACTGCCCCTATGAATATCTAACATGATGCCAAACTCCTAAAATGGCTTGCTCCACATGAAATAATCAATATCACAGCCCCAAAGAGCTCTAATGCCTTCGTTCTCTGAGTAACACCTACCTAGAGGTCACGTATTTGAATTTTTGAGTGAGCTTAATATGGAAAAGCTTTGATGTCTCCAAAGTCTAGGTCATGGAGCAGGTGCAGACTACACAATATTAGGAAAAAGGAGTTGCAAGAAATTTGGAATCATCTTGGCTAGCATTGGGTGTTCAATAAGGGTCATGGGTTTATAGAGGGAATAAGTCCCATCCATGGTGGCTACCTACCTAAGATTCACTATCCTGCGAGTTTCTTGGACACCAACATGTTGTTGGGTTAGGTGAGGTGTTCGTGAGATTAGTCGAGGTGTGCACAAACTGTCCTGGACACTCATGAATTAAAAAAATTAGTTAACCAAGAGCATGACAGGCTTTTTTGACTTCAAACTTTTCTTTTCTTTTTAAATGTTTATTCCATGAGAGCAACTAAAGAATGAACTGTGATCTTTGCTCCACAATTTTTTTTTTTTTTTTTTAATTGAAAAAAGTAGATCAGCAAATATCTTAAGGAAACATTTTGTGTGCCCAATTCCAAAGAAATCTAAAGTTTTCTTGATAATCAAGCTTTGACCAATTTGTCATTTTGATTCGGTCAGCCTTTGGCCTATGAGCCCTTTTGATAACACTAAAGTAGGGACAAGATCAGATACTCCTTTAAGGCCTTGAAGAGGTTGCCTTGCAGTTTATCCCAAATGTAGTTGGTACTTCCTCTTAAGTATCTTAAAGAACACCAAACACTCGTTGGACAACTTTGAGTTGAACTAACACACCAGTTAATGATGAATTCATCCTAGAGATACTAAGTGGAAGGAAAAAGCTCATTTCTTGGATTTTCTTTGAATTCTCTTTCACTCAGGTTGCCAAAAGGCTTAAAGAACAAAGATCTGAAAGGTTTGAACAAGTTAATTGGTAACTACACTTCTCCAAACAAAATTGAAAGCCACAATTTACAGGGCTATAGATGTTCCAAATATTTGTAAAAACTGATAAACACACTATTTCTGCTATGGAAAAAGCAACTTGATCAATCTAGACCGCCCATAACTCTCTTTTGAAAACGTTGAATGGTATATTAATTGTCTAGTTAGGCTTACATGTTATGAGACTTGGAAAATTAAGCAGTTTCTTTGTCAATACCTCTCCTTTTGAAGATGAATCCATATTAGATCACTTTCTTTGACTTAAATGTCTTTGATGCATGTTGTCATCCTTCCTGTCCTTGTTTTTTTTTTTAAAAAAATATCTTATCTCAACTCGTTCTAATCTTGAAATTCTGGGCATATTTATCATTACTATAATATGAGCAATAGAATGCAGAACAAAGTCCAGCGTACTGATAGGTCTTTAAATCTAAGGACATAAACCACTTTAGGACCATAATGTGTAGTTTGCTTGTAGATTGATCATAAGAAAATTCCATAAGAGGTTGGATGAGATCCCATTGACGAGTATTCTTTCCTGCATAATGTCTTAGAAGATACCTCAAACTTCAGTTGACAACTCAGGATTTACCATTAGTTCTTTTGAGGATGAAAAGAAGAACTAAGACGTGTGCATAAATGACTTAAAAATTTTGAACTTTTCATCAAATAACAATTTTCATTGATAAAATGAATAATTACAAAGAAAGGATGGTTCATTTTTCTGCACTTCTTTGGTTCTTTTTGTTGTTCTTGTAGGCTTTTGAGTGTTGGAGTTTGGAGTAAGCTCTTTATATTTCTGGTTTTTTGGGATGGATGGGCTGTGATTGTTTTATTTTTGGTAGTTTATTTATTTATTTTATTTTGGTATCTGTAATCAATATTTTTAACCTATTCTAAACCATAAACTAGAAATTCTTAGAACTCTTGACAACATGACAATTGTGTATATATTCTCATACCCTATATCTCTTTCGACCTATTGCCAATAAAAGTTTTGTTAAATAAGGGTCAAAGTTTTATCCTTCCCAAATTTCCTCTAAGCTACCTCACTTGAAGTGAGAATGAATTCTCTTACTGGCAATGTGGAACATAAAGATGGTTAGCTTTAAAGAGAAACCCCACATGAAAATTATTTTTGTTAATAGGAAACAATAACCGTTCATTGATATGATGAAATGGGATACAAAAGAAGCTCCTATTGAGAAATAATTATAAAAAAAACACCTCCCTAGCAAGGGTGTCGAGGTTATAATACAAAAGGGGAGCAAGCTAAAAGAGAAGTAGAATCAATAAATAGGCATTGCCACGATTCTTGTCCAAGAAGATGCGAGTGTTTTGTTCCGACCAAAGACTCCAACAGAAAACACGAACATGGGTAAGCCAAAGGATCTTTTTGTTCGTTTTGAAAGAGGGTGCCCCAAAAAAGAAGATTATTATTCTAGTAAGGAAAGCTCCAACCAAAAACACCTTGTAAGAAATTCCTGAATTTGGTGGCAAAGGAAATTACTGAAGAGGTGGAATGATGTTTCATTGTTGCTATAGCATATAGGGCAGCAGCTGGGGTATAAGTATAGCCATTGAGGACGGCATTGGAGCTTAACAAAAGTATTAATGCTAAGAGTGGCTAAGCTCCCATATAATGAATTTCACCTTTCTTGGACAACGACCAACCCAAATATGGTTATATAGCTGATTTCGTGCACACATTGGCTGTTGATAAAGCCCTTGAGAATGATTTCGATGGGAAAATACCATCCTTTTCCAAAGCCGAGTGCCGAGAGTCTTGGGTTGAATCATGAAGGGGCAATTAAATAGTGGATAAAGTGGCGCGATCTTCAATTTCAGGCTTCATGAGATTTCTACGGAGATGAAGGATCCCAAAGTACAGCAATAGGTGTGTCTTTCTTGCTCGATAAGTACTAAAGCCCAGGGAAAGTTTCTGACGATTTGAGACTCCTATGCAAACATCCTTCCTAAAGGATGTGCGGTGATTGTTACCGGCCTAGGAAATATGTGCATAAAGCATTTGTTTCAAGTTATTTGTGCATTTTAACGACTTGTAGGAAGAATGCATGGTTTTGGAATCCAGTTTAAGGTCAAAGTGGATGGAACCATATTTGGAGGCAATAACCTAACACCAATGAGCCTCCTGCTTGTTGATACTAATTTATTGGCAAGATGGGACAACTGCAAGTTGATACTGATGTCCATCAAATTTTCGGCAATTTATGTTGAGGCTCGTTGCTTCTTCAAAGCGTTTGATAATGTTGAAAAGGTTACGAAAAGAGTAGTGCCATCGACAAACTGTAGATGAGTAATAGGGATGACATTGGGCTCTTTACCGACTTTGAATCCTTTGATCCAACCATCTTGTTCAGCATAGTAAGCGATGGGTTGTAGTTCCCTCCACTCGGGATATAGTAACACCGCTGTACCTTCATCGGCCACTCAGCTTGGCTGAACCCAGGATTCACCTTCCACAACATAATATCCTCTTTTGTGTTATTTAAGGAACAGTTCTCGGTCCTTCCATATTGGAATCAACCATATTCCTCGATTCTAGATATGCATTATGCTTTCTTAATACTTGAGCTTTGTTGAGTTCATCTCTCACTTCTTATCATAGTTCTCAACTTGATTGTCAAAGTTACATTGGTAAAGACGACAATGGTTTAAACATCCTTTTTCTTTTGTACATAGGCGATTGTTAATGATATGTTCTTCCCTTCTATAAACCGTATAAAGCCAAGCCATTCGATAAATTCATTCGGCATCATTCTTCTTTCAATGTCTAGGTTCCGATAATATTGAAAGAACGATATATGATGGCAGTTACTAGTAGTCTTTTTGGGATGATAAATCCAAGTTTCATCCAGCTTGGCGGTTCTATTCTATAACTCTATTTGGTTGCATCTTCTCTATTATGGTTTACTCAAATCTTGAATCTACATGGATGATGTCCCATTATTTTTATTTTGTTTAATCAGATGTTCTTTCCTCCCTTGGGTTTTTACGTTTAGTTATTTCGAGGCGTTTAGACATTCTTAGGCTCGCTAACTGAGACTAGTTCCAATTTGCAAAGACGCCTCGATGAGAAGAATGTTCGAAAATGTTATCTCTGATTTACAGTTGTTGTTACAATGTAATTATGTGTTGCTCATATGAACAGGTAA
mRNA sequence
ATGGATTCCGCTACTCAAATTCTCAAAAAAGAGCACAAAACCTTCGTTAGACGACGATTACCGCCAACCCCACTACCGCCGCCGCCGCCTTCTCTTTCTTCTTCCTCTTCGCCTCCTCTCAACCCAACTCTACCGGCCAAGCCCTCAGCCCCTTGTCCGGTAAAGAAAACTCGGGATTTGCCTAATTTTTCGGAATGTCATGCTTGTGGCTTTCGTGTTGACGCCGTTGACGGTCGTTCCAGGCTTAATTCTCTCTACAGCGAGTGGCGGATTGTTCTTCTTTGTAAGAAGTGCTTCTCTCTTGTCGAATCTTCTCAGGTTTGTTCTTACTGTTTTGCTGACTCGAGAGGCGACTCTTTTAATTGCTCTGAGTGTAACCGTCGGGTTCATCGGGAGTGTTTTTCCCTGTATAGTAGAGTTGCTCCCTGGTCGTACTCCTCTTCGGGTTCTGAGTTTTCTGTGTGTATTGATTGTTGGGTTCCTAAACCAATTGTGACAGCGAGGGCGGTTTTGAGAAGTAGGAAAATTAGGAGGAAGAACATCAATGTTTCGGATTTGCGGAGTTCTAAGGTTTCAACTAGCGGAAACTGCAAATCATTCAGTTCATTAGTGAAGGATGCAAATTGTTTGGCGGATAAGAATGTTGATGCTGCGGTTAGAGGTAGGGAACATGCGCTGAAAAAGGCTGCTGTGGTTAGGAGGGCTTCTGAGTTGGCTAGCGATGCCTTAAATTTGGTTGCCCAAAGGGATGAAACTGCTGCCAAAGAGTCTGGGGAATCTGCTGACGATGCCGAGTTGGCAATTCAGTTGCACCGGGCTATGAATAGTTCTCCAAGGTTATCCAAGAATTTTTGTTCGGCAAATTCGAACTACATGGCTTTTGAGAATACAAGGGTGGTGGATGATGGGGACACATCAATAGGAGAACTATGTAGTGAGGAGTTTGATTTCTTTAAAACCCCACAAGTTTTGATTAATAACAGCTTATGCAATTCTCCTGACAATGCTGCTTCTGAGCCTTCGGTCACGGCTAAAGATCATGTCGCACCGTTAGAAATTAATCATTTAGAATCCATGGGAAACAATCCTATACCGGCGAAGGGCAATGGTTGCTCGGTAAAGTGTGATACTGAGTCTGAGAATGAGGAATTGTCTCCTAAAGAAGATCTGAGGAGCAGGTCAATTGAATTAATGACTGCCGGTTGCAATCATGATAGGTTATGCTCTGAAGATAAGTACACTCTACCCAGGGATGAGAGATGCATTGCAAAGCCTTATCACTATTTCTTCAAGTATAAAAGGAGAGATACAACGAAACGCTATCTGTTGAAGTATAGCAAGAGAAAATCAAGATTGAAGAGAATGCCTGATTGCAAACCTAAAATTCTTGTTGATGGAATGTGCTTGGGCGTTCCGTCTTCCTCTGCAGCAATCCTAATTAGTACTGAGAAATTTCCAGTTATTTCAAATGCCTCATTTGGCTGCTGTGCTGTTCCTCTGCAAGCTTCTAGCCTGGGAGTGCATGCAGTTCAAGAAATCAGTGACCAAGGAGGCAGGTAA
Coding sequence (CDS)
ATGGATTCCGCTACTCAAATTCTCAAAAAAGAGCACAAAACCTTCGTTAGACGACGATTACCGCCAACCCCACTACCGCCGCCGCCGCCTTCTCTTTCTTCTTCCTCTTCGCCTCCTCTCAACCCAACTCTACCGGCCAAGCCCTCAGCCCCTTGTCCGGTAAAGAAAACTCGGGATTTGCCTAATTTTTCGGAATGTCATGCTTGTGGCTTTCGTGTTGACGCCGTTGACGGTCGTTCCAGGCTTAATTCTCTCTACAGCGAGTGGCGGATTGTTCTTCTTTGTAAGAAGTGCTTCTCTCTTGTCGAATCTTCTCAGGTTTGTTCTTACTGTTTTGCTGACTCGAGAGGCGACTCTTTTAATTGCTCTGAGTGTAACCGTCGGGTTCATCGGGAGTGTTTTTCCCTGTATAGTAGAGTTGCTCCCTGGTCGTACTCCTCTTCGGGTTCTGAGTTTTCTGTGTGTATTGATTGTTGGGTTCCTAAACCAATTGTGACAGCGAGGGCGGTTTTGAGAAGTAGGAAAATTAGGAGGAAGAACATCAATGTTTCGGATTTGCGGAGTTCTAAGGTTTCAACTAGCGGAAACTGCAAATCATTCAGTTCATTAGTGAAGGATGCAAATTGTTTGGCGGATAAGAATGTTGATGCTGCGGTTAGAGGTAGGGAACATGCGCTGAAAAAGGCTGCTGTGGTTAGGAGGGCTTCTGAGTTGGCTAGCGATGCCTTAAATTTGGTTGCCCAAAGGGATGAAACTGCTGCCAAAGAGTCTGGGGAATCTGCTGACGATGCCGAGTTGGCAATTCAGTTGCACCGGGCTATGAATAGTTCTCCAAGGTTATCCAAGAATTTTTGTTCGGCAAATTCGAACTACATGGCTTTTGAGAATACAAGGGTGGTGGATGATGGGGACACATCAATAGGAGAACTATGTAGTGAGGAGTTTGATTTCTTTAAAACCCCACAAGTTTTGATTAATAACAGCTTATGCAATTCTCCTGACAATGCTGCTTCTGAGCCTTCGGTCACGGCTAAAGATCATGTCGCACCGTTAGAAATTAATCATTTAGAATCCATGGGAAACAATCCTATACCGGCGAAGGGCAATGGTTGCTCGGTAAAGTGTGATACTGAGTCTGAGAATGAGGAATTGTCTCCTAAAGAAGATCTGAGGAGCAGGTCAATTGAATTAATGACTGCCGGTTGCAATCATGATAGGTTATGCTCTGAAGATAAGTACACTCTACCCAGGGATGAGAGATGCATTGCAAAGCCTTATCACTATTTCTTCAAGTATAAAAGGAGAGATACAACGAAACGCTATCTGTTGAAGTATAGCAAGAGAAAATCAAGATTGAAGAGAATGCCTGATTGCAAACCTAAAATTCTTGTTGATGGAATGTGCTTGGGCGTTCCGTCTTCCTCTGCAGCAATCCTAATTAGTACTGAGAAATTTCCAGTTATTTCAAATGCCTCATTTGGCTGCTGTGCTGTTCCTCTGCAAGCTTCTAGCCTGGGAGTGCATGCAGTTCAAGAAATCAGTGACCAAGGAGGCAGGTAA
Protein sequence
MDSATQILKKEHKTFVRRRLPPTPLPPPPPSLSSSSSPPLNPTLPAKPSAPCPVKKTRDLPNFSECHACGFRVDAVDGRSRLNSLYSEWRIVLLCKKCFSLVESSQVCSYCFADSRGDSFNCSECNRRVHRECFSLYSRVAPWSYSSSGSEFSVCIDCWVPKPIVTARAVLRSRKIRRKNINVSDLRSSKVSTSGNCKSFSSLVKDANCLADKNVDAAVRGREHALKKAAVVRRASELASDALNLVAQRDETAAKESGESADDAELAIQLHRAMNSSPRLSKNFCSANSNYMAFENTRVVDDGDTSIGELCSEEFDFFKTPQVLINNSLCNSPDNAASEPSVTAKDHVAPLEINHLESMGNNPIPAKGNGCSVKCDTESENEELSPKEDLRSRSIELMTAGCNHDRLCSEDKYTLPRDERCIAKPYHYFFKYKRRDTTKRYLLKYSKRKSRLKRMPDCKPKILVDGMCLGVPSSSAAILISTEKFPVISNASFGCCAVPLQASSLGVHAVQEISDQGGR
Homology
BLAST of Csor.00g156540 vs. NCBI nr
Match:
KAG6571814.1 (hypothetical protein SDJN03_28542, partial [Cucurbita argyrosperma subsp. sororia])
HSP 1 Score: 1026 bits (2652), Expect = 0.0
Identity = 519/519 (100.00%), Postives = 519/519 (100.00%), Query Frame = 0
Query: 1 MDSATQILKKEHKTFVRRRLPPTPLPPPPPSLSSSSSPPLNPTLPAKPSAPCPVKKTRDL 60
MDSATQILKKEHKTFVRRRLPPTPLPPPPPSLSSSSSPPLNPTLPAKPSAPCPVKKTRDL
Sbjct: 1 MDSATQILKKEHKTFVRRRLPPTPLPPPPPSLSSSSSPPLNPTLPAKPSAPCPVKKTRDL 60
Query: 61 PNFSECHACGFRVDAVDGRSRLNSLYSEWRIVLLCKKCFSLVESSQVCSYCFADSRGDSF 120
PNFSECHACGFRVDAVDGRSRLNSLYSEWRIVLLCKKCFSLVESSQVCSYCFADSRGDSF
Sbjct: 61 PNFSECHACGFRVDAVDGRSRLNSLYSEWRIVLLCKKCFSLVESSQVCSYCFADSRGDSF 120
Query: 121 NCSECNRRVHRECFSLYSRVAPWSYSSSGSEFSVCIDCWVPKPIVTARAVLRSRKIRRKN 180
NCSECNRRVHRECFSLYSRVAPWSYSSSGSEFSVCIDCWVPKPIVTARAVLRSRKIRRKN
Sbjct: 121 NCSECNRRVHRECFSLYSRVAPWSYSSSGSEFSVCIDCWVPKPIVTARAVLRSRKIRRKN 180
Query: 181 INVSDLRSSKVSTSGNCKSFSSLVKDANCLADKNVDAAVRGREHALKKAAVVRRASELAS 240
INVSDLRSSKVSTSGNCKSFSSLVKDANCLADKNVDAAVRGREHALKKAAVVRRASELAS
Sbjct: 181 INVSDLRSSKVSTSGNCKSFSSLVKDANCLADKNVDAAVRGREHALKKAAVVRRASELAS 240
Query: 241 DALNLVAQRDETAAKESGESADDAELAIQLHRAMNSSPRLSKNFCSANSNYMAFENTRVV 300
DALNLVAQRDETAAKESGESADDAELAIQLHRAMNSSPRLSKNFCSANSNYMAFENTRVV
Sbjct: 241 DALNLVAQRDETAAKESGESADDAELAIQLHRAMNSSPRLSKNFCSANSNYMAFENTRVV 300
Query: 301 DDGDTSIGELCSEEFDFFKTPQVLINNSLCNSPDNAASEPSVTAKDHVAPLEINHLESMG 360
DDGDTSIGELCSEEFDFFKTPQVLINNSLCNSPDNAASEPSVTAKDHVAPLEINHLESMG
Sbjct: 301 DDGDTSIGELCSEEFDFFKTPQVLINNSLCNSPDNAASEPSVTAKDHVAPLEINHLESMG 360
Query: 361 NNPIPAKGNGCSVKCDTESENEELSPKEDLRSRSIELMTAGCNHDRLCSEDKYTLPRDER 420
NNPIPAKGNGCSVKCDTESENEELSPKEDLRSRSIELMTAGCNHDRLCSEDKYTLPRDER
Sbjct: 361 NNPIPAKGNGCSVKCDTESENEELSPKEDLRSRSIELMTAGCNHDRLCSEDKYTLPRDER 420
Query: 421 CIAKPYHYFFKYKRRDTTKRYLLKYSKRKSRLKRMPDCKPKILVDGMCLGVPSSSAAILI 480
CIAKPYHYFFKYKRRDTTKRYLLKYSKRKSRLKRMPDCKPKILVDGMCLGVPSSSAAILI
Sbjct: 421 CIAKPYHYFFKYKRRDTTKRYLLKYSKRKSRLKRMPDCKPKILVDGMCLGVPSSSAAILI 480
Query: 481 STEKFPVISNASFGCCAVPLQASSLGVHAVQEISDQGGR 519
STEKFPVISNASFGCCAVPLQASSLGVHAVQEISDQGGR
Sbjct: 481 STEKFPVISNASFGCCAVPLQASSLGVHAVQEISDQGGR 519
BLAST of Csor.00g156540 vs. NCBI nr
Match:
KAG7011508.1 (hypothetical protein SDJN02_26414, partial [Cucurbita argyrosperma subsp. argyrosperma])
HSP 1 Score: 1023 bits (2644), Expect = 0.0
Identity = 518/519 (99.81%), Postives = 518/519 (99.81%), Query Frame = 0
Query: 1 MDSATQILKKEHKTFVRRRLPPTPLPPPPPSLSSSSSPPLNPTLPAKPSAPCPVKKTRDL 60
MDSATQILKKEHKTFVRRRLPPTPLPPPPPSLSSSSSPPLNPTLPAKPSAPCPVKKTRDL
Sbjct: 1 MDSATQILKKEHKTFVRRRLPPTPLPPPPPSLSSSSSPPLNPTLPAKPSAPCPVKKTRDL 60
Query: 61 PNFSECHACGFRVDAVDGRSRLNSLYSEWRIVLLCKKCFSLVESSQVCSYCFADSRGDSF 120
PNFSECHACGFRVDAVDGRSRLNSLYSEWRIVLLCKKCFSLVESSQVCSYCFADSRGDSF
Sbjct: 61 PNFSECHACGFRVDAVDGRSRLNSLYSEWRIVLLCKKCFSLVESSQVCSYCFADSRGDSF 120
Query: 121 NCSECNRRVHRECFSLYSRVAPWSYSSSGSEFSVCIDCWVPKPIVTARAVLRSRKIRRKN 180
NCSECNRRVHRECFSLYSRVAPWSYSSSGSEFSVCIDCWVPKPIVTARAVLRSRKIRRKN
Sbjct: 121 NCSECNRRVHRECFSLYSRVAPWSYSSSGSEFSVCIDCWVPKPIVTARAVLRSRKIRRKN 180
Query: 181 INVSDLRSSKVSTSGNCKSFSSLVKDANCLADKNVDAAVRGREHALKKAAVVRRASELAS 240
INVSDLRSSKVSTSGNCKSFSSLVKDANCLADKNVDAAVRGREHALKKAAVVRRASELAS
Sbjct: 181 INVSDLRSSKVSTSGNCKSFSSLVKDANCLADKNVDAAVRGREHALKKAAVVRRASELAS 240
Query: 241 DALNLVAQRDETAAKESGESADDAELAIQLHRAMNSSPRLSKNFCSANSNYMAFENTRVV 300
DALNLVAQRDETAAKESGESADDAELAIQLHRAMNSSPRLSKNFCSANSNYMAFENTRVV
Sbjct: 241 DALNLVAQRDETAAKESGESADDAELAIQLHRAMNSSPRLSKNFCSANSNYMAFENTRVV 300
Query: 301 DDGDTSIGELCSEEFDFFKTPQVLINNSLCNSPDNAASEPSVTAKDHVAPLEINHLESMG 360
DDGDTSIGELCSEEFDFFKTPQVLINNSLCNSPDNAASEPSVTAKDHVAPLEINHLESMG
Sbjct: 301 DDGDTSIGELCSEEFDFFKTPQVLINNSLCNSPDNAASEPSVTAKDHVAPLEINHLESMG 360
Query: 361 NNPIPAKGNGCSVKCDTESENEELSPKEDLRSRSIELMTAGCNHDRLCSEDKYTLPRDER 420
NNPIPAKGNGCSVKCDTESENEELSPKEDLRSRSIELMTAGCNHDRLCSEDKYTLPRDE
Sbjct: 361 NNPIPAKGNGCSVKCDTESENEELSPKEDLRSRSIELMTAGCNHDRLCSEDKYTLPRDEI 420
Query: 421 CIAKPYHYFFKYKRRDTTKRYLLKYSKRKSRLKRMPDCKPKILVDGMCLGVPSSSAAILI 480
CIAKPYHYFFKYKRRDTTKRYLLKYSKRKSRLKRMPDCKPKILVDGMCLGVPSSSAAILI
Sbjct: 421 CIAKPYHYFFKYKRRDTTKRYLLKYSKRKSRLKRMPDCKPKILVDGMCLGVPSSSAAILI 480
Query: 481 STEKFPVISNASFGCCAVPLQASSLGVHAVQEISDQGGR 519
STEKFPVISNASFGCCAVPLQASSLGVHAVQEISDQGGR
Sbjct: 481 STEKFPVISNASFGCCAVPLQASSLGVHAVQEISDQGGR 519
BLAST of Csor.00g156540 vs. NCBI nr
Match:
XP_022952090.1 (uncharacterized protein LOC111454852 [Cucurbita moschata])
HSP 1 Score: 1017 bits (2629), Expect = 0.0
Identity = 515/519 (99.23%), Postives = 516/519 (99.42%), Query Frame = 0
Query: 1 MDSATQILKKEHKTFVRRRLPPTPLPPPPPSLSSSSSPPLNPTLPAKPSAPCPVKKTRDL 60
MDSATQILKKEHKTFVRRRLPPTPLPPPPPSLSSSSSPPLNPTLP KPSAPCPVKKTRDL
Sbjct: 1 MDSATQILKKEHKTFVRRRLPPTPLPPPPPSLSSSSSPPLNPTLPVKPSAPCPVKKTRDL 60
Query: 61 PNFSECHACGFRVDAVDGRSRLNSLYSEWRIVLLCKKCFSLVESSQVCSYCFADSRGDSF 120
PNFSECHACGFRVDAVDGRSRLNSLYSEWRIVLLCKKCFSLVESSQVCSYCFADSRGDSF
Sbjct: 61 PNFSECHACGFRVDAVDGRSRLNSLYSEWRIVLLCKKCFSLVESSQVCSYCFADSRGDSF 120
Query: 121 NCSECNRRVHRECFSLYSRVAPWSYSSSGSEFSVCIDCWVPKPIVTARAVLRSRKIRRKN 180
NCSECNRRVHRECFSLYSRVAPWSYSSSGSEFSVCIDCWVPKPIVTARAVLRSRKIRRKN
Sbjct: 121 NCSECNRRVHRECFSLYSRVAPWSYSSSGSEFSVCIDCWVPKPIVTARAVLRSRKIRRKN 180
Query: 181 INVSDLRSSKVSTSGNCKSFSSLVKDANCLADKNVDAAVRGREHALKKAAVVRRASELAS 240
INVSDLRSSKVSTSGNCKSFSSLVKDANCLADKNVDAAVRGREHALKKAAVVRRASELAS
Sbjct: 181 INVSDLRSSKVSTSGNCKSFSSLVKDANCLADKNVDAAVRGREHALKKAAVVRRASELAS 240
Query: 241 DALNLVAQRDETAAKESGESADDAELAIQLHRAMNSSPRLSKNFCSANSNYMAFENTRVV 300
DALNLVAQRDETAAKESGESADDAELAIQLHRAMNSSPRLSKNFCSANSNYMAFENTRVV
Sbjct: 241 DALNLVAQRDETAAKESGESADDAELAIQLHRAMNSSPRLSKNFCSANSNYMAFENTRVV 300
Query: 301 DDGDTSIGELCSEEFDFFKTPQVLINNSLCNSPDNAASEPSVTAKDHVAPLEINHLESMG 360
DDGDTSIGELCSEEFDFFKTPQVLINNSLCNSPDNAASEPSVTAKDHVAPLEIN LESMG
Sbjct: 301 DDGDTSIGELCSEEFDFFKTPQVLINNSLCNSPDNAASEPSVTAKDHVAPLEINRLESMG 360
Query: 361 NNPIPAKGNGCSVKCDTESENEELSPKEDLRSRSIELMTAGCNHDRLCSEDKYTLPRDER 420
NNPIPAKGNGCSVKCDTESEN ELSPKEDLRSRSIELM+AGCNHDRLCSEDKYTLPRDER
Sbjct: 361 NNPIPAKGNGCSVKCDTESENVELSPKEDLRSRSIELMSAGCNHDRLCSEDKYTLPRDER 420
Query: 421 CIAKPYHYFFKYKRRDTTKRYLLKYSKRKSRLKRMPDCKPKILVDGMCLGVPSSSAAILI 480
CIAKPYHYFFKYKRRDTTKRYLLKYSKRKSRLKRMPDCKPKILVDGMCLGVPSSSAAILI
Sbjct: 421 CIAKPYHYFFKYKRRDTTKRYLLKYSKRKSRLKRMPDCKPKILVDGMCLGVPSSSAAILI 480
Query: 481 STEKFPVISNASFGCCAVPLQASSLGVHAVQEISDQGGR 519
STEKFPVISNASFGCCAVPLQASSLGVHAVQEISDQGGR
Sbjct: 481 STEKFPVISNASFGCCAVPLQASSLGVHAVQEISDQGGR 519
BLAST of Csor.00g156540 vs. NCBI nr
Match:
XP_023554148.1 (uncharacterized protein LOC111811499 isoform X2 [Cucurbita pepo subsp. pepo])
HSP 1 Score: 1013 bits (2618), Expect = 0.0
Identity = 512/519 (98.65%), Postives = 515/519 (99.23%), Query Frame = 0
Query: 1 MDSATQILKKEHKTFVRRRLPPTPLPPPPPSLSSSSSPPLNPTLPAKPSAPCPVKKTRDL 60
MDSATQILKKEHKTFVRRRLPPTPLPPPPPSLSSSSSPPLNPTLPAKPSAPCPVKKTRDL
Sbjct: 1 MDSATQILKKEHKTFVRRRLPPTPLPPPPPSLSSSSSPPLNPTLPAKPSAPCPVKKTRDL 60
Query: 61 PNFSECHACGFRVDAVDGRSRLNSLYSEWRIVLLCKKCFSLVESSQVCSYCFADSRGDSF 120
PNFSECHACGFRVDAVDGRSRLNSLYSEWRIVLLCKKCFSLVESSQVCSYCFADSRGDSF
Sbjct: 61 PNFSECHACGFRVDAVDGRSRLNSLYSEWRIVLLCKKCFSLVESSQVCSYCFADSRGDSF 120
Query: 121 NCSECNRRVHRECFSLYSRVAPWSYSSSGSEFSVCIDCWVPKPIVTARAVLRSRKIRRKN 180
NCSECNRRVHRECFSLYSRVAPWSYSSSGSEFSVCIDCWVPKPIVTARAVLRSRKIRRKN
Sbjct: 121 NCSECNRRVHRECFSLYSRVAPWSYSSSGSEFSVCIDCWVPKPIVTARAVLRSRKIRRKN 180
Query: 181 INVSDLRSSKVSTSGNCKSFSSLVKDANCLADKNVDAAVRGREHALKKAAVVRRASELAS 240
INVSDLRSSKVSTSGNCKS SSLVKDANCLADKNVDAAVRGREHALKKAAVVRRASELAS
Sbjct: 181 INVSDLRSSKVSTSGNCKSLSSLVKDANCLADKNVDAAVRGREHALKKAAVVRRASELAS 240
Query: 241 DALNLVAQRDETAAKESGESADDAELAIQLHRAMNSSPRLSKNFCSANSNYMAFENTRVV 300
DALNLVAQRDETAAKESGESADDAELAIQLHRAMNSSPRLSKNFCSANSNYMAFENTRVV
Sbjct: 241 DALNLVAQRDETAAKESGESADDAELAIQLHRAMNSSPRLSKNFCSANSNYMAFENTRVV 300
Query: 301 DDGDTSIGELCSEEFDFFKTPQVLINNSLCNSPDNAASEPSVTAKDHVAPLEINHLESMG 360
DDGDTSIGELCSEEFDFFKTPQV INNSLCNSPDNAASEPSVTAKDHVAPLEIN LESMG
Sbjct: 301 DDGDTSIGELCSEEFDFFKTPQVFINNSLCNSPDNAASEPSVTAKDHVAPLEINRLESMG 360
Query: 361 NNPIPAKGNGCSVKCDTESENEELSPKEDLRSRSIELMTAGCNHDRLCSEDKYTLPRDER 420
NNP+PAKGNGCSVKCDTESEN ELSPKEDLRSRSIELM+AGCNHDRLCSEDKY+LPRDER
Sbjct: 361 NNPVPAKGNGCSVKCDTESENVELSPKEDLRSRSIELMSAGCNHDRLCSEDKYSLPRDER 420
Query: 421 CIAKPYHYFFKYKRRDTTKRYLLKYSKRKSRLKRMPDCKPKILVDGMCLGVPSSSAAILI 480
CIAKPYHYFFKYKRRDTTKRYLLKYSKRKSRLKRMPDCKPKILVDGMCLGVPSSSAAILI
Sbjct: 421 CIAKPYHYFFKYKRRDTTKRYLLKYSKRKSRLKRMPDCKPKILVDGMCLGVPSSSAAILI 480
Query: 481 STEKFPVISNASFGCCAVPLQASSLGVHAVQEISDQGGR 519
STEKFPVISNASFGCCAVPLQASSLGVHAVQEISDQGGR
Sbjct: 481 STEKFPVISNASFGCCAVPLQASSLGVHAVQEISDQGGR 519
BLAST of Csor.00g156540 vs. NCBI nr
Match:
XP_023554146.1 (uncharacterized protein LOC111811499 isoform X1 [Cucurbita pepo subsp. pepo])
HSP 1 Score: 1013 bits (2618), Expect = 0.0
Identity = 512/519 (98.65%), Postives = 515/519 (99.23%), Query Frame = 0
Query: 1 MDSATQILKKEHKTFVRRRLPPTPLPPPPPSLSSSSSPPLNPTLPAKPSAPCPVKKTRDL 60
MDSATQILKKEHKTFVRRRLPPTPLPPPPPSLSSSSSPPLNPTLPAKPSAPCPVKKTRDL
Sbjct: 1 MDSATQILKKEHKTFVRRRLPPTPLPPPPPSLSSSSSPPLNPTLPAKPSAPCPVKKTRDL 60
Query: 61 PNFSECHACGFRVDAVDGRSRLNSLYSEWRIVLLCKKCFSLVESSQVCSYCFADSRGDSF 120
PNFSECHACGFRVDAVDGRSRLNSLYSEWRIVLLCKKCFSLVESSQVCSYCFADSRGDSF
Sbjct: 61 PNFSECHACGFRVDAVDGRSRLNSLYSEWRIVLLCKKCFSLVESSQVCSYCFADSRGDSF 120
Query: 121 NCSECNRRVHRECFSLYSRVAPWSYSSSGSEFSVCIDCWVPKPIVTARAVLRSRKIRRKN 180
NCSECNRRVHRECFSLYSRVAPWSYSSSGSEFSVCIDCWVPKPIVTARAVLRSRKIRRKN
Sbjct: 121 NCSECNRRVHRECFSLYSRVAPWSYSSSGSEFSVCIDCWVPKPIVTARAVLRSRKIRRKN 180
Query: 181 INVSDLRSSKVSTSGNCKSFSSLVKDANCLADKNVDAAVRGREHALKKAAVVRRASELAS 240
INVSDLRSSKVSTSGNCKS SSLVKDANCLADKNVDAAVRGREHALKKAAVVRRASELAS
Sbjct: 181 INVSDLRSSKVSTSGNCKSLSSLVKDANCLADKNVDAAVRGREHALKKAAVVRRASELAS 240
Query: 241 DALNLVAQRDETAAKESGESADDAELAIQLHRAMNSSPRLSKNFCSANSNYMAFENTRVV 300
DALNLVAQRDETAAKESGESADDAELAIQLHRAMNSSPRLSKNFCSANSNYMAFENTRVV
Sbjct: 241 DALNLVAQRDETAAKESGESADDAELAIQLHRAMNSSPRLSKNFCSANSNYMAFENTRVV 300
Query: 301 DDGDTSIGELCSEEFDFFKTPQVLINNSLCNSPDNAASEPSVTAKDHVAPLEINHLESMG 360
DDGDTSIGELCSEEFDFFKTPQV INNSLCNSPDNAASEPSVTAKDHVAPLEIN LESMG
Sbjct: 301 DDGDTSIGELCSEEFDFFKTPQVFINNSLCNSPDNAASEPSVTAKDHVAPLEINRLESMG 360
Query: 361 NNPIPAKGNGCSVKCDTESENEELSPKEDLRSRSIELMTAGCNHDRLCSEDKYTLPRDER 420
NNP+PAKGNGCSVKCDTESEN ELSPKEDLRSRSIELM+AGCNHDRLCSEDKY+LPRDER
Sbjct: 361 NNPVPAKGNGCSVKCDTESENVELSPKEDLRSRSIELMSAGCNHDRLCSEDKYSLPRDER 420
Query: 421 CIAKPYHYFFKYKRRDTTKRYLLKYSKRKSRLKRMPDCKPKILVDGMCLGVPSSSAAILI 480
CIAKPYHYFFKYKRRDTTKRYLLKYSKRKSRLKRMPDCKPKILVDGMCLGVPSSSAAILI
Sbjct: 421 CIAKPYHYFFKYKRRDTTKRYLLKYSKRKSRLKRMPDCKPKILVDGMCLGVPSSSAAILI 480
Query: 481 STEKFPVISNASFGCCAVPLQASSLGVHAVQEISDQGGR 519
STEKFPVISNASFGCCAVPLQASSLGVHAVQEISDQGGR
Sbjct: 481 STEKFPVISNASFGCCAVPLQASSLGVHAVQEISDQGGR 519
BLAST of Csor.00g156540 vs. ExPASy TrEMBL
Match:
A0A6J1GKS5 (uncharacterized protein LOC111454852 OS=Cucurbita moschata OX=3662 GN=LOC111454852 PE=4 SV=1)
HSP 1 Score: 1017 bits (2629), Expect = 0.0
Identity = 515/519 (99.23%), Postives = 516/519 (99.42%), Query Frame = 0
Query: 1 MDSATQILKKEHKTFVRRRLPPTPLPPPPPSLSSSSSPPLNPTLPAKPSAPCPVKKTRDL 60
MDSATQILKKEHKTFVRRRLPPTPLPPPPPSLSSSSSPPLNPTLP KPSAPCPVKKTRDL
Sbjct: 1 MDSATQILKKEHKTFVRRRLPPTPLPPPPPSLSSSSSPPLNPTLPVKPSAPCPVKKTRDL 60
Query: 61 PNFSECHACGFRVDAVDGRSRLNSLYSEWRIVLLCKKCFSLVESSQVCSYCFADSRGDSF 120
PNFSECHACGFRVDAVDGRSRLNSLYSEWRIVLLCKKCFSLVESSQVCSYCFADSRGDSF
Sbjct: 61 PNFSECHACGFRVDAVDGRSRLNSLYSEWRIVLLCKKCFSLVESSQVCSYCFADSRGDSF 120
Query: 121 NCSECNRRVHRECFSLYSRVAPWSYSSSGSEFSVCIDCWVPKPIVTARAVLRSRKIRRKN 180
NCSECNRRVHRECFSLYSRVAPWSYSSSGSEFSVCIDCWVPKPIVTARAVLRSRKIRRKN
Sbjct: 121 NCSECNRRVHRECFSLYSRVAPWSYSSSGSEFSVCIDCWVPKPIVTARAVLRSRKIRRKN 180
Query: 181 INVSDLRSSKVSTSGNCKSFSSLVKDANCLADKNVDAAVRGREHALKKAAVVRRASELAS 240
INVSDLRSSKVSTSGNCKSFSSLVKDANCLADKNVDAAVRGREHALKKAAVVRRASELAS
Sbjct: 181 INVSDLRSSKVSTSGNCKSFSSLVKDANCLADKNVDAAVRGREHALKKAAVVRRASELAS 240
Query: 241 DALNLVAQRDETAAKESGESADDAELAIQLHRAMNSSPRLSKNFCSANSNYMAFENTRVV 300
DALNLVAQRDETAAKESGESADDAELAIQLHRAMNSSPRLSKNFCSANSNYMAFENTRVV
Sbjct: 241 DALNLVAQRDETAAKESGESADDAELAIQLHRAMNSSPRLSKNFCSANSNYMAFENTRVV 300
Query: 301 DDGDTSIGELCSEEFDFFKTPQVLINNSLCNSPDNAASEPSVTAKDHVAPLEINHLESMG 360
DDGDTSIGELCSEEFDFFKTPQVLINNSLCNSPDNAASEPSVTAKDHVAPLEIN LESMG
Sbjct: 301 DDGDTSIGELCSEEFDFFKTPQVLINNSLCNSPDNAASEPSVTAKDHVAPLEINRLESMG 360
Query: 361 NNPIPAKGNGCSVKCDTESENEELSPKEDLRSRSIELMTAGCNHDRLCSEDKYTLPRDER 420
NNPIPAKGNGCSVKCDTESEN ELSPKEDLRSRSIELM+AGCNHDRLCSEDKYTLPRDER
Sbjct: 361 NNPIPAKGNGCSVKCDTESENVELSPKEDLRSRSIELMSAGCNHDRLCSEDKYTLPRDER 420
Query: 421 CIAKPYHYFFKYKRRDTTKRYLLKYSKRKSRLKRMPDCKPKILVDGMCLGVPSSSAAILI 480
CIAKPYHYFFKYKRRDTTKRYLLKYSKRKSRLKRMPDCKPKILVDGMCLGVPSSSAAILI
Sbjct: 421 CIAKPYHYFFKYKRRDTTKRYLLKYSKRKSRLKRMPDCKPKILVDGMCLGVPSSSAAILI 480
Query: 481 STEKFPVISNASFGCCAVPLQASSLGVHAVQEISDQGGR 519
STEKFPVISNASFGCCAVPLQASSLGVHAVQEISDQGGR
Sbjct: 481 STEKFPVISNASFGCCAVPLQASSLGVHAVQEISDQGGR 519
BLAST of Csor.00g156540 vs. ExPASy TrEMBL
Match:
A0A6J1I871 (uncharacterized protein LOC111470557 OS=Cucurbita maxima OX=3661 GN=LOC111470557 PE=4 SV=1)
HSP 1 Score: 995 bits (2573), Expect = 0.0
Identity = 506/520 (97.31%), Postives = 510/520 (98.08%), Query Frame = 0
Query: 1 MDSATQILKKEHKTFVRRRLPPTPLPPPPP-SLSSSSSPPLNPTLPAKPSAPCPVKKTRD 60
MDSATQILKKEHKTFVRRRLPPTPLPPPPP SLSSSSSPPLNPTLPAKPSAPCPVKKTRD
Sbjct: 1 MDSATQILKKEHKTFVRRRLPPTPLPPPPPPSLSSSSSPPLNPTLPAKPSAPCPVKKTRD 60
Query: 61 LPNFSECHACGFRVDAVDGRSRLNSLYSEWRIVLLCKKCFSLVESSQVCSYCFADSRGDS 120
LPNFSECHACGFRVDAVDGRSRLNSLYSEWRIVLLCKKCFSLVESSQVCSYCFADSRGDS
Sbjct: 61 LPNFSECHACGFRVDAVDGRSRLNSLYSEWRIVLLCKKCFSLVESSQVCSYCFADSRGDS 120
Query: 121 FNCSECNRRVHRECFSLYSRVAPWSYSSSGSEFSVCIDCWVPKPIVTARAVLRSRKIRRK 180
FNC ECNRRVHRECFSLYSRVAPWSYSSSGSEFSVCIDCWVPKPIVTARAVLRSRKIRRK
Sbjct: 121 FNCCECNRRVHRECFSLYSRVAPWSYSSSGSEFSVCIDCWVPKPIVTARAVLRSRKIRRK 180
Query: 181 NINVSDLRSSKVSTSGNCKSFSSLVKDANCLADKNVDAAVRGREHALKKAAVVRRASELA 240
N+NVSDLRSSKVSTSGNCKS SSLVKDANCLADKNVDAAVRGREHALKKAAVVRRASEL
Sbjct: 181 NVNVSDLRSSKVSTSGNCKSLSSLVKDANCLADKNVDAAVRGREHALKKAAVVRRASELV 240
Query: 241 SDALNLVAQRDETAAKESGESADDAELAIQLHRAMNSSPRLSKNFCSANSNYMAFENTRV 300
SDALNLVAQRDETAAKESGESADDAELAIQLHRAMNSSPRLSKNFCSANSNYMAFENTRV
Sbjct: 241 SDALNLVAQRDETAAKESGESADDAELAIQLHRAMNSSPRLSKNFCSANSNYMAFENTRV 300
Query: 301 VDDGDTSIGELCSEEFDFFKTPQVLINNSLCNSPDNAASEPSVTAKDHVAPLEINHLESM 360
VDD DTSIGELCSEEFDFFKTPQVL+NNSLCNSPDNAASEPSVTAKDHVAPLEIN LESM
Sbjct: 301 VDDRDTSIGELCSEEFDFFKTPQVLVNNSLCNSPDNAASEPSVTAKDHVAPLEINRLESM 360
Query: 361 GNNPIPAKGNGCSVKCDTESENEELSPKEDLRSRSIELMTAGCNHDRLCSEDKYTLPRDE 420
GNNPIP KGNG SVKCDTESEN ELSPKEDLRSRSIELM+AGCNHDRLCSEDKYTLPRDE
Sbjct: 361 GNNPIPEKGNGWSVKCDTESENVELSPKEDLRSRSIELMSAGCNHDRLCSEDKYTLPRDE 420
Query: 421 RCIAKPYHYFFKYKRRDTTKRYLLKYSKRKSRLKRMPDCKPKILVDGMCLGVPSSSAAIL 480
RCIAKPYHYFFKYKRRDTTKRYLLKYSKRKSRLKRMPDC+PKILVDGMCLGVPSSSAAIL
Sbjct: 421 RCIAKPYHYFFKYKRRDTTKRYLLKYSKRKSRLKRMPDCRPKILVDGMCLGVPSSSAAIL 480
Query: 481 ISTEKFPVISNASFGCCAVPLQASSLGVHAVQEISDQGGR 519
ISTEKFPVISNASFGCCAVPLQAS LGVHAVQEISDQGGR
Sbjct: 481 ISTEKFPVISNASFGCCAVPLQASGLGVHAVQEISDQGGR 520
BLAST of Csor.00g156540 vs. ExPASy TrEMBL
Match:
A0A1S3BG91 (uncharacterized protein LOC103489686 isoform X3 OS=Cucumis melo OX=3656 GN=LOC103489686 PE=4 SV=1)
HSP 1 Score: 840 bits (2169), Expect = 4.94e-304
Identity = 431/527 (81.78%), Postives = 467/527 (88.61%), Query Frame = 0
Query: 1 MDSATQILKKEHKTFVRRRLPPTPLPPPPPSLSSSSSPPLNPTLPAKPSAPCPVKKTRDL 60
M+SA +I+KKEHKTFVRRRLPPTP PPPPSLSSSSS PLNPTL PS PCP KKTRDL
Sbjct: 1 MESAVKIVKKEHKTFVRRRLPPTP--PPPPSLSSSSSAPLNPTLIPNPSLPCPQKKTRDL 60
Query: 61 PNFSECHACGFRVDAVDGRSRLNSLYSEWRIVLLCKKCFSLVESSQVCSYCFADSRGDSF 120
PNFSECH+CGFR+D VDGRSRLNSLYSEWRIVLLCKKCFSLVESSQVCSYCFADS GDSF
Sbjct: 61 PNFSECHSCGFRIDTVDGRSRLNSLYSEWRIVLLCKKCFSLVESSQVCSYCFADSTGDSF 120
Query: 121 NCSECNRRVHRECFSLYSRVAPWSYSSSGSEFSVCIDCWVPKPIVTARAVLRSRKIRRKN 180
C ECNRRVHRECFS YSRVAPWSYSSSGS FSVCIDCWVPKPIVTARAVLRSRKIRRKN
Sbjct: 121 ICCECNRRVHRECFSQYSRVAPWSYSSSGSVFSVCIDCWVPKPIVTARAVLRSRKIRRKN 180
Query: 181 INVSDLRSSKVSTSGNCKSFSSLVKDANCLADKNVDAAVRGREHALKKAAVVRRASELAS 240
+NVSDLRSSKVSTSGNCKS S+LVKDANCL +K VDAAVR REHALKKAAV RRAS LAS
Sbjct: 181 VNVSDLRSSKVSTSGNCKSLSALVKDANCLVEKKVDAAVRAREHALKKAAVARRASALAS 240
Query: 241 DALNLVAQRDETAAKESGESADDAELAIQLHRAMNSSPRLSKNFCSANSNYMAFENTRVV 300
DALNLVAQRDE+AAKESG+SA+DAELAIQLHRAMNSSPR SKN CS NSNYM F+NTRV
Sbjct: 241 DALNLVAQRDESAAKESGDSAEDAELAIQLHRAMNSSPRFSKNLCSTNSNYMDFDNTRV- 300
Query: 301 DDGDTSIGELCSEEFDFFKTPQVLINNSLCNSPDNAASEPSVTAKDHVAPLEINHLESMG 360
DDG+TS G L S EFDFFK P VL+NN++CNSPDN ASEPSVTAKDHV+PLE NHLE +G
Sbjct: 301 DDGETSAGALFSGEFDFFKAPPVLVNNNICNSPDNTASEPSVTAKDHVSPLENNHLEFLG 360
Query: 361 NNPIPAKGNGCSVKCDTESENEELSPKEDLRSRSIELMTAGCNHDRLCSE-------DKY 420
+ + KGNGC VKCD+ES N EL+P+++++S SI+L AGCN+D LCSE DKY
Sbjct: 361 KDLMRVKGNGCPVKCDSESVNVELTPEKEMKSSSIKLTNAGCNYDSLCSESQLSPTQDKY 420
Query: 421 TLPRDERCIAKPYHYFFKYKRRDTTKRYLLKYSKRKSRLKRMPDCKPKILVDGMCLGVPS 480
LPRDERCIAKPYHYFFKY+RRDTTKRYLLKYSKR S+LKRMPDC PKI VDGMC+GVPS
Sbjct: 421 DLPRDERCIAKPYHYFFKYRRRDTTKRYLLKYSKRNSKLKRMPDCNPKIRVDGMCVGVPS 480
Query: 481 SSAAILIS-TEKFPVISNASFGCCAVPLQASSLGVHAVQEISDQGGR 519
SSAAI+IS TE FPVISNASFGCCAVPLQAS LGV+AVQEIS++GGR
Sbjct: 481 SSAAIVISSTENFPVISNASFGCCAVPLQASGLGVNAVQEISNKGGR 524
BLAST of Csor.00g156540 vs. ExPASy TrEMBL
Match:
A0A1S4DWI3 (uncharacterized protein LOC103489686 isoform X2 OS=Cucumis melo OX=3656 GN=LOC103489686 PE=4 SV=1)
HSP 1 Score: 838 bits (2164), Expect = 2.96e-303
Identity = 430/526 (81.75%), Postives = 466/526 (88.59%), Query Frame = 0
Query: 1 MDSATQILKKEHKTFVRRRLPPTPLPPPPPSLSSSSSPPLNPTLPAKPSAPCPVKKTRDL 60
M+SA +I+KKEHKTFVRRRLPPTP PPPPSLSSSSS PLNPTL PS PCP KKTRDL
Sbjct: 1 MESAVKIVKKEHKTFVRRRLPPTP--PPPPSLSSSSSAPLNPTLIPNPSLPCPQKKTRDL 60
Query: 61 PNFSECHACGFRVDAVDGRSRLNSLYSEWRIVLLCKKCFSLVESSQVCSYCFADSRGDSF 120
PNFSECH+CGFR+D VDGRSRLNSLYSEWRIVLLCKKCFSLVESSQVCSYCFADS GDSF
Sbjct: 61 PNFSECHSCGFRIDTVDGRSRLNSLYSEWRIVLLCKKCFSLVESSQVCSYCFADSTGDSF 120
Query: 121 NCSECNRRVHRECFSLYSRVAPWSYSSSGSEFSVCIDCWVPKPIVTARAVLRSRKIRRKN 180
C ECNRRVHRECFS YSRVAPWSYSSSGS FSVCIDCWVPKPIVTARAVLRSRKIRRKN
Sbjct: 121 ICCECNRRVHRECFSQYSRVAPWSYSSSGSVFSVCIDCWVPKPIVTARAVLRSRKIRRKN 180
Query: 181 INVSDLRSSKVSTSGNCKSFSSLVKDANCLADKNVDAAVRGREHALKKAAVVRRASELAS 240
+NVSDLRSSKVSTSGNCKS S+LVKDANCL +K VDAAVR REHALKKAAV RRAS LAS
Sbjct: 181 VNVSDLRSSKVSTSGNCKSLSALVKDANCLVEKKVDAAVRAREHALKKAAVARRASALAS 240
Query: 241 DALNLVAQRDETAAKESGESADDAELAIQLHRAMNSSPRLSKNFCSANSNYMAFENTRVV 300
DALNLVAQRDE+AAKESG+SA+DAELAIQLHRAMNSSPR SKN CS NSNYM F+NTRV
Sbjct: 241 DALNLVAQRDESAAKESGDSAEDAELAIQLHRAMNSSPRFSKNLCSTNSNYMDFDNTRV- 300
Query: 301 DDGDTSIGELCSEEFDFFKTPQVLINNSLCNSPDNAASEPSVTAKDHVAPLEINHLESMG 360
DDG+TS G L S EFDFFK P VL+NN++CNSPDN ASEPSVTAKDHV+PLE NHLE +G
Sbjct: 301 DDGETSAGALFSGEFDFFKAPPVLVNNNICNSPDNTASEPSVTAKDHVSPLENNHLEFLG 360
Query: 361 NNPIPAKGNGCSVKCDTESENEELSPKEDLRSRSIELMTAGCNHDRLCSE-------DKY 420
+ + KGNGC VKCD+ES N EL+P+++++S SI+L AGCN+D LCSE DKY
Sbjct: 361 KDLMRVKGNGCPVKCDSESVNVELTPEKEMKSSSIKLTNAGCNYDSLCSESQLSPTQDKY 420
Query: 421 TLPRDERCIAKPYHYFFKYKRRDTTKRYLLKYSKRKSRLKRMPDCKPKILVDGMCLGVPS 480
LPRDERCIAKPYHYFFKY+RRDTTKRYLLKYSKR S+LKRMPDC PKI VDGMC+GVPS
Sbjct: 421 DLPRDERCIAKPYHYFFKYRRRDTTKRYLLKYSKRNSKLKRMPDCNPKIRVDGMCVGVPS 480
Query: 481 SSAAILIS-TEKFPVISNASFGCCAVPLQASSLGVHAVQEISDQGG 518
SSAAI+IS TE FPVISNASFGCCAVPLQAS LGV+AVQEIS++GG
Sbjct: 481 SSAAIVISSTENFPVISNASFGCCAVPLQASGLGVNAVQEISNKGG 523
BLAST of Csor.00g156540 vs. ExPASy TrEMBL
Match:
A0A1S3BHF1 (uncharacterized protein LOC103489686 isoform X1 OS=Cucumis melo OX=3656 GN=LOC103489686 PE=4 SV=1)
HSP 1 Score: 838 bits (2166), Expect = 3.46e-303
Identity = 430/527 (81.59%), Postives = 467/527 (88.61%), Query Frame = 0
Query: 1 MDSATQILKKEHKTFVRRRLPPTPLPPPPPSLSSSSSPPLNPTLPAKPSAPCPVKKTRDL 60
M+SA +I+KKEHKTFVRRRLPPTP PPPPSLSSSSS PLNPTL PS PCP KKTRDL
Sbjct: 1 MESAVKIVKKEHKTFVRRRLPPTP--PPPPSLSSSSSAPLNPTLIPNPSLPCPQKKTRDL 60
Query: 61 PNFSECHACGFRVDAVDGRSRLNSLYSEWRIVLLCKKCFSLVESSQVCSYCFADSRGDSF 120
PNFSECH+CGFR+D VDGRSRLNSLYSEWRIVLLCKKCFSLVESSQVCSYCFADS GDSF
Sbjct: 61 PNFSECHSCGFRIDTVDGRSRLNSLYSEWRIVLLCKKCFSLVESSQVCSYCFADSTGDSF 120
Query: 121 NCSECNRRVHRECFSLYSRVAPWSYSSSGSEFSVCIDCWVPKPIVTARAVLRSRKIRRKN 180
C ECNRRVHRECFS YSRVAPWSYSSSGS FSVCIDCWVPKPIVTARAVLRSRKIRRKN
Sbjct: 121 ICCECNRRVHRECFSQYSRVAPWSYSSSGSVFSVCIDCWVPKPIVTARAVLRSRKIRRKN 180
Query: 181 INVSDLRSSKVSTSGNCKSFSSLVKDANCLADKNVDAAVRGREHALKKAAVVRRASELAS 240
+NVSDLRSSKVSTSGNCKS S+LVKDANCL +K VDAAVR REHALKKAAV RRAS LAS
Sbjct: 181 VNVSDLRSSKVSTSGNCKSLSALVKDANCLVEKKVDAAVRAREHALKKAAVARRASALAS 240
Query: 241 DALNLVAQRDETAAKESGESADDAELAIQLHRAMNSSPRLSKNFCSANSNYMAFENTRVV 300
DALNLVAQRDE+AAKESG+SA+DAELAIQLHRAMNSSPR SKN CS NSNYM F+NTRV
Sbjct: 241 DALNLVAQRDESAAKESGDSAEDAELAIQLHRAMNSSPRFSKNLCSTNSNYMDFDNTRV- 300
Query: 301 DDGDTSIGELCSEEFDFFKTPQVLINNSLCNSPDNAASEPSVTAKDHVAPLEINHLESMG 360
DDG+TS G L S EFDFFK P VL+NN++CNSPDN ASEPSVTAKDHV+PLE NHLE +G
Sbjct: 301 DDGETSAGALFSGEFDFFKAPPVLVNNNICNSPDNTASEPSVTAKDHVSPLENNHLEFLG 360
Query: 361 NNPIPAKGNGCSVKCDTESENEELSPKEDLRSRSIELMTAGCNHDRLCSE-------DKY 420
+ + KGNGC VKCD+ES N EL+P+++++S SI+L AGCN+D LCSE DKY
Sbjct: 361 KDLMRVKGNGCPVKCDSESVNVELTPEKEMKSSSIKLTNAGCNYDSLCSESQLSPTQDKY 420
Query: 421 TLPRDERCIAKPYHYFFKYKRRDTTKRYLLKYSKRKSRLKRMPDCKPKILVDGMCLGVPS 480
LPRDERCIAKPYHYFFKY+RRDTTKRYLLKYSKR S+LKRMPDC PKI VDGMC+GVPS
Sbjct: 421 DLPRDERCIAKPYHYFFKYRRRDTTKRYLLKYSKRNSKLKRMPDCNPKIRVDGMCVGVPS 480
Query: 481 SSAAILIS-TEKFPVISNASFGCCAVPLQASSLGVHAVQEISDQGGR 519
SSAAI+IS TE FPVISNASFGCCAVPLQAS LGV+AVQEIS++GG+
Sbjct: 481 SSAAIVISSTENFPVISNASFGCCAVPLQASGLGVNAVQEISNKGGK 524
The following BLAST results are available for this feature:
Match Name | E-value | Identity | Description | |
Match Name | E-value | Identity | Description | |
KAG6571814.1 | 0.0 | 100.00 | hypothetical protein SDJN03_28542, partial [Cucurbita argyrosperma subsp. sorori... | [more] |
KAG7011508.1 | 0.0 | 99.81 | hypothetical protein SDJN02_26414, partial [Cucurbita argyrosperma subsp. argyro... | [more] |
XP_022952090.1 | 0.0 | 99.23 | uncharacterized protein LOC111454852 [Cucurbita moschata] | [more] |
XP_023554148.1 | 0.0 | 98.65 | uncharacterized protein LOC111811499 isoform X2 [Cucurbita pepo subsp. pepo] | [more] |
XP_023554146.1 | 0.0 | 98.65 | uncharacterized protein LOC111811499 isoform X1 [Cucurbita pepo subsp. pepo] | [more] |
Match Name | E-value | Identity | Description | |
A0A6J1GKS5 | 0.0 | 99.23 | uncharacterized protein LOC111454852 OS=Cucurbita moschata OX=3662 GN=LOC1114548... | [more] |
A0A6J1I871 | 0.0 | 97.31 | uncharacterized protein LOC111470557 OS=Cucurbita maxima OX=3661 GN=LOC111470557... | [more] |
A0A1S3BG91 | 4.94e-304 | 81.78 | uncharacterized protein LOC103489686 isoform X3 OS=Cucumis melo OX=3656 GN=LOC10... | [more] |
A0A1S4DWI3 | 2.96e-303 | 81.75 | uncharacterized protein LOC103489686 isoform X2 OS=Cucumis melo OX=3656 GN=LOC10... | [more] |
A0A1S3BHF1 | 3.46e-303 | 81.59 | uncharacterized protein LOC103489686 isoform X1 OS=Cucumis melo OX=3656 GN=LOC10... | [more] |
Match Name | E-value | Identity | Description | |