Sequences
The following sequences are available for this feature:
Gene sequence (with intron)
Legend: five_prime_UTRexonCDSpolypeptidethree_prime_UTR
Hold the cursor over a type above to highlight its positions in the sequence below.
AAGAAAAGAAAAGGATTACGTAGGTCGGTGACGACGCTCCAGCTCTCGCGCAATCGCCTCCGGCGATGGCGACAGTGCAATCCGACATTCGCTGTGCTTATCACATCGCCTCCTCCACCTTCCGCTCCCTCTCCCCTACTTTTAGGCCCCGCCGAACCACTGCCGTTCCATGTAAACTGCTCTTCCCCTCCTTCAAATTGAGAGCCTTCTCCACATGTGCTGCCGTGAGAGTTCCCGAGAAGCCGTCGATTTGTACGGCCGACGAGCTTCATTATGCCTCTGTACCGAATTCTGACTGGAGGCTTGCTCTCTGGCGCTATCGTCCCTCCCCTCAGGTGATCTTTAATTCCGTTTTGATTATGTACTCTTCGAGATTCGAGATGAGAACAATTTTTTTTTTTTTTTTTTTTTTTAATTTGTGGATTTGCGATGTGAATAAGTACGCCGCTTAGTTTTGAATTGCTTTTCTGTCCTTTACGCGATTGAACAGGCGCCTCCGAGGAATCATCCGCTTTTGCTATTATCCGGAGTCGGAACTAATGCTATTGGTTATGATCTTGCCCCTGGGGTAAATTTTCTTCCGCTTAGGATTGTAGCTGTACTTTGAAGTTTATTGAGAAGTGGTTTTGATTTTTATGGTCTTTTATGATTTGTAAATGCTCTAGGTTGCTATGACATTAATGTAGATTACTTATTATTCTTATTATTCTTATTTGTTTTTGAGGAACATTTGATATAGATTACTGTATAATCATGTTCTCTCTGGTCCTTTGTCTGTCTATTTTCTTTCCTTGAGCCACACTGGTCATATTCCCATGCTATGAAAAGTTGCATAGTCGTATTTTTCTTTATCCATCAGATTGTCTGTTCTAACGTATTCAAAGATGACATCATTCACTAAGAACTAATCTTCTAAGCTTCATTGGATTGAACAACCCATATATTCCTTTCTTTTATGTGCTATTCCACGTTTGCAGTGTTCTTTTGCGCGGCATATGTCTGGTCAGGGGTATGATACATGGATTCTTGAAGTTCGAGGTGCAGGACTGAGCTTGCAGGAACCAAATTTGAAAGAAATTGAGCATTCAGCTAAAGTTAAATCAGAAAAAATGGAAGCAATCTCTGAGAGCAAAATTAATGGAACTGTACATGTGGAAGAAGAGTCAACAAAAATTCTTGATGACCTCTCAAAATCAGATACTTGCATCAATGGCATCAATGGGAAAGAATCTGACTCTTCTATGGTTGAGGAAGAAGACTTCATAGGAATAGCAACAATTTGGGATGAGTCAAGTGTAGTTTCGGAGTTAACAGAAACTTTTATGCGTTTGTCAGAACGGCTATCTGGCTTTTTAAGTGAAGGCCAATCAAGGATTATGTCCGCCACACTATTTGATCAAATTTCAAAACTTTTAGTTGATTCCCAATTATCTGAACGTTTTAATGAGGTAAGGGAAAAGCTTTCATATTTGTTGGAAACAGGACAGACCTCAGTCATTGCTGGCCAGATTAGGGATTTGAGTCAAAGGCTTGTAGAAATTATTGAAGAAGGTCAACGATCTGTTTCACCTCCATTATTCAATTTGCAAGATCGATTTTCTTCAACGATTGACGATTTTCAGAAACAACTTGATTTAATAGTAAAATATGACTGGGACTTTGATCATTACCTGTTGGAGGACGTTCCTGCTGCGGTAAGGTGACATTTTATCTTGTTGAAATGACCATGTCTACAATCACTAATTGTGGATAGTTTCTGAACTTAACTTCAACTTTTTTCATCTTAGATTTTGATTAATTTTAAATTTGAATTTTTTATACTCACTAAGAGAAATAATCTGTCAACTTCGACTGCAGATGGATTATATCATGGCTATAAGCAAGCCAAAGGATGGCAAATTGCTTGCTATTGGGCACTCCATGGGTGGTATCTTGCTTTATGCAAGACTTTCTCGTTGTGGTAAATTTGTGTACTTTCTCATGCATTTCACGTTTCTATGACTAAATGTTATATTTGTGTATCCTTTAGTTTCTGTGTCTAAATGGTATTGTGCTTTGTTTTTTGAATTATTGCAGCAGCTTTTCCTCTCTCCCTCTCTCTCTAGCACACACACTGCGCACACTAAACTCTCTTTGGTATTTGACCCTTTTTAGGTGGTTTTCTATGCTTTCTACATTATCTCTATCATACTTAAGATGCTAGCATATGTAATATAGACTAGAAGTCCTTTGAAGTTGACAATGGAAATGCCATTAATCAAGTGACAGAAAAACTATTGGGAAAAAATTGGTAGTACTGTTTGTAGCATTTTCATGATTACAGAAGAAAAGTAAAACGGGACCTTCTCAAAGAAAACAAAGAAATGGGAAGGAAAGGTTTTTATGAAAGTATTATATCCCCTGTTGTGATGAAGGTTGATCCATTGAACATCTATTAAGTTTTAGAGTTTTGTTTTTTTGTCAATTCTTTCTCAACAATTTTGACTCTTACTAGTGCTGTTTGTACTACATGTTTGTTGTTTTTTCTTGCATATAATGAGGAATGAGCTCAGGATGACTGCTTCTGTTATGCCTCATTATAAATTATTGAAGTTGATACAATATTTGTGAATTTCTTGAACATTACAGCTGCATAACAACGTTTTAGCTGCCACTTAAGAGTTTTTATATTTTAGTTGTTGTAAGATGTATGATTTCTTTATCATCGTTATTTTATAATGTTAGCATTTGCTAGGATATTCCTATCCTGTAATAATCTTTAATACGTAAAGTTACATATGGGAGAAAGAAATCCTTTACTATGCACTACTACTTTGCGTCTCTTTTTCTTTCTGTTAGCTGGGAATAATTTTGTATGCATTTGTCTGCTCTCTTATAAGGTTTTGAAGGAAGAGACCCCAGATTGGCTGCTATTGTTACTTTGGCATCATCTCTTGACTATACTTCTTCAAAATCAGCCTTGAAAATGCTATTGCCCCTTGTGAGTGAAAGCAAGTTTTTAAGCAAATGTTTTCAACTGAGTAATTAGCGAACGTCAATTGTTGGGAAATATCTTTGATGTTGTTTTCCTTTTGTTAGTTCAACACTTCACTCGTCATAAATATCTTGATTATGCAGTAATATTACTGTATTTGGCTTACAGGCAGATCCTGCACAGGCTCTTAATGTTCCAGTTGTTCCATTAGGAGCTTTACTATCAGCTTCATATCCTCTTTCATCTCGTTCTCCTTATGTCTTATCATGGCTAAATAGTCTTATTTCCGCAGAAGATATGATGGATCCTGAAATGTTAAAAAAGCTCGTCTTGAACAACTTCTGTAAGTGATCCTTCTTGAGCTTTTCTCCGTGGTTAGATGCTATTAGCATTAAGTTAAGAGAGTCAGCCATGTCTAAGATATCACATGCATGAAATTTGTTTGTTTTTCATCTTGTAGCAAGAATGATATAATGCATGCTTGCGTTATATGGTATAGCTTACAACTTAGAACTGTCTGTTTCAAGTCAACTAGTCTATCAAAGGTGAAGGAGTTGTTTGAGTTATCGTTACTTAAAAAAAGAAAAGAAAAAGAGAACACCCTAGTAAAAAGAGACCACCTCAATATATTGAATTGAAACGGTGTGTCACTTGCACAAAAAAAAAGGGTACTTATACTTTTGATTTCTAAGTTTTGAGTTTGGTTTTTATTTAGTCCCAAAATTTGGAGAATAATTTTAAATAGTTCTTGTACTAGTTAATCACTAGTTAACCTATTGGAAAGATTCTGGGGCAGTTAAGTTGGCTAATTAGATTATGTCAGGCTCAGCTTGCTGCAATGAGTTGGGATAACGAGCTCAACAATTCAATCAACAAAAAAAGAAATGATATGTTAACATAATTCAATCAACAACTTAACCGTCAAATGATTTTTTTTCATTTGGTCAATAGTATTCTTTATCTACAACACTATTAGGAATTACAGCCTATCAACTTGTGACGAATTCACGAGGGTCAACTTTCTCACGACACCTTTACAAACTGATTTAACACAAAACAATGTAATATCCAGAAAACAGTACAGAAAGCAACACAAAAGAAAGAAACACACAGAACTTCTCTCAACTTCCTATTCAAACATTCAACTCAAATTACCAACTACAACTAAACCTTCTAATCTAAGTTCGGAACTAAATTCCCCTGTTACCTTTTCTTAGAATCGTGATTGGAGGTCTAACAGTCAACTATCAGTAAAATTGGACTTTCTAATACTAATTTTCGAATTTTAGGGACTAAACTGTTAATTTTAAAACCTCTAGGATTAAATAGAAACCAGACCAAAACTCATGGACCAAAAGTGTAATTTGTCCCCAAAAAAAATTACCTTTGAGTTACTTGTTCAAATAGGGAGTATTTGTTTTTTAATTTTTAATGAGGAATTTCATTATAGAATGAAATAACAAAAAGGGGGAGGATAACCCAAAAATCGCGTGCAAGGTCAGTTTGCACATGGAGAAGATAGCCTGTAAGTACTGAAAGGAGCTTAAAATTTACTCCAAGAGAGACTAATAAGGTGATTGAAGAACTTCCAAAAGGATTTGGACGTCTTGTTGAAGACTATCATTTCTCAAGCCGTAGACACTCAGCTGCAACCCTACCAAGATAAGTTAAGCTGGAGAATTTCCTTCTTTGAAAACGGATGGGGTTCAAGGTACAGAGATTAGGCTTCTTGCTTCTGGATCTGTTTAGGTGGACAGTGTACCAACTGAAACGTTGCAAAAGGCGATTCCAAAGTGGAGCCCAAGGAATAGTCACGAAAGATAAGGTCTTACCATAAACCACGATTGCATTTCCTGACTCCAAGAGTGGGGTTACTTATTTTAAATACTCAAACCACATTATTATTCTTCTAGGGAAAAATAAGACTCCAATATACAGATGACGATACCAGCTAAGGAGGCCATGGAAAGAAAGTCAGGACAGTGAAATTTATGCATATGTTGTAGCCTTAGGTTAGGGTAGGGCAATTGCTTTCATTTTCTAATTTTGATAGGAAAGGATGTTTTCCCTCTTTTCAATTATACTATTTTCTATGATGCTACCAATATATTTGAAATGATTTTATGAATATTTAAGTTTTCAATTATACTATTTTCTATGATTCTACCAATATATTTGAAATGAATTTATGAATATTTAAGTCCATGGCAACGTTTAAAGCGAACTATTTAAACAATTTTTTTACTTATAGTAGTCATGACATGCATGTAGTGGCGGTCTTATTTTAAAAGAAAAAGTATTCTTTAATACCATCTTGCATTGTTAATATCCTATCTTAAGATCATCCCCCTTTGATACTTGATGCTCCAAGAACTAAAGTCCTTTCAAAGACCTTAATCCCCTTATAAGACCCTTTTTCAAACCACCCTTAGCATAATTTGGCATATTGGTAATAGCAAGTTAGAGGAAATAGGTTTAAACTATTGTGGTCACATACCTAAGATTTAATATTCTAAATGGAGTAGGATCATGTGGTTGTCATGTAAGCATTGTTGTGGTGCACGCAAGCTAGCCTAAATACTCACGAAGGTATATCAACACTTCTAGAAAATGGGGGAGCGATAGCTTTTCTGGCAGTTATGTTTTTCTTTAAATGATATTTAAAAGAAAAAGTTCCGGAATTGTATGGTGCCTGGTAAATGGCCATCCACTGTCCAAAGTCTTTAGAGATTTGGAGTGAAGGTTCCTGGCTTAACAGAAATGGCCTACCACTGTTCGTCCTCTCTATCGGTTAGATTTCTCCTCATATTATACATGCTCCAAATCCCAAGATCTCATCCCAGCAAGCCTGAACACTCCTAATCTTGTTCACGGACACAGCAAACACTTTGAGGAGAACAGAACACAATAGGGAAGCATTTACGAAATAGAATTTGGATGTAAAAAAGGACTGAAGGTAATGGTGCTGATAAAATTTTTGGTTCACTTATTGAAAAGTCTTGCTCAACTTACGTCAGAGGATTGGACGTGCATTAAGGTATCCGAAGTCAATTATGCAAGGATATACTTTTGTAAAAAGCCACGACACCTCCTTGGCCTTTTCAATGTAGTTTGTAGATCTGTTTAATGCCAAGAAGTTTCTAGATTTTCACTGCAAGTTCAAGTAATGTTAAAAAAAAAAAGTGTCTTGAGTTGTAAGTCTGATCTCTACTGTCAGTTATTGAAAATTTAATTCTCTTGAGAACCAACCAATGAAACCATGTCCAGCCAATAACTCCATCTTCCTATTTATTGTTATATTATTATTTTCTCGTGTTCATAATATAATTACTATCTCACATAGCACAAATTCAACTATGAGTGTAGGTACGATTCCCGCTAAACTTATCTTACAACTCACAACCGCTTTTCGAGAGGGTGGTTTGCGAGACAGAAGTAATACATTTTATTACAAGGACCATTTACATAAATGCCATGTCCCAGTCTTAGCGCTTGCTGGAGACCAAGATCTAATCTGTCCACCTGTAGCTGTAGAAGGTACATTCATTGTGTTCATGTGGAATATAACTACCTTTTCCATCTTGCTTATATTGATTTAATTGAAGCAGAAACAGCAAAGCTTATTCCCGAGCACCTTGTTTCCTATAAATGTTTTGGAGAACCGGGAGGTCCGCATTATGCACATTATGATTTAGTTGGAGGACGCCTGGTATTATTTTCTTCCCTTCCGCCGTCTATTCACTATCATGGACTTTTCATAGCAGATTCGAGTTATATTTAATGGGAAAAATTTTTATTTTGTTTTCCAAATCTATTTGATGTTGCCATGATTGAATTCCTCCAAAATATGCAGGCAGCGGAGCAGGTCTATCCATGTATTATCGATTTTATTAGCAAGCATGATGCAGTTTGATTATCCACATCTCAATAAATTTTGTTTTTGCATGATTCATTCAAATGTAACACATCCTAGTCGATGTTGGTTACCGTTCATACATGCCTAGGGATCGAAATTTCCTTGCAGAACGTGTAAATTTTGGTGGATTGAGATCATATGCAAAGAGGGTTATGGAGACTTGAGGACTCCTTATTCGTCCAAGCTTCATTAACCAACGCTGAATCAAGATCTCCTTGTTCTGGTTTCAACGAAAATTATACTGGAAAAGGTGAAGATAATCAATATGTACGAGTTCCGGATCACACAATAACATGATATATATATATATATATATATAAAAATGCTTGAAGAACTTGTACTCTTTTCAGCGTTTACACATCACTAGAATTTGTAAATTACTATGATTTTCTTTGAAGGACCCTGAGTTGGTTCAGTAGTCAAAGGAATACGTTGCAATTTCAAGTATCTTTGAGGTAATTAGCTCAGAACCAAGTGGAGTTTGATGGGCCGATTTAGACTATTATGAAATAGACTAGTTTTTTTTTTTTTTTTCTTTTTCTCATTCTTCGTATCAGGACTGTTTTCTTGTGCTTAAAAAGCCCATATCTTCCTGAC
mRNA sequence
AAGAAAAGAAAAGGATTACGTAGGTCGGTGACGACGCTCCAGCTCTCGCGCAATCGCCTCCGGCGATGGCGACAGTGCAATCCGACATTCGCTGTGCTTATCACATCGCCTCCTCCACCTTCCGCTCCCTCTCCCCTACTTTTAGGCCCCGCCGAACCACTGCCGTTCCATGTAAACTGCTCTTCCCCTCCTTCAAATTGAGAGCCTTCTCCACATGTGCTGCCGTGAGAGTTCCCGAGAAGCCGTCGATTTGTACGGCCGACGAGCTTCATTATGCCTCTGTACCGAATTCTGACTGGAGGCTTGCTCTCTGGCGCTATCGTCCCTCCCCTCAGGCGCCTCCGAGGAATCATCCGCTTTTGCTATTATCCGGAGTCGGAACTAATGCTATTGGTTATGATCTTGCCCCTGGGTGTTCTTTTGCGCGGCATATGTCTGGTCAGGGGTATGATACATGGATTCTTGAAGTTCGAGGTGCAGGACTGAGCTTGCAGGAACCAAATTTGAAAGAAATTGAGCATTCAGCTAAAGTTAAATCAGAAAAAATGGAAGCAATCTCTGAGAGCAAAATTAATGGAACTGTACATGTGGAAGAAGAGTCAACAAAAATTCTTGATGACCTCTCAAAATCAGATACTTGCATCAATGGCATCAATGGGAAAGAATCTGACTCTTCTATGGTTGAGGAAGAAGACTTCATAGGAATAGCAACAATTTGGGATGAGTCAAGTGTAGTTTCGGAGTTAACAGAAACTTTTATGCGTTTGTCAGAACGGCTATCTGGCTTTTTAAGTGAAGGCCAATCAAGGATTATGTCCGCCACACTATTTGATCAAATTTCAAAACTTTTAGTTGATTCCCAATTATCTGAACGTTTTAATGAGGTAAGGGAAAAGCTTTCATATTTGTTGGAAACAGGACAGACCTCAGTCATTGCTGGCCAGATTAGGGATTTGAGTCAAAGGCTTGTAGAAATTATTGAAGAAGGTCAACGATCTGTTTCACCTCCATTATTCAATTTGCAAGATCGATTTTCTTCAACGATTGACGATTTTCAGAAACAACTTGATTTAATAGTAAAATATGACTGGGACTTTGATCATTACCTGTTGGAGGACGTTCCTGCTGCGATGGATTATATCATGGCTATAAGCAAGCCAAAGGATGGCAAATTGCTTGCTATTGGGCACTCCATGGGTGGTATCTTGCTTTATGCAAGACTTTCTCGTTGTGGTTTTGAAGGAAGAGACCCCAGATTGGCTGCTATTGTTACTTTGGCATCATCTCTTGACTATACTTCTTCAAAATCAGCCTTGAAAATGCTATTGCCCCTTGCAGATCCTGCACAGGCTCTTAATGTTCCAGTTGTTCCATTAGGAGCTTTACTATCAGCTTCATATCCTCTTTCATCTCGTTCTCCTTATGTCTTATCATGGCTAAATAGTCTTATTTCCGCAGAAGATATGATGGATCCTGAAATGTTAAAAAAGCTCGTCTTGAACAACTTCTGTAACAAGAATGATATAATGCATGCTTGCGTTATATGGTATAGCTTACAACTTAGAACTGTCTGTTTCAAGTCAACTAGTACGATTCCCGCTAAACTTATCTTACAACTCACAACCGCTTTTCGAGAGGGTGGTTTGCGAGACAGAAGTAATACATTTTATTACAAGGACCATTTACATAAATGCCATGTCCCAGTCTTAGCGCTTGCTGGAGACCAAGATCTAATCTGTCCACCTGTAGCTGTAGAAGAAACAGCAAAGCTTATTCCCGAGCACCTTGTTTCCTATAAATGTTTTGGAGAACCGGGAGGTCCGCATTATGCACATTATGATTTAGTTGGAGGACGCCTGGCAGCGGAGCAGGTCTATCCATGTATTATCGATTTTATTAGCAAGCATGATGCAGTTTGATTATCCACATCTCAATAAATTTTGTTTTTGCATGATTCATTCAAATGTAACACATCCTAGTCGATGTTGGTTACCGTTCATACATGCCTAGGGATCGAAATTTCCTTGCAGAACGTGTAAATTTTGGTGGATTGAGATCATATGCAAAGAGGGTTATGGAGACTTGAGGACTCCTTATTCGTCCAAGCTTCATTAACCAACGCTGAATCAAGATCTCCTTGTTCTGGTTTCAACGAAAATTATACTGGAAAAGGTGAAGATAATCAATATGTACTCTTTTCAGCGTTTACACATCACTAGAATTTGTAAATTACTATGATTTTCTTTGAAGGACCCTGAGTTGGTTCAGTAGTCAAAGGAATACGTTGCAATTTCAAGTATCTTTGAGGTAATTAGCTCAGAACCAAGTGGAGTTTGATGGGCCGATTTAGACTATTATGAAATAGACTAGTTTTTTTTTTTTTTTTCTTTTTCTCATTCTTCGTATCAGGACTGTTTTCTTGTGCTTAAAAAGCCCATATCTTCCTGAC
Coding sequence (CDS)
ATGGCGACAGTGCAATCCGACATTCGCTGTGCTTATCACATCGCCTCCTCCACCTTCCGCTCCCTCTCCCCTACTTTTAGGCCCCGCCGAACCACTGCCGTTCCATGTAAACTGCTCTTCCCCTCCTTCAAATTGAGAGCCTTCTCCACATGTGCTGCCGTGAGAGTTCCCGAGAAGCCGTCGATTTGTACGGCCGACGAGCTTCATTATGCCTCTGTACCGAATTCTGACTGGAGGCTTGCTCTCTGGCGCTATCGTCCCTCCCCTCAGGCGCCTCCGAGGAATCATCCGCTTTTGCTATTATCCGGAGTCGGAACTAATGCTATTGGTTATGATCTTGCCCCTGGGTGTTCTTTTGCGCGGCATATGTCTGGTCAGGGGTATGATACATGGATTCTTGAAGTTCGAGGTGCAGGACTGAGCTTGCAGGAACCAAATTTGAAAGAAATTGAGCATTCAGCTAAAGTTAAATCAGAAAAAATGGAAGCAATCTCTGAGAGCAAAATTAATGGAACTGTACATGTGGAAGAAGAGTCAACAAAAATTCTTGATGACCTCTCAAAATCAGATACTTGCATCAATGGCATCAATGGGAAAGAATCTGACTCTTCTATGGTTGAGGAAGAAGACTTCATAGGAATAGCAACAATTTGGGATGAGTCAAGTGTAGTTTCGGAGTTAACAGAAACTTTTATGCGTTTGTCAGAACGGCTATCTGGCTTTTTAAGTGAAGGCCAATCAAGGATTATGTCCGCCACACTATTTGATCAAATTTCAAAACTTTTAGTTGATTCCCAATTATCTGAACGTTTTAATGAGGTAAGGGAAAAGCTTTCATATTTGTTGGAAACAGGACAGACCTCAGTCATTGCTGGCCAGATTAGGGATTTGAGTCAAAGGCTTGTAGAAATTATTGAAGAAGGTCAACGATCTGTTTCACCTCCATTATTCAATTTGCAAGATCGATTTTCTTCAACGATTGACGATTTTCAGAAACAACTTGATTTAATAGTAAAATATGACTGGGACTTTGATCATTACCTGTTGGAGGACGTTCCTGCTGCGATGGATTATATCATGGCTATAAGCAAGCCAAAGGATGGCAAATTGCTTGCTATTGGGCACTCCATGGGTGGTATCTTGCTTTATGCAAGACTTTCTCGTTGTGGTTTTGAAGGAAGAGACCCCAGATTGGCTGCTATTGTTACTTTGGCATCATCTCTTGACTATACTTCTTCAAAATCAGCCTTGAAAATGCTATTGCCCCTTGCAGATCCTGCACAGGCTCTTAATGTTCCAGTTGTTCCATTAGGAGCTTTACTATCAGCTTCATATCCTCTTTCATCTCGTTCTCCTTATGTCTTATCATGGCTAAATAGTCTTATTTCCGCAGAAGATATGATGGATCCTGAAATGTTAAAAAAGCTCGTCTTGAACAACTTCTGTAACAAGAATGATATAATGCATGCTTGCGTTATATGGTATAGCTTACAACTTAGAACTGTCTGTTTCAAGTCAACTAGTACGATTCCCGCTAAACTTATCTTACAACTCACAACCGCTTTTCGAGAGGGTGGTTTGCGAGACAGAAGTAATACATTTTATTACAAGGACCATTTACATAAATGCCATGTCCCAGTCTTAGCGCTTGCTGGAGACCAAGATCTAATCTGTCCACCTGTAGCTGTAGAAGAAACAGCAAAGCTTATTCCCGAGCACCTTGTTTCCTATAAATGTTTTGGAGAACCGGGAGGTCCGCATTATGCACATTATGATTTAGTTGGAGGACGCCTGGCAGCGGAGCAGGTCTATCCATGTATTATCGATTTTATTAGCAAGCATGATGCAGTTTGA
Protein sequence
MATVQSDIRCAYHIASSTFRSLSPTFRPRRTTAVPCKLLFPSFKLRAFSTCAAVRVPEKPSICTADELHYASVPNSDWRLALWRYRPSPQAPPRNHPLLLLSGVGTNAIGYDLAPGCSFARHMSGQGYDTWILEVRGAGLSLQEPNLKEIEHSAKVKSEKMEAISESKINGTVHVEEESTKILDDLSKSDTCINGINGKESDSSMVEEEDFIGIATIWDESSVVSELTETFMRLSERLSGFLSEGQSRIMSATLFDQISKLLVDSQLSERFNEVREKLSYLLETGQTSVIAGQIRDLSQRLVEIIEEGQRSVSPPLFNLQDRFSSTIDDFQKQLDLIVKYDWDFDHYLLEDVPAAMDYIMAISKPKDGKLLAIGHSMGGILLYARLSRCGFEGRDPRLAAIVTLASSLDYTSSKSALKMLLPLADPAQALNVPVVPLGALLSASYPLSSRSPYVLSWLNSLISAEDMMDPEMLKKLVLNNFCNKNDIMHACVIWYSLQLRTVCFKSTSTIPAKLILQLTTAFREGGLRDRSNTFYYKDHLHKCHVPVLALAGDQDLICPPVAVEETAKLIPEHLVSYKCFGEPGGPHYAHYDLVGGRLAAEQVYPCIIDFISKHDAV
Homology
BLAST of Cp4.1LG01g18190 vs. NCBI nr
Match:
XP_023552896.1 (uncharacterized protein LOC111810420 [Cucurbita pepo subsp. pepo])
HSP 1 Score: 1156 bits (2990), Expect = 0.0
Identity = 591/617 (95.79%), Postives = 591/617 (95.79%), Query Frame = 0
Query: 1 MATVQSDIRCAYHIASSTFRSLSPTFRPRRTTAVPCKLLFPSFKLRAFSTCAAVRVPEKP 60
MATVQSDIRCAYHIASSTFRSLSPTFRPRRTTAVPCKLLFPSFKLRAFSTCAAVRVPEKP
Sbjct: 1 MATVQSDIRCAYHIASSTFRSLSPTFRPRRTTAVPCKLLFPSFKLRAFSTCAAVRVPEKP 60
Query: 61 SICTADELHYASVPNSDWRLALWRYRPSPQAPPRNHPLLLLSGVGTNAIGYDLAPGCSFA 120
SICTADELHYASVPNSDWRLALWRYRPSPQAPPRNHPLLLLSGVGTNAIGYDLAPGCSFA
Sbjct: 61 SICTADELHYASVPNSDWRLALWRYRPSPQAPPRNHPLLLLSGVGTNAIGYDLAPGCSFA 120
Query: 121 RHMSGQGYDTWILEVRGAGLSLQEPNLKEIEHSAKVKSEKMEAISESKINGTVHVEEEST 180
RHMSGQGYDTWILEVRGAGLSLQEPNLKEIEHSAKVKSEKMEAISESKINGTVHVEEEST
Sbjct: 121 RHMSGQGYDTWILEVRGAGLSLQEPNLKEIEHSAKVKSEKMEAISESKINGTVHVEEEST 180
Query: 181 KILDDLSKSDTCINGINGKESDSSMVEEEDFIGIATIWDESSVVSELTETFMRLSERLSG 240
KILDDLSKSDTCINGINGKESDSSMVEEEDFIGIATIWDESSVVSELTETFMRLSERLSG
Sbjct: 181 KILDDLSKSDTCINGINGKESDSSMVEEEDFIGIATIWDESSVVSELTETFMRLSERLSG 240
Query: 241 FLSEGQSRIMSATLFDQISKLLVDSQLSERFNEVREKLSYLLETGQTSVIAGQIRDLSQR 300
FLSEGQSRIMSATLFDQISKLLVDSQLSERFNEVREKLSYLLETGQTSVIAGQIRDLSQR
Sbjct: 241 FLSEGQSRIMSATLFDQISKLLVDSQLSERFNEVREKLSYLLETGQTSVIAGQIRDLSQR 300
Query: 301 LVEIIEEGQRSVSPPLFNLQDRFSSTIDDFQKQLDLIVKYDWDFDHYLLEDVPAAMDYIM 360
LVEIIEEGQRSVSPPLFNLQDRFSSTIDDFQKQLDLIVKYDWDFDHYLLEDVPAAMDYIM
Sbjct: 301 LVEIIEEGQRSVSPPLFNLQDRFSSTIDDFQKQLDLIVKYDWDFDHYLLEDVPAAMDYIM 360
Query: 361 AISKPKDGKLLAIGHSMGGILLYARLSRCGFEGRDPRLAAIVTLASSLDYTSSKSALKML 420
AISKPKDGKLLAIGHSMGGILLYARLSRCGFEGRDPRLAAIVTLASSLDYTSSKSALKML
Sbjct: 361 AISKPKDGKLLAIGHSMGGILLYARLSRCGFEGRDPRLAAIVTLASSLDYTSSKSALKML 420
Query: 421 LPLADPAQALNVPVVPLGALLSASYPLSSRSPYVLSWLNSLISAEDMMDPEMLKKLVLNN 480
LPLADPAQALNVPVVPLGALLSASYPLSSRSPYVLSWLNSLISAEDMMDPEMLKKLVLNN
Sbjct: 421 LPLADPAQALNVPVVPLGALLSASYPLSSRSPYVLSWLNSLISAEDMMDPEMLKKLVLNN 480
Query: 481 FCNKNDIMHACVIWYSLQLRTVCFKSTSTIPAKLILQLTTAFREGGLRDRSNTFYYKDHL 540
FC TIPAKLILQLTTAFREGGLRDRSNTFYYKDHL
Sbjct: 481 FC--------------------------TIPAKLILQLTTAFREGGLRDRSNTFYYKDHL 540
Query: 541 HKCHVPVLALAGDQDLICPPVAVEETAKLIPEHLVSYKCFGEPGGPHYAHYDLVGGRLAA 600
HKCHVPVLALAGDQDLICPPVAVEETAKLIPEHLVSYKCFGEPGGPHYAHYDLVGGRLAA
Sbjct: 541 HKCHVPVLALAGDQDLICPPVAVEETAKLIPEHLVSYKCFGEPGGPHYAHYDLVGGRLAA 591
Query: 601 EQVYPCIIDFISKHDAV 617
EQVYPCIIDFISKHDAV
Sbjct: 601 EQVYPCIIDFISKHDAV 591
BLAST of Cp4.1LG01g18190 vs. NCBI nr
Match:
XP_022921386.1 (uncharacterized protein LOC111429673 [Cucurbita moschata])
HSP 1 Score: 1148 bits (2969), Expect = 0.0
Identity = 587/617 (95.14%), Postives = 588/617 (95.30%), Query Frame = 0
Query: 1 MATVQSDIRCAYHIASSTFRSLSPTFRPRRTTAVPCKLLFPSFKLRAFSTCAAVRVPEKP 60
MATVQSDIRCAYHIASSTFRSLSPTFRPRRT AVPCKLLFPSFKLRAFSTCAAVRVPEKP
Sbjct: 1 MATVQSDIRCAYHIASSTFRSLSPTFRPRRTAAVPCKLLFPSFKLRAFSTCAAVRVPEKP 60
Query: 61 SICTADELHYASVPNSDWRLALWRYRPSPQAPPRNHPLLLLSGVGTNAIGYDLAPGCSFA 120
SICTADELHYASVPNSDWRLALWRYRPSPQAPPRNHPLLLLSGVGTNAIGYDLAPGCSFA
Sbjct: 61 SICTADELHYASVPNSDWRLALWRYRPSPQAPPRNHPLLLLSGVGTNAIGYDLAPGCSFA 120
Query: 121 RHMSGQGYDTWILEVRGAGLSLQEPNLKEIEHSAKVKSEKMEAISESKINGTVHVEEEST 180
RHMSGQGYDTWILEVRGAGLSLQEPNLKEIEHSAKVKSEKMEAISESKINGTVHVE+EST
Sbjct: 121 RHMSGQGYDTWILEVRGAGLSLQEPNLKEIEHSAKVKSEKMEAISESKINGTVHVEKEST 180
Query: 181 KILDDLSKSDTCINGINGKESDSSMVEEEDFIGIATIWDESSVVSELTETFMRLSERLSG 240
KILDDLSKSDTCINGINGKESD SMVEEEDFIGIATIWDESSVVSELTETFMRLSERLSG
Sbjct: 181 KILDDLSKSDTCINGINGKESDFSMVEEEDFIGIATIWDESSVVSELTETFMRLSERLSG 240
Query: 241 FLSEGQSRIMSATLFDQISKLLVDSQLSERFNEVREKLSYLLETGQTSVIAGQIRDLSQR 300
FLSEGQSRIMSA LFDQISKLLVDSQLSERFNEVREKLSYLLETGQTSVIAGQIRDLSQR
Sbjct: 241 FLSEGQSRIMSAKLFDQISKLLVDSQLSERFNEVREKLSYLLETGQTSVIAGQIRDLSQR 300
Query: 301 LVEIIEEGQRSVSPPLFNLQDRFSSTIDDFQKQLDLIVKYDWDFDHYLLEDVPAAMDYIM 360
LVEIIEEGQRSVSPPLFNLQDRFSSTIDDFQKQLDLIVKYDWDFDHYLLEDVPAAMDYIM
Sbjct: 301 LVEIIEEGQRSVSPPLFNLQDRFSSTIDDFQKQLDLIVKYDWDFDHYLLEDVPAAMDYIM 360
Query: 361 AISKPKDGKLLAIGHSMGGILLYARLSRCGFEGRDPRLAAIVTLASSLDYTSSKSALKML 420
AISKPKDGKLLAIGHSMGGILLYARLSRCGFEGRDPRLAAIVTLASSLDYTSSKSALKML
Sbjct: 361 AISKPKDGKLLAIGHSMGGILLYARLSRCGFEGRDPRLAAIVTLASSLDYTSSKSALKML 420
Query: 421 LPLADPAQALNVPVVPLGALLSASYPLSSRSPYVLSWLNSLISAEDMMDPEMLKKLVLNN 480
LPLADPAQALNVPVVPLGALLSASYPLSSRSPYVLSWLNSLISAEDMMDPEMLKKLVLNN
Sbjct: 421 LPLADPAQALNVPVVPLGALLSASYPLSSRSPYVLSWLNSLISAEDMMDPEMLKKLVLNN 480
Query: 481 FCNKNDIMHACVIWYSLQLRTVCFKSTSTIPAKLILQLTTAFREGGLRDRSNTFYYKDHL 540
FC TIPAKLILQLTTAFREGGLRDRSNTFYYKDHL
Sbjct: 481 FC--------------------------TIPAKLILQLTTAFREGGLRDRSNTFYYKDHL 540
Query: 541 HKCHVPVLALAGDQDLICPPVAVEETAKLIPEHLVSYKCFGEPGGPHYAHYDLVGGRLAA 600
HKCHVPVLALAGDQDLICPPVAVEETAKLIPEHLVSYKCFGEPGGPHYAHYDLVGGRLAA
Sbjct: 541 HKCHVPVLALAGDQDLICPPVAVEETAKLIPEHLVSYKCFGEPGGPHYAHYDLVGGRLAA 591
Query: 601 EQVYPCIIDFISKHDAV 617
EQVYPCIIDFISKHDAV
Sbjct: 601 EQVYPCIIDFISKHDAV 591
BLAST of Cp4.1LG01g18190 vs. NCBI nr
Match:
KAG7032528.1 (hypothetical protein SDJN02_06577, partial [Cucurbita argyrosperma subsp. argyrosperma])
HSP 1 Score: 1139 bits (2947), Expect = 0.0
Identity = 583/617 (94.49%), Postives = 585/617 (94.81%), Query Frame = 0
Query: 1 MATVQSDIRCAYHIASSTFRSLSPTFRPRRTTAVPCKLLFPSFKLRAFSTCAAVRVPEKP 60
MATVQSDIRCAYHIASSTFRSLSPTFRPRRT AVPCKLLFPSFKLRAFSTCAAVRVPEKP
Sbjct: 1 MATVQSDIRCAYHIASSTFRSLSPTFRPRRTAAVPCKLLFPSFKLRAFSTCAAVRVPEKP 60
Query: 61 SICTADELHYASVPNSDWRLALWRYRPSPQAPPRNHPLLLLSGVGTNAIGYDLAPGCSFA 120
SICTADELHYASVPNSDWRLALWRYRPSPQAPPRNHPLLLLSGVGTNAIGYDLAPGCSFA
Sbjct: 61 SICTADELHYASVPNSDWRLALWRYRPSPQAPPRNHPLLLLSGVGTNAIGYDLAPGCSFA 120
Query: 121 RHMSGQGYDTWILEVRGAGLSLQEPNLKEIEHSAKVKSEKMEAISESKINGTVHVEEEST 180
RHMSGQGYDTWILEVRGAGLSLQEPNLKEIEHSAKVKSEKMEAISE KINGTV+ EEEST
Sbjct: 121 RHMSGQGYDTWILEVRGAGLSLQEPNLKEIEHSAKVKSEKMEAISEIKINGTVNAEEEST 180
Query: 181 KILDDLSKSDTCINGINGKESDSSMVEEEDFIGIATIWDESSVVSELTETFMRLSERLSG 240
KILDDLSKSDTCINGINGKESD SMVEEEDFIGIATIWDESSVVSELTETFMRLSERLSG
Sbjct: 181 KILDDLSKSDTCINGINGKESDFSMVEEEDFIGIATIWDESSVVSELTETFMRLSERLSG 240
Query: 241 FLSEGQSRIMSATLFDQISKLLVDSQLSERFNEVREKLSYLLETGQTSVIAGQIRDLSQR 300
FLSEGQSRIMSA LFDQISKLLVDSQLSERFNEVREKLSYLLETGQTSVI GQIRDLSQR
Sbjct: 241 FLSEGQSRIMSAKLFDQISKLLVDSQLSERFNEVREKLSYLLETGQTSVITGQIRDLSQR 300
Query: 301 LVEIIEEGQRSVSPPLFNLQDRFSSTIDDFQKQLDLIVKYDWDFDHYLLEDVPAAMDYIM 360
LVEIIEEGQRSVSPPLF+LQDRFSSTIDDFQKQLDLIVKYDWDFDHYLLEDVPAAMDYIM
Sbjct: 301 LVEIIEEGQRSVSPPLFDLQDRFSSTIDDFQKQLDLIVKYDWDFDHYLLEDVPAAMDYIM 360
Query: 361 AISKPKDGKLLAIGHSMGGILLYARLSRCGFEGRDPRLAAIVTLASSLDYTSSKSALKML 420
AISKPKDGKLLAIGHSMGGILLYARLSRCGFEGRDPRLAAIVTLASSLDYTSSKSALKML
Sbjct: 361 AISKPKDGKLLAIGHSMGGILLYARLSRCGFEGRDPRLAAIVTLASSLDYTSSKSALKML 420
Query: 421 LPLADPAQALNVPVVPLGALLSASYPLSSRSPYVLSWLNSLISAEDMMDPEMLKKLVLNN 480
LPLADPAQALNVPVVPLGALLSASYPLSSRSPYVLSWLNSLISAEDMMDPEMLKKLVLNN
Sbjct: 421 LPLADPAQALNVPVVPLGALLSASYPLSSRSPYVLSWLNSLISAEDMMDPEMLKKLVLNN 480
Query: 481 FCNKNDIMHACVIWYSLQLRTVCFKSTSTIPAKLILQLTTAFREGGLRDRSNTFYYKDHL 540
FC TIPAKLILQLTTAFREGGLRDRSNTFYYKDHL
Sbjct: 481 FC--------------------------TIPAKLILQLTTAFREGGLRDRSNTFYYKDHL 540
Query: 541 HKCHVPVLALAGDQDLICPPVAVEETAKLIPEHLVSYKCFGEPGGPHYAHYDLVGGRLAA 600
HKCHVPVLALAGDQDLICPPVAVEETAKLIPEHLVSYKCFGEPGGPHYAHYDLVGGRLAA
Sbjct: 541 HKCHVPVLALAGDQDLICPPVAVEETAKLIPEHLVSYKCFGEPGGPHYAHYDLVGGRLAA 591
Query: 601 EQVYPCIIDFISKHDAV 617
EQVYPCIIDFISKHDAV
Sbjct: 601 EQVYPCIIDFISKHDAV 591
BLAST of Cp4.1LG01g18190 vs. NCBI nr
Match:
KAG6601823.1 (hypothetical protein SDJN03_07056, partial [Cucurbita argyrosperma subsp. sororia])
HSP 1 Score: 1137 bits (2941), Expect = 0.0
Identity = 582/617 (94.33%), Postives = 584/617 (94.65%), Query Frame = 0
Query: 1 MATVQSDIRCAYHIASSTFRSLSPTFRPRRTTAVPCKLLFPSFKLRAFSTCAAVRVPEKP 60
MATVQSDIRCAYHIASSTFRSLSPTFRPRRT AVPCKLLFPSFKLRAFSTCAAVRVPEKP
Sbjct: 1 MATVQSDIRCAYHIASSTFRSLSPTFRPRRTAAVPCKLLFPSFKLRAFSTCAAVRVPEKP 60
Query: 61 SICTADELHYASVPNSDWRLALWRYRPSPQAPPRNHPLLLLSGVGTNAIGYDLAPGCSFA 120
SICTADELHYASVPNSDWRLALWRYRPSPQAPPRNHPLLLLSGVGTNAIGYDLAPGCSFA
Sbjct: 61 SICTADELHYASVPNSDWRLALWRYRPSPQAPPRNHPLLLLSGVGTNAIGYDLAPGCSFA 120
Query: 121 RHMSGQGYDTWILEVRGAGLSLQEPNLKEIEHSAKVKSEKMEAISESKINGTVHVEEEST 180
RHMSGQGYDTWILEVRGAGLSLQEPNLKEIEHSAKVKSEKMEAISE KINGTV+ EEEST
Sbjct: 121 RHMSGQGYDTWILEVRGAGLSLQEPNLKEIEHSAKVKSEKMEAISEIKINGTVNAEEEST 180
Query: 181 KILDDLSKSDTCINGINGKESDSSMVEEEDFIGIATIWDESSVVSELTETFMRLSERLSG 240
KILDDLSKSDTCINGINGKESD SMVEEEDFIGIATIWDESSVVSELTETFMRLSERLSG
Sbjct: 181 KILDDLSKSDTCINGINGKESDFSMVEEEDFIGIATIWDESSVVSELTETFMRLSERLSG 240
Query: 241 FLSEGQSRIMSATLFDQISKLLVDSQLSERFNEVREKLSYLLETGQTSVIAGQIRDLSQR 300
FLSEGQSRIMSA LFDQISKLLVDSQLSERFNEVREKLSYLLETGQTS IAGQIRDLSQR
Sbjct: 241 FLSEGQSRIMSAKLFDQISKLLVDSQLSERFNEVREKLSYLLETGQTSAIAGQIRDLSQR 300
Query: 301 LVEIIEEGQRSVSPPLFNLQDRFSSTIDDFQKQLDLIVKYDWDFDHYLLEDVPAAMDYIM 360
LVEIIEEGQRSVSPPLF+LQDRFSSTIDDFQKQLDLIVKYDWDFDHYLLEDVPAAMDYIM
Sbjct: 301 LVEIIEEGQRSVSPPLFDLQDRFSSTIDDFQKQLDLIVKYDWDFDHYLLEDVPAAMDYIM 360
Query: 361 AISKPKDGKLLAIGHSMGGILLYARLSRCGFEGRDPRLAAIVTLASSLDYTSSKSALKML 420
AISKPKDGKLLAIGHSMGGILLYARLSRCGFEGRDPRLAAIVTLASSLDYTSSKSALKML
Sbjct: 361 AISKPKDGKLLAIGHSMGGILLYARLSRCGFEGRDPRLAAIVTLASSLDYTSSKSALKML 420
Query: 421 LPLADPAQALNVPVVPLGALLSASYPLSSRSPYVLSWLNSLISAEDMMDPEMLKKLVLNN 480
LPLADPAQALNVPVVPLGALLSASYPLSSRSPYVLSWLNSLISAEDMMDPEMLKKLVLNN
Sbjct: 421 LPLADPAQALNVPVVPLGALLSASYPLSSRSPYVLSWLNSLISAEDMMDPEMLKKLVLNN 480
Query: 481 FCNKNDIMHACVIWYSLQLRTVCFKSTSTIPAKLILQLTTAFREGGLRDRSNTFYYKDHL 540
FC TIPAKLILQLTTAFREGGLRDRSNTFYYKDHL
Sbjct: 481 FC--------------------------TIPAKLILQLTTAFREGGLRDRSNTFYYKDHL 540
Query: 541 HKCHVPVLALAGDQDLICPPVAVEETAKLIPEHLVSYKCFGEPGGPHYAHYDLVGGRLAA 600
HKCHVPVLALAGDQDLICPPVAVEETAKLIPEHLVSYKCFGEPGGPHYAHYDLVGGRLAA
Sbjct: 541 HKCHVPVLALAGDQDLICPPVAVEETAKLIPEHLVSYKCFGEPGGPHYAHYDLVGGRLAA 591
Query: 601 EQVYPCIIDFISKHDAV 617
EQVYPCIIDFIS HDAV
Sbjct: 601 EQVYPCIIDFISMHDAV 591
BLAST of Cp4.1LG01g18190 vs. NCBI nr
Match:
XP_022971785.1 (uncharacterized protein LOC111470463 [Cucurbita maxima])
HSP 1 Score: 1129 bits (2921), Expect = 0.0
Identity = 579/617 (93.84%), Postives = 583/617 (94.49%), Query Frame = 0
Query: 1 MATVQSDIRCAYHIASSTFRSLSPTFRPRRTTAVPCKLLFPSFKLRAFSTCAAVRVPEKP 60
M TVQSDIRCAYHIASSTFRSLSP+FRPRRT A PCKLLFPSFKLRAFSTCAAVRVPEKP
Sbjct: 1 MPTVQSDIRCAYHIASSTFRSLSPSFRPRRTAAFPCKLLFPSFKLRAFSTCAAVRVPEKP 60
Query: 61 SICTADELHYASVPNSDWRLALWRYRPSPQAPPRNHPLLLLSGVGTNAIGYDLAPGCSFA 120
SICTADELHYASVPNSDWRLALWRYRPSPQAPPRNHPLLLLSGVGTNAIGYDLAPGCSFA
Sbjct: 61 SICTADELHYASVPNSDWRLALWRYRPSPQAPPRNHPLLLLSGVGTNAIGYDLAPGCSFA 120
Query: 121 RHMSGQGYDTWILEVRGAGLSLQEPNLKEIEHSAKVKSEKMEAISESKINGTVHVEEEST 180
RHMSGQGYDTWILEVRGAGLSLQEPN+KEIEHSAKVKSEKMEAISESKINGTVHVEEEST
Sbjct: 121 RHMSGQGYDTWILEVRGAGLSLQEPNVKEIEHSAKVKSEKMEAISESKINGTVHVEEEST 180
Query: 181 KILDDLSKSDTCINGINGKESDSSMVEEEDFIGIATIWDESSVVSELTETFMRLSERLSG 240
KIL+DLSKSDTCING KESD SMVEEEDFIGIATIWDESSVVSELTETFMRLSERLSG
Sbjct: 181 KILEDLSKSDTCING---KESDFSMVEEEDFIGIATIWDESSVVSELTETFMRLSERLSG 240
Query: 241 FLSEGQSRIMSATLFDQISKLLVDSQLSERFNEVREKLSYLLETGQTSVIAGQIRDLSQR 300
FLSEGQSRIMSA LFDQISKLLVDSQLSERFNEVREKLSYLLETGQTSVIAGQIRDLSQR
Sbjct: 241 FLSEGQSRIMSAKLFDQISKLLVDSQLSERFNEVREKLSYLLETGQTSVIAGQIRDLSQR 300
Query: 301 LVEIIEEGQRSVSPPLFNLQDRFSSTIDDFQKQLDLIVKYDWDFDHYLLEDVPAAMDYIM 360
LVEIIEEGQRSVSPPLFNLQDRFSSTIDDFQKQLDLIVKYDWDFDHYLLEDVPAAMDYIM
Sbjct: 301 LVEIIEEGQRSVSPPLFNLQDRFSSTIDDFQKQLDLIVKYDWDFDHYLLEDVPAAMDYIM 360
Query: 361 AISKPKDGKLLAIGHSMGGILLYARLSRCGFEGRDPRLAAIVTLASSLDYTSSKSALKML 420
AISKPKDGKLLAIGHSMGGILLYARLSRCGFEGRDPRLAAIVTLASSLDYTSSKSALKML
Sbjct: 361 AISKPKDGKLLAIGHSMGGILLYARLSRCGFEGRDPRLAAIVTLASSLDYTSSKSALKML 420
Query: 421 LPLADPAQALNVPVVPLGALLSASYPLSSRSPYVLSWLNSLISAEDMMDPEMLKKLVLNN 480
LPLADPAQALNVPVVPLGALLSASYPLSSRSPYVLSWLNSLISAEDMMDPEMLKKLVLNN
Sbjct: 421 LPLADPAQALNVPVVPLGALLSASYPLSSRSPYVLSWLNSLISAEDMMDPEMLKKLVLNN 480
Query: 481 FCNKNDIMHACVIWYSLQLRTVCFKSTSTIPAKLILQLTTAFREGGLRDRSNTFYYKDHL 540
FC TIPAKLILQLTTAFREGGLRDRSNTFYYKDHL
Sbjct: 481 FC--------------------------TIPAKLILQLTTAFREGGLRDRSNTFYYKDHL 540
Query: 541 HKCHVPVLALAGDQDLICPPVAVEETAKLIPEHLVSYKCFGEPGGPHYAHYDLVGGRLAA 600
HKCHVP+LALAGDQDLICPPVAVEETAKLIPEHLVSYKCFGEPGGPHYAHYDLVGGRLAA
Sbjct: 541 HKCHVPILALAGDQDLICPPVAVEETAKLIPEHLVSYKCFGEPGGPHYAHYDLVGGRLAA 588
Query: 601 EQVYPCIIDFISKHDAV 617
EQVYPCIIDFISKHDAV
Sbjct: 601 EQVYPCIIDFISKHDAV 588
BLAST of Cp4.1LG01g18190 vs. ExPASy TrEMBL
Match:
A0A6J1E189 (uncharacterized protein LOC111429673 OS=Cucurbita moschata OX=3662 GN=LOC111429673 PE=4 SV=1)
HSP 1 Score: 1148 bits (2969), Expect = 0.0
Identity = 587/617 (95.14%), Postives = 588/617 (95.30%), Query Frame = 0
Query: 1 MATVQSDIRCAYHIASSTFRSLSPTFRPRRTTAVPCKLLFPSFKLRAFSTCAAVRVPEKP 60
MATVQSDIRCAYHIASSTFRSLSPTFRPRRT AVPCKLLFPSFKLRAFSTCAAVRVPEKP
Sbjct: 1 MATVQSDIRCAYHIASSTFRSLSPTFRPRRTAAVPCKLLFPSFKLRAFSTCAAVRVPEKP 60
Query: 61 SICTADELHYASVPNSDWRLALWRYRPSPQAPPRNHPLLLLSGVGTNAIGYDLAPGCSFA 120
SICTADELHYASVPNSDWRLALWRYRPSPQAPPRNHPLLLLSGVGTNAIGYDLAPGCSFA
Sbjct: 61 SICTADELHYASVPNSDWRLALWRYRPSPQAPPRNHPLLLLSGVGTNAIGYDLAPGCSFA 120
Query: 121 RHMSGQGYDTWILEVRGAGLSLQEPNLKEIEHSAKVKSEKMEAISESKINGTVHVEEEST 180
RHMSGQGYDTWILEVRGAGLSLQEPNLKEIEHSAKVKSEKMEAISESKINGTVHVE+EST
Sbjct: 121 RHMSGQGYDTWILEVRGAGLSLQEPNLKEIEHSAKVKSEKMEAISESKINGTVHVEKEST 180
Query: 181 KILDDLSKSDTCINGINGKESDSSMVEEEDFIGIATIWDESSVVSELTETFMRLSERLSG 240
KILDDLSKSDTCINGINGKESD SMVEEEDFIGIATIWDESSVVSELTETFMRLSERLSG
Sbjct: 181 KILDDLSKSDTCINGINGKESDFSMVEEEDFIGIATIWDESSVVSELTETFMRLSERLSG 240
Query: 241 FLSEGQSRIMSATLFDQISKLLVDSQLSERFNEVREKLSYLLETGQTSVIAGQIRDLSQR 300
FLSEGQSRIMSA LFDQISKLLVDSQLSERFNEVREKLSYLLETGQTSVIAGQIRDLSQR
Sbjct: 241 FLSEGQSRIMSAKLFDQISKLLVDSQLSERFNEVREKLSYLLETGQTSVIAGQIRDLSQR 300
Query: 301 LVEIIEEGQRSVSPPLFNLQDRFSSTIDDFQKQLDLIVKYDWDFDHYLLEDVPAAMDYIM 360
LVEIIEEGQRSVSPPLFNLQDRFSSTIDDFQKQLDLIVKYDWDFDHYLLEDVPAAMDYIM
Sbjct: 301 LVEIIEEGQRSVSPPLFNLQDRFSSTIDDFQKQLDLIVKYDWDFDHYLLEDVPAAMDYIM 360
Query: 361 AISKPKDGKLLAIGHSMGGILLYARLSRCGFEGRDPRLAAIVTLASSLDYTSSKSALKML 420
AISKPKDGKLLAIGHSMGGILLYARLSRCGFEGRDPRLAAIVTLASSLDYTSSKSALKML
Sbjct: 361 AISKPKDGKLLAIGHSMGGILLYARLSRCGFEGRDPRLAAIVTLASSLDYTSSKSALKML 420
Query: 421 LPLADPAQALNVPVVPLGALLSASYPLSSRSPYVLSWLNSLISAEDMMDPEMLKKLVLNN 480
LPLADPAQALNVPVVPLGALLSASYPLSSRSPYVLSWLNSLISAEDMMDPEMLKKLVLNN
Sbjct: 421 LPLADPAQALNVPVVPLGALLSASYPLSSRSPYVLSWLNSLISAEDMMDPEMLKKLVLNN 480
Query: 481 FCNKNDIMHACVIWYSLQLRTVCFKSTSTIPAKLILQLTTAFREGGLRDRSNTFYYKDHL 540
FC TIPAKLILQLTTAFREGGLRDRSNTFYYKDHL
Sbjct: 481 FC--------------------------TIPAKLILQLTTAFREGGLRDRSNTFYYKDHL 540
Query: 541 HKCHVPVLALAGDQDLICPPVAVEETAKLIPEHLVSYKCFGEPGGPHYAHYDLVGGRLAA 600
HKCHVPVLALAGDQDLICPPVAVEETAKLIPEHLVSYKCFGEPGGPHYAHYDLVGGRLAA
Sbjct: 541 HKCHVPVLALAGDQDLICPPVAVEETAKLIPEHLVSYKCFGEPGGPHYAHYDLVGGRLAA 591
Query: 601 EQVYPCIIDFISKHDAV 617
EQVYPCIIDFISKHDAV
Sbjct: 601 EQVYPCIIDFISKHDAV 591
BLAST of Cp4.1LG01g18190 vs. ExPASy TrEMBL
Match:
A0A6J1I6P7 (uncharacterized protein LOC111470463 OS=Cucurbita maxima OX=3661 GN=LOC111470463 PE=4 SV=1)
HSP 1 Score: 1129 bits (2921), Expect = 0.0
Identity = 579/617 (93.84%), Postives = 583/617 (94.49%), Query Frame = 0
Query: 1 MATVQSDIRCAYHIASSTFRSLSPTFRPRRTTAVPCKLLFPSFKLRAFSTCAAVRVPEKP 60
M TVQSDIRCAYHIASSTFRSLSP+FRPRRT A PCKLLFPSFKLRAFSTCAAVRVPEKP
Sbjct: 1 MPTVQSDIRCAYHIASSTFRSLSPSFRPRRTAAFPCKLLFPSFKLRAFSTCAAVRVPEKP 60
Query: 61 SICTADELHYASVPNSDWRLALWRYRPSPQAPPRNHPLLLLSGVGTNAIGYDLAPGCSFA 120
SICTADELHYASVPNSDWRLALWRYRPSPQAPPRNHPLLLLSGVGTNAIGYDLAPGCSFA
Sbjct: 61 SICTADELHYASVPNSDWRLALWRYRPSPQAPPRNHPLLLLSGVGTNAIGYDLAPGCSFA 120
Query: 121 RHMSGQGYDTWILEVRGAGLSLQEPNLKEIEHSAKVKSEKMEAISESKINGTVHVEEEST 180
RHMSGQGYDTWILEVRGAGLSLQEPN+KEIEHSAKVKSEKMEAISESKINGTVHVEEEST
Sbjct: 121 RHMSGQGYDTWILEVRGAGLSLQEPNVKEIEHSAKVKSEKMEAISESKINGTVHVEEEST 180
Query: 181 KILDDLSKSDTCINGINGKESDSSMVEEEDFIGIATIWDESSVVSELTETFMRLSERLSG 240
KIL+DLSKSDTCING KESD SMVEEEDFIGIATIWDESSVVSELTETFMRLSERLSG
Sbjct: 181 KILEDLSKSDTCING---KESDFSMVEEEDFIGIATIWDESSVVSELTETFMRLSERLSG 240
Query: 241 FLSEGQSRIMSATLFDQISKLLVDSQLSERFNEVREKLSYLLETGQTSVIAGQIRDLSQR 300
FLSEGQSRIMSA LFDQISKLLVDSQLSERFNEVREKLSYLLETGQTSVIAGQIRDLSQR
Sbjct: 241 FLSEGQSRIMSAKLFDQISKLLVDSQLSERFNEVREKLSYLLETGQTSVIAGQIRDLSQR 300
Query: 301 LVEIIEEGQRSVSPPLFNLQDRFSSTIDDFQKQLDLIVKYDWDFDHYLLEDVPAAMDYIM 360
LVEIIEEGQRSVSPPLFNLQDRFSSTIDDFQKQLDLIVKYDWDFDHYLLEDVPAAMDYIM
Sbjct: 301 LVEIIEEGQRSVSPPLFNLQDRFSSTIDDFQKQLDLIVKYDWDFDHYLLEDVPAAMDYIM 360
Query: 361 AISKPKDGKLLAIGHSMGGILLYARLSRCGFEGRDPRLAAIVTLASSLDYTSSKSALKML 420
AISKPKDGKLLAIGHSMGGILLYARLSRCGFEGRDPRLAAIVTLASSLDYTSSKSALKML
Sbjct: 361 AISKPKDGKLLAIGHSMGGILLYARLSRCGFEGRDPRLAAIVTLASSLDYTSSKSALKML 420
Query: 421 LPLADPAQALNVPVVPLGALLSASYPLSSRSPYVLSWLNSLISAEDMMDPEMLKKLVLNN 480
LPLADPAQALNVPVVPLGALLSASYPLSSRSPYVLSWLNSLISAEDMMDPEMLKKLVLNN
Sbjct: 421 LPLADPAQALNVPVVPLGALLSASYPLSSRSPYVLSWLNSLISAEDMMDPEMLKKLVLNN 480
Query: 481 FCNKNDIMHACVIWYSLQLRTVCFKSTSTIPAKLILQLTTAFREGGLRDRSNTFYYKDHL 540
FC TIPAKLILQLTTAFREGGLRDRSNTFYYKDHL
Sbjct: 481 FC--------------------------TIPAKLILQLTTAFREGGLRDRSNTFYYKDHL 540
Query: 541 HKCHVPVLALAGDQDLICPPVAVEETAKLIPEHLVSYKCFGEPGGPHYAHYDLVGGRLAA 600
HKCHVP+LALAGDQDLICPPVAVEETAKLIPEHLVSYKCFGEPGGPHYAHYDLVGGRLAA
Sbjct: 541 HKCHVPILALAGDQDLICPPVAVEETAKLIPEHLVSYKCFGEPGGPHYAHYDLVGGRLAA 588
Query: 601 EQVYPCIIDFISKHDAV 617
EQVYPCIIDFISKHDAV
Sbjct: 601 EQVYPCIIDFISKHDAV 588
BLAST of Cp4.1LG01g18190 vs. ExPASy TrEMBL
Match:
A0A0A0KPN7 (AB hydrolase-1 domain-containing protein OS=Cucumis sativus OX=3659 GN=Csa_5G576750 PE=4 SV=1)
HSP 1 Score: 1021 bits (2639), Expect = 0.0
Identity = 524/617 (84.93%), Postives = 554/617 (89.79%), Query Frame = 0
Query: 1 MATVQSDIRCAYHIASSTFRSLSPTFRPRRTTAVPCKLLFPSFKLRAFSTCAAVRVPEKP 60
MATVQSDIR A+HIASSTF S++P RRT ++ KLLFPSFKLRAFST AAVRVP+KP
Sbjct: 1 MATVQSDIRGAFHIASSTFLSITPNLMLRRTASLSGKLLFPSFKLRAFSTGAAVRVPDKP 60
Query: 61 SICTADELHYASVPNSDWRLALWRYRPSPQAPPRNHPLLLLSGVGTNAIGYDLAPGCSFA 120
SICTADELHY SVPNSDWRLALWRY PSPQAPPRNHPLLLLSGVGTNAIGYDLAPGCSFA
Sbjct: 61 SICTADELHYVSVPNSDWRLALWRYHPSPQAPPRNHPLLLLSGVGTNAIGYDLAPGCSFA 120
Query: 121 RHMSGQGYDTWILEVRGAGLSLQEPNLKEIEHSAKVKSEKMEAISESKINGTVHVEEEST 180
RHMSGQG+DTWILEVRGAGLSLQEPNLKEIEHSAKVKS+KMEA SE KINGT +EST
Sbjct: 121 RHMSGQGFDTWILEVRGAGLSLQEPNLKEIEHSAKVKSDKMEAASEIKINGTSKEVKEST 180
Query: 181 KILDDLSKSDTCINGINGKESDSSMVEEEDFIGIATIWDESSVVSELTETFMRLSERLSG 240
KIL DL+KSD+CING KES SSMVEEEDFIGI TIWDESS+VSELTETFMRLSERLSG
Sbjct: 181 KILSDLAKSDSCING---KESASSMVEEEDFIGITTIWDESSLVSELTETFMRLSERLSG 240
Query: 241 FLSEGQSRIMSATLFDQISKLLVDSQLSERFNEVREKLSYLLETGQTSVIAGQIRDLSQR 300
FLSEGQSRIMSA LFDQISKLLVDSQLSERFNEVR +LS LLETGQTSVIAGQIRDLSQR
Sbjct: 241 FLSEGQSRIMSAKLFDQISKLLVDSQLSERFNEVRGRLSNLLETGQTSVIAGQIRDLSQR 300
Query: 301 LVEIIEEGQRSVSPPLFNLQDRFSSTIDDFQKQLDLIVKYDWDFDHYLLEDVPAAMDYIM 360
LVEII++GQRSVSPPLFNLQDRFSSTIDDFQKQLDLIVKYDWDFDHYLLEDVPAA+DYI
Sbjct: 301 LVEIIDDGQRSVSPPLFNLQDRFSSTIDDFQKQLDLIVKYDWDFDHYLLEDVPAAIDYIR 360
Query: 361 AISKPKDGKLLAIGHSMGGILLYARLSRCGFEGRDPRLAAIVTLASSLDYTSSKSALKML 420
+SKP+DGKLLAIGHSMGGILLYA LSRCG EGRDPR AAIVTLASSLDYT SKSALK+L
Sbjct: 361 DVSKPRDGKLLAIGHSMGGILLYAELSRCGCEGRDPRFAAIVTLASSLDYTPSKSALKLL 420
Query: 421 LPLADPAQALNVPVVPLGALLSASYPLSSRSPYVLSWLNSLISAEDMMDPEMLKKLVLNN 480
LPLADPAQALNVPVVPLGALLSASYPLSSRSPYV SWLN+LISAEDMM PEMLKKLVLNN
Sbjct: 421 LPLADPAQALNVPVVPLGALLSASYPLSSRSPYVFSWLNNLISAEDMMHPEMLKKLVLNN 480
Query: 481 FCNKNDIMHACVIWYSLQLRTVCFKSTSTIPAKLILQLTTAFREGGLRDRSNTFYYKDHL 540
FC TIPAKL+LQLTTAFREGGLRDRSNTF+YKDH+
Sbjct: 481 FC--------------------------TIPAKLVLQLTTAFREGGLRDRSNTFFYKDHI 540
Query: 541 HKCHVPVLALAGDQDLICPPVAVEETAKLIPEHLVSYKCFGEPGGPHYAHYDLVGGRLAA 600
HKC+VPVLA+AGDQDLICPPVAVEETAKLIP+HLV+YKCFGEP GPHYAHYDLVGGRLA
Sbjct: 541 HKCNVPVLAIAGDQDLICPPVAVEETAKLIPQHLVTYKCFGEPEGPHYAHYDLVGGRLAV 588
Query: 601 EQVYPCIIDFISKHDAV 617
EQVYPCII+FIS+HDA+
Sbjct: 601 EQVYPCIIEFISQHDAI 588
BLAST of Cp4.1LG01g18190 vs. ExPASy TrEMBL
Match:
A0A6J1DEH2 (uncharacterized protein LOC111020046 OS=Momordica charantia OX=3673 GN=LOC111020046 PE=4 SV=1)
HSP 1 Score: 1016 bits (2628), Expect = 0.0
Identity = 525/618 (84.95%), Postives = 553/618 (89.48%), Query Frame = 0
Query: 1 MATVQSDIRCAYHIASSTFRSLSPTFRPRRTTAVPCKLLFPSFKLRAFSTCAA-VRVPEK 60
MATVQSD+RCAYHIASSTFRSLSP RRT AVP + PSFKLRAFST AA V+V EK
Sbjct: 1 MATVQSDLRCAYHIASSTFRSLSPNLVLRRTAAVPGHVFSPSFKLRAFSTAAANVKVSEK 60
Query: 61 PSICTADELHYASVPNSDWRLALWRYRPSPQAPPRNHPLLLLSGVGTNAIGYDLAPGCSF 120
PSICTADELHY SVPNSDWRLALWRY SPQAPPRNHPLLLLSGVGTNAIGYDLAPGCSF
Sbjct: 61 PSICTADELHYVSVPNSDWRLALWRYHSSPQAPPRNHPLLLLSGVGTNAIGYDLAPGCSF 120
Query: 121 ARHMSGQGYDTWILEVRGAGLSLQEPNLKEIEHSAKVKSEKMEAISESKINGTVHVEEES 180
ARHMSGQGYDTWILEVRGAGLSLQEP+ KEIEHSA VKSE+MEA+SESK+NGT+ + EES
Sbjct: 121 ARHMSGQGYDTWILEVRGAGLSLQEPDSKEIEHSANVKSEQMEAVSESKVNGTLQMAEES 180
Query: 181 TKILDDLSKSDTCINGINGKESDSSMVEEEDFIGIATIWDESSVVSELTETFMRLSERLS 240
TKIL+DLSKS++C NG KESD SMVEEE F GI TIWDESS+V+ELTETFMRLSERLS
Sbjct: 181 TKILNDLSKSNSCTNG---KESDLSMVEEEYFPGIGTIWDESSLVTELTETFMRLSERLS 240
Query: 241 GFLSEGQSRIMSATLFDQISKLLVDSQLSERFNEVREKLSYLLETGQTSVIAGQIRDLSQ 300
GFLSEGQS+IMSA LFDQ+SKLLVDSQLSERFNEVR +L LLETGQTSVIAGQIRDLSQ
Sbjct: 241 GFLSEGQSKIMSAKLFDQMSKLLVDSQLSERFNEVRGRLLNLLETGQTSVIAGQIRDLSQ 300
Query: 301 RLVEIIEEGQRSVSPPLFNLQDRFSSTIDDFQKQLDLIVKYDWDFDHYLLEDVPAAMDYI 360
RLVEIIEEGQRSVSPPLFNLQDRFSSTIDDFQKQLDLIVKYDWDFDHYLLEDVPAA+DYI
Sbjct: 301 RLVEIIEEGQRSVSPPLFNLQDRFSSTIDDFQKQLDLIVKYDWDFDHYLLEDVPAAIDYI 360
Query: 361 MAISKPKDGKLLAIGHSMGGILLYARLSRCGFEGRDPRLAAIVTLASSLDYTSSKSALKM 420
A+SKPKDGKLLAIGHSMGGILLYA+LSRCGFEG+DPRLAAIVTLASSLDYTSSKSALK+
Sbjct: 361 RAVSKPKDGKLLAIGHSMGGILLYAKLSRCGFEGQDPRLAAIVTLASSLDYTSSKSALKL 420
Query: 421 LLPLADPAQALNVPVVPLGALLSASYPLSSRSPYVLSWLNSLISAEDMMDPEMLKKLVLN 480
LLPLADPAQALNVPVVPLGALLSASYPLSSR PYVLSWLN+LISAEDMM PEMLKKLVLN
Sbjct: 421 LLPLADPAQALNVPVVPLGALLSASYPLSSRPPYVLSWLNNLISAEDMMQPEMLKKLVLN 480
Query: 481 NFCNKNDIMHACVIWYSLQLRTVCFKSTSTIPAKLILQLTTAFREGGLRDRSNTFYYKDH 540
NFC TIPAKLILQLTTAFREGGLRDRSNTF+Y DH
Sbjct: 481 NFC--------------------------TIPAKLILQLTTAFREGGLRDRSNTFFYNDH 540
Query: 541 LHKCHVPVLALAGDQDLICPPVAVEETAKLIPEHLVSYKCFGEPGGPHYAHYDLVGGRLA 600
+HKCHVPVLALAGDQDLICPP AVE TAKLIPEHLV+YK FGE GGPHYAHYDLVGGRLA
Sbjct: 541 IHKCHVPVLALAGDQDLICPPEAVEATAKLIPEHLVTYKVFGEAGGPHYAHYDLVGGRLA 589
Query: 601 AEQVYPCIIDFISKHDAV 617
AEQVYPCII+FISKHDA+
Sbjct: 601 AEQVYPCIIEFISKHDAI 589
BLAST of Cp4.1LG01g18190 vs. ExPASy TrEMBL
Match:
A0A1S3CT47 (uncharacterized protein LOC103504592 OS=Cucumis melo OX=3656 GN=LOC103504592 PE=4 SV=1)
HSP 1 Score: 1016 bits (2628), Expect = 0.0
Identity = 525/617 (85.09%), Postives = 552/617 (89.47%), Query Frame = 0
Query: 1 MATVQSDIRCAYHIASSTFRSLSPTFRPRRTTAVPCKLLFPSFKLRAFSTCAAVRVPEKP 60
MATVQSDIR A+HIASSTF SL+P RRT ++ KLLFPSFKLRAFST AAVRVPEKP
Sbjct: 1 MATVQSDIRGAFHIASSTFLSLTPNLMLRRTASLSGKLLFPSFKLRAFSTGAAVRVPEKP 60
Query: 61 SICTADELHYASVPNSDWRLALWRYRPSPQAPPRNHPLLLLSGVGTNAIGYDLAPGCSFA 120
SICTADELHY SVPNSDWRLALWRY PSPQAPPRNHPLLLLSGVGTNAIGYDLAPGCSFA
Sbjct: 61 SICTADELHYVSVPNSDWRLALWRYHPSPQAPPRNHPLLLLSGVGTNAIGYDLAPGCSFA 120
Query: 121 RHMSGQGYDTWILEVRGAGLSLQEPNLKEIEHSAKVKSEKMEAISESKINGTVHVEEEST 180
R+MSGQG+DTWILEVRGAGLSLQEPNLKEIEHS+KVKS+KMEA SE KINGT EEST
Sbjct: 121 RYMSGQGFDTWILEVRGAGLSLQEPNLKEIEHSSKVKSDKMEASSEIKINGTSKEVEEST 180
Query: 181 KILDDLSKSDTCINGINGKESDSSMVEEEDFIGIATIWDESSVVSELTETFMRLSERLSG 240
K+L+DL+KSD+CING KE SSMVEEEDFIGI TIWDESS+VSELTETFMRLSERLSG
Sbjct: 181 KVLNDLAKSDSCING---KEYGSSMVEEEDFIGITTIWDESSLVSELTETFMRLSERLSG 240
Query: 241 FLSEGQSRIMSATLFDQISKLLVDSQLSERFNEVREKLSYLLETGQTSVIAGQIRDLSQR 300
FLSEGQSRIMSA LFDQISKLLVDSQLSERFNEVR +LS LLETGQTSVIAGQIRDLSQR
Sbjct: 241 FLSEGQSRIMSAKLFDQISKLLVDSQLSERFNEVRGRLSNLLETGQTSVIAGQIRDLSQR 300
Query: 301 LVEIIEEGQRSVSPPLFNLQDRFSSTIDDFQKQLDLIVKYDWDFDHYLLEDVPAAMDYIM 360
LVEIIE+GQRSVSPPLFNLQDRFSSTIDDFQKQLDLIVKYDWDFDHYLLEDVPAA+DYI
Sbjct: 301 LVEIIEDGQRSVSPPLFNLQDRFSSTIDDFQKQLDLIVKYDWDFDHYLLEDVPAAIDYIR 360
Query: 361 AISKPKDGKLLAIGHSMGGILLYARLSRCGFEGRDPRLAAIVTLASSLDYTSSKSALKML 420
AISKP+DGKLLAIGHSMGGILLYA LSRCGFE RDP AA+VTLASSLDYTSSKSALK+L
Sbjct: 361 AISKPRDGKLLAIGHSMGGILLYAELSRCGFEERDPGFAAVVTLASSLDYTSSKSALKLL 420
Query: 421 LPLADPAQALNVPVVPLGALLSASYPLSSRSPYVLSWLNSLISAEDMMDPEMLKKLVLNN 480
LPLADPAQALNVPVVPLGALLSASYPLSSRSP V SWLN+LISAEDMM PEMLKKLVLNN
Sbjct: 421 LPLADPAQALNVPVVPLGALLSASYPLSSRSPIVFSWLNNLISAEDMMHPEMLKKLVLNN 480
Query: 481 FCNKNDIMHACVIWYSLQLRTVCFKSTSTIPAKLILQLTTAFREGGLRDRSNTFYYKDHL 540
FC TIPAKLILQLTTAFREGGLRDRSNTF+YKDH+
Sbjct: 481 FC--------------------------TIPAKLILQLTTAFREGGLRDRSNTFFYKDHI 540
Query: 541 HKCHVPVLALAGDQDLICPPVAVEETAKLIPEHLVSYKCFGEPGGPHYAHYDLVGGRLAA 600
HKC VPVLA+AGDQDLICPPVAVEETA LIPEHLV+YKCFGEP GPHYAHYDLVGGRLA
Sbjct: 541 HKCTVPVLAIAGDQDLICPPVAVEETANLIPEHLVTYKCFGEPEGPHYAHYDLVGGRLAV 588
Query: 601 EQVYPCIIDFISKHDAV 617
EQVYPCII+FIS+HDA+
Sbjct: 601 EQVYPCIIEFISQHDAI 588
BLAST of Cp4.1LG01g18190 vs. TAIR 10
Match:
AT1G15060.1 (Uncharacterised conserved protein UCP031088, alpha/beta hydrolase )
HSP 1 Score: 733.8 bits (1893), Expect = 1.1e-211
Identity = 388/619 (62.68%), Postives = 468/619 (75.61%), Query Frame = 0
Query: 7 DIRCAYHIASST---FRSLS-----PTFRPRRTTAVPCKLLFPSFKLRAFSTCAAVRVPE 66
+IR A ASST RS+S P+FR R T P RAFS+ ++V++P
Sbjct: 11 EIRSALRRASSTVYLHRSISTVTTTPSFRHRTTLLRP----------RAFSS-SSVKLPT 70
Query: 67 KPSICTADELHYASVPNSDWRLALWRYRPSPQAPPRNHPLLLLSGVGTNAIGYDLAPGCS 126
KPS+CTADELHY SVPN+DWRLALWRY P PQAP RNHPLLLLSGVGTNAIGYDL+PGCS
Sbjct: 71 KPSLCTADELHYVSVPNTDWRLALWRYLPPPQAPTRNHPLLLLSGVGTNAIGYDLSPGCS 130
Query: 127 FARHMSGQGYDTWILEVRGAGLSLQEPNLKEIEHSAKVKSEKMEAISESKI-NGTVHVEE 186
FARHMSGQG++TWILEVRGAGLS + +LK++E SA S ++E+ + + T E+
Sbjct: 131 FARHMSGQGFETWILEVRGAGLSTRVSDLKDVEESAHELSNQIESTARAAAGKETCSDEK 190
Query: 187 ESTKILDDLSKSDTCINGINGKESDSSMVEEEDFIGIATIWDESSVVSELTETFMRLSER 246
++T I+D + + SD S+V G A+ WDES +V+ LT TFM LSER
Sbjct: 191 QTTDIMDSSAPAPA---------SDVSVV------GEASAWDESQLVARLTSTFMSLSER 250
Query: 247 LSGFLSEGQSRIMSATLFDQISKLLVDSQLSERFNEVREKLSYLLETGQTSVIAGQIRDL 306
LSGFLSEGQS MSA LFD+I+ L+ D+QL ERFN++R KL L+E+ Q S + Q+RDL
Sbjct: 251 LSGFLSEGQSVFMSAKLFDKIAMLVDDTQLYERFNDIRSKLLSLIESKQNSGLVNQVRDL 310
Query: 307 SQRLVEIIEEGQRSVSPPLFNLQDRFSSTIDDFQKQLDLIVKYDWDFDHYLLEDVPAAMD 366
+QRLV + ++GQRSVSPPL +LQ+R ++TI+DFQKQLDLIVKYDWDFDHYL EDVPAA++
Sbjct: 311 AQRLVNLFDDGQRSVSPPLIDLQERLTATIEDFQKQLDLIVKYDWDFDHYLEEDVPAAIE 370
Query: 367 YIMAISKPKDGKLLAIGHSMGGILLYARLSRCGFEGRDPRLAAIVTLASSLDYTSSKSAL 426
Y+ A SKPKDGKL AIGHSMGGILLYA LSRC FEGR+P +AA+ TLASS+DYT+S SAL
Sbjct: 371 YVRAQSKPKDGKLFAIGHSMGGILLYAMLSRCAFEGREPSVAAVATLASSVDYTTSNSAL 430
Query: 427 KMLLPLADPAQALNVPVVPLGALLSASYPLSSRSPYVLSWLNSLISAEDMMDPEMLKKLV 486
K+L+PLA+PA+AL+VPVVPLGALL+A++PLS+R PYVLSWLN LIS+ DMM PEML+KLV
Sbjct: 431 KLLIPLANPAEALSVPVVPLGALLAAAFPLSTRPPYVLSWLNDLISSTDMMHPEMLEKLV 490
Query: 487 LNNFCNKNDIMHACVIWYSLQLRTVCFKSTSTIPAKLILQLTTAFREGGLRDRSNTFYYK 546
LNNFC TIPAKL++QLTTAFREGGLRDRS FYYK
Sbjct: 491 LNNFC--------------------------TIPAKLLIQLTTAFREGGLRDRSGKFYYK 550
Query: 547 DHLHKCHVPVLALAGDQDLICPPVAVEETAKLIPEHLVSYKCFGEPGGPHYAHYDLVGGR 606
DHL + VPVLALAGD+DLICPP AVE+T KL PE+LV+YK GEP GPHYAHYDLVGGR
Sbjct: 551 DHLPRTSVPVLALAGDRDLICPPAAVEDTVKLFPENLVTYKLLGEPDGPHYAHYDLVGGR 577
Query: 607 LAAEQVYPCIIDFISKHDA 617
LA EQVYPCI +F+S HD+
Sbjct: 611 LAVEQVYPCITEFLSHHDS 577
BLAST of Cp4.1LG01g18190 vs. TAIR 10
Match:
AT1G73750.1 (Uncharacterised conserved protein UCP031088, alpha/beta hydrolase )
HSP 1 Score: 441.0 bits (1133), Expect = 1.5e-123
Identity = 245/554 (44.22%), Postives = 306/554 (55.23%), Query Frame = 0
Query: 62 ICTADELHYASVPNSDWRLALWRYRPSPQAPPRNHPLLLLSGVGTNAIGYDLAPGCSFAR 121
ICTADELHY VPNSDWR+ALWRY PSP+AP RNHPLLLLSG+GTNA+ YDL+P CSFAR
Sbjct: 56 ICTADELHYVPVPNSDWRVALWRYLPSPKAPKRNHPLLLLSGIGTNAVTYDLSPECSFAR 115
Query: 122 HMSGQGYDTWILEVRGAGLSLQEPNLKEIEHSAKVKSEKMEAISESKINGTVHVEEESTK 181
MSG G+DTWILE+RGAGLS S
Sbjct: 116 SMSGSGFDTWILELRGAGLS-------------------------------------SLS 175
Query: 182 ILDDLSKSDTCINGINGKESDSSMVEEEDFIGIATIWDESSVVSELTETFMRLSERLSGF 241
+ +L K + ++ +VS L E F+ +SERL
Sbjct: 176 VDTNLGKGN----------------------------NQQRIVSNLLENFISVSERLENV 235
Query: 242 LSEGQSRIMSATLFDQISKLLVDSQLSERFNEVREKLSYLLETGQTSVIAGQIRDLSQRL 301
L G SK+L
Sbjct: 236 LDGG-------------SKIL--------------------------------------- 295
Query: 302 VEIIEEGQRSVSPPLFNLQDRFSSTIDDFQKQLDLIVKYDWDFDHYLLEDVPAAMDYIMA 361
+QDR S DF+++ +LI Y+WDFD+YL EDVP+AMDY+
Sbjct: 296 ----------------GMQDRLSKRAGDFKQRFELIPHYNWDFDNYLEEDVPSAMDYVRT 355
Query: 362 ISKPKDGKLLAIGHSMGGILLYARLSRCGFEGRDPRLAAIVTLASSLDYTSSKSALKMLL 421
+K KDGKLLA+GHSMGGILLYA LSRCGF+G D LA + TLAS+ DY+SS + LK LL
Sbjct: 356 QTKSKDGKLLAVGHSMGGILLYALLSRCGFKGMDSGLAGVTTLASTFDYSSSGTLLKYLL 415
Query: 422 PLADPAQALNVPVVPLGALLSASYPLSSRSPYVLSWLNSLISAEDMMDPEMLKKLVLNNF 481
P+ +PAQA+N+P++P+ +L+ ++PL R PY LSWL + ISA MMDPE+++KLVLN+
Sbjct: 416 PMKEPAQAINLPIMPIDTMLAMAHPLMCRPPYSLSWLTANISAPQMMDPEVIEKLVLNSL 450
Query: 482 CNKNDIMHACVIWYSLQLRTVCFKSTSTIPAKLILQLTTAFREGGLRDRSNTFYYKDHLH 541
C T+P KL+LQLTTA GGLRDR+ TF YKDH+
Sbjct: 476 C--------------------------TVPVKLLLQLTTAVDHGGLRDRTGTFCYKDHIS 450
Query: 542 KCHVPVLALAGDQDLICPPVAVEETAKLIPEHLVSYKCFGEPGGPHYAHYDLVGGRLAAE 601
K +VP+LALAGD D+ICPP AV +T KLIPEHL +YK G PGGPHY H DL+ GR A
Sbjct: 536 KTNVPILALAGDWDIICPPDAVYDTVKLIPEHLATYKVVGSPGGPHYGHQDLISGRTARN 450
Query: 602 QVYPCIIDFISKHD 616
+VYP I F+ + D
Sbjct: 596 EVYPLITRFLQQQD 450
The following BLAST results are available for this feature:
Match Name | E-value | Identity | Description | |
Match Name | E-value | Identity | Description | |
XP_023552896.1 | 0.0 | 95.79 | uncharacterized protein LOC111810420 [Cucurbita pepo subsp. pepo] | [more] |
XP_022921386.1 | 0.0 | 95.14 | uncharacterized protein LOC111429673 [Cucurbita moschata] | [more] |
KAG7032528.1 | 0.0 | 94.49 | hypothetical protein SDJN02_06577, partial [Cucurbita argyrosperma subsp. argyro... | [more] |
KAG6601823.1 | 0.0 | 94.33 | hypothetical protein SDJN03_07056, partial [Cucurbita argyrosperma subsp. sorori... | [more] |
XP_022971785.1 | 0.0 | 93.84 | uncharacterized protein LOC111470463 [Cucurbita maxima] | [more] |
Match Name | E-value | Identity | Description | |
A0A6J1E189 | 0.0 | 95.14 | uncharacterized protein LOC111429673 OS=Cucurbita moschata OX=3662 GN=LOC1114296... | [more] |
A0A6J1I6P7 | 0.0 | 93.84 | uncharacterized protein LOC111470463 OS=Cucurbita maxima OX=3661 GN=LOC111470463... | [more] |
A0A0A0KPN7 | 0.0 | 84.93 | AB hydrolase-1 domain-containing protein OS=Cucumis sativus OX=3659 GN=Csa_5G576... | [more] |
A0A6J1DEH2 | 0.0 | 84.95 | uncharacterized protein LOC111020046 OS=Momordica charantia OX=3673 GN=LOC111020... | [more] |
A0A1S3CT47 | 0.0 | 85.09 | uncharacterized protein LOC103504592 OS=Cucumis melo OX=3656 GN=LOC103504592 PE=... | [more] |
Match Name | E-value | Identity | Description | |
AT1G15060.1 | 1.1e-211 | 62.68 | Uncharacterised conserved protein UCP031088, alpha/beta hydrolase | [more] |
AT1G73750.1 | 1.5e-123 | 44.22 | Uncharacterised conserved protein UCP031088, alpha/beta hydrolase | [more] |