Sequences
The following sequences are available for this feature:
Gene sequence (with intron)
Legend: polypeptideexonCDSthree_prime_UTR
Hold the cursor over a type above to highlight its positions in the sequence below.
TATGATTTTGTAGACTTCCGCGAAAAGCTATATATATGTAAGTACGGACCGCCACTCTCTCTCTCTCCTTCTCTCTCGAATACAAACCGGAACAGAGCTTCCAATTTCTTAGCTCTACATGGACTAATGGAAACCACACCCACTCTCTGCAGCTTTCCTCCATATCCATTCACTTCTCACCGCCATTACCGCCACCTCCGCTGCCGCATCTACTCGCCGGAGCCCCGACAACTCTTTACCACTCTCTCCTGCTTCAAACCACGCCGTCGGCCTCGCCGGAAAAACAAGCTCGCTAAGTTCCACACCATCCAATCACCCCTCGAATCCTCCTCTGACTCGAAGCTCCAAACTGTAATTGAAATTGATCAATTCACCGCCGAAGCCTCCTCTCTTGTTTACTCCGTCTACTACTACTCCCGTTCCCAATTTCGCCAGTTTCTGTCGTCTGGATTGGACGCTTTCCATGATTTGCGGACGTTGATTGCTTTCGATGACCAGAATCGCACATTGACCGTCTCGTGTCGGCGTTCCACTGTGGAATTCCTAGGTCAGTTGGTGCTATTCAGCTTCGTTGTGGTCTTTCTAGTTAGGGTTTTAATTGGGATCGGATCTCGTTTTCGTAACCAATTTAGTTATGGGTATACTTCTCCTGTGGTGAGGAGGGACCGCAGCCTCGGTGGACGAGAGGTTGTTGTTGGAACTGTGCGAGATAAAGCTATGTCGAAGAAGAACAATCGTTTTGGGATATTGGGTAGTCCCATATCCATGACTTCAATGGCTCTGACTGATGTTTCAGATGAAGTATCGAGGAATGGGGCTTGGGTTGGAGATAGATTGCCAAAATGGTGGCCTCCGGCAGTTCCTAGACGGATTTCCACGGCGAATAGGCAGGGATATCAGATAGAAGCTGACAGATTAGTACGAGGTTAGTGCACAAAGAATTCTTATATGTACAAATTTCACATTGCCATTACGTTATACTCTTGATTGCAATTTGAAGCTGAACTTACCCTCTTGATTTCCAGCCCTCGTGGACAGTAGAATGAGTGGGCGGGATTTCATGGAGGATGATATTCTACATGTGAGTGCCGTTCTTTAACTATATTTGGCTAATTTTTTATACAGTAATTTGCTGGTATACTGAGTTTACTTCCCTTTCATCTATGGTTTATCATGCCTCCTACAATGCAATTATTGAACATTCATGAATAGCACTTGACCTTTTGCCATAGCTGATGTTTTTATGATTGCCTGCACTTTTGTTTTCCTGAATGGATGAGCAGTTGCGTCAAATATGCAGGATGTCTGGAGTAAAAGTGTCCTTCAACACAGAGAATATGCGCGATTCATTCTATCGAGCATCTGTTGACTCTGTTTTTAATATCTATAGCAGGTGAACTTTATACTGGTGCTCCTATAAATATTTGCTCTATAAAACGATGAATCATGCCAAATTTCAAATTCTAGTTCTGGATTTTATTAATGTGATCACACTGAAATCTTTGGTGTTCATATGACTGTATTTGTAACTTATTTACAGGACTCCAATTAACCCTAACTCAGTTCTCATTAATGGTGAGAATGGTCCGAGTTTTCTTGCTGGACTCGCTGAGGACATTGGCATTGAGAATACTCGTGCTGCTAGGATAGTTTCGGCTGCTGTTGCTGCTAGAATGCGTTCATGCTTTTTACAGGCCTGGGTAAATACTTTTACAATTTCCACTGCATTCACTATATCAAATGCTTGTATCTTGTAATGCCTGGGGTAACTCTCACATTTTTGTTGCTAAAATCTTGCAATGAGAAAACATACAAAGATGGAATTATGTGCAAGAATCTTGAAGAACGTTGGTCTATTTTTCTTTAGTTCAAGCTTATTTTTAACATGCTGGACAAAAGCTTTAGGATGATCTGTTAGCATGCTAACATATTTTGGGTTTGAACCGTGGCGAAGGCTTAGACAGTGTGTATATTTATATGTCTTAAATTTGGTAATTGATTCAAGAACCAGAGGTGTTGCTGAAGATCCTCGAGTTCCTTTGGCACCAAAGTAAAAAGAAGAGCGGTTAAAAGCCACCCAAAGAGTTTTTGCATCACTTAGCCACGGAGAAACTTTCTGCTCTCTTCTTTGATGTGTTTCCCCTGTAATTTATATGATTTGTTATACCATAAAGAGGATGTACTGAGTTAATTACATCTTACTATAGGCTCTAGTGATGCAAGACAGACATTCTGAAGCAGATGCGGAGTTGTTGAAGATCTGTCACGTTGTCCAGACATTTCCTCCGGACGAGTCGTCTGTATGTATTCTGAACCATCTTTGTAAAGGGTTGTTTCTGAAATGTTCTTATGTTATCTTTGTTGATTCTCCAGCCTGAGATGGAGATGTTGACTCTAGGCTTGAAGAAACACTTGAAGGTAGAGCAGAGAGAATGTTTGATGAATATGTTCATTAGTGTCTGTGGTAAAGATAGTCATAGGACCGCTGCAGAAGCTCTTGGTTTGGTATGTCCTTCTTAATCCCGTCTCTCTCTCTCTCTCTCTATTTTTCTCTCATCAAGCTGCTTCCTCCAAATATTGGCGATTCATATTGTCAAATCCTTTGCGCCTCAAAATCTTCATACTTGTTTATTAGTACACTTCCTCCAAATGTTCATACTCCATCTACATATTCATCAACTTTAGCATTATTTATTAATTGCCTCATCCATCCAATAATTCATTAATTAATGTGTTCTTTCTTTTCTTTCTGGTAAAAGAATCAAGGTGAATCTTTGAACCATATAGCTCTGAATTTAATTTACATTCAATTTTTTTAATAAATTGTTTGAAATGGGCTTTGTTTTTCCTAATTTTATTTGGAAATTGCAAAATTAATCTAAGAAGCTTAAAAAATTGGCCCGTAATACAATGTTTGAATGCAGGCCGATTACATCACAAAAAAATGAAGGTGAAACTTTACTAATGCTGGGACAGTTGTCCAATCTTGAGGTTCTTAGAGTTAGTTGGCTAATCTTCTGAAATTATAAAGTTTGAGTGATTCCAGCTTGAACTTAGTGAAGGGAGCGTTGAACATGCCAAAGTTTCTCATGTAGCCTGTTTTCATATATTTCTAAAACGGTTATTTTTACATTTGTTTCTTTTCTTTGTACTTTTTTTAGTATTTACATTAGCCTAAGTTAAATTTTAATTGTATTCCATTAATTTTGTCACTGTATTTATTATCGTTTTCACGGTTTGGTTTGAAAGCATAATCGACACTTAATTAACAATTCAATTCAATTTGATTTGATTTTTTTCTTATTTTGGTTTTGATTGTGTTTATAAAAGCTTCATTATTTGTTTATATCTTCTAATAATTTTCTTTATAACACTTTGATTGAAGTACCAAGTAGCGTGAACGTTCATAATTTCATCTTATTAGAGTAACAAAGACACATATCATAAATGAATAATTCAAACTTGATTTCTTTAGACAAATCGAATAAAATAAGTATGGAAGAAGTCGTATAGAATTTTGGTATAGATAGTCGAAGTGCCAACAAGCAAATTGGCCTATCATCGCGAAGTTGGCAGCACTAAACCATAGTATTGATACCCCATGTAGGACCTATGGTGCCTGGGAGAATAGAGGATAGAGGGCAGGGGCTGCATGAAAGGTTAAAGGAAGAGCCATAGCATAACCCTTCTCGAAGATAAGGGGATCAACAGCCAATTTCTTTGCCACCAACCGAATAAGAATAATATCCAACATAACATAGCAACAGCTGTAGGCCATGGATTAGACAAAAGCTTATGAAAGGCAGAGTTGAAAGGCAAAGATGATGGATGACGTAAAAAGATGAGCACTTATTCGCTCAACAATTTTCTTAAAAGTCGAGTTGTCCTCAATCACACCCCACTACTCACAGGAACATTCTGTCTTTGAGAGAGTTCGATTTTTAGAGGGAATCTCAGTAATCATTTAACTAACGAAGGCTTTTTTATGAAATTTTGAAACTAAAAAATTTTGAATAGTTTAAAATGAGTACTCCTGAAAAAAAGTGCCCCTTTAGGGTATAGAAGCTCCGTCAATTACTTGCTGTGATCCATTCACGGATGGAAAACAGTATCAGAGGTAAAGAGTTTCATGTAAGGGAGAACTTATTAACAGTTTCAGACACACCTTTCCTTTCTGAGAGGCCAATGATATTAAGTAAAACACGCAGAAAACCCTACTCTCTGAGTGTTGAGATCCCGAGAGGTTGTGTAAGGGTTTCTATACCCTAAAGGGGCACGTTTTATGTAGAGTACCCCTAAAGGGGCACTTTTTTTCAGGAGTACTTTTTTTAGTTTTCTTACGAAATATAAATAGAAACGTAGGAAGTTCACGTTTTACATAACGAAATAGTTAAGTTTAATATTTCTATGGTCAAAGTCGGTCTGAAAAGCTCGTTTGTATTGAAGGAATACATTCTTGGAATGCAGACCGACAGCATCACAAAAAATAAAGTCTTGAAGCATCAATTATTTTATTCTCATTAGTGTTCCTTTGGAGGCCACCTTGCTAAACCTGCTCAATCTCGATAGTCAGTCTAGATGTTAGATTCATTCGGGTGCACACAAATCCTCTGGAATTGCAAACTTTGAGAATCCAAGAAGAGAAGAGAAACATGGCGGAATTCCTGTGGACCTTTGCTGTACAGGAATCGTTGAAGAAGACGGTGATTATGGCGGCTGAGCAGATCGGCCTGGCATGGGGATTCCAGGAGGAACTCTCTAAGCTGGAGGAAGCTTTATCCAGGGCGCGAGCCATCCTAAGGAATGTTGACAGAACGAAGGCAGACCATGAATACCTCAACCTATGGGTGAAGAAGCTTGAAGATATTGTCTTTGAAGCTGATAATTTACTCGATGAGCTCGCTTATGAAGATGTTCGACGCAAAGTGGAAATCGAAACGGTAAGCATCATTTCATTCTCCACATTTTATATCAAAATGGCAAATAAAATCCCAACGGTTGCTAAAAAGTTACAAGAGTTCATTTCTACAAATCCTCCTCCGCTATGTGTTGCTACAACATCCAATGAAGCTGAAATTGATCTTAACCAAACTCGAGAGACGGACTCATTTCCTGAGGAAATTGAGGTCATTGGGAGGAAAACGGAAGTATCATATATAGTGGATAAGCTACTTGCCCTTGAGAGTCAGAAAACTCTAGTTGTTTTAACGATTGTTGGCACAGGTCGATTAGGAAAAACAACACTGGTGAAGAAAGTTTTCCATCATGATATGATAAGGAAGAATTTCGATACAACCATATGGGTATGTGTGTCTCATCCTTTTAAAATCAACAAAATTTTGAGAGCAATCGTGGGATCCTTTAATCCTACATTTGGTGGCTCAGATGAAAGGGAAGTCATTCTTCGGGAGCTCCAAAATTTGTTGACTGCCAAAAAGTATTTGCTTGTGCTTGATGATGTTTGGAACGAGGAACCCATTCTATGGAACGAGTTGAGGGCATGTTTGGTAAAGATCAATCAAAATGTTGGAAGTGCATACATCATTTGAGGCAGTTACCGGATGATGATTGTTGGGCTTTATTTCAAAAATGTGTCTTTGGAAGTGATATACCAATAATTCCTGATGTCATTCGAGGACAACTTGTTAAAAAATTTGGTGGCATACCATTGGTTGTGAAAGTGCTGGGAGGAATGGTGAAATCATACAAGAACGATGAGGAATTGCAATCGACTTTGGAAAATCTAGTGAGAATTGAATTACCAAAGGAAGATCTTATTTTATTCACAATCAAATTAAGCGTGGACCGTCTACCGTCTTCCTCATTGAAACAATGTTTTGCTTATTGTTCAAATTTTCCACCCGACTTTCACTTCTACAAGGAAGCACTTGTTCGGATGTGGATAGCACAAGGGTTTATTCAACTACCTAATGGAAGCAATGTAACAATGGAGGATATTGGAGAGAAGTATTTCGATATGTTGTTGTCTCGCTCCTTGTTTCAAGACGTTGAAAAGGATTATAGAGGAAAAATTGAATATTGTAAGATGCATGATCATATACACGAGGTCGCATGTGCTATTTCAAATGATAAAAATTTGAGAGAGGACCTTGTAACGGATGAAAAATATGAAGGTGGTGAAGTTCTTTCGATCCGTCAAAGAAGAAGAACAGTTTATTGTTGTGAAAATGTATCATTTGATATGATCACCAACTTCATCTACTTGCGTGTTTTAATTATGGATCATGTGTTCATAACTGAATTGCCGGATACAATTGGTAAGTTGAAGCATTTGAGGTATCTTGACATTTCCGGGTGTGGAATAAGTAATCTCCCAGAATCTATTGTTTTGCTCTACCATTTGCAAACTTTGAAGCTTGATGAGTGTACAAAGCTTCCGACAGAGTTGACAAAGTTGGTGAACTTAAGGCATTTAGAATTTGATTGGTATGATATAAATCCTAAGCAAATGCCTAAACATTTAAGTCGAATGACTGAACTTCAAACGCTTTCTAGTTTTATAGTTGGGTGTGATGACGGATGTAAGATCGAGGAGCTTGGACCCTTAAAAAACCTTAAAGGTGATTTAAGCTTGTTATATCTCGAGCGAGTCAAAAGTAAAGAGGAGGCCATGACTGCAAATTTAGTAGAAAAGGAAAATATTTCTCAACTATGTTTGAAATGGAGTATTGAAAGGGAAGAATGTAACGACAGTGATTTGAATGTGTTGGAAGGGCTTCAACCACACAAAAACCTTCGACAATTGAGAATTCAAAATTTTGCCGGTGAACTTCTACCCAACGGTATTTTTGTTGACAACTTGGTTGAGATACATCTATTCGAGTGCACAAAATGTGAAACTTTACCAATGCTCGGACAGTTGTCCAAACTTAAGGCTCTTCAAATTAGCAAGTTGAGTGCACTAAAAAGTATTGGAGACGAATTTTATGGAAATTATCGCGACAGCAGAACTTTATTCCCTAAATTGAAAGCATTTCATATTTGGTTCATGGACAGTTTAGAGCAATGGGAAGAAGTTGATACTCTGACGAATTGTACAACTTTTCCTCATCTCGAAAGTTTAATCATTCGTTATTGTCTCAAACTAACGAATATTCCAAACATTTTTGCAACTCATGGACAGAGGTTGGAAGTTGACACTATGAATGCAAGGTTATTTTCAAGCTTTCAAAGTCCTCCAAAGCTTCGATCTCTACACATTACAAATTGTACAAGTTTGATAAAGCTACCAAACTGGTTAGAGTTTTGTAGCTCACTTGAAGATTTGGAGATAGACAACTTTTGTGATGATATTTCCCTTCCAAATTTGCAAAATCTCCGAAACTTGTCTACGTTAGAAATCAAAAAGTTTGAAAATTTGCCAAAGGGGCTTCGTGGTATGCATAACTTGAAAAAATTGGTGGTTGAAGGGCCAATGAAGGATTATGATTGGAGTCGATTCGTACCCTTAAATTCGCTTGAAGAGCTTGTATTGTCGGAAACAAGAACTAATAGTTTAACACAACTTCCTCGACAACTTGAACTCCTCCCTGCTTTAAAATATTTGTATATTGAGAATTTTGATGGTGTTGAATCTTTGCCGGAATGGTTGGGAAACGTTATATCTTTGGAGACATTATCATTACGTAAGTGAAGAAGCACTGTCAAATTTAACTAAATTGAATCGGTTGTTCGTATATGGTCGTTCCCAGATTGATGCGGTTGAACGGGAAAGAGTATTCCATGAAACCGGTATCCATGTGTCCCGCTACTAACAGGTCAGTTCCTCTGTATTTAAAGCCCGTCCCCAATTTCGCCAGTTTCTGACGCTTTCCATGATTTGCGGACCTTGATTGCTTTCGATGACCAGAATCGCACATTGACCGTCTCGTGTCGGCGTTCCACTGTGGAGTTCCTAGGTCAGTTGGTGCTATTCAGCTTCGTCGTGGTCTTTCTGGTTAGGGTTTTAATTGGGATCGGATCTCGTTTTCGTAACCAATTTAGTCCCCTATACATGATGACTTCAATGGCTCTGACTGATGTTTCAGATGAAGTTTCGAGGAATGGGGCTTGGGTTGGAGATAGATTGCCGAAATGGTGGCCTCCGGCAGTTCCTAGACGAATTTCCATTGCGAACATGGCCTGTAATTCAATCTTTGAAATCAGACTATAAACTTTTCGAACATGATTTCATACTTTTCAAACATCACTTTTGTCCGGGTTATTTCGTTGGAGCTCACAACTATTTTATGCCTGTTTCTTTTGATTCTAATACTTTTATTAGTTGTATTTGTTGTATTTGAAGAATTGTATTAAATTTGATATTATATTAATTAGTTACTACTACGGCTTTGTTGATTCAACAATTTTGTTATTCTCATTCTTCGCCCTTTCCCTTTTGGGATCCTCTTCCTTTCTCGCCTCCTGGAATCAACAAGATGACTCCCCTTCTTCTTGGAACTTCATTCATTAAATACAAATGTGATCAGCTTAAAGGGTTACT
mRNA sequence
TATGATTTTGTAGACTTCCGCGAAAAGCTATATATATGTAAGTACGGACCGCCACTCTCTCTCTCTCCTTCTCTCTCGAATACAAACCGGAACAGAGCTTCCAATTTCTTAGCTCTACATGGACTAATGGAAACCACACCCACTCTCTGCAGCTTTCCTCCATATCCATTCACTTCTCACCGCCATTACCGCCACCTCCGCTGCCGCATCTACTCGCCGGAGCCCCGACAACTCTTTACCACTCTCTCCTGCTTCAAACCACGCCGTCGGCCTCGCCGGAAAAACAAGCTCGCTAAGTTCCACACCATCCAATCACCCCTCGAATCCTCCTCTGACTCGAAGCTCCAAACTGTAATTGAAATTGATCAATTCACCGCCGAAGCCTCCTCTCTTGTTTACTCCGTCTACTACTACTCCCGTTCCCAATTTCGCCAGTTTCTGTCGTCTGGATTGGACGCTTTCCATGATTTGCGGACGTTGATTGCTTTCGATGACCAGAATCGCACATTGACCGTCTCGTGTCGGCGTTCCACTGTGGAATTCCTAGGTCAGTTGGTGCTATTCAGCTTCGTTGTGGTCTTTCTAGTTAGGGTTTTAATTGGGATCGGATCTCGTTTTCGTAACCAATTTAGTTATGGGTATACTTCTCCTGTGGTGAGGAGGGACCGCAGCCTCGGTGGACGAGAGGTTGTTGTTGGAACTGTGCGAGATAAAGCTATGTCGAAGAAGAACAATCGTTTTGGGATATTGGGTAGTCCCATATCCATGACTTCAATGGCTCTGACTGATGTTTCAGATGAAGTATCGAGGAATGGGGCTTGGGTTGGAGATAGATTGCCAAAATGGTGGCCTCCGGCAGTTCCTAGACGGATTTCCACGGCGAATAGGCAGGGATATCAGATAGAAGCTGACAGATTAGTACGAGCCCTCGTGGACAGTAGAATGAGTGGGCGGGATTTCATGGAGGATGATATTCTACATTTGCGTCAAATATGCAGGATGTCTGGAGTAAAAGTGTCCTTCAACACAGAGAATATGCGCGATTCATTCTATCGAGCATCTGTTGACTCTGTTTTTAATATCTATAGCAGGACTCCAATTAACCCTAACTCAGTTCTCATTAATGGTGAGAATGGTCCGAGTTTTCTTGCTGGACTCGCTGAGGACATTGGCATTGAGAATACTCGTGCTGCTAGGATAGTTTCGGCTGCTGTTGCTGCTAGAATGCGTTCATGCTTTTTACAGGCCTGGGCTCTAGTGATGCAAGACAGACATTCTGAAGCAGATGCGGAGTTGTTGAAGATCTGTCACGTTGTCCAGACATTTCCTCCGGACGAGTCGTCTCCTGAGATGGAGATGTTGACTCTAGGCTTGAAGAAACACTTGAAGGTAGAGCAGAGAGAATGTTTGATGAATATGTTCATTAGTGTCTGTGGTAAAGATAGTCATAGGACCGCTGCAGAAGCTCTTGGTTTGAATCGCACATTGACCGTCTCGTGTCGGCGTTCCACTGTGGAGTTCCTAGATGAAGTTTCGAGGAATGGGGCTTGGGTTGGAGATAGATTGCCGAAATGGTGGCCTCCGGCAGTTCCTAGACGAATTTCCATTGCGAACATGGCCTGTAATTCAATCTTTGAAATCAGACTATAAACTTTTCGAACATGATTTCATACTTTTCAAACATCACTTTTGTCCGGGTTATTTCGTTGGAGCTCACAACTATTTTATGCCTGTTTCTTTTGATTCTAATACTTTTATTAGTTGTATTTGTTGTATTTGAAGAATTGTATTAAATTTGATATTATATTAATTAGTTACTACTACGGCTTTGTTGATTCAACAATTTTGTTATTCTCATTCTTCGCCCTTTCCCTTTTGGGATCCTCTTCCTTTCTCGCCTCCTGGAATCAACAAGATGACTCCCCTTCTTCTTGGAACTTCATTCATTAAATACAAATGTGATCAGCTTAAAGGGTTACT
Coding sequence (CDS)
TATGATTTTGTAGACTTCCGCGAAAAGCTATATATATGTAAGTACGGACCGCCACTCTCTCTCTCTCCTTCTCTCTCGAATACAAACCGGAACAGAGCTTCCAATTTCTTAGCTCTACATGGACTAATGGAAACCACACCCACTCTCTGCAGCTTTCCTCCATATCCATTCACTTCTCACCGCCATTACCGCCACCTCCGCTGCCGCATCTACTCGCCGGAGCCCCGACAACTCTTTACCACTCTCTCCTGCTTCAAACCACGCCGTCGGCCTCGCCGGAAAAACAAGCTCGCTAAGTTCCACACCATCCAATCACCCCTCGAATCCTCCTCTGACTCGAAGCTCCAAACTGTAATTGAAATTGATCAATTCACCGCCGAAGCCTCCTCTCTTGTTTACTCCGTCTACTACTACTCCCGTTCCCAATTTCGCCAGTTTCTGTCGTCTGGATTGGACGCTTTCCATGATTTGCGGACGTTGATTGCTTTCGATGACCAGAATCGCACATTGACCGTCTCGTGTCGGCGTTCCACTGTGGAATTCCTAGGTCAGTTGGTGCTATTCAGCTTCGTTGTGGTCTTTCTAGTTAGGGTTTTAATTGGGATCGGATCTCGTTTTCGTAACCAATTTAGTTATGGGTATACTTCTCCTGTGGTGAGGAGGGACCGCAGCCTCGGTGGACGAGAGGTTGTTGTTGGAACTGTGCGAGATAAAGCTATGTCGAAGAAGAACAATCGTTTTGGGATATTGGGTAGTCCCATATCCATGACTTCAATGGCTCTGACTGATGTTTCAGATGAAGTATCGAGGAATGGGGCTTGGGTTGGAGATAGATTGCCAAAATGGTGGCCTCCGGCAGTTCCTAGACGGATTTCCACGGCGAATAGGCAGGGATATCAGATAGAAGCTGACAGATTAGTACGAGCCCTCGTGGACAGTAGAATGAGTGGGCGGGATTTCATGGAGGATGATATTCTACATTTGCGTCAAATATGCAGGATGTCTGGAGTAAAAGTGTCCTTCAACACAGAGAATATGCGCGATTCATTCTATCGAGCATCTGTTGACTCTGTTTTTAATATCTATAGCAGGACTCCAATTAACCCTAACTCAGTTCTCATTAATGGTGAGAATGGTCCGAGTTTTCTTGCTGGACTCGCTGAGGACATTGGCATTGAGAATACTCGTGCTGCTAGGATAGTTTCGGCTGCTGTTGCTGCTAGAATGCGTTCATGCTTTTTACAGGCCTGGGCTCTAGTGATGCAAGACAGACATTCTGAAGCAGATGCGGAGTTGTTGAAGATCTGTCACGTTGTCCAGACATTTCCTCCGGACGAGTCGTCTCCTGAGATGGAGATGTTGACTCTAGGCTTGAAGAAACACTTGAAGGTAGAGCAGAGAGAATGTTTGATGAATATGTTCATTAGTGTCTGTGGTAAAGATAGTCATAGGACCGCTGCAGAAGCTCTTGGTTTGAATCGCACATTGACCGTCTCGTGTCGGCGTTCCACTGTGGAGTTCCTAGATGAAGTTTCGAGGAATGGGGCTTGGGTTGGAGATAGATTGCCGAAATGGTGGCCTCCGGCAGTTCCTAGACGAATTTCCATTGCGAACATGGCCTGTAATTCAATCTTTGAAATCAGACTATAA
Protein sequence
YDFVDFREKLYICKYGPPLSLSPSLSNTNRNRASNFLALHGLMETTPTLCSFPPYPFTSHRHYRHLRCRIYSPEPRQLFTTLSCFKPRRRPRRKNKLAKFHTIQSPLESSSDSKLQTVIEIDQFTAEASSLVYSVYYYSRSQFRQFLSSGLDAFHDLRTLIAFDDQNRTLTVSCRRSTVEFLGQLVLFSFVVVFLVRVLIGIGSRFRNQFSYGYTSPVVRRDRSLGGREVVVGTVRDKAMSKKNNRFGILGSPISMTSMALTDVSDEVSRNGAWVGDRLPKWWPPAVPRRISTANRQGYQIEADRLVRALVDSRMSGRDFMEDDILHLRQICRMSGVKVSFNTENMRDSFYRASVDSVFNIYSRTPINPNSVLINGENGPSFLAGLAEDIGIENTRAARIVSAAVAARMRSCFLQAWALVMQDRHSEADAELLKICHVVQTFPPDESSPEMEMLTLGLKKHLKVEQRECLMNMFISVCGKDSHRTAAEALGLNRTLTVSCRRSTVEFLDEVSRNGAWVGDRLPKWWPPAVPRRISIANMACNSIFEIRL
Homology
BLAST of Cp4.1LG08g05870 vs. NCBI nr
Match:
XP_023539150.1 (uncharacterized protein LOC111799886 isoform X3 [Cucurbita pepo subsp. pepo])
HSP 1 Score: 887 bits (2292), Expect = 0.0
Identity = 451/455 (99.12%), Postives = 452/455 (99.34%), Query Frame = 0
Query: 43 METTPTLCSFPPYPFTSHRHYRHLRCRIYSPEPRQLFTTLSCFKPRRRPRRKNKLAKFHT 102
METTPTLCSFPPYPFTSHRHYRHLRCRIYSPEPRQLFTTLSCFKPRRRPRRKNKLAKFHT
Sbjct: 1 METTPTLCSFPPYPFTSHRHYRHLRCRIYSPEPRQLFTTLSCFKPRRRPRRKNKLAKFHT 60
Query: 103 IQSPLESSSDSKLQTVIEIDQFTAEASSLVYSVYYYSRSQFRQFLSSGLDAFHDLRTLIA 162
IQSPLESSSDSKLQTVIEIDQFTAEASSLVYSVYYYSRSQFRQFLSSGLDAFHDLRTLIA
Sbjct: 61 IQSPLESSSDSKLQTVIEIDQFTAEASSLVYSVYYYSRSQFRQFLSSGLDAFHDLRTLIA 120
Query: 163 FDDQNRTLTVSCRRSTVEFLGQLVLFSFVVVFLVRVLIGIGSRFRNQFSYGYTSPVVRRD 222
FDDQNRTLTVSCRRSTVEFLGQLVLFSFVVVFLVRVLIGIGSRFRNQFSYGYTSPVVRRD
Sbjct: 121 FDDQNRTLTVSCRRSTVEFLGQLVLFSFVVVFLVRVLIGIGSRFRNQFSYGYTSPVVRRD 180
Query: 223 RSLGGREVVVGTVRDKAMSKKNNRFGILGSPISMTSMALTDVSDEVSRNGAWVGDRLPKW 282
RSLGGREVVVGTVRDKAMSKKNNRFGILGSPISMTSMALTDVSDEVSRNGAWVGDRLPKW
Sbjct: 181 RSLGGREVVVGTVRDKAMSKKNNRFGILGSPISMTSMALTDVSDEVSRNGAWVGDRLPKW 240
Query: 283 WPPAVPRRISTANRQGYQIEADRLVRALVDSRMSGRDFMEDDILHLRQICRMSGVKVSFN 342
WPPAVPRRISTANRQGYQIEADRLVRALVDSRMSGRDFMEDDILHLRQICRMSGVKVSFN
Sbjct: 241 WPPAVPRRISTANRQGYQIEADRLVRALVDSRMSGRDFMEDDILHLRQICRMSGVKVSFN 300
Query: 343 TENMRDSFYRASVDSVFNIYSRTPINPNSVLINGENGPSFLAGLAEDIGIENTRAARIVS 402
TENMRDSFYRASVDSVFNIYSRTPINPNSVLINGENGPSFLAGLAEDIGIENTRAARIVS
Sbjct: 301 TENMRDSFYRASVDSVFNIYSRTPINPNSVLINGENGPSFLAGLAEDIGIENTRAARIVS 360
Query: 403 AAVAARMRSCFLQAWALVMQDRHSEADAELLKICHVVQTFPPDESSPEMEMLTLGLKKHL 462
AAVAARMRSCFLQAWALVMQDRHSEADAELLKICHVVQTFPPDESSPEMEMLTLGLKKHL
Sbjct: 361 AAVAARMRSCFLQAWALVMQDRHSEADAELLKICHVVQTFPPDESSPEMEMLTLGLKKHL 420
Query: 463 KVEQRECLMNMFISVCGKDSHRTAAEALGLNRTLT 497
KVEQRECLMNMFISVCGKDSHRTAAEALGL +T
Sbjct: 421 KVEQRECLMNMFISVCGKDSHRTAAEALGLADYIT 455
BLAST of Cp4.1LG08g05870 vs. NCBI nr
Match:
XP_023539149.1 (uncharacterized protein LOC111799886 isoform X2 [Cucurbita pepo subsp. pepo])
HSP 1 Score: 887 bits (2291), Expect = 0.0
Identity = 450/450 (100.00%), Postives = 450/450 (100.00%), Query Frame = 0
Query: 43 METTPTLCSFPPYPFTSHRHYRHLRCRIYSPEPRQLFTTLSCFKPRRRPRRKNKLAKFHT 102
METTPTLCSFPPYPFTSHRHYRHLRCRIYSPEPRQLFTTLSCFKPRRRPRRKNKLAKFHT
Sbjct: 1 METTPTLCSFPPYPFTSHRHYRHLRCRIYSPEPRQLFTTLSCFKPRRRPRRKNKLAKFHT 60
Query: 103 IQSPLESSSDSKLQTVIEIDQFTAEASSLVYSVYYYSRSQFRQFLSSGLDAFHDLRTLIA 162
IQSPLESSSDSKLQTVIEIDQFTAEASSLVYSVYYYSRSQFRQFLSSGLDAFHDLRTLIA
Sbjct: 61 IQSPLESSSDSKLQTVIEIDQFTAEASSLVYSVYYYSRSQFRQFLSSGLDAFHDLRTLIA 120
Query: 163 FDDQNRTLTVSCRRSTVEFLGQLVLFSFVVVFLVRVLIGIGSRFRNQFSYGYTSPVVRRD 222
FDDQNRTLTVSCRRSTVEFLGQLVLFSFVVVFLVRVLIGIGSRFRNQFSYGYTSPVVRRD
Sbjct: 121 FDDQNRTLTVSCRRSTVEFLGQLVLFSFVVVFLVRVLIGIGSRFRNQFSYGYTSPVVRRD 180
Query: 223 RSLGGREVVVGTVRDKAMSKKNNRFGILGSPISMTSMALTDVSDEVSRNGAWVGDRLPKW 282
RSLGGREVVVGTVRDKAMSKKNNRFGILGSPISMTSMALTDVSDEVSRNGAWVGDRLPKW
Sbjct: 181 RSLGGREVVVGTVRDKAMSKKNNRFGILGSPISMTSMALTDVSDEVSRNGAWVGDRLPKW 240
Query: 283 WPPAVPRRISTANRQGYQIEADRLVRALVDSRMSGRDFMEDDILHLRQICRMSGVKVSFN 342
WPPAVPRRISTANRQGYQIEADRLVRALVDSRMSGRDFMEDDILHLRQICRMSGVKVSFN
Sbjct: 241 WPPAVPRRISTANRQGYQIEADRLVRALVDSRMSGRDFMEDDILHLRQICRMSGVKVSFN 300
Query: 343 TENMRDSFYRASVDSVFNIYSRTPINPNSVLINGENGPSFLAGLAEDIGIENTRAARIVS 402
TENMRDSFYRASVDSVFNIYSRTPINPNSVLINGENGPSFLAGLAEDIGIENTRAARIVS
Sbjct: 301 TENMRDSFYRASVDSVFNIYSRTPINPNSVLINGENGPSFLAGLAEDIGIENTRAARIVS 360
Query: 403 AAVAARMRSCFLQAWALVMQDRHSEADAELLKICHVVQTFPPDESSPEMEMLTLGLKKHL 462
AAVAARMRSCFLQAWALVMQDRHSEADAELLKICHVVQTFPPDESSPEMEMLTLGLKKHL
Sbjct: 361 AAVAARMRSCFLQAWALVMQDRHSEADAELLKICHVVQTFPPDESSPEMEMLTLGLKKHL 420
Query: 463 KVEQRECLMNMFISVCGKDSHRTAAEALGL 492
KVEQRECLMNMFISVCGKDSHRTAAEALGL
Sbjct: 421 KVEQRECLMNMFISVCGKDSHRTAAEALGL 450
BLAST of Cp4.1LG08g05870 vs. NCBI nr
Match:
XP_022932631.1 (uncharacterized protein LOC111439130 isoform X2 [Cucurbita moschata])
HSP 1 Score: 875 bits (2260), Expect = 0.0
Identity = 444/455 (97.58%), Postives = 448/455 (98.46%), Query Frame = 0
Query: 43 METTPTLCSFPPYPFTSHRHYRHLRCRIYSPEPRQLFTTLSCFKPRRRPRRKNKLAKFHT 102
METTPTLCSFPPYPFTSH HYRHLRCRIYSPEPRQLFTTLSCFKPRRRPRRKNKLAKFHT
Sbjct: 1 METTPTLCSFPPYPFTSHLHYRHLRCRIYSPEPRQLFTTLSCFKPRRRPRRKNKLAKFHT 60
Query: 103 IQSPLESSSDSKLQTVIEIDQFTAEASSLVYSVYYYSRSQFRQFLSSGLDAFHDLRTLIA 162
IQSPLESSSDSKLQT+IEIDQFTAEASSLVYSVYYYSRSQFRQFLSSGLDAFHDLRTLIA
Sbjct: 61 IQSPLESSSDSKLQTIIEIDQFTAEASSLVYSVYYYSRSQFRQFLSSGLDAFHDLRTLIA 120
Query: 163 FDDQNRTLTVSCRRSTVEFLGQLVLFSFVVVFLVRVLIGIGSRFRNQFSYGYTSPVVRRD 222
FDDQNRTLTVSCRRSTVEFLGQLVLFSFVVVFL RVLIGIGSRFRNQFSYGYTSPVVRRD
Sbjct: 121 FDDQNRTLTVSCRRSTVEFLGQLVLFSFVVVFLGRVLIGIGSRFRNQFSYGYTSPVVRRD 180
Query: 223 RSLGGREVVVGTVRDKAMSKKNNRFGILGSPISMTSMALTDVSDEVSRNGAWVGDRLPKW 282
RSLGGREVVVGTVRDKAM+KKNN FGIL SP+SMTSMALTDVSDEVSRNGAWVGDRLPKW
Sbjct: 181 RSLGGREVVVGTVRDKAMAKKNNHFGILDSPLSMTSMALTDVSDEVSRNGAWVGDRLPKW 240
Query: 283 WPPAVPRRISTANRQGYQIEADRLVRALVDSRMSGRDFMEDDILHLRQICRMSGVKVSFN 342
WPPAVPRRISTANRQGYQIEADRLVRALVDSRMSGRDFMEDDILHLRQICRMSGVKVSFN
Sbjct: 241 WPPAVPRRISTANRQGYQIEADRLVRALVDSRMSGRDFMEDDILHLRQICRMSGVKVSFN 300
Query: 343 TENMRDSFYRASVDSVFNIYSRTPINPNSVLINGENGPSFLAGLAEDIGIENTRAARIVS 402
TENMRDSFYRASVDSVFNIYSRTPINPNSVLINGENGPSFLAGLAEDIGIENTRAARIVS
Sbjct: 301 TENMRDSFYRASVDSVFNIYSRTPINPNSVLINGENGPSFLAGLAEDIGIENTRAARIVS 360
Query: 403 AAVAARMRSCFLQAWALVMQDRHSEADAELLKICHVVQTFPPDESSPEMEMLTLGLKKHL 462
AAVAARMRSCFLQAWALVMQDRHSEADAELLKICHVVQTFPPDESSPEMEMLTLGLKKHL
Sbjct: 361 AAVAARMRSCFLQAWALVMQDRHSEADAELLKICHVVQTFPPDESSPEMEMLTLGLKKHL 420
Query: 463 KVEQRECLMNMFISVCGKDSHRTAAEALGLNRTLT 497
KVEQRECLMNMFISVCGKDSHRTAAEALGL +T
Sbjct: 421 KVEQRECLMNMFISVCGKDSHRTAAEALGLADYIT 455
BLAST of Cp4.1LG08g05870 vs. NCBI nr
Match:
KAG6597642.1 (hypothetical protein SDJN03_10822, partial [Cucurbita argyrosperma subsp. sororia])
HSP 1 Score: 875 bits (2260), Expect = 0.0
Identity = 444/450 (98.67%), Postives = 446/450 (99.11%), Query Frame = 0
Query: 43 METTPTLCSFPPYPFTSHRHYRHLRCRIYSPEPRQLFTTLSCFKPRRRPRRKNKLAKFHT 102
METTPTLCSFPPYPFTSHRHY HLRCRIYSPEPRQLFTTLSCFKPRRRPRRKNKLAKFHT
Sbjct: 1 METTPTLCSFPPYPFTSHRHYHHLRCRIYSPEPRQLFTTLSCFKPRRRPRRKNKLAKFHT 60
Query: 103 IQSPLESSSDSKLQTVIEIDQFTAEASSLVYSVYYYSRSQFRQFLSSGLDAFHDLRTLIA 162
IQSPLESSSDSKLQTVIEIDQFTAEASSLVYSVYYYSRSQFRQFLSSGLDAFHDLRTLIA
Sbjct: 61 IQSPLESSSDSKLQTVIEIDQFTAEASSLVYSVYYYSRSQFRQFLSSGLDAFHDLRTLIA 120
Query: 163 FDDQNRTLTVSCRRSTVEFLGQLVLFSFVVVFLVRVLIGIGSRFRNQFSYGYTSPVVRRD 222
FDDQNRTLTVSCRRSTVEFLGQLVL SFVVVFLVRVLIGIGSRFRNQFSYGYTSPVVRRD
Sbjct: 121 FDDQNRTLTVSCRRSTVEFLGQLVLLSFVVVFLVRVLIGIGSRFRNQFSYGYTSPVVRRD 180
Query: 223 RSLGGREVVVGTVRDKAMSKKNNRFGILGSPISMTSMALTDVSDEVSRNGAWVGDRLPKW 282
RSLGGREVVVGTVRDKAM+KKNN FGIL SPISMTSMALTDVSDEVSRNGAWVGDRLPKW
Sbjct: 181 RSLGGREVVVGTVRDKAMAKKNNHFGILDSPISMTSMALTDVSDEVSRNGAWVGDRLPKW 240
Query: 283 WPPAVPRRISTANRQGYQIEADRLVRALVDSRMSGRDFMEDDILHLRQICRMSGVKVSFN 342
WPPAVPRRISTANRQGYQIEADRLVRALVDSRMSGRDFMEDDILHLRQICRMSGVKVSFN
Sbjct: 241 WPPAVPRRISTANRQGYQIEADRLVRALVDSRMSGRDFMEDDILHLRQICRMSGVKVSFN 300
Query: 343 TENMRDSFYRASVDSVFNIYSRTPINPNSVLINGENGPSFLAGLAEDIGIENTRAARIVS 402
TENMRDSFYRASVDSVFNIYSRTPINPNSVLI+GENGPSFLAGLAEDIGIENTRAARIVS
Sbjct: 301 TENMRDSFYRASVDSVFNIYSRTPINPNSVLIHGENGPSFLAGLAEDIGIENTRAARIVS 360
Query: 403 AAVAARMRSCFLQAWALVMQDRHSEADAELLKICHVVQTFPPDESSPEMEMLTLGLKKHL 462
AAVAARMRSCFLQAWALVMQDRHSEADAELLKICHVVQTFPPDESSPEMEMLTLGLKKHL
Sbjct: 361 AAVAARMRSCFLQAWALVMQDRHSEADAELLKICHVVQTFPPDESSPEMEMLTLGLKKHL 420
Query: 463 KVEQRECLMNMFISVCGKDSHRTAAEALGL 492
KVEQRECLMNMFISVCGKDSHRTAAEALGL
Sbjct: 421 KVEQRECLMNMFISVCGKDSHRTAAEALGL 450
BLAST of Cp4.1LG08g05870 vs. NCBI nr
Match:
XP_022932633.1 (uncharacterized protein LOC111439130 isoform X3 [Cucurbita moschata] >XP_022932634.1 uncharacterized protein LOC111439130 isoform X3 [Cucurbita moschata])
HSP 1 Score: 874 bits (2259), Expect = 0.0
Identity = 443/450 (98.44%), Postives = 446/450 (99.11%), Query Frame = 0
Query: 43 METTPTLCSFPPYPFTSHRHYRHLRCRIYSPEPRQLFTTLSCFKPRRRPRRKNKLAKFHT 102
METTPTLCSFPPYPFTSH HYRHLRCRIYSPEPRQLFTTLSCFKPRRRPRRKNKLAKFHT
Sbjct: 1 METTPTLCSFPPYPFTSHLHYRHLRCRIYSPEPRQLFTTLSCFKPRRRPRRKNKLAKFHT 60
Query: 103 IQSPLESSSDSKLQTVIEIDQFTAEASSLVYSVYYYSRSQFRQFLSSGLDAFHDLRTLIA 162
IQSPLESSSDSKLQT+IEIDQFTAEASSLVYSVYYYSRSQFRQFLSSGLDAFHDLRTLIA
Sbjct: 61 IQSPLESSSDSKLQTIIEIDQFTAEASSLVYSVYYYSRSQFRQFLSSGLDAFHDLRTLIA 120
Query: 163 FDDQNRTLTVSCRRSTVEFLGQLVLFSFVVVFLVRVLIGIGSRFRNQFSYGYTSPVVRRD 222
FDDQNRTLTVSCRRSTVEFLGQLVLFSFVVVFL RVLIGIGSRFRNQFSYGYTSPVVRRD
Sbjct: 121 FDDQNRTLTVSCRRSTVEFLGQLVLFSFVVVFLGRVLIGIGSRFRNQFSYGYTSPVVRRD 180
Query: 223 RSLGGREVVVGTVRDKAMSKKNNRFGILGSPISMTSMALTDVSDEVSRNGAWVGDRLPKW 282
RSLGGREVVVGTVRDKAM+KKNN FGIL SP+SMTSMALTDVSDEVSRNGAWVGDRLPKW
Sbjct: 181 RSLGGREVVVGTVRDKAMAKKNNHFGILDSPLSMTSMALTDVSDEVSRNGAWVGDRLPKW 240
Query: 283 WPPAVPRRISTANRQGYQIEADRLVRALVDSRMSGRDFMEDDILHLRQICRMSGVKVSFN 342
WPPAVPRRISTANRQGYQIEADRLVRALVDSRMSGRDFMEDDILHLRQICRMSGVKVSFN
Sbjct: 241 WPPAVPRRISTANRQGYQIEADRLVRALVDSRMSGRDFMEDDILHLRQICRMSGVKVSFN 300
Query: 343 TENMRDSFYRASVDSVFNIYSRTPINPNSVLINGENGPSFLAGLAEDIGIENTRAARIVS 402
TENMRDSFYRASVDSVFNIYSRTPINPNSVLINGENGPSFLAGLAEDIGIENTRAARIVS
Sbjct: 301 TENMRDSFYRASVDSVFNIYSRTPINPNSVLINGENGPSFLAGLAEDIGIENTRAARIVS 360
Query: 403 AAVAARMRSCFLQAWALVMQDRHSEADAELLKICHVVQTFPPDESSPEMEMLTLGLKKHL 462
AAVAARMRSCFLQAWALVMQDRHSEADAELLKICHVVQTFPPDESSPEMEMLTLGLKKHL
Sbjct: 361 AAVAARMRSCFLQAWALVMQDRHSEADAELLKICHVVQTFPPDESSPEMEMLTLGLKKHL 420
Query: 463 KVEQRECLMNMFISVCGKDSHRTAAEALGL 492
KVEQRECLMNMFISVCGKDSHRTAAEALGL
Sbjct: 421 KVEQRECLMNMFISVCGKDSHRTAAEALGL 450
BLAST of Cp4.1LG08g05870 vs. ExPASy TrEMBL
Match:
A0A6J1EWV7 (uncharacterized protein LOC111439130 isoform X2 OS=Cucurbita moschata OX=3662 GN=LOC111439130 PE=4 SV=1)
HSP 1 Score: 875 bits (2260), Expect = 0.0
Identity = 444/455 (97.58%), Postives = 448/455 (98.46%), Query Frame = 0
Query: 43 METTPTLCSFPPYPFTSHRHYRHLRCRIYSPEPRQLFTTLSCFKPRRRPRRKNKLAKFHT 102
METTPTLCSFPPYPFTSH HYRHLRCRIYSPEPRQLFTTLSCFKPRRRPRRKNKLAKFHT
Sbjct: 1 METTPTLCSFPPYPFTSHLHYRHLRCRIYSPEPRQLFTTLSCFKPRRRPRRKNKLAKFHT 60
Query: 103 IQSPLESSSDSKLQTVIEIDQFTAEASSLVYSVYYYSRSQFRQFLSSGLDAFHDLRTLIA 162
IQSPLESSSDSKLQT+IEIDQFTAEASSLVYSVYYYSRSQFRQFLSSGLDAFHDLRTLIA
Sbjct: 61 IQSPLESSSDSKLQTIIEIDQFTAEASSLVYSVYYYSRSQFRQFLSSGLDAFHDLRTLIA 120
Query: 163 FDDQNRTLTVSCRRSTVEFLGQLVLFSFVVVFLVRVLIGIGSRFRNQFSYGYTSPVVRRD 222
FDDQNRTLTVSCRRSTVEFLGQLVLFSFVVVFL RVLIGIGSRFRNQFSYGYTSPVVRRD
Sbjct: 121 FDDQNRTLTVSCRRSTVEFLGQLVLFSFVVVFLGRVLIGIGSRFRNQFSYGYTSPVVRRD 180
Query: 223 RSLGGREVVVGTVRDKAMSKKNNRFGILGSPISMTSMALTDVSDEVSRNGAWVGDRLPKW 282
RSLGGREVVVGTVRDKAM+KKNN FGIL SP+SMTSMALTDVSDEVSRNGAWVGDRLPKW
Sbjct: 181 RSLGGREVVVGTVRDKAMAKKNNHFGILDSPLSMTSMALTDVSDEVSRNGAWVGDRLPKW 240
Query: 283 WPPAVPRRISTANRQGYQIEADRLVRALVDSRMSGRDFMEDDILHLRQICRMSGVKVSFN 342
WPPAVPRRISTANRQGYQIEADRLVRALVDSRMSGRDFMEDDILHLRQICRMSGVKVSFN
Sbjct: 241 WPPAVPRRISTANRQGYQIEADRLVRALVDSRMSGRDFMEDDILHLRQICRMSGVKVSFN 300
Query: 343 TENMRDSFYRASVDSVFNIYSRTPINPNSVLINGENGPSFLAGLAEDIGIENTRAARIVS 402
TENMRDSFYRASVDSVFNIYSRTPINPNSVLINGENGPSFLAGLAEDIGIENTRAARIVS
Sbjct: 301 TENMRDSFYRASVDSVFNIYSRTPINPNSVLINGENGPSFLAGLAEDIGIENTRAARIVS 360
Query: 403 AAVAARMRSCFLQAWALVMQDRHSEADAELLKICHVVQTFPPDESSPEMEMLTLGLKKHL 462
AAVAARMRSCFLQAWALVMQDRHSEADAELLKICHVVQTFPPDESSPEMEMLTLGLKKHL
Sbjct: 361 AAVAARMRSCFLQAWALVMQDRHSEADAELLKICHVVQTFPPDESSPEMEMLTLGLKKHL 420
Query: 463 KVEQRECLMNMFISVCGKDSHRTAAEALGLNRTLT 497
KVEQRECLMNMFISVCGKDSHRTAAEALGL +T
Sbjct: 421 KVEQRECLMNMFISVCGKDSHRTAAEALGLADYIT 455
BLAST of Cp4.1LG08g05870 vs. ExPASy TrEMBL
Match:
A0A6J1F2A1 (uncharacterized protein LOC111439130 isoform X3 OS=Cucurbita moschata OX=3662 GN=LOC111439130 PE=4 SV=1)
HSP 1 Score: 874 bits (2259), Expect = 0.0
Identity = 443/450 (98.44%), Postives = 446/450 (99.11%), Query Frame = 0
Query: 43 METTPTLCSFPPYPFTSHRHYRHLRCRIYSPEPRQLFTTLSCFKPRRRPRRKNKLAKFHT 102
METTPTLCSFPPYPFTSH HYRHLRCRIYSPEPRQLFTTLSCFKPRRRPRRKNKLAKFHT
Sbjct: 1 METTPTLCSFPPYPFTSHLHYRHLRCRIYSPEPRQLFTTLSCFKPRRRPRRKNKLAKFHT 60
Query: 103 IQSPLESSSDSKLQTVIEIDQFTAEASSLVYSVYYYSRSQFRQFLSSGLDAFHDLRTLIA 162
IQSPLESSSDSKLQT+IEIDQFTAEASSLVYSVYYYSRSQFRQFLSSGLDAFHDLRTLIA
Sbjct: 61 IQSPLESSSDSKLQTIIEIDQFTAEASSLVYSVYYYSRSQFRQFLSSGLDAFHDLRTLIA 120
Query: 163 FDDQNRTLTVSCRRSTVEFLGQLVLFSFVVVFLVRVLIGIGSRFRNQFSYGYTSPVVRRD 222
FDDQNRTLTVSCRRSTVEFLGQLVLFSFVVVFL RVLIGIGSRFRNQFSYGYTSPVVRRD
Sbjct: 121 FDDQNRTLTVSCRRSTVEFLGQLVLFSFVVVFLGRVLIGIGSRFRNQFSYGYTSPVVRRD 180
Query: 223 RSLGGREVVVGTVRDKAMSKKNNRFGILGSPISMTSMALTDVSDEVSRNGAWVGDRLPKW 282
RSLGGREVVVGTVRDKAM+KKNN FGIL SP+SMTSMALTDVSDEVSRNGAWVGDRLPKW
Sbjct: 181 RSLGGREVVVGTVRDKAMAKKNNHFGILDSPLSMTSMALTDVSDEVSRNGAWVGDRLPKW 240
Query: 283 WPPAVPRRISTANRQGYQIEADRLVRALVDSRMSGRDFMEDDILHLRQICRMSGVKVSFN 342
WPPAVPRRISTANRQGYQIEADRLVRALVDSRMSGRDFMEDDILHLRQICRMSGVKVSFN
Sbjct: 241 WPPAVPRRISTANRQGYQIEADRLVRALVDSRMSGRDFMEDDILHLRQICRMSGVKVSFN 300
Query: 343 TENMRDSFYRASVDSVFNIYSRTPINPNSVLINGENGPSFLAGLAEDIGIENTRAARIVS 402
TENMRDSFYRASVDSVFNIYSRTPINPNSVLINGENGPSFLAGLAEDIGIENTRAARIVS
Sbjct: 301 TENMRDSFYRASVDSVFNIYSRTPINPNSVLINGENGPSFLAGLAEDIGIENTRAARIVS 360
Query: 403 AAVAARMRSCFLQAWALVMQDRHSEADAELLKICHVVQTFPPDESSPEMEMLTLGLKKHL 462
AAVAARMRSCFLQAWALVMQDRHSEADAELLKICHVVQTFPPDESSPEMEMLTLGLKKHL
Sbjct: 361 AAVAARMRSCFLQAWALVMQDRHSEADAELLKICHVVQTFPPDESSPEMEMLTLGLKKHL 420
Query: 463 KVEQRECLMNMFISVCGKDSHRTAAEALGL 492
KVEQRECLMNMFISVCGKDSHRTAAEALGL
Sbjct: 421 KVEQRECLMNMFISVCGKDSHRTAAEALGL 450
BLAST of Cp4.1LG08g05870 vs. ExPASy TrEMBL
Match:
A0A6J1F2P4 (uncharacterized protein LOC111439130 isoform X1 OS=Cucurbita moschata OX=3662 GN=LOC111439130 PE=4 SV=1)
HSP 1 Score: 874 bits (2259), Expect = 0.0
Identity = 443/450 (98.44%), Postives = 446/450 (99.11%), Query Frame = 0
Query: 43 METTPTLCSFPPYPFTSHRHYRHLRCRIYSPEPRQLFTTLSCFKPRRRPRRKNKLAKFHT 102
METTPTLCSFPPYPFTSH HYRHLRCRIYSPEPRQLFTTLSCFKPRRRPRRKNKLAKFHT
Sbjct: 1 METTPTLCSFPPYPFTSHLHYRHLRCRIYSPEPRQLFTTLSCFKPRRRPRRKNKLAKFHT 60
Query: 103 IQSPLESSSDSKLQTVIEIDQFTAEASSLVYSVYYYSRSQFRQFLSSGLDAFHDLRTLIA 162
IQSPLESSSDSKLQT+IEIDQFTAEASSLVYSVYYYSRSQFRQFLSSGLDAFHDLRTLIA
Sbjct: 61 IQSPLESSSDSKLQTIIEIDQFTAEASSLVYSVYYYSRSQFRQFLSSGLDAFHDLRTLIA 120
Query: 163 FDDQNRTLTVSCRRSTVEFLGQLVLFSFVVVFLVRVLIGIGSRFRNQFSYGYTSPVVRRD 222
FDDQNRTLTVSCRRSTVEFLGQLVLFSFVVVFL RVLIGIGSRFRNQFSYGYTSPVVRRD
Sbjct: 121 FDDQNRTLTVSCRRSTVEFLGQLVLFSFVVVFLGRVLIGIGSRFRNQFSYGYTSPVVRRD 180
Query: 223 RSLGGREVVVGTVRDKAMSKKNNRFGILGSPISMTSMALTDVSDEVSRNGAWVGDRLPKW 282
RSLGGREVVVGTVRDKAM+KKNN FGIL SP+SMTSMALTDVSDEVSRNGAWVGDRLPKW
Sbjct: 181 RSLGGREVVVGTVRDKAMAKKNNHFGILDSPLSMTSMALTDVSDEVSRNGAWVGDRLPKW 240
Query: 283 WPPAVPRRISTANRQGYQIEADRLVRALVDSRMSGRDFMEDDILHLRQICRMSGVKVSFN 342
WPPAVPRRISTANRQGYQIEADRLVRALVDSRMSGRDFMEDDILHLRQICRMSGVKVSFN
Sbjct: 241 WPPAVPRRISTANRQGYQIEADRLVRALVDSRMSGRDFMEDDILHLRQICRMSGVKVSFN 300
Query: 343 TENMRDSFYRASVDSVFNIYSRTPINPNSVLINGENGPSFLAGLAEDIGIENTRAARIVS 402
TENMRDSFYRASVDSVFNIYSRTPINPNSVLINGENGPSFLAGLAEDIGIENTRAARIVS
Sbjct: 301 TENMRDSFYRASVDSVFNIYSRTPINPNSVLINGENGPSFLAGLAEDIGIENTRAARIVS 360
Query: 403 AAVAARMRSCFLQAWALVMQDRHSEADAELLKICHVVQTFPPDESSPEMEMLTLGLKKHL 462
AAVAARMRSCFLQAWALVMQDRHSEADAELLKICHVVQTFPPDESSPEMEMLTLGLKKHL
Sbjct: 361 AAVAARMRSCFLQAWALVMQDRHSEADAELLKICHVVQTFPPDESSPEMEMLTLGLKKHL 420
Query: 463 KVEQRECLMNMFISVCGKDSHRTAAEALGL 492
KVEQRECLMNMFISVCGKDSHRTAAEALGL
Sbjct: 421 KVEQRECLMNMFISVCGKDSHRTAAEALGL 450
BLAST of Cp4.1LG08g05870 vs. ExPASy TrEMBL
Match:
A0A6J1I3V3 (uncharacterized protein LOC111470737 isoform X2 OS=Cucurbita maxima OX=3661 GN=LOC111470737 PE=4 SV=1)
HSP 1 Score: 860 bits (2223), Expect = 8.30e-313
Identity = 435/455 (95.60%), Postives = 445/455 (97.80%), Query Frame = 0
Query: 43 METTPTLCSFPPYPFTSHRHYRHLRCRIYSPEPRQLFTTLSCFKPRRRPRRKNKLAKFHT 102
METTPTLCSFPPYPFT HRHYR LRCRIYSPEPRQLFTTLSCFKPRRRPR+KNKLAKFHT
Sbjct: 1 METTPTLCSFPPYPFTPHRHYRQLRCRIYSPEPRQLFTTLSCFKPRRRPRQKNKLAKFHT 60
Query: 103 IQSPLESSSDSKLQTVIEIDQFTAEASSLVYSVYYYSRSQFRQFLSSGLDAFHDLRTLIA 162
IQSPLESSSDSKLQTVIEIDQFTAEASSLVY VYYYSRSQFRQFLSSGLDAFHDLRTLIA
Sbjct: 61 IQSPLESSSDSKLQTVIEIDQFTAEASSLVYFVYYYSRSQFRQFLSSGLDAFHDLRTLIA 120
Query: 163 FDDQNRTLTVSCRRSTVEFLGQLVLFSFVVVFLVRVLIGIGSRFRNQFSYGYTSPVVRRD 222
FDDQNRTLTVSCRRSTVEFLGQLVLFSFVVVFLVRVLIGIGSRFRNQFSYGYTSPVVRRD
Sbjct: 121 FDDQNRTLTVSCRRSTVEFLGQLVLFSFVVVFLVRVLIGIGSRFRNQFSYGYTSPVVRRD 180
Query: 223 RSLGGREVVVGTVRDKAMSKKNNRFGILGSPISMTSMALTDVSDEVSRNGAWVGDRLPKW 282
RSLGGREVVVGTVRDKA++KKNN FGIL SP+SMTSMALTDVSDEVSRNGAWVGDRLPKW
Sbjct: 181 RSLGGREVVVGTVRDKAVAKKNNHFGILDSPLSMTSMALTDVSDEVSRNGAWVGDRLPKW 240
Query: 283 WPPAVPRRISTANRQGYQIEADRLVRALVDSRMSGRDFMEDDILHLRQICRMSGVKVSFN 342
WPPAVPRRISTANRQ YQIEADRLVRALVDSRMSGRDFMEDDILHLRQICRMSGVKVSFN
Sbjct: 241 WPPAVPRRISTANRQEYQIEADRLVRALVDSRMSGRDFMEDDILHLRQICRMSGVKVSFN 300
Query: 343 TENMRDSFYRASVDSVFNIYSRTPINPNSVLINGENGPSFLAGLAEDIGIENTRAARIVS 402
TENMRDSFYRASVDSVFNIYSRTPINPNSVLINGENGPSFLAGLAEDIGI+NTRAARI+S
Sbjct: 301 TENMRDSFYRASVDSVFNIYSRTPINPNSVLINGENGPSFLAGLAEDIGIQNTRAARIIS 360
Query: 403 AAVAARMRSCFLQAWALVMQDRHSEADAELLKICHVVQTFPPDESSPEMEMLTLGLKKHL 462
AAVAARMRSCFLQAWALVMQDRHSEADAELLK+CH+VQ FPPDESSPEMEMLTLGLKKHL
Sbjct: 361 AAVAARMRSCFLQAWALVMQDRHSEADAELLKMCHIVQIFPPDESSPEMEMLTLGLKKHL 420
Query: 463 KVEQRECLMNMFISVCGKDSHRTAAEALGLNRTLT 497
KVEQRECLMNMFISVCGKDSH+TAAEALGL +T
Sbjct: 421 KVEQRECLMNMFISVCGKDSHKTAAEALGLADYIT 455
BLAST of Cp4.1LG08g05870 vs. ExPASy TrEMBL
Match:
A0A6J1I516 (uncharacterized protein LOC111470737 isoform X1 OS=Cucurbita maxima OX=3661 GN=LOC111470737 PE=4 SV=1)
HSP 1 Score: 860 bits (2222), Expect = 1.66e-312
Identity = 434/450 (96.44%), Postives = 443/450 (98.44%), Query Frame = 0
Query: 43 METTPTLCSFPPYPFTSHRHYRHLRCRIYSPEPRQLFTTLSCFKPRRRPRRKNKLAKFHT 102
METTPTLCSFPPYPFT HRHYR LRCRIYSPEPRQLFTTLSCFKPRRRPR+KNKLAKFHT
Sbjct: 1 METTPTLCSFPPYPFTPHRHYRQLRCRIYSPEPRQLFTTLSCFKPRRRPRQKNKLAKFHT 60
Query: 103 IQSPLESSSDSKLQTVIEIDQFTAEASSLVYSVYYYSRSQFRQFLSSGLDAFHDLRTLIA 162
IQSPLESSSDSKLQTVIEIDQFTAEASSLVY VYYYSRSQFRQFLSSGLDAFHDLRTLIA
Sbjct: 61 IQSPLESSSDSKLQTVIEIDQFTAEASSLVYFVYYYSRSQFRQFLSSGLDAFHDLRTLIA 120
Query: 163 FDDQNRTLTVSCRRSTVEFLGQLVLFSFVVVFLVRVLIGIGSRFRNQFSYGYTSPVVRRD 222
FDDQNRTLTVSCRRSTVEFLGQLVLFSFVVVFLVRVLIGIGSRFRNQFSYGYTSPVVRRD
Sbjct: 121 FDDQNRTLTVSCRRSTVEFLGQLVLFSFVVVFLVRVLIGIGSRFRNQFSYGYTSPVVRRD 180
Query: 223 RSLGGREVVVGTVRDKAMSKKNNRFGILGSPISMTSMALTDVSDEVSRNGAWVGDRLPKW 282
RSLGGREVVVGTVRDKA++KKNN FGIL SP+SMTSMALTDVSDEVSRNGAWVGDRLPKW
Sbjct: 181 RSLGGREVVVGTVRDKAVAKKNNHFGILDSPLSMTSMALTDVSDEVSRNGAWVGDRLPKW 240
Query: 283 WPPAVPRRISTANRQGYQIEADRLVRALVDSRMSGRDFMEDDILHLRQICRMSGVKVSFN 342
WPPAVPRRISTANRQ YQIEADRLVRALVDSRMSGRDFMEDDILHLRQICRMSGVKVSFN
Sbjct: 241 WPPAVPRRISTANRQEYQIEADRLVRALVDSRMSGRDFMEDDILHLRQICRMSGVKVSFN 300
Query: 343 TENMRDSFYRASVDSVFNIYSRTPINPNSVLINGENGPSFLAGLAEDIGIENTRAARIVS 402
TENMRDSFYRASVDSVFNIYSRTPINPNSVLINGENGPSFLAGLAEDIGI+NTRAARI+S
Sbjct: 301 TENMRDSFYRASVDSVFNIYSRTPINPNSVLINGENGPSFLAGLAEDIGIQNTRAARIIS 360
Query: 403 AAVAARMRSCFLQAWALVMQDRHSEADAELLKICHVVQTFPPDESSPEMEMLTLGLKKHL 462
AAVAARMRSCFLQAWALVMQDRHSEADAELLK+CH+VQ FPPDESSPEMEMLTLGLKKHL
Sbjct: 361 AAVAARMRSCFLQAWALVMQDRHSEADAELLKMCHIVQIFPPDESSPEMEMLTLGLKKHL 420
Query: 463 KVEQRECLMNMFISVCGKDSHRTAAEALGL 492
KVEQRECLMNMFISVCGKDSH+TAAEALGL
Sbjct: 421 KVEQRECLMNMFISVCGKDSHKTAAEALGL 450
BLAST of Cp4.1LG08g05870 vs. TAIR 10
Match:
AT2G43235.1 (unknown protein; FUNCTIONS IN: molecular_function unknown; INVOLVED IN: biological_process unknown; LOCATED IN: chloroplast; EXPRESSED IN: 22 plant structures; EXPRESSED DURING: 13 growth stages; Has 41 Blast hits to 41 proteins in 16 species: Archae - 0; Bacteria - 0; Metazoa - 0; Fungi - 0; Plants - 40; Viruses - 0; Other Eukaryotes - 1 (source: NCBI BLink). )
HSP 1 Score: 300.4 bits (768), Expect = 2.9e-81
Identity = 183/419 (43.68%), Postives = 265/419 (63.25%), Query Frame = 0
Query: 79 FTTLSCFKPR-RRPRRKNKLAKFHTIQSPLESSSDSKLQTVIEIDQFTAEASSLVYSVYY 138
F+T +PR RR R+ N + HT + L S+ S D + + V+ +
Sbjct: 32 FSTPKRRRPRPRRNRKSNGSSYDHTDGNLLSISTSSPSGA----DDQSLSLTLDVHRIST 91
Query: 139 YSRSQFRQFLSSGLDAFHDLRTLIAFDDQNRTLTVSCRRSTVEFLGQLVLFSFVVVFLVR 198
+ +F+ FL S DAF DL+TLI+ DD NR + VSC++ST++F+G +V+ FV F +R
Sbjct: 92 LANYRFQLFLDSSKDAFSDLQTLISLDD-NRRVVVSCKKSTMQFVGGVVILGFVFGFAIR 151
Query: 199 VLIGIGSRFRNQFSYGYTSP--VVRRDRSLGGREVVVGTVRDKAMSKKNNRFGILGSPIS 258
VL+ +GS + F ++P VVRRDRSLGG+EVVV ++ S+ + F I S
Sbjct: 152 VLVKLGSALKGNFQ---SNPKFVVRRDRSLGGKEVVVSVDNIRSSSRDSKSF-IASDQAS 211
Query: 259 MTSMALTDVSDEVSRNGAWVGDRLPKWWPPAV-PRRISTANRQGYQIEADRLVRALVDSR 318
++ ++ + N LPKWWP ++ + +++ YQ EA+R+VRA+VD+R
Sbjct: 212 RSNSTPRNLHLKAQNN-------LPKWWPTSLTSQSFDVVDKEDYQREANRIVRAIVDNR 271
Query: 319 MSGRDFMEDDILHLRQICRMSGVKVSFNTENMRDSFYRASVDSVFNIYSRTPINPNSVLI 378
SG+D +DDI+ LR++CR+SGV+V+F +N DSFYR S+D V N SR P +SV I
Sbjct: 272 TSGKDITDDDIIQLRRVCRISGVQVTFEPKNTGDSFYRTSIDFVLNACSRAPWESSSVEI 331
Query: 379 NGENGPSFLAGLAEDIGIENTRAARIVSAAVAARMRSCFLQAWALVMQDRHSEADAELLK 438
E+ F+AGLAE+IG+ AAR+VSAAVAAR RS FLQAWAL +Q +HSE+ AEL K
Sbjct: 332 CSEDAREFIAGLAENIGLAKIDAARMVSAAVAARTRSWFLQAWALEIQGKHSESVAELSK 391
Query: 439 ICHVVQTFPPDESSPEMEMLTLGLKKHLKVEQRECLMNMFISV-CGKDSHRTAAEALGL 493
IC + + FPP+E S EMEM+ GL+K +K+E+R+ L+ F+ + C +DS R+AAEALGL
Sbjct: 392 ICLIHRIFPPNEYSAEMEMVARGLEKLMKLEERQSLLKTFVGMCCSEDSQRSAAEALGL 434
The following BLAST results are available for this feature:
Match Name | E-value | Identity | Description | |
Match Name | E-value | Identity | Description | |
XP_023539150.1 | 0.0 | 99.12 | uncharacterized protein LOC111799886 isoform X3 [Cucurbita pepo subsp. pepo] | [more] |
XP_023539149.1 | 0.0 | 100.00 | uncharacterized protein LOC111799886 isoform X2 [Cucurbita pepo subsp. pepo] | [more] |
XP_022932631.1 | 0.0 | 97.58 | uncharacterized protein LOC111439130 isoform X2 [Cucurbita moschata] | [more] |
KAG6597642.1 | 0.0 | 98.67 | hypothetical protein SDJN03_10822, partial [Cucurbita argyrosperma subsp. sorori... | [more] |
XP_022932633.1 | 0.0 | 98.44 | uncharacterized protein LOC111439130 isoform X3 [Cucurbita moschata] >XP_0229326... | [more] |
Match Name | E-value | Identity | Description | |
A0A6J1EWV7 | 0.0 | 97.58 | uncharacterized protein LOC111439130 isoform X2 OS=Cucurbita moschata OX=3662 GN... | [more] |
A0A6J1F2A1 | 0.0 | 98.44 | uncharacterized protein LOC111439130 isoform X3 OS=Cucurbita moschata OX=3662 GN... | [more] |
A0A6J1F2P4 | 0.0 | 98.44 | uncharacterized protein LOC111439130 isoform X1 OS=Cucurbita moschata OX=3662 GN... | [more] |
A0A6J1I3V3 | 8.30e-313 | 95.60 | uncharacterized protein LOC111470737 isoform X2 OS=Cucurbita maxima OX=3661 GN=L... | [more] |
A0A6J1I516 | 1.66e-312 | 96.44 | uncharacterized protein LOC111470737 isoform X1 OS=Cucurbita maxima OX=3661 GN=L... | [more] |
Match Name | E-value | Identity | Description | |
AT2G43235.1 | 2.9e-81 | 43.68 | unknown protein; FUNCTIONS IN: molecular_function unknown; INVOLVED IN: biologic... | [more] |