Cp4.1LG02g08670 (gene) Cucurbita pepo (MU‐CU‐16) v4.1

Overview
NameCp4.1LG02g08670
Typegene
OrganismCucurbita pepo (Cucurbita pepo (MU‐CU‐16) v4.1)
DescriptionLocalized to the inner membrane of the chloroplast
LocationCp4.1LG02: 128035 .. 136358 (+)
RNA-Seq ExpressionCp4.1LG02g08670
SyntenyCp4.1LG02g08670
Sequences
The following sequences are available for this feature:

Gene sequence (with intron)

Legend: five_prime_UTRexonCDSpolypeptidethree_prime_UTR
Hold the cursor over a type above to highlight its positions in the sequence below.
TAGGATTGACATCAGATCACAGGGCAGCAAAAAAGAGCTCAAGGGATTATCGTGATCGCAAAGCACGGAATAATTTCCTCAAGGCCATCGATATTATCGAACAAGAAAAATATTTTCCTAGTAATAATCTGTTTCCCGTTTCCCGTTTCCACGTTCACCTCCAAACCAAAACCGCACTCTCTCGCTATCTTCTGTTCATCTTTGGGACTCAACACAACGATGTCTGCTATATCCAATTCGTTGATTTTAACGAAAACCCCTCAACTCCAGTTATCAGCTGGTAATCCTCACTCTCTGATTTCTTCAGCATTTGATCTTGTGTTAGATTTTCGTTGTTAAGTACACAAGCTCCCCATTTGTTATTTCTTATCCTGTATTTTTTTTCTTGTATCGATTCAATTTTAGCTGTTGCTCTTGAGTTTCCTTTCCAATTTACACATCTTAGTAACACTTCTCTCTCCGGTTTAATTTTGGGGATGATCTTGTAGTGCGGGAGATTTAGGACTTGTGTTCTCGATTTTTCCATCGCGAAATGCCCTCATTTCCATCGGTAGATATCGGGTTTTCTTCTAATTTGTATATTGAGTTACAGCGAGCTTCAAATGTTAGTTACCTGCTACTTTTGACCATCCTCTTAGCTCATAAATCGGTTTTGTGTTCTATGTAGAAACGGCCTTTTCTAGGCGAAATTTGCGAGGGGAATGTGTATTTTCCTCCATTTTGGTTCCGTTTATGGTGGGCTGTTTATTCTTTCTTGATGATGTCTAAAGGCGCTATTAATCTTATCGTTTTTATTACATCGTGATGATGTCTTGATTTCCAAATTAATGAAATTTTTTGTAATTGTTGGAAGATTGAATTATGACTAGTAGGTAGTTGCTCTTATTTTCATAATTGGGAAGAACTAAGGAGAGTGCACATGTATGTAACCATAAATTGTATTTGTCTAAGTGTACTCTCGCCAATTTTCCTGCCAGGATCAAATCTGAAAACTGCTGATAAACGATTTGGGAGTGTCTCCCCTACCCGCTTGTTACTCAATCCTGGTTGGGTGGGAAAGGCCAAGCTTTCTACATTTAGGCGGCCACTTTCAGTTCAAGCTGCATATAGGTATAATCTTTGAAATCACTACTAAAACAATATGCCATTTAGTCATCTCATATTATAGCATTACAGTCTAGGGAGTCTATTTTTTGAAGAAATTATTATCGATCACTACGTTGAGAAACTATTTACGTGGTAGGAATGCATGTTTAGAATCTTTGTGTGGGTAGGCTTATCATTGGGGTGCACAATAACTGGAAAAGGAACTGCCGTGCTGAGCTGAACTGAATTTCTTAATCTCAGGTAGTCAACAACCATGAACATAACTAATTTTTAGCTGGTTCAAACAGATTTGCAACCAAAGGATGAGCCTGTTTGGTTTGCTATGCTCGAAACTGAGTCTCGTTGATTTGGTCAAAGACTGATCAAATCAACATAATGAAGGCATATAGGTTTCCTACCCAAAAGAAGGGGAAAAAATAATAATGAATGTATGTACTCATCCCATTTTAGATAACAAATTCAAATGAAACAGTTGTTGAATTTGATCTGTTTACTCTGTATGATTACGGTCTGGATTTGTGCCTCCCTATGAGATTTGGCCTTTTGCTCCTAAAATATTTTGGGGGATCAGTGTTTAGAGGCAAGGCCAACATTTTGTGGAATTGTAGTCCATTCTCTCCTTTGGTTAATTTGGAAGGAGAGAACAAATAGGATTTTTAAAGTCAAGGAGACTACGGTGGATTCTTTTTGTAATCGTGTACAACACTTAGCATTGTGGTGGTGTTCCAAACACACGAGCTTTTTTTTTGGTTGTAATTTTAGTCTTTTGATGATTACTAATGATTGGAAGGCTCCAATGGTCTATTTTCGTGAGGGGGATTTTTCTTCCCGCACTCAGGTTGTTTCTCTCTCTCAATCTGTCTATAAAGCATCATTCATCATCCCCAAGCCTAGATTGTTTCTCTATCTCTTTGTTTATAAAGCCTCATTTCTTATACCAAAAAATTACTAATTAATAAATAAACAATAACTCTTAGGGTTGAATAATGTCTAGTCTGTAACAATTGTTAATAAAGCTTTTCCGAGTGGAAACTAATAATAGTATTCCTATATGTTTCAAATGGACATAGCACAAGTTGATGTCTATTTTATTGTATATGTCTTGATGGGTCCGTCCTATATGCTTCATTTGAAATACTCTGACGGCAGTTTATTATGCACCTTGTGGCAGTGATGGTGGACGGTCAAGCAGTGCAGGCATTTTTATTGGAGGTTTTGTATTGGGAGGGCTCGTAGTTGGTACGCTTGGCTGTGTATATGCCCCTCAGGTTTTTAACTTTTTTCATCTCTTATCTAACCTCTAAATTGGACATATGGGAAATTTCACCTAGTATATCTGTTACTTGTGTGCTATGGTTACTATCTTAGGGAAGGTATGTAGAAATATGCACAACTATTGTCTCTGAGCTTCAAGTGCAGAATCCTCGTCTTGTTATGACTGAGAGTTGATCTTGTGTTTGAATTATAGATTAGCAAGGCACTTGCTGGAACAGACCGAAAAGATTTGATGAGGAAACTTCCCAAGTTCATATATGATGAAGAAAAAGCCTTAGAGGTGCAAAATCACTCATCTCTTTTTATTTGTTGCGAGTGATCTTGTGAATGAACATTTCATCTTCACTTGACATTCATTTACCCAATCGTGGATTTTATTCGTAACATTGCACTTTCTTCTTAAATACCATGAGTTGGCAGAATGATAATTGGAGCTTGGGAAAGAAAAGGGTTAAGAGGGAATGAGTTCAAATCATGTGGGCACCTACGTATGATTTTAAAATTCTGTTAGTTTTCTTGGCAACCACTTGAACTAGGATCAAGTAGTTGTCCCATGAGATAAGGTGGCTCGAACATTTGCAGATATAAAATAATAATAATAATAGTAATAAATATATATTTTTTCCAACCTTCATCAACTTCTCATCAAAGATGCTAAAATGTGTTCGTGTAACTTCAGTTTTGACAATTCTGCGTGGGGGTATTTGAATTTTGCATAAAATTTGTGTAGTATAAGGTCAAAAGAGTACATAAACTGGAATACGAATTTATATGTATGGATTTACTTATCCTGGGATACGAGTTTTCTAATTTATGGAGCACATAAACTCATATCACATGAATCTAAAACCTTAAATATCCCATCATCAATAATAGTGAAAATAANAAATATACAGATTACTGATTCAATTGCACAATTCAAATAAGCTTATCGACTAAATATCCCACTTTCATTTGGTTTTTGAGATATGGTTTGTCGGTATTTAGTTTAAAATATGACTATAATATGTTGTATGTATGTAGTCTAGAAGATTAGCAAGCTTGTGGATCAGGATAAGCAAATCTACACCCATCTAGATTATACAAAACTAATCTATGAACTAAACACAGGGCCCTTTCCAGTAGGATTTGTCTTAAATTTTTAGATAACAATATTTTGTTTTTGATTCTGGATTTTTTTCTTTTCAACTCTGAACTTTCTTTATAGAAGAAATACAATATTACGTGCTTTTCTTTAACAGTTCCTGTTCGGTTTTTGATGTGCGAACATGTACTTGCTTTTTATTGAAATTGCAGAAAACAAGAAAAGTGCTGGCACAGAAGATTGAACAGTTGAACTCTGCCATTGATGAGGTTTCTTCTCAGCTCCGCACAGAGGATTCCCCAAATGGAGTGGCCGTAAACTCTGATGAAGTTGAACCTGCCATTTGAAAGTATCGTTACCTTGGTTTTCACCGCCCACGATTCGGGGACTAAGAATAATAGTTAATTTTCTAGCTATTATCTTTGATTTTCTGTCTTAATACGTTTGGAACTGAACTTTGATGTGCATTACACATCATCGCGTAATGTGGTGCTATGATCACTCAGATGCTCCTTCTAAAGTTGAATAGTTTGTAGAAGAGCATTCTTGCTGTTTCCTACTTTGTAAACGAACAGCCATTGTTTTGTATTTGAAAATCACATATTTCATTCCTTCCTCTTATTTGCATTTTTTAACTTGTCTGGCAGTCTTAAGCCAAATGATCCCTTGTTGCTGGGGCGTCCATTTTGTCTTCCAGCTAAAAACTAAATCTTAACAGGGAAAAGCGAGGTAGCAAAGCTTAATATTTGGTTTTTGGCGAGATTCTGGAGATTACAGGGGAAGAAATAGTGGAACATTGGAGGAGAAAAAAACCAAAGAATGAAACGTTATTATTTAAGCAAAAGTCAGGTTGGATCGAGTGAAGGCACTGGCCAATGTCTTCTCTGTATTTGCTGTTCCAAGCCAAGGGCGAAAATGACGTGGATGGATAGGCTATGACTCATTAACTTGGAACAATGAGAGGCAGACATTAGAGAGGGAAACGTCGGTGACTTGTCAGACTGAACAGGTGCCAGCTTGGACTACTACCAACAATCCTCTTGTAACAACATATAGCTTGAGGGCATTATTGGAAAGTTACATAGGTGAAAGATCTGTGGTTCAGAGGCTTCCTGAAATTGATAGGCGAAAAAGACTGCCTTATCTTATGTATCTGCCCCTCGCAGCTTTCTCACCACAAAACACTCTCTAGATCACTTGATTGCAAGTTGCCACAGTCCTTCCATCTATGATTAAAAAAGGGCAGCTTCTGTATAACCCAAATGCCAATTTCAGAGTTCTACAATTGATATACATTTGTGAAGTGGAGTTACGTAGCCCCTGAGTTAGCAGCCATGGCCACTCACGCAGCTCTGGCTTCAACAAGGATCCCCACCAACACAAGGCTTCCTTCCAAGACTTCCAATTCTTTCCGACCTCGATGCTCATCCAAGGTTTGTCTCTTCATCTCTCAAACTACTTAATGGTAGCTGATACTTCATTAAATTGTCCTCATAATTTAGTTTTGAATTTGCAATGTGCAGAGGCTGGATGTTGCTGAGTTCAATGGGCTTAGATCTGCTTGCCTGACCTTCTCTAACAATGGCAGAGAAGGATCGCTCTTCGACGTCGTGGCTGCTCAACTGACTCCCAAGGTTCTTCCTTTTCCCTTCGTTTTGTGGTTTAAAATTCTAGATGATTTCATGGGATGAAAAGAGATCGCAGTAGTTTGAGAAATTTGATATAGTTTGAAGAAATCTATATGTTTGACTAATCTTAGTAATTATGAGTAACTCATCTTGATCGTGGTTCTTACTAACAGAATAGGAACATTTTCTTCATACAGGCAGTAGGATCATTTCCGGTTAGGGGAGAAACAGTGGCTAAACTCAAGGTGGCAATCAATGGATTTGGACGTATTGGCCGAAACTTCCTGCGGTGCTGGCATGGCAGGAAAGACTCGCCTCTCGATGTTGTCGTTGTCAACGACAGTGGTGGTGTCAAGAATGTAAGCAATATATATCTTCTCAATACATTGATCAAGTGCCATTTTCAAGGATTAATCTACTTGAACATAGTTAGCAAAAGTCTCGCCACTTAAGATGAATTATACCAAAAATTTCAGGCCTCCCACTTGCTAAAATATGATTCCATGCTGGGAACTTTCAAGGCAGAGGTGAAAATTGTAGACAATGAAACCATATCTGTTGATGGGAAGCCAATCAAGGTTGTCTCAGGCAGGGACCCCCTCAAGCTTCCTTGGGGCGAGTTAGGCATTGACATTGTTATTGAGGTAGATTTAAGTCCATCCCAATCAATTATCAGATTATCCGCCCCGATCAATACCAATATCCATGGCCTTCGCAGGGTACAGGAGTGTTCGTGGATGGCCCAGGGGCAGGGAAACACATCCAAGCAGGTGCAAAGAAGGTTCTCATCACTGCTCCTGCAAAAGGTGCTGATATTCCAACATATGTTGTTGGGGTAAACGAAAAAGATTACTACCATGATGTTGCTAACATCATTAGGTCAGTAACAGTTTCATTCTTGGAGCTGAATTTGTTTTGACCTTTGAGCTGCTAAGTAATTGTTTTTGGTTGGTTAGTAAGCTTCTGATTGTTGAAGTTTTTAGCTATTGCTCTTAAACTTTTATGCAGCAATGCTTCTTGCACCACCAATTGCCTTGCTCCTTTTGTGAAGATCATTGATGAGGAATTTGGTAGGCAATCAGTTTCATTTTGGGCAACGAGCATTGTTTCCCGATTTTAACCACTTGGTGTTTGATTGAATTCAGGCATTGTGAAGGGAACGATGACAACCACTCACTCCTACACTGGAGACCAGGTAAAGAAAATATATGAACAATTAATTACATTATGAATGTGTGTAGAATCCTATGGCATTGATGATTTTTGAGATAGAGGAAATCTTATAACCTCGAGTCACGTTTTGGGGGTTATGCAGAGACTTTTGGATGCATCACACCGTGACTTGAGACGAGCCAGGGCCGCAGCTTTGAACATTGTCCCCACAAGCACTGGTGCAGCAAAGGCCGTATCCCTTGTGCTGCCCCAGCTTAAGGGCAAGCTCAATGGTATTGCACTTCGTGTGCCTACTCCAAATGTTTCAGTTGTTGACCTCGTTGTGAACGTTGCGAAGAAAGGCATCTCCGCAGATGATGTCAATGCTGCCTTCAGAAAGGCAGCTGATGGACCATTGAAGGGCGTGCTAGCCGTGTGCGATATTCCTCTTGTTTCTGTGGACTTTAAGTGCACTGATGTTTCCTCCACAATTGACTCTTCATTAACAATGGTAATGGGAGATGATATGCTCAAGGTGGTCGCCTGGTACGACAATGAATGGGGATACAGGTGAGGACTTTCATTCATGGGAATGGATGATTGATTCCCTTCAACATTATAAACCAACGTATATGATGTTTCATTAACTTTTTGTTGTCTGCAGCCAAAGAGTTGTGGACTTGGCTCATTTGGTGGCAAGCAAATGGCCGGGAGTGGGATCGGGAACAAGTGGAGATCCGTTGGAGGATTTCTGCCAGACTAACCCAGCTGATGAAGAGTGCAAAGTTTATGAAGCTTAGGCATCATCATGTCTTTTGTTGGTTGCTTTGAGTCTAGGTCGTTTTTTAGCATTGCTGTACAATGCTTGTTTAGAATTCAAATTCAAGGAAGAGACTTTTTATCTATCCATTTGTAAAATTTGAAGACAGTGTTCTAAAGCTGCAATAACCTTTGAAAAGGGACAAAAATGAAGGAAAAACATGAACCACCCCCTTTACTACACCTTTTTCCCTCATATTTCCTCTGCAAATTAGTCCATTTTGTTCTTTTAGATGCTTTCCTCTGATTTCTTGTTCTAAACTTGAAGTGTACAAACTGGATTTAGACAGACTTAAATGTATTGATGAGGAGACGAAATTAATAAACCTGGCTGGTTGCTTCAATTTCTGTCTACAAACTTCTCAAGGATTTACCTCCAAATTACCATCATTCTTATCATTTTCGGACGATTTTTCAACTTTCCTTCGTTTGCTTTGCCTTGGTTCTTTCTTCTCATCAAACAACGACATGATTCGTCGAAGACGGTTCGGATGCGATCCACTACTGCTAGACAGATCAGATTCTTTTGAATCAGAATGGTATAGCTGACGAATAATCTGAAAGCAGTATATGGCAAGGGCAATGAAACACACTGTCCCACAGATAAGAAAGAGACCCCAGAAGCTCTTAAGTTGAAGCGAGTCTGATTCGAGATCTGTACTGTCTGAGGTGCAAGCACTTTTCACAACCCATTTGTCGTGTATCCGTTGCAGATCGCCGTTCTCGGATAGCTGCAAAATGGCAGTTGACATGTCTACAGCCAATGGAGAGTCTCGTGGGAATGCCTGAATTACCAAAAGGAGCAACAGGAGAATAGTTATTGATGTGTTCAATTTATCAAATACTAAATAAGAACGAAAACACTCACGAAACCCCAGCCGCTTTTGGTAAACTCTTGACCAACAACTCTGAATTTACACTCTCTAGAGATGAAATTTTCTACATATAGAAGTTCATCAACTACAGCAGCAACGCCTCCCTCCTTGCCAGGGCCAAGATCAAGTGCCTTGGCATATTCTTCCGGTGATCCAAGAGGAATAAGCCTAGATCTAGACACGTTCAGCTCCTCACTCAGATAACGTTCAGCAAAAGATCCAACTTGGAAACCAATCGGTTCATCAATTTCCCTCAAGGTTTCTATTCCTGTGATTGGAGAATATAGCTGCTGCACCGTGAGAATGGATGTTAAGCTTGCAGTGTAGCTAGAATTTATAATCAAAACCACAAACAGCCATATGATCAGCACGAAGCGGCCGAGA

mRNA sequence

TAGGATTGACATCAGATCACAGGGCAGCAAAAAAGAGCTCAAGGGATTATCGTGATCGCAAAGCACGGAATAATTTCCTCAAGGCCATCGATATTATCGAACAAGAAAAATATTTTCCTAGTAATAATCTGTTTCCCGTTTCCCGTTTCCACGTTCACCTCCAAACCAAAACCGCACTCTCTCGCTATCTTCTGTTCATCTTTGGGACTCAACACAACGATGTCTGCTATATCCAATTCGTTGATTTTAACGAAAACCCCTCAACTCCAGTTATCAGCTGGATCAAATCTGAAAACTGCTGATAAACGATTTGGGAGTGTCTCCCCTACCCGCTTGTTACTCAATCCTGGTTGGGTGGGAAAGGCCAAGCTTTCTACATTTAGGCGGCCACTTTCAGTTCAAGCTGCATATAGTGATGGTGGACGGTCAAGCAGTGCAGGCATTTTTATTGGAGGTTTTGTATTGGGAGGGCTCGTAGTTGGTACGCTTGGCTGTGTATATGCCCCTCAGATTAGCAAGGCACTTGCTGGAACAGACCGAAAAGATTTGATGAGGAAACTTCCCAAGTTCATATATGATGAAGAAAAAGCCTTAGAGAAAACAAGAAAAGTGCTGGCACAGAAGATTGAACAGTTGAACTCTGCCATTGATGAGGTTTCTTCTCAGCTCCGCACAGAGGATTCCCCAAATGGAGTGGCCGTAAACTCTGATGAAGTTGAACCTGCCATTTGAAAGTATCGTTACCTTGGTTTTCACCGCCCACGATTCGGGGACTAAGAATAATAGTTAATTTTCTAGCTATTATCTTTGATTTTCTGTCTTAATACGTTTGGAACTGAACTTTGATGTGCATTACACATCATCGCGTAATGTGGTGCTATGATCACTCAGATGCTCCTTCTAAAGTTGAATAGTTTGTAGAAGAGCATTCTTGCTGTTTCCTACTTTGTAAACGAACAGCCATTGTTTTGTATTTGAAAATCACATATTTCATTCCTTCCTCTTATTTGCATTTTTTAACTTGTCTGGCAGTCTTAAGCCAAATGATCCCTTGTTGCTGGGGCGTCCATTTTGTCTTCCAGCTAAAAACTAAATCTTAACAGGGAAAAGCGAGGTAGCAAAGCTTAATATTTGGTTTTTGGCGAGATTCTGGAGATTACAGGGGAAGAAATAGTGGAACATTGGAGGAGAAAAAAACCAAAGAATGAAACGTTATTATTTAAGCAAAAGTCAGGTTGGATCGAGTGAAGGCACTGGCCAATGTCTTCTCTGTATTTGCTGTTCCAAGCCAAGGGCGAAAATGACGTGGATGGATAGGCTATGACTCATTAACTTGGAACAATGAGAGGCAGACATTAGAGAGGGAAACGTCGGTGACTTGTCAGACTGAACAGGTGCCAGCTTGGACTACTACCAACAATCCTCTTGTAACAACATATAGCTTGAGGGCATTATTGGAAAGTTACATAGGTGAAAGATCTGTGGTTCAGAGGCTTCCTGAAATTGATAGGCGAAAAAGACTGCCTTATCTTATGTATCTGCCCCTCGCAGCTTTCTCACCACAAAACACTCTCTAGATCACTTGATTGCAAGTTGCCACAGTCCTTCCATCTATGATTAAAAAAGGGCAGCTTCTGTATAACCCAAATGCCAATTTCAGAGTTCTACAATTGATATACATTTGTGAAGTGGAGTTACGTAGCCCCTGAGTTAGCAGCCATGGCCACTCACGCAGCTCTGGCTTCAACAAGGATCCCCACCAACACAAGGCTTCCTTCCAAGACTTCCAATTCTTTCCGACCTCGATGCTCATCCAAGAGGCTGGATGTTGCTGAGTTCAATGGGCTTAGATCTGCTTGCCTGACCTTCTCTAACAATGGCAGAGAAGGATCGCTCTTCGACGTCGTGGCTGCTCAACTGACTCCCAAGGCAGTAGGATCATTTCCGGTTAGGGGAGAAACAGTGGCTAAACTCAAGGTGGCAATCAATGGATTTGGACGTATTGGCCGAAACTTCCTGCGGTGCTGGCATGGCAGGAAAGACTCGCCTCTCGATGTTGTCGTTGTCAACGACAGTGGTGGTGTCAAGAATGCCTCCCACTTGCTAAAATATGATTCCATGCTGGGAACTTTCAAGGCAGAGGTGAAAATTGTAGACAATGAAACCATATCTGTTGATGGGAAGCCAATCAAGGTTGTCTCAGGCAGGGACCCCCTCAAGCTTCCTTGGGGCGAGTTAGGCATTGACATTGTTATTGAGGGTACAGGAGTGTTCGTGGATGGCCCAGGGGCAGGGAAACACATCCAAGCAGGTGCAAAGAAGGTTCTCATCACTGCTCCTGCAAAAGGTGCTGATATTCCAACATATGTTGTTGGGGTAAACGAAAAAGATTACTACCATGATGTTGCTAACATCATTAGCAATGCTTCTTGCACCACCAATTGCCTTGCTCCTTTTGTGAAGATCATTGATGAGGAATTTGGCATTGTGAAGGGAACGATGACAACCACTCACTCCTACACTGGAGACCAGAGACTTTTGGATGCATCACACCGTGACTTGAGACGAGCCAGGGCCGCAGCTTTGAACATTGTCCCCACAAGCACTGGTGCAGCAAAGGCCGTATCCCTTGTGCTGCCCCAGCTTAAGGGCAAGCTCAATGGTATTGCACTTCGTGTGCCTACTCCAAATGTTTCAGTTGTTGACCTCGTTGTGAACGTTGCGAAGAAAGGCATCTCCGCAGATGATGTCAATGCTGCCTTCAGAAAGGCAGCTGATGGACCATTGAAGGGCGTGCTAGCCGTGTGCGATATTCCTCTTGTTTCTGTGGACTTTAAGTGCACTGATGTTTCCTCCACAATTGACTCTTCATTAACAATGGTAATGGGAGATGATATGCTCAAGGTGGTCGCCTGGTACGACAATGAATGGGGATACAGCCAAAGAGTTGTGGACTTGGCTCATTTGGTGGCAAGCAAATGGCCGGGAGTGGGATCGGGAACAAGTGGAGATCCGTTGGAGGATTTCTGCCAGACTAACCCAGCTGATGAAGAGTGCAAAGTTTATGAAGCTTAGGCATCATCATGTCTTTTGTTGGTTGCTTTGAGTCTAGGTCGTTTTTTAGCATTGCTGTACAATGCTTGTTTAGAATTCAAATTCAAGGAAGAGACTTTTTATCTATCCATTTGTAAAATTTGAAGACAGTGTTCTAAAGCTGCAATAACCTTTGAAAAGGGACAAAAATGAAGGAAAAACATGAACCACCCCCTTTACTACACCTTTTTCCCTCATATTTCCTCTGCAAATTAGTCCATTTTGTTCTTTTAGATGCTTTCCTCTGATTTCTTGTTCTAAACTTGAAGTGTACAAACTGGATTTAGACAGACTTAAATGTATTGATGAGGAGACGAAATTAATAAACCTGGCTGGTTGCTTCAATTTCTGTCTACAAACTTCTCAAGGATTTACCTCCAAATTACCATCATTCTTATCATTTTCGGACGATTTTTCAACTTTCCTTCGTTTGCTTTGCCTTGGTTCTTTCTTCTCATCAAACAACGACATGATTCGTCGAAGACGGTTCGGATGCGATCCACTACTGCTAGACAGATCAGATTCTTTTGAATCAGAATGGTATAGCTGACGAATAATCTGAAAGCAGTATATGGCAAGGGCAATGAAACACACTGTCCCACAGATAAGAAAGAGACCCCAGAAGCTCTTAAGTTGAAGCGAGTCTGATTCGAGATCTGTACTGTCTGAGGTGCAAGCACTTTTCACAACCCATTTGTCGTGTATCCGTTGCAGATCGCCGTTCTCGGATAGCTGCAAAATGGCAGTTGACATGTCTACAGCCAATGGAGAGTCTCGTGGGAATGCCTGAATTACCAAAAGGAGCAACAGGAGAATAGTTATTGATGTGTTCAATTTATCAAATACTAAATAAGAACGAAAACACTCACGAAACCCCAGCCGCTTTTGGTAAACTCTTGACCAACAACTCTGAATTTACACTCTCTAGAGATGAAATTTTCTACATATAGAAGTTCATCAACTACAGCAGCAACGCCTCCCTCCTTGCCAGGGCCAAGATCAAGTGCCTTGGCATATTCTTCCGGTGATCCAAGAGGAATAAGCCTAGATCTAGACACGTTCAGCTCCTCACTCAGATAACGTTCAGCAAAAGATCCAACTTGGAAACCAATCGGTTCATCAATTTCCCTCAAGGTTTCTATTCCTGTGATTGGAGAATATAGCTGCTGCACCGTGAGAATGGATGTTAAGCTTGCAGTGTAGCTAGAATTTATAATCAAAACCACAAACAGCCATATGATCAGCACGAAGCGGCCGAGA

Coding sequence (CDS)

ATGTCTGCTATATCCAATTCGTTGATTTTAACGAAAACCCCTCAACTCCAGTTATCAGCTGGATCAAATCTGAAAACTGCTGATAAACGATTTGGGAGTGTCTCCCCTACCCGCTTGTTACTCAATCCTGGTTGGGTGGGAAAGGCCAAGCTTTCTACATTTAGGCGGCCACTTTCAGTTCAAGCTGCATATAGTGATGGTGGACGGTCAAGCAGTGCAGGCATTTTTATTGGAGGTTTTGTATTGGGAGGGCTCGTAGTTGGTACGCTTGGCTGTGTATATGCCCCTCAGATTAGCAAGGCACTTGCTGGAACAGACCGAAAAGATTTGATGAGGAAACTTCCCAAGTTCATATATGATGAAGAAAAAGCCTTAGAGAAAACAAGAAAAGTGCTGGCACAGAAGATTGAACAGTTGAACTCTGCCATTGATGAGGTTTCTTCTCAGCTCCGCACAGAGGATTCCCCAAATGGAGTGGCCGTAAACTCTGATGAAGTTGAACCTGCCATTTGA

Protein sequence

MSAISNSLILTKTPQLQLSAGSNLKTADKRFGSVSPTRLLLNPGWVGKAKLSTFRRPLSVQAAYSDGGRSSSAGIFIGGFVLGGLVVGTLGCVYAPQISKALAGTDRKDLMRKLPKFIYDEEKALEKTRKVLAQKIEQLNSAIDEVSSQLRTEDSPNGVAVNSDEVEPAI
Homology
BLAST of Cp4.1LG02g08670 vs. NCBI nr
Match: XP_023521457.1 (uncharacterized protein LOC111785280 [Cucurbita pepo subsp. pepo])

HSP 1 Score: 320 bits (820), Expect = 6.87e-110
Identity = 170/170 (100.00%), Postives = 170/170 (100.00%), Query Frame = 0

Query: 1   MSAISNSLILTKTPQLQLSAGSNLKTADKRFGSVSPTRLLLNPGWVGKAKLSTFRRPLSV 60
           MSAISNSLILTKTPQLQLSAGSNLKTADKRFGSVSPTRLLLNPGWVGKAKLSTFRRPLSV
Sbjct: 1   MSAISNSLILTKTPQLQLSAGSNLKTADKRFGSVSPTRLLLNPGWVGKAKLSTFRRPLSV 60

Query: 61  QAAYSDGGRSSSAGIFIGGFVLGGLVVGTLGCVYAPQISKALAGTDRKDLMRKLPKFIYD 120
           QAAYSDGGRSSSAGIFIGGFVLGGLVVGTLGCVYAPQISKALAGTDRKDLMRKLPKFIYD
Sbjct: 61  QAAYSDGGRSSSAGIFIGGFVLGGLVVGTLGCVYAPQISKALAGTDRKDLMRKLPKFIYD 120

Query: 121 EEKALEKTRKVLAQKIEQLNSAIDEVSSQLRTEDSPNGVAVNSDEVEPAI 170
           EEKALEKTRKVLAQKIEQLNSAIDEVSSQLRTEDSPNGVAVNSDEVEPAI
Sbjct: 121 EEKALEKTRKVLAQKIEQLNSAIDEVSSQLRTEDSPNGVAVNSDEVEPAI 170

BLAST of Cp4.1LG02g08670 vs. NCBI nr
Match: XP_022941515.1 (uncharacterized protein LOC111446794 [Cucurbita moschata])

HSP 1 Score: 320 bits (819), Expect = 9.76e-110
Identity = 169/170 (99.41%), Postives = 170/170 (100.00%), Query Frame = 0

Query: 1   MSAISNSLILTKTPQLQLSAGSNLKTADKRFGSVSPTRLLLNPGWVGKAKLSTFRRPLSV 60
           MSAISNSLILTKTPQLQLSAGSNLKTADKRFGSVSPTRLLLNPGWVGKAKLSTFRRPLSV
Sbjct: 1   MSAISNSLILTKTPQLQLSAGSNLKTADKRFGSVSPTRLLLNPGWVGKAKLSTFRRPLSV 60

Query: 61  QAAYSDGGRSSSAGIFIGGFVLGGLVVGTLGCVYAPQISKALAGTDRKDLMRKLPKFIYD 120
           QAAYSDGGRSSSAGIFIGGFVLGGL+VGTLGCVYAPQISKALAGTDRKDLMRKLPKFIYD
Sbjct: 61  QAAYSDGGRSSSAGIFIGGFVLGGLIVGTLGCVYAPQISKALAGTDRKDLMRKLPKFIYD 120

Query: 121 EEKALEKTRKVLAQKIEQLNSAIDEVSSQLRTEDSPNGVAVNSDEVEPAI 170
           EEKALEKTRKVLAQKIEQLNSAIDEVSSQLRTEDSPNGVAVNSDEVEPAI
Sbjct: 121 EEKALEKTRKVLAQKIEQLNSAIDEVSSQLRTEDSPNGVAVNSDEVEPAI 170

BLAST of Cp4.1LG02g08670 vs. NCBI nr
Match: XP_022981122.1 (uncharacterized protein LOC111480366 [Cucurbita maxima])

HSP 1 Score: 318 bits (815), Expect = 3.97e-109
Identity = 167/170 (98.24%), Postives = 170/170 (100.00%), Query Frame = 0

Query: 1   MSAISNSLILTKTPQLQLSAGSNLKTADKRFGSVSPTRLLLNPGWVGKAKLSTFRRPLSV 60
           MSAISNSL+LTKTPQLQLSAGSNLKTADKRFGSVSPTRLLLNPGWVGKAKLSTFRRPLSV
Sbjct: 1   MSAISNSLVLTKTPQLQLSAGSNLKTADKRFGSVSPTRLLLNPGWVGKAKLSTFRRPLSV 60

Query: 61  QAAYSDGGRSSSAGIFIGGFVLGGLVVGTLGCVYAPQISKALAGTDRKDLMRKLPKFIYD 120
           QAAYSDGGRSSSAGIFIGGFVLGGL+VGTLGCVYAPQISKALAGTDRKDLMRKLPKFIYD
Sbjct: 61  QAAYSDGGRSSSAGIFIGGFVLGGLIVGTLGCVYAPQISKALAGTDRKDLMRKLPKFIYD 120

Query: 121 EEKALEKTRKVLAQKIEQLNSAIDEVSSQLRTEDSPNGVAVNSDEVEPAI 170
           EEKA+EKTRKVLAQKIEQLNSAIDEVSSQLRTEDSPNGVAVNSDEVEPAI
Sbjct: 121 EEKAVEKTRKVLAQKIEQLNSAIDEVSSQLRTEDSPNGVAVNSDEVEPAI 170

BLAST of Cp4.1LG02g08670 vs. NCBI nr
Match: KAG6608647.1 (hypothetical protein SDJN03_01989, partial [Cucurbita argyrosperma subsp. sororia])

HSP 1 Score: 317 bits (813), Expect = 8.02e-109
Identity = 168/170 (98.82%), Postives = 169/170 (99.41%), Query Frame = 0

Query: 1   MSAISNSLILTKTPQLQLSAGSNLKTADKRFGSVSPTRLLLNPGWVGKAKLSTFRRPLSV 60
           MSAISNSLILTKTPQLQLSAGSNLKTADKRFGSVSPTR LLNPGWVGKAKLSTFRRPLSV
Sbjct: 1   MSAISNSLILTKTPQLQLSAGSNLKTADKRFGSVSPTRSLLNPGWVGKAKLSTFRRPLSV 60

Query: 61  QAAYSDGGRSSSAGIFIGGFVLGGLVVGTLGCVYAPQISKALAGTDRKDLMRKLPKFIYD 120
           QAAYSDGGRSSSAGIFIGGFVLGGL+VGTLGCVYAPQISKALAGTDRKDLMRKLPKFIYD
Sbjct: 61  QAAYSDGGRSSSAGIFIGGFVLGGLIVGTLGCVYAPQISKALAGTDRKDLMRKLPKFIYD 120

Query: 121 EEKALEKTRKVLAQKIEQLNSAIDEVSSQLRTEDSPNGVAVNSDEVEPAI 170
           EEKALEKTRKVLAQKIEQLNSAIDEVSSQLRTEDSPNGVAVNSDEVEPAI
Sbjct: 121 EEKALEKTRKVLAQKIEQLNSAIDEVSSQLRTEDSPNGVAVNSDEVEPAI 170

BLAST of Cp4.1LG02g08670 vs. NCBI nr
Match: XP_022936063.1 (uncharacterized protein LOC111442778 [Cucurbita moschata])

HSP 1 Score: 296 bits (757), Expect = 2.76e-100
Identity = 154/170 (90.59%), Postives = 163/170 (95.88%), Query Frame = 0

Query: 1   MSAISNSLILTKTPQLQLSAGSNLKTADKRFGSVSPTRLLLNPGWVGKAKLSTFRRPLSV 60
           M+AISNSL+ TK PQLQLS+GSNLKTADK  GSVSPT LLLNPGWVGKAKLST RRPLSV
Sbjct: 1   MAAISNSLVSTKNPQLQLSSGSNLKTADKLLGSVSPTSLLLNPGWVGKAKLSTSRRPLSV 60

Query: 61  QAAYSDGGRSSSAGIFIGGFVLGGLVVGTLGCVYAPQISKALAGTDRKDLMRKLPKFIYD 120
           QAAYSDGGRSS+AGIF+GGFVLGGL+VGTLGCVYAPQISKALAGTDRK+LMRKLPKFIYD
Sbjct: 61  QAAYSDGGRSSNAGIFVGGFVLGGLIVGTLGCVYAPQISKALAGTDRKELMRKLPKFIYD 120

Query: 121 EEKALEKTRKVLAQKIEQLNSAIDEVSSQLRTEDSPNGVAVNSDEVEPAI 170
           EEKALEKTRKVLAQKIEQLNSAIDEVSSQLRT+D PNGVAVNSDE+EPAI
Sbjct: 121 EEKALEKTRKVLAQKIEQLNSAIDEVSSQLRTDDPPNGVAVNSDEIEPAI 170

BLAST of Cp4.1LG02g08670 vs. ExPASy TrEMBL
Match: A0A6J1FTW3 (uncharacterized protein LOC111446794 OS=Cucurbita moschata OX=3662 GN=LOC111446794 PE=4 SV=1)

HSP 1 Score: 320 bits (819), Expect = 4.72e-110
Identity = 169/170 (99.41%), Postives = 170/170 (100.00%), Query Frame = 0

Query: 1   MSAISNSLILTKTPQLQLSAGSNLKTADKRFGSVSPTRLLLNPGWVGKAKLSTFRRPLSV 60
           MSAISNSLILTKTPQLQLSAGSNLKTADKRFGSVSPTRLLLNPGWVGKAKLSTFRRPLSV
Sbjct: 1   MSAISNSLILTKTPQLQLSAGSNLKTADKRFGSVSPTRLLLNPGWVGKAKLSTFRRPLSV 60

Query: 61  QAAYSDGGRSSSAGIFIGGFVLGGLVVGTLGCVYAPQISKALAGTDRKDLMRKLPKFIYD 120
           QAAYSDGGRSSSAGIFIGGFVLGGL+VGTLGCVYAPQISKALAGTDRKDLMRKLPKFIYD
Sbjct: 61  QAAYSDGGRSSSAGIFIGGFVLGGLIVGTLGCVYAPQISKALAGTDRKDLMRKLPKFIYD 120

Query: 121 EEKALEKTRKVLAQKIEQLNSAIDEVSSQLRTEDSPNGVAVNSDEVEPAI 170
           EEKALEKTRKVLAQKIEQLNSAIDEVSSQLRTEDSPNGVAVNSDEVEPAI
Sbjct: 121 EEKALEKTRKVLAQKIEQLNSAIDEVSSQLRTEDSPNGVAVNSDEVEPAI 170

BLAST of Cp4.1LG02g08670 vs. ExPASy TrEMBL
Match: A0A6J1IT36 (uncharacterized protein LOC111480366 OS=Cucurbita maxima OX=3661 GN=LOC111480366 PE=4 SV=1)

HSP 1 Score: 318 bits (815), Expect = 1.92e-109
Identity = 167/170 (98.24%), Postives = 170/170 (100.00%), Query Frame = 0

Query: 1   MSAISNSLILTKTPQLQLSAGSNLKTADKRFGSVSPTRLLLNPGWVGKAKLSTFRRPLSV 60
           MSAISNSL+LTKTPQLQLSAGSNLKTADKRFGSVSPTRLLLNPGWVGKAKLSTFRRPLSV
Sbjct: 1   MSAISNSLVLTKTPQLQLSAGSNLKTADKRFGSVSPTRLLLNPGWVGKAKLSTFRRPLSV 60

Query: 61  QAAYSDGGRSSSAGIFIGGFVLGGLVVGTLGCVYAPQISKALAGTDRKDLMRKLPKFIYD 120
           QAAYSDGGRSSSAGIFIGGFVLGGL+VGTLGCVYAPQISKALAGTDRKDLMRKLPKFIYD
Sbjct: 61  QAAYSDGGRSSSAGIFIGGFVLGGLIVGTLGCVYAPQISKALAGTDRKDLMRKLPKFIYD 120

Query: 121 EEKALEKTRKVLAQKIEQLNSAIDEVSSQLRTEDSPNGVAVNSDEVEPAI 170
           EEKA+EKTRKVLAQKIEQLNSAIDEVSSQLRTEDSPNGVAVNSDEVEPAI
Sbjct: 121 EEKAVEKTRKVLAQKIEQLNSAIDEVSSQLRTEDSPNGVAVNSDEVEPAI 170

BLAST of Cp4.1LG02g08670 vs. ExPASy TrEMBL
Match: A0A6J1FC82 (uncharacterized protein LOC111442778 OS=Cucurbita moschata OX=3662 GN=LOC111442778 PE=4 SV=1)

HSP 1 Score: 296 bits (757), Expect = 1.34e-100
Identity = 154/170 (90.59%), Postives = 163/170 (95.88%), Query Frame = 0

Query: 1   MSAISNSLILTKTPQLQLSAGSNLKTADKRFGSVSPTRLLLNPGWVGKAKLSTFRRPLSV 60
           M+AISNSL+ TK PQLQLS+GSNLKTADK  GSVSPT LLLNPGWVGKAKLST RRPLSV
Sbjct: 1   MAAISNSLVSTKNPQLQLSSGSNLKTADKLLGSVSPTSLLLNPGWVGKAKLSTSRRPLSV 60

Query: 61  QAAYSDGGRSSSAGIFIGGFVLGGLVVGTLGCVYAPQISKALAGTDRKDLMRKLPKFIYD 120
           QAAYSDGGRSS+AGIF+GGFVLGGL+VGTLGCVYAPQISKALAGTDRK+LMRKLPKFIYD
Sbjct: 61  QAAYSDGGRSSNAGIFVGGFVLGGLIVGTLGCVYAPQISKALAGTDRKELMRKLPKFIYD 120

Query: 121 EEKALEKTRKVLAQKIEQLNSAIDEVSSQLRTEDSPNGVAVNSDEVEPAI 170
           EEKALEKTRKVLAQKIEQLNSAIDEVSSQLRT+D PNGVAVNSDE+EPAI
Sbjct: 121 EEKALEKTRKVLAQKIEQLNSAIDEVSSQLRTDDPPNGVAVNSDEIEPAI 170

BLAST of Cp4.1LG02g08670 vs. ExPASy TrEMBL
Match: A0A6J1IGB6 (uncharacterized protein LOC111474436 OS=Cucurbita maxima OX=3661 GN=LOC111474436 PE=4 SV=1)

HSP 1 Score: 291 bits (745), Expect = 9.03e-99
Identity = 152/170 (89.41%), Postives = 161/170 (94.71%), Query Frame = 0

Query: 1   MSAISNSLILTKTPQLQLSAGSNLKTADKRFGSVSPTRLLLNPGWVGKAKLSTFRRPLSV 60
           M+AISNSL+ TK PQL LS+GSNLKTADK  GSVSPT LLLNPGWVGKAKLST RRPLSV
Sbjct: 1   MAAISNSLVSTKNPQLPLSSGSNLKTADKLLGSVSPTSLLLNPGWVGKAKLSTSRRPLSV 60

Query: 61  QAAYSDGGRSSSAGIFIGGFVLGGLVVGTLGCVYAPQISKALAGTDRKDLMRKLPKFIYD 120
           QAAYSD GRSS+AGIF+GGFVLGGL+VGTLGCVYAPQISKALAGTDRK+LMRKLPKFIYD
Sbjct: 61  QAAYSDSGRSSNAGIFVGGFVLGGLIVGTLGCVYAPQISKALAGTDRKELMRKLPKFIYD 120

Query: 121 EEKALEKTRKVLAQKIEQLNSAIDEVSSQLRTEDSPNGVAVNSDEVEPAI 170
           EEKALEKTRKVLAQKIEQLNSAIDEVSSQLRT+D PNGVAVNSDE+EPAI
Sbjct: 121 EEKALEKTRKVLAQKIEQLNSAIDEVSSQLRTDDPPNGVAVNSDEIEPAI 170

BLAST of Cp4.1LG02g08670 vs. ExPASy TrEMBL
Match: A0A1S3BV81 (uncharacterized protein LOC103493844 OS=Cucumis melo OX=3656 GN=LOC103493844 PE=4 SV=1)

HSP 1 Score: 289 bits (740), Expect = 5.22e-98
Identity = 150/170 (88.24%), Postives = 161/170 (94.71%), Query Frame = 0

Query: 1   MSAISNSLILTKTPQLQLSAGSNLKTADKRFGSVSPTRLLLNPGWVGKAKLSTFRRPLSV 60
           M+AISNSL+LTK PQLQL+ GSNLKT DKR GS+SPT L+L+PGWVGKAKLSTFRRPLSV
Sbjct: 1   MTAISNSLVLTKNPQLQLTHGSNLKTVDKRLGSLSPTSLVLSPGWVGKAKLSTFRRPLSV 60

Query: 61  QAAYSDGGRSSSAGIFIGGFVLGGLVVGTLGCVYAPQISKALAGTDRKDLMRKLPKFIYD 120
           QAAYSDGGRSS+AGIFIGGFVLGG++VGTLGCVYAPQISKA+AG DRKDLMRKLPKFIYD
Sbjct: 61  QAAYSDGGRSSNAGIFIGGFVLGGIIVGTLGCVYAPQISKAIAGADRKDLMRKLPKFIYD 120

Query: 121 EEKALEKTRKVLAQKIEQLNSAIDEVSSQLRTEDSPNGVAVNSDEVEPAI 170
           EEKALEKTRKVLAQKIEQLNSAIDEVS+QLR ED PNGVAVNSDEVE AI
Sbjct: 121 EEKALEKTRKVLAQKIEQLNSAIDEVSAQLRPEDPPNGVAVNSDEVESAI 170

BLAST of Cp4.1LG02g08670 vs. TAIR 10
Match: AT1G42960.1 (expressed protein localized to the inner membrane of the chloroplast. )

HSP 1 Score: 171.8 bits (434), Expect = 4.8e-43
Identity = 84/113 (74.34%), Postives = 102/113 (90.27%), Query Frame = 0

Query: 55  RRPLSVQAAYSDGGRSSSAGIFIGGFVLGGLVVGTLGCVYAPQISKALAGTDRKDLMRKL 114
           +R L++Q+AY D   S S G+F+GGF+LGGL+VG LGCVYAPQISKA+AG DRKDLMRKL
Sbjct: 53  KRILTIQSAYRDDDGSGSTGLFVGGFILGGLIVGALGCVYAPQISKAIAGADRKDLMRKL 112

Query: 115 PKFIYDEEKALEKTRKVLAQKIEQLNSAIDEVSSQLRTEDSPNGVAVNSDEVE 168
           PKFIYDEEKALEKTRKVLA+KI QLNSAID+VSSQL++ED+PNG A+++DE+E
Sbjct: 113 PKFIYDEEKALEKTRKVLAEKIAQLNSAIDDVSSQLKSEDTPNGAALSTDEIE 165

BLAST of Cp4.1LG02g08670 vs. TAIR 10
Match: AT5G16660.1 (unknown protein; FUNCTIONS IN: molecular_function unknown; INVOLVED IN: biological_process unknown; LOCATED IN: chloroplast thylakoid membrane, chloroplast, membrane, chloroplast envelope; EXPRESSED IN: 23 plant structures; EXPRESSED DURING: 14 growth stages; BEST Arabidopsis thaliana protein match is: unknown protein (TAIR:AT3G02900.1); Has 30201 Blast hits to 17322 proteins in 780 species: Archae - 12; Bacteria - 1396; Metazoa - 17338; Fungi - 3422; Plants - 5037; Viruses - 0; Other Eukaryotes - 2996 (source: NCBI BLink). )

HSP 1 Score: 79.7 bits (195), Expect = 2.5e-15
Identity = 56/122 (45.90%), Postives = 73/122 (59.84%), Query Frame = 0

Query: 50  KLSTFRRPLSVQAAYSDGGRSSSAGIFIGGFVLGGLVVGTLGCVYAPQISKALAGTDRKD 109
           K S   R  SV A Y DG RS S+G FI GF+LGG V G +  ++APQI +++   + + 
Sbjct: 45  KKSNRTRKFSVSAGYRDGSRSGSSGDFIAGFLLGGAVFGAVAYIFAPQIRRSVLNEEDEY 104

Query: 110 LMRKLPKFIYDEEKALEKTRKVLAQKIEQLNSAIDEVSSQLRTED---SPNGVAVNSD-E 168
              K  +  Y +E  LEKTR+ L +KI QLNSAID VSS+LR  +   S   V V +D E
Sbjct: 105 GFEKPKQPTYYDE-GLEKTRETLNEKIGQLNSAIDNVSSRLRGREKNTSSLNVPVETDPE 164

BLAST of Cp4.1LG02g08670 vs. TAIR 10
Match: AT5G16660.2 (unknown protein; FUNCTIONS IN: molecular_function unknown; INVOLVED IN: biological_process unknown; LOCATED IN: chloroplast, membrane; EXPRESSED IN: 23 plant structures; EXPRESSED DURING: 14 growth stages; BEST Arabidopsis thaliana protein match is: unknown protein (TAIR:AT3G02900.1); Has 106 Blast hits to 106 proteins in 32 species: Archae - 0; Bacteria - 28; Metazoa - 0; Fungi - 0; Plants - 78; Viruses - 0; Other Eukaryotes - 0 (source: NCBI BLink). )

HSP 1 Score: 72.4 bits (176), Expect = 3.9e-13
Identity = 55/122 (45.08%), Postives = 72/122 (59.02%), Query Frame = 0

Query: 50  KLSTFRRPLSVQAAYSDGGRSSSAGIFIGGFVLGGLVVGTLGCVYAPQISKALAGTDRKD 109
           K S   R  SV A   DG RS S+G FI GF+LGG V G +  ++APQI +++   + + 
Sbjct: 45  KKSNRTRKFSVSA--GDGSRSGSSGDFIAGFLLGGAVFGAVAYIFAPQIRRSVLNEEDEY 104

Query: 110 LMRKLPKFIYDEEKALEKTRKVLAQKIEQLNSAIDEVSSQLRTED---SPNGVAVNSD-E 168
              K  +  Y +E  LEKTR+ L +KI QLNSAID VSS+LR  +   S   V V +D E
Sbjct: 105 GFEKPKQPTYYDE-GLEKTRETLNEKIGQLNSAIDNVSSRLRGREKNTSSLNVPVETDPE 163

BLAST of Cp4.1LG02g08670 vs. TAIR 10
Match: AT3G02900.1 (unknown protein; FUNCTIONS IN: molecular_function unknown; INVOLVED IN: biological_process unknown; LOCATED IN: chloroplast; EXPRESSED IN: 22 plant structures; EXPRESSED DURING: 13 growth stages; BEST Arabidopsis thaliana protein match is: unknown protein (TAIR:AT5G16660.1); Has 80 Blast hits to 80 proteins in 21 species: Archae - 0; Bacteria - 4; Metazoa - 0; Fungi - 0; Plants - 76; Viruses - 0; Other Eukaryotes - 0 (source: NCBI BLink). )

HSP 1 Score: 67.8 bits (164), Expect = 9.7e-12
Identity = 42/109 (38.53%), Postives = 62/109 (56.88%), Query Frame = 0

Query: 58  LSVQAAYSDGGRSSSAGIFIGGFVLGGLVVGTLGCVYAPQISKALAGTDRKDLMRKLPKF 117
           LSV A Y  G +   +  F+ GF+LG  V GTL  ++APQI +++   +     +     
Sbjct: 44  LSVSAGYRGGSKGGGSSDFVTGFLLGSAVFGTLAYIFAPQIRRSVLSENEYGFKKPEQPM 103

Query: 118 IYDEEKALEKTRKVLAQKIEQLNSAIDEVSSQLRTEDSPNGVAVNSDEV 167
            YDE   LE+ R++L +KI QLNSAID+VSS+L+   S +    +S  V
Sbjct: 104 YYDE--GLEERREILNEKIGQLNSAIDKVSSRLKGGRSGSSKNTSSPSV 150

BLAST of Cp4.1LG02g08670 vs. TAIR 10
Match: AT3G02900.2 (unknown protein; FUNCTIONS IN: molecular_function unknown; INVOLVED IN: biological_process unknown; EXPRESSED IN: 22 plant structures; EXPRESSED DURING: 13 growth stages; BEST Arabidopsis thaliana protein match is: unknown protein (TAIR:AT5G16660.1). )

HSP 1 Score: 67.8 bits (164), Expect = 9.7e-12
Identity = 42/109 (38.53%), Postives = 62/109 (56.88%), Query Frame = 0

Query: 58  LSVQAAYSDGGRSSSAGIFIGGFVLGGLVVGTLGCVYAPQISKALAGTDRKDLMRKLPKF 117
           LSV A Y  G +   +  F+ GF+LG  V GTL  ++APQI +++   +     +     
Sbjct: 43  LSVSAGYRGGSKGGGSSDFVTGFLLGSAVFGTLAYIFAPQIRRSVLSENEYGFKKPEQPM 102

Query: 118 IYDEEKALEKTRKVLAQKIEQLNSAIDEVSSQLRTEDSPNGVAVNSDEV 167
            YDE   LE+ R++L +KI QLNSAID+VSS+L+   S +    +S  V
Sbjct: 103 YYDE--GLEERREILNEKIGQLNSAIDKVSSRLKGGRSGSSKNTSSPSV 149

The following BLAST results are available for this feature:
Match NameE-valueIdentityDescription
Match NameE-valueIdentityDescription
XP_023521457.16.87e-110100.00uncharacterized protein LOC111785280 [Cucurbita pepo subsp. pepo][more]
XP_022941515.19.76e-11099.41uncharacterized protein LOC111446794 [Cucurbita moschata][more]
XP_022981122.13.97e-10998.24uncharacterized protein LOC111480366 [Cucurbita maxima][more]
KAG6608647.18.02e-10998.82hypothetical protein SDJN03_01989, partial [Cucurbita argyrosperma subsp. sorori... [more]
XP_022936063.12.76e-10090.59uncharacterized protein LOC111442778 [Cucurbita moschata][more]
Match NameE-valueIdentityDescription
A0A6J1FTW34.72e-11099.41uncharacterized protein LOC111446794 OS=Cucurbita moschata OX=3662 GN=LOC1114467... [more]
A0A6J1IT361.92e-10998.24uncharacterized protein LOC111480366 OS=Cucurbita maxima OX=3661 GN=LOC111480366... [more]
A0A6J1FC821.34e-10090.59uncharacterized protein LOC111442778 OS=Cucurbita moschata OX=3662 GN=LOC1114427... [more]
A0A6J1IGB69.03e-9989.41uncharacterized protein LOC111474436 OS=Cucurbita maxima OX=3661 GN=LOC111474436... [more]
A0A1S3BV815.22e-9888.24uncharacterized protein LOC103493844 OS=Cucumis melo OX=3656 GN=LOC103493844 PE=... [more]
Match NameE-valueIdentityDescription
AT1G42960.14.8e-4374.34expressed protein localized to the inner membrane of the chloroplast. [more]
AT5G16660.12.5e-1545.90unknown protein; FUNCTIONS IN: molecular_function unknown; INVOLVED IN: biologic... [more]
AT5G16660.23.9e-1345.08unknown protein; FUNCTIONS IN: molecular_function unknown; INVOLVED IN: biologic... [more]
AT3G02900.19.7e-1238.53unknown protein; FUNCTIONS IN: molecular_function unknown; INVOLVED IN: biologic... [more]
AT3G02900.29.7e-1238.53unknown protein; FUNCTIONS IN: molecular_function unknown; INVOLVED IN: biologic... [more]
InterPro
Analysis Name: InterPro Annotations of Cucurbita pepo (Zucchini) v1
Date Performed: 2021-10-25
IPR TermIPR DescriptionSourceSource TermSource DescriptionAlignment
NoneNo IPR availableCOILSCoilCoilcoord: 122..149
NoneNo IPR availableMOBIDB_LITEmobidb-litedisorder_predictioncoord: 147..170
NoneNo IPR availablePANTHERPTHR34048LOW-DENSITY RECEPTOR-LIKE PROTEINcoord: 1..170
NoneNo IPR availablePANTHERPTHR34048:SF9SUBFAMILY NOT NAMEDcoord: 1..170

Relationships

The following mRNA feature(s) are a part of this gene:

Feature NameUnique NameType
Cp4.1LG02g08670.1Cp4.1LG02g08670.1mRNA


GO Annotation
GO Assignments
This gene is annotated with the following GO terms.
Category Term Accession Term Name
cellular_component GO:0009706 chloroplast inner membrane
cellular_component GO:0016021 integral component of membrane