Cp4.1LG15g01170 (gene) Cucurbita pepo (Zucchini)

NameCp4.1LG15g01170
Typegene
OrganismCucurbita pepo (Cucurbita pepo (Zucchini))
DescriptionGlutamate--tRNA ligase, chloroplastic/mitochondrial
LocationCp4.1LG15 : 889128 .. 899363 (+)
The following sequences are available for this feature:

Gene sequence (with intron)

Legend: five_prime_UTRCDSthree_prime_UTR
Hold the cursor over a type above to highlight its positions in the sequence below.
CGCGTAATTTGCGCGCGAAAGTTCATCGTTCTTTAAAACTGAAATCGCTCTGACGAGAAGTTTGATGATTTTAGGGCTCGAAGTTAATGGCGGCTCACGATGCTGTAAGATGACGCAGGTATGTTTCGGTGATTTTCTTATAGGGAAACCTCATATTATCCTAATTACTTTAAGCTAATATCGTTCATTAAACTAAGGATCGTTATCGGTCAGTCTCATAATCAGAAGTATCAGATTCTTTTTGTTAGCCGATTAGATTGGATTAGGTGTTTTTGTTGATCGATTCGATTCGTATATTCAATTTACTTCTTGTTTTATGCACGACAGTCCGATGATTCAAAGAATATATCTCGACATTTTTATTTCAGATTTTGTACGAGTTGCTACAGTCTTCTTTAAACTAAGGATCGTTATCGGTCAGTCTCATAATCAGGAGTATCAGATTCTTTTTGTTCGCCGATTAGATTGGATTAGATGTTTTTGTTGATCGATTCGATTCGTATATTCAATTTACCTCTTGTTTTATGCACGACGATCCGATGATTCAAAGAAAATATCTCGATATTTTTATTTCAGATTTTGTACGAGTTCCTACAGTCTTCTTTTTACTTGATTTCAGAATTTCGAGGATGATTTCGTGTAAGCAGTGGTTTATGCTGATGCGTGAACCGAGTATCTTCTGAACAAAATTTATCGGTATGTATTTTGATGAACCTTTTAGTTTTACTGAGTTGAACTTGTGAATCAATTTATTAAATAACCATACAACGAACTTGTTCTTGTCTGGATTCCATACTTCTAATATCTTTCATTTTTAAGTATTATAATTATAGTGATGTGTTCTTTATTTAATGAATACTATGTTTTGTTACAGATTTTCCTAATTCATAAAATAATATACATTCAAGAATCAAGTTTGCCTCCCCTAAGAACATTAGATCTATACTTGAGTTTTATTCATTTACAGATTGTTACATGCAATTAGGATCCCCCTGGCNATTTTTAGGAGATTATGTTCATTAGTTTGGTTTCTTACTGTTTCTTAGTAGTCTTCATTTCATGCAGGCCTAAGTTATTCTATTTGTTTATTTGTTTTTGGTCGTTACTTCACTGCAATGTACTCTGTTTAGTTTATACATTATGATATTTCTGGAATCATAGGTTTTTTTAAGTAAAATAAATGTTATCTAAGAGAAAATTCAATGAATAAAATAGATAAACAAATTTTTATATTAAATTGTTAATAATTAGTGGTTATAGGGCATGAACATCAATAGCAGTAATACTAGTTAAGGAATGAAAAAATAGCATTGCAATATTATAGTTAGATTTAGTGAGAATGAAGTAGTCACTTTTTTGTGATCCAAATACATCTTTTAAGGTATGTTTACTACAGAGAAGGTTAGGAATCACGACTCTCCACTATGATATTATCCACTTTGAGCATAAACTCTCATGGCTTTGCTTTGGGCTTCCTCAAAAGGCCTCATACCAATCGAGATGTATTTCTTACGTATAAACCCATGATCATTCCCTAAATTAGCTAATGTGGGACTCCCTCCCAACAATTCTCAACAATCCTTCCCTCGAACATACCATAGAGTCTCTCCTAAGGCCTATAAACAACCGAATTTTGCTTTTTACTAAACCAAGAAACTAATGAACTACCAAGAAATTGACAATTGACAAGTACTGACAAGTACTGGCAAGTTTCACTAGTACTTTTACGATCAAGTAAGCTTTCGATAAAATTTTCATGGGAATATCTTACTGATTTAAACTCAACATTTTTAGGATACCACAATCTTAAATCAATAGTTTCAAATAAATATTTAAATATCCTTTTAGCAAATTGAAACTTAGCACAAATACATACACTAAACATAATATCTGGTCTACTAGTGGTTAAATAAAGTAGAGATCCAATCATACCTCAATAAGATTTAATGTCAACATATTTATCTTTTTCATCTTTGTCAAGCTACGTGCTCATGGTAGTTTTTGCAATCTTGTGTTCTTCAACCTTTTAAGTAAATCTTTAGTGTATTTCTCTTGATTGTTAAAGATACCATTTTTGAGTTGTTTGATTTGTAGGCGAGAGAAACTAAGCACTTTCATCATACTATCTCAACTCATTACGCATACATTTAGAAAATTATTCACCTAAAAAAGGAAGTAGATCTAAATATGATATCATCTACATATATTTGCACTAATAACATATCATTTTTTTTAATTTTAATAAAAAGAGTAGTGTTGAGTTTACCTATTTTAAAATCATTCCCAATAAGAAAATTACTAAGTTTATCGTACCAAGCTCGAGCTTGTTTTAAGCCATAATGAGCATTTTTCAACTTGATTTATGATGATACCATCCTTGAGTTGAAAAAGTCATGTCTATGGTATGATAGACTGGGTAATTTTCTTACCGGGAATGATTTTAAAATAGGTAAACTCGACACTGCTCTTTTTATTAAGATTAAAGAAAATGAGATGTTATTAGTGCAAGTATATGAAGATGATATCATATTTGGTTTTACTAATCTTCTTTATGTGAAGAATTTTTTAAATATATGCATGAGTTTGAGACGAGTATGATGGGAGAGTTTAGTTTCTTTGTTGGACTTCAAATCAAACATCTTAAGGATGGTATCATCGTAAATCGAGAGAAATACACACTAAAGATTTAATCAAAAGGTTCAACTTCGATGTAGGTACGATTGTAAAAACTCCCGTGAGCACGTCCACTAAGCTTGACAAAGATGAAAAAGGTAAATGTGTAGAGAAATACACACTAAAAATTTAATCAAAAGGTTCAACTTCGATGTAGGTACGATTGTAAAAACTCCCGTGAGCACGTCCACTAAGCTTCACAAAGATGAAAAAGGTAAATGTGTGGATATTAAATTTTATCGAAGTATGATTGGATCTCCTACTTTATTTAACCGCTAGTAAACTCGATATTATGTTTAGTGTATGATGTCTTCGTGCTCGGTTTCAATCTTGTCCTAAGGAATCTCATTTACCTGTTAAAAAAATATTTAAATATTTGCTTGGAACTATTGATTTAAGATTGTTGCATCCTAGAAATGTTAGTTTAAATTAGTAGGATATTCTTATGTGAATTTTGCGGAAAGCTTACTTGATCGTAAAAGTGGAATGTCATTCATTAGTTTCTTGGTTTAGTCAAAAGCTAAATTCGGTTGTCTTATCTAATACGAATGCAGAATATATTTCGATTGCTAATTCGGTTGTCATTCATTAGTTTTTTGTGATTTTGGATTAAAATTTGATAGTGCGCCTATATTTTGTGATAACACTAGTGCTTATCTACTACGGGAATATATTTTGGTTGCTAGTTGTTGTGCTCAAATATTTTGTGATAACACTAGTGCTTATCTACTACGGGAATATATTTTGGTTGCTAGTTGTGCTCAAATTTTTTGGATGAAACAAAGTTTTTGTGATTTTGAATTAAAATTTATAGTATGTATTTTGTGATAACACTAGTGCTTATCTACGGCGGAAGCGGAATATATTTTGGTTGCTAGTTTGTGCTCAAATTCTTTGGATGAAATAAAATATTTGTGATTTTGGAATAAAAATTGATGGTGTTCCTATGTTTTGTGATAACACTAGTGCTATTAATTTAACAAAAAAAAATTATTCATCATTTTAGAACTAAACAAACATATTGACATTAGACATTATTTTATTAGGGAGCGAGTGCAAAATGGACTATAATACAAATGGATATATTATTATTGATTTTGTAAAACTCTAATAATCAATTAGCTGATATTTTTACAAGGAGTAAGCTAGGTATTATTCGTTGATATATCTTGAATTTTATAATATTATTTATAATTTTTTAAGGCAAGTTCGTAGAGGGAGCTATTACGTCATTTTTAATTTATTATTTTTCAATTGTTTATTTTGATGATTTCAAAATTGGAGAAAATTGGAGAAAATTAAAAAAAAAAATGCTCTAAATTTGTTGTTTTATAATTTATGAATCATGTGATGGTTCATAATTTATTGAGGCTAATATATGGGGAGAATATGCTCTTGCATTACTCCGTTGTTGTGATAGTTCTATTTTTATTTATTTGGACATGTGTATTATTTTAGGTTTTTATGAATTTTTCTTGAAATGTTTATCTCATACAAAAAGGTAGGTAGATGGTTGGTTTATTGACTCTCAATATCAAATTAATTAATTGTTGGTTTGATGATAACAAACCTACTATTAATTATTAATAATTATCAGTGTTTTGAATTGTCCACAATATAAAGTTGTGAATAACGATTTCAATACTTTCTTTTAATCACTCAAATCAGAGTTAAATTGAAGAACTTAAAATAAGGTACTCGACGTGATGATGTGGCAGCTAAATTGACATTATGGACTAATAGAACTTTGGAAAGGTAACAAATGGCAGGGTTATTAATTATCTTTCAGAAATAGAAATGTATTTATACATCATAGGTCATTTGAATTATAAAAAATAAAATTAAAAAAATTGTTATTCATGTTATTTTTATTAATATTATTATATTTATAATAAGTTTTTAATTTTTTTATCATTTTTATATTATATTATTATAATTTTAAAAAAAATCAATTTAATTTATTTTTAATATTTACTTTTAAATCTGCCGTTTCCTTCTCTCTTATCTTATAGATTGAGATTCTCTCCTACAAACATTTTTATTGTAGTGTCAGGAAATGGTAAGGGATTAAATTGTATTCAATATTGTGATGTGTCCCAACAAAATTTAAGTTGGTATAGCTTTGATCTTTGAGTTTGCTCTTGAACTCTTGATATAGCGTTTGACCAAACAACTATAAAATTGTATTATTTTCTAAGTTTTTTTTTATAACTGTTTATCGTTTTGACTTCAACTACTATATTATATTAATTATCCTCTACTTATTTCTAAAAGGTTTCATAATTAAAAAAATTAATCACGCATAAATATTTTTATTGTTAAAACTCTATTCATCCCTTACTCGAATTCGTATGCCCATCCAACAACTCACTCTATATTTTTCTCAAACCCATCTTTATTGGATGGGCATGCGAATTCGAGAAAGGGATGAATAGAGTTTTTTATGGAATCCATGAGAGATTGTGATGCTTGTCCTCTTTTCCAGCTCAATTAGGTTCCGACACTGAAATTGTGAAACCGAATTAAATAGGTAAATGAGTAAATTCTTGAAGTTGGCTAAATTGGCTAAATGACGAATTCTTGAAATTGAGGTGTTAGTCTTCAAAAAAAAAAAAAATAATAATAATAATAAATAAATAAAATAATAATAATAATATTTTTTTTCTTTCATTCTTCAACCTTCGTCATCTGAGCTTTATGGTGCCGCAAATTCTGGGTATCGTGGCGCCATCGAGACCTTCCTTCGAAATTTGCTTCCATTGATAATCTGGGAAACTACCTTATATATCAACTGCAAGTATCTGGAGTGGTCAATCATGGTGGTTTCCATGGCAATCATTCCCTCCTGCAACAATTTTGGTCGACCTCCACATTGCTCTCGCACTCGCAGTCATAACGCTCGCAATCTCAAATTCTGCCGCCGTATCTGTTGCCATGTTCATCGGAATCGGGTCTCGTTTAATTTTGGCCCTAGTGTAGCAGATTCGCTTCGTTATCTTCAATACCGGAGACGGAATTTAGCTTCCGTCACTTGCTCGGCTTCTGATAAACCGGAAATCAGGTTTTTCTCTCTTTCTCTGTTATTATTGTAGCGTTTGGATGGAAATAGAACAACAGTTGCTGATGATTTTAATGTTGTTATGCTTTTTGTATCTCATTACTTGATTTTCGACCATAGCTTGATTTTCGTAGTTTTGCGTTATTTTGCGTTTGTTCATGAGACTGCTTTTGGATGGCGAAAATCGACGTAAATTTATGGAAACTCAGAGGATCTTGCCTGACTTCTTATGTGAATTTTCTCATTTGTGTATTTCAATTGAGTATTTAAGATGGTGAGTTCTTCATACTTCATGCCTTGAGCCACAAAGCTTGTCTGGTTTTCTCATATATGTTATCTGATCACAACTTGATTTGTATGCTCAAACCTATAGACAGCCAAGCTGTTTGATGTGATCAGCGTATAATGATTCCAAATTGAGCCAGAATGTCCTGTTTATGGACTAATTAAACTAAAAAAGTAACCCTTAGATTAACAACTGCATTGGAATGCTTAATTGGAAGAATAACTTATCCTAGTCTCCCTTCTAATTACTAACATTGGTGTTGACACTCTCGTTTGATAATCATTTAGTCTTTTGTTTTTGTTTTTTTTGAAAGTTGTGTTTGCTTTCTCACCCTACAATGATCCTCATCTTCCTTAAGGTAACATTTGAATTTTTAGCCTAATTCTAAAATGAAAATAACTTCTAAAATCTACTCTTTTTTAGTTAAAACTTGGTATGGATTTTAAAAATATTGTTAGAAAGTATTTGCCTGGATTGACTCCCCATAACATGGGAGAGCGCTTGTGAGACTTGGCGCTCGACTTTGTCCGAGAGAAAATTTACGAACTCCTAATGGGTTTGTTGAATCTTGATTTTAATATGCCTATCGAGCAAGATTATCTATTCCCCACTGGGTTTCTATGTTTTTGACAGCCCATCTTTGTCTCAGTAGAGTCTTTCAGTGACATGTTTCGGTCCTCTTTCCCATTACTTAGAAAAAATGAGCTACCGGTTCAGGTATAAGATACTATCATTACTGCTTGGACAATTAGATATTCAACCAGTAATCGCAACGACCCAATTGCAATTCAAATCCTTGATTTTCGAATTGAGAGATAGCAAGAATTCTCACTATTTCTTAGATTCATGGACCCAATTGAGAAACAAAACAAAGAAAGTCATAGGTATAAATAGTGTTTATAAGCTTAATTTTCAAAAATAAAAAATCGAATGGTTATCAAATCAGACCTACGGTTTTTCTTTTTTGGGGGCGGAGGGTTGAGAGTTGAGTACGGAGCCTGGTTGTGTTCTAACCCAGTGTCTTTTTTTTCAGTACACCTGTTATTCAATCTGAAATGAGGCTTTGTGTAACAGTATCTGTTTTTTTTCTTAGTTCCACAGCCAAGATAAGAAGTGAAGTTCTGTCTCCATTTCGGTCTGTTCGGATGTTCTTTTATCTCACTTTCATTGCCAGTGGTACATTGGGAGGACTGATAGCAACCACTCAACTGCTTGCTGCATTGGCAAATTCATCAAGAGCTGAAGAAGTCCCTGATATTCTAAATGGACTTGGAATAGACTTCGGAGCCGTAGCCTTTTTCGCATTTCTTTACTTCAGAGAGAACAATGCAAAAAATGCTCAGTTGGCAAGGCTGTCAAGAGAAGAAAGCCTTTCCAATTTAAAGCTTCGAGTGGACCAAAACAAAGTTATTACCATCAGCACTCTGCGTGGGATCGCTCGTCTTGTAATTTGTGCTGGCCCCGAATCCTTTATCATGGAAGCTTTTAAAACAAGTGAACCTTTCACCGAACGACTTCTAGAACGAGGGGTATTAGTCATACCCTTTGCCACAGATGCTAGTTCACTGAATTTTGAGTTCGATGAACGTGAAGAGATGAAGGATATAACCACCAAAAGGAAAAGACTCTGGCGCTTGACTCCGGTATACATGTCCCAGTGGTCGGCGTAAGTAACTATAATTAACATATATTTTCACATTGCTCCCCTCCTCCTTCCTCCATGTGATTTGCCTTTTCTTGTGAACAATAGGTGGTTAGATGATCAAAAGAAGTTGGCTGGAGTCTCCTCTGATTCGCCTGTGTAAGTTATCCTCGTCGTTGAACTTCCATTTCAAATTGATTGATGAACTACAATGGTTGCCTCTTTAATTCTTAAACTGCCAGTCTAAGCTGTGCTGTGGTAGTAAATAAATACTGCTTTGGTCCCATTTTTGCTCTTATACTTTCGATATATGTTCATTTTGATTCTTGTACTTTCAAAATGTCCATTATGATTATTGTAGTTTTAAAAAGTGACCATTTTGTTCCCTATTTGCAAAACTTAAACATAAGTTCTATACATGATAGAAATCCTTTAGTACAAGCTTATGGTTACATATTAAGAAATTGACTAAAAATGGATACAAAAATGGTTACTTTTGAAGTATAGGAACCAAAATGAACATTTTAAAAGTACATGAACCAAAAACTTTTCGGGTTTAATTTTGTAATGAAATATACTGTTTCGTTAACCATAATTCATGAGCTATCCAATTTGAAGTCCCCATAGTTGGATCTTATGTAAGCTGCACAGAGAAAAAACTTTTTTTTGCTGAATCCATGCCAAAGCTAGTTTACCTTGCAAAAGTTAATCAATTAATCTTTTTTTTTTTTTCTTCTTATCTTCCATAGCTTGCTGTGTTAATATTTGTTCTTGCAGGTATTTATCTCTTCGAATGGATGGCCGTGTTCGTGGTAGTGGGGTTGGCTATCCTCCATGGAATGCTCTTGTTGCACAATTACCACCTGTGAAAGGACTATGGTCAGGTCTTCTAGATGGGATGGATGGCAGAGTTCTTTGAAGAAAAACACTAGCTCTTCTTCCAAGTTAGTGCTCTTTTCACACGCTTCCTGTTAGTTATTAGCAATTATTTTCACTTCATTTCATCTAAAATCAACTCACAATCACTTCAATAAGTAGGGAAAAAAATAGTGGAAAACAAGAAGATGAAAAGTATTCGTATTAATGAAAAATCACCACGATCCATCTGATTATAACATTTCTCTCTATTATTATGTTAATCCGTGCATTTAAACGAGTCTTATACACACACATGTGAAGATTATGCACTAATAGCTGTGAAACGAAATCTTAATGAATGATTTCACTTGATTCACTTACGAAAGTTTAGGGCTTTTGTTACACAATTATTAATTTGGAGTCTTAAGTAATACAACCTCCAAAGTCTTGAGGTATAAATTGATTTTTTTCCCTATTTATTTGAAACTATAATCATGCCTTTTGTATTACCTTTTGAGATGGCATCAAATTTCCCATATCTAATTGAAGTCCTATTAGTTGAACTACAGAATCTCTTATCACATATACATACATATATACATATATAAATATACATACATATATACATATATATATATACATATGTATGTATATAGATAAATATTGGTATCATTATTGGCCTAATTTATAAGGTGACAACATTATTGGCTTAATTTCTCAGATTCTATATATACTCTCAAAACAGTTGATTCTATATATACATCTATATATATACATATGTATGTATATATTTATCTATATACATACATATGTATATATATATTTATCTATATACATACATATGTATATATACATATATGTATGTATGTATATATATATTTATCTATATACATATGTATGTATATAGATAAATATATACATATTACATACATATGTATGTATATAGATAAATATATACATATTACATACATATGTATGTATATAGATAAATATATACATATTACATACATATGCATACATATAGATATATATATATGGAATCAACTGTTTTGAGAAATTAGGCCAATAATGTTGTCACTTTAGAAATTTTGGCCCTTTTTTGTATAATTCAACTGTTACAAAATTTCTTGCTCCAAATGGGAAAATTGTATTAGTTTGTTCTTGTTTTCAACTTTATGCTTAAACAGATATTTACGATGGAGTGGAGTAGGCGTTTTGTAATAATCTTTAAGCCATCCAGGATGTTGCCCTTGTGAAATCATCTCCAACACCATTGTCATGCTCGACTGCCTGAGAAATGGAACCGAATCTCATGACAGCCGAAATACAGGTCAGAGATTTACTTCTTACTTTTTGGGTGTTCTAACAATATAGGTCGATAATATATATCTAAACCAGTTGAGCTATACTCAATTTGGCAAAGGTTTAACGTTCTTCGACATCGTAACTTTTATATTGGATCTCGAATCCAATATTCTTTGTTATTAGTATTTGCTCCGGCTATTACATTGTGTGGATGCAAATCTGTTAATAGTTAGTTGAATATGTATTTAGATACACCCAGTATATGGTAATAAAGCGATTGTACACTTTCATTCAATGTAAATAGCTATGTTTTCTTAGGAGTAATTTGGCATTAGATTGATTCATTTATGAGAGTTTCATTCCGTGTGTCTATTTAGTCTCTGACGTTTTTAAAAGTGTCTAATAATTTGTTTA

mRNA sequence

CGCGTAATTTGCGCGCGAAAGTTCATCGTTCTTTAAAACTGAAATCGCTCTGACGAGAAGTTTGATGATTTTAGGGCTCGAAGTTAATGGCGGCTCACGATGCTGTAAGATGACGCAGAATTTCGAGGATGATTTCGTGTAAGCAGTGGTTTATGCTGATGCGTGAACCGAGTATCTTCTGAACAAAATTTATCGGTACGATTGTAAAAACTCCCGTGAGCACGTCCACTAAGCTTCACAAAGATGAAAAAGCTCAATTAGGTTCCGACACTGAAATTGTGAAACCGAATTAAATAGGTAAATGAGTAAATTCTTGAAGTTGGCTAAATTGGCTAAATGACGAATTCTTGAAATTGAGGTGTTAGTCTTCAAAAAAAAAAAAAATAATAATAATAATAAATAAATAAAATAATAATAATAATATTTTTTTTCTTTCATTCTTCAACCTTCGTCATCTGAGCTTTATGGTGCCGCAAATTCTGGGTATCGTGGCGCCATCGAGACCTTCCTTCGAAATTTGCTTCCATTGATAATCTGGGAAACTACCTTATATATCAACTGCAAGTATCTGGAGTGGTCAATCATGGTGGTTTCCATGGCAATCATTCCCTCCTGCAACAATTTTGGTCGACCTCCACATTGCTCTCGCACTCGCAGTCATAACGCTCGCAATCTCAAATTCTGCCGCCGTATCTGTTGCCATGTTCATCGGAATCGGGTCTCGTTTAATTTTGGCCCTAGTGTAGCAGATTCGCTTCGTTATCTTCAATACCGGAGACGGAATTTAGCTTCCGTCACTTGCTCGGCTTCTGATAAACCGGAAATCAGTACACCTGTTATTCAATCTGAAATGAGGCTTTGTGTAACAGTATCTGTTTTTTTTCTTAGTTCCACAGCCAAGATAAGAAGTGAAGTTCTGTCTCCATTTCGGTCTGTTCGGATGTTCTTTTATCTCACTTTCATTGCCAGTGGTACATTGGGAGGACTGATAGCAACCACTCAACTGCTTGCTGCATTGGCAAATTCATCAAGAGCTGAAGAAGTCCCTGATATTCTAAATGGACTTGGAATAGACTTCGGAGCCGTAGCCTTTTTCGCATTTCTTTACTTCAGAGAGAACAATGCAAAAAATGCTCAGTTGGCAAGGCTGTCAAGAGAAGAAAGCCTTTCCAATTTAAAGCTTCGAGTGGACCAAAACAAAGTTATTACCATCAGCACTCTGCGTGGGATCGCTCGTCTTGTAATTTGTGCTGGCCCCGAATCCTTTATCATGGAAGCTTTTAAAACAAGTGAACCTTTCACCGAACGACTTCTAGAACGAGGGGTATTAGTCATACCCTTTGCCACAGATGCTAGTTCACTGAATTTTGAGTTCGATGAACGTGAAGAGATGAAGGATATAACCACCAAAAGGAAAAGACTCTGGCGCTTGACTCCGGTATACATGTCCCAGTGGTCGGCGTGGTTAGATGATCAAAAGAAGTTGGCTGGAGTCTCCTCTGATTCGCCTGTGTATTTATCTCTTCGAATGGATGGCCGTGTTCGTGGTAGTGGGGTTGGCTATCCTCCATGGAATGCTCTTGTTGCACAATTACCACCTGTGAAAGGACTATGGTCAGGTCTTCTAGATGGGATGGATGGCAGAGTTCTTTGAAGAAAAACACTAGCTCTTCTTCCAAATATTTACGATGGAGTGGAGTAGGCGTTTTGTAATAATCTTTAAGCCATCCAGGATGTTGCCCTTGTGAAATCATCTCCAACACCATTGTCATGCTCGACTGCCTGAGAAATGGAACCGAATCTCATGACAGCCGAAATACAGGTCAGAGATTTACTTCTTACTTTTTGGGTGTTCTAACAATATAGGTCGATAATATATATCTAAACCAGTTGAGCTATACTCAATTTGGCAAAGGTTTAACGTTCTTCGACATCGTAACTTTTATATTGGATCTCGAATCCAATATTCTTTGTTATTAGTATTTGCTCCGGCTATTACATTGTGTGGATGCAAATCTGTTAATAGTTAGTTGAATATGTATTTAGATACACCCAGTATATGGTAATAAAGCGATTGTACACTTTCATTCAATGTAAATAGCTATGTTTTCTTAGGAGTAATTTGGCATTAGATTGATTCATTTATGAGAGTTTCATTCCGTGTGTCTATTTAGTCTCTGACGTTTTTAAAAGTGTCTAATAATTTGTTTA

Coding sequence (CDS)

ATGGTGGTTTCCATGGCAATCATTCCCTCCTGCAACAATTTTGGTCGACCTCCACATTGCTCTCGCACTCGCAGTCATAACGCTCGCAATCTCAAATTCTGCCGCCGTATCTGTTGCCATGTTCATCGGAATCGGGTCTCGTTTAATTTTGGCCCTAGTGTAGCAGATTCGCTTCGTTATCTTCAATACCGGAGACGGAATTTAGCTTCCGTCACTTGCTCGGCTTCTGATAAACCGGAAATCAGTACACCTGTTATTCAATCTGAAATGAGGCTTTGTGTAACAGTATCTGTTTTTTTTCTTAGTTCCACAGCCAAGATAAGAAGTGAAGTTCTGTCTCCATTTCGGTCTGTTCGGATGTTCTTTTATCTCACTTTCATTGCCAGTGGTACATTGGGAGGACTGATAGCAACCACTCAACTGCTTGCTGCATTGGCAAATTCATCAAGAGCTGAAGAAGTCCCTGATATTCTAAATGGACTTGGAATAGACTTCGGAGCCGTAGCCTTTTTCGCATTTCTTTACTTCAGAGAGAACAATGCAAAAAATGCTCAGTTGGCAAGGCTGTCAAGAGAAGAAAGCCTTTCCAATTTAAAGCTTCGAGTGGACCAAAACAAAGTTATTACCATCAGCACTCTGCGTGGGATCGCTCGTCTTGTAATTTGTGCTGGCCCCGAATCCTTTATCATGGAAGCTTTTAAAACAAGTGAACCTTTCACCGAACGACTTCTAGAACGAGGGGTATTAGTCATACCCTTTGCCACAGATGCTAGTTCACTGAATTTTGAGTTCGATGAACGTGAAGAGATGAAGGATATAACCACCAAAAGGAAAAGACTCTGGCGCTTGACTCCGGTATACATGTCCCAGTGGTCGGCGTGGTTAGATGATCAAAAGAAGTTGGCTGGAGTCTCCTCTGATTCGCCTGTGTATTTATCTCTTCGAATGGATGGCCGTGTTCGTGGTAGTGGGGTTGGCTATCCTCCATGGAATGCTCTTGTTGCACAATTACCACCTGTGAAAGGACTATGGTCAGGTCTTCTAGATGGGATGGATGGCAGAGTTCTTTGA

Protein sequence

MVVSMAIIPSCNNFGRPPHCSRTRSHNARNLKFCRRICCHVHRNRVSFNFGPSVADSLRYLQYRRRNLASVTCSASDKPEISTPVIQSEMRLCVTVSVFFLSSTAKIRSEVLSPFRSVRMFFYLTFIASGTLGGLIATTQLLAALANSSRAEEVPDILNGLGIDFGAVAFFAFLYFRENNAKNAQLARLSREESLSNLKLRVDQNKVITISTLRGIARLVICAGPESFIMEAFKTSEPFTERLLERGVLVIPFATDASSLNFEFDEREEMKDITTKRKRLWRLTPVYMSQWSAWLDDQKKLAGVSSDSPVYLSLRMDGRVRGSGVGYPPWNALVAQLPPVKGLWSGLLDGMDGRVL
BLAST of Cp4.1LG15g01170 vs. Swiss-Prot
Match: LPA1_ARATH (Protein LOW PSII ACCUMULATION 1, chloroplastic OS=Arabidopsis thaliana GN=LPA1 PE=1 SV=1)

HSP 1 Score: 137.9 bits (346), Expect = 2.2e-31
Identity = 101/345 (29.28%), Postives = 152/345 (44.06%), Query Frame = 1

Query: 33  FCRRICCHVHRNRVSFNFGPSVADSLRYLQYRRRNLASVT-------CSASDKPEISTPV 92
           +  + CCH +R       G    D LR +  R  NL   T        S    PE     
Sbjct: 115 YYNKACCHAYRGE-----GKKAVDCLR-IALRDYNLKFATILNDPDLASFRALPEFKE-- 174

Query: 93  IQSEMRLCVTVSVFFLSSTAKIRSEVLSPFRSVRMFFYLTFIASGTLGGLIATTQLLAAL 152
           +Q E RL             K+ SEV +PFR VR FFY  F A+  +       +L+ A+
Sbjct: 175 LQEEARLGGEDIGDNFRRDLKLISEVRAPFRGVRKFFYFAFAAAAGISMFFTVPRLVQAI 234

Query: 153 ANSSRAEEVPDILNGLGIDFGAVAFFAFLYFRENNAKNAQLARLSREESLSNLKLRVDQN 212
                A  + +      I+ G +     L+  EN  +  Q+ +++R+E+LS L LR+  N
Sbjct: 235 RGGDGAPNLLETTGNAAINIGGIVVMVSLFLWENKKEEEQMVQITRDETLSRLPLRLSTN 294

Query: 213 KVITISTLRGIARLVICAGPESFIMEAFKTSEPFTERLLERGVLVIPF------------ 272
           +V+ +  LR   R VI AG +  +  A + ++ F   LL RGVL++P             
Sbjct: 295 RVVELVQLRDTVRPVILAGKKETVTLAMQKADRFRTELLRRGVLLVPVVWGERKTPEIEK 354

Query: 273 ---------ATDASSLNFEFDEREEMKDITTKRKR--LWRLTPVYMSQWSAWLDDQKKLA 332
                    AT   S+  +FD R +     +K K    ++   V   +W  W+ DQ+   
Sbjct: 355 KGFGASSKAATSLPSIGEDFDTRAQSVVAQSKLKGEIRFKAETVSPGEWERWIRDQQISE 414

Query: 333 GVSSDSPVYLSLRMDGRVRGSGVGYPPWNALVAQLPPVKGLWSGL 348
           GV+    VY+ LR+DGRVR SG G P W  +  +LPP+  + S L
Sbjct: 415 GVNPGDDVYIILRLDGRVRRSGRGMPDWAEISKELPPMDDVLSKL 451

BLAST of Cp4.1LG15g01170 vs. TrEMBL
Match: A0A0A0L902_CUCSA (Uncharacterized protein OS=Cucumis sativus GN=Csa_3G621430 PE=4 SV=1)

HSP 1 Score: 538.5 bits (1386), Expect = 6.2e-150
Identity = 278/349 (79.66%), Postives = 297/349 (85.10%), Query Frame = 1

Query: 8   IPSCNNFGRPPHCSRTRSHNARNLKFCRRICCHVHRNRVSFNFGPSVADSLRYLQYRRRN 67
           I S NN   PPH S T SHNA + KF RR  C VHR  VSF+  P +  SLR+L Y RRN
Sbjct: 10  ISSFNNIAPPPHFSPTPSHNASDFKFRRRSYCRVHRKTVSFSSSPRLPVSLRFLVYGRRN 69

Query: 68  LASVTCSASDKPEISTPVIQSEMRLCVTVSVFFLSSTAKIRSEVLSPFRSVRMFFYLTFI 127
           LA+  CSA+DKPEIS                    STAKIRSEVLSPFRSVRMFFYLTFI
Sbjct: 70  LANFICSAADKPEIS--------------------STAKIRSEVLSPFRSVRMFFYLTFI 129

Query: 128 ASGTLGGLIATTQLLAALANSSRAEEVPDILNGLGIDFGAVAFFAFLYFRENNAKNAQLA 187
           ASGTLGGLIATTQLL ALANSSRA+EVPDIL GLG+DFGAVA FAFLYFRENNAKNAQLA
Sbjct: 130 ASGTLGGLIATTQLLGALANSSRADEVPDILEGLGVDFGAVALFAFLYFRENNAKNAQLA 189

Query: 188 RLSREESLSNLKLRVDQNKVITISTLRGIARLVICAGPESFIMEAFKTSEPFTERLLERG 247
           RLSREESLSNLKLRVDQNKVI IS LRGIARLVICAGPESFI+EAFK+SEPFTERLLERG
Sbjct: 190 RLSREESLSNLKLRVDQNKVIPISILRGIARLVICAGPESFIIEAFKSSEPFTERLLERG 249

Query: 248 VLVIPFATDASSLNFEFDEREEMKDITTKRKRLWRLTPVYMSQWSAWLDDQKKLAGVSSD 307
           VLV+P ATD ++LNFEFD+REE+KDITTKRKRLWRLTPVYM++WSAWLD+QKKLAGV+SD
Sbjct: 250 VLVVPLATDVTTLNFEFDDREEVKDITTKRKRLWRLTPVYMTEWSAWLDEQKKLAGVTSD 309

Query: 308 SPVYLSLRMDGRVRGSGVGYPPWNALVAQLPPVKGLWSGLLDGMDGRVL 357
           SPVYLSLRMDGRVRGSGVGYPPWNALVAQLPPVKGLWSGLLDGMDGRVL
Sbjct: 310 SPVYLSLRMDGRVRGSGVGYPPWNALVAQLPPVKGLWSGLLDGMDGRVL 338

BLAST of Cp4.1LG15g01170 vs. TrEMBL
Match: A0A061DQ88_THECC (Uncharacterized protein isoform 1 OS=Theobroma cacao GN=TCM_003773 PE=4 SV=1)

HSP 1 Score: 431.4 bits (1108), Expect = 1.1e-117
Identity = 216/288 (75.00%), Postives = 247/288 (85.76%), Query Frame = 1

Query: 69  ASVTCSASDKPEISTPVIQSEMRLCVTVSVFFLSSTAKIRSEVLSPFRSVRMFFYLTFIA 128
           +SV CSA++KP  S+ +                SS AKIRSEVLSPFRSVRMFFYL FIA
Sbjct: 73  SSVVCSAANKPSSSSEI----------------SSAAKIRSEVLSPFRSVRMFFYLAFIA 132

Query: 129 SGTLGGLIATTQLLAALANSSRAEEVPDILNGLGIDFGAVAFFAFLYFRENNAKNAQLAR 188
           SG LGGLIA TQL+AAL N +R+ EVPD+L  LGID  AV+ FAFLYFREN AKNAQ+AR
Sbjct: 133 SGALGGLIAFTQLIAALTNPARSSEVPDLLTSLGIDVAAVSIFAFLYFRENTAKNAQIAR 192

Query: 189 LSREESLSNLKLRVDQNKVITISTLRGIARLVICAGPESFIMEAFKTSEPFTERLLERGV 248
           LSREESLSNLKLRVDQNK+I++S+LRGIARLVICAGP SFI+E+FK+SEPFTE LL+RGV
Sbjct: 193 LSREESLSNLKLRVDQNKIISVSSLRGIARLVICAGPASFILESFKSSEPFTEGLLQRGV 252

Query: 249 LVIPFATDASSLNFEFDEREEMKDITTKRKRLWRLTPVYMSQWSAWLDDQKKLAGVSSDS 308
           LVIPFATD +SL+ +FD+ E+MK+ITTKRKRLW+LTPVY+S+WS WLD+QKKLAGVS +S
Sbjct: 253 LVIPFATDGNSLSLDFDDSEDMKEITTKRKRLWQLTPVYVSEWSEWLDEQKKLAGVSPES 312

Query: 309 PVYLSLRMDGRVRGSGVGYPPWNALVAQLPPVKGLWSGLLDGMDGRVL 357
           PVYLSLR+DGRVRGSGVGYPPWNA VAQLPPVKGLWSGLLDGMDGRVL
Sbjct: 313 PVYLSLRLDGRVRGSGVGYPPWNAFVAQLPPVKGLWSGLLDGMDGRVL 344

BLAST of Cp4.1LG15g01170 vs. TrEMBL
Match: A0A0D2Q2X4_GOSRA (Uncharacterized protein OS=Gossypium raimondii GN=B456_001G235900 PE=4 SV=1)

HSP 1 Score: 430.6 bits (1106), Expect = 1.8e-117
Identity = 214/288 (74.31%), Postives = 249/288 (86.46%), Query Frame = 1

Query: 69  ASVTCSASDKPEISTPVIQSEMRLCVTVSVFFLSSTAKIRSEVLSPFRSVRMFFYLTFIA 128
           +S+ CSA++KP  S+ V                SS AKIRSEVLSPFRSVRMFFYLTFIA
Sbjct: 55  SSIVCSAANKPSSSSQV----------------SSAAKIRSEVLSPFRSVRMFFYLTFIA 114

Query: 129 SGTLGGLIATTQLLAALANSSRAEEVPDILNGLGIDFGAVAFFAFLYFRENNAKNAQLAR 188
           SG+LGGLIATTQL+++L N +R+ EVPDIL GLGID GAV+ FAFLYFREN AKNAQLAR
Sbjct: 115 SGSLGGLIATTQLISSLTNPARSSEVPDILTGLGIDIGAVSIFAFLYFRENTAKNAQLAR 174

Query: 189 LSREESLSNLKLRVDQNKVITISTLRGIARLVICAGPESFIMEAFKTSEPFTERLLERGV 248
           LSREESLSNLKLRV+QNK+I++S+LRGIARLVIC+GP SFI+E+FK SEPFTE LLERGV
Sbjct: 175 LSREESLSNLKLRVNQNKIISVSSLRGIARLVICSGPASFILESFKLSEPFTESLLERGV 234

Query: 249 LVIPFATDASSLNFEFDEREEMKDITTKRKRLWRLTPVYMSQWSAWLDDQKKLAGVSSDS 308
           LV+PFATD +S + +FDE E+MK+IT KRKRLW+L PVY+S+W+ WLD+QKKLAG+S +S
Sbjct: 235 LVVPFATDGNSPSLDFDESEDMKEITEKRKRLWQLAPVYVSEWTEWLDEQKKLAGISPES 294

Query: 309 PVYLSLRMDGRVRGSGVGYPPWNALVAQLPPVKGLWSGLLDGMDGRVL 357
           PVYLSLR+DGRVRGSGVG+PPWNALVAQLPPVKGLWSGLLDGMDGRVL
Sbjct: 295 PVYLSLRLDGRVRGSGVGFPPWNALVAQLPPVKGLWSGLLDGMDGRVL 326

BLAST of Cp4.1LG15g01170 vs. TrEMBL
Match: A0A0B0PHJ8_GOSAR (Glutamate--tRNA ligase, chloroplastic/mitochondrial OS=Gossypium arboreum GN=F383_04136 PE=4 SV=1)

HSP 1 Score: 428.3 bits (1100), Expect = 9.0e-117
Identity = 215/288 (74.65%), Postives = 251/288 (87.15%), Query Frame = 1

Query: 70  SVTCSASDKPEISTPVIQSEMRLCV-TVSVFFLSSTAKIRSEVLSPFRSVRMFFYLTFIA 129
           ++ CSA++KP  S+ V + E  +    V  +FL   AKIRSEVLSPFRSVRMFFYL FIA
Sbjct: 56  TIVCSAANKPSSSSQVREGEEDMEQGNVEDYFLIR-AKIRSEVLSPFRSVRMFFYLAFIA 115

Query: 130 SGTLGGLIATTQLLAALANSSRAEEVPDILNGLGIDFGAVAFFAFLYFRENNAKNAQLAR 189
           SG+LGGLIATTQL+A+L N +R+ EVPDIL GLGID GAV+ FAFLYFREN AKNAQL R
Sbjct: 116 SGSLGGLIATTQLIASLTNPARSSEVPDILTGLGIDIGAVSIFAFLYFRENTAKNAQLTR 175

Query: 190 LSREESLSNLKLRVDQNKVITISTLRGIARLVICAGPESFIMEAFKTSEPFTERLLERGV 249
           LSREESLSNLKLRV+QNK+I++S+LRGIARLVIC+GP SFI+E+FK SEPFTE LLERGV
Sbjct: 176 LSREESLSNLKLRVNQNKIISVSSLRGIARLVICSGPASFILESFKLSEPFTESLLERGV 235

Query: 250 LVIPFATDASSLNFEFDEREEMKDITTKRKRLWRLTPVYMSQWSAWLDDQKKLAGVSSDS 309
           LV+PFATD +S + +FDE E+MK+IT KRKRLW+L PVY+S+W+ WLD+QKKLAG+S +S
Sbjct: 236 LVVPFATDGNSPSLDFDESEDMKEITEKRKRLWQLAPVYVSEWTEWLDEQKKLAGISPES 295

Query: 310 PVYLSLRMDGRVRGSGVGYPPWNALVAQLPPVKGLWSGLLDGMDGRVL 357
           PVYLSLR+DGRVRGSGVGYPPWNALVAQLPPVKGLWSGLLDGMDGRVL
Sbjct: 296 PVYLSLRLDGRVRGSGVGYPPWNALVAQLPPVKGLWSGLLDGMDGRVL 342

BLAST of Cp4.1LG15g01170 vs. TrEMBL
Match: F6HZU3_VITVI (Putative uncharacterized protein OS=Vitis vinifera GN=VIT_07s0005g04900 PE=4 SV=1)

HSP 1 Score: 424.1 bits (1089), Expect = 1.7e-115
Identity = 214/289 (74.05%), Postives = 244/289 (84.43%), Query Frame = 1

Query: 68  LASVTCSASDKPEISTPVIQSEMRLCVTVSVFFLSSTAKIRSEVLSPFRSVRMFFYLTFI 127
           L+ +TCSAS+KP  S+P                +SSTAKIRSEVLSPFR+VRMFFYL FI
Sbjct: 54  LSIITCSASNKPSSSSPSP--------------ISSTAKIRSEVLSPFRTVRMFFYLAFI 113

Query: 128 ASGTLGGLIATTQLLAALANSSRAEEVPDILNGLGIDFGAVAFFAFLYFRENNAKNAQLA 187
           ASG LGGLIATTQL+AAL NSSRA  VPDIL GLGID GAVA FAFLY RE++AKNAQLA
Sbjct: 114 ASGALGGLIATTQLIAALTNSSRAPLVPDILKGLGIDIGAVAIFAFLYSRESSAKNAQLA 173

Query: 188 RLSREESLSNLKLRVDQNKVITISTLRGIARLVICAGPESFIMEAFKTSEPFTERLLERG 247
           RL+REESLSNLKLRVD+ KVI+++ LRGIARLVICAGP  FI E+FK S+PFT+ LL+RG
Sbjct: 174 RLTREESLSNLKLRVDEKKVISVNDLRGIARLVICAGPAPFIAESFKLSQPFTQGLLDRG 233

Query: 248 VLVIPFATDASSLNFEFDEREEMKDITTKRKRLWRLTPVYMSQWSAWLDDQKKLAGVSSD 307
           VLV+PF TD    +FE++E EEMKDITTKRKRLW+L PVY+S+WS WLD+QKKLAGVS +
Sbjct: 234 VLVVPFVTDGKLPSFEYEESEEMKDITTKRKRLWQLVPVYVSEWSKWLDEQKKLAGVSPE 293

Query: 308 SPVYLSLRMDGRVRGSGVGYPPWNALVAQLPPVKGLWSGLLDGMDGRVL 357
           SPVYLSLRMDGRVRGSGVGYPPWNA VAQLPP+KG+W+GLLDGMDGRVL
Sbjct: 294 SPVYLSLRMDGRVRGSGVGYPPWNAFVAQLPPIKGMWTGLLDGMDGRVL 328

BLAST of Cp4.1LG15g01170 vs. TAIR10
Match: AT4G28740.1 (AT4G28740.1 FUNCTIONS IN: molecular_function unknown)

HSP 1 Score: 358.2 bits (918), Expect = 5.8e-99
Identity = 194/353 (54.96%), Postives = 248/353 (70.25%), Query Frame = 1

Query: 5   MAIIPSCNNFGRPPHCSRTRSHNARNLKFCRRICCHVHRNRVSFNFGPSVADSLRYLQYR 64
           MA + S   +    H S+   + A+     RRI  H HR R+ F+         R  Q  
Sbjct: 17  MATLVSSQTYIYHCHISKQALYQAKESYSHRRISRHNHRERLDFSHRNHRLTITRKQQPL 76

Query: 65  RRNLASVTCSASDKPEISTPVIQSEMRLCVTVSVFFLSSTAKIRSEVLSPFRSVRMFFYL 124
             N     C A+D+P        SE           +S+ A+IRSEVLSPFRSVRMFFYL
Sbjct: 77  SFN---TVCFAADEP--------SE-----------ISADARIRSEVLSPFRSVRMFFYL 136

Query: 125 TFIASGTLGGLIATTQLLAALANSSRAEEVPDILNGLGIDFGAVAFFAFLYFRENNAKNA 184
            FIASG+LGGLIAT++L+ ALAN +R+ EV +I+ GLG+D GA + FAFLYF EN  KNA
Sbjct: 137 AFIASGSLGGLIATSRLIGALANPARSGEVLEIVKGLGVDIGAASLFAFLYFNENKTKNA 196

Query: 185 QLARLSREESLSNLKLRVDQ-NKVITISTLRGIARLVICAGPESFIMEAFKTSEPFTERL 244
           Q+ARLSREE+L  LK+RV++ NKVI++  LRG+ARLVICAGP  FI EAFK S+ +T+ L
Sbjct: 197 QMARLSREENLGKLKMRVEENNKVISVGDLRGVARLVICAGPAEFIEEAFKRSKEYTQGL 256

Query: 245 LERGVLVIPFATDASSLNFEFDERE-EMKDITTKRKRLWRLTPVYMSQWSAWLDDQKKLA 304
           +ERGV+V+ +ATD +S   EFDE +   ++++ +RK+LWR+TPV++ +W  WL++QKKLA
Sbjct: 257 VERGVVVVAYATDGNSPVLEFDETDIADEEMSQRRKKLWRVTPVFVPEWEKWLNEQKKLA 316

Query: 305 GVSSDSPVYLSLRMDGRVRGSGVGYPPWNALVAQLPPVKGLWSGLLDGMDGRV 356
            VSSDSPVYLSLR+DGRVR SGVGYPPW A VAQLPPVKG+W+GLLDGMDGRV
Sbjct: 317 NVSSDSPVYLSLRLDGRVRASGVGYPPWQAFVAQLPPVKGMWTGLLDGMDGRV 347

BLAST of Cp4.1LG15g01170 vs. TAIR10
Match: AT1G02910.1 (AT1G02910.1 tetratricopeptide repeat (TPR)-containing protein)

HSP 1 Score: 137.9 bits (346), Expect = 1.2e-32
Identity = 101/345 (29.28%), Postives = 152/345 (44.06%), Query Frame = 1

Query: 33  FCRRICCHVHRNRVSFNFGPSVADSLRYLQYRRRNLASVT-------CSASDKPEISTPV 92
           +  + CCH +R       G    D LR +  R  NL   T        S    PE     
Sbjct: 115 YYNKACCHAYRGE-----GKKAVDCLR-IALRDYNLKFATILNDPDLASFRALPEFKE-- 174

Query: 93  IQSEMRLCVTVSVFFLSSTAKIRSEVLSPFRSVRMFFYLTFIASGTLGGLIATTQLLAAL 152
           +Q E RL             K+ SEV +PFR VR FFY  F A+  +       +L+ A+
Sbjct: 175 LQEEARLGGEDIGDNFRRDLKLISEVRAPFRGVRKFFYFAFAAAAGISMFFTVPRLVQAI 234

Query: 153 ANSSRAEEVPDILNGLGIDFGAVAFFAFLYFRENNAKNAQLARLSREESLSNLKLRVDQN 212
                A  + +      I+ G +     L+  EN  +  Q+ +++R+E+LS L LR+  N
Sbjct: 235 RGGDGAPNLLETTGNAAINIGGIVVMVSLFLWENKKEEEQMVQITRDETLSRLPLRLSTN 294

Query: 213 KVITISTLRGIARLVICAGPESFIMEAFKTSEPFTERLLERGVLVIPF------------ 272
           +V+ +  LR   R VI AG +  +  A + ++ F   LL RGVL++P             
Sbjct: 295 RVVELVQLRDTVRPVILAGKKETVTLAMQKADRFRTELLRRGVLLVPVVWGERKTPEIEK 354

Query: 273 ---------ATDASSLNFEFDEREEMKDITTKRKR--LWRLTPVYMSQWSAWLDDQKKLA 332
                    AT   S+  +FD R +     +K K    ++   V   +W  W+ DQ+   
Sbjct: 355 KGFGASSKAATSLPSIGEDFDTRAQSVVAQSKLKGEIRFKAETVSPGEWERWIRDQQISE 414

Query: 333 GVSSDSPVYLSLRMDGRVRGSGVGYPPWNALVAQLPPVKGLWSGL 348
           GV+    VY+ LR+DGRVR SG G P W  +  +LPP+  + S L
Sbjct: 415 GVNPGDDVYIILRLDGRVRRSGRGMPDWAEISKELPPMDDVLSKL 451

BLAST of Cp4.1LG15g01170 vs. NCBI nr
Match: gi|659093061|ref|XP_008447350.1| (PREDICTED: protein LOW PSII ACCUMULATION 1, chloroplastic [Cucumis melo])

HSP 1 Score: 542.0 bits (1395), Expect = 8.0e-151
Identity = 279/354 (78.81%), Postives = 301/354 (85.03%), Query Frame = 1

Query: 3   VSMAIIPSCNNFGRPPHCSRTRSHNARNLKFCRRICCHVHRNRVSFNFGPSVADSLRYLQ 62
           +S  I+ S NN   PPH S T SHNA + KFCRR C HVHR  VSF+  P +  SLR+  
Sbjct: 5   LSTLIVSSFNNIAPPPHFSPTPSHNAPDFKFCRRSCFHVHRKTVSFSSSPRLPVSLRFHV 64

Query: 63  YRRRNLASVTCSASDKPEISTPVIQSEMRLCVTVSVFFLSSTAKIRSEVLSPFRSVRMFF 122
           Y RRNLA+   SA+DKPEIS                    STAKIRSEVLSPFRSVRMFF
Sbjct: 65  YGRRNLANYIYSAADKPEIS--------------------STAKIRSEVLSPFRSVRMFF 124

Query: 123 YLTFIASGTLGGLIATTQLLAALANSSRAEEVPDILNGLGIDFGAVAFFAFLYFRENNAK 182
           YLTFIASGTLGGLIATTQLL ALANSSRA+EVPDIL GLGIDFGAVA FAFLYFRENNAK
Sbjct: 125 YLTFIASGTLGGLIATTQLLGALANSSRADEVPDILKGLGIDFGAVALFAFLYFRENNAK 184

Query: 183 NAQLARLSREESLSNLKLRVDQNKVITISTLRGIARLVICAGPESFIMEAFKTSEPFTER 242
           NAQLARLSREESLSNLKLRVDQNKVI ISTLRGIARLVICAGPESF++EAFK+SEPFTE+
Sbjct: 185 NAQLARLSREESLSNLKLRVDQNKVIPISTLRGIARLVICAGPESFVIEAFKSSEPFTEQ 244

Query: 243 LLERGVLVIPFATDASSLNFEFDEREEMKDITTKRKRLWRLTPVYMSQWSAWLDDQKKLA 302
           LLERGVLV+P ATD ++LNFEFDEREE+KDIT+KRK+LWRLTPVYM++WSAWLD+QKKLA
Sbjct: 245 LLERGVLVVPLATDVTTLNFEFDEREEVKDITSKRKKLWRLTPVYMTEWSAWLDEQKKLA 304

Query: 303 GVSSDSPVYLSLRMDGRVRGSGVGYPPWNALVAQLPPVKGLWSGLLDGMDGRVL 357
           GVSSDSPVYLSLRMDGRVRGSGVGYPPWNALVAQLPPVKGLWSGLLDGMDGRVL
Sbjct: 305 GVSSDSPVYLSLRMDGRVRGSGVGYPPWNALVAQLPPVKGLWSGLLDGMDGRVL 338

BLAST of Cp4.1LG15g01170 vs. NCBI nr
Match: gi|449468602|ref|XP_004152010.1| (PREDICTED: protein LOW PSII ACCUMULATION 1, chloroplastic [Cucumis sativus])

HSP 1 Score: 538.5 bits (1386), Expect = 8.8e-150
Identity = 278/349 (79.66%), Postives = 297/349 (85.10%), Query Frame = 1

Query: 8   IPSCNNFGRPPHCSRTRSHNARNLKFCRRICCHVHRNRVSFNFGPSVADSLRYLQYRRRN 67
           I S NN   PPH S T SHNA + KF RR  C VHR  VSF+  P +  SLR+L Y RRN
Sbjct: 10  ISSFNNIAPPPHFSPTPSHNASDFKFRRRSYCRVHRKTVSFSSSPRLPVSLRFLVYGRRN 69

Query: 68  LASVTCSASDKPEISTPVIQSEMRLCVTVSVFFLSSTAKIRSEVLSPFRSVRMFFYLTFI 127
           LA+  CSA+DKPEIS                    STAKIRSEVLSPFRSVRMFFYLTFI
Sbjct: 70  LANFICSAADKPEIS--------------------STAKIRSEVLSPFRSVRMFFYLTFI 129

Query: 128 ASGTLGGLIATTQLLAALANSSRAEEVPDILNGLGIDFGAVAFFAFLYFRENNAKNAQLA 187
           ASGTLGGLIATTQLL ALANSSRA+EVPDIL GLG+DFGAVA FAFLYFRENNAKNAQLA
Sbjct: 130 ASGTLGGLIATTQLLGALANSSRADEVPDILEGLGVDFGAVALFAFLYFRENNAKNAQLA 189

Query: 188 RLSREESLSNLKLRVDQNKVITISTLRGIARLVICAGPESFIMEAFKTSEPFTERLLERG 247
           RLSREESLSNLKLRVDQNKVI IS LRGIARLVICAGPESFI+EAFK+SEPFTERLLERG
Sbjct: 190 RLSREESLSNLKLRVDQNKVIPISILRGIARLVICAGPESFIIEAFKSSEPFTERLLERG 249

Query: 248 VLVIPFATDASSLNFEFDEREEMKDITTKRKRLWRLTPVYMSQWSAWLDDQKKLAGVSSD 307
           VLV+P ATD ++LNFEFD+REE+KDITTKRKRLWRLTPVYM++WSAWLD+QKKLAGV+SD
Sbjct: 250 VLVVPLATDVTTLNFEFDDREEVKDITTKRKRLWRLTPVYMTEWSAWLDEQKKLAGVTSD 309

Query: 308 SPVYLSLRMDGRVRGSGVGYPPWNALVAQLPPVKGLWSGLLDGMDGRVL 357
           SPVYLSLRMDGRVRGSGVGYPPWNALVAQLPPVKGLWSGLLDGMDGRVL
Sbjct: 310 SPVYLSLRMDGRVRGSGVGYPPWNALVAQLPPVKGLWSGLLDGMDGRVL 338

BLAST of Cp4.1LG15g01170 vs. NCBI nr
Match: gi|590715027|ref|XP_007050079.1| (Uncharacterized protein isoform 1 [Theobroma cacao])

HSP 1 Score: 431.4 bits (1108), Expect = 1.5e-117
Identity = 216/288 (75.00%), Postives = 247/288 (85.76%), Query Frame = 1

Query: 69  ASVTCSASDKPEISTPVIQSEMRLCVTVSVFFLSSTAKIRSEVLSPFRSVRMFFYLTFIA 128
           +SV CSA++KP  S+ +                SS AKIRSEVLSPFRSVRMFFYL FIA
Sbjct: 73  SSVVCSAANKPSSSSEI----------------SSAAKIRSEVLSPFRSVRMFFYLAFIA 132

Query: 129 SGTLGGLIATTQLLAALANSSRAEEVPDILNGLGIDFGAVAFFAFLYFRENNAKNAQLAR 188
           SG LGGLIA TQL+AAL N +R+ EVPD+L  LGID  AV+ FAFLYFREN AKNAQ+AR
Sbjct: 133 SGALGGLIAFTQLIAALTNPARSSEVPDLLTSLGIDVAAVSIFAFLYFRENTAKNAQIAR 192

Query: 189 LSREESLSNLKLRVDQNKVITISTLRGIARLVICAGPESFIMEAFKTSEPFTERLLERGV 248
           LSREESLSNLKLRVDQNK+I++S+LRGIARLVICAGP SFI+E+FK+SEPFTE LL+RGV
Sbjct: 193 LSREESLSNLKLRVDQNKIISVSSLRGIARLVICAGPASFILESFKSSEPFTEGLLQRGV 252

Query: 249 LVIPFATDASSLNFEFDEREEMKDITTKRKRLWRLTPVYMSQWSAWLDDQKKLAGVSSDS 308
           LVIPFATD +SL+ +FD+ E+MK+ITTKRKRLW+LTPVY+S+WS WLD+QKKLAGVS +S
Sbjct: 253 LVIPFATDGNSLSLDFDDSEDMKEITTKRKRLWQLTPVYVSEWSEWLDEQKKLAGVSPES 312

Query: 309 PVYLSLRMDGRVRGSGVGYPPWNALVAQLPPVKGLWSGLLDGMDGRVL 357
           PVYLSLR+DGRVRGSGVGYPPWNA VAQLPPVKGLWSGLLDGMDGRVL
Sbjct: 313 PVYLSLRLDGRVRGSGVGYPPWNAFVAQLPPVKGLWSGLLDGMDGRVL 344

BLAST of Cp4.1LG15g01170 vs. NCBI nr
Match: gi|823127258|ref|XP_012434346.1| (PREDICTED: protein LOW PSII ACCUMULATION 1, chloroplastic [Gossypium raimondii])

HSP 1 Score: 430.6 bits (1106), Expect = 2.6e-117
Identity = 214/288 (74.31%), Postives = 249/288 (86.46%), Query Frame = 1

Query: 69  ASVTCSASDKPEISTPVIQSEMRLCVTVSVFFLSSTAKIRSEVLSPFRSVRMFFYLTFIA 128
           +S+ CSA++KP  S+ V                SS AKIRSEVLSPFRSVRMFFYLTFIA
Sbjct: 55  SSIVCSAANKPSSSSQV----------------SSAAKIRSEVLSPFRSVRMFFYLTFIA 114

Query: 129 SGTLGGLIATTQLLAALANSSRAEEVPDILNGLGIDFGAVAFFAFLYFRENNAKNAQLAR 188
           SG+LGGLIATTQL+++L N +R+ EVPDIL GLGID GAV+ FAFLYFREN AKNAQLAR
Sbjct: 115 SGSLGGLIATTQLISSLTNPARSSEVPDILTGLGIDIGAVSIFAFLYFRENTAKNAQLAR 174

Query: 189 LSREESLSNLKLRVDQNKVITISTLRGIARLVICAGPESFIMEAFKTSEPFTERLLERGV 248
           LSREESLSNLKLRV+QNK+I++S+LRGIARLVIC+GP SFI+E+FK SEPFTE LLERGV
Sbjct: 175 LSREESLSNLKLRVNQNKIISVSSLRGIARLVICSGPASFILESFKLSEPFTESLLERGV 234

Query: 249 LVIPFATDASSLNFEFDEREEMKDITTKRKRLWRLTPVYMSQWSAWLDDQKKLAGVSSDS 308
           LV+PFATD +S + +FDE E+MK+IT KRKRLW+L PVY+S+W+ WLD+QKKLAG+S +S
Sbjct: 235 LVVPFATDGNSPSLDFDESEDMKEITEKRKRLWQLAPVYVSEWTEWLDEQKKLAGISPES 294

Query: 309 PVYLSLRMDGRVRGSGVGYPPWNALVAQLPPVKGLWSGLLDGMDGRVL 357
           PVYLSLR+DGRVRGSGVG+PPWNALVAQLPPVKGLWSGLLDGMDGRVL
Sbjct: 295 PVYLSLRLDGRVRGSGVGFPPWNALVAQLPPVKGLWSGLLDGMDGRVL 326

BLAST of Cp4.1LG15g01170 vs. NCBI nr
Match: gi|728845017|gb|KHG24460.1| (Glutamate--tRNA ligase, chloroplastic/mitochondrial [Gossypium arboreum])

HSP 1 Score: 428.3 bits (1100), Expect = 1.3e-116
Identity = 215/288 (74.65%), Postives = 251/288 (87.15%), Query Frame = 1

Query: 70  SVTCSASDKPEISTPVIQSEMRLCV-TVSVFFLSSTAKIRSEVLSPFRSVRMFFYLTFIA 129
           ++ CSA++KP  S+ V + E  +    V  +FL   AKIRSEVLSPFRSVRMFFYL FIA
Sbjct: 56  TIVCSAANKPSSSSQVREGEEDMEQGNVEDYFLIR-AKIRSEVLSPFRSVRMFFYLAFIA 115

Query: 130 SGTLGGLIATTQLLAALANSSRAEEVPDILNGLGIDFGAVAFFAFLYFRENNAKNAQLAR 189
           SG+LGGLIATTQL+A+L N +R+ EVPDIL GLGID GAV+ FAFLYFREN AKNAQL R
Sbjct: 116 SGSLGGLIATTQLIASLTNPARSSEVPDILTGLGIDIGAVSIFAFLYFRENTAKNAQLTR 175

Query: 190 LSREESLSNLKLRVDQNKVITISTLRGIARLVICAGPESFIMEAFKTSEPFTERLLERGV 249
           LSREESLSNLKLRV+QNK+I++S+LRGIARLVIC+GP SFI+E+FK SEPFTE LLERGV
Sbjct: 176 LSREESLSNLKLRVNQNKIISVSSLRGIARLVICSGPASFILESFKLSEPFTESLLERGV 235

Query: 250 LVIPFATDASSLNFEFDEREEMKDITTKRKRLWRLTPVYMSQWSAWLDDQKKLAGVSSDS 309
           LV+PFATD +S + +FDE E+MK+IT KRKRLW+L PVY+S+W+ WLD+QKKLAG+S +S
Sbjct: 236 LVVPFATDGNSPSLDFDESEDMKEITEKRKRLWQLAPVYVSEWTEWLDEQKKLAGISPES 295

Query: 310 PVYLSLRMDGRVRGSGVGYPPWNALVAQLPPVKGLWSGLLDGMDGRVL 357
           PVYLSLR+DGRVRGSGVGYPPWNALVAQLPPVKGLWSGLLDGMDGRVL
Sbjct: 296 PVYLSLRLDGRVRGSGVGYPPWNALVAQLPPVKGLWSGLLDGMDGRVL 342

The following BLAST results are available for this feature:
Match NameE-valueIdentityDescription
LPA1_ARATH2.2e-3129.28Protein LOW PSII ACCUMULATION 1, chloroplastic OS=Arabidopsis thaliana GN=LPA1 P... [more]
Match NameE-valueIdentityDescription
A0A0A0L902_CUCSA6.2e-15079.66Uncharacterized protein OS=Cucumis sativus GN=Csa_3G621430 PE=4 SV=1[more]
A0A061DQ88_THECC1.1e-11775.00Uncharacterized protein isoform 1 OS=Theobroma cacao GN=TCM_003773 PE=4 SV=1[more]
A0A0D2Q2X4_GOSRA1.8e-11774.31Uncharacterized protein OS=Gossypium raimondii GN=B456_001G235900 PE=4 SV=1[more]
A0A0B0PHJ8_GOSAR9.0e-11774.65Glutamate--tRNA ligase, chloroplastic/mitochondrial OS=Gossypium arboreum GN=F38... [more]
F6HZU3_VITVI1.7e-11574.05Putative uncharacterized protein OS=Vitis vinifera GN=VIT_07s0005g04900 PE=4 SV=... [more]
Match NameE-valueIdentityDescription
AT4G28740.15.8e-9954.96 FUNCTIONS IN: molecular_function unknown[more]
AT1G02910.11.2e-3229.28 tetratricopeptide repeat (TPR)-containing protein[more]
Match NameE-valueIdentityDescription
gi|659093061|ref|XP_008447350.1|8.0e-15178.81PREDICTED: protein LOW PSII ACCUMULATION 1, chloroplastic [Cucumis melo][more]
gi|449468602|ref|XP_004152010.1|8.8e-15079.66PREDICTED: protein LOW PSII ACCUMULATION 1, chloroplastic [Cucumis sativus][more]
gi|590715027|ref|XP_007050079.1|1.5e-11775.00Uncharacterized protein isoform 1 [Theobroma cacao][more]
gi|823127258|ref|XP_012434346.1|2.6e-11774.31PREDICTED: protein LOW PSII ACCUMULATION 1, chloroplastic [Gossypium raimondii][more]
gi|728845017|gb|KHG24460.1|1.3e-11674.65Glutamate--tRNA ligase, chloroplastic/mitochondrial [Gossypium arboreum][more]
The following terms have been associated with this gene:
Vocabulary: INTERPRO
TermDefinition
IPR021883LPA1-like
GO Assignments
This gene is annotated with the following GO terms.
Category Term Accession Term Name
biological_process GO:0009658 chloroplast organization
biological_process GO:0006098 pentose-phosphate shunt
biological_process GO:0008150 biological_process
cellular_component GO:0009507 chloroplast
cellular_component GO:0005575 cellular_component
cellular_component GO:0016021 integral component of membrane
molecular_function GO:0016874 ligase activity
molecular_function GO:0003674 molecular_function

The following mRNA feature(s) are a part of this gene:

Feature NameUnique NameType
Cp4.1LG15g01170.1Cp4.1LG15g01170.1mRNA


Analysis Name: InterPro Annotations of Cucurbita pepo
Date Performed: 2017-12-02
IPR TermIPR DescriptionSourceSource TermSource DescriptionAlignment
IPR021883Protein of unknown function DUF3493PFAMPF11998DUF3493coord: 103..179
score: 6.0
NoneNo IPR availablePANTHERPTHR35498FAMILY NOT NAMEDcoord: 101..356
score: 1.1E
NoneNo IPR availablePANTHERPTHR35498:SF1SUBFAMILY NOT NAMEDcoord: 101..356
score: 1.1E

The following gene(s) are paralogous to this gene:

None