CmoCh19G003090 (gene) Cucurbita moschata (Rifu)

NameCmoCh19G003090
Typegene
OrganismCucurbita moschata (Cucurbita moschata (Rifu))
DescriptionPentatricopeptide repeat (PPR) superfamily protein
LocationCmo_Chr19 : 2301762 .. 2313897 (+)
The following sequences are available for this feature:

Gene sequence (with intron)

Legend: exonfive_prime_UTRCDSthree_prime_UTR
Hold the cursor over a type above to highlight its positions in the sequence below.
TTGAGCGTTTTGCCTCGTTACCAGTCGGGACTGTGGCGGTACGACATTTGAAATCAGGGTTTTGCTCTACCATGCTCATCCGGAGGTTTCATCGAGCAGCGACATGGGCGACGCCTCTGTTGCGAGACACAACTGTAAGTTTTCGAACTTGTTTTTCTCTTCGTTTGTCTTCTCGAATCTTTGAATGCTTGGATTTCCTGTTTTATGGTTTATCTGTTCGTTAATTGTCTTCATATGCATTACCATTTTCTGATTGAATCGGAAACGAAGTTCTTAGTTTCTCTTTATTTCTGCTTATATTATGATTATTTTTTATGTATTCAACAAACGGCTAATGGAAGGAAGAACTAGGTACCCTTTTTTTAACTTTCCGTATTGCGATTCTTGCATCAGAATTTTTTGGATTACAATCATCTGTTAGTTCTCCTTTGGTCCTGTTCTGCTTAATTCTCTGTTTTGTGTTATTTGTGAAATAATGAACTGTGGTTAAAAGCTCTCCCCATCACTTATTTCTTATTTTTCTATAGTAGGCAATCCAGGATTTACCCTGAACTATGCCTTTGTTGGTCTAGTTTGATTTTGGAACGTCAAGACATTGGTTGGATATGTTTATAGAAGAATTGCGCATTAACTTCCTGTATACTATATTAACGTACGTTAACTTGTTTCAAACTTCTTTTAAGGAATCTCAATTTATCAATATATGAGCACACAGGTAGGACAAATCATGGAGCTTGGAGTCAACAAGCTGCAAATTGGGAACTCTTGTTACTGCACAATGTTACAAAATCAAATGTCTAAACGTTTTGGTGATAAAGATATGACAGATAAGGTATCTGAGATTAAAAGAAAATTGTTCATGCATAGAAAGTATGACGTATTCTTTAGTTTTTTGATGCTGCAACAGAGTAAAATTAGTTTTGTTGATATTTTAAAGGTTAACCTTTTCTTTTCTTCTTTTGGTATAACTATCTTAGCTCTCTCTATTTATATTTCACTATTTGGCGGCAGAAATGATATTAAGGTTGATGAATTATATTTTATTGAAGGGTTTGAAAAAGGAAGAGAGTCCTATTATCTTTGTCACAATTAGGGTTTAGTGCTGGGTCCTTACTTCTCAAGTTTTTATGCTCTTGAATCTTGTAGTTGTCAGTTTCTTCTACCATTGTGCATTATATCTTGACTTACAAAATGTACTTTCTTTCTTTATTTTCGTTTGTTTCTGTTATCCTGTCAAACATTCCACATATTTGATATGCCTCGAGTACTAAAATAGAGTAATAATGATTATGGAAATTACCCTAAATGTATTTGCTGGTAATTTAGTTAACTCTTGAATTGTCTACCAATGGTCCAGGATGTTAACAATAGTAAACCTTTGTACCAGACTTCAGAGCGAAATATTGGAGACATTAGAAAGCACCAAATTGGGGAAAACGTTTCACGGAAGGACAAAATTGACTTTCTTGTAAATACTGTAAGTATCAATGGGAAAATATCTTCCCTCTTTCTTCTTTCTTTTTTTTGGGGGGTGGGGGTGGTTGGTAGTCAATTAAGTCTTCTCAAAATCAGGTATAGGTGTATTTTCATTTGATGGGTAGCTATTATATTTTCTGATTCCAAAAAGAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAACANNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNAAAAAAAAAAAAAAAAAAAAAAAAAAAAAGAAAAGAAAAGAAAAGAAAAAAGAAAGACACTTCAAATAATCAAATGCTAAAACTGATTGCTTTGATAATAGTTGCATAAACCAAACACTAATTGACGAGTCTTTTTGTTTCCTATCAAAAGTAATATCAAGATGCCTTTCGCAAGTCCTATTTCCTTCTGTCATTTAATTTATTTACTTCTATAATATATATTTCTTTTTATTCACATTGGGTAAGAAAGCATCAAGAAGTGGTAAACATGAATTTCAGCGAGCTATGATTGTATCTAGATTGTTTGCTCATAACAACTAGAGGGATATTAAAGGTGCTTTGGAAGATCATTTTGAAGTCTCTGTTTTGTTTAATTCTTTCATGGCTGACAAGACTTTACTCAAACTTGATGATGGTAACTCTCCATTGGTTTTAAATCCCGCCGGAAATTGGAAGCTCTGTTGTGATTTTCATTTGCTTATCAAGAAGTGGTCTAGAGATAAGCACGCTCATCTGAATTTCATTGAAAGGTATGGTGGGTGGATCAGAATCAAGAACTTACCTCAACTTATTGGGAACACAATATCTTTGATGCTGTTAAAAATTACTAACACTTAGTATATTGGATTTTACTAAGAGGGACATGGAAGTCAAAAGGCATTTGTGTGAGTTTTTCCTGACTGCTACTGGAATTAATGATAGGAACAGAGGAAATGTGTTTCTTAGATTAGGAGGTGCATTTTTTGATGTCCCAGGGAATGGAGTTTGTATGAGTCTCAATTTTTAGTCATAAAACTTTGAATAAGCTTAAGTGATATTGTAGTTTACCTCAGTGATGCTCAGGGTAACAAAATGGCCAAGGTACTCGGAGGTAGTAGGAACCCGAGATGAAGAGATAATTTTAGTGATAACTACATCATTCCCAATAGTTGGCAACATATTAAACGTTTCATTTCGAGCTTTAGGGCCATTTAAAGACATGGAGTTTCTGAGATATTCAGAAATGGATTTGTACATTCCTCAGATATGGAAGCTTGCTGATCTGTTGACTGAAATAGGGAGTTGAAGGCATTTGCATAAAAATCTTCGAACTATTAGATCATCCAATGTATAGTATAGTAATCGGGCTGAATATCTAATTCAAATTGACATCTGACTCATCATCTGAATCATCGAAAGGAACTGCATTGGAAGATGAATGATCATTCGATGACTTTAAAATATCATGAGAAAATTTGGAACTACCTAGACCATTGTCGTATTCAATCAATACTGATGGGGTATTTTTCTTCAATCTTCTTCCTGGGATAACCTCCTGCTTTTCTAAATGAATCAAAGGGATTGAAGATATTTTCAAAGAACATTTGGTCCGAAGGTACAATATCTTCATTGAATGTGAATGAGATGGGTCTGCAGAATACAAAGGGATTGAAGATTTTTTCCCTGCATAGAATTTTGAATAGGTTACATTAAAACAATCATTTACTTTCTTATTGCTCAAAATTTCTAAAGATTCAAAGACAGTAGGTACTCATCAGTTTGTGCATTGTTTGATAGACTTCCTCATTTTATGCCTTCGTTACATGGGAAGATTGGAGGATTGGTCAGGTTGATCTATTAAAAGTGGGCCCACAGGAATTTCCTCAAGAGGCGCGAGTAAAGGCTCCAAGGGATTCGATAAAGAGCAATTGAACGTATCCTTCTCATTTAGACCTTGATTATAAAATGCAACATCCCCCCGACTCATTTAAGACTATCTTCTAACTCATTGAAGGCAGCCTTCACATGAAGATCACTATCATTAAATGCAACATTCCCTGACTCATTTAAAACCGCATTTTCTGATCCAACTAACCTATGAACTGGTGAGTGTAAAATTTCTTCCTTGGTCATGGTGGTGACGGCTTTATCTTCCATAAATTGCTCTATCCTATTTAAATCCAATGAATTTGAAAAGTCATTTAAAGATAATTCACGGTTGATGATATTAGGAGATTCTAAAGAAGAAGCATCCGAAGGGAAATGCTTCCTTTTTTTCATATCCTTGATCACAATGGTTGCATGAAAGAAAACCACTTAGGGTTTTCTTAACTTCAATACAAGCTCGTTGAAATCAATCAAATTTAGAATATTAGAAGAAATGCGAATAAGGCCCCTGAAATACTCTCCAATTGCTTCAATCGTACTTCGTCTCCAGTTTATTCAAGGTAAGTTGTTGATAGAAATCCAACCTCCATACCCTTCAATAAGCTCCGGATGAGAGTGCTTCTTTTGTGACAGTTTTTCAATTTTCAATAGCAAGTTTCCATAAAATCTCCATTTTCCATCACTAATGTTTACTAATGTTTCCTTGGTTTCCTTCCATTTATTATGAACAAGCAATTTGGAGACAACAAAACGCTCACCGAAATTTTATGTATCACTTCTTTTGCTTCTCTCACCCAAAATGATGGTTCAATCAAGTTGTCAATACAAGAAATCCCTGCAAGCTGATTACTCTTTGGATTATTCCTCTGTTTCATCTCCACAAAGTGGTTATTCTTCAGATTATTCCTCTAATTTGACTTCACAACCTCTGCAGATCTTCTGGTAAATGACTTTTCATTGATCTCTATCTTCATATAATGATCTTCGGGCATTGCCATATTAGGTAGTTGCATTGCGGGCTCACTGTTTTTACTCATGTATCTTTAGGAAATCCCATAGCAATTCCCAGAAAATGACCCATCCATGTTTGGCAAAACCAGCAGACCTTGAGCAATTTCCTTCCTCTTGAAGGTGGCCAAACCACACATCCAATGTGCTAACCTTGGCTTTTCAGATTATTGCTAATGGAAAACCAAGAGGAGATTTATCTAGTTGTTATTCACGTCTTTTACAAAACACATTTTATCTTCAGACTAAATGAAAAAAACGTGTTATGAACACAACTCCTTACCTCCATAATTCAAAATCAACCTAATTATCATGGAGGAAGGTGATTATGAGTATTCATGGAAGATTTTCTTTAGAATCGCTTAAGGATGGTCGTCATGCTTCAAGCCTTAGAAGTCCTTGGTCTAATGTTTCTAAGATCTAGAAAAAGGTTGAATCTTATGTCTCTTTTAAACTTGGAAAAGGTAATCAAATCTTTTTTTGGCAGGACAAAGAGACTGGAGGTATTTCTTTGCAGCATTTTCTTCCTCAATTTTTTGTTCTCTCTTCACCTCCTGATGGTTCTGTTTTTTTAGTTTTGGGATTCTATTCATCTTTCTTGATCTTTAATCTTTCAAAGGTTACTGAAAGACGATGTGATATTGGCTTTTCAATCGTTATTGGGTCCCCTGGATACTTCAATGGTAGATTCTTCTGAGGATTCAAGATCATGGTTATTAGAGCTCTCTGGCTCTTTCTCGGTGAAGTATATCGAGAGAATTAGCTTATGCTACTCCTTTACAGCCAAACCTATTCAAAATAATATGGAAATCTCGTAGTCCAAAGCGTATTAATATTCTGTTGTGGACCATTCCTTTGTAGCAATCACTATTGATTGCCCAGGATGTCCTTGCTCTATATTCTGTTGTGGACCATTCCTTTGTAGCAATCACTATTGATTGCCTAGGATGTATCACCTTTATATTCTCTAAACTTCTCTGGGTATGTGTCACTTTCATACACAACGGTCATGCATATATCACTTTCATGCATATAGGACCCCAAAAAAGTATTGAAGAGGTGAGAAAATGTATAATGATCGCTCACAGGCATGAGATACCACATATATAAAACATTAGTTCTTAAACATGATGCAACATAGAGATTATTTCGTACATAAACGTTCTCATCAAATTTAGCTAGTATCCATAACATCAATCACATCATGACTTGTCATTTCAGGAAAATACATAGTTTAGCATACATCAATCAATCATCTGACCTAGCCTTAAAGCGGTTTAGCATACATCAATCAATCATCTGACCTAGCCTTAAAGCATTCCAAAAGGAAAATCACATTGGCTACACAACCAAAAACAAAAGAGAGGAATATTCCCTATTATGAAGATGTTTTGCCCAACCTAAAAGAAAATTTCGTAGGTAAATCAATAGCTCAAGGATATGAATATTAGACTTAAATCTTGAAAGCTACCATTTACCCAATTGGTTCTTTTCTCAAAGTCGCCCTTAAACCGTATTCAAACAGTCTAAATCTATTCCAACACTACAACAAAAGATGCTGATTGCCACGACTGGCTGTCTACGTGTGGGACGTGTGGGTTGATGCTGATGCCTATTCATTGGAGAAGAAAACGATCCCCCCGAGAGGGGGAAAAGAGATGGAGATAGAAAGAGCCTTTCTAATTTTTGTTTATGACGTCCTAAATGGCTCTTTTTCCTTTCCCATTATTTTTCCTTTTCTACAATCTAGTTAAAAAATTCTAATTAACCCCAAATGAACTTCAAATATAAGAAAAAATCTTTGGGAATACACTTGGGCTCTTAAAATTGAAATACAAAAATTCAATGGATGATTAAATCCTCAAAGTTATGAATTTGAGACGATTAAAAAAAATCTCTCATTCAGAAATTAAATAAATTGATTGTCTCAACTTAACCCACTAAATTTAACTTAACCAAATTAGATCCCAAGAATTCCAATATATCCAATCGAATTCAAACTAATAACCTAAAGATTAAAATAATCTTATCCAAAAATGTAGGCATTACGGATCATGTTTCTTATAATATCTTTATCTTCAATATTAAATCTTGATGTATTTTCTAAAATAATTATACGACCTTGGAGGTTAATGAGTTCTGTAAACGTTGCAGCTTATGGATCTGAGAGATAGTAAGGAGGCTGTTTATGGTGCTCTTGATGCCTGGGTTGCATGGGAACAAGACTTTCCAATAGCATCCCTTAAGCATGCATTGGCTGTCCTTGAGAAAGAAAACCAGTGGCATAGAGTTGTACAGGTATGAGTTGTACCAGCCATTCTTTATCTACTCTCTTTTCCTTTTACCCACTCTTCCCACTTTCATGTTGAATGGAAATTAGTCTTCTCACTAGCCAAAAGGTGTGAAGGACATTAAATTTTTGGGGCAGGTATGTGAGCATGCAATTAGATGCAAATCCACAATATTACTGCTATATTCCCTTCTCGTGTAAACTGCAGAATTTTAGAGCATTGGCTGTGTGATGCAGTATATAATGGGCTATTTTACCAATTCTGTTTTGTGTAAAGTAAAATCTAAGAGGATGCACAAATAAACCCTAGTTTAAGGAAGAGTAAATAGCAAAGACAGTAAGTAGTGGGCTTTGAATGGCTAAGAATCTGTAAGTTACACTATTTAACACTAATATACAGCCTAAGAAGCACCAACACTCCCATTTAGGTAATGCATTTGCTTCGAACACTCCGATACTTGTTGGACACAGACAGTTACCTTGTAACATATTAAGTGATAATTTTTGTACAAAATGAATTAGTCCGATAATTTTTGTTTTTTGTTTTTTTGATAAGAAACAATTTCACTGGTTAATGAAATTTACAAAAAGAATAGATAATCCATGAGGATTACAAAAAGCTTTTCCAATTGGTCAATAGGGAATTGTAACTAGGGAGCCAGCAAAGTGATTATACACTTTACACCAAGACATAGAATAATAAATGAAGAGATCGAAGAACACTTCAAATGGATGAATGTTCTTTATTTAGAAAAATCCTTTGGTTTCTCTCCTTCCATAGAACCCCAAGGGAAGCCTTGTTTATGTGCATCCACAGTAGAGCTCTTGCTTTTTTAAAGGGGTCACGCAACATAACATTGGTGAGAATATCCTTTGCATCTTTAGGAATTGTCATATACCATCCAAAAGCCTTGAGAATTCTGTTCCAAAATTTCAAAGCAAAAGGGCAGTGTATGAATAAATGGTTCTGTGTTTCATCTTGGCCTTGCCTAAAGGATACCAGCTTGGGGAAGTTTCCAAGTATGGCATTCCCGTTTGGAGGTTGTCTTTGGTAGTGATGGTTCTGTGAGATAATTCCCATAGGAAAATTTTGGGTAACTATCGTCTAGATAATTCTAGCCAATGCCGTCTCCAAAAGGTGAGTTGATTATCCTATATTTAATAACAAAGATCTTGTGCATAAGGGACCATTTGGATTAGGTAACCATCTCTATGAATCCTCCCTCTTTGAGAGAACAATAGGTGCAAGGTCAGGACTGAGAGAAGACTGAGATCAGCCACTCTCAGGCTTCCTCGTCCTTCTAATTTCTGCCAAGTTGCAAGTCCCAGAAGCTGTTATCAGTGTTCCATACCCATAACAATCCAAGTCCACTGCTAGCAGATTTGTCCTCTTTGGGCTTTCCTTTTTGGGCTTACCCATACGATTTTTAAAAGGCGTCTGCTTGGGAGAGGATTCCACACCCTTATAAAGGATGTTTCGTTCTCCTCCCCAACCGACGTGGGATCTTACAATCCACCCCCCCTTCAGGGTCCAGCGTCCTCGCTAGCACTCCTTCCTTTCTCCAATCATGTGGGACTCCCACCAAATCCACCCCCCTTTCAGGGCCCAGCGTCCTTACTGGCACACCATCTCGTGTTTACTCCCCTTTGGGGAACAACCTCTGATTTGATAAGGAAATCACTACAAGGGGAGTAAGATTCGGATTACCTTGTTGATCGAATATCTCAAGACAAGAACATTGTTTGAGATTCAAATCACTCCACAAGCAAGATCGATCATGTCTAGCTTGAATGATTCTTGTTGATCAAATATCTCAAATACTTGTTTGAGATTCGAATCACTCCACAAGCAAGATTGAACATGTCGAGCTTGAATGATTCTACATGCAACCTAAACTACATAGAATTGCCAAGAAACTTAGCCATTGGCTAAAGAAAAGCACAAATGCTTCTTTTAATATATTTTCCAAGTCTACCTACAAATACAACATACAAGGCTTTATATAGCCTAAAAAATGAAACTATTAAAGGCGTGTAACATTCATACTTAATGACCATAATTAGCCATTGTGTAAATGTAACCTAAAGTAAATAAAATGTCTTAAAATACAATAACTCTAAATTGTAACCCACCCAAAATTTACAACAATCAAACTTTATTCTTCTTCAATGTAGCATGAATTGAAACATCTTTTGATAATTTTGACAACCTTTTCTTTACATCTTCATTGAAGTATATTGTATGATTGATGTCTCTTGGTTCATATCACTCCTCGCTAGCACATCACCTGGTGTCTGGTTCTGATACCATTTGTAATAGCCCAAACCTACTGCTAGCAGATATTGTCCTCTTTGGGCTTTCCCTTTCGGGCTTACCCTCAAGTTTTTTAAAACGTGTCTGTTAGGGAGAGGTTTCCACACCCTTATAAAGAGTGTTTCGTTCTCCTCCCCAACCGATGTGGGATCTTACAATACCTCTCTAATGGTAGCCTTTTTGCAATGAGGAATTCTGTAAAGTTGGGGGGTAATGAAGGCCATTGAAGTGTTTCCACCCAAGGATCAGTCCAGAAAAGTGTTGTGGCTCCACCACCCACTTGGTAGCATATTCCGATGGTGATAAGATCTTGGTGCTTGATATTGAACATCCATTGGTAGAGAAGATACATACCTACGTAATGTTGTTTACCAAGCTATATCTTGGTGTAAACTGTCTAATATTTTTACTTTCTATAGTTATACCACCCTCATTGTAAATAGGGAAGGTCTTTTGTTAACCCTACGGATTATACATCCCTTTTGTGAGTTTCAATCATCAATGAAATTATCTCTTAAAAAAAAAAAAACTTCCAAGGACTTCCTTGGTTATCTTGACTGGAAAGGAATGGGCGTCTCTACAAAAAAAGACCTATGTTTTTCAAATCTTTTTTTGATAATTTATTTTTTACTGCCCTCTCTTGGTGTAAACTCTCCCCACTCTTCAAAAATCATAGTCTCACATCGGTTGTTCAAAATGTTTTTTTGAATTTCCATGGCTCACTGCCCTTTTGTAATTTTATTTCACAGTCCATGAAATGATTACCTCACTGTTTCTCACAAAAAAAAAAAAAGCTTATTGATGTAAATGCATAAATTTATTGATTTTAAATTCTCTGAATTCGTGGTTTCGTGTTTGTATCTTTGCTTCTTAGTATCCAACTCATAACATTTCTAAAGAATATATGTGAATCATGTTGAGTTTCTTGTTAGAACAATAGTCAAGTCCAACGGTACCTAGCTTGTTACTATTGAATTTAGACAATGAAATGGGTATATGTATAGGTTATAGAAACAAATGCTGTTAGATTCATGCAGATGTCTTGCAGGTAATCAAATGGATGTTAAGCAAGGGGCAAGGAACCACAATGAATGTCTATGGGCAGTTAATACGGGCTTTAGACATGGACCATCGAGCGGAAGAAGCACATAAGTTTTGGGTCATGAAAATTGGTTCGGATCTACATTCAGTTCCTTGGCAATTGTGCAGAAGCATGATATCAATATACTACCGAAACAAAATGCTAGAAGATCTTGTGAAGGTATAATTCTCTCTTTCATTTAGTTAATGTAATGCTAGTGCCTTCTGGAAATGGCTTTTTGTTTGGAATATTGTTTTTTTCTTTTCCCCATTTTCCCATTAAAGTATTTTGATGATTCTCTTCTTTTGGGTTTGTACCTAGAGTATACTTCTATACTTGATTTTGATATCTCCACATAGCCCACGTGATTAGAGCATATCAAGTGTAACGATTCAGGTTCACCGCTAGTAGATATTGTCTTCTTTGGGCTTTTCCTTTCGGGCTTCCCCCTAATGTTTTTAAAACGCGTCTGCTAGAGAGAGGTTCTACACCCTTATAAAGGGTGTTTCGTTCTCTTCCCCAACCGATGTGGGATCTCACAATCCACCCCCCTTCAGGGCCCAGCGTCCTCGTTGGCACTCGTTCCTTTCTCCAATCGAGGTGGGACCCCACCAAATCCATCCCCCTTCGGGGCCTAGTGTCCTTACTGCACACTGCCTCGTGTCTACCGGAGAGGTTTCCACACCTTATAAAGGGTGTTTTGTTCTCCCCAACATCAAGTTTATGTTTCTTTTTACTAGGCTCAGATATTTTGTTGGCCTTCTCAAACAAAACCTTAGTTTTGTTGTTAAGATCTGAACTAGAGTTCCCTTTCTCAAGTGGTGAATGCTAAATGATTATCATATATTCCTATGGCCAGCTGGATAAAAGCATTAAGAACATCGTAATGTAGATCAAATTTGGAAATAAACAAAGGGCCTTAATAGCATACGAGCATAACAATCTAATGGCTTGGCTACACTACATGGTTCCTCTGGTTGAGAGACATTTTGATCCCTTTTAATTATTTTTAGCTCCCCTTGTCCTCTTTTTGATGTTATTAAAGTTTTTGAATTGGCATCTGGTTTGAATATTAATTATGGCAAGAGTGAACTGTTGGGGATAAACATAAGTCATAACACCTGTGGCAAGAACAATGGATAGAATTCTTTAAGATTTTCTTTGGAAAGGCTCGGAAGGTGATGCTGGTTGGCCTAATGTCAATTGGGAGCAGATTCAACTTCCCAAACTCTTGGACGGCCTTTGTGTTGGAGATCCTTTTAGTAATCCTTTGTACTGGGTTCACTTATCAATGAAATTTGTTACACAGATTGATAATATTTATATATATTGATGCAAAATAGTCGATGTGTCGATGAATTTTAAATTTTTGCGGGTCGGATTGCGTTATCCCCGAGTCTCTGTTATGCTAACCTTGCCTTTTGCCTTGTGTAATGTAAAGCTTTTTAAGAATCTTGAAGCTTTTGGGCGTAAACCTCCAGAAAAATCAATAGTGCAAAGGGTAGCAGATGCTTGTGAGATGCTGGGCTTGGTTGAAGAGAAAGAGAGGGTGCTAGTGAAGTATAATTACCTTTTTACCGATGAGAAGAAAGGGTCCATCAAGAAATATAAGGGAAAAAGAAAATCGACCAAGGGCAATCAAGACAATAGCGACCTTATGAAGAAGGCTCAATGAGTGAAATTAAGAGTTTCTAAATTTATTTTTAATTTTATATCTAATAGTCATTGACAAGTTTGTATATAATTGACTGATC

mRNA sequence

TTGAGCGTTTTGCCTCGTTACCAGTCGGGACTGTGGCGGTACGACATTTGAAATCAGGGTTTTGCTCTACCATGCTCATCCGGAGGTTTCATCGAGCAGCGACATGGGCGACGCCTCTGTTGCGAGACACAACTGTAGGACAAATCATGGAGCTTGGAGTCAACAAGCTGCAAATTGGGAACTCTTGTTACTGCACAATGTTACAAAATCAAATGTCTAAACGTTTTGGTGATAAAGATATGACAGATAAGGATGTTAACAATAGTAAACCTTTGTACCAGACTTCAGAGCGAAATATTGGAGACATTAGAAAGCACCAAATTGGGGAAAACGTTTCACGGAAGGACAAAATTGACTTTCTTGTAAATACTCTTATGGATCTGAGAGATAGTAAGGAGGCTGTTTATGGTGCTCTTGATGCCTGGGTTGCATGGGAACAAGACTTTCCAATAGCATCCCTTAAGCATGCATTGGCTGTCCTTGAGAAAGAAAACCAGTGGCATAGAGTTGTACAGGTAATCAAATGGATGTTAAGCAAGGGGCAAGGAACCACAATGAATGTCTATGGGCAGTTAATACGGGCTTTAGACATGGACCATCGAGCGGAAGAAGCACATAAGTTTTGGGTCATGAAAATTGGTTCGGATCTACATTCAGTTCCTTGGCAATTGTGCAGAAGCATGATATCAATATACTACCGAAACAAAATGCTAGAAGATCTTGTGAAGCTTTTTAAGAATCTTGAAGCTTTTGGGCGTAAACCTCCAGAAAAATCAATAGTGCAAAGGGTAGCAGATGCTTGTGAGATGCTGGGCTTGGTTGAAGAGAAAGAGAGGGTGCTAGTGAAGTATAATTACCTTTTTACCGATGAGAAGAAAGGGTCCATCAAGAAATATAAGGGAAAAAGAAAATCGACCAAGGGCAATCAAGACAATAGCGACCTTATGAAGAAGGCTCAATGAGTGAAATTAAGAGTTTCTAAATTTATTTTTAATTTTATATCTAATAGTCATTGACAAGTTTGTATATAATTGACTGATC

Coding sequence (CDS)

ATGCTCATCCGGAGGTTTCATCGAGCAGCGACATGGGCGACGCCTCTGTTGCGAGACACAACTGTAGGACAAATCATGGAGCTTGGAGTCAACAAGCTGCAAATTGGGAACTCTTGTTACTGCACAATGTTACAAAATCAAATGTCTAAACGTTTTGGTGATAAAGATATGACAGATAAGGATGTTAACAATAGTAAACCTTTGTACCAGACTTCAGAGCGAAATATTGGAGACATTAGAAAGCACCAAATTGGGGAAAACGTTTCACGGAAGGACAAAATTGACTTTCTTGTAAATACTCTTATGGATCTGAGAGATAGTAAGGAGGCTGTTTATGGTGCTCTTGATGCCTGGGTTGCATGGGAACAAGACTTTCCAATAGCATCCCTTAAGCATGCATTGGCTGTCCTTGAGAAAGAAAACCAGTGGCATAGAGTTGTACAGGTAATCAAATGGATGTTAAGCAAGGGGCAAGGAACCACAATGAATGTCTATGGGCAGTTAATACGGGCTTTAGACATGGACCATCGAGCGGAAGAAGCACATAAGTTTTGGGTCATGAAAATTGGTTCGGATCTACATTCAGTTCCTTGGCAATTGTGCAGAAGCATGATATCAATATACTACCGAAACAAAATGCTAGAAGATCTTGTGAAGCTTTTTAAGAATCTTGAAGCTTTTGGGCGTAAACCTCCAGAAAAATCAATAGTGCAAAGGGTAGCAGATGCTTGTGAGATGCTGGGCTTGGTTGAAGAGAAAGAGAGGGTGCTAGTGAAGTATAATTACCTTTTTACCGATGAGAAGAAAGGGTCCATCAAGAAATATAAGGGAAAAAGAAAATCGACCAAGGGCAATCAAGACAATAGCGACCTTATGAAGAAGGCTCAATGA
BLAST of CmoCh19G003090 vs. Swiss-Prot
Match: PP322_ARATH (Pentatricopeptide repeat-containing protein At4g18975, chloroplastic OS=Arabidopsis thaliana GN=At4g18975 PE=2 SV=2)

HSP 1 Score: 146.0 bits (367), Expect = 6.6e-34
Identity = 76/183 (41.53%), Postives = 112/183 (61.20%), Query Frame = 1

Query: 97  LVNTLMDLRDSKEAVYGALDAWVAWEQDFPIASLKHALAVLEKENQWHRVVQVIKWMLSK 156
           LV  L  L + KEAVYGAL+ WVAWE +FPI +   AL +L K +QWHRV+Q+ KWMLSK
Sbjct: 101 LVRMLSGLPNEKEAVYGALNKWVAWEVEFPIIAAAKALQILRKRSQWHRVIQLAKWMLSK 160

Query: 157 GQGTTMNVYGQLIRALDMDHRAEEAHKFWVMKIGSDLHSVPWQLCRSMISIYYRNKMLED 216
           GQG TM  Y  L+ A DMD RA+EA   W M + +   S+P +L   MI++Y  + + + 
Sbjct: 161 GQGATMGTYDILLLAFDMDERADEAESLWNMILHTHTRSIPRRLFARMIALYAHHDLHDK 220

Query: 217 LVKLFKNLEAFGRKPPEKSIVQRVADACEMLGLVEEKE----RVLVKYNYLFTDEKKGSI 276
           ++++F ++E     P E S  +RVA A   L   E ++    R L +Y Y++ + ++  +
Sbjct: 221 VIEVFADMEELKVSPDEDS-ARRVARAFRELNQEENRKLILRRYLSEYKYIYFNGERVRV 280

BLAST of CmoCh19G003090 vs. Swiss-Prot
Match: PP332_ARATH (Pentatricopeptide repeat-containing protein At4g21190 OS=Arabidopsis thaliana GN=EMB1417 PE=2 SV=1)

HSP 1 Score: 129.4 bits (324), Expect = 6.4e-29
Identity = 71/192 (36.98%), Postives = 107/192 (55.73%), Query Frame = 1

Query: 97  LVNTLMDLRDSKEAVYGALDAWVAWEQDFPIASLKHALAVLEKENQWHRVVQVIKWMLSK 156
           ++  +  L + KE VYGALD+++AWE +FP+  +K AL +LE E +W +++QV KWMLSK
Sbjct: 61  MIACIKGLSNVKEEVYGALDSFIAWELEFPLVIVKKALVILEDEKEWKKIIQVTKWMLSK 120

Query: 157 GQGTTMNVYGQLIRALDMDHRAEEAHKFWVMKIGSDLHSVPWQLCRSMISIYYRNKMLED 216
           GQG TM  Y  L+ AL  D+R +EA + W       L   P +    MISIYY+  M + 
Sbjct: 121 GQGRTMGTYFSLLNALAEDNRLDEAEELWNKLFMEHLEGTPRKFFNKMISIYYKRDMHQK 180

Query: 217 LVKLFKNLEAFGRKPPEKSIVQRVADACEMLGLVEEKERVLVKYNYLFTDEKKGSIKKYK 276
           L ++F ++E  G K P  +IV  V      L + ++ E+++ KY        +   +  K
Sbjct: 181 LFEVFADMEELGVK-PNVAIVSMVGKVFVKLEMKDKYEKLMKKY-----PPPQWEFRYIK 240

Query: 277 GKRKSTKGNQDN 289
           G+R   K  Q N
Sbjct: 241 GRRVKVKAKQLN 246

BLAST of CmoCh19G003090 vs. TrEMBL
Match: A0A0A0K3W9_CUCSA (Uncharacterized protein OS=Cucumis sativus GN=Csa_7G057510 PE=4 SV=1)

HSP 1 Score: 380.2 bits (975), Expect = 2.3e-102
Identity = 180/220 (81.82%), Postives = 198/220 (90.00%), Query Frame = 1

Query: 1   MLIRRFHRAATWATPLLRDTTVGQIMELGVNKLQIGNSCYCTMLQNQMSKRFGDKDMTDK 60
           MLIRR HRAA WATPLLR  TVGQ MELGV++LQ+G+SCYCT +Q+QM ++  DKD  DK
Sbjct: 1   MLIRRIHRAAAWATPLLRHPTVGQTMELGVSRLQVGSSCYCTTIQDQMCQQLADKDRKDK 60

Query: 61  DVNNSKPLYQTSERNIGDIRKHQIGENVSRKDKIDFLVNTLMDLRDSKEAVYGALDAWVA 120
           DVN+SK L   SE+NIGDIRKHQIG+N+SRKDKI FLVNTL+DLRDSKEAVYGALDAWVA
Sbjct: 61  DVNSSKALGHISEQNIGDIRKHQIGKNISRKDKIHFLVNTLLDLRDSKEAVYGALDAWVA 120

Query: 121 WEQDFPIASLKHALAVLEKENQWHRVVQVIKWMLSKGQGTTMNVYGQLIRALDMDHRAEE 180
           WEQDFPIA LKH LA LEKE QWHR+VQVIKWMLSKGQGTTMNVYGQLIRALDMDHR EE
Sbjct: 121 WEQDFPIAPLKHVLAALEKEQQWHRIVQVIKWMLSKGQGTTMNVYGQLIRALDMDHRGEE 180

Query: 181 AHKFWVMKIGSDLHSVPWQLCRSMISIYYRNKMLEDLVKL 221
           AHKFWVMKIGSDLHSVPWQ+CRSM++IYYRNK LEDLVK+
Sbjct: 181 AHKFWVMKIGSDLHSVPWQVCRSMMAIYYRNKRLEDLVKV 220

BLAST of CmoCh19G003090 vs. TrEMBL
Match: B9SJZ4_RICCO (Putative uncharacterized protein OS=Ricinus communis GN=RCOM_0577750 PE=4 SV=1)

HSP 1 Score: 332.4 bits (851), Expect = 5.6e-88
Identity = 172/274 (62.77%), Postives = 207/274 (75.55%), Query Frame = 1

Query: 12  WATPLLRDTTVGQIMELGVNKLQIGNSCYC-TMLQNQMSKRFGDKDMTDKDVNNSKPLYQ 71
           W +P     T G++ ++GV +LQ  N  Y  TM+Q Q+S R        +D ++ K    
Sbjct: 2   WRSPAFSSLT-GRLSQVGVARLQCSNGRYSSTMVQAQISNR-NTPSPRPEDQDDYKTTCH 61

Query: 72  TSERNIGDIRKHQIGENVSRKDKIDFLVNTLMDLRDSKEAVYGALDAWVAWEQDFPIASL 131
            S ++ G ++K+QIG+NVSRK+KIDFL+ TL+DL+DSKEAVYGALDAWVAWE +FPIASL
Sbjct: 62  NSNQSAGGVQKNQIGKNVSRKEKIDFLLKTLLDLKDSKEAVYGALDAWVAWEHNFPIASL 121

Query: 132 KHALAVLEKENQWHRVVQVIKWMLSKGQGTTMNVYGQLIRALDMDHRAEEAHKFWVMKIG 191
           K  L +LEKE QWH+VVQVIKWMLSKGQG TM  YGQLIRALDMDHRA EAH FW+ KIG
Sbjct: 122 KRVLILLEKEQQWHKVVQVIKWMLSKGQGNTMGTYGQLIRALDMDHRANEAHMFWLKKIG 181

Query: 192 SDLHSVPWQLCRSMISIYYRNKMLEDLVKLFKNLEAFGRKPPEKSIVQRVADACEMLGLV 251
            DLHSVPWQLC  MIS+YYRN MLE LVKLFK LEAF RKPP+KSI+Q+VADA EMLG++
Sbjct: 182 LDLHSVPWQLCHRMISVYYRNNMLESLVKLFKGLEAFDRKPPDKSILQKVADAYEMLGML 241

Query: 252 EEKERVLVKYNYLFTDEKKGSIKKYK---GKRKS 282
           EEKERVL KY  LF + +KG  KK +    K+KS
Sbjct: 242 EEKERVLQKYKDLFKETEKGRPKKSRSTLAKKKS 273

BLAST of CmoCh19G003090 vs. TrEMBL
Match: F6I4B4_VITVI (Putative uncharacterized protein OS=Vitis vinifera GN=VIT_05s0062g01340 PE=4 SV=1)

HSP 1 Score: 329.7 bits (844), Expect = 3.6e-87
Identity = 161/235 (68.51%), Postives = 188/235 (80.00%), Query Frame = 1

Query: 57  MTDKDVNNSKPLYQTSERNIGDIRKHQIGENVSRKDKIDFLVNTLMDLRDSKEAVYGALD 116
           + D+   N++P+Y  S ++   + KHQIGENVSRKDKI+FLV TL+DL+DSKEAVYGALD
Sbjct: 32  LVDEGQCNNQPMYHDSGKDAASVHKHQIGENVSRKDKINFLVTTLLDLKDSKEAVYGALD 91

Query: 117 AWVAWEQDFPIASLKHALAVLEKENQWHRVVQVIKWMLSKGQGTTMNVYGQLIRALDMDH 176
           AWVAWEQ+FPIASLK  L  LEKE QWHRV+QV+KWMLSKGQGTTM  YGQLIRALDMDH
Sbjct: 92  AWVAWEQNFPIASLKRVLITLEKEQQWHRVIQVVKWMLSKGQGTTMGTYGQLIRALDMDH 151

Query: 177 RAEEAHKFWVMKIGSDLHSVPWQLCRSMISIYYRNKMLEDLVKLFKNLEAFGRKPPEKSI 236
           RAEEAH+FWV KIG+DLHSVPW LC  MIS+YYRN MLE+LVKLFK LEAF RKP +K +
Sbjct: 152 RAEEAHEFWVKKIGTDLHSVPWHLCHRMISVYYRNNMLENLVKLFKGLEAFDRKPQDKLV 211

Query: 237 VQRVADACEMLGLVEEKERVLVKYNYLFTDEKKGSIKKYK---------GKRKST 283
           V++VADA EMLGL+EEKER+  KY+YLFT+   G  KK K         G+RK T
Sbjct: 212 VKKVADAYEMLGLLEEKERIFEKYDYLFTETVAGKPKKSKKFLSEKKKSGRRKPT 266

BLAST of CmoCh19G003090 vs. TrEMBL
Match: A0A067LEC7_JATCU (Uncharacterized protein OS=Jatropha curcas GN=JCGZ_24025 PE=4 SV=1)

HSP 1 Score: 317.4 bits (812), Expect = 1.9e-83
Identity = 162/280 (57.86%), Postives = 209/280 (74.64%), Query Frame = 1

Query: 12  WATPLLRDTTVGQIMELGVNKLQIGNSCYCTMLQNQMSKRFGDKD--MTDKDVNNSKPLY 71
           W +P +   +  ++  +G  ++Q  N  Y TMLQ+ + K    +   + +  V++     
Sbjct: 2   WRSPAMSSLS-SRLTGVGAARVQSPNCGYNTMLQDHIYKTNVTRSTFLLENRVDHKAAAC 61

Query: 72  QTSERNIGDIRKHQIGENVSRKDKIDFLVNTLMDLRDSKEAVYGALDAWVAWEQDFPIAS 131
           + S +N+  ++K+QIGENVSRKDKI FL+N  ++L+DSKEAVYGALDAWVAWE+ FPI S
Sbjct: 62  ENSIQNVRGVQKYQIGENVSRKDKISFLMNMFLNLKDSKEAVYGALDAWVAWERQFPIGS 121

Query: 132 LKHALAVLEKENQWHRVVQVIKWMLSKGQGTTMNVYGQLIRALDMDHRAEEAHKFWVMKI 191
           ++ AL  LEKE QWHRVVQVIKWMLSKGQG TM  YGQLIRALDMDHRA+EAH FW  KI
Sbjct: 122 IRRALLTLEKEQQWHRVVQVIKWMLSKGQGNTMGTYGQLIRALDMDHRADEAHLFWSKKI 181

Query: 192 GSDLHSVPWQLCRSMISIYYRNKMLEDLVKLFKNLEAFGRKPPEKSIVQRVADACEMLGL 251
           G+DLHSVPW+LC+ M+SIYYRN MLE LVKLFK LEAF RKPPEKSIVQ+VA+A EMLG+
Sbjct: 182 GTDLHSVPWELCKLMLSIYYRNNMLECLVKLFKGLEAFDRKPPEKSIVQKVANAYEMLGM 241

Query: 252 VEEKERVLVKYNYLFTDEKKGSIKKYK-GKRKSTKGNQDN 289
           +EEK+RV  KY +LF++  KG  KK++   +K ++G + N
Sbjct: 242 LEEKDRVEQKYIHLFSETHKGDNKKFRTTSKKKSQGLRQN 280

BLAST of CmoCh19G003090 vs. TrEMBL
Match: V4VTA3_9ROSI (Uncharacterized protein OS=Citrus clementina GN=CICLE_v10021498mg PE=4 SV=1)

HSP 1 Score: 316.6 bits (810), Expect = 3.2e-83
Identity = 167/253 (66.01%), Postives = 193/253 (76.28%), Query Frame = 1

Query: 37  NSCYCTMLQNQMSKRFGDKDMTDKDVNNSKP---LYQTSERNIGDIRKHQIGENVSRKDK 96
           NS Y +  + Q+S +   K M+   +   +    + Q  ERN    R  +IGENV RKDK
Sbjct: 16  NSIYKSAEKIQISNQIIGKAMSMSSLEGQRTNQSVDQYPERNAASTRNFRIGENVPRKDK 75

Query: 97  IDFLVNTLMDLRDSKEAVYGALDAWVAWEQDFPIASLKHALAVLEKENQWHRVVQVIKWM 156
           I+FLVNTL+DL++SKE VYG LDAWVAWEQ+FP+ SLK AL  LEKE QWHRVVQVIKWM
Sbjct: 76  INFLVNTLLDLKNSKEDVYGTLDAWVAWEQNFPVGSLKKALLALEKEQQWHRVVQVIKWM 135

Query: 157 LSKGQGTTMNVYGQLIRALDMDHRAEEAHKFWVMKIGSDLHSVPWQLCRSMISIYYRNKM 216
           LSKGQG+TM   GQLIRALDMDHRAEEAHKFW  +IG DLHSVPWQLC+SMI+IYYRN M
Sbjct: 136 LSKGQGSTMGTCGQLIRALDMDHRAEEAHKFWEKRIGIDLHSVPWQLCKSMIAIYYRNNM 195

Query: 217 LEDLVKLFKNLEAFGRKPPEKSIVQRVADACEMLGLVEEKERVLVKYNYLFTDEKKGSIK 276
           LE L+KLFK LEAF RKPPEKSIVQRVADA E+LGL+EEKERVL KY  LFT+++K S K
Sbjct: 196 LERLIKLFKGLEAFDRKPPEKSIVQRVADAYEVLGLLEEKERVLEKYKDLFTEKEKRSNK 255

Query: 277 KYKGKRKSTKGNQ 287
             K K  S KG +
Sbjct: 256 --KSKSSSMKGKK 266

BLAST of CmoCh19G003090 vs. TAIR10
Match: AT1G04590.2 (AT1G04590.2 BEST Arabidopsis thaliana protein match is: Pentatricopeptide repeat (PPR) superfamily protein (TAIR:AT4G18975.4))

HSP 1 Score: 282.7 bits (722), Expect = 2.6e-76
Identity = 148/264 (56.06%), Postives = 194/264 (73.48%), Query Frame = 1

Query: 28  LGVNK------LQIGNSCYCTMLQNQMSKRFGDKDMTDKDVNNSKPLYQTSERNIGDIRK 87
           LGVN+      L++ +  Y  +  +  S +   K+  ++D ++S     + + N  + RK
Sbjct: 79  LGVNQTNKPAFLRVQSMSYQFVADSHSSPKRIVKNEDEEDFSDS-----SKKGNAENPRK 138

Query: 88  HQIGENVSRKDKIDFLVNTLMDLRDSKEAVYGALDAWVAWEQDFPIASLKHALAVLEKEN 147
           HQIGEN+ +KDKI FLVNTL+D+ D+KEAVYGALDAWVAWE++FPIASLK  +A LEKE+
Sbjct: 139 HQIGENIPKKDKIKFLVNTLLDIEDNKEAVYGALDAWVAWERNFPIASLKIVIASLEKEH 198

Query: 148 QWHRVVQVIKWMLSKGQGTTMNVYGQLIRALDMDHRAEEAHKFWVMKIGSDLHSVPWQLC 207
           QWHR+VQVIKW+LSKGQG TM  YGQLIRALDMD RAEEAH  W  K+G+DLHSVPWQLC
Sbjct: 199 QWHRMVQVIKWILSKGQGNTMGTYGQLIRALDMDRRAEEAHVIWRKKVGNDLHSVPWQLC 258

Query: 208 RSMISIYYRNKMLEDLV---KLFKNLEAFGRKPPEKSIVQRVADACEMLGLVEEKERVLV 267
             M+ IY+RN ML++LV   KLFK+LE++ RKPP+K IVQ VADA E+LG+++EKERV+ 
Sbjct: 259 LQMMRIYFRNNMLQELVKVMKLFKDLESYDRKPPDKHIVQTVADAYELLGMLDEKERVVT 318

Query: 268 KYNYLF----TDEKKGSIKKYKGK 279
           KY++L     +D+K     + K K
Sbjct: 319 KYSHLLLGTPSDDKPSRSSRKKKK 337

BLAST of CmoCh19G003090 vs. TAIR10
Match: AT4G18975.1 (AT4G18975.1 Pentatricopeptide repeat (PPR) superfamily protein)

HSP 1 Score: 146.0 bits (367), Expect = 3.7e-35
Identity = 76/183 (41.53%), Postives = 112/183 (61.20%), Query Frame = 1

Query: 97  LVNTLMDLRDSKEAVYGALDAWVAWEQDFPIASLKHALAVLEKENQWHRVVQVIKWMLSK 156
           LV  L  L + KEAVYGAL+ WVAWE +FPI +   AL +L K +QWHRV+Q+ KWMLSK
Sbjct: 101 LVRMLSGLPNEKEAVYGALNKWVAWEVEFPIIAAAKALQILRKRSQWHRVIQLAKWMLSK 160

Query: 157 GQGTTMNVYGQLIRALDMDHRAEEAHKFWVMKIGSDLHSVPWQLCRSMISIYYRNKMLED 216
           GQG TM  Y  L+ A DMD RA+EA   W M + +   S+P +L   MI++Y  + + + 
Sbjct: 161 GQGATMGTYDILLLAFDMDERADEAESLWNMILHTHTRSIPRRLFARMIALYAHHDLHDK 220

Query: 217 LVKLFKNLEAFGRKPPEKSIVQRVADACEMLGLVEEKE----RVLVKYNYLFTDEKKGSI 276
           ++++F ++E     P E S  +RVA A   L   E ++    R L +Y Y++ + ++  +
Sbjct: 221 VIEVFADMEELKVSPDEDS-ARRVARAFRELNQEENRKLILRRYLSEYKYIYFNGERVRV 280

BLAST of CmoCh19G003090 vs. TAIR10
Match: AT4G21190.1 (AT4G21190.1 Pentatricopeptide repeat (PPR) superfamily protein)

HSP 1 Score: 129.4 bits (324), Expect = 3.6e-30
Identity = 71/192 (36.98%), Postives = 107/192 (55.73%), Query Frame = 1

Query: 97  LVNTLMDLRDSKEAVYGALDAWVAWEQDFPIASLKHALAVLEKENQWHRVVQVIKWMLSK 156
           ++  +  L + KE VYGALD+++AWE +FP+  +K AL +LE E +W +++QV KWMLSK
Sbjct: 61  MIACIKGLSNVKEEVYGALDSFIAWELEFPLVIVKKALVILEDEKEWKKIIQVTKWMLSK 120

Query: 157 GQGTTMNVYGQLIRALDMDHRAEEAHKFWVMKIGSDLHSVPWQLCRSMISIYYRNKMLED 216
           GQG TM  Y  L+ AL  D+R +EA + W       L   P +    MISIYY+  M + 
Sbjct: 121 GQGRTMGTYFSLLNALAEDNRLDEAEELWNKLFMEHLEGTPRKFFNKMISIYYKRDMHQK 180

Query: 217 LVKLFKNLEAFGRKPPEKSIVQRVADACEMLGLVEEKERVLVKYNYLFTDEKKGSIKKYK 276
           L ++F ++E  G K P  +IV  V      L + ++ E+++ KY        +   +  K
Sbjct: 181 LFEVFADMEELGVK-PNVAIVSMVGKVFVKLEMKDKYEKLMKKY-----PPPQWEFRYIK 240

Query: 277 GKRKSTKGNQDN 289
           G+R   K  Q N
Sbjct: 241 GRRVKVKAKQLN 246

BLAST of CmoCh19G003090 vs. NCBI nr
Match: gi|659110487|ref|XP_008455250.1| (PREDICTED: pentatricopeptide repeat-containing protein At4g18975, chloroplastic [Cucumis melo])

HSP 1 Score: 487.6 bits (1254), Expect = 1.5e-134
Identity = 246/301 (81.73%), Postives = 270/301 (89.70%), Query Frame = 1

Query: 1   MLIRRFHRAATWATPLLRDTTVGQIMELGVNKLQIGNSCYCTMLQNQMSKRFGDKDMTDK 60
           MLIRRF+RAA WATPLLR  TVG+ MELGV++LQ+G S YCTM+Q+QM K+  DKD  +K
Sbjct: 1   MLIRRFYRAAAWATPLLRHPTVGKTMELGVSRLQVGCSWYCTMIQDQMYKQLADKDRKNK 60

Query: 61  DVNNSKPLYQTSERNIGDIRKHQIGENVSRKDKIDFLVNTLMDLRDSKEAVYGALDAWVA 120
           DV+NSK L   SE+NIGDIRKH+IGENVSRKDKI FLVNTL+DLRDSKEAVYGALDAWVA
Sbjct: 61  DVDNSKALGHISEQNIGDIRKHKIGENVSRKDKISFLVNTLLDLRDSKEAVYGALDAWVA 120

Query: 121 WEQDFPIASLKHALAVLEKENQWHRVVQVIKWMLSKGQGTTMNVYGQLIRALDMDHRAEE 180
           WEQDFPIASLKH LA LEKE QWHR+VQVIKWMLSKGQGTTMNVYGQLIRALDMDHRAEE
Sbjct: 121 WEQDFPIASLKHVLAALEKEQQWHRIVQVIKWMLSKGQGTTMNVYGQLIRALDMDHRAEE 180

Query: 181 AHKFWVMKIGSDLHSVPWQLCRSMISIYYRNKMLEDLVKLFKNLEAFGRKPPEKSIVQRV 240
           AHKFWVMKIGSDLHSVPWQLCRSMI+IYYRNKMLEDLVKLFK+LEAFGRKPP+KSIVQRV
Sbjct: 181 AHKFWVMKIGSDLHSVPWQLCRSMIAIYYRNKMLEDLVKLFKDLEAFGRKPPDKSIVQRV 240

Query: 241 ADACEMLGLVEEKERVLVKYNYLFTDEKKGSIKKY--------KGKRKSTKGNQDNSDLM 294
           ADACEMLGL+EEKERVLVKY YLF DEK+ S+KKY        K KRKSTKG++DNS+L+
Sbjct: 241 ADACEMLGLLEEKERVLVKYKYLF-DEKQESMKKYKRVSFEKPKRKRKSTKGSEDNSNLV 300

BLAST of CmoCh19G003090 vs. NCBI nr
Match: gi|778724238|ref|XP_011658763.1| (PREDICTED: pentatricopeptide repeat-containing protein At4g18975, chloroplastic isoform X1 [Cucumis sativus])

HSP 1 Score: 485.3 bits (1248), Expect = 7.4e-134
Identity = 241/301 (80.07%), Postives = 266/301 (88.37%), Query Frame = 1

Query: 1   MLIRRFHRAATWATPLLRDTTVGQIMELGVNKLQIGNSCYCTMLQNQMSKRFGDKDMTDK 60
           MLIRR HRAA WATPLLR  TVGQ MELGV++LQ+G+SCYCT +Q+QM ++  DKD  DK
Sbjct: 1   MLIRRIHRAAAWATPLLRHPTVGQTMELGVSRLQVGSSCYCTTIQDQMCQQLADKDRKDK 60

Query: 61  DVNNSKPLYQTSERNIGDIRKHQIGENVSRKDKIDFLVNTLMDLRDSKEAVYGALDAWVA 120
           DVN+SK L   SE+NIGDIRKHQIG+N+SRKDKI FLVNTL+DLRDSKEAVYGALDAWVA
Sbjct: 61  DVNSSKALGHISEQNIGDIRKHQIGKNISRKDKIHFLVNTLLDLRDSKEAVYGALDAWVA 120

Query: 121 WEQDFPIASLKHALAVLEKENQWHRVVQVIKWMLSKGQGTTMNVYGQLIRALDMDHRAEE 180
           WEQDFPIA LKH LA LEKE QWHR+VQVIKWMLSKGQGTTMNVYGQLIRALDMDHR EE
Sbjct: 121 WEQDFPIAPLKHVLAALEKEQQWHRIVQVIKWMLSKGQGTTMNVYGQLIRALDMDHRGEE 180

Query: 181 AHKFWVMKIGSDLHSVPWQLCRSMISIYYRNKMLEDLVKLFKNLEAFGRKPPEKSIVQRV 240
           AHKFWVMKIGSDLHSVPWQ+CRSM++IYYRNK LEDLVKLFK+LEAFGRKPP+KSIVQRV
Sbjct: 181 AHKFWVMKIGSDLHSVPWQVCRSMMAIYYRNKRLEDLVKLFKDLEAFGRKPPDKSIVQRV 240

Query: 241 ADACEMLGLVEEKERVLVKYNYLFTDEKKGSIKKY--------KGKRKSTKGNQDNSDLM 294
           ADACEMLGL+EEKERVLVKY YLF DEK+G +KKY        K KRKSTKG +DNS+L+
Sbjct: 241 ADACEMLGLLEEKERVLVKYKYLF-DEKEGPMKKYKRISFEKSKRKRKSTKGTEDNSNLV 300

BLAST of CmoCh19G003090 vs. NCBI nr
Match: gi|700188441|gb|KGN43674.1| (hypothetical protein Csa_7G057510 [Cucumis sativus])

HSP 1 Score: 380.2 bits (975), Expect = 3.3e-102
Identity = 180/220 (81.82%), Postives = 198/220 (90.00%), Query Frame = 1

Query: 1   MLIRRFHRAATWATPLLRDTTVGQIMELGVNKLQIGNSCYCTMLQNQMSKRFGDKDMTDK 60
           MLIRR HRAA WATPLLR  TVGQ MELGV++LQ+G+SCYCT +Q+QM ++  DKD  DK
Sbjct: 1   MLIRRIHRAAAWATPLLRHPTVGQTMELGVSRLQVGSSCYCTTIQDQMCQQLADKDRKDK 60

Query: 61  DVNNSKPLYQTSERNIGDIRKHQIGENVSRKDKIDFLVNTLMDLRDSKEAVYGALDAWVA 120
           DVN+SK L   SE+NIGDIRKHQIG+N+SRKDKI FLVNTL+DLRDSKEAVYGALDAWVA
Sbjct: 61  DVNSSKALGHISEQNIGDIRKHQIGKNISRKDKIHFLVNTLLDLRDSKEAVYGALDAWVA 120

Query: 121 WEQDFPIASLKHALAVLEKENQWHRVVQVIKWMLSKGQGTTMNVYGQLIRALDMDHRAEE 180
           WEQDFPIA LKH LA LEKE QWHR+VQVIKWMLSKGQGTTMNVYGQLIRALDMDHR EE
Sbjct: 121 WEQDFPIAPLKHVLAALEKEQQWHRIVQVIKWMLSKGQGTTMNVYGQLIRALDMDHRGEE 180

Query: 181 AHKFWVMKIGSDLHSVPWQLCRSMISIYYRNKMLEDLVKL 221
           AHKFWVMKIGSDLHSVPWQ+CRSM++IYYRNK LEDLVK+
Sbjct: 181 AHKFWVMKIGSDLHSVPWQVCRSMMAIYYRNKRLEDLVKV 220

BLAST of CmoCh19G003090 vs. NCBI nr
Match: gi|225433730|ref|XP_002269673.1| (PREDICTED: pentatricopeptide repeat-containing protein At4g18975, chloroplastic isoform X1 [Vitis vinifera])

HSP 1 Score: 339.7 bits (870), Expect = 5.0e-90
Identity = 174/272 (63.97%), Postives = 202/272 (74.26%), Query Frame = 1

Query: 22  VGQIMELGVNKLQIGNSCYCTMLQNQMS--KRFGDKDMTDKDVNNSKPLYQTSERNIGDI 81
           V Q  +LG  ++Q   S Y T  Q QMS     G+        NN +P+Y  S ++   +
Sbjct: 15  VRQFTQLGATRVQTLASSYSTFTQTQMSDTSNVGEVAFLGGQCNN-QPMYHDSGKDAASV 74

Query: 82  RKHQIGENVSRKDKIDFLVNTLMDLRDSKEAVYGALDAWVAWEQDFPIASLKHALAVLEK 141
            KHQIGENVSRKDKI+FLV TL+DL+DSKEAVYGALDAWVAWEQ+FPIASLK  L  LEK
Sbjct: 75  HKHQIGENVSRKDKINFLVTTLLDLKDSKEAVYGALDAWVAWEQNFPIASLKRVLITLEK 134

Query: 142 ENQWHRVVQVIKWMLSKGQGTTMNVYGQLIRALDMDHRAEEAHKFWVMKIGSDLHSVPWQ 201
           E QWHRV+QV+KWMLSKGQGTTM  YGQLIRALDMDHRAEEAH+FWV KIG+DLHSVPW 
Sbjct: 135 EQQWHRVIQVVKWMLSKGQGTTMGTYGQLIRALDMDHRAEEAHEFWVKKIGTDLHSVPWH 194

Query: 202 LCRSMISIYYRNKMLEDLVKLFKNLEAFGRKPPEKSIVQRVADACEMLGLVEEKERVLVK 261
           LC  MIS+YYRN MLE+LVKLFK LEAF RKP +K +V++VADA EMLGL+EEKER+  K
Sbjct: 195 LCHRMISVYYRNNMLENLVKLFKGLEAFDRKPQDKLVVKKVADAYEMLGLLEEKERIFEK 254

Query: 262 YNYLFTDEKKGSIKKYK---------GKRKST 283
           Y+YLFT+   G  KK K         G+RK T
Sbjct: 255 YDYLFTETVAGKPKKSKKFLSEKKKSGRRKPT 285

BLAST of CmoCh19G003090 vs. NCBI nr
Match: gi|296089642|emb|CBI39461.3| (unnamed protein product [Vitis vinifera])

HSP 1 Score: 339.7 bits (870), Expect = 5.0e-90
Identity = 174/272 (63.97%), Postives = 202/272 (74.26%), Query Frame = 1

Query: 22  VGQIMELGVNKLQIGNSCYCTMLQNQMS--KRFGDKDMTDKDVNNSKPLYQTSERNIGDI 81
           V Q  +LG  ++Q   S Y T  Q QMS     G+        NN +P+Y  S ++   +
Sbjct: 11  VRQFTQLGATRVQTLASSYSTFTQTQMSDTSNVGEVAFLGGQCNN-QPMYHDSGKDAASV 70

Query: 82  RKHQIGENVSRKDKIDFLVNTLMDLRDSKEAVYGALDAWVAWEQDFPIASLKHALAVLEK 141
            KHQIGENVSRKDKI+FLV TL+DL+DSKEAVYGALDAWVAWEQ+FPIASLK  L  LEK
Sbjct: 71  HKHQIGENVSRKDKINFLVTTLLDLKDSKEAVYGALDAWVAWEQNFPIASLKRVLITLEK 130

Query: 142 ENQWHRVVQVIKWMLSKGQGTTMNVYGQLIRALDMDHRAEEAHKFWVMKIGSDLHSVPWQ 201
           E QWHRV+QV+KWMLSKGQGTTM  YGQLIRALDMDHRAEEAH+FWV KIG+DLHSVPW 
Sbjct: 131 EQQWHRVIQVVKWMLSKGQGTTMGTYGQLIRALDMDHRAEEAHEFWVKKIGTDLHSVPWH 190

Query: 202 LCRSMISIYYRNKMLEDLVKLFKNLEAFGRKPPEKSIVQRVADACEMLGLVEEKERVLVK 261
           LC  MIS+YYRN MLE+LVKLFK LEAF RKP +K +V++VADA EMLGL+EEKER+  K
Sbjct: 191 LCHRMISVYYRNNMLENLVKLFKGLEAFDRKPQDKLVVKKVADAYEMLGLLEEKERIFEK 250

Query: 262 YNYLFTDEKKGSIKKYK---------GKRKST 283
           Y+YLFT+   G  KK K         G+RK T
Sbjct: 251 YDYLFTETVAGKPKKSKKFLSEKKKSGRRKPT 281

The following BLAST results are available for this feature:
Match NameE-valueIdentityDescription
PP322_ARATH6.6e-3441.53Pentatricopeptide repeat-containing protein At4g18975, chloroplastic OS=Arabidop... [more]
PP332_ARATH6.4e-2936.98Pentatricopeptide repeat-containing protein At4g21190 OS=Arabidopsis thaliana GN... [more]
Match NameE-valueIdentityDescription
A0A0A0K3W9_CUCSA2.3e-10281.82Uncharacterized protein OS=Cucumis sativus GN=Csa_7G057510 PE=4 SV=1[more]
B9SJZ4_RICCO5.6e-8862.77Putative uncharacterized protein OS=Ricinus communis GN=RCOM_0577750 PE=4 SV=1[more]
F6I4B4_VITVI3.6e-8768.51Putative uncharacterized protein OS=Vitis vinifera GN=VIT_05s0062g01340 PE=4 SV=... [more]
A0A067LEC7_JATCU1.9e-8357.86Uncharacterized protein OS=Jatropha curcas GN=JCGZ_24025 PE=4 SV=1[more]
V4VTA3_9ROSI3.2e-8366.01Uncharacterized protein OS=Citrus clementina GN=CICLE_v10021498mg PE=4 SV=1[more]
Match NameE-valueIdentityDescription
AT1G04590.22.6e-7656.06 BEST Arabidopsis thaliana protein match is: Pentatricopeptide repeat... [more]
AT4G18975.13.7e-3541.53 Pentatricopeptide repeat (PPR) superfamily protein[more]
AT4G21190.13.6e-3036.98 Pentatricopeptide repeat (PPR) superfamily protein[more]
Match NameE-valueIdentityDescription
gi|659110487|ref|XP_008455250.1|1.5e-13481.73PREDICTED: pentatricopeptide repeat-containing protein At4g18975, chloroplastic ... [more]
gi|778724238|ref|XP_011658763.1|7.4e-13480.07PREDICTED: pentatricopeptide repeat-containing protein At4g18975, chloroplastic ... [more]
gi|700188441|gb|KGN43674.1|3.3e-10281.82hypothetical protein Csa_7G057510 [Cucumis sativus][more]
gi|225433730|ref|XP_002269673.1|5.0e-9063.97PREDICTED: pentatricopeptide repeat-containing protein At4g18975, chloroplastic ... [more]
gi|296089642|emb|CBI39461.3|5.0e-9063.97unnamed protein product [Vitis vinifera][more]
The following terms have been associated with this gene:
Vocabulary: INTERPRO
TermDefinition
GO Assignments
This gene is annotated with the following GO terms.
Category Term Accession Term Name
biological_process GO:0008150 biological_process
cellular_component GO:0005575 cellular_component
molecular_function GO:0003674 molecular_function
molecular_function GO:0005515 protein binding

The following mRNA feature(s) are a part of this gene:

Feature NameUnique NameType
CmoCh19G003090.1CmoCh19G003090.1mRNA


Analysis Name: InterPro Annotations of Cucurbita moschata
Date Performed: 2017-05-19
IPR TermIPR DescriptionSourceSource TermSource DescriptionAlignment
NoneNo IPR availablePANTHERPTHR24015FAMILY NOT NAMEDcoord: 84..283
score: 7.6
NoneNo IPR availablePANTHERPTHR24015:SF716SUBFAMILY NOT NAMEDcoord: 84..283
score: 7.6