CsaV3_4G007030 (gene) Cucumber (Chinese Long) v3

NameCsaV3_4G007030
Typegene
OrganismCucumis sativus (Cucumber (Chinese Long) v3)
DescriptionPentatricopeptide repeat-containing protein, chloroplastic
Locationchr4 : 4749540 .. 4756585 (-)
The following sequences are available for this feature:

Gene sequence (with intron)

Legend: polypeptideCDSexon
Hold the cursor over a type above to highlight its positions in the sequence below.
ATGCATTTGACAAGATTTAAAATCAGTAAGACAACTCCTGTATTGTTTCCCTTCTCTCGTCGGCTGGTCTGTGTGTCTTCCACCCAACCGCATAAAGAACACCATCAGGATCCGCCCTGGCAGTCCCAGGATCAGTTGCATCTTTGGGTATCTTCTGTTCTTTCTCATTCATCTCTCGACTCTTCTAAATGTAGTGCTCTCTTACCCCATTTGTCTCCTTCTCAATTTGATCAGCTCTTCTTCTCCATTGGATTGAAAGCCAACCCCATGACTTGTCTTAATTTTTTTTACTTTGCGTCTAATTCTTTCAAATTTCGATTTACCATCCATTCTTATTGTACATTGATTCTTTTGCTTATTCGTTCTAAGTTTATACCCCCCGCAAGACTGCTTCTGATTCGTTTGATAGACGGGAATCTCCCGGTGTTGAATTTGGATTCAGAAAAGTTTCACATTGAGATAGCTAATGCATTGTTTGGTTTAACTTCAGTTGTTGGGCGGTTTGAATGGACACAGGCATTTGATTTATTGATACATGTATACAGCACACAATTCAGAAATCTTGGCTTTAGTTGCGCTGTGGATGTGTTTTATTTGCTTGCTCGTAAGGGTACCTTTCCATCGTTAAAGACTTGTAATTTTTTATTGAGCTCGTTGGTAAAGGCTAATGAATTTGAGAAGTGTTGTGAAGTATTTCGAGTGATGTCCGAAGGAGCTTGTCCAGATGTTTTCTCATTTACGAACGTGATAAATGCTTTGTGCAAGGGAGGGAAGATGGAAAATGCCATTGAGTTATTCATGAAAATGGAGAAGTTGGGGATTTCTCCCAATGTTGTTACTTATAATTGTATTATTAATGGTTTATGCCAGAATGGGAGATTAGACAATGCCTTTGAGCTCAAAGAGAAGATGACAGTGAAAGGGGTACAGCCAAATCTTAAAACTTATGGTGCGCTTATTAATGGTTTGATAAAACTAAACTTTTTTGACAAAGTGAATCATGTTTTAGATGAAATGATTGGTTCGGGTTTTAATCCAAATGTAGTTGTCTTCAATAATTTAATTGATGGATACTGCAAAATGGGAAATATCGAAGGAGCACTTAAGATCAAAGATGTGATGATATCCAAAAATATAACTCCTACTTCAGTTACTTTATATAGTCTCATGCAAGGATTTTGCAAAAGTGATCAAATTGAGCATGCAGAGAATGCCCTTGAGGAGATATTATCAAGTGGGCTATCTATACACCCGGATAATTGTTATTCGGTTGTCCACTGGCTATGTAAAAAGTTCAGGTACCATTCTGCATTCCGATTTACTAAGATGATGTTATCTAGGAACTTCAGGCCTAGTGATCTACTCTTAACCATGTTGGTATGTGGATTGTGCAAGGATGGTAAACATTTAGAAGCAACTGAACTTTGGTTTAGGTTATTGGAGAAAGGGTCTCCAGCAAGTAAGGTGACCTCCAATGCTCTAATACATGGACTTTGTGGGGCTGGTAAATTGCCAGAGGCTTCTAGAATTGTCAAAGAGATGTTAGAGAGGGGTCTTCCAATGGATCGGATCACATACAATGCACTCATCTTAGGTTTTTGCAATGAGGGAAAAGTTGAGGGATGCTTTAGACTTAGAGAAGAGATGACCAAACGAGGAATTCAGCCAGATATCTATACTTACAATTTTCTATTGCGTGGACTGTGCAATGTAGGAAAATTGGATGATGCTATTAAACTTTGGGATGAATTCAAAGCTAGTGGGCTGATTTCTAACATTCACACTTACGGGATAATGATGGAAGGTTATTGTAAAGCTAACAGAATCGAAGATGTTGAAAATTTATTTAATGAATTGCTCTCTAAGAAAATGGAGCTGAATTCCATTGTCTACAATATAATTATCAAAGCACATTGCCAGAATGGAAATGTAGCTGCAGCTTTGCAACTTCTTGAAAATATGAAAAGCAAGGGAATTTTACCAAATTGTGCCACGTATTCTTCTCTAATACACGGCGTGTGCAACATTGGTCTTGTTGAAGATGCAAAGCATCTTATTGATGAAATGAGAAAGGAAGGATTTGTGCCGAATGTTGTTTGCTATACTGCATTAATTGGCGGTTATTGTAAGCTGGGGCAAATGGATACTGCTGAATCTACTTGGCTTGAGATGATCTCTTTTAACATACATCCTAACAAATTTACCTACACTGTCATGATCGACGGCTACTGTAAATTAGGGAATATGGAAAAAGCAAATAACCTTCTGATAAAAATGAAAGAAAGTGGAATCGTCCCAGATGTTGTTACTTACAATGTCTTGACTAATGGATTTTGTAAGGCAAATGACATGGACAATGCTTTTAAAGTATGTGATCAAATGGCCACCGAAGGATTACCTGTAGATGAAATTACTTACACTACACTCGTACATGGTTGGAATCCACCTACAATCACTGGCCAAGACTGATCGAATTTCTGCAGAGGTTCTTTGTTACCTCAGTATCTGTTGATATTGTCATTATTGTTTCATGTTCAATTAAACTGAAAATTACTTTTGTTTTATCCTTGAACTTTTCATCAGCGACCAGATCCGTCAATATTGTTTCTTTGTGCATTGAGGCTATTCTCCTTCTCTTGTTCCAGACTTCAAATGAAAGAGCCAAAAAGGAAGAGTCGTCCTTAGTCTCTGGAGGTTGGTAGGCCTTTTCTTTATAGTGTATTCCATTTTCGCTTTGTGTATAGTATCATTAGTTTGTATGTGGTCTTACTACAAGTTAAGCATATATTTTGAAAACTTGTTATTATTCTTTAACAGATCAGGGAAGCAGAAGTGCTTAGCTCTTATTGATATAAAATGATGAGAATTTGAACGTAGGATGTTCTATCAAAGATCCTCAGAGAAGAAAGATGGCAATTGATTTGATAATTAGAATGACATTGGTTCTCCGACTCAAAGCCAAGGATGAAAGCTGACAAACATATTAGGTTGGGTATGTCATTTTTAGTGTAATATATTTGGGATGGAAAGCTGGGACAGTTAAGGTTTTTTCATTACTTACAATGAGAATGTATAGTGTTGATGCAGATTTTGCAGGCTTGGATCTGCATGTAGCCGTTCTCAGGGTGCAATGTTGTATTTGTGAGTTGTTTACTGGCATATTGGAATTGGAGCATAGAAAGCACTTTCACTTCCATGGATGGGTAAGTCCGTCTACTCCATCTTGTCATCAATCTTGTACTCAGAAATGACAACAATATGCTCTTCTGTTATTGTGTTTCTTCTATAACTGAAAATGCTGTAAAGCACTATATGCATTTTTTTCTGTAACTGCGGTGTTGGGTCATTACCGTACACATCACTGACCACTAAGCATTTGGAGCTTTAGGCCCGAGGTCTCTGAACTGCACATCTAAGTTCTTTCTTCTTCTTCTCTTCTTGTTCATGTTTTGTTGAAATTATTGTTATTAGGAGTTTGCAAATTGCACATTTAACTATTTGAGTTCCATTTTAAAAAACTGTTTAAGTTGAAGTGCTGAAAAGTAATATAATTCCGTTAAATACTTTTATTTTTGTTATAGAAAAGAAATCTGGCTTAAAGGAAAAATATATAATGAATGCCCTTAATTAACAGACTTGGTTGTATAGGTTTATTTATTTATTTCTAAAGTTGAATAATATATAGGGTGGGGATTCAAACTTTTGATCTTTAGATCTTTAGATCTTGACCAGTTGATGCTTATATTGGTTTTTGTTGTACAGATTGATTGTACTTTGAGTTCAAAAGATAACTTGTTAGATGATACTATTTTATTTATTTAAGACCCATTTAAGTCACTAGACTTGAGAAAATACTCATTTTGGTACTTGTCATTACTGTGCAGTTAAATATTGAGCAAAGGAATGGCGAACTTTGAATATGTATATTAACTCTTTGAGGTGAGTAATCATTGGAATATAACTCAGATAATTGCTCAACGTATGCTTGCGTCAATTTGATATTAGGCATATTTAAAGTGATGTTATCCGTTTGTCAAATAAGTGATGGCAGTATGATACTAAAGACAAAACAATTATCAATTATTTTGTTAAGTTTTGGACCCAAGTAATCAATTTGAAAGTTAAAAGAACAGAATCGACTTAATTGGAAAATTATCAAACTGGCCTCAAAATCTATATATAAGATGTTTTGTTCACCATTTAGTCATTTAGCAACCTATGAACCTACCAACTTGAATGCAACTCAACTGGTTGAAGTATATACCCTTGACTTAAAGGCTCAAAGATTCAAATCCTCCTACAGATGCTGTTGAACTAAAACACATTACATGTGTGTATAGCAAGCATGGTCTGCTTAATTATATAGAAGTTTTTGTTTGATGCATATCTGATCTATTTTGCCGTTTGAAATGTTACTTGTTTTTTCTGCTTGTTCATTTCAAAATTCTTCAGAATATTACGTACCCAACCCATCTAATCTTCTTTCCAGTCTTCTTTATTGCCCATGTTACCCAGATTTTTTTTCCATTGGGGTTTTTATTTATTTATTTATTGCAATATATGTGTCTACCTCTCTTCCTGCCCCTTTTTGGTTCTTCCCCTTTTTTCCGGTCAGCGTGCTTTGCATGTGTGTATTTTGTTTTGGAAAATGGTAACAGAATAAATTACGAACAAGATTAAAAGATAACGGGGACAAAATAGACGAAGAGGAATGGGATAATCTTGGAAGTGTGGAAATCACATGCATTATCAATTTATAGGAAATAAACCAATGTTAATATCTTTTGCATTTGATTAAATCATAAGCAGTTAGGAAAAAATCAGCACTTGTTAGATAGAGGAGTTCAATATCACACAACTTGAACCTAATCGGTATTCTGTGAAAACAAATGCTTATGAAACACATTTATCTAAATTATATTTAAGTATTATGATATATAATATTAATGAATTGAAATATTAATTTTTAAGGAAATCAAATTATACCAAAAGTAATCCATGTATGATATCTGATAGACCCCCTATCATCCACCAATATACTAGACGAGGGAAGAAGAATTAGGGTAGGCACGTGTGAGTATTAGTGTAGGCACGTGTGAGTATTAGTGTAGGCACGTGTGAGTTTGTTAGGGTAATTAGTTAGGATATGTTTTATAAATATGTTTGAATTATTGGAGGGGAAGGGTAGTTAGGTTTTGAGATTTTAGTCAGGAATCTATGCACCTTTGAGAGAGAAGGTAACGAGAGATTCTTCTTTGATATTGTATTCAATTTGGTTTAATTATCAATAAGAGTTCCATTGCCTTAGTGTTCTATCAATATCCTCTTATGTGGAGAAAAAAAATCTAGAGAGAGTTTAATGGTTCAATAAAAACTTCTCCGTCTGGAAATTGACATGATTGACTCATGTTTTCTTTGATTTTGCTGTGGTATTCCAATCTACCCAGTTTAGAAATCAATTTCAGCCACATGGAATTCGAGTAAGCCTGCTCTTCATTCATTCTGTCACTGGCGTTAACAATCTCTATCCATATTTAGTCACTGGAAATGTATGGTATAAAACCCATTTGTTGATTATAGTGTCAATATATCTATTCCTTAATTTTCTTAAAACAGCTTGTCTCAGAGCAACACGGTTCTTGTTTGTAGATATCGTCAGCAAGCTTGTAAGTGTAGTTGATTCTAGTTAATTATATAATAAACACAACCACAAATTATTCGTAATATTTTTGTGCTTGTTGATTTGTGAATCTTAATATTTACCTCTAAATTGGACTTATTGAGTTTAATTGATACAATTGCTTTGAAACGGATATTGGTGACGTCATCAGTTTAACAACAACGCTTGGCTCTGTTTGCTTTTTGTGAATTCAGACGCAATAAGGTAATCTGGATGGAGCAACAGGCCAATGTTAAACACCCTTTTAGATGTTTTCATCGTTAAACCAGTTGACCTGGGTGTTAACTGCAGCCCCACCGCCATCGGGAATGCTGTTAAGGACCTTCTCGTGGCCTTTAGGGTAATAAATTCCAGTTCGCCCATCGAGCACCCAAGGAGTCACTTCACCCCCCACATGGCTCTGGTTGCTCTTCAGTTCCTTGCTTTCTGTTTCAACTCTCTCCCTCATCATATCTGGTGAAGAGCTTTTCCTGTGGAAGCCCTTTCGCCCAATCGGTCTGCCAAAGAATGTAACCGAATCAGAATCAGAAAAACAAGGTAATACTCTGTCTTGTTATTCCATGTGTTGTGTTTTACATCTCACGTTAATACAATTTTAAACAATGTACTAATTGAACCCAGCTAACGACGGTATTCTGAATGAAACCTAATTTGACTTTGGTTCCAAGACTGTCCTTTGTTTTGGTCTGCAGCCTTCTGATCATAGATCAATGTCAAGATAATGGTGAAGATAGAAGTAAATATTTCAAATCTTATTCTGATGTTCTTAAAAACTGCTTTCTTGTAGAAAAACAGAATCATAAACAAAAGAAAAGAATTAGTTGATGGGTGAGTTGAAGAAGGCTCTTACTTGAAGAACATGGAGTTCATTGTGTGAGCCCCTCTTACAAATGCTGTGGCCATACCTTCTTCCAATAAGGTAAGTGGATTGTGATGAACAATGGGATTTGGTTGGTTTTTATTTATAGGTGAAAAAAGAGGCTATCCGTTTCATCCATCCCAATGAAGATTCTGAAGAATGGGAAGAATTTGTCCTTTTACTCCTTGATACTGCCCCCCCATTCCTTGGAGATGGTTTGCGAAAATGAACCTTCAAAATTCCTCTTCCCATACTTCATTAACTGTATATGTACATGAACCTAATGGGTTGGCTTTCCCTGAGGAGCTATAGTATGGGCTTAGGCTTTGTGGGCTTTACAGATGAACTAAAGTGTAGATTTGGGCCTAGTGGTTCTTTGGCCCATTAATGAAACCATGAATTGTCGTTGTTAGGACTATTAATACTCAAGTGGTGAATTGATGATTTTGAAATCCTTAATGAAAGGAATGATATTTTCCCCTCTTTTTGAGACTCAACACGAAAGGG

mRNA sequence

ATGCATTTGACAAGATTTAAAATCAGTAAGACAACTCCTGTATTGTTTCCCTTCTCTCGTCGGCTGGTCTGTGTGTCTTCCACCCAACCGCATAAAGAACACCATCAGGATCCGCCCTGGCAGTCCCAGGATCAGTTGCATCTTTGGGTATCTTCTGTTCTTTCTCATTCATCTCTCGACTCTTCTAAATGTAGTGCTCTCTTACCCCATTTGTCTCCTTCTCAATTTGATCAGCTCTTCTTCTCCATTGGATTGAAAGCCAACCCCATGACTTGTCTTAATTTTTTTTACTTTGCGTCTAATTCTTTCAAATTTCGATTTACCATCCATTCTTATTGTACATTGATTCTTTTGCTTATTCGTTCTAAGTTTATACCCCCCGCAAGACTGCTTCTGATTCGTTTGATAGACGGGAATCTCCCGGTGTTGAATTTGGATTCAGAAAAGTTTCACATTGAGATAGCTAATGCATTGTTTGGTTTAACTTCAGTTGTTGGGCGGTTTGAATGGACACAGGCATTTGATTTATTGATACATGTATACAGCACACAATTCAGAAATCTTGGCTTTAGTTGCGCTGTGGATGTGTTTTATTTGCTTGCTCGTAAGGGTACCTTTCCATCGTTAAAGACTTGTAATTTTTTATTGAGCTCGTTGGTAAAGGCTAATGAATTTGAGAAGTGTTGTGAAGTATTTCGAGTGATGTCCGAAGGAGCTTGTCCAGATGTTTTCTCATTTACGAACGTGATAAATGCTTTGTGCAAGGGAGGGAAGATGGAAAATGCCATTGAGTTATTCATGAAAATGGAGAAGTTGGGGATTTCTCCCAATGTTGTTACTTATAATTGTATTATTAATGGTTTATGCCAGAATGGGAGATTAGACAATGCCTTTGAGCTCAAAGAGAAGATGACAGTGAAAGGGGTACAGCCAAATCTTAAAACTTATGGTGCGCTTATTAATGGTTTGATAAAACTAAACTTTTTTGACAAAGTGAATCATGTTTTAGATGAAATGATTGGTTCGGGTTTTAATCCAAATGTAGTTGTCTTCAATAATTTAATTGATGGATACTGCAAAATGGGAAATATCGAAGGAGCACTTAAGATCAAAGATGTGATGATATCCAAAAATATAACTCCTACTTCAGTTACTTTATATAGTCTCATGCAAGGATTTTGCAAAAGTGATCAAATTGAGCATGCAGAGAATGCCCTTGAGGAGATATTATCAAGTGGGCTATCTATACACCCGGATAATTGTTATTCGGTTGTCCACTGGCTATGTAAAAAGTTCAGGTACCATTCTGCATTCCGATTTACTAAGATGATGTTATCTAGGAACTTCAGGCCTAGTGATCTACTCTTAACCATGTTGGTATGTGGATTGTGCAAGGATGGTAAACATTTAGAAGCAACTGAACTTTGGTTTAGGTTATTGGAGAAAGGGTCTCCAGCAAGTAAGGTGACCTCCAATGCTCTAATACATGGACTTTGTGGGGCTGGTAAATTGCCAGAGGCTTCTAGAATTGTCAAAGAGATGTTAGAGAGGGGTCTTCCAATGGATCGGATCACATACAATGCACTCATCTTAGGTTTTTGCAATGAGGGAAAAGTTGAGGGATGCTTTAGACTTAGAGAAGAGATGACCAAACGAGGAATTCAGCCAGATATCTATACTTACAATTTTCTATTGCGTGGACTGTGCAATGTAGGAAAATTGGATGATGCTATTAAACTTTGGGATGAATTCAAAGCTAGTGGGCTGATTTCTAACATTCACACTTACGGGATAATGATGGAAGGTTATTGTAAAGCTAACAGAATCGAAGATGTTGAAAATTTATTTAATGAATTGCTCTCTAAGAAAATGGAGCTGAATTCCATTGTCTACAATATAATTATCAAAGCACATTGCCAGAATGGAAATGTAGCTGCAGCTTTGCAACTTCTTGAAAATATGAAAAGCAAGGGAATTTTACCAAATTGTGCCACGTATTCTTCTCTAATACACGGCGTGTGCAACATTGGTCTTGTTGAAGATGCAAAGCATCTTATTGATGAAATGAGAAAGGAAGGATTTGTGCCGAATGTTGTTTGCTATACTGCATTAATTGGCGGTTATTGTAAGCTGGGGCAAATGGATACTGCTGAATCTACTTGGCTTGAGATGATCTCTTTTAACATACATCCTAACAAATTTACCTACACTGTCATGATCGACGGCTACTGTAAATTAGGGAATATGGAAAAAGCAAATAACCTTCTGATAAAAATGAAAGAAAGTGGAATCGTCCCAGATGTTGTTACTTACAATGTCTTGACTAATGGATTTTGTAAGGCAAATGACATGGACAATGCTTTTAAAGTATGTGATCAAATGGCCACCGAAGGATTACCTGTAGATGAAATTACTTACACTACACTCGTACATGCGACCAGATCCGTCAATATTGTTTCTTTGTGCATTGAGGCTATTCTCCTTCTCTTGTTCCAGACTTCAAATGAAAGAGCCAAAAAGGAAGAGTCGTCCTTAGTCTCTGGAGATTTTGCAGGCTTGGATCTGCATGTAGCCGTTCTCAGGGTGCAATGTTGTATTTGTGAGTTGTTTACTGGCATATTGGAATTGGAGCATAGAAAGCACTTTCACTTCCATGGATGGTTAAATATTGAGCAAAGGAATGGCGAACTTTGA

Coding sequence (CDS)

ATGCATTTGACAAGATTTAAAATCAGTAAGACAACTCCTGTATTGTTTCCCTTCTCTCGTCGGCTGGTCTGTGTGTCTTCCACCCAACCGCATAAAGAACACCATCAGGATCCGCCCTGGCAGTCCCAGGATCAGTTGCATCTTTGGGTATCTTCTGTTCTTTCTCATTCATCTCTCGACTCTTCTAAATGTAGTGCTCTCTTACCCCATTTGTCTCCTTCTCAATTTGATCAGCTCTTCTTCTCCATTGGATTGAAAGCCAACCCCATGACTTGTCTTAATTTTTTTTACTTTGCGTCTAATTCTTTCAAATTTCGATTTACCATCCATTCTTATTGTACATTGATTCTTTTGCTTATTCGTTCTAAGTTTATACCCCCCGCAAGACTGCTTCTGATTCGTTTGATAGACGGGAATCTCCCGGTGTTGAATTTGGATTCAGAAAAGTTTCACATTGAGATAGCTAATGCATTGTTTGGTTTAACTTCAGTTGTTGGGCGGTTTGAATGGACACAGGCATTTGATTTATTGATACATGTATACAGCACACAATTCAGAAATCTTGGCTTTAGTTGCGCTGTGGATGTGTTTTATTTGCTTGCTCGTAAGGGTACCTTTCCATCGTTAAAGACTTGTAATTTTTTATTGAGCTCGTTGGTAAAGGCTAATGAATTTGAGAAGTGTTGTGAAGTATTTCGAGTGATGTCCGAAGGAGCTTGTCCAGATGTTTTCTCATTTACGAACGTGATAAATGCTTTGTGCAAGGGAGGGAAGATGGAAAATGCCATTGAGTTATTCATGAAAATGGAGAAGTTGGGGATTTCTCCCAATGTTGTTACTTATAATTGTATTATTAATGGTTTATGCCAGAATGGGAGATTAGACAATGCCTTTGAGCTCAAAGAGAAGATGACAGTGAAAGGGGTACAGCCAAATCTTAAAACTTATGGTGCGCTTATTAATGGTTTGATAAAACTAAACTTTTTTGACAAAGTGAATCATGTTTTAGATGAAATGATTGGTTCGGGTTTTAATCCAAATGTAGTTGTCTTCAATAATTTAATTGATGGATACTGCAAAATGGGAAATATCGAAGGAGCACTTAAGATCAAAGATGTGATGATATCCAAAAATATAACTCCTACTTCAGTTACTTTATATAGTCTCATGCAAGGATTTTGCAAAAGTGATCAAATTGAGCATGCAGAGAATGCCCTTGAGGAGATATTATCAAGTGGGCTATCTATACACCCGGATAATTGTTATTCGGTTGTCCACTGGCTATGTAAAAAGTTCAGGTACCATTCTGCATTCCGATTTACTAAGATGATGTTATCTAGGAACTTCAGGCCTAGTGATCTACTCTTAACCATGTTGGTATGTGGATTGTGCAAGGATGGTAAACATTTAGAAGCAACTGAACTTTGGTTTAGGTTATTGGAGAAAGGGTCTCCAGCAAGTAAGGTGACCTCCAATGCTCTAATACATGGACTTTGTGGGGCTGGTAAATTGCCAGAGGCTTCTAGAATTGTCAAAGAGATGTTAGAGAGGGGTCTTCCAATGGATCGGATCACATACAATGCACTCATCTTAGGTTTTTGCAATGAGGGAAAAGTTGAGGGATGCTTTAGACTTAGAGAAGAGATGACCAAACGAGGAATTCAGCCAGATATCTATACTTACAATTTTCTATTGCGTGGACTGTGCAATGTAGGAAAATTGGATGATGCTATTAAACTTTGGGATGAATTCAAAGCTAGTGGGCTGATTTCTAACATTCACACTTACGGGATAATGATGGAAGGTTATTGTAAAGCTAACAGAATCGAAGATGTTGAAAATTTATTTAATGAATTGCTCTCTAAGAAAATGGAGCTGAATTCCATTGTCTACAATATAATTATCAAAGCACATTGCCAGAATGGAAATGTAGCTGCAGCTTTGCAACTTCTTGAAAATATGAAAAGCAAGGGAATTTTACCAAATTGTGCCACGTATTCTTCTCTAATACACGGCGTGTGCAACATTGGTCTTGTTGAAGATGCAAAGCATCTTATTGATGAAATGAGAAAGGAAGGATTTGTGCCGAATGTTGTTTGCTATACTGCATTAATTGGCGGTTATTGTAAGCTGGGGCAAATGGATACTGCTGAATCTACTTGGCTTGAGATGATCTCTTTTAACATACATCCTAACAAATTTACCTACACTGTCATGATCGACGGCTACTGTAAATTAGGGAATATGGAAAAAGCAAATAACCTTCTGATAAAAATGAAAGAAAGTGGAATCGTCCCAGATGTTGTTACTTACAATGTCTTGACTAATGGATTTTGTAAGGCAAATGACATGGACAATGCTTTTAAAGTATGTGATCAAATGGCCACCGAAGGATTACCTGTAGATGAAATTACTTACACTACACTCGTACATGCGACCAGATCCGTCAATATTGTTTCTTTGTGCATTGAGGCTATTCTCCTTCTCTTGTTCCAGACTTCAAATGAAAGAGCCAAAAAGGAAGAGTCGTCCTTAGTCTCTGGAGATTTTGCAGGCTTGGATCTGCATGTAGCCGTTCTCAGGGTGCAATGTTGTATTTGTGAGTTGTTTACTGGCATATTGGAATTGGAGCATAGAAAGCACTTTCACTTCCATGGATGGTTAAATATTGAGCAAAGGAATGGCGAACTTTGA

Protein sequence

MHLTRFKISKTTPVLFPFSRRLVCVSSTQPHKEHHQDPPWQSQDQLHLWVSSVLSHSSLDSSKCSALLPHLSPSQFDQLFFSIGLKANPMTCLNFFYFASNSFKFRFTIHSYCTLILLLIRSKFIPPARLLLIRLIDGNLPVLNLDSEKFHIEIANALFGLTSVVGRFEWTQAFDLLIHVYSTQFRNLGFSCAVDVFYLLARKGTFPSLKTCNFLLSSLVKANEFEKCCEVFRVMSEGACPDVFSFTNVINALCKGGKMENAIELFMKMEKLGISPNVVTYNCIINGLCQNGRLDNAFELKEKMTVKGVQPNLKTYGALINGLIKLNFFDKVNHVLDEMIGSGFNPNVVVFNNLIDGYCKMGNIEGALKIKDVMISKNITPTSVTLYSLMQGFCKSDQIEHAENALEEILSSGLSIHPDNCYSVVHWLCKKFRYHSAFRFTKMMLSRNFRPSDLLLTMLVCGLCKDGKHLEATELWFRLLEKGSPASKVTSNALIHGLCGAGKLPEASRIVKEMLERGLPMDRITYNALILGFCNEGKVEGCFRLREEMTKRGIQPDIYTYNFLLRGLCNVGKLDDAIKLWDEFKASGLISNIHTYGIMMEGYCKANRIEDVENLFNELLSKKMELNSIVYNIIIKAHCQNGNVAAALQLLENMKSKGILPNCATYSSLIHGVCNIGLVEDAKHLIDEMRKEGFVPNVVCYTALIGGYCKLGQMDTAESTWLEMISFNIHPNKFTYTVMIDGYCKLGNMEKANNLLIKMKESGIVPDVVTYNVLTNGFCKANDMDNAFKVCDQMATEGLPVDEITYTTLVHATRSVNIVSLCIEAILLLLFQTSNERAKKEESSLVSGDFAGLDLHVAVLRVQCCICELFTGILELEHRKHFHFHGWLNIEQRNGEL
BLAST of CsaV3_4G007030 vs. NCBI nr
Match: XP_004149000.1 (PREDICTED: pentatricopeptide repeat-containing protein At4g19440, chloroplastic [Cucumis sativus] >XP_011653231.1 PREDICTED: pentatricopeptide repeat-containing protein At4g19440, chloroplastic [Cucumis sativus] >XP_011653232.1 PREDICTED: pentatricopeptide repeat-containing protein At4g19440, chloroplastic [Cucumis sativus] >XP_011653233.1 PREDICTED: pentatricopeptide repeat-containing protein At4g19440, chloroplastic [Cucumis sativus] >XP_011653234.1 PREDICTED: pentatricopeptide repeat-containing protein At4g19440, chloroplastic [Cucumis sativus])

HSP 1 Score: 547.7 bits (1410), Expect = 7.1e-152
Identity = 271/271 (100.00%), Postives = 271/271 (100.00%), Query Frame = 0

Query: 1   MHLTRFKISKTTPVLFPFSRRLVCVSSTQPHKEHHQDPPWQSQDQLHLWVSSVLSHSSLD 60
           MHLTRFKISKTTPVLFPFSRRLVCVSSTQPHKEHHQDPPWQSQDQLHLWVSSVLSHSSLD
Sbjct: 1   MHLTRFKISKTTPVLFPFSRRLVCVSSTQPHKEHHQDPPWQSQDQLHLWVSSVLSHSSLD 60

Query: 61  SSKCSALLPHLSPSQFDQLFFSIGLKANPMTCLNFFYFASNSFKFRFTIHSYCTLILLLI 120
           SSKCSALLPHLSPSQFDQLFFSIGLKANPMTCLNFFYFASNSFKFRFTIHSYCTLILLLI
Sbjct: 61  SSKCSALLPHLSPSQFDQLFFSIGLKANPMTCLNFFYFASNSFKFRFTIHSYCTLILLLI 120

Query: 121 RSKFIPPARLLLIRLIDGNLPVLNLDSEKFHIEIANALFGLTSVVGRFEWTQAFDLLIHV 180
           RSKFIPPARLLLIRLIDGNLPVLNLDSEKFHIEIANALFGLTSVVGRFEWTQAFDLLIHV
Sbjct: 121 RSKFIPPARLLLIRLIDGNLPVLNLDSEKFHIEIANALFGLTSVVGRFEWTQAFDLLIHV 180

Query: 181 YSTQFRNLGFSCAVDVFYLLARKGTFPSLKTCNFLLSSLVKANEFEKCCEVFRVMSEGAC 240
           YSTQFRNLGFSCAVDVFYLLARKGTFPSLKTCNFLLSSLVKANEFEKCCEVFRVMSEGAC
Sbjct: 181 YSTQFRNLGFSCAVDVFYLLARKGTFPSLKTCNFLLSSLVKANEFEKCCEVFRVMSEGAC 240

Query: 241 PDVFSFTNVINALCKGGKMENAIELFMKMEK 272
           PDVFSFTNVINALCKGGKMENAIELFMKMEK
Sbjct: 241 PDVFSFTNVINALCKGGKMENAIELFMKMEK 271

BLAST of CsaV3_4G007030 vs. NCBI nr
Match: KGN53456.1 (hypothetical protein Csa_4G055990 [Cucumis sativus])

HSP 1 Score: 547.7 bits (1410), Expect = 7.1e-152
Identity = 271/271 (100.00%), Postives = 271/271 (100.00%), Query Frame = 0

Query: 1   MHLTRFKISKTTPVLFPFSRRLVCVSSTQPHKEHHQDPPWQSQDQLHLWVSSVLSHSSLD 60
           MHLTRFKISKTTPVLFPFSRRLVCVSSTQPHKEHHQDPPWQSQDQLHLWVSSVLSHSSLD
Sbjct: 1   MHLTRFKISKTTPVLFPFSRRLVCVSSTQPHKEHHQDPPWQSQDQLHLWVSSVLSHSSLD 60

Query: 61  SSKCSALLPHLSPSQFDQLFFSIGLKANPMTCLNFFYFASNSFKFRFTIHSYCTLILLLI 120
           SSKCSALLPHLSPSQFDQLFFSIGLKANPMTCLNFFYFASNSFKFRFTIHSYCTLILLLI
Sbjct: 61  SSKCSALLPHLSPSQFDQLFFSIGLKANPMTCLNFFYFASNSFKFRFTIHSYCTLILLLI 120

Query: 121 RSKFIPPARLLLIRLIDGNLPVLNLDSEKFHIEIANALFGLTSVVGRFEWTQAFDLLIHV 180
           RSKFIPPARLLLIRLIDGNLPVLNLDSEKFHIEIANALFGLTSVVGRFEWTQAFDLLIHV
Sbjct: 121 RSKFIPPARLLLIRLIDGNLPVLNLDSEKFHIEIANALFGLTSVVGRFEWTQAFDLLIHV 180

Query: 181 YSTQFRNLGFSCAVDVFYLLARKGTFPSLKTCNFLLSSLVKANEFEKCCEVFRVMSEGAC 240
           YSTQFRNLGFSCAVDVFYLLARKGTFPSLKTCNFLLSSLVKANEFEKCCEVFRVMSEGAC
Sbjct: 181 YSTQFRNLGFSCAVDVFYLLARKGTFPSLKTCNFLLSSLVKANEFEKCCEVFRVMSEGAC 240

Query: 241 PDVFSFTNVINALCKGGKMENAIELFMKMEK 272
           PDVFSFTNVINALCKGGKMENAIELFMKMEK
Sbjct: 241 PDVFSFTNVINALCKGGKMENAIELFMKMEK 271

BLAST of CsaV3_4G007030 vs. NCBI nr
Match: XP_016901180.1 (PREDICTED: pentatricopeptide repeat-containing protein At4g19440, chloroplastic isoform X1 [Cucumis melo] >XP_016901181.1 PREDICTED: pentatricopeptide repeat-containing protein At4g19440, chloroplastic isoform X1 [Cucumis melo] >XP_016901182.1 PREDICTED: pentatricopeptide repeat-containing protein At4g19440, chloroplastic isoform X1 [Cucumis melo] >XP_016901183.1 PREDICTED: pentatricopeptide repeat-containing protein At4g19440, chloroplastic isoform X1 [Cucumis melo])

HSP 1 Score: 515.0 bits (1325), Expect = 5.1e-142
Identity = 252/271 (92.99%), Postives = 260/271 (95.94%), Query Frame = 0

Query: 1   MHLTRFKISKTTPVLFPFSRRLVCVSSTQPHKEHHQDPPWQSQDQLHLWVSSVLSHSSLD 60
           MHLTRFKI+KT PVLFPFSRRL CVSSTQPHKEHHQDPPWQSQDQLHLWVSSVLS+SSLD
Sbjct: 1   MHLTRFKINKTIPVLFPFSRRLACVSSTQPHKEHHQDPPWQSQDQLHLWVSSVLSNSSLD 60

Query: 61  SSKCSALLPHLSPSQFDQLFFSIGLKANPMTCLNFFYFASNSFKFRFTIHSYCTLILLLI 120
           SSKCSALLPHLSP QFDQLFFSIGLKANPMTCLNFFYFAS+SFKFRFTIHSYC LILLL+
Sbjct: 61  SSKCSALLPHLSPFQFDQLFFSIGLKANPMTCLNFFYFASDSFKFRFTIHSYCILILLLV 120

Query: 121 RSKFIPPARLLLIRLIDGNLPVLNLDSEKFHIEIANALFGLTSVVGRFEWTQAFDLLIHV 180
            SKF+PPARLLLIRLIDGNLPVLN D +KFHIEIANALFGLTSVVGRFEWTQAFDLLIHV
Sbjct: 121 HSKFLPPARLLLIRLIDGNLPVLNSDFKKFHIEIANALFGLTSVVGRFEWTQAFDLLIHV 180

Query: 181 YSTQFRNLGFSCAVDVFYLLARKGTFPSLKTCNFLLSSLVKANEFEKCCEVFRVMSEGAC 240
           YSTQFRNLGF CA+DVFYLLARKGTFPSLKTCNFLLSSLVKANEFEKCCEVF+VMSEG C
Sbjct: 181 YSTQFRNLGFGCAIDVFYLLARKGTFPSLKTCNFLLSSLVKANEFEKCCEVFQVMSEGVC 240

Query: 241 PDVFSFTNVINALCKGGKMENAIELFMKMEK 272
           PDVFSFTNVINALCKGGKME A ELFMKMEK
Sbjct: 241 PDVFSFTNVINALCKGGKMEKATELFMKMEK 271

BLAST of CsaV3_4G007030 vs. NCBI nr
Match: XP_023542002.1 (pentatricopeptide repeat-containing protein At4g19440, chloroplastic-like [Cucurbita pepo subsp. pepo] >XP_023542009.1 pentatricopeptide repeat-containing protein At4g19440, chloroplastic-like [Cucurbita pepo subsp. pepo] >XP_023542017.1 pentatricopeptide repeat-containing protein At4g19440, chloroplastic-like [Cucurbita pepo subsp. pepo])

HSP 1 Score: 460.3 bits (1183), Expect = 1.5e-125
Identity = 227/270 (84.07%), Postives = 240/270 (88.89%), Query Frame = 0

Query: 1   MHLTRFKISKTTPVLFPFSRRLVCVSSTQPHKEHHQDPPWQSQDQLHLWVSSVLSHSSLD 60
           MHLTRFKI+KT PVLFPFSRRL CV STQPHKEHHQ+PPWQ QDQL   VSS+LS+SSLD
Sbjct: 1   MHLTRFKINKTVPVLFPFSRRLACVLSTQPHKEHHQEPPWQLQDQLLYSVSSILSNSSLD 60

Query: 61  SSKCSALLPHLSPSQFDQLFFSIGLKANPMTCLNFFYFASNSFKFRFTIHSYCTLILLLI 120
           SSKC ALLPHLSP +FD++FFS+GLKANP TCLNFFYFAS+SFKFRFTI SYC L+LLLI
Sbjct: 61  SSKCRALLPHLSPLEFDRMFFSVGLKANPKTCLNFFYFASDSFKFRFTIRSYCILVLLLI 120

Query: 121 RSKFIPPARLLLIRLIDGNLPVLNLDSEKFHIEIANALFGLTSVVGRFEWTQAFDLLIHV 180
            SKF+PPARLLLIRLIDG LPVLN D  K HIEIANAL GLTSVVGRFEWTQAFDLLIHV
Sbjct: 121 NSKFLPPARLLLIRLIDGKLPVLNFDLNKLHIEIANALLGLTSVVGRFEWTQAFDLLIHV 180

Query: 181 YSTQFRNLGFSCAVDVFYLLARKGTFPSLKTCNFLLSSLVKANEFEKCCEVFRVMSEGAC 240
           YSTQFRNLGF CAVD FYL A+KG FPSLKTCNFLLSSLVK NE EKCCEVF VMS G  
Sbjct: 181 YSTQFRNLGFGCAVDAFYLFAQKGIFPSLKTCNFLLSSLVKDNELEKCCEVFEVMSRGVR 240

Query: 241 PDVFSFTNVINALCKGGKMENAIELFMKME 271
           PDVF FTNVINALCKGGKMENAIELF+KME
Sbjct: 241 PDVFLFTNVINALCKGGKMENAIELFLKME 270

BLAST of CsaV3_4G007030 vs. NCBI nr
Match: XP_022942543.1 (pentatricopeptide repeat-containing protein At4g19440, chloroplastic-like [Cucurbita moschata] >XP_022942544.1 pentatricopeptide repeat-containing protein At4g19440, chloroplastic-like [Cucurbita moschata] >XP_022942545.1 pentatricopeptide repeat-containing protein At4g19440, chloroplastic-like [Cucurbita moschata])

HSP 1 Score: 456.4 bits (1173), Expect = 2.2e-124
Identity = 225/270 (83.33%), Postives = 238/270 (88.15%), Query Frame = 0

Query: 1   MHLTRFKISKTTPVLFPFSRRLVCVSSTQPHKEHHQDPPWQSQDQLHLWVSSVLSHSSLD 60
           MHLTRFKI+KT PVLFPFSRRL CV STQPHKEHHQ+PPWQ QDQL   VSS+LS+SSLD
Sbjct: 1   MHLTRFKINKTIPVLFPFSRRLACVLSTQPHKEHHQEPPWQLQDQLLYSVSSILSNSSLD 60

Query: 61  SSKCSALLPHLSPSQFDQLFFSIGLKANPMTCLNFFYFASNSFKFRFTIHSYCTLILLLI 120
           SSKC AL PHLSP +FD++FFS+GLKANP TCLNFFYFAS+SFKFRFTI SYC LILLLI
Sbjct: 61  SSKCRALFPHLSPLEFDRMFFSVGLKANPKTCLNFFYFASDSFKFRFTIRSYCILILLLI 120

Query: 121 RSKFIPPARLLLIRLIDGNLPVLNLDSEKFHIEIANALFGLTSVVGRFEWTQAFDLLIHV 180
            SKF+PPARLLLIRLIDG LP+LN D  K HIEIAN L GLTSVVGRFEWTQAFDLLIHV
Sbjct: 121 NSKFLPPARLLLIRLIDGKLPLLNFDLNKLHIEIANTLLGLTSVVGRFEWTQAFDLLIHV 180

Query: 181 YSTQFRNLGFSCAVDVFYLLARKGTFPSLKTCNFLLSSLVKANEFEKCCEVFRVMSEGAC 240
           YSTQFRNLGF CA D FYL A+KG FPSLKTCNFLLSSLVKANE EKCCEVF VMS G  
Sbjct: 181 YSTQFRNLGFGCAFDAFYLFAQKGIFPSLKTCNFLLSSLVKANELEKCCEVFEVMSRGVR 240

Query: 241 PDVFSFTNVINALCKGGKMENAIELFMKME 271
           PDVF FTNVINALCKGGKMENAIELF+KME
Sbjct: 241 PDVFLFTNVINALCKGGKMENAIELFLKME 270

BLAST of CsaV3_4G007030 vs. TAIR10
Match: AT4G19440.1 (Tetratricopeptide repeat (TPR)-like superfamily protein)

HSP 1 Score: 211.8 bits (538), Expect = 1.7e-54
Identity = 116/230 (50.43%), Postives = 150/230 (65.22%), Query Frame = 0

Query: 42  SQDQLHLWVSSVLSHSSLDSSKCSALLPHLSPSQFDQLFFSIGLKANPMTCLNFFYFASN 101
           S   LH  +SSVLS  SLD  +C  L+  LSP +FD+LF     K NP T L+FF  AS+
Sbjct: 59  SDRHLHERLSSVLSKRSLDYEQCKQLITVLSPLEFDRLFPEFRSKVNPKTALDFFRLASD 118

Query: 102 SFKFRFTIHSYCTLILLLIRSKFIPPARLLLIRLIDGNLPVLNLDSEKFHIEIANALFGL 161
           SF F F++ SYC LI LL+ +  +  AR++LIRLI+GN+PVL        + IA+A+  L
Sbjct: 119 SFSFSFSLRSYCLLIGLLLDANLLSAARVVLIRLINGNVPVLPCGLRDSRVAIADAMASL 178

Query: 162 TSVVGRFEWTQAFDLLIHVYSTQFRNLGFSCAVDVFYLLARKGTFPSLKTCNFLLSSLVK 221
           +         +  DLLI VY TQF+  G   A+DVF +LA KG FPS  TCN LL+SLV+
Sbjct: 179 SLCFDEEIRRKMSDLLIEVYCTQFKRDGCYLALDVFPVLANKGMFPSKTTCNILLTSLVR 238

Query: 222 ANEFEKCCEVFRVMSEGACPDVFSFTNVINALCKGGKMENAIELFMKMEK 272
           ANEF+KCCE F V+ +G  PDV+ FT  INA CKGGK+E A++LF KME+
Sbjct: 239 ANEFQKCCEAFDVVCKGVSPDVYLFTTAINAFCKGGKVEEAVKLFSKMEE 288

BLAST of CsaV3_4G007030 vs. TAIR10
Match: AT4G21170.1 (Tetratricopeptide repeat (TPR)-like superfamily protein)

HSP 1 Score: 47.4 bits (111), Expect = 5.5e-05
Identity = 43/178 (24.16%), Postives = 75/178 (42.13%), Query Frame = 0

Query: 89  PMTCLNFFYFASNSFKFRFTIHSYCTLILLLIRSKFIPPARLLLIRLIDGNLPVLNLDSE 148
           P T L+FF FA    +F   + S+C +I +   S  +  A +LL  L++ N   L +   
Sbjct: 83  PKTTLDFFDFAKTHLRFEPDLKSHCRVIEVAAESGLLERAEMLLRPLVETNSVSLVVGEM 142

Query: 149 KFHIEIANALFGLTSVVGRFEWTQAFDLLIHVYSTQFRNLGFSCAVDVFYLLARKGTFPS 208
               E            G    + +  L++  Y+ +  +      ++VF  + R    PS
Sbjct: 143 HRWFE------------GEVSLSVSLSLVLEYYALKGSHHN---GLEVFGFMRRLRLSPS 202

Query: 209 LKTCNFLLSSLVKANEFE-KCCEVFRVMSEGACPDVFSFTNVINALCKGGKMENAIEL 266
               N LL SLVK N+F    C    ++  G   D  ++  +   LC+ G+ ++  +L
Sbjct: 203 QSAYNSLLGSLVKENQFRVALCLYSAMVRNGIVSDELTWDLIAQILCEQGRSKSVFKL 245

BLAST of CsaV3_4G007030 vs. TAIR10
Match: AT2G26790.1 (Pentatricopeptide repeat (PPR) superfamily protein)

HSP 1 Score: 44.7 bits (104), Expect = 3.6e-04
Identity = 22/82 (26.83%), Postives = 42/82 (51.22%), Query Frame = 0

Query: 190 FSCAVDVFYLLARKGTFPSLKTCNFLLSSLVKANEFEKCCEVFRVMSE-GACPDVFSFTN 249
           F  A DV +   R      +K CNFL++ + +  +      +F+ + + G C + +++  
Sbjct: 162 FDEATDVLFQSKRLDCVVDIKACNFLMNRMTEFGKIGMLMTLFKQLKQLGLCANEYTYAI 221

Query: 250 VINALCKGGKMENAIELFMKME 271
           V+ ALC+ G +E A  L ++ E
Sbjct: 222 VVKALCRKGNLEEAAMLLIENE 243

BLAST of CsaV3_4G007030 vs. Swiss-Prot
Match: sp|Q940A6|PP325_ARATH (Pentatricopeptide repeat-containing protein At4g19440, chloroplastic OS=Arabidopsis thaliana OX=3702 GN=At4g19440 PE=2 SV=2)

HSP 1 Score: 211.8 bits (538), Expect = 3.0e-53
Identity = 116/230 (50.43%), Postives = 150/230 (65.22%), Query Frame = 0

Query: 42  SQDQLHLWVSSVLSHSSLDSSKCSALLPHLSPSQFDQLFFSIGLKANPMTCLNFFYFASN 101
           S   LH  +SSVLS  SLD  +C  L+  LSP +FD+LF     K NP T L+FF  AS+
Sbjct: 72  SDRHLHERLSSVLSKRSLDYEQCKQLITVLSPLEFDRLFPEFRSKVNPKTALDFFRLASD 131

Query: 102 SFKFRFTIHSYCTLILLLIRSKFIPPARLLLIRLIDGNLPVLNLDSEKFHIEIANALFGL 161
           SF F F++ SYC LI LL+ +  +  AR++LIRLI+GN+PVL        + IA+A+  L
Sbjct: 132 SFSFSFSLRSYCLLIGLLLDANLLSAARVVLIRLINGNVPVLPCGLRDSRVAIADAMASL 191

Query: 162 TSVVGRFEWTQAFDLLIHVYSTQFRNLGFSCAVDVFYLLARKGTFPSLKTCNFLLSSLVK 221
           +         +  DLLI VY TQF+  G   A+DVF +LA KG FPS  TCN LL+SLV+
Sbjct: 192 SLCFDEEIRRKMSDLLIEVYCTQFKRDGCYLALDVFPVLANKGMFPSKTTCNILLTSLVR 251

Query: 222 ANEFEKCCEVFRVMSEGACPDVFSFTNVINALCKGGKMENAIELFMKMEK 272
           ANEF+KCCE F V+ +G  PDV+ FT  INA CKGGK+E A++LF KME+
Sbjct: 252 ANEFQKCCEAFDVVCKGVSPDVYLFTTAINAFCKGGKVEEAVKLFSKMEE 301

BLAST of CsaV3_4G007030 vs. Swiss-Prot
Match: sp|O49558|PP331_ARATH (Pentatricopeptide repeat-containing protein At4g21170 OS=Arabidopsis thaliana OX=3702 GN=At4g21170 PE=3 SV=2)

HSP 1 Score: 47.4 bits (111), Expect = 9.9e-04
Identity = 43/178 (24.16%), Postives = 75/178 (42.13%), Query Frame = 0

Query: 89  PMTCLNFFYFASNSFKFRFTIHSYCTLILLLIRSKFIPPARLLLIRLIDGNLPVLNLDSE 148
           P T L+FF FA    +F   + S+C +I +   S  +  A +LL  L++ N   L +   
Sbjct: 83  PKTTLDFFDFAKTHLRFEPDLKSHCRVIEVAAESGLLERAEMLLRPLVETNSVSLVVGEM 142

Query: 149 KFHIEIANALFGLTSVVGRFEWTQAFDLLIHVYSTQFRNLGFSCAVDVFYLLARKGTFPS 208
               E            G    + +  L++  Y+ +  +      ++VF  + R    PS
Sbjct: 143 HRWFE------------GEVSLSVSLSLVLEYYALKGSHHN---GLEVFGFMRRLRLSPS 202

Query: 209 LKTCNFLLSSLVKANEFE-KCCEVFRVMSEGACPDVFSFTNVINALCKGGKMENAIEL 266
               N LL SLVK N+F    C    ++  G   D  ++  +   LC+ G+ ++  +L
Sbjct: 203 QSAYNSLLGSLVKENQFRVALCLYSAMVRNGIVSDELTWDLIAQILCEQGRSKSVFKL 245

BLAST of CsaV3_4G007030 vs. TrEMBL
Match: tr|A0A0A0L008|A0A0A0L008_CUCSA (Uncharacterized protein OS=Cucumis sativus OX=3659 GN=Csa_4G055990 PE=4 SV=1)

HSP 1 Score: 547.7 bits (1410), Expect = 4.7e-152
Identity = 271/271 (100.00%), Postives = 271/271 (100.00%), Query Frame = 0

Query: 1   MHLTRFKISKTTPVLFPFSRRLVCVSSTQPHKEHHQDPPWQSQDQLHLWVSSVLSHSSLD 60
           MHLTRFKISKTTPVLFPFSRRLVCVSSTQPHKEHHQDPPWQSQDQLHLWVSSVLSHSSLD
Sbjct: 1   MHLTRFKISKTTPVLFPFSRRLVCVSSTQPHKEHHQDPPWQSQDQLHLWVSSVLSHSSLD 60

Query: 61  SSKCSALLPHLSPSQFDQLFFSIGLKANPMTCLNFFYFASNSFKFRFTIHSYCTLILLLI 120
           SSKCSALLPHLSPSQFDQLFFSIGLKANPMTCLNFFYFASNSFKFRFTIHSYCTLILLLI
Sbjct: 61  SSKCSALLPHLSPSQFDQLFFSIGLKANPMTCLNFFYFASNSFKFRFTIHSYCTLILLLI 120

Query: 121 RSKFIPPARLLLIRLIDGNLPVLNLDSEKFHIEIANALFGLTSVVGRFEWTQAFDLLIHV 180
           RSKFIPPARLLLIRLIDGNLPVLNLDSEKFHIEIANALFGLTSVVGRFEWTQAFDLLIHV
Sbjct: 121 RSKFIPPARLLLIRLIDGNLPVLNLDSEKFHIEIANALFGLTSVVGRFEWTQAFDLLIHV 180

Query: 181 YSTQFRNLGFSCAVDVFYLLARKGTFPSLKTCNFLLSSLVKANEFEKCCEVFRVMSEGAC 240
           YSTQFRNLGFSCAVDVFYLLARKGTFPSLKTCNFLLSSLVKANEFEKCCEVFRVMSEGAC
Sbjct: 181 YSTQFRNLGFSCAVDVFYLLARKGTFPSLKTCNFLLSSLVKANEFEKCCEVFRVMSEGAC 240

Query: 241 PDVFSFTNVINALCKGGKMENAIELFMKMEK 272
           PDVFSFTNVINALCKGGKMENAIELFMKMEK
Sbjct: 241 PDVFSFTNVINALCKGGKMENAIELFMKMEK 271

BLAST of CsaV3_4G007030 vs. TrEMBL
Match: tr|A0A1S4DYY2|A0A1S4DYY2_CUCME (pentatricopeptide repeat-containing protein At4g19440, chloroplastic isoform X1 OS=Cucumis melo OX=3656 GN=LOC103493057 PE=4 SV=1)

HSP 1 Score: 515.0 bits (1325), Expect = 3.4e-142
Identity = 252/271 (92.99%), Postives = 260/271 (95.94%), Query Frame = 0

Query: 1   MHLTRFKISKTTPVLFPFSRRLVCVSSTQPHKEHHQDPPWQSQDQLHLWVSSVLSHSSLD 60
           MHLTRFKI+KT PVLFPFSRRL CVSSTQPHKEHHQDPPWQSQDQLHLWVSSVLS+SSLD
Sbjct: 1   MHLTRFKINKTIPVLFPFSRRLACVSSTQPHKEHHQDPPWQSQDQLHLWVSSVLSNSSLD 60

Query: 61  SSKCSALLPHLSPSQFDQLFFSIGLKANPMTCLNFFYFASNSFKFRFTIHSYCTLILLLI 120
           SSKCSALLPHLSP QFDQLFFSIGLKANPMTCLNFFYFAS+SFKFRFTIHSYC LILLL+
Sbjct: 61  SSKCSALLPHLSPFQFDQLFFSIGLKANPMTCLNFFYFASDSFKFRFTIHSYCILILLLV 120

Query: 121 RSKFIPPARLLLIRLIDGNLPVLNLDSEKFHIEIANALFGLTSVVGRFEWTQAFDLLIHV 180
            SKF+PPARLLLIRLIDGNLPVLN D +KFHIEIANALFGLTSVVGRFEWTQAFDLLIHV
Sbjct: 121 HSKFLPPARLLLIRLIDGNLPVLNSDFKKFHIEIANALFGLTSVVGRFEWTQAFDLLIHV 180

Query: 181 YSTQFRNLGFSCAVDVFYLLARKGTFPSLKTCNFLLSSLVKANEFEKCCEVFRVMSEGAC 240
           YSTQFRNLGF CA+DVFYLLARKGTFPSLKTCNFLLSSLVKANEFEKCCEVF+VMSEG C
Sbjct: 181 YSTQFRNLGFGCAIDVFYLLARKGTFPSLKTCNFLLSSLVKANEFEKCCEVFQVMSEGVC 240

Query: 241 PDVFSFTNVINALCKGGKMENAIELFMKMEK 272
           PDVFSFTNVINALCKGGKME A ELFMKMEK
Sbjct: 241 PDVFSFTNVINALCKGGKMEKATELFMKMEK 271

BLAST of CsaV3_4G007030 vs. TrEMBL
Match: tr|A0A2I4EPW0|A0A2I4EPW0_9ROSI (pentatricopeptide repeat-containing protein At4g19440, chloroplastic OS=Juglans regia OX=51240 GN=LOC108991576 PE=4 SV=1)

HSP 1 Score: 293.1 bits (749), Expect = 2.1e-75
Identity = 158/286 (55.24%), Postives = 192/286 (67.13%), Query Frame = 0

Query: 1   MHLTRFKISKTTPVLFPFSRRLVCVSS------TQPHKEHHQDPPWQSQDQLHL------ 60
           M L RF I+K T + +  +RRL CV+S       QP +  H+ PP Q Q Q H       
Sbjct: 4   MDLRRFPITKPTRIFYSITRRLTCVTSIAHHLQEQPPQSQHR-PPLQLQRQSHSQPPNQS 63

Query: 61  ---WVSSVLSHSSLDSSKCSALLPHLSPSQFDQLFFSIGLKANPMTCLNFFYFASNSFKF 120
              WVS++LS  SLDSS C +++PHLSPS+FDQ+F S+    NP T LNFFYFAS +F+F
Sbjct: 64  LLNWVSTILSKPSLDSSMCKSVIPHLSPSEFDQIFLSLKSNLNPKTTLNFFYFASEAFRF 123

Query: 121 RFTIHSYCTLILLLIRSKFIPPARLLLIRLIDGNLPVLNLDSEKFHIEIANALFGLTSVV 180
            FT+ SYC LI LLI S  + PARLLLIRLIDG +PVL   ++  H+EIA  +  L    
Sbjct: 124 PFTVRSYCLLIRLLIVSNLVSPARLLLIRLIDGKMPVLFASTKNRHVEIATMMADLNLPS 183

Query: 181 GRFEWTQAFDLLIHVYSTQFRNLGFSCAVDVFYLLARKGTFPSLKTCNFLLSSLVKANEF 240
            R    QA D+L+HVY TQF+NLGF  A+DVF L A KG FPSLKTCNF LSSLVKANE 
Sbjct: 184 ERVLGVQALDMLVHVYCTQFKNLGFGFAIDVFRLSASKGMFPSLKTCNFFLSSLVKANEL 243

Query: 241 EKCCEVFRVMSEGACPDVFSFTNVINALCKGGKMENAIELFMKMEK 272
           +K CEVF VM  G  PDV+  +  INALCKGGK+E+AI LF+KMEK
Sbjct: 244 QKSCEVFEVMCRGVSPDVYLLSTAINALCKGGKVEDAIGLFLKMEK 288

BLAST of CsaV3_4G007030 vs. TrEMBL
Match: tr|M5WX26|M5WX26_PRUPE (Uncharacterized protein OS=Prunus persica OX=3760 GN=PRUPE_ppa001463mg PE=4 SV=1)

HSP 1 Score: 282.7 bits (722), Expect = 2.8e-72
Identity = 156/278 (56.12%), Postives = 189/278 (67.99%), Query Frame = 0

Query: 1   MHLTRFKISKTTPVLFPFSRRLVCVS-STQPHKEHHQDPPWQ-------SQDQLHLWVSS 60
           M L R  ISK T +LF  +R L CV+ + Q  KE  Q PP Q           LH WVSS
Sbjct: 1   MDLRRLSISKPT-LLFRINRPLTCVTCNLQRPKEPPQPPPLQVXXXXXPPNQSLHNWVSS 60

Query: 61  VLSHSSLDSSKCSALLPHLSPSQFDQLFFSIGLKANPMTCLNFFYFASNSFKFRFTIHSY 120
           +LS  SLDSSKC AL+P LS  +FD++F SI    NP T L+FFYFAS SFKF+FT+ S+
Sbjct: 61  ILSKPSLDSSKCKALIPLLSSHEFDRVFCSISSNVNPKTALHFFYFASESFKFQFTVRSF 120

Query: 121 CTLILLLIRSKFIPPARLLLIRLIDGNLPVLNLDSEKFHIEIANALFGLTSVVGRFEWTQ 180
           C L+ LLI S  + PARLLLIRLIDGN+PVL  +  + H+EIA A+  L +V  +    Q
Sbjct: 121 CVLVRLLILSNLVSPARLLLIRLIDGNVPVLYANHNQRHMEIAIAMLDLNTVSTQGLGVQ 180

Query: 181 AFDLLIHVYSTQFRNLGFSCAVDVFYLLARKGTFPSLKTCNFLLSSLVKANEFEKCCEVF 240
           A DLLIHVY TQF+N+GF  A+D F + ++KG FPSLKTCNFLLSSLVKANE  K  +VF
Sbjct: 181 ALDLLIHVYCTQFKNMGFGYAIDAFVIFSKKGVFPSLKTCNFLLSSLVKANELHKSYDVF 240

Query: 241 RVMSEGACPDVFSFTNVINALCKGGKMENAIELFMKME 271
            VM  G  PDV+ FT  INA CKGGK+++AI LF KME
Sbjct: 241 EVMCRGVSPDVYLFTTAINAFCKGGKVDDAIGLFSKME 277

BLAST of CsaV3_4G007030 vs. TrEMBL
Match: tr|A0A251PV32|A0A251PV32_PRUPE (Uncharacterized protein OS=Prunus persica OX=3760 GN=PRUPE_3G018200 PE=4 SV=1)

HSP 1 Score: 282.7 bits (722), Expect = 2.8e-72
Identity = 156/278 (56.12%), Postives = 189/278 (67.99%), Query Frame = 0

Query: 1   MHLTRFKISKTTPVLFPFSRRLVCVS-STQPHKEHHQDPPWQ-------SQDQLHLWVSS 60
           M L R  ISK T +LF  +R L CV+ + Q  KE  Q PP Q           LH WVSS
Sbjct: 5   MDLRRLSISKPT-LLFRINRPLTCVTCNLQRPKEPPQPPPLQVXXXXXPPNQSLHNWVSS 64

Query: 61  VLSHSSLDSSKCSALLPHLSPSQFDQLFFSIGLKANPMTCLNFFYFASNSFKFRFTIHSY 120
           +LS  SLDSSKC AL+P LS  +FD++F SI    NP T L+FFYFAS SFKF+FT+ S+
Sbjct: 65  ILSKPSLDSSKCKALIPLLSSHEFDRVFCSISSNVNPKTALHFFYFASESFKFQFTVRSF 124

Query: 121 CTLILLLIRSKFIPPARLLLIRLIDGNLPVLNLDSEKFHIEIANALFGLTSVVGRFEWTQ 180
           C L+ LLI S  + PARLLLIRLIDGN+PVL  +  + H+EIA A+  L +V  +    Q
Sbjct: 125 CVLVRLLILSNLVSPARLLLIRLIDGNVPVLYANHNQRHMEIAIAMLDLNTVSTQGLGVQ 184

Query: 181 AFDLLIHVYSTQFRNLGFSCAVDVFYLLARKGTFPSLKTCNFLLSSLVKANEFEKCCEVF 240
           A DLLIHVY TQF+N+GF  A+D F + ++KG FPSLKTCNFLLSSLVKANE  K  +VF
Sbjct: 185 ALDLLIHVYCTQFKNMGFGYAIDAFVIFSKKGVFPSLKTCNFLLSSLVKANELHKSYDVF 244

Query: 241 RVMSEGACPDVFSFTNVINALCKGGKMENAIELFMKME 271
            VM  G  PDV+ FT  INA CKGGK+++AI LF KME
Sbjct: 245 EVMCRGVSPDVYLFTTAINAFCKGGKVDDAIGLFSKME 281

The following BLAST results are available for this feature:
Match NameE-valueIdentityDescription
XP_004149000.17.1e-152100.00PREDICTED: pentatricopeptide repeat-containing protein At4g19440, chloroplastic ... [more]
KGN53456.17.1e-152100.00hypothetical protein Csa_4G055990 [Cucumis sativus][more]
XP_016901180.15.1e-14292.99PREDICTED: pentatricopeptide repeat-containing protein At4g19440, chloroplastic ... [more]
XP_023542002.11.5e-12584.07pentatricopeptide repeat-containing protein At4g19440, chloroplastic-like [Cucur... [more]
XP_022942543.12.2e-12483.33pentatricopeptide repeat-containing protein At4g19440, chloroplastic-like [Cucur... [more]
Match NameE-valueIdentityDescription
AT4G19440.11.7e-5450.43Tetratricopeptide repeat (TPR)-like superfamily protein[more]
AT4G21170.15.5e-0524.16Tetratricopeptide repeat (TPR)-like superfamily protein[more]
AT2G26790.13.6e-0426.83Pentatricopeptide repeat (PPR) superfamily protein[more]
Match NameE-valueIdentityDescription
sp|Q940A6|PP325_ARATH3.0e-5350.43Pentatricopeptide repeat-containing protein At4g19440, chloroplastic OS=Arabidop... [more]
sp|O49558|PP331_ARATH9.9e-0424.16Pentatricopeptide repeat-containing protein At4g21170 OS=Arabidopsis thaliana OX... [more]
Match NameE-valueIdentityDescription
tr|A0A0A0L008|A0A0A0L008_CUCSA4.7e-152100.00Uncharacterized protein OS=Cucumis sativus OX=3659 GN=Csa_4G055990 PE=4 SV=1[more]
tr|A0A1S4DYY2|A0A1S4DYY2_CUCME3.4e-14292.99pentatricopeptide repeat-containing protein At4g19440, chloroplastic isoform X1 ... [more]
tr|A0A2I4EPW0|A0A2I4EPW0_9ROSI2.1e-7555.24pentatricopeptide repeat-containing protein At4g19440, chloroplastic OS=Juglans ... [more]
tr|M5WX26|M5WX26_PRUPE2.8e-7256.12Uncharacterized protein OS=Prunus persica OX=3760 GN=PRUPE_ppa001463mg PE=4 SV=1[more]
tr|A0A251PV32|A0A251PV32_PRUPE2.8e-7256.12Uncharacterized protein OS=Prunus persica OX=3760 GN=PRUPE_3G018200 PE=4 SV=1[more]
The following terms have been associated with this gene:
Vocabulary: Molecular Function
TermDefinition
GO:0005515protein binding
Vocabulary: INTERPRO
TermDefinition
IPR002885Pentatricopeptide_repeat
IPR011990TPR-like_helical_dom_sf
GO Assignments
This gene is annotated with the following GO terms.
Category Term Accession Term Name
biological_process GO:0008150 biological_process
cellular_component GO:0005575 cellular_component
molecular_function GO:0005515 protein binding

The following mRNA feature(s) are a part of this gene:

Feature NameUnique NameType
CsaV3_4G007030.1CsaV3_4G007030.1mRNA


Analysis Name: InterPro Annotations of cucumber chineselong genome (v3)
Date Performed: 2019-03-04
IPR TermIPR DescriptionSourceSource TermSource DescriptionAlignment
IPR011990Tetratricopeptide-like helical domain superfamilyGENE3DG3DSA:1.25.40.10coord: 549..622
e-value: 3.0E-20
score: 74.5
coord: 691..760
e-value: 1.4E-21
score: 78.8
coord: 761..819
e-value: 3.6E-15
score: 57.9
coord: 623..690
e-value: 3.6E-19
score: 71.0
IPR011990Tetratricopeptide-like helical domain superfamilyGENE3DG3DSA:1.25.40.10coord: 432..548
e-value: 5.9E-25
score: 90.4
coord: 161..304
e-value: 6.7E-31
score: 110.0
IPR011990Tetratricopeptide-like helical domain superfamilyGENE3DG3DSA:1.25.40.10coord: 305..431
e-value: 1.1E-30
score: 109.0
IPR011990Tetratricopeptide-like helical domain superfamilySUPERFAMILYSSF48452TPR-likecoord: 414..426
coord: 221..304
coord: 459..520
coord: 556..632
IPR002885Pentatricopeptide repeatPFAMPF12854PPR_1coord: 238..270
e-value: 9.0E-9
score: 34.8
IPR002885Pentatricopeptide repeatPFAMPF01535PPRcoord: 211..237
e-value: 0.057
score: 13.6
coord: 489..519
e-value: 1.9E-5
score: 24.5
coord: 595..622
e-value: 3.4E-5
score: 23.7
coord: 457..483
e-value: 0.023
score: 14.8
IPR002885Pentatricopeptide repeatPFAMPF13041PPR_2coord: 276..325
e-value: 1.3E-17
score: 63.6
coord: 696..745
e-value: 2.7E-16
score: 59.4
coord: 627..675
e-value: 2.6E-16
score: 59.4
coord: 346..395
e-value: 3.8E-13
score: 49.3
coord: 522..570
e-value: 3.3E-18
score: 65.5
coord: 766..812
e-value: 3.4E-13
score: 49.5
IPR002885Pentatricopeptide repeatTIGRFAMTIGR00756TIGR00756coord: 769..802
e-value: 2.0E-6
score: 25.6
coord: 245..278
e-value: 1.2E-7
score: 29.4
coord: 559..590
e-value: 9.8E-7
score: 26.6
coord: 349..382
e-value: 4.3E-7
score: 27.7
coord: 665..698
e-value: 2.4E-8
score: 31.6
coord: 492..522
e-value: 3.0E-5
score: 21.9
coord: 524..558
e-value: 4.8E-8
score: 30.7
coord: 211..237
e-value: 2.1E-4
score: 19.2
coord: 734..768
e-value: 2.6E-10
score: 37.8
coord: 629..663
e-value: 2.3E-9
score: 34.8
coord: 699..732
e-value: 3.0E-8
score: 31.3
coord: 279..312
e-value: 6.4E-9
score: 33.4
coord: 315..348
e-value: 7.2E-6
score: 23.9
coord: 594..627
e-value: 2.0E-7
score: 28.7
IPR002885Pentatricopeptide repeatPROSITEPS51375PPRcoord: 242..276
score: 12.891
IPR002885Pentatricopeptide repeatPROSITEPS51375PPRcoord: 557..591
score: 12.167
IPR002885Pentatricopeptide repeatPROSITEPS51375PPRcoord: 277..311
score: 13.482
IPR002885Pentatricopeptide repeatPROSITEPS51375PPRcoord: 487..521
score: 11.126
IPR002885Pentatricopeptide repeatPROSITEPS51375PPRcoord: 108..142
score: 5.229
IPR002885Pentatricopeptide repeatPROSITEPS51375PPRcoord: 732..766
score: 13.625
IPR002885Pentatricopeptide repeatPROSITEPS51375PPRcoord: 662..696
score: 12.551
IPR002885Pentatricopeptide repeatPROSITEPS51375PPRcoord: 382..416
score: 8.988
IPR002885Pentatricopeptide repeatPROSITEPS51375PPRcoord: 767..801
score: 11.674
IPR002885Pentatricopeptide repeatPROSITEPS51375PPRcoord: 592..626
score: 11.148
IPR002885Pentatricopeptide repeatPROSITEPS51375PPRcoord: 522..556
score: 12.803
IPR002885Pentatricopeptide repeatPROSITEPS51375PPRcoord: 417..451
score: 7.114
IPR002885Pentatricopeptide repeatPROSITEPS51375PPRcoord: 627..661
score: 12.781
IPR002885Pentatricopeptide repeatPROSITEPS51375PPRcoord: 452..486
score: 7.947
IPR002885Pentatricopeptide repeatPROSITEPS51375PPRcoord: 347..381
score: 11.959
IPR002885Pentatricopeptide repeatPROSITEPS51375PPRcoord: 697..731
score: 11.751
IPR002885Pentatricopeptide repeatPROSITEPS51375PPRcoord: 312..346
score: 10.654
IPR002885Pentatricopeptide repeatPROSITEPS51375PPRcoord: 208..238
score: 7.53
NoneNo IPR availablePANTHERPTHR24015:SF1067SUBFAMILY NOT NAMEDcoord: 7..775
coord: 761..813
NoneNo IPR availablePANTHERPTHR24015FAMILY NOT NAMEDcoord: 7..775
coord: 761..813