Cp4.1LG02g07200 (gene) Cucurbita pepo (Zucchini)

NameCp4.1LG02g07200
Typegene
OrganismCucurbita pepo (Cucurbita pepo (Zucchini))
DescriptionPentatricopeptide repeat superfamily protein
LocationCp4.1LG02 : 777503 .. 784423 (+)
The following sequences are available for this feature:

Gene sequence (with intron)

Legend: five_prime_UTRCDSthree_prime_UTR
Hold the cursor over a type above to highlight its positions in the sequence below.
TCAATATATATATAGACGAGAATAACTCCTTATATATATATAGAACTTAATTTAATTTAATTAGAATAGTTAAAAAAAGAAAAAAGAAAAAAGAATATATATAAAATACGATTTTGTGGCGAGGGGTTGGGACCGATGAGCGAATAGCAGGGGAAGGCCAACGCCGGATCGCGCTGAAGTTGGAGAGCAAAGCAACTGCGGAAGAGAGCAACGATGACGGTGTTTCATTTCTTCAACTGCGCAATTCTCACTTTCGGTCCTCATGCTGTCTACTACTCTGCAACGCCCTTGTAAGGATCTCGCCTTTTCATCTTCCTTGCATCTCCTTCTACTTCTTTAGATCTCCTGTTCCTCGCACTGTCTCTTCAGGCCCACAGTGATTTGTTGTATTGAAATTCTCCTCACTTTGCTTTGTTTGGAATGATTTCTTGCTGTTTAATCTTGGACTGCTTCAATTTTAGTTCCTCTGCTGGTCCGGAGTGATTCGTTCGGTTGAATTTCCGTTCTCCTTGCTTTGTTTTTATATGATTTATTGATTGTTTGGACTGTTCAAGTTATTAGTTCCGATTTGTACGGAGCTGTGTGTCTGAACTGGATTAGAGAATCAGTTTAGAAGCAGTCTCTATTTTGCAAAGTACCGAAATGCTTAATGACTTGAGTCGGCTGACCAACTTCCTTGTTTTGATGAAATGGAATTGCTGTAGTTTATTAGAGTAGTGGAACGGTTTGTATTCCCAGTGCCACGCTCTGATTTCTACCATGTACGATTTGGTTGCCATAGCTACCAAAATAAGCAATATAAGTCAGTTAAGACAGCTTCATGCGCATCTTGTTCTCAATTCCCTCCAATCTCAGAACTACTGGGTTTCTCTGCTCCTCACTATCTGTACTCGTCTTCACGCTCATCCTTCCTATGCGGCTTCTATTTTTACCTCCTCGCCCTACCCCAATGCTTCTGTTTACAGTTGTATGCTCAAATATTACTCCCGCATGGGTGCCCACGACGAGGTGGTTTCTCTCTTCAGATGCATGCAGTGTCTAGACCTCAGGCCCCATCCCTCTGTTTATATATACTTGATTAAGTTAGCTGGGAAATCTGGCAATTTGTTCCATGCTTCTGTCCTGAAGTTGGGTCATATTGACGACCACTTCATCCGTAATGCTATATTGGATATGTATGCAAAATATGGCCAAGTGGATCTTGCCAGGAAGTTGTTTGAGCAAATGCCTAACAGAACTCTAGCGGACTGGAATTCAATGATTTCTGGCTGTTGGAATTCGGGAAATGAAGCTGATGCAGTCATGCTGTTTAATATGATGCCTGATAGGAATACTATTTCATGGACTGCCATGGTTACTGGGTATGCTAAGATGAGGGACTTGGAGAGTGCTAGAAGGTATTTTGATGAGATGCCAGAGAAAAGTGTAGTCTCGTGGAATGCCATGCTATCAGCTTATGCTCAAAATGAATGTGCAGAAGAGGCTTTGAAATTGTTCCATCGAATGCTGAAAGAGGGAATCACTCCTGATGATACAACATGGGTAGCTGCAATTTCATCGTGCTCTTCCATCGGCAATCCTAACCTTGCTGATTCACTTCTAGCAAAGATTAACCAAAAGCACGTCATTTTGAATAATTATGTCAAAACGGCTTTACTTGACATGCATGCAAAATTTGGTAACCTAGAAATTGCTAGAAGAATTTTTGATGAATTGGGAGGTCAGAGGAATGCTGTTACTTGGAATGTCATGATCTCAGCATATACAAGGGCAGGAAAACTGTCATTAGCTCGGGAGCTGTTTGACAATATGCCAAAAAGGGATGTTGTTTCATGGAATTCAATGATAGCTGGTTATGCACAAAATGGAGAGTCTGCCATGTCAATTCACCTCTTTAGAGAAATGATTGATTGTACGGACATACAACCAGATGAGGTCACCATAGCTAGTGTCTTATCGGCCTGTGGACATATTGGGGCTCTGAAATTTAGTTACTGGGTTCTAAATATCGTTCAAGAGAAGAACATAAAGTTTGGCATTTCAGGATTCAATTCTTTGATATTCATGTACTCTAAATGTGGAAATGTGGTTGATGCCCATAGGATATTCCAAAATATGGGGACGAAAGATGTTGTTACTTTCAATACACTGATTTCAGGATTTGCAGCCAATGGTCATGGGAAGGATGCTATCAAGTTACTGTTAACAATGGAGGAAGAAGGCATTGAACCAGACCATGTCACATATATTGGTGTTTTGACTGCATGTAGTCATGCAGGAATGCTGAAAGAAGGTAAAAACATCTTTAAGTCAATTAAAGCACCGACAGTGGATCACTATGCTTGTATGGTTGATTTATTAGGAAGAGCAGGCGAATTAGATGAAGCCAAAATGTTAATTGAATCTATGCCAATGAAACCTCACGCTGGTGTTTATGGCTCTTTGTTAAATGGCAGTCGAATTCACAAGAGAGTTGAGCTGGGAGAACTTGCTGCTAACAAGCTCTTAGAGCTTGAACCTCAAAACCCAGGAAATTATATTTTACTTTCTAATATATATGCCTCTGCTGGAAGATGGGAAGATGTTAGACAGGTTAGAGAGAAGATGAGGAAGGGAGGTGTCAAGAAATCAGTTGGAATGAGTTGGGTGGAATATAAGGGTCAAATACATAATTTCACTGTGGGTGATAGATCGCATGAATGGTCAAAAGATATTTATAGATTATTGGCTGAACTTGAAAGGAAGATGAAGAGGGTTGGCTTTGTAACTGATAAAAGTTGTGCACTGCGAGATGTTGAGGAGGAAGAGAAGGAAGAAATGCTGGGGACTCACAGTGAGAAGTTGGCCATTTGTTTTGCGCTTCTTGTCAGTGAAGTGGGGACACCAATTAGAGTGGTGAAAAATTTGAGAATTTGTATGGATTGTCATACAGCTATTAAGATGATCTCAAAGTTAGAGGAAAGAGAGATTATTGTCCGTGATAATAATAGGTTCCATTGTTTTAGTGAAGGGATATGTTCTTGCCATGATTACTGGTAAATTGGCAAAGAGCAAATTTATGGATAAAATTCATTTGCATAAAGAGAGATTTTAAGCACAACCCGGCCTTCAATAGTGAATCACGCAGATTACCATACTTGTCATTTCTCCATATTCAGCTTTTAACTGAAGAACACTTTATGATTAGGTCCAAGGATCAATATCAAAGAGCATTGTATCTCTGTGATGTACCTTACAAGAGTTATGCATTAGGAGTTTGTTAGTTTTATTTTTCTACCAATTCATTCATAAATCAACAGAGTGAACCATGTTTATCAGGGAAAAAAAAAAAAGATAAAGAGAGATTGGCCAGTTGGAATGTTCGAAGTTTTTTTTTTTTTTTTTTTTGATGATGTAATCTAAGAATTGCATATTTTCTTCCTCACAGAATCGATGAATTCATGGTCTAATCTCCGGTTCTGTCATAATTTTATGTTTTTCTATAAGTCACGATTTCAGTTCACTGAAAACTCTAGCTATACAGATCTGAGTATGACACACTCGGAACGTCAGTCAAAGCGGCACTGGTTTACCTTGGAACGGCCTTAGTAAAGGTTTGTGATTTCTGAACACTTCAAGGTTGATTGAGACCTTGAATTTCTCTCCATTGTATATTATTGACTTTTCACTTCTCCATGAATCTAACTGATATTTTTCTATGCATTCTTGTCAGCTTGTATGCCTTGCAACCTTCCTTAACGTGTCAGAGAATGACTCCTTTGACCTATATCAGGTATTAATGTTCTTTTCATCTGTAATTGGTGAAAAACATTTTTTTTTATATATTTTAGTGTACATTTTAATTCCATCAGTTGTAATGCCTAGTATCTTTTGTACATTTTAATTCTCTGACTTGGAAAATTAAAGACTTCATTAACCTTGGATGTCAGGACAAAATTATTGGTTGCTTGGCTATAATCCTCATCTGGGAAACTTAAATTTTTATATTCCACACTGCATTCAACCTTTCTGGTTGATTGGATTGGTGTTGGTGGGTGGGAAGGTTTAGTTAGTCTATTGTTAGAACTTGTTTTTTACTATCACATATTATGTAATTTTTGTTTAATAGAACATTGAGTTACATACCATTAAATGGTTTGGAACGAAATGAATAATGCAGCCTAGTATCCAGAATTGGACAAACATGAATCAGAATAGCATTACAGAACTGTTTTTAGCGTTGAAGTGAAAAATAGTTTCCAGTCTCTATTTCATGTACTAGCCTTGATTCCATCCTCTTTTTCCCTCTTTGTGGGATGTCTTCCTGTTTCATCCAATGAGTAGTGCTAAATGGTATGTTTCTTTTATTAGATCCTCATGTCAAGTGTACTAACTATGCTTTTATTAACAAAGCCTCCATATTTTTAGGAAATTTTGTTCGAGTCGTGCCAGTAGTTATAGCAAATGTTAACAAGGTATAATTCATGTACGACAGGAACTGTTGAAAGCGCTTATTGGTTTAATAGATGTTGCCGGACTTTACTTTGCTTTGACCCAGTTGACTTACCGGAACATTTCTCAGAACCATAAATTTCAGGCAGTTGGACTGGGTGAGTTTGTTGAGCAATCAATTGAGGTTTTTTAGTTTGATTGTAAATGATGACTGCATGCACGGTTTCTGTTTGTTTATTTCGATCTTTCAATGTTTTTCCCCATCTTCTAATCATTGGATGGCCTGGTTACTCTGTTCTTCTTGCCTGCAGGTTGGGCATTCGCTGATTCTGTTTTGCATAGACTTGCACCACTTTGGATTGGTGCCAGAGGACTAGAGTTTACTTGGGATTACATTTTGCAGGGCCTTGAAGCTAATGCAAATCTGGTTTCCATCTCTCTTCTTTATTTTAAGATACTCTTTATTTGGCTACTGTGAGAGATTCAATCATCGATTCGATTGATATTTCTTGGGTTCTTGACCTTGCATTCTGGGTGAAGTATTTGTCATGTCAGCCGCGCTGTTGCTTCTCTAATTTCTGCTCCCCACTTCAATTCAGGTGTTGAGCATATCCCTTGCTGCATTGGGATCTTTGATGTGGCTTCGGAAGAACAAACCCAAAGCGCTAATTCCCATAATTTATATCTGTGCACTGATTGTGGCTACTATGCCATCAATCACAAGGTATGTAACATCAAAATTAGAAAATTTGCCACTTCGAACTTGAATCACTTGCTTTGAACTATCTGAACATCAAAAGGTAATCTATTTGAACATCAAAATTGTGCTTGTTTTCCTAATATTTCTCTACAATGAGTTTCATTATTTTTAAGAAAAAGTTGGAAATTCCTAGCCAAACTCTAAAAACAAAACAGGTTTTTCAAAACCCGTTTTAATTTTAAAATTCAAATAAAGATGTACATAATTTTAACACAAGCTGCAAATTTTCTTTCCAGCTACTTAAGGCGGGGAATGGGTTGGCATTTCCCTAAGGTGGTGGGATTTGAACTCTTCACCTCTTTGGCGATGGCTTTTATTAGTTGGCAGCTTTTTTCTGCTTGTCAGAGACCCTCTGTTTAAGGTTATGGCACACTGGGGTTGACTATATTGATGCTGGACGTGAATTTTGGAGGTGTGGATTGCAGATCTCCTTTTACTTAACCTTCTCTTTTGCAGTGTTCCGCCTAATTTCGTAATCATTTGCAAAACCAAATTGTGGGGTTAAGTTTCAGATGTTGTAAGGTTGTAAGCTTTGTAATGTATGATGACGGGTAAAAGGGAGTTTCAGATGTTGTAACGTTTAGTTCTTTTTCTTTTGAGATTTTGAATGGAAGTAAAAGGTAGCTTGCCAGGACAATGTTATCTCATTTTCCTTTCCCTACCCTCAGATTTTAAACCTTTTTTTTCTCCTTCCATTCTTTTACTGAGGAACATGATTACGAAGATGGAAGAGGTTACCTCAGATCAAGGCTCTCGCGCCTTCTCATTTCATCACTATTTTATTTTCTTCTGCAAATTTTGGATGAGAAAAGTTAGAACGACAAGTAAAGAATGCAATTTTTAATCTGCATTTACATTAGATGATTAGAGCGACAAGTTAGCAAAATAAGAGAACACATTCACTAGTTTCATTTCAAGAAAAAGCATAAGGGGCAGGGGGAAAAAGCATAAGGGGCAGGGGGAAAAAGTGTGTACAAGTGATGCAACTCCGATTGAACCAGTCATCCGTTGCCATGGGTTCAACGACAGCCTAACCTGGGACCGCTACAATGAGAATTTTGGGGGAAAAAGATGACCGGTGACCTTTATAGCAAAAGAAAATCTAGCCATTCCTAATATGAGCAAATAATTTTTTTGATTGAGAAGAAATGAGGCGGAGTACCAACCTTACAGAGAAGGATGATTGCTTGTTGTTGGTACAATTGCTAATTCACTACAAGTTGCGACTTCACTGCTGAGGTCAAACCCTTTCTTCCCAACTCCGGTTACATGTACAAGCAATTCACACTTGCCAACAAGTTCAAAAATGCAGATATCCCCCATCTGGATACCATTGTCACGAACAAACGACATCCAACCGCCGCAAAAAGTATGCATCATTCGCCCCATTGAGTCTGGTACTGAGTTCACAATCCAACATCCCCCGTTTGGACCGCGAAGTATAATCTCAGTTTTGCTGTTAGGGAAATGTACCGAAGAAAACTGATGCGGAATTTTCTGCAACATAGAATGCCAATCAGTCCTACTAAATTGAAGCAAAGAAAATGTAAAAAATAACTGCATTGACAAACAACACTGGATCTAAAACCAAAAACTTGAGGAAGTACATGAACGGTCTAAAATAAAAGACTTACTAAAGTATATGAACCACTAGTGTTGAACTTTTTCATGATTCTGACAAAATTTGGAAAACAAGAAGCAAATGACTTAGCTGCTCTTTCTTCATCAAG

mRNA sequence

TCAATATATATATAGACGAGAATAACTCCTTATATATATATAGAACTTAATTTAATTTAATTAGAATAGTTAAAAAAAGAAAAAAGAAAAAAGAATATATATAAAATACGATTTTGTGGCGAGGGGTTGGGACCGATGAGCGAATAGCAGGGGAAGGCCAACGCCGGATCGCGCTGAAGTTGGAGAGCAAAGCAACTGCGGAAGAGAGCAACGATGACGGTGTTTCATTTCTTCAACTGCGCAATTCTCACTTTCGGTCCTCATGCTGTCTACTACTCTGCAACGCCCTTGAATACTATTTCATGGACTGCCATGGTTACTGGGTATGCTAAGATGAGGGACTTGGAGAGTGCTAGAAGGTATTTTGATGAGATGCCAGAGAAAAGTGTAGTCTCGTGGAATGCCATGCTATCAGCTTATGCTCAAAATGAATGTGCAGAAGAGGCTTTGAAATTGTTCCATCGAATGCTGAAAGAGGGAATCACTCCTGATGATACAACATGGGTAGCTGCAATTTCATCGTGCTCTTCCATCGGCAATCCTAACCTTGCTGATTCACTTCTAGCAAAGATTAACCAAAAGCACGTCATTTTGAATAATTATGTCAAAACGGCTTTACTTGACATGCATGCAAAATTTGGTAACCTAGAAATTGCTAGAAGAATTTTTGATGAATTGGGAGGTCAGAGGAATGCTGTTACTTGGAATGTCATGATCTCAGCATATACAAGGGCAGGAAAACTGTCATTAGCTCGGGAGCTGTTTGACAATATGCCAAAAAGGGATGTTGTTTCATGGAATTCAATGATAGCTGGTTATGCACAAAATGGAGAGTCTGCCATGTCAATTCACCTCTTTAGAGAAATGATTGATTGTACGGACATACAACCAGATGAGGTCACCATAGCTAGTGTCTTATCGGCCTGTGGACATATTGGGGCTCTGAAATTTAGTTACTGGGTTCTAAATATCGTTCAAGAGAAGAACATAAAGTTTGGCATTTCAGGATTCAATTCTTTGATATTCATGTACTCTAAATGTGGAAATGTGGTTGATGCCCATAGGATATTCCAAAATATGGGGACGAAAGATGTTGTTACTTTCAATACACTGATTTCAGGATTTGCAGCCAATGGTCATGGGAAGGATGCTATCAAGTTACTGTTAACAATGGAGGAAGAAGGCATTGAACCAGACCATGTCACATATATTGGTGTTTTGACTGCATGTAGTCATGCAGGAATGCTGAAAGAAGGTAAAAACATCTTTAAGTCAATTAAAGCACCGACAGTGGATCACTATGCTTGTATGGTTGATTTATTAGGAAGAGCAGGCGAATTAGATGAAGCCAAAATGTTAATTGAATCTATGCCAATGAAACCTCACGCTGGTGTTTATGGCTCTTTGTTAAATGGCAGTCGAATTCACAAGAGAGTTGAGCTGGGAGAACTTGCTGCTAACAAGCTCTTAGAGCTTGAACCTCAAAACCCAGGAAATTATATTTTACTTTCTAATATATATGCCTCTGCTGGAAGATGGGAAGATGTTAGACAGGTTAGAGAGAAGATGAGGAAGGGAGGTGTCAAGAAATCAGTTGGAATGAGTTGGGTGGAATATAAGGGTCAAATACATAATTTCACTGTGGGTGATAGATCGCATGAATGGTCAAAAGATATTTATAGATTATTGGCTGAACTTGAAAGGAAGATGAAGAGGGTTGGCTTTGTAACTGATAAAAGTTGTGCACTGCGAGATGTTGAGGAGGAAGAGAAGGAAGAAATGCTGGGGACTCACAGTGAGAAGTTGGCCATTTGTTTTGCGCTTCTTGTCAGTGAAGTGGGGACACCAATTAGAGTGGTGAAAAATTTGAGAATTTGTATGGATTGTCATACAGCTATTAAGATGATCTCAAAGTTAGAGGAAAGAGAGATTATTGTCCGTGATAATAATAGATCTGAGTATGACACACTCGGAACGTCAGTCAAAGCGGCACTGGTTTACCTTGGAACGGCCTTAGTAAAGCTTGTATGCCTTGCAACCTTCCTTAACGTGTCAGAGAATGACTCCTTTGACCTATATCAGGAACTGTTGAAAGCGCTTATTGGTTTAATAGATGTTGCCGGACTTTACTTTGCTTTGACCCAGTTGACTTACCGGAACATTTCTCAGAACCATAAATTTCAGGCAGTTGGACTGGGTTGGGCATTCGCTGATTCTGTTTTGCATAGACTTGCACCACTTTGGATTGGTGCCAGAGGACTAGAGTTTACTTGGGATTACATTTTGCAGGGCCTTGAAGCTAATGCAAATCTGGTGTTGAGCATATCCCTTGCTGCATTGGGATCTTTGATGTGGCTTCGGAAGAACAAACCCAAAGCGCTAATTCCCATAATTTATATCTGTGCACTGATTGTGGCTACTATGCCATCAATCACAAGCTACTTAAGGCGGGGAATGGGTTGGCATTTCCCTAAGGTGGTGGGATTTGAACTCTTCACCTCTTTGGCGATGGCTTTTATTAGTTGGCAGCTTTTTTCTGCTTGTCAGAGACCCTCTGTTTAAGGTTATGGCACACTGGGGTTGACTATATTGATGCTGGACGTGAATTTTGGAGGTGTGGATTGCAGATCTCCTTTTACTTAACCTTCTCTTTTGCAGTGTTCCGCCTAATTTCGTAATCATTTGCAAAACCAAATTGTGGGGTTAAGTTTCAGATGTTGTAAGGTTGTAAGCTTTGTAATGTATGATGACGGGTAAAAGGGAGTTTCAGATGTTGTAACGTTTAGTTCTTTTTCTTTTGAGATTTTGAATGGAAGTAAAAGGTAGCTTGCCAGGACAATGTTATCTCATTTTCCTTTCCCTACCCTCAGATTTTAAACCTTTTTTTTCTCCTTCCATTCTTTTACTGAGGAACATGATTACGAAGATGGAAGAGGTTACCTCAGATCAAGGCTCTCGCGCCTTCTCATTTCATCACTATTTTATTTTCTTCTGCAAATTTTGGATGAGAAAAGTTAGAACGACAAGTAAAGAATGCAATTTTTAATCTGCATTTACATTAGATGATTAGAGCGACAAGTTAGCAAAATAAGAGAACACATTCACTAGTTTCATTTCAAGAAAAAGCATAAGGGGCAGGGGGAAAAAGCATAAGGGGCAGGGGGAAAAAGTGTGTACAAGTGATGCAACTCCGATTGAACCAGTCATCCGTTGCCATGGGTTCAACGACAGCCTAACCTGGGACCGCTACAATGAGAATTTTGGGGGAAAAAGATGACCGGTGACCTTTATAGCAAAAGAAAATCTAGCCATTCCTAATATGAGCAAATAATTTTTTTGATTGAGAAGAAATGAGGCGGAGTACCAACCTTACAGAGAAGGATGATTGCTTGTTGTTGGTACAATTGCTAATTCACTACAAGTTGCGACTTCACTGCTGAGGTCAAACCCTTTCTTCCCAACTCCGGTTACATGTACAAGCAATTCACACTTGCCAACAAGTTCAAAAATGCAGATATCCCCCATCTGGATACCATTGTCACGAACAAACGACATCCAACCGCCGCAAAAAGTATGCATCATTCGCCCCATTGAGTCTGGTACTGAGTTCACAATCCAACATCCCCCGTTTGGACCGCGAAGTATAATCTCAGTTTTGCTGTTAGGGAAATGTACCGAAGAAAACTGATGCGGAATTTTCTGCAACATAGAATGCCAATCAGTCCTACTAAATTGAAGCAAAGAAAATGTAAAAAATAACTGCATTGACAAACAACACTGGATCTAAAACCAAAAACTTGAGGAAGTACATGAACGGTCTAAAATAAAAGACTTACTAAAGTATATGAACCACTAGTGTTGAACTTTTTCATGATTCTGACAAAATTTGGAAAACAAGAAGCAAATGACTTAGCTGCTCTTTCTTCATCAAG

Coding sequence (CDS)

ATGACGGTGTTTCATTTCTTCAACTGCGCAATTCTCACTTTCGGTCCTCATGCTGTCTACTACTCTGCAACGCCCTTGAATACTATTTCATGGACTGCCATGGTTACTGGGTATGCTAAGATGAGGGACTTGGAGAGTGCTAGAAGGTATTTTGATGAGATGCCAGAGAAAAGTGTAGTCTCGTGGAATGCCATGCTATCAGCTTATGCTCAAAATGAATGTGCAGAAGAGGCTTTGAAATTGTTCCATCGAATGCTGAAAGAGGGAATCACTCCTGATGATACAACATGGGTAGCTGCAATTTCATCGTGCTCTTCCATCGGCAATCCTAACCTTGCTGATTCACTTCTAGCAAAGATTAACCAAAAGCACGTCATTTTGAATAATTATGTCAAAACGGCTTTACTTGACATGCATGCAAAATTTGGTAACCTAGAAATTGCTAGAAGAATTTTTGATGAATTGGGAGGTCAGAGGAATGCTGTTACTTGGAATGTCATGATCTCAGCATATACAAGGGCAGGAAAACTGTCATTAGCTCGGGAGCTGTTTGACAATATGCCAAAAAGGGATGTTGTTTCATGGAATTCAATGATAGCTGGTTATGCACAAAATGGAGAGTCTGCCATGTCAATTCACCTCTTTAGAGAAATGATTGATTGTACGGACATACAACCAGATGAGGTCACCATAGCTAGTGTCTTATCGGCCTGTGGACATATTGGGGCTCTGAAATTTAGTTACTGGGTTCTAAATATCGTTCAAGAGAAGAACATAAAGTTTGGCATTTCAGGATTCAATTCTTTGATATTCATGTACTCTAAATGTGGAAATGTGGTTGATGCCCATAGGATATTCCAAAATATGGGGACGAAAGATGTTGTTACTTTCAATACACTGATTTCAGGATTTGCAGCCAATGGTCATGGGAAGGATGCTATCAAGTTACTGTTAACAATGGAGGAAGAAGGCATTGAACCAGACCATGTCACATATATTGGTGTTTTGACTGCATGTAGTCATGCAGGAATGCTGAAAGAAGGTAAAAACATCTTTAAGTCAATTAAAGCACCGACAGTGGATCACTATGCTTGTATGGTTGATTTATTAGGAAGAGCAGGCGAATTAGATGAAGCCAAAATGTTAATTGAATCTATGCCAATGAAACCTCACGCTGGTGTTTATGGCTCTTTGTTAAATGGCAGTCGAATTCACAAGAGAGTTGAGCTGGGAGAACTTGCTGCTAACAAGCTCTTAGAGCTTGAACCTCAAAACCCAGGAAATTATATTTTACTTTCTAATATATATGCCTCTGCTGGAAGATGGGAAGATGTTAGACAGGTTAGAGAGAAGATGAGGAAGGGAGGTGTCAAGAAATCAGTTGGAATGAGTTGGGTGGAATATAAGGGTCAAATACATAATTTCACTGTGGGTGATAGATCGCATGAATGGTCAAAAGATATTTATAGATTATTGGCTGAACTTGAAAGGAAGATGAAGAGGGTTGGCTTTGTAACTGATAAAAGTTGTGCACTGCGAGATGTTGAGGAGGAAGAGAAGGAAGAAATGCTGGGGACTCACAGTGAGAAGTTGGCCATTTGTTTTGCGCTTCTTGTCAGTGAAGTGGGGACACCAATTAGAGTGGTGAAAAATTTGAGAATTTGTATGGATTGTCATACAGCTATTAAGATGATCTCAAAGTTAGAGGAAAGAGAGATTATTGTCCGTGATAATAATAGATCTGAGTATGACACACTCGGAACGTCAGTCAAAGCGGCACTGGTTTACCTTGGAACGGCCTTAGTAAAGCTTGTATGCCTTGCAACCTTCCTTAACGTGTCAGAGAATGACTCCTTTGACCTATATCAGGAACTGTTGAAAGCGCTTATTGGTTTAATAGATGTTGCCGGACTTTACTTTGCTTTGACCCAGTTGACTTACCGGAACATTTCTCAGAACCATAAATTTCAGGCAGTTGGACTGGGTTGGGCATTCGCTGATTCTGTTTTGCATAGACTTGCACCACTTTGGATTGGTGCCAGAGGACTAGAGTTTACTTGGGATTACATTTTGCAGGGCCTTGAAGCTAATGCAAATCTGGTGTTGAGCATATCCCTTGCTGCATTGGGATCTTTGATGTGGCTTCGGAAGAACAAACCCAAAGCGCTAATTCCCATAATTTATATCTGTGCACTGATTGTGGCTACTATGCCATCAATCACAAGCTACTTAAGGCGGGGAATGGGTTGGCATTTCCCTAAGGTGGTGGGATTTGAACTCTTCACCTCTTTGGCGATGGCTTTTATTAGTTGGCAGCTTTTTTCTGCTTGTCAGAGACCCTCTGTTTAA

Protein sequence

MTVFHFFNCAILTFGPHAVYYSATPLNTISWTAMVTGYAKMRDLESARRYFDEMPEKSVVSWNAMLSAYAQNECAEEALKLFHRMLKEGITPDDTTWVAAISSCSSIGNPNLADSLLAKINQKHVILNNYVKTALLDMHAKFGNLEIARRIFDELGGQRNAVTWNVMISAYTRAGKLSLARELFDNMPKRDVVSWNSMIAGYAQNGESAMSIHLFREMIDCTDIQPDEVTIASVLSACGHIGALKFSYWVLNIVQEKNIKFGISGFNSLIFMYSKCGNVVDAHRIFQNMGTKDVVTFNTLISGFAANGHGKDAIKLLLTMEEEGIEPDHVTYIGVLTACSHAGMLKEGKNIFKSIKAPTVDHYACMVDLLGRAGELDEAKMLIESMPMKPHAGVYGSLLNGSRIHKRVELGELAANKLLELEPQNPGNYILLSNIYASAGRWEDVRQVREKMRKGGVKKSVGMSWVEYKGQIHNFTVGDRSHEWSKDIYRLLAELERKMKRVGFVTDKSCALRDVEEEEKEEMLGTHSEKLAICFALLVSEVGTPIRVVKNLRICMDCHTAIKMISKLEEREIIVRDNNRSEYDTLGTSVKAALVYLGTALVKLVCLATFLNVSENDSFDLYQELLKALIGLIDVAGLYFALTQLTYRNISQNHKFQAVGLGWAFADSVLHRLAPLWIGARGLEFTWDYILQGLEANANLVLSISLAALGSLMWLRKNKPKALIPIIYICALIVATMPSITSYLRRGMGWHFPKVVGFELFTSLAMAFISWQLFSACQRPSV
BLAST of Cp4.1LG02g07200 vs. Swiss-Prot
Match: PP301_ARATH (Pentatricopeptide repeat-containing protein At4g02750 OS=Arabidopsis thaliana GN=PCMP-H24 PE=3 SV=1)

HSP 1 Score: 452.6 bits (1163), Expect = 8.8e-126
Identity = 228/557 (40.93%), Postives = 353/557 (63.38%), Query Frame = 1

Query: 29  ISWTAMVTGYAKMRDLESARRYFDEMPEKSVVSWNAMLSAYAQNECAEEALKLFHRMLKE 88
           +SW  ++ G+ K + +  AR++FD M  + VVSWN +++ YAQ+   +EA +LF     E
Sbjct: 220 VSWNCLLGGFVKKKKIVEARQFFDSMNVRDVVSWNTIITGYAQSGKIDEARQLFD----E 279

Query: 89  GITPDDTTWVAAISSCSSIGNPNLADSLLAKINQKHVILNNYVKTALLDMHAKFGNLEIA 148
               D  TW A +S          A  L  K+ +++ +  N    A+L  + +   +E+A
Sbjct: 280 SPVQDVFTWTAMVSGYIQNRMVEEARELFDKMPERNEVSWN----AMLAGYVQGERMEMA 339

Query: 149 RRIFDELGGQRNAVTWNVMISAYTRAGKLSLARELFDNMPKRDVVSWNSMIAGYAQNGES 208
           + +FD +   RN  TWN MI+ Y + GK+S A+ LFD MPKRD VSW +MIAGY+Q+G S
Sbjct: 340 KELFDVMPC-RNVSTWNTMITGYAQCGKISEAKNLFDKMPKRDPVSWAAMIAGYSQSGHS 399

Query: 209 AMSIHLFREMIDCTDIQPDEVTIASVLSACGHIGALKFSYWVLNIVQEKNIKFGISGFNS 268
             ++ LF +M +    + +  + +S LS C  + AL+    +   + +   + G    N+
Sbjct: 400 FEALRLFVQM-EREGGRLNRSSFSSALSTCADVVALELGKQLHGRLVKGGYETGCFVGNA 459

Query: 269 LIFMYSKCGNVVDAHRIFQNMGTKDVVTFNTLISGFAANGHGKDAIKLLLTMEEEGIEPD 328
           L+ MY KCG++ +A+ +F+ M  KD+V++NT+I+G++ +G G+ A++   +M+ EG++PD
Sbjct: 460 LLLMYCKCGSIEEANDLFKEMAGKDIVSWNTMIAGYSRHGFGEVALRFFESMKREGLKPD 519

Query: 329 HVTYIGVLTACSHAGMLKEGKNIFKSIKA-----PTVDHYACMVDLLGRAGELDEAKMLI 388
             T + VL+ACSH G++ +G+  F ++       P   HYACMVDLLGRAG L++A  L+
Sbjct: 520 DATMVAVLSACSHTGLVDKGRQYFYTMTQDYGVMPNSQHYACMVDLLGRAGLLEDAHNLM 579

Query: 389 ESMPMKPHAGVYGSLLNGSRIHKRVELGELAANKLLELEPQNPGNYILLSNIYASAGRWE 448
           ++MP +P A ++G+LL  SR+H   EL E AA+K+  +EP+N G Y+LLSN+YAS+GRW 
Sbjct: 580 KNMPFEPDAAIWGTLLGASRVHGNTELAETAADKIFAMEPENSGMYVLLSNLYASSGRWG 639

Query: 449 DVRQVREKMRKGGVKKSVGMSWVEYKGQIHNFTVGDRSHEWSKDIYRLLAELERKMKRVG 508
           DV ++R +MR  GVKK  G SW+E + + H F+VGD  H    +I+  L EL+ +MK+ G
Sbjct: 640 DVGKLRVRMRDKGVKKVPGYSWIEIQNKTHTFSVGDEFHPEKDEIFAFLEELDLRMKKAG 699

Query: 509 FVTDKSCALRDVEEEEKEEMLGTHSEKLAICFALLVSEVGTPIRVVKNLRICMDCHTAIK 568
           +V+  S  L DVEEEEKE M+  HSE+LA+ + ++    G PIRV+KNLR+C DCH AIK
Sbjct: 700 YVSKTSVVLHDVEEEEKERMVRYHSERLAVAYGIMRVSSGRPIRVIKNLRVCEDCHNAIK 759

Query: 569 MISKLEEREIIVRDNNR 581
            ++++  R II+RDNNR
Sbjct: 760 YMARITGRLIILRDNNR 766

BLAST of Cp4.1LG02g07200 vs. Swiss-Prot
Match: PP175_ARATH (Pentatricopeptide repeat-containing protein At2g29760, chloroplastic OS=Arabidopsis thaliana GN=PCMP-H33 PE=2 SV=1)

HSP 1 Score: 443.7 bits (1140), Expect = 4.1e-123
Identity = 215/554 (38.81%), Postives = 349/554 (63.00%), Query Frame = 1

Query: 33  AMVTGYAKMRDLESARRYFDEMPEKSVVSWNAMLSAYAQNECAEEALKLFHRMLKEGITP 92
           +++  Y    DL+SA + F  + EK VVSWN+M++ + Q    ++AL+LF +M  E +  
Sbjct: 171 SLIHCYFSCGDLDSACKVFTTIKEKDVVSWNSMINGFVQKGSPDKALELFKKMESEDVKA 230

Query: 93  DDTTWVAAISSCSSIGNPNLADSLLAKINQKHVILNNYVKTALLDMHAKFGNLEIARRIF 152
              T V  +S+C+ I N      + + I +  V +N  +  A+LDM+ K G++E A+R+F
Sbjct: 231 SHVTMVGVLSACAKIRNLEFGRQVCSYIEENRVNVNLTLANAMLDMYTKCGSIEDAKRLF 290

Query: 153 DELGGQRNAVTWNVMISAYTRAGKLSLARELFDNMPKRDVVSWNSMIAGYAQNGESAMSI 212
           D +  + N VTW  M+  Y  +     ARE+ ++MP++D+V+WN++I+ Y QNG+   ++
Sbjct: 291 DAMEEKDN-VTWTTMLDGYAISEDYEAAREVLNSMPQKDIVAWNALISAYEQNGKPNEAL 350

Query: 213 HLFREMIDCTDIQPDEVTIASVLSACGHIGALKFSYWVLNIVQEKNIKFGISGFNSLIFM 272
            +F E+    +++ +++T+ S LSAC  +GAL+   W+ + +++  I+      ++LI M
Sbjct: 351 IVFHELQLQKNMKLNQITLVSTLSACAQVGALELGRWIHSYIKKHGIRMNFHVTSALIHM 410

Query: 273 YSKCGNVVDAHRIFQNMGTKDVVTFNTLISGFAANGHGKDAIKLLLTMEEEGIEPDHVTY 332
           YSKCG++  +  +F ++  +DV  ++ +I G A +G G +A+ +   M+E  ++P+ VT+
Sbjct: 411 YSKCGDLEKSREVFNSVEKRDVFVWSAMIGGLAMHGCGNEAVDMFYKMQEANVKPNGVTF 470

Query: 333 IGVLTACSHAGMLKEGKNIFKSIKA-----PTVDHYACMVDLLGRAGELDEAKMLIESMP 392
             V  ACSH G++ E +++F  +++     P   HYAC+VD+LGR+G L++A   IE+MP
Sbjct: 471 TNVFCACSHTGLVDEAESLFHQMESNYGIVPEEKHYACIVDVLGRSGYLEKAVKFIEAMP 530

Query: 393 MKPHAGVYGSLLNGSRIHKRVELGELAANKLLELEPQNPGNYILLSNIYASAGRWEDVRQ 452
           + P   V+G+LL   +IH  + L E+A  +LLELEP+N G ++LLSNIYA  G+WE+V +
Sbjct: 531 IPPSTSVWGALLGACKIHANLNLAEMACTRLLELEPRNDGAHVLLSNIYAKLGKWENVSE 590

Query: 453 VREKMRKGGVKKSVGMSWVEYKGQIHNFTVGDRSHEWSKDIYRLLAELERKMKRVGFVTD 512
           +R+ MR  G+KK  G S +E  G IH F  GD +H  S+ +Y  L E+  K+K  G+  +
Sbjct: 591 LRKHMRVTGLKKEPGCSSIEIDGMIHEFLSGDNAHPMSEKVYGKLHEVMEKLKSNGYEPE 650

Query: 513 KSCALRDVEEEE-KEEMLGTHSEKLAICFALLVSEVGTPIRVVKNLRICMDCHTAIKMIS 572
            S  L+ +EEEE KE+ L  HSEKLAIC+ L+ +E    IRV+KNLR+C DCH+  K+IS
Sbjct: 651 ISQVLQIIEEEEMKEQSLNLHSEKLAICYGLISTEAPKVIRVIKNLRVCGDCHSVAKLIS 710

Query: 573 KLEEREIIVRDNNR 581
           +L +REIIVRD  R
Sbjct: 711 QLYDREIIVRDRYR 723

BLAST of Cp4.1LG02g07200 vs. Swiss-Prot
Match: PP249_ARATH (Pentatricopeptide repeat-containing protein At3g22690 OS=Arabidopsis thaliana GN=PCMP-H56 PE=2 SV=1)

HSP 1 Score: 432.6 bits (1111), Expect = 9.4e-120
Identity = 227/572 (39.69%), Postives = 341/572 (59.62%), Query Frame = 1

Query: 22  SATPLNTISWTAMVTGYAKMRDLESARRYFDEMPEKSVVSWNAMLSAYAQNECAEEALKL 81
           S   +N +  +A+V  Y K   ++ A+R FDE    ++   NAM S Y +     EAL +
Sbjct: 265 SGIEVNDLMVSALVDMYMKCNAIDVAKRLFDEYGASNLDLCNAMASNYVRQGLTREALGV 324

Query: 82  FHRMLKEGITPDDTTWVAAISSCSSIGNPNLADSLLAKINQKHVILNNY-----VKTALL 141
           F+ M+  G+ PD  + ++AISSCS + N      L  K    +V+ N +     +  AL+
Sbjct: 325 FNLMMDSGVRPDRISMLSAISSCSQLRN-----ILWGKSCHGYVLRNGFESWDNICNALI 384

Query: 142 DMHAKFGNLEIARRIFDELGGQRNAVTWNVMISAYTRAGKLSLARELFDNMPKRDVVSWN 201
           DM+ K    + A RIFD +   +  VTWN +++ Y   G++  A E F+ MP++++VSWN
Sbjct: 385 DMYMKCHRQDTAFRIFDRMSN-KTVVTWNSIVAGYVENGEVDAAWETFETMPEKNIVSWN 444

Query: 202 SMIAGYAQNGESAMSIHLFREMIDCTDIQPDEVTIASVLSACGHIGALKFSYWVLNIVQE 261
           ++I+G  Q      +I +F  M     +  D VT+ S+ SACGH+GAL  + W+   +++
Sbjct: 445 TIISGLVQGSLFEEAIEVFCSMQSQEGVNADGVTMMSIASACGHLGALDLAKWIYYYIEK 504

Query: 262 KNIKFGISGFNSLIFMYSKCGNVVDAHRIFQNMGTKDVVTFNTLISGFAANGHGKDAIKL 321
             I+  +    +L+ M+S+CG+   A  IF ++  +DV  +   I   A  G+ + AI+L
Sbjct: 505 NGIQLDVRLGTTLVDMFSRCGDPESAMSIFNSLTNRDVSAWTAAIGAMAMAGNAERAIEL 564

Query: 322 LLTMEEEGIEPDHVTYIGVLTACSHAGMLKEGKNIFKSIK-----APTVDHYACMVDLLG 381
              M E+G++PD V ++G LTACSH G++++GK IF S+      +P   HY CMVDLLG
Sbjct: 565 FDDMIEQGLKPDGVAFVGALTACSHGGLVQQGKEIFYSMLKLHGVSPEDVHYGCMVDLLG 624

Query: 382 RAGELDEAKMLIESMPMKPHAGVYGSLLNGSRIHKRVELGELAANKLLELEPQNPGNYIL 441
           RAG L+EA  LIE MPM+P+  ++ SLL   R+   VE+   AA K+  L P+  G+Y+L
Sbjct: 625 RAGLLEEAVQLIEDMPMEPNDVIWNSLLAACRVQGNVEMAAYAAEKIQVLAPERTGSYVL 684

Query: 442 LSNIYASAGRWEDVRQVREKMRKGGVKKSVGMSWVEYKGQIHNFTVGDRSHEWSKDIYRL 501
           LSN+YASAGRW D+ +VR  M++ G++K  G S ++ +G+ H FT GD SH    +I  +
Sbjct: 685 LSNVYASAGRWNDMAKVRLSMKEKGLRKPPGTSSIQIRGKTHEFTSGDESHPEMPNIEAM 744

Query: 502 LAELERKMKRVGFVTDKSCALRDVEEEEKEEMLGTHSEKLAICFALLVSEVGTPIRVVKN 561
           L E+ ++   +G V D S  L DV+E+EK  ML  HSEKLA+ + L+ S  GT IR+VKN
Sbjct: 745 LDEVSQRASHLGHVPDLSNVLMDVDEKEKIFMLSRHSEKLAMAYGLISSNKGTTIRIVKN 804

Query: 562 LRICMDCHTAIKMISKLEEREIIVRDNNRSEY 584
           LR+C DCH+  K  SK+  REII+RDNNR  Y
Sbjct: 805 LRVCSDCHSFAKFASKVYNREIILRDNNRFHY 830

BLAST of Cp4.1LG02g07200 vs. Swiss-Prot
Match: PP168_ARATH (Pentatricopeptide repeat-containing protein At2g22070 OS=Arabidopsis thaliana GN=PCMP-H41 PE=3 SV=1)

HSP 1 Score: 431.8 bits (1109), Expect = 1.6e-119
Identity = 212/559 (37.92%), Postives = 350/559 (62.61%), Query Frame = 1

Query: 30  SWTAMVTGYAKMRDLESARRYFDEMPEKSVVSWNAMLSAYAQNECAEEALKLFHRMLKEG 89
           SW AM+  + ++  ++ A   F++M E+ +V+WN+M+S + Q      AL +F +ML++ 
Sbjct: 214 SWNAMIALHMQVGQMDLAMAQFEQMAERDIVTWNSMISGFNQRGYDLRALDIFSKMLRDS 273

Query: 90  I-TPDDTTWVAAISSCSSIGNPNLADSLLAKINQKHVILNNYVKTALLDMHAKFGNLEIA 149
           + +PD  T  + +S+C+++    +   + + I      ++  V  AL+ M+++ G +E A
Sbjct: 274 LLSPDRFTLASVLSACANLEKLCIGKQIHSHIVTTGFDISGIVLNALISMYSRCGGVETA 333

Query: 150 RRIFDELGGQRNAVT-WNVMISAYTRAGKLSLARELFDNMPKRDVVSWNSMIAGYAQNGE 209
           RR+ ++ G +   +  +  ++  Y + G ++ A+ +F ++  RDVV+W +MI GY Q+G 
Sbjct: 334 RRLIEQRGTKDLKIEGFTALLDGYIKLGDMNQAKNIFVSLKDRDVVAWTAMIVGYEQHGS 393

Query: 210 SAMSIHLFREMIDCTDIQPDEVTIASVLSACGHIGALKFSYWVLNIVQEKNIKFGISGFN 269
              +I+LFR M+     +P+  T+A++LS    + +L     +     +    + +S  N
Sbjct: 394 YGEAINLFRSMVGGGQ-RPNSYTLAAMLSVASSLASLSHGKQIHGSAVKSGEIYSVSVSN 453

Query: 270 SLIFMYSKCGNVVDAHRIFQNMGT-KDVVTFNTLISGFAANGHGKDAIKLLLTMEEEGIE 329
           +LI MY+K GN+  A R F  +   +D V++ ++I   A +GH ++A++L  TM  EG+ 
Sbjct: 454 ALITMYAKAGNITSASRAFDLIRCERDTVSWTSMIIALAQHGHAEEALELFETMLMEGLR 513

Query: 330 PDHVTYIGVLTACSHAGMLKEGKNIFKSIK-----APTVDHYACMVDLLGRAGELDEAKM 389
           PDH+TY+GV +AC+HAG++ +G+  F  +K      PT+ HYACMVDL GRAG L EA+ 
Sbjct: 514 PDHITYVGVFSACTHAGLVNQGRQYFDMMKDVDKIIPTLSHYACMVDLFGRAGLLQEAQE 573

Query: 390 LIESMPMKPHAGVYGSLLNGSRIHKRVELGELAANKLLELEPQNPGNYILLSNIYASAGR 449
            IE MP++P    +GSLL+  R+HK ++LG++AA +LL LEP+N G Y  L+N+Y++ G+
Sbjct: 574 FIEKMPIEPDVVTWGSLLSACRVHKNIDLGKVAAERLLLLEPENSGAYSALANLYSACGK 633

Query: 450 WEDVRQVREKMRKGGVKKSVGMSWVEYKGQIHNFTVGDRSHEWSKDIYRLLAELERKMKR 509
           WE+  ++R+ M+ G VKK  G SW+E K ++H F V D +H    +IY  + ++  ++K+
Sbjct: 634 WEEAAKIRKSMKDGRVKKEQGFSWIEVKHKVHVFGVEDGTHPEKNEIYMTMKKIWDEIKK 693

Query: 510 VGFVTDKSCALRDVEEEEKEEMLGTHSEKLAICFALLVSEVGTPIRVVKNLRICMDCHTA 569
           +G+V D +  L D+EEE KE++L  HSEKLAI F L+ +   T +R++KNLR+C DCHTA
Sbjct: 694 MGYVPDTASVLHDLEEEVKEQILRHHSEKLAIAFGLISTPDKTTLRIMKNLRVCNDCHTA 753

Query: 570 IKMISKLEEREIIVRDNNR 581
           IK ISKL  REIIVRD  R
Sbjct: 754 IKFISKLVGREIIVRDTTR 771

BLAST of Cp4.1LG02g07200 vs. Swiss-Prot
Match: PP316_ARATH (Pentatricopeptide repeat-containing protein At4g16835, mitochondrial OS=Arabidopsis thaliana GN=DYW10 PE=2 SV=3)

HSP 1 Score: 430.6 bits (1106), Expect = 3.6e-119
Identity = 233/573 (40.66%), Postives = 341/573 (59.51%), Query Frame = 1

Query: 15  GPHAVYYSATPLNTISWTAMVTGYAKMRD-LESARRYFDEMPEKSVVSWNAMLSAYAQNE 74
           G   V++     NTI+W +++ G +K    +  A + FDE+PE    S+N MLS Y +N 
Sbjct: 79  GALRVFHGMRAKNTITWNSLLIGISKDPSRMMEAHQLFDEIPEPDTFSYNIMLSCYVRNV 138

Query: 75  CAEEALKLFHRMLKEGITPDDTTWVAAISSCSSIGNPNLADSLLAKINQKHVILNNYVKT 134
             E+A   F RM  +    D  +W   I+  +  G    A  L   + +K+ +  N    
Sbjct: 139 NFEKAQSFFDRMPFK----DAASWNTMITGYARRGEMEKARELFYSMMEKNEVSWN---- 198

Query: 135 ALLDMHAKFGNLEIARRIFDELGGQRNAVTWNVMISAYTRAGKLSLARELFDNMP-KRDV 194
           A++  + + G+LE A   F ++   R  V W  MI+ Y +A K+ LA  +F +M   +++
Sbjct: 199 AMISGYIECGDLEKASHFF-KVAPVRGVVAWTAMITGYMKAKKVELAEAMFKDMTVNKNL 258

Query: 195 VSWNSMIAGYAQNGESAMSIHLFREMIDCTDIQPDEVTIASVLSACGHIGALKFSYWVLN 254
           V+WN+MI+GY +N      + LFR M++   I+P+   ++S L  C  + AL+    +  
Sbjct: 259 VTWNAMISGYVENSRPEDGLKLFRAMLE-EGIRPNSSGLSSALLGCSELSALQLGRQIHQ 318

Query: 255 IVQEKNIKFGISGFNSLIFMYSKCGNVVDAHRIFQNMGTKDVVTFNTLISGFAANGHGKD 314
           IV +  +   ++   SLI MY KCG + DA ++F+ M  KDVV +N +ISG+A +G+   
Sbjct: 319 IVSKSTLCNDVTALTSLISMYCKCGELGDAWKLFEVMKKKDVVAWNAMISGYAQHGNADK 378

Query: 315 AIKLLLTMEEEGIEPDHVTYIGVLTACSHAGMLKEGKNIFKSIKA-----PTVDHYACMV 374
           A+ L   M +  I PD +T++ VL AC+HAG++  G   F+S+       P  DHY CMV
Sbjct: 379 ALCLFREMIDNKIRPDWITFVAVLLACNHAGLVNIGMAYFESMVRDYKVEPQPDHYTCMV 438

Query: 375 DLLGRAGELDEAKMLIESMPMKPHAGVYGSLLNGSRIHKRVELGELAANKLLELEPQNPG 434
           DLLGRAG+L+EA  LI SMP +PHA V+G+LL   R+HK VEL E AA KLL+L  QN  
Sbjct: 439 DLLGRAGKLEEALKLIRSMPFRPHAAVFGTLLGACRVHKNVELAEFAAEKLLQLNSQNAA 498

Query: 435 NYILLSNIYASAGRWEDVRQVREKMRKGGVKKSVGMSWVEYKGQIHNFTVGDRSHEWSKD 494
            Y+ L+NIYAS  RWEDV +VR++M++  V K  G SW+E + ++H+F   DR H     
Sbjct: 499 GYVQLANIYASKNRWEDVARVRKRMKESNVVKVPGYSWIEIRNKVHHFRSSDRIHPELDS 558

Query: 495 IYRLLAELERKMKRVGFVTDKSCALRDVEEEEKEEMLGTHSEKLAICFALLVSEVGTPIR 554
           I++ L ELE+KMK  G+  +   AL +VEEE+KE++L  HSEKLA+ F  +    G+ I+
Sbjct: 559 IHKKLKELEKKMKLAGYKPELEFALHNVEEEQKEKLLLWHSEKLAVAFGCIKLPQGSQIQ 618

Query: 555 VVKNLRICMDCHTAIKMISKLEEREIIVRDNNR 581
           V KNLRIC DCH AIK IS++E+REIIVRD  R
Sbjct: 619 VFKNLRICGDCHKAIKFISEIEKREIIVRDTTR 641

BLAST of Cp4.1LG02g07200 vs. TrEMBL
Match: A5B4C7_VITVI (Putative uncharacterized protein OS=Vitis vinifera GN=VITISV_013866 PE=4 SV=1)

HSP 1 Score: 807.7 bits (2085), Expect = 1.2e-230
Identity = 386/554 (69.68%), Postives = 461/554 (83.21%), Query Frame = 1

Query: 27  NTISWTAMVTGYAKMRDLESARRYFDEMPEKSVVSWNAMLSAYAQNECAEEALKLFHRML 86
           N I+WTAMVTGYAK++DLE+ARRYFD MPE+SVVSWNAMLS YAQN  AEE L+LF  M+
Sbjct: 193 NVITWTAMVTGYAKVKDLEAARRYFDCMPERSVVSWNAMLSGYAQNGLAEEVLRLFDEMV 252

Query: 87  KEGITPDDTTWVAAISSCSSIGNPNLADSLLAKINQKHVILNNYVKTALLDMHAKFGNLE 146
             GI PD+TTWV  IS+CSS G+P LA SL+  ++QK + LN +V+TALLDM+AK G++ 
Sbjct: 253 NAGIEPDETTWVTVISACSSRGDPCLAASLVRTLHQKQIQLNCFVRTALLDMYAKCGSIG 312

Query: 147 IARRIFDELGGQRNAVTWNVMISAYTRAGKLSLARELFDNMPKRDVVSWNSMIAGYAQNG 206
            ARRIFDELG  RN+VTWN MISAYTR G L  ARELF+ MP R+VV+WNSMIAGYAQNG
Sbjct: 313 AARRIFDELGAYRNSVTWNAMISAYTRVGNLDSARELFNTMPGRNVVTWNSMIAGYAQNG 372

Query: 207 ESAMSIHLFREMIDCTDIQPDEVTIASVLSACGHIGALKFSYWVLNIVQEKNIKFGISGF 266
           +SAM+I LF+EMI    + PDEVT+ SV+SACGH+GAL+   WV+  + E  IK  ISG 
Sbjct: 373 QSAMAIELFKEMITAKKLTPDEVTMVSVISACGHLGALELGNWVVRFLTENQIKLSISGH 432

Query: 267 NSLIFMYSKCGNVVDAHRIFQNMGTKDVVTFNTLISGFAANGHGKDAIKLLLTMEEEGIE 326
           N++IFMYS+CG++ DA R+FQ M T+DVV++NTLISGFAA+GHG +AI L+ TM+E GIE
Sbjct: 433 NAMIFMYSRCGSMEDAKRVFQEMATRDVVSYNTLISGFAAHGHGVEAINLMSTMKEGGIE 492

Query: 327 PDHVTYIGVLTACSHAGMLKEGKNIFKSIKAPTVDHYACMVDLLGRAGELDEAKMLIESM 386
           PD VT+IGVLTACSHAG+L+EG+ +F+SIK P +DHYACMVDLLGR GEL++AK  +E M
Sbjct: 493 PDRVTFIGVLTACSHAGLLEEGRKVFESIKDPAIDHYACMVDLLGRVGELEDAKRTMERM 552

Query: 387 PMKPHAGVYGSLLNGSRIHKRVELGELAANKLLELEPQNPGNYILLSNIYASAGRWEDVR 446
           PM+PHAGVYGSLLN SRIHK+VELGELAANKL ELEP N GN+ILLSNIYASAGRW+DV 
Sbjct: 553 PMEPHAGVYGSLLNASRIHKQVELGELAANKLFELEPDNSGNFILLSNIYASAGRWKDVE 612

Query: 447 QVREKMRKGGVKKSVGMSWVEYKGQIHNFTVGDRSHEWSKDIYRLLAELERKMKRVGFVT 506
           ++RE M+KGGVKK+ G SWVEY G++H F V DRSHE S DIY+LL EL +KM+  G++ 
Sbjct: 613 RIREAMKKGGVKKTTGWSWVEYGGKLHKFIVADRSHERSDDIYQLLIELRKKMREAGYIA 672

Query: 507 DKSCALRDVEEEEKEEMLGTHSEKLAICFALLVSEVGTPIRVVKNLRICMDCHTAIKMIS 566
           DKSC LRDVEEEEKEE++GTHSEKLAIC+ALLVSE G  IRVVKNLR+C DCHTAIKMIS
Sbjct: 673 DKSCVLRDVEEEEKEEIVGTHSEKLAICYALLVSEAGAVIRVVKNLRVCWDCHTAIKMIS 732

Query: 567 KLEEREIIVRDNNR 581
           KLE R IIVRDNNR
Sbjct: 733 KLEGRVIIVRDNNR 746

BLAST of Cp4.1LG02g07200 vs. TrEMBL
Match: F6HJZ0_VITVI (Putative uncharacterized protein OS=Vitis vinifera GN=VIT_12s0035g01970 PE=4 SV=1)

HSP 1 Score: 805.1 bits (2078), Expect = 7.8e-230
Identity = 385/554 (69.49%), Postives = 461/554 (83.21%), Query Frame = 1

Query: 27  NTISWTAMVTGYAKMRDLESARRYFDEMPEKSVVSWNAMLSAYAQNECAEEALKLFHRML 86
           N I+WTAMVTGYAK++DLE+ARRYFD MPE+SVVSWNAMLS YAQN  AEEAL+LF  M+
Sbjct: 123 NVITWTAMVTGYAKVKDLEAARRYFDCMPERSVVSWNAMLSGYAQNGLAEEALRLFDEMV 182

Query: 87  KEGITPDDTTWVAAISSCSSIGNPNLADSLLAKINQKHVILNNYVKTALLDMHAKFGNLE 146
             GI PD+TTWV  IS+CSS G+P LA SL+  ++QK + LN +V+TALLDM+AK G++ 
Sbjct: 183 NAGIEPDETTWVTVISACSSRGDPCLAASLVRTLHQKRIQLNCFVRTALLDMYAKCGSIG 242

Query: 147 IARRIFDELGGQRNAVTWNVMISAYTRAGKLSLARELFDNMPKRDVVSWNSMIAGYAQNG 206
            ARRIFDELG  RN+VTWN MISAY R G L  AR+LF+ MP R+VV+WNSMIAGYAQNG
Sbjct: 243 AARRIFDELGAYRNSVTWNAMISAYMRVGDLDSARKLFNTMPGRNVVTWNSMIAGYAQNG 302

Query: 207 ESAMSIHLFREMIDCTDIQPDEVTIASVLSACGHIGALKFSYWVLNIVQEKNIKFGISGF 266
           +SAM+I LF+EMI    + PDEVT+ SV+SACGH+GAL+   WV+  + E  IK  ISG 
Sbjct: 303 QSAMAIELFKEMITAKKLTPDEVTMVSVISACGHLGALELGNWVVRFLTENQIKLSISGH 362

Query: 267 NSLIFMYSKCGNVVDAHRIFQNMGTKDVVTFNTLISGFAANGHGKDAIKLLLTMEEEGIE 326
           N++IFMYS+CG++ DA R+FQ M T+DVV++NTLISGFAA+GHG +AI L+ TM+E GIE
Sbjct: 363 NAMIFMYSRCGSMEDAKRVFQEMATRDVVSYNTLISGFAAHGHGVEAINLMSTMKEGGIE 422

Query: 327 PDHVTYIGVLTACSHAGMLKEGKNIFKSIKAPTVDHYACMVDLLGRAGELDEAKMLIESM 386
           PD VT+IGVLTACSHAG+L+EG+ +F+SIK P +DHYACMVDLLGR GEL++AK  +E M
Sbjct: 423 PDRVTFIGVLTACSHAGLLEEGRKVFESIKDPAIDHYACMVDLLGRVGELEDAKRTMERM 482

Query: 387 PMKPHAGVYGSLLNGSRIHKRVELGELAANKLLELEPQNPGNYILLSNIYASAGRWEDVR 446
           PM+PHAGVYGSLLN SRIHK+VELGELAANKL ELEP N GN+ILLSNIYASAGRW+DV 
Sbjct: 483 PMEPHAGVYGSLLNASRIHKQVELGELAANKLFELEPDNSGNFILLSNIYASAGRWKDVE 542

Query: 447 QVREKMRKGGVKKSVGMSWVEYKGQIHNFTVGDRSHEWSKDIYRLLAELERKMKRVGFVT 506
           ++RE M+KGGVKK+ G SWVEY G++H F V DRSHE S DIY+LL EL +KM+  G++ 
Sbjct: 543 RIREAMKKGGVKKTTGWSWVEYGGKLHKFIVADRSHERSDDIYQLLIELRKKMREAGYIA 602

Query: 507 DKSCALRDVEEEEKEEMLGTHSEKLAICFALLVSEVGTPIRVVKNLRICMDCHTAIKMIS 566
           DKSC LRDVEEEEKEE++GTHSEKLAIC+ALLVSE G  IRVVKNLR+C DCHTAIKMIS
Sbjct: 603 DKSCVLRDVEEEEKEEIVGTHSEKLAICYALLVSEAGAVIRVVKNLRVCWDCHTAIKMIS 662

Query: 567 KLEEREIIVRDNNR 581
           KLE R IIVRDNNR
Sbjct: 663 KLEGRVIIVRDNNR 676

BLAST of Cp4.1LG02g07200 vs. TrEMBL
Match: A0A067KKU3_JATCU (Uncharacterized protein OS=Jatropha curcas GN=JCGZ_13364 PE=4 SV=1)

HSP 1 Score: 796.6 bits (2056), Expect = 2.8e-227
Identity = 379/554 (68.41%), Postives = 461/554 (83.21%), Query Frame = 1

Query: 27  NTISWTAMVTGYAKMRDLESARRYFDEMPEKSVVSWNAMLSAYAQNECAEEALKLFHRML 86
           N ++WTAMVTG+AK++DLE AR+YFD MP +SVVSWNAMLS YAQN  AEEALKLF  M+
Sbjct: 192 NVVTWTAMVTGFAKIKDLEKARKYFDYMPMRSVVSWNAMLSGYAQNGFAEEALKLFGDMV 251

Query: 87  KEGITPDDTTWVAAISSCSSIGNPNLADSLLAKINQKHVILNNYVKTALLDMHAKFGNLE 146
             G+ P++TTW   +S CS  G+P +A+S++  ++ K + +N +VKTALLDM+AK GNLE
Sbjct: 252 NSGVQPNETTWATVVSLCSLFGDPCVAESIVKMLDGKRIKMNCFVKTALLDMNAKCGNLE 311

Query: 147 IARRIFDELGGQRNAVTWNVMISAYTRAGKLSLARELFDNMPKRDVVSWNSMIAGYAQNG 206
            AR IF+ELG  RN+VTWN MISAYT+ G L  AR+ FD MP+RDVVSWN+MI+GYAQNG
Sbjct: 312 AARNIFNELGVHRNSVTWNTMISAYTKVGDLDSARDHFDRMPERDVVSWNTMISGYAQNG 371

Query: 207 ESAMSIHLFREMIDCTDIQPDEVTIASVLSACGHIGALKFSYWVLNIVQEKNIKFGISGF 266
           +SA +I +F+EMI   D+QPDEVT+ASV+SACGH+GAL+   WV+N + E  I   I G+
Sbjct: 372 QSAKAIEIFKEMISSKDLQPDEVTMASVISACGHLGALELGTWVVNHITEYKINLSILGY 431

Query: 267 NSLIFMYSKCGNVVDAHRIFQNMGTKDVVTFNTLISGFAANGHGKDAIKLLLTMEEEGIE 326
           NSLIFMYSKCGN+ +AHRIFQ M T+DVV++NTLI+GFAA+G G +AIKLL TM+EEGI 
Sbjct: 432 NSLIFMYSKCGNMKEAHRIFQEMETRDVVSYNTLIAGFAAHGKGIEAIKLLSTMKEEGIH 491

Query: 327 PDHVTYIGVLTACSHAGMLKEGKNIFKSIKAPTVDHYACMVDLLGRAGELDEAKMLIESM 386
           PD VTYIGVLTACSHAG+++EG  +F+SI++P VDHYACMVDLLGR G+LDEAK LI++M
Sbjct: 492 PDRVTYIGVLTACSHAGLMEEGHKVFESIESPDVDHYACMVDLLGRVGKLDEAKKLIDNM 551

Query: 387 PMKPHAGVYGSLLNGSRIHKRVELGELAANKLLELEPQNPGNYILLSNIYASAGRWEDVR 446
           PM+PHAGVYGSLL+ S+IHKRV+ GELAA  L +LEPQN GNY+LLSNIYASAGRWE+V 
Sbjct: 552 PMEPHAGVYGSLLHASQIHKRVDFGELAAKMLFQLEPQNSGNYVLLSNIYASAGRWEEVN 611

Query: 447 QVREKMRKGGVKKSVGMSWVEYKGQIHNFTVGDRSHEWSKDIYRLLAELERKMKRVGFVT 506
           +VRE M KG VKK+ G SWVEY+G++H F VGDRSHE S DIYRLLAEL  KM+R G+  
Sbjct: 612 RVREMMSKGEVKKTAGWSWVEYQGKVHKFMVGDRSHERSDDIYRLLAELASKMRRHGYTA 671

Query: 507 DKSCALRDVEEEEKEEMLGTHSEKLAICFALLVSEVGTPIRVVKNLRICMDCHTAIKMIS 566
           D+SC LRDVEEEEKE M+GTHSEKLAICFALLVS+ G  IRVVKNLR+C+DCHTAIK+IS
Sbjct: 672 DRSCVLRDVEEEEKEHMVGTHSEKLAICFALLVSKSGAAIRVVKNLRVCLDCHTAIKLIS 731

Query: 567 KLEEREIIVRDNNR 581
           +LE REIIVRDNNR
Sbjct: 732 QLEGREIIVRDNNR 745

BLAST of Cp4.1LG02g07200 vs. TrEMBL
Match: A0A061GB75_THECC (Pentatricopeptide repeat superfamily protein isoform 2 OS=Theobroma cacao GN=TCM_027991 PE=4 SV=1)

HSP 1 Score: 783.5 bits (2022), Expect = 2.4e-223
Identity = 376/556 (67.63%), Postives = 454/556 (81.65%), Query Frame = 1

Query: 27  NTISWTAMVTGYAKMRDLESARRYFDEMPEKSVVSWNAMLSAYAQNECAEEALKLFHRML 86
           N ++WTAMVTG A M+DL +ARRYFD MP ++VVSWNAMLS YA+N  A+EAL LF  M+
Sbjct: 207 NVVTWTAMVTGSANMKDLITARRYFDRMPRRNVVSWNAMLSGYAKNGFAKEALHLFLHMI 266

Query: 87  K--EGITPDDTTWVAAISSCSSIGNPNLADSLLAKINQKHVILNNYVKTALLDMHAKFGN 146
           K  +GI P+  TWVA ISSCSS+ +P LADS++  +++K + LN+Y+KTALLDMHAK GN
Sbjct: 267 KAGDGIEPNQITWVAVISSCSSLADPCLADSVVKFLDKKKIQLNSYLKTALLDMHAKCGN 326

Query: 147 LEIARRIFDELGGQRNAVTWNVMISAYTRAGKLSLARELFDNMPKRDVVSWNSMIAGYAQ 206
           LE A++IFDE G  R+  TWN MISAY R G L+LARELFD MP R+VVSWNSMIAG+AQ
Sbjct: 327 LETAQKIFDEFGEHRSCTTWNAMISAYMRFGNLALARELFDKMPVRNVVSWNSMIAGFAQ 386

Query: 207 NGESAMSIHLFREMIDCTDIQPDEVTIASVLSACGHIGALKFSYWVLNIVQEKNIKFGIS 266
           NG+ AM+I LF+EMI  T+++PDEVT+ SV+S CG +GAL+   WV+N + E  IK  IS
Sbjct: 387 NGQPAMAIQLFKEMIATTNLKPDEVTMVSVISVCGQLGALEMGNWVVNFIVENQIKLSIS 446

Query: 267 GFNSLIFMYSKCGNVVDAHRIFQNMGTKDVVTFNTLISGFAANGHGKDAIKLLLTMEEEG 326
           G+N+LIFMYSKCG++ DA RIFQ M  +D +++N L+SGF A+G G +A++L+  M +EG
Sbjct: 447 GYNTLIFMYSKCGSMKDAERIFQEMKRRDTISYNALVSGFGAHGRGIEAVELMSRMRKEG 506

Query: 327 IEPDHVTYIGVLTACSHAGMLKEGKNIFKSIKAPTVDHYACMVDLLGRAGELDEAKMLIE 386
           IEPDH+TYIGVLTACSHA +LKEG+ +F+SIK P VDHYACMVDLLGR GELDEAK LI+
Sbjct: 507 IEPDHITYIGVLTACSHARLLKEGRRVFESIKFPAVDHYACMVDLLGRVGELDEAKRLID 566

Query: 387 SMPMKPHAGVYGSLLNGSRIHKRVELGELAANKLLELEPQNPGNYILLSNIYASAGRWED 446
            MPM+PHAG+YGSLLN S IHKRVELGE AANKL ELEP N GNY+LLSNIYASA RW D
Sbjct: 567 HMPMEPHAGIYGSLLNASTIHKRVELGEFAANKLFELEPSNSGNYVLLSNIYASAARWGD 626

Query: 447 VRQVREKMRKGGVKKSVGMSWVEYKGQIHNFTVGDRSHEWSKDIYRLLAELERKMKRVGF 506
           V  VRE MRK GVKK+ G SWVE+ G++H F VGDRSHE S DIYRLL EL RKM R+G+
Sbjct: 627 VDWVREAMRKLGVKKTTGWSWVEHDGKVHKFIVGDRSHERSDDIYRLLEELCRKMGRLGY 686

Query: 507 VTDKSCALRDVEEEEKEEMLGTHSEKLAICFALLVSEVGTPIRVVKNLRICMDCHTAIKM 566
           + +KSC LRDVE+EEKEEM+GTHSEKLA+CFALLVSEVG  +RVVKNLR+C DCHTA+KM
Sbjct: 687 IANKSCVLRDVEDEEKEEMVGTHSEKLAVCFALLVSEVGAVVRVVKNLRVCQDCHTAMKM 746

Query: 567 ISKLEEREIIVRDNNR 581
           IS LE REII+RDNNR
Sbjct: 747 ISMLEGREIIMRDNNR 762

BLAST of Cp4.1LG02g07200 vs. TrEMBL
Match: A0A061G9Y9_THECC (Pentatricopeptide repeat superfamily protein isoform 1 OS=Theobroma cacao GN=TCM_027991 PE=4 SV=1)

HSP 1 Score: 783.5 bits (2022), Expect = 2.4e-223
Identity = 382/589 (64.86%), Postives = 464/589 (78.78%), Query Frame = 1

Query: 27  NTISWTAMVTGYAKMRDLESARRYFDEMPEKSVVSWNAMLSAYAQNECAEEALKLFHRML 86
           N ++WTAMVTG A M+DL +ARRYFD MP ++VVSWNAMLS YA+N  A+EAL LF  M+
Sbjct: 207 NVVTWTAMVTGSANMKDLITARRYFDRMPRRNVVSWNAMLSGYAKNGFAKEALHLFLHMI 266

Query: 87  K--EGITPDDTTWVAAISSCSSIGNPNLADSLLAKINQKHVILNNYVKTALLDMHAKFGN 146
           K  +GI P+  TWVA ISSCSS+ +P LADS++  +++K + LN+Y+KTALLDMHAK GN
Sbjct: 267 KAGDGIEPNQITWVAVISSCSSLADPCLADSVVKFLDKKKIQLNSYLKTALLDMHAKCGN 326

Query: 147 LEIARRIFDELGGQRNAVTWNVMISAYTRAGKLSLARELFDNMPKRDVVSWNSMIAGYAQ 206
           LE A++IFDE G  R+  TWN MISAY R G L+LARELFD MP R+VVSWNSMIAG+AQ
Sbjct: 327 LETAQKIFDEFGEHRSCTTWNAMISAYMRFGNLALARELFDKMPVRNVVSWNSMIAGFAQ 386

Query: 207 NGESAMSIHLFREMIDCTDIQPDEVTIASVLSACGHIGALKFSYWVLNIVQEKNIKFGIS 266
           NG+ AM+I LF+EMI  T+++PDEVT+ SV+S CG +GAL+   WV+N + E  IK  IS
Sbjct: 387 NGQPAMAIQLFKEMIATTNLKPDEVTMVSVISVCGQLGALEMGNWVVNFIVENQIKLSIS 446

Query: 267 GFNSLIFMYSKCGNVVDAHRIFQNMGTKDVVTFNTLISGFAANGHGKDAIKLLLTMEEEG 326
           G+N+LIFMYSKCG++ DA RIFQ M  +D +++N L+SGF A+G G +A++L+  M +EG
Sbjct: 447 GYNTLIFMYSKCGSMKDAERIFQEMKRRDTISYNALVSGFGAHGRGIEAVELMSRMRKEG 506

Query: 327 IEPDHVTYIGVLTACSHAGMLKEGKNIFKSIKAPTVDHYACMVDLLGRAGELDEAKMLIE 386
           IEPDH+TYIGVLTACSHA +LKEG+ +F+SIK P VDHYACMVDLLGR GELDEAK LI+
Sbjct: 507 IEPDHITYIGVLTACSHARLLKEGRRVFESIKFPAVDHYACMVDLLGRVGELDEAKRLID 566

Query: 387 SMPMKPHAGVYGSLLNGSRIHKRVELGELAANKLLELEPQNPGNYILLSNIYASAGRWED 446
            MPM+PHAG+YGSLLN S IHKRVELGE AANKL ELEP N GNY+LLSNIYASA RW D
Sbjct: 567 HMPMEPHAGIYGSLLNASTIHKRVELGEFAANKLFELEPSNSGNYVLLSNIYASAARWGD 626

Query: 447 VRQVREKMRKGGVKKSVGMSWVEYKGQIHNFTVGDRSHEWSKDIYRLLAELERKMKRVGF 506
           V  VRE MRK GVKK+ G SWVE+ G++H F VGDRSHE S DIYRLL EL RKM R+G+
Sbjct: 627 VDWVREAMRKLGVKKTTGWSWVEHDGKVHKFIVGDRSHERSDDIYRLLEELCRKMGRLGY 686

Query: 507 VTDKSCALRDVEEEEKEEMLGTHSEKLAICFALLVSEVGTPIRVVKNLRICMDCHTAIKM 566
           + +KSC LRDVE+EEKEEM+GTHSEKLA+CFALLVSEVG  +RVVKNLR+C DCHTA+KM
Sbjct: 687 IANKSCVLRDVEDEEKEEMVGTHSEKLAVCFALLVSEVGAVVRVVKNLRVCQDCHTAMKM 746

Query: 567 ISKLEEREIIVRDNNRSEYDTLGTSVK----------AALVYLGTALVK 604
           IS LE REII+RDNN   Y  +   +K             +YLG  +VK
Sbjct: 747 ISMLEGREIIMRDNNSKFYVIVLIDIKHRRNDFDMLLQPCLYLGEYMVK 795

BLAST of Cp4.1LG02g07200 vs. TAIR10
Match: AT4G02750.1 (AT4G02750.1 Tetratricopeptide repeat (TPR)-like superfamily protein)

HSP 1 Score: 452.6 bits (1163), Expect = 4.9e-127
Identity = 228/557 (40.93%), Postives = 353/557 (63.38%), Query Frame = 1

Query: 29  ISWTAMVTGYAKMRDLESARRYFDEMPEKSVVSWNAMLSAYAQNECAEEALKLFHRMLKE 88
           +SW  ++ G+ K + +  AR++FD M  + VVSWN +++ YAQ+   +EA +LF     E
Sbjct: 220 VSWNCLLGGFVKKKKIVEARQFFDSMNVRDVVSWNTIITGYAQSGKIDEARQLFD----E 279

Query: 89  GITPDDTTWVAAISSCSSIGNPNLADSLLAKINQKHVILNNYVKTALLDMHAKFGNLEIA 148
               D  TW A +S          A  L  K+ +++ +  N    A+L  + +   +E+A
Sbjct: 280 SPVQDVFTWTAMVSGYIQNRMVEEARELFDKMPERNEVSWN----AMLAGYVQGERMEMA 339

Query: 149 RRIFDELGGQRNAVTWNVMISAYTRAGKLSLARELFDNMPKRDVVSWNSMIAGYAQNGES 208
           + +FD +   RN  TWN MI+ Y + GK+S A+ LFD MPKRD VSW +MIAGY+Q+G S
Sbjct: 340 KELFDVMPC-RNVSTWNTMITGYAQCGKISEAKNLFDKMPKRDPVSWAAMIAGYSQSGHS 399

Query: 209 AMSIHLFREMIDCTDIQPDEVTIASVLSACGHIGALKFSYWVLNIVQEKNIKFGISGFNS 268
             ++ LF +M +    + +  + +S LS C  + AL+    +   + +   + G    N+
Sbjct: 400 FEALRLFVQM-EREGGRLNRSSFSSALSTCADVVALELGKQLHGRLVKGGYETGCFVGNA 459

Query: 269 LIFMYSKCGNVVDAHRIFQNMGTKDVVTFNTLISGFAANGHGKDAIKLLLTMEEEGIEPD 328
           L+ MY KCG++ +A+ +F+ M  KD+V++NT+I+G++ +G G+ A++   +M+ EG++PD
Sbjct: 460 LLLMYCKCGSIEEANDLFKEMAGKDIVSWNTMIAGYSRHGFGEVALRFFESMKREGLKPD 519

Query: 329 HVTYIGVLTACSHAGMLKEGKNIFKSIKA-----PTVDHYACMVDLLGRAGELDEAKMLI 388
             T + VL+ACSH G++ +G+  F ++       P   HYACMVDLLGRAG L++A  L+
Sbjct: 520 DATMVAVLSACSHTGLVDKGRQYFYTMTQDYGVMPNSQHYACMVDLLGRAGLLEDAHNLM 579

Query: 389 ESMPMKPHAGVYGSLLNGSRIHKRVELGELAANKLLELEPQNPGNYILLSNIYASAGRWE 448
           ++MP +P A ++G+LL  SR+H   EL E AA+K+  +EP+N G Y+LLSN+YAS+GRW 
Sbjct: 580 KNMPFEPDAAIWGTLLGASRVHGNTELAETAADKIFAMEPENSGMYVLLSNLYASSGRWG 639

Query: 449 DVRQVREKMRKGGVKKSVGMSWVEYKGQIHNFTVGDRSHEWSKDIYRLLAELERKMKRVG 508
           DV ++R +MR  GVKK  G SW+E + + H F+VGD  H    +I+  L EL+ +MK+ G
Sbjct: 640 DVGKLRVRMRDKGVKKVPGYSWIEIQNKTHTFSVGDEFHPEKDEIFAFLEELDLRMKKAG 699

Query: 509 FVTDKSCALRDVEEEEKEEMLGTHSEKLAICFALLVSEVGTPIRVVKNLRICMDCHTAIK 568
           +V+  S  L DVEEEEKE M+  HSE+LA+ + ++    G PIRV+KNLR+C DCH AIK
Sbjct: 700 YVSKTSVVLHDVEEEEKERMVRYHSERLAVAYGIMRVSSGRPIRVIKNLRVCEDCHNAIK 759

Query: 569 MISKLEEREIIVRDNNR 581
            ++++  R II+RDNNR
Sbjct: 760 YMARITGRLIILRDNNR 766

BLAST of Cp4.1LG02g07200 vs. TAIR10
Match: AT2G29760.1 (AT2G29760.1 Tetratricopeptide repeat (TPR)-like superfamily protein)

HSP 1 Score: 443.7 bits (1140), Expect = 2.3e-124
Identity = 215/554 (38.81%), Postives = 349/554 (63.00%), Query Frame = 1

Query: 33  AMVTGYAKMRDLESARRYFDEMPEKSVVSWNAMLSAYAQNECAEEALKLFHRMLKEGITP 92
           +++  Y    DL+SA + F  + EK VVSWN+M++ + Q    ++AL+LF +M  E +  
Sbjct: 171 SLIHCYFSCGDLDSACKVFTTIKEKDVVSWNSMINGFVQKGSPDKALELFKKMESEDVKA 230

Query: 93  DDTTWVAAISSCSSIGNPNLADSLLAKINQKHVILNNYVKTALLDMHAKFGNLEIARRIF 152
              T V  +S+C+ I N      + + I +  V +N  +  A+LDM+ K G++E A+R+F
Sbjct: 231 SHVTMVGVLSACAKIRNLEFGRQVCSYIEENRVNVNLTLANAMLDMYTKCGSIEDAKRLF 290

Query: 153 DELGGQRNAVTWNVMISAYTRAGKLSLARELFDNMPKRDVVSWNSMIAGYAQNGESAMSI 212
           D +  + N VTW  M+  Y  +     ARE+ ++MP++D+V+WN++I+ Y QNG+   ++
Sbjct: 291 DAMEEKDN-VTWTTMLDGYAISEDYEAAREVLNSMPQKDIVAWNALISAYEQNGKPNEAL 350

Query: 213 HLFREMIDCTDIQPDEVTIASVLSACGHIGALKFSYWVLNIVQEKNIKFGISGFNSLIFM 272
            +F E+    +++ +++T+ S LSAC  +GAL+   W+ + +++  I+      ++LI M
Sbjct: 351 IVFHELQLQKNMKLNQITLVSTLSACAQVGALELGRWIHSYIKKHGIRMNFHVTSALIHM 410

Query: 273 YSKCGNVVDAHRIFQNMGTKDVVTFNTLISGFAANGHGKDAIKLLLTMEEEGIEPDHVTY 332
           YSKCG++  +  +F ++  +DV  ++ +I G A +G G +A+ +   M+E  ++P+ VT+
Sbjct: 411 YSKCGDLEKSREVFNSVEKRDVFVWSAMIGGLAMHGCGNEAVDMFYKMQEANVKPNGVTF 470

Query: 333 IGVLTACSHAGMLKEGKNIFKSIKA-----PTVDHYACMVDLLGRAGELDEAKMLIESMP 392
             V  ACSH G++ E +++F  +++     P   HYAC+VD+LGR+G L++A   IE+MP
Sbjct: 471 TNVFCACSHTGLVDEAESLFHQMESNYGIVPEEKHYACIVDVLGRSGYLEKAVKFIEAMP 530

Query: 393 MKPHAGVYGSLLNGSRIHKRVELGELAANKLLELEPQNPGNYILLSNIYASAGRWEDVRQ 452
           + P   V+G+LL   +IH  + L E+A  +LLELEP+N G ++LLSNIYA  G+WE+V +
Sbjct: 531 IPPSTSVWGALLGACKIHANLNLAEMACTRLLELEPRNDGAHVLLSNIYAKLGKWENVSE 590

Query: 453 VREKMRKGGVKKSVGMSWVEYKGQIHNFTVGDRSHEWSKDIYRLLAELERKMKRVGFVTD 512
           +R+ MR  G+KK  G S +E  G IH F  GD +H  S+ +Y  L E+  K+K  G+  +
Sbjct: 591 LRKHMRVTGLKKEPGCSSIEIDGMIHEFLSGDNAHPMSEKVYGKLHEVMEKLKSNGYEPE 650

Query: 513 KSCALRDVEEEE-KEEMLGTHSEKLAICFALLVSEVGTPIRVVKNLRICMDCHTAIKMIS 572
            S  L+ +EEEE KE+ L  HSEKLAIC+ L+ +E    IRV+KNLR+C DCH+  K+IS
Sbjct: 651 ISQVLQIIEEEEMKEQSLNLHSEKLAICYGLISTEAPKVIRVIKNLRVCGDCHSVAKLIS 710

Query: 573 KLEEREIIVRDNNR 581
           +L +REIIVRD  R
Sbjct: 711 QLYDREIIVRDRYR 723

BLAST of Cp4.1LG02g07200 vs. TAIR10
Match: AT3G22690.1 (AT3G22690.1 Protein of unknown function DUF1685 (InterPro:IPR012881), Pentatricopeptide repeat (InterPro:IPR002885))

HSP 1 Score: 432.6 bits (1111), Expect = 5.3e-121
Identity = 227/572 (39.69%), Postives = 341/572 (59.62%), Query Frame = 1

Query: 22  SATPLNTISWTAMVTGYAKMRDLESARRYFDEMPEKSVVSWNAMLSAYAQNECAEEALKL 81
           S   +N +  +A+V  Y K   ++ A+R FDE    ++   NAM S Y +     EAL +
Sbjct: 265 SGIEVNDLMVSALVDMYMKCNAIDVAKRLFDEYGASNLDLCNAMASNYVRQGLTREALGV 324

Query: 82  FHRMLKEGITPDDTTWVAAISSCSSIGNPNLADSLLAKINQKHVILNNY-----VKTALL 141
           F+ M+  G+ PD  + ++AISSCS + N      L  K    +V+ N +     +  AL+
Sbjct: 325 FNLMMDSGVRPDRISMLSAISSCSQLRN-----ILWGKSCHGYVLRNGFESWDNICNALI 384

Query: 142 DMHAKFGNLEIARRIFDELGGQRNAVTWNVMISAYTRAGKLSLARELFDNMPKRDVVSWN 201
           DM+ K    + A RIFD +   +  VTWN +++ Y   G++  A E F+ MP++++VSWN
Sbjct: 385 DMYMKCHRQDTAFRIFDRMSN-KTVVTWNSIVAGYVENGEVDAAWETFETMPEKNIVSWN 444

Query: 202 SMIAGYAQNGESAMSIHLFREMIDCTDIQPDEVTIASVLSACGHIGALKFSYWVLNIVQE 261
           ++I+G  Q      +I +F  M     +  D VT+ S+ SACGH+GAL  + W+   +++
Sbjct: 445 TIISGLVQGSLFEEAIEVFCSMQSQEGVNADGVTMMSIASACGHLGALDLAKWIYYYIEK 504

Query: 262 KNIKFGISGFNSLIFMYSKCGNVVDAHRIFQNMGTKDVVTFNTLISGFAANGHGKDAIKL 321
             I+  +    +L+ M+S+CG+   A  IF ++  +DV  +   I   A  G+ + AI+L
Sbjct: 505 NGIQLDVRLGTTLVDMFSRCGDPESAMSIFNSLTNRDVSAWTAAIGAMAMAGNAERAIEL 564

Query: 322 LLTMEEEGIEPDHVTYIGVLTACSHAGMLKEGKNIFKSIK-----APTVDHYACMVDLLG 381
              M E+G++PD V ++G LTACSH G++++GK IF S+      +P   HY CMVDLLG
Sbjct: 565 FDDMIEQGLKPDGVAFVGALTACSHGGLVQQGKEIFYSMLKLHGVSPEDVHYGCMVDLLG 624

Query: 382 RAGELDEAKMLIESMPMKPHAGVYGSLLNGSRIHKRVELGELAANKLLELEPQNPGNYIL 441
           RAG L+EA  LIE MPM+P+  ++ SLL   R+   VE+   AA K+  L P+  G+Y+L
Sbjct: 625 RAGLLEEAVQLIEDMPMEPNDVIWNSLLAACRVQGNVEMAAYAAEKIQVLAPERTGSYVL 684

Query: 442 LSNIYASAGRWEDVRQVREKMRKGGVKKSVGMSWVEYKGQIHNFTVGDRSHEWSKDIYRL 501
           LSN+YASAGRW D+ +VR  M++ G++K  G S ++ +G+ H FT GD SH    +I  +
Sbjct: 685 LSNVYASAGRWNDMAKVRLSMKEKGLRKPPGTSSIQIRGKTHEFTSGDESHPEMPNIEAM 744

Query: 502 LAELERKMKRVGFVTDKSCALRDVEEEEKEEMLGTHSEKLAICFALLVSEVGTPIRVVKN 561
           L E+ ++   +G V D S  L DV+E+EK  ML  HSEKLA+ + L+ S  GT IR+VKN
Sbjct: 745 LDEVSQRASHLGHVPDLSNVLMDVDEKEKIFMLSRHSEKLAMAYGLISSNKGTTIRIVKN 804

Query: 562 LRICMDCHTAIKMISKLEEREIIVRDNNRSEY 584
           LR+C DCH+  K  SK+  REII+RDNNR  Y
Sbjct: 805 LRVCSDCHSFAKFASKVYNREIILRDNNRFHY 830

BLAST of Cp4.1LG02g07200 vs. TAIR10
Match: AT2G22070.1 (AT2G22070.1 pentatricopeptide (PPR) repeat-containing protein)

HSP 1 Score: 431.8 bits (1109), Expect = 9.0e-121
Identity = 212/559 (37.92%), Postives = 350/559 (62.61%), Query Frame = 1

Query: 30  SWTAMVTGYAKMRDLESARRYFDEMPEKSVVSWNAMLSAYAQNECAEEALKLFHRMLKEG 89
           SW AM+  + ++  ++ A   F++M E+ +V+WN+M+S + Q      AL +F +ML++ 
Sbjct: 214 SWNAMIALHMQVGQMDLAMAQFEQMAERDIVTWNSMISGFNQRGYDLRALDIFSKMLRDS 273

Query: 90  I-TPDDTTWVAAISSCSSIGNPNLADSLLAKINQKHVILNNYVKTALLDMHAKFGNLEIA 149
           + +PD  T  + +S+C+++    +   + + I      ++  V  AL+ M+++ G +E A
Sbjct: 274 LLSPDRFTLASVLSACANLEKLCIGKQIHSHIVTTGFDISGIVLNALISMYSRCGGVETA 333

Query: 150 RRIFDELGGQRNAVT-WNVMISAYTRAGKLSLARELFDNMPKRDVVSWNSMIAGYAQNGE 209
           RR+ ++ G +   +  +  ++  Y + G ++ A+ +F ++  RDVV+W +MI GY Q+G 
Sbjct: 334 RRLIEQRGTKDLKIEGFTALLDGYIKLGDMNQAKNIFVSLKDRDVVAWTAMIVGYEQHGS 393

Query: 210 SAMSIHLFREMIDCTDIQPDEVTIASVLSACGHIGALKFSYWVLNIVQEKNIKFGISGFN 269
              +I+LFR M+     +P+  T+A++LS    + +L     +     +    + +S  N
Sbjct: 394 YGEAINLFRSMVGGGQ-RPNSYTLAAMLSVASSLASLSHGKQIHGSAVKSGEIYSVSVSN 453

Query: 270 SLIFMYSKCGNVVDAHRIFQNMGT-KDVVTFNTLISGFAANGHGKDAIKLLLTMEEEGIE 329
           +LI MY+K GN+  A R F  +   +D V++ ++I   A +GH ++A++L  TM  EG+ 
Sbjct: 454 ALITMYAKAGNITSASRAFDLIRCERDTVSWTSMIIALAQHGHAEEALELFETMLMEGLR 513

Query: 330 PDHVTYIGVLTACSHAGMLKEGKNIFKSIK-----APTVDHYACMVDLLGRAGELDEAKM 389
           PDH+TY+GV +AC+HAG++ +G+  F  +K      PT+ HYACMVDL GRAG L EA+ 
Sbjct: 514 PDHITYVGVFSACTHAGLVNQGRQYFDMMKDVDKIIPTLSHYACMVDLFGRAGLLQEAQE 573

Query: 390 LIESMPMKPHAGVYGSLLNGSRIHKRVELGELAANKLLELEPQNPGNYILLSNIYASAGR 449
            IE MP++P    +GSLL+  R+HK ++LG++AA +LL LEP+N G Y  L+N+Y++ G+
Sbjct: 574 FIEKMPIEPDVVTWGSLLSACRVHKNIDLGKVAAERLLLLEPENSGAYSALANLYSACGK 633

Query: 450 WEDVRQVREKMRKGGVKKSVGMSWVEYKGQIHNFTVGDRSHEWSKDIYRLLAELERKMKR 509
           WE+  ++R+ M+ G VKK  G SW+E K ++H F V D +H    +IY  + ++  ++K+
Sbjct: 634 WEEAAKIRKSMKDGRVKKEQGFSWIEVKHKVHVFGVEDGTHPEKNEIYMTMKKIWDEIKK 693

Query: 510 VGFVTDKSCALRDVEEEEKEEMLGTHSEKLAICFALLVSEVGTPIRVVKNLRICMDCHTA 569
           +G+V D +  L D+EEE KE++L  HSEKLAI F L+ +   T +R++KNLR+C DCHTA
Sbjct: 694 MGYVPDTASVLHDLEEEVKEQILRHHSEKLAIAFGLISTPDKTTLRIMKNLRVCNDCHTA 753

Query: 570 IKMISKLEEREIIVRDNNR 581
           IK ISKL  REIIVRD  R
Sbjct: 754 IKFISKLVGREIIVRDTTR 771

BLAST of Cp4.1LG02g07200 vs. TAIR10
Match: AT4G16835.1 (AT4G16835.1 Tetratricopeptide repeat (TPR)-like superfamily protein)

HSP 1 Score: 430.6 bits (1106), Expect = 2.0e-120
Identity = 233/573 (40.66%), Postives = 341/573 (59.51%), Query Frame = 1

Query: 15  GPHAVYYSATPLNTISWTAMVTGYAKMRD-LESARRYFDEMPEKSVVSWNAMLSAYAQNE 74
           G   V++     NTI+W +++ G +K    +  A + FDE+PE    S+N MLS Y +N 
Sbjct: 79  GALRVFHGMRAKNTITWNSLLIGISKDPSRMMEAHQLFDEIPEPDTFSYNIMLSCYVRNV 138

Query: 75  CAEEALKLFHRMLKEGITPDDTTWVAAISSCSSIGNPNLADSLLAKINQKHVILNNYVKT 134
             E+A   F RM  +    D  +W   I+  +  G    A  L   + +K+ +  N    
Sbjct: 139 NFEKAQSFFDRMPFK----DAASWNTMITGYARRGEMEKARELFYSMMEKNEVSWN---- 198

Query: 135 ALLDMHAKFGNLEIARRIFDELGGQRNAVTWNVMISAYTRAGKLSLARELFDNMP-KRDV 194
           A++  + + G+LE A   F ++   R  V W  MI+ Y +A K+ LA  +F +M   +++
Sbjct: 199 AMISGYIECGDLEKASHFF-KVAPVRGVVAWTAMITGYMKAKKVELAEAMFKDMTVNKNL 258

Query: 195 VSWNSMIAGYAQNGESAMSIHLFREMIDCTDIQPDEVTIASVLSACGHIGALKFSYWVLN 254
           V+WN+MI+GY +N      + LFR M++   I+P+   ++S L  C  + AL+    +  
Sbjct: 259 VTWNAMISGYVENSRPEDGLKLFRAMLE-EGIRPNSSGLSSALLGCSELSALQLGRQIHQ 318

Query: 255 IVQEKNIKFGISGFNSLIFMYSKCGNVVDAHRIFQNMGTKDVVTFNTLISGFAANGHGKD 314
           IV +  +   ++   SLI MY KCG + DA ++F+ M  KDVV +N +ISG+A +G+   
Sbjct: 319 IVSKSTLCNDVTALTSLISMYCKCGELGDAWKLFEVMKKKDVVAWNAMISGYAQHGNADK 378

Query: 315 AIKLLLTMEEEGIEPDHVTYIGVLTACSHAGMLKEGKNIFKSIKA-----PTVDHYACMV 374
           A+ L   M +  I PD +T++ VL AC+HAG++  G   F+S+       P  DHY CMV
Sbjct: 379 ALCLFREMIDNKIRPDWITFVAVLLACNHAGLVNIGMAYFESMVRDYKVEPQPDHYTCMV 438

Query: 375 DLLGRAGELDEAKMLIESMPMKPHAGVYGSLLNGSRIHKRVELGELAANKLLELEPQNPG 434
           DLLGRAG+L+EA  LI SMP +PHA V+G+LL   R+HK VEL E AA KLL+L  QN  
Sbjct: 439 DLLGRAGKLEEALKLIRSMPFRPHAAVFGTLLGACRVHKNVELAEFAAEKLLQLNSQNAA 498

Query: 435 NYILLSNIYASAGRWEDVRQVREKMRKGGVKKSVGMSWVEYKGQIHNFTVGDRSHEWSKD 494
            Y+ L+NIYAS  RWEDV +VR++M++  V K  G SW+E + ++H+F   DR H     
Sbjct: 499 GYVQLANIYASKNRWEDVARVRKRMKESNVVKVPGYSWIEIRNKVHHFRSSDRIHPELDS 558

Query: 495 IYRLLAELERKMKRVGFVTDKSCALRDVEEEEKEEMLGTHSEKLAICFALLVSEVGTPIR 554
           I++ L ELE+KMK  G+  +   AL +VEEE+KE++L  HSEKLA+ F  +    G+ I+
Sbjct: 559 IHKKLKELEKKMKLAGYKPELEFALHNVEEEQKEKLLLWHSEKLAVAFGCIKLPQGSQIQ 618

Query: 555 VVKNLRICMDCHTAIKMISKLEEREIIVRDNNR 581
           V KNLRIC DCH AIK IS++E+REIIVRD  R
Sbjct: 619 VFKNLRICGDCHKAIKFISEIEKREIIVRDTTR 641

BLAST of Cp4.1LG02g07200 vs. NCBI nr
Match: gi|449460189|ref|XP_004147828.1| (PREDICTED: pentatricopeptide repeat-containing protein At1g14470 [Cucumis sativus])

HSP 1 Score: 997.7 bits (2578), Expect = 1.2e-287
Identity = 484/554 (87.36%), Postives = 521/554 (94.04%), Query Frame = 1

Query: 27  NTISWTAMVTGYAKMRDLESARRYFDEMPEKSVVSWNAMLSAYAQNECAEEALKLFHRML 86
           N I+WT+MVTGYAKM DLESARRYFDEMPE+SVVSWNAM SAYAQ EC +EAL LFH+ML
Sbjct: 191 NIITWTSMVTGYAKMGDLESARRYFDEMPERSVVSWNAMQSAYAQKECPKEALNLFHQML 250

Query: 87  KEGITPDDTTWVAAISSCSSIGNPNLADSLLAKINQKHVILNNYVKTALLDMHAKFGNLE 146
           +EGITPDDTTWV  ISSCSSIG+P LADS+L  I+QKH++LN++VKTALLDMHAKFGNLE
Sbjct: 251 EEGITPDDTTWVVTISSCSSIGDPTLADSILRMIDQKHIVLNSFVKTALLDMHAKFGNLE 310

Query: 147 IARRIFDELGGQRNAVTWNVMISAYTRAGKLSLARELFDNMPKRDVVSWNSMIAGYAQNG 206
           IAR IFDELG QRNAVTWN+MISAYTR GKLSLARELFDNMPKRDVVSWNSMIAGYAQNG
Sbjct: 311 IARNIFDELGSQRNAVTWNIMISAYTRVGKLSLARELFDNMPKRDVVSWNSMIAGYAQNG 370

Query: 207 ESAMSIHLFREMIDCTDIQPDEVTIASVLSACGHIGALKFSYWVLNIVQEKNIKFGISGF 266
           ESAMSI LF+EMI C DIQPDEVTIASVLSACGHIGALK SYWVL+IV+EKNIK GISGF
Sbjct: 371 ESAMSIELFKEMISCMDIQPDEVTIASVLSACGHIGALKLSYWVLDIVREKNIKLGISGF 430

Query: 267 NSLIFMYSKCGNVVDAHRIFQNMGTKDVVTFNTLISGFAANGHGKDAIKLLLTMEEEGIE 326
           NSLIFMYSKCG+V DAHRIFQ MGT+DVV+FNTLISGFAANGHGK+AIKL+LTMEEEGIE
Sbjct: 431 NSLIFMYSKCGSVADAHRIFQTMGTRDVVSFNTLISGFAANGHGKEAIKLVLTMEEEGIE 490

Query: 327 PDHVTYIGVLTACSHAGMLKEGKNIFKSIKAPTVDHYACMVDLLGRAGELDEAKMLIESM 386
           PDHVTYIGVLTACSHAG+L EGKN+FKSI+APTVDHYACMVDLLGRAGELDEAKMLI+SM
Sbjct: 491 PDHVTYIGVLTACSHAGLLNEGKNVFKSIQAPTVDHYACMVDLLGRAGELDEAKMLIQSM 550

Query: 387 PMKPHAGVYGSLLNGSRIHKRVELGELAANKLLELEPQNPGNYILLSNIYASAGRWEDVR 446
           PMKPHAGVYGSLLN SRIHKRV LGELAA+KL ELEPQN GNY+LLSNIYAS GRWEDV+
Sbjct: 551 PMKPHAGVYGSLLNASRIHKRVGLGELAASKLFELEPQNLGNYVLLSNIYASFGRWEDVK 610

Query: 447 QVREKMRKGGVKKSVGMSWVEYKGQIHNFTVGDRSHEWSKDIYRLLAELERKMKRVGFVT 506
           +VRE M+KGG+KKSVGMSWVEYKGQ+H FTVGDRSHE SKDIY+LLAELERKMKRVGFV 
Sbjct: 611 RVREMMKKGGLKKSVGMSWVEYKGQVHKFTVGDRSHEQSKDIYKLLAELERKMKRVGFVA 670

Query: 507 DKSCALRDVEEEEKEEMLGTHSEKLAICFALLVSEVGTPIRVVKNLRICMDCHTAIKMIS 566
           DKSCALRDVEEEEKEEMLGTHSEKLAICFALL+SEVGT IRVVKNLRIC+DCHTAIKMIS
Sbjct: 671 DKSCALRDVEEEEKEEMLGTHSEKLAICFALLISEVGTTIRVVKNLRICLDCHTAIKMIS 730

Query: 567 KLEEREIIVRDNNR 581
           KLE REI+VRDNNR
Sbjct: 731 KLEGREIVVRDNNR 744

BLAST of Cp4.1LG02g07200 vs. NCBI nr
Match: gi|659109350|ref|XP_008454670.1| (PREDICTED: pentatricopeptide repeat-containing protein At1g14470 [Cucumis melo])

HSP 1 Score: 989.2 bits (2556), Expect = 4.2e-285
Identity = 479/554 (86.46%), Postives = 516/554 (93.14%), Query Frame = 1

Query: 27  NTISWTAMVTGYAKMRDLESARRYFDEMPEKSVVSWNAMLSAYAQNECAEEALKLFHRML 86
           N I+WT+MVTGYAK+ DLESARRYFDEMPE+SVVSWNAM SAYAQ EC +EALKLFH+ML
Sbjct: 191 NIITWTSMVTGYAKVGDLESARRYFDEMPERSVVSWNAMQSAYAQKECPKEALKLFHQML 250

Query: 87  KEGITPDDTTWVAAISSCSSIGNPNLADSLLAKINQKHVILNNYVKTALLDMHAKFGNLE 146
           KEGITPDDTTW   ISSCSSIG+P LADS+L  INQKH++LN++V+TALLDMHAKFGNLE
Sbjct: 251 KEGITPDDTTWAVTISSCSSIGDPTLADSILRMINQKHIVLNSFVQTALLDMHAKFGNLE 310

Query: 147 IARRIFDELGGQRNAVTWNVMISAYTRAGKLSLARELFDNMPKRDVVSWNSMIAGYAQNG 206
           IAR IFDELG QRN V WNVMISAYTR GKLSLARELFDNMPKRDVVSWNSMIAGYAQNG
Sbjct: 311 IARNIFDELGSQRNDVAWNVMISAYTRVGKLSLARELFDNMPKRDVVSWNSMIAGYAQNG 370

Query: 207 ESAMSIHLFREMIDCTDIQPDEVTIASVLSACGHIGALKFSYWVLNIVQEKNIKFGISGF 266
           E+AMSI LF+EMI C DIQPDEVTIASVLSACGHIGALK  YWVL+IV+EKNIK GISGF
Sbjct: 371 EAAMSIELFKEMISCADIQPDEVTIASVLSACGHIGALKLGYWVLDIVREKNIKLGISGF 430

Query: 267 NSLIFMYSKCGNVVDAHRIFQNMGTKDVVTFNTLISGFAANGHGKDAIKLLLTMEEEGIE 326
           NSLIFMYSKCG+V DAHRIFQ M T+DVV+FNTLISGFAANGHGK+AIKL+LTMEEEGIE
Sbjct: 431 NSLIFMYSKCGSVADAHRIFQTMETRDVVSFNTLISGFAANGHGKEAIKLVLTMEEEGIE 490

Query: 327 PDHVTYIGVLTACSHAGMLKEGKNIFKSIKAPTVDHYACMVDLLGRAGELDEAKMLIESM 386
           PDHVTYIGVLTACSHAG+L EGKN+FKSIKAPTVDHYACMVDLLGRAGELDEAKMLI+SM
Sbjct: 491 PDHVTYIGVLTACSHAGLLNEGKNVFKSIKAPTVDHYACMVDLLGRAGELDEAKMLIQSM 550

Query: 387 PMKPHAGVYGSLLNGSRIHKRVELGELAANKLLELEPQNPGNYILLSNIYASAGRWEDVR 446
           PMKPH GVYGSLLN SRIHKRV LGELAA+KL ELEPQNPGNY+LLSNIYAS+GRWEDV+
Sbjct: 551 PMKPHGGVYGSLLNASRIHKRVGLGELAASKLFELEPQNPGNYVLLSNIYASSGRWEDVK 610

Query: 447 QVREKMRKGGVKKSVGMSWVEYKGQIHNFTVGDRSHEWSKDIYRLLAELERKMKRVGFVT 506
           +VRE MRK G++K VGMSWVEYKGQ+H F VGDRSHE SKDIY+LLAELERKMKRVGFV 
Sbjct: 611 RVREMMRKRGLQKLVGMSWVEYKGQVHKFIVGDRSHEQSKDIYKLLAELERKMKRVGFVA 670

Query: 507 DKSCALRDVEEEEKEEMLGTHSEKLAICFALLVSEVGTPIRVVKNLRICMDCHTAIKMIS 566
           DKSCALRDVEEEEKEEMLGTHSEKLAICFALL+SEVGTPIRVVKNLRIC+DCHTAIKMIS
Sbjct: 671 DKSCALRDVEEEEKEEMLGTHSEKLAICFALLISEVGTPIRVVKNLRICLDCHTAIKMIS 730

Query: 567 KLEEREIIVRDNNR 581
           KLE REI+VRDNNR
Sbjct: 731 KLEGREIVVRDNNR 744

BLAST of Cp4.1LG02g07200 vs. NCBI nr
Match: gi|147856457|emb|CAN80769.1| (hypothetical protein VITISV_013866 [Vitis vinifera])

HSP 1 Score: 807.7 bits (2085), Expect = 1.7e-230
Identity = 386/554 (69.68%), Postives = 461/554 (83.21%), Query Frame = 1

Query: 27  NTISWTAMVTGYAKMRDLESARRYFDEMPEKSVVSWNAMLSAYAQNECAEEALKLFHRML 86
           N I+WTAMVTGYAK++DLE+ARRYFD MPE+SVVSWNAMLS YAQN  AEE L+LF  M+
Sbjct: 193 NVITWTAMVTGYAKVKDLEAARRYFDCMPERSVVSWNAMLSGYAQNGLAEEVLRLFDEMV 252

Query: 87  KEGITPDDTTWVAAISSCSSIGNPNLADSLLAKINQKHVILNNYVKTALLDMHAKFGNLE 146
             GI PD+TTWV  IS+CSS G+P LA SL+  ++QK + LN +V+TALLDM+AK G++ 
Sbjct: 253 NAGIEPDETTWVTVISACSSRGDPCLAASLVRTLHQKQIQLNCFVRTALLDMYAKCGSIG 312

Query: 147 IARRIFDELGGQRNAVTWNVMISAYTRAGKLSLARELFDNMPKRDVVSWNSMIAGYAQNG 206
            ARRIFDELG  RN+VTWN MISAYTR G L  ARELF+ MP R+VV+WNSMIAGYAQNG
Sbjct: 313 AARRIFDELGAYRNSVTWNAMISAYTRVGNLDSARELFNTMPGRNVVTWNSMIAGYAQNG 372

Query: 207 ESAMSIHLFREMIDCTDIQPDEVTIASVLSACGHIGALKFSYWVLNIVQEKNIKFGISGF 266
           +SAM+I LF+EMI    + PDEVT+ SV+SACGH+GAL+   WV+  + E  IK  ISG 
Sbjct: 373 QSAMAIELFKEMITAKKLTPDEVTMVSVISACGHLGALELGNWVVRFLTENQIKLSISGH 432

Query: 267 NSLIFMYSKCGNVVDAHRIFQNMGTKDVVTFNTLISGFAANGHGKDAIKLLLTMEEEGIE 326
           N++IFMYS+CG++ DA R+FQ M T+DVV++NTLISGFAA+GHG +AI L+ TM+E GIE
Sbjct: 433 NAMIFMYSRCGSMEDAKRVFQEMATRDVVSYNTLISGFAAHGHGVEAINLMSTMKEGGIE 492

Query: 327 PDHVTYIGVLTACSHAGMLKEGKNIFKSIKAPTVDHYACMVDLLGRAGELDEAKMLIESM 386
           PD VT+IGVLTACSHAG+L+EG+ +F+SIK P +DHYACMVDLLGR GEL++AK  +E M
Sbjct: 493 PDRVTFIGVLTACSHAGLLEEGRKVFESIKDPAIDHYACMVDLLGRVGELEDAKRTMERM 552

Query: 387 PMKPHAGVYGSLLNGSRIHKRVELGELAANKLLELEPQNPGNYILLSNIYASAGRWEDVR 446
           PM+PHAGVYGSLLN SRIHK+VELGELAANKL ELEP N GN+ILLSNIYASAGRW+DV 
Sbjct: 553 PMEPHAGVYGSLLNASRIHKQVELGELAANKLFELEPDNSGNFILLSNIYASAGRWKDVE 612

Query: 447 QVREKMRKGGVKKSVGMSWVEYKGQIHNFTVGDRSHEWSKDIYRLLAELERKMKRVGFVT 506
           ++RE M+KGGVKK+ G SWVEY G++H F V DRSHE S DIY+LL EL +KM+  G++ 
Sbjct: 613 RIREAMKKGGVKKTTGWSWVEYGGKLHKFIVADRSHERSDDIYQLLIELRKKMREAGYIA 672

Query: 507 DKSCALRDVEEEEKEEMLGTHSEKLAICFALLVSEVGTPIRVVKNLRICMDCHTAIKMIS 566
           DKSC LRDVEEEEKEE++GTHSEKLAIC+ALLVSE G  IRVVKNLR+C DCHTAIKMIS
Sbjct: 673 DKSCVLRDVEEEEKEEIVGTHSEKLAICYALLVSEAGAVIRVVKNLRVCWDCHTAIKMIS 732

Query: 567 KLEEREIIVRDNNR 581
           KLE R IIVRDNNR
Sbjct: 733 KLEGRVIIVRDNNR 746

BLAST of Cp4.1LG02g07200 vs. NCBI nr
Match: gi|731411247|ref|XP_010657905.1| (PREDICTED: pentatricopeptide repeat-containing protein At1g14470 [Vitis vinifera])

HSP 1 Score: 805.1 bits (2078), Expect = 1.1e-229
Identity = 385/554 (69.49%), Postives = 461/554 (83.21%), Query Frame = 1

Query: 27  NTISWTAMVTGYAKMRDLESARRYFDEMPEKSVVSWNAMLSAYAQNECAEEALKLFHRML 86
           N I+WTAMVTGYAK++DLE+ARRYFD MPE+SVVSWNAMLS YAQN  AEEAL+LF  M+
Sbjct: 193 NVITWTAMVTGYAKVKDLEAARRYFDCMPERSVVSWNAMLSGYAQNGLAEEALRLFDEMV 252

Query: 87  KEGITPDDTTWVAAISSCSSIGNPNLADSLLAKINQKHVILNNYVKTALLDMHAKFGNLE 146
             GI PD+TTWV  IS+CSS G+P LA SL+  ++QK + LN +V+TALLDM+AK G++ 
Sbjct: 253 NAGIEPDETTWVTVISACSSRGDPCLAASLVRTLHQKRIQLNCFVRTALLDMYAKCGSIG 312

Query: 147 IARRIFDELGGQRNAVTWNVMISAYTRAGKLSLARELFDNMPKRDVVSWNSMIAGYAQNG 206
            ARRIFDELG  RN+VTWN MISAY R G L  AR+LF+ MP R+VV+WNSMIAGYAQNG
Sbjct: 313 AARRIFDELGAYRNSVTWNAMISAYMRVGDLDSARKLFNTMPGRNVVTWNSMIAGYAQNG 372

Query: 207 ESAMSIHLFREMIDCTDIQPDEVTIASVLSACGHIGALKFSYWVLNIVQEKNIKFGISGF 266
           +SAM+I LF+EMI    + PDEVT+ SV+SACGH+GAL+   WV+  + E  IK  ISG 
Sbjct: 373 QSAMAIELFKEMITAKKLTPDEVTMVSVISACGHLGALELGNWVVRFLTENQIKLSISGH 432

Query: 267 NSLIFMYSKCGNVVDAHRIFQNMGTKDVVTFNTLISGFAANGHGKDAIKLLLTMEEEGIE 326
           N++IFMYS+CG++ DA R+FQ M T+DVV++NTLISGFAA+GHG +AI L+ TM+E GIE
Sbjct: 433 NAMIFMYSRCGSMEDAKRVFQEMATRDVVSYNTLISGFAAHGHGVEAINLMSTMKEGGIE 492

Query: 327 PDHVTYIGVLTACSHAGMLKEGKNIFKSIKAPTVDHYACMVDLLGRAGELDEAKMLIESM 386
           PD VT+IGVLTACSHAG+L+EG+ +F+SIK P +DHYACMVDLLGR GEL++AK  +E M
Sbjct: 493 PDRVTFIGVLTACSHAGLLEEGRKVFESIKDPAIDHYACMVDLLGRVGELEDAKRTMERM 552

Query: 387 PMKPHAGVYGSLLNGSRIHKRVELGELAANKLLELEPQNPGNYILLSNIYASAGRWEDVR 446
           PM+PHAGVYGSLLN SRIHK+VELGELAANKL ELEP N GN+ILLSNIYASAGRW+DV 
Sbjct: 553 PMEPHAGVYGSLLNASRIHKQVELGELAANKLFELEPDNSGNFILLSNIYASAGRWKDVE 612

Query: 447 QVREKMRKGGVKKSVGMSWVEYKGQIHNFTVGDRSHEWSKDIYRLLAELERKMKRVGFVT 506
           ++RE M+KGGVKK+ G SWVEY G++H F V DRSHE S DIY+LL EL +KM+  G++ 
Sbjct: 613 RIREAMKKGGVKKTTGWSWVEYGGKLHKFIVADRSHERSDDIYQLLIELRKKMREAGYIA 672

Query: 507 DKSCALRDVEEEEKEEMLGTHSEKLAICFALLVSEVGTPIRVVKNLRICMDCHTAIKMIS 566
           DKSC LRDVEEEEKEE++GTHSEKLAIC+ALLVSE G  IRVVKNLR+C DCHTAIKMIS
Sbjct: 673 DKSCVLRDVEEEEKEEIVGTHSEKLAICYALLVSEAGAVIRVVKNLRVCWDCHTAIKMIS 732

Query: 567 KLEEREIIVRDNNR 581
           KLE R IIVRDNNR
Sbjct: 733 KLEGRVIIVRDNNR 746

BLAST of Cp4.1LG02g07200 vs. NCBI nr
Match: gi|802640482|ref|XP_012078834.1| (PREDICTED: pentatricopeptide repeat-containing protein At1g14470 [Jatropha curcas])

HSP 1 Score: 796.6 bits (2056), Expect = 4.0e-227
Identity = 379/554 (68.41%), Postives = 461/554 (83.21%), Query Frame = 1

Query: 27  NTISWTAMVTGYAKMRDLESARRYFDEMPEKSVVSWNAMLSAYAQNECAEEALKLFHRML 86
           N ++WTAMVTG+AK++DLE AR+YFD MP +SVVSWNAMLS YAQN  AEEALKLF  M+
Sbjct: 192 NVVTWTAMVTGFAKIKDLEKARKYFDYMPMRSVVSWNAMLSGYAQNGFAEEALKLFGDMV 251

Query: 87  KEGITPDDTTWVAAISSCSSIGNPNLADSLLAKINQKHVILNNYVKTALLDMHAKFGNLE 146
             G+ P++TTW   +S CS  G+P +A+S++  ++ K + +N +VKTALLDM+AK GNLE
Sbjct: 252 NSGVQPNETTWATVVSLCSLFGDPCVAESIVKMLDGKRIKMNCFVKTALLDMNAKCGNLE 311

Query: 147 IARRIFDELGGQRNAVTWNVMISAYTRAGKLSLARELFDNMPKRDVVSWNSMIAGYAQNG 206
            AR IF+ELG  RN+VTWN MISAYT+ G L  AR+ FD MP+RDVVSWN+MI+GYAQNG
Sbjct: 312 AARNIFNELGVHRNSVTWNTMISAYTKVGDLDSARDHFDRMPERDVVSWNTMISGYAQNG 371

Query: 207 ESAMSIHLFREMIDCTDIQPDEVTIASVLSACGHIGALKFSYWVLNIVQEKNIKFGISGF 266
           +SA +I +F+EMI   D+QPDEVT+ASV+SACGH+GAL+   WV+N + E  I   I G+
Sbjct: 372 QSAKAIEIFKEMISSKDLQPDEVTMASVISACGHLGALELGTWVVNHITEYKINLSILGY 431

Query: 267 NSLIFMYSKCGNVVDAHRIFQNMGTKDVVTFNTLISGFAANGHGKDAIKLLLTMEEEGIE 326
           NSLIFMYSKCGN+ +AHRIFQ M T+DVV++NTLI+GFAA+G G +AIKLL TM+EEGI 
Sbjct: 432 NSLIFMYSKCGNMKEAHRIFQEMETRDVVSYNTLIAGFAAHGKGIEAIKLLSTMKEEGIH 491

Query: 327 PDHVTYIGVLTACSHAGMLKEGKNIFKSIKAPTVDHYACMVDLLGRAGELDEAKMLIESM 386
           PD VTYIGVLTACSHAG+++EG  +F+SI++P VDHYACMVDLLGR G+LDEAK LI++M
Sbjct: 492 PDRVTYIGVLTACSHAGLMEEGHKVFESIESPDVDHYACMVDLLGRVGKLDEAKKLIDNM 551

Query: 387 PMKPHAGVYGSLLNGSRIHKRVELGELAANKLLELEPQNPGNYILLSNIYASAGRWEDVR 446
           PM+PHAGVYGSLL+ S+IHKRV+ GELAA  L +LEPQN GNY+LLSNIYASAGRWE+V 
Sbjct: 552 PMEPHAGVYGSLLHASQIHKRVDFGELAAKMLFQLEPQNSGNYVLLSNIYASAGRWEEVN 611

Query: 447 QVREKMRKGGVKKSVGMSWVEYKGQIHNFTVGDRSHEWSKDIYRLLAELERKMKRVGFVT 506
           +VRE M KG VKK+ G SWVEY+G++H F VGDRSHE S DIYRLLAEL  KM+R G+  
Sbjct: 612 RVREMMSKGEVKKTAGWSWVEYQGKVHKFMVGDRSHERSDDIYRLLAELASKMRRHGYTA 671

Query: 507 DKSCALRDVEEEEKEEMLGTHSEKLAICFALLVSEVGTPIRVVKNLRICMDCHTAIKMIS 566
           D+SC LRDVEEEEKE M+GTHSEKLAICFALLVS+ G  IRVVKNLR+C+DCHTAIK+IS
Sbjct: 672 DRSCVLRDVEEEEKEHMVGTHSEKLAICFALLVSKSGAAIRVVKNLRVCLDCHTAIKLIS 731

Query: 567 KLEEREIIVRDNNR 581
           +LE REIIVRDNNR
Sbjct: 732 QLEGREIIVRDNNR 745

The following BLAST results are available for this feature:
Match NameE-valueIdentityDescription
PP301_ARATH8.8e-12640.93Pentatricopeptide repeat-containing protein At4g02750 OS=Arabidopsis thaliana GN... [more]
PP175_ARATH4.1e-12338.81Pentatricopeptide repeat-containing protein At2g29760, chloroplastic OS=Arabidop... [more]
PP249_ARATH9.4e-12039.69Pentatricopeptide repeat-containing protein At3g22690 OS=Arabidopsis thaliana GN... [more]
PP168_ARATH1.6e-11937.92Pentatricopeptide repeat-containing protein At2g22070 OS=Arabidopsis thaliana GN... [more]
PP316_ARATH3.6e-11940.66Pentatricopeptide repeat-containing protein At4g16835, mitochondrial OS=Arabidop... [more]
Match NameE-valueIdentityDescription
A5B4C7_VITVI1.2e-23069.68Putative uncharacterized protein OS=Vitis vinifera GN=VITISV_013866 PE=4 SV=1[more]
F6HJZ0_VITVI7.8e-23069.49Putative uncharacterized protein OS=Vitis vinifera GN=VIT_12s0035g01970 PE=4 SV=... [more]
A0A067KKU3_JATCU2.8e-22768.41Uncharacterized protein OS=Jatropha curcas GN=JCGZ_13364 PE=4 SV=1[more]
A0A061GB75_THECC2.4e-22367.63Pentatricopeptide repeat superfamily protein isoform 2 OS=Theobroma cacao GN=TCM... [more]
A0A061G9Y9_THECC2.4e-22364.86Pentatricopeptide repeat superfamily protein isoform 1 OS=Theobroma cacao GN=TCM... [more]
Match NameE-valueIdentityDescription
AT4G02750.14.9e-12740.93 Tetratricopeptide repeat (TPR)-like superfamily protein[more]
AT2G29760.12.3e-12438.81 Tetratricopeptide repeat (TPR)-like superfamily protein[more]
AT3G22690.15.3e-12139.69 Protein of unknown function DUF1685 (InterPro:IPR012881), Pentatrico... [more]
AT2G22070.19.0e-12137.92 pentatricopeptide (PPR) repeat-containing protein[more]
AT4G16835.12.0e-12040.66 Tetratricopeptide repeat (TPR)-like superfamily protein[more]
Match NameE-valueIdentityDescription
gi|449460189|ref|XP_004147828.1|1.2e-28787.36PREDICTED: pentatricopeptide repeat-containing protein At1g14470 [Cucumis sativu... [more]
gi|659109350|ref|XP_008454670.1|4.2e-28586.46PREDICTED: pentatricopeptide repeat-containing protein At1g14470 [Cucumis melo][more]
gi|147856457|emb|CAN80769.1|1.7e-23069.68hypothetical protein VITISV_013866 [Vitis vinifera][more]
gi|731411247|ref|XP_010657905.1|1.1e-22969.49PREDICTED: pentatricopeptide repeat-containing protein At1g14470 [Vitis vinifera... [more]
gi|802640482|ref|XP_012078834.1|4.0e-22768.41PREDICTED: pentatricopeptide repeat-containing protein At1g14470 [Jatropha curca... [more]
The following terms have been associated with this gene:
Vocabulary: Molecular Function
TermDefinition
GO:0005515protein binding
GO:0008270zinc ion binding
Vocabulary: INTERPRO
TermDefinition
IPR019164TMEM147
IPR011990TPR-like_helical_dom_sf
IPR002885Pentatricopeptide_repeat
GO Assignments
This gene is annotated with the following GO terms.
Category Term Accession Term Name
biological_process GO:0008150 biological_process
cellular_component GO:0005575 cellular_component
cellular_component GO:0005794 Golgi apparatus
cellular_component GO:0005739 mitochondrion
molecular_function GO:0005515 protein binding
molecular_function GO:0008270 zinc ion binding
molecular_function GO:0003674 molecular_function

The following mRNA feature(s) are a part of this gene:

Feature NameUnique NameType
Cp4.1LG02g07200.1Cp4.1LG02g07200.1mRNA


Analysis Name: InterPro Annotations of Cucurbita pepo
Date Performed: 2017-12-02
IPR TermIPR DescriptionSourceSource TermSource DescriptionAlignment
IPR002885Pentatricopeptide repeatPFAMPF01535PPRcoord: 363..387
score: 0.046coord: 432..457
score:
IPR002885Pentatricopeptide repeatPFAMPF12854PPR_1coord: 156..187
score: 1.
IPR002885Pentatricopeptide repeatPFAMPF13041PPR_2coord: 292..340
score: 4.3E-13coord: 57..104
score: 1.1E-11coord: 191..238
score: 4.
IPR002885Pentatricopeptide repeatTIGRFAMsTIGR00756TIGR00756coord: 193..228
score: 7.2E-6coord: 29..60
score: 5.0E-6coord: 60..93
score: 1.6E-9coord: 162..193
score: 3.1E-7coord: 295..328
score: 1.
IPR002885Pentatricopeptide repeatPROFILEPS51375PPRcoord: 93..127
score: 6.884coord: 262..292
score: 7.552coord: 27..57
score: 10.128coord: 328..358
score: 7.059coord: 425..459
score: 8.977coord: 58..92
score: 13.011coord: 195..221
score: 6.303coord: 359..389
score: 7.618coord: 160..194
score: 12.2coord: 293..327
score: 12.342coord: 128..158
score: 6.358coord: 227..261
score: 6
IPR011990Tetratricopeptide-like helical domainGENE3DG3DSA:1.25.40.10coord: 264..289
score: 4.5E-10coord: 163..225
score: 4.5E-10coord: 419..450
score: 4.5E-10coord: 23..117
score: 4.5
IPR011990Tetratricopeptide-like helical domainunknownSSF48452TPR-likecoord: 336..451
score: 1.9E-6coord: 135..161
score: 1.9E-6coord: 42..100
score: 1.
IPR019164Protein of unknown function DUF2053, membranePFAMPF09767DUF2053coord: 581..715
score: 1.2E-42coord: 2..30
score: 4.
NoneNo IPR availablePANTHERPTHR24015FAMILY NOT NAMEDcoord: 25..466
score: 1.5E
NoneNo IPR availablePANTHERPTHR24015:SF911SUBFAMILY NOT NAMEDcoord: 25..466
score: 1.5E

The following gene(s) are paralogous to this gene:
GeneParalogueOrganismBlock
Cp4.1LG02g07200Cp4.1LG06g01330Cucurbita pepo (Zucchini)cpecpeB459