CmUC02G032040 (gene) Watermelon (USVL531) v1

Overview
NameCmUC02G032040
Typegene
OrganismCitrullus mucosospermus (Watermelon (USVL531) v1)
DescriptionPentatricopeptide repeat-containing protein
LocationCmU531Chr02: 6058959 .. 6066893 (+)
RNA-Seq ExpressionCmUC02G032040
SyntenyCmUC02G032040
Sequences
The following sequences are available for this feature:

Gene sequence (with intron)

Legend: exonCDSpolypeptide
Hold the cursor over a type above to highlight its positions in the sequence below.
ATGGTTTTTTTTTTTTGTTTTTTTTTGTTCTTGAACATTGTTACCTTTTTGGCCTTGTTGCTTTTGTCCATATTTTATGTTGATTATATCTTTCGGTAAAAAATCACTTTATATAAGCATGTTTTGGAGCAATTTTAAACCGTGATTTTAACAATTTCAAAATCACTCCAAAACATACAATATATTTCCTAAATCAATAATATTTTTTTTTTTGTCAAAAACTTAAATTTGTTCAAGATCTATTTGACACAAAATTGAAAGTTTCGAAACTAAATAGACACAAAATACATTAAATATTATTGACAATTTTGACAATATTAGATACTTTTGGTCATCTATTAAAAAAAAATATTGAATTCTTGTTTGATAATCATTTAAGCCCTGTTTGATAACCATTTGATTTTTAAAATTTGTGCTTGTTTCACACCAAATTTTAGAATGATTTTTTTCCTTCCTAAACAAACATGTGAATTCTTAATCAAATTTAAAAAACAAAGACAAGTTTTGAAAAAGTACTTTTTTTTAGTTTTCCAAATTAAGTTTGAATTTTTAGAACACTTATAAAAAAAAGTAGATTACATAACAAATAAACTCTAAGATACTAGTAGAGTTTATAAACTTAACCTTTTAAAATAAAAACTAAAAACTTGGTTATCAAATGTGGTCTTACTTTTTTTACTTTGGATTTTTAAAATTAAGCTTATAAATGCTACTTTTACCTATCGATTTCCTTATTCGAAATTATAAAAAAAAAATACTTTTAAAATATTCTTTTTATTTTTGAAATTTTACTAAAAGTTTCCAAATGTTTAGTTAGAAAAACAAAAACTATAATAAAGAGATCGTAACCATATAAACATAATTTTCAAAAACCAAAAACTAAAAACGAAATATTTATTAAATTGTGTCTTTATTTTTCCATCCAATCAAGCCCATGGCCTTCCAAATAAGTTCAGAAGGCCTTTGGTAGTAAATGGGTTTCAAGCCCAGGTCCAGGCCCAATCCACTTCTTCGTCTTCACTGTTCACTTGTTAGTGTCATGTAGCTGAGCCTCGGCCTTTACCAGCCCCGTTTGAATTCAAATCAAACTACGGAGCTCTCTTTATTGGCGCTCGAAATTCTCTCTCAAGAGCCCATCACTGGAAAATCGAAGGGAACGAGCTTCAGTTCAGCGACGGAATCCTACAACAGTCAGATTCTTGCTTCACTTTCTTGCTCTTTTTGCTCTCTTTGCCGATTGAGGGATTGACTGTTTGGTTTCCAGGAAAATGGTGGAATCGAACCTGACTTACGAGGAATGCCGACGCCAGAGGTTGGAAGAGAATAAGAAGAGGATGGAAGAGCTTAATCTGAACAAACTTGCCGATGCTCTGAAATTTTCCAGCCCTAAATCCTCTCCGGTAGGTTTCCGTTGTTTAAAATACTTCATAATCCAATGTCTTCCACCATGATAATGGATTTAAAGCTTGATCTCTCTCTCCGCTTGGGTTATGGAAGACCAAACAGCTAAAGCGTCCTCGTCAGCCACTCGATAAGTCGTCTTTCAGTGTGAGAAGGTCTAGCCGTTTTGCCGATAAGCCCCCTCCGAACTATAAGGAGGTAGGATTTGAACTATTGGTTTTCTTGGAAATGTTCAACAGGACGGAAAGAAAAATGTTGCAGTGTTTCAACAATGGCCATTTGAATGTTATTTACTTTGTTCTTACTGAAATGACTCGTTTCTTTGAGCTTTTACAATGCAGACATGTGTAGTTTAATTGTTGGCCAGTTTCTAATTGAAGCATTTAGATGAACTATTTTGATGTTAGTGGATTGTCGATTTTCTTAGCTTAATGTTCATATCATATCGGTATTTGCAGGTGCCCATTGAACCACTTCCAGGTATAAGAAGGTATGCAAGAAAATTCTGTTTCGAATTACTTTGACTTTAGACTATTTGGTGTTTTTTCACTTAAACAAAGCGTCGATAATTGGACTAACAGGACTTATCAAAGGAGAGATTTGCTGAATCGGATTTATGCTTCACAAGAAGAAAGGCAATATGCTATTGACAGAGCAAGAGACCTTCAATCTAGCCTGGAATCTAGGTACCCCAGTTTTGTGAAGCCCATGCTTCAATCACATGTCACAGGGGGATTTTGGCTGGTCAGTTTATGATATCATTTTACTGAACATTTTCTGCTTTGATTGGTTATATAAATCTTACAATTGAATAGTATATTTGACGGTTAGTGTATGTACTGATGTTTTCCCTATACATATCTTATAGGGTCTACCAGTTCACTTTTGCAAGACACACCTTCCCCTTGAGGATGAAATGCTAACTCTGGTTGACGAGGATGATAATGAGTTCCAAACAAAATACCTTGCCGATAAAACAGGTCTCAGTGGTGGTTGGAGAGGGTTTTCCATTGATCATCAGTTAGTAGATGGGGATGCTTTGGTGTTTCAGTTAACTAAGCCAACTGAATTCAAGGTGATTACATGGAAAGAAAAAAAAAAAAAGTCCATTGCCTGATTGAATGATTCTGCATTGGCAAAGCATTTTTTTCATTTGATTCCAGTCTAATGGGTCACACAATTAGTGTGGCACATTTAGTTTGAGGTTTAGGGTACGTGCACACATATATAATAGGATTTGGACTTCCAACCTCAAGGGAAGGAACATATGCCAACTAGAATGAGCTATGCTTGTGTTGGAAAGACACGTTTATATTGTATCTTAGAATTCATATCCAAGAAGTTGAACGACTTGAACAGCAGTAGCTTCTTTGAAACATAAAGGAGACTGATTTAAAGCTTTTTGTTGCTACGAGCATCACCTAATGTGGGGACTGGGGAGGCAGGACTGGGATCTTCAAGACGGTTCATAAAGATTCTTGAGCATTTCTATATTTCTAGTTCAACTCACTTTTTCCTTCCTCAGGAAAGCATGATGCAATTGACTTAGAACTATTGATACTAGATCGACATAAAACTTCCCCCCTCTTAATTTCACTAATCTTGGAAGTTGGAATCTCACAGAAATGACAAAATTTTTATTCCCAACTTGTAAACATCCCTGCTCAAATGCTGCCTGAAAATATCTTTACTAATGTCCTTTGCCTTGTATTCATAGGTATATATCATCAGAGCATACAATTTAGAAGACAGAGAAGATACCAATGAGGATTCTGATGTCACCCAATTGGAAAAAAGTAGCAAAAGAAATACCAAATCATCAGGGCATAAATCCAGGGCAAATAATTCTGAGGATAAAGGAGATAATGGTGAGGATTCAGCAGATGTCTCTCAGTTGGAAAAAAGTGGCAAAAAAACTACTAAATCATCAGGGCGTAAAAGCAGGGCAAATAAATCCAAAGATAAAGCAGATAACGGCGCGGATTTGGATGTCCCCGAGCTGGAAAAAAGTGGCAAAAGAATTACTAGATCATCAAGTAAAGGTTTCATTTTTCATATAGGCCTTCCATTATTACTATTGTCACAATAATCTCTCACATGCTTTGAATTGGTAATGGATTTTTGAAGAGAGACTCAAAACAATTACTCAATTCGAATTTATAATTAGCTTCTACTTTTCATTTCTCATATTCACTTGTGTTTTCTTTTTTTTATGCCAGGGCAAAAGTAACTCAAGGTTTTCCTTTGGCACAAGGATTATTATAAGTATCCCTATCCAAGGCTTTGTGAGGAAAAAAAAAAAAATTGGGAGCTTTCAGGAATTTTGTCAGTCTTAGCTCTCTCTGGACTGTTGTAATTGTAAGCTGTAACATCAAAGATATGTTAATGAAATGACCCCCTTGTTCTGATGATCTGGAGCAACATCTCTCTCTCTCTCTTGAGTTGTGGATGGCCAAGAGTTTCTGGAAATCATGCTAAAAGTTTGCAGGTTAGCCAAACAAAAAGGTATTTCTTTTCACCTCTCTTGAGCGTTCATTATAAATGTCTATTATGGAGAGTAAAATATGGCTTAATCAATCTTTGGCCAGCTATTTGAGTTAAGATATATACCACTGACGTAGACTTAGAGATTAGAGATTTAGATCCTTACACCTAAGATTTAAAGTCCACCTAAATGTTGAGGTCTCAATTTTATTGAAATGTCAATGGATATTTCTAGAAAAATCATGAAAAAAAATTCAATTTTTTTAAAAAAATTAGTAAATAAACATTTTATGGATTTTAAACAAGTTAAGGGGGTATTTGGAGCGTTGAGTTGAGTTATTAAGTCTGGAGTTGATATGTTAGAGTTAAAAAGTTTTGTGTTTGGGGTGTAAAGTAACATGAAGTTCCTTAATAAATGTGCAAAATAGAGAAAGATGGAATGATAGAATTTCTAGATAAAGGTGTAAACTTCAATTGTAAACACTATTGAAATTGAGTTATTTAACACCAACTTATGAAATTGGCGAGCCAAACACACATTAGTGAAACCTCGATTCGCTTCTCATATTGATGAATCTATGGAAATGTGGAAATATAAGTAGAAATATCGAGATGTTGATGGAAATTTAATACTATACTTATACCTAGTGGTTGAACACAAGGAGAAAAAAAAAACATCTCTCTACACTGATGTCACTAGGTAGTAGGGTGCCATATGAGAGATTTGGGGTAAACTTACTAAGTAGTGTGTCATTTAAAGCCATGATTTTAAATCATCAGTTTAACTTAAACTTGAACCGAAAGAGAGAAAATAACAATAATAATATTCAACTAGTATGCAACATGTCTAAGTGTGTATAGCTTAGTTGTAATTAATATGCATTTCTTGTTTTGAAGTTTGGAAGTTTAAATCTCTATATCTCTATTATTACTTAAAATAATTAGTTTGGAACATAACACAAACCTTCCAAAATATTGAAATTTATTAAGAGTGTATAGATATAAGCTTTATTTATTTATTTTTTTACAATATATACGATATATTTACATAATGATATGTTCTCAATATTTTGGATAAATACCAATATAACATAAAGATATAACAATAATATAAATATGAACATTGGTTTGGAATACCTTCTACTCTAAGCACTTCTTAAAAAAATATTAAATAAAAATAAATTTATAAAACCGGCCTTAAAAAAATATTTTTTTCCACTATGTTCAGTAAGTGGTTCTATTGAGTAAAAATACTTTTTTTTATTACATGTTTGGAAGATTTTATATATGAAAATCTTTTTTTTTTAAATTAATTGATCTTTGTCTTATAGTCCCCATTGTACCCTAAACTCCTCACTGTATTTTTATTTTTAATGGAAGAAGACTCTTTGCCACCGTCAAAACGCTCTAAAAGTGTTAATTATCTATAATGCCTCATTGTAATTTTAGTTATGAAAAATTAAAGTCATTCGTACAATAGAGTTCTTATACATTAGGGCTTTTAATACCTAACTCTCTTGCACTTGCATTCAATTAATCCAATATAGTAATATTCTCACACGCTTTGTGCTCTTGCTTCTCGCTTTTTCACTTTTACTACTTCATCACTCTTGCATGTATTGTGTTTGTGTCTGTGCTCCTCGTGCTTGTGCTTACTGCTTTGAACTTTAATTAATGGTGCACAATAGGTAATATGGACAATTAACATGCTATTACGACGAATGTGAGTGATTCTAAAATAGTTAAAATCAATTTTGTCATTTTCAAAATATTGCAAAAATGCTTTTAATCATTTAAAATCAATTTTAATACTATTGAATTTGATTTTGAGTGATTAAAGATATATTTTAGAGTGATTTTAAATTTAATAAAAATAATTTTAACAATTTTAAAATATTTATCAAACATACGATAGGGTTATCATTCGTTATTCTTATAAACTTGGGATCATAAATTAGTGGTGCGCAAAGGTAGTATGGATGGATATTTATTTCTTATTTCTTATAAAATATTAAACGTATTTATTTAAAATTGGGCTGCTTTAGAAAATGGGCCGAAGTCTAAGCCCACATCAAGCCCAATGTAAGTTTGTGAAAAATAATTTGTCTAATTTTTCGTCTTCCGCTCTCAAGCTTTCAGTAACTGAGGTTTCTCTGAACCTTTCCTTAGGGTTTAGGCATTTTCCGATCAGAAATGGCGTCTCTCATGGCGGTTCGGCGTGCTCGAACCCCTATTCTTATCTCCTCCTTCTTCAAGGTACGGTCTCCTCTCCCTTCTCGTTTCACCTTCACTTGTGGAAATCAGACAGATACCCTAATCAAAGCCCTAAGCACCTCGGCAATCCGTAACGATTTCTCAAATTTTCCTCCTCCGCCGCAACAACCTTCTTCGTCTGACCCTCGACATCGTCAAGCCCAGTGGGGCTCGCCGAGCCAGGTTCATCCTCCGAGTGGAAATTTTAATAATCAGTCGTTCTCGGAGTTTCAGAATCGCGATTATGTTCAACAGGGAAGCCCCAGTAATCAATTGAAGTATCGGAGTCAGAATCAGAGCCCTCAACCCAATCCTGGATTTTCCCGGCAGGGTCAGAGCTATAGTCAACCCGGTAACCCTAATTCGTGGAATCCTCCAAATCAAAGCTACCCGCAGTATCAAAATCCTTCGCAGGCGAACCCTCAAAATTTCAATTATCAGCAACAAAGAGGCCCTAACCAATGGAACAATCAAAAACAGGGATATCCACAATTTGGAAGGCCTGAACAGCGTAACCCACAAGTAGAGAATTCTAATCAGTTGAATAATCAGGCTGGGATTCAAAGGCAAGGTGCTCAAAATCAAGCACTAAATGCCCTTGTATCTCCTATTGACGAACTGCGGCGCCTTTGTGGAGAGGGGAAGATGAAAGAAGCTGTTGAATTATTGAAAGAAGGTGTTAAAGCTGATGCTGATTGTTTCCATTTGTTGTTTGAACTATGTGGGAAATCCAAGTCATTTGACAATGCTAAAGTAGTTCATGATTACTTTTTACAGTCAACTTGTAGAAGTGATCTGCAATTGAATAATAAAGTGCTTGAGATGTATGGGAGATGTGGAAGCATGAGTGATGCACGGAGAGTGTTCGACTATATGCCTGATAGAAATATTGATTCTTGGCATTTTATGATGAAAGGATATGCTGATAATGGATTGGGTGATGAGGGTCTGGAGTTATTTGAGAATATGAAGCAGCTAGGGTTGCAACCCGATTCACAAACTTTCCTTTTTGTTATGTCAGCTTGTGCTAGTGCGAATGCTGTGGAAGAAGGATTTCTGTACTTTGAGTCAATGAAAAATGATTATCATATTACCCCAGACATGGATCATTATTTGGGGCTTTTAGGTATTCTTGGAGAACCAGGACACATTAATGAGGCTTTCGAGTATGTTAAGAAACTGCCAATGGAGCCCACAGTTGAGGTATGGGAGACTTTAAAGAACTATGCTAGAATTCATGGAGATGTTGATCTTGAGGACTATGCAGAGGAGCTAATTGTTGATCTGGACCCGACAAAAGCTGTTTCTAATAAGATATCGACACCACCTCCCAAAAAACGGTCTGCAATTAGCATGCTTGATGGGAAGAACAGGATTAGTGAATTCAGAAATCCAACTCTCTACAAAGATGATGAAAAATTGAAGGCTTTGAAGGCAATGAAAGAACAAGGGTATGTGCCAGATACTAGATATGTTCTTCACGATATCGATCAAGAGGCCAAAGAGCAGGCATTGCTGTATCACAGTGAACGATTGGCAATCGCATATGGATTGATCAGTACTCCGGCACGAACGCCTCTTAGGATCATTAAGAACCTAAGGATCTGCGGTGATTGTCATAATGCCATCAAAATCATGTCTAGAATTGTTGGGAGAGAGTTAATTGTAAGGGACAACAAACGGTTCCATCATTTTAAGGATGGTAAATGTTCTTGTGGGGATTACTGGTAAACCATTCATTCTACAACACCCCCACAACATGTACCTGATGAGTTTATCTTCTCAATAGCAGCTTGA

mRNA sequence

ATGTGTCATGTAGCTGAGCCTCGGCCTTTACCAGCCCCGTTTGAATTCAAATCAAACTACGGAGCTCTCTTTATTGGCGCTCGAAATTCTCTCTCAAGAGCCCATCACTGGAAAATCGAAGGGAACGAGCTTCAGTTCAGCGACGGAATCCTACAACAGAAAATGGTGGAATCGAACCTGACTTACGAGGAATGCCGACGCCAGAGGTTGGAAGAGAATAAGAAGAGGATGGAAGAGCTTAATCTGAACAAACTTGCCGATGCTCTGAAATTTTCCAGCCCTAAATCCTCTCCGACCAAACAGCTAAAGCGTCCTCGTCAGCCACTCGATAAGTCGTCTTTCAGTGTGAGAAGGTCTAGCCGTTTTGCCGATAAGCCCCCTCCGAACTATAAGGAGGTGCCCATTGAACCACTTCCAGGTATAAGAAGGACTTATCAAAGGAGAGATTTGCTGAATCGGATTTATGCTTCACAAGAAGAAAGGCAATATGCTATTGACAGAGCAAGAGACCTTCAATCTAGCCTGGAATCTAGGTACCCCAGTTTTGTGAAGCCCATGCTTCAATCACATGTCACAGGGGGATTTTGGCTGGGTCTACCAGTTCACTTTTGCAAGACACACCTTCCCCTTGAGGATGAAATGCTAACTCTGGTTGACGAGGATGATAATGAGTTCCAAACAAAATACCTTGCCGATAAAACAGGTCTCAGTGGTGGTTGGAGAGGGTTTTCCATTGATCATCAGTTAGTAGATGGGGATGCTTTGGTGTTTCAGTTAACTAAGCCAACTGAATTCAAGGTATATATCATCAGAGCATACAATTTAGAAGACAGAGAAGATACCAATGAGGATTCTGATGTCACCCAATTGGAAAAAAGTAGCAAAAGAAATACCAAATCATCAGGGCATAAATCCAGGGCAAATAATTCTGAGGATAAAGGAGATAATGGTGAGGATTCAGCAGATGTCTCTCAGTTGGAAAAAAGTGGCAAAAAAACTACTAAATCATCAGGGCGTAAAAGCAGGGCAAATAAATCCAAAGATAAAGCAGATAACGGCGCGGATTTGGATGTCCCCGAGCTGGAAAAAAGTGGCAAAAGAATTACTAGATCATCAAGGCAAAAGAATTTTGTCAGTCTTAGCTCTCTCTGGACTGTTGTAATTCCAAACAAAAAGGGTTTAGGCATTTTCCGATCAGAAATGGCGTCTCTCATGGCGGTTCGGCGTGCTCGAACCCCTATTCTTATCTCCTCCTTCTTCAAGGTACGGTCTCCTCTCCCTTCTCGTTTCACCTTCACTTGTGGAAATCAGACAGATACCCTAATCAAAGCCCTAAGCACCTCGGCAATCCGTAACGATTTCTCAAATTTTCCTCCTCCGCCGCAACAACCTTCTTCGTCTGACCCTCGACATCGTCAAGCCCAGTGGGGCTCGCCGAGCCAGGTTCATCCTCCGAGTGGAAATTTTAATAATCAGTCGTTCTCGGAGTTTCAGAATCGCGATTATGTTCAACAGGGAAGCCCCAGTAATCAATTGAAGTATCGGAGTCAGAATCAGAGCCCTCAACCCAATCCTGGATTTTCCCGGCAGGGTCAGAGCTATAGTCAACCCGGTAACCCTAATTCGTGGAATCCTCCAAATCAAAGCTACCCGCAGTATCAAAATCCTTCGCAGGCGAACCCTCAAAATTTCAATTATCAGCAACAAAGAGGCCCTAACCAATGGAACAATCAAAAACAGGGATATCCACAATTTGGAAGGCCTGAACAGCGTAACCCACAAGTAGAGAATTCTAATCAGTTGAATAATCAGGCTGGGATTCAAAGGCAAGGTGCTCAAAATCAAGCACTAAATGCCCTTGTATCTCCTATTGACGAACTGCGGCGCCTTTGTGGAGAGGGGAAGATGAAAGAAGCTGTTGAATTATTGAAAGAAGGTGTTAAAGCTGATGCTGATTGTTTCCATTTGTTGTTTGAACTATGTGGGAAATCCAAGTCATTTGACAATGCTAAAGTAGTTCATGATTACTTTTTACAGTCAACTTGTAGAAGTGATCTGCAATTGAATAATAAAGTGCTTGAGATGTATGGGAGATGTGGAAGCATGAGTGATGCACGGAGAGTGTTCGACTATATGCCTGATAGAAATATTGATTCTTGGCATTTTATGATGAAAGGATATGCTGATAATGGATTGGGTGATGAGGGTCTGGAGTTATTTGAGAATATGAAGCAGCTAGGGTTGCAACCCGATTCACAAACTTTCCTTTTTGTTATGTCAGCTTGTGCTAGTGCGAATGCTGTGGAAGAAGGATTTCTGTACTTTGAGTCAATGAAAAATGATTATCATATTACCCCAGACATGGATCATTATTTGGGGCTTTTAGGTATTCTTGGAGAACCAGGACACATTAATGAGGCTTTCGAGTATGTTAAGAAACTGCCAATGGAGCCCACAGTTGAGGTATGGGAGACTTTAAAGAACTATGCTAGAATTCATGGAGATGTTGATCTTGAGGACTATGCAGAGGAGCTAATTGTTGATCTGGACCCGACAAAAGCTGTTTCTAATAAGATATCGACACCACCTCCCAAAAAACGGTCTGCAATTAGCATGCTTGATGGGAAGAACAGGATTAGTGAATTCAGAAATCCAACTCTCTACAAAGATGATGAAAAATTGAAGGCTTTGAAGGCAATGAAAGAACAAGGGTATGTGCCAGATACTAGATATGTTCTTCACGATATCGATCAAGAGGCCAAAGAGCAGGCATTGCTGTATCACAGTGAACGATTGGCAATCGCATATGGATTGATCAGTACTCCGGCACGAACGCCTCTTAGGATCATTAAGAACCTAAGGATCTGCGGTGATTGTCATAATGCCATCAAAATCATGTCTAGAATTGTTGGGAGAGAGTTAATTGTAAGGGACAACAAACGGTTCCATCATTTTAAGGATGCTTGA

Coding sequence (CDS)

ATGTGTCATGTAGCTGAGCCTCGGCCTTTACCAGCCCCGTTTGAATTCAAATCAAACTACGGAGCTCTCTTTATTGGCGCTCGAAATTCTCTCTCAAGAGCCCATCACTGGAAAATCGAAGGGAACGAGCTTCAGTTCAGCGACGGAATCCTACAACAGAAAATGGTGGAATCGAACCTGACTTACGAGGAATGCCGACGCCAGAGGTTGGAAGAGAATAAGAAGAGGATGGAAGAGCTTAATCTGAACAAACTTGCCGATGCTCTGAAATTTTCCAGCCCTAAATCCTCTCCGACCAAACAGCTAAAGCGTCCTCGTCAGCCACTCGATAAGTCGTCTTTCAGTGTGAGAAGGTCTAGCCGTTTTGCCGATAAGCCCCCTCCGAACTATAAGGAGGTGCCCATTGAACCACTTCCAGGTATAAGAAGGACTTATCAAAGGAGAGATTTGCTGAATCGGATTTATGCTTCACAAGAAGAAAGGCAATATGCTATTGACAGAGCAAGAGACCTTCAATCTAGCCTGGAATCTAGGTACCCCAGTTTTGTGAAGCCCATGCTTCAATCACATGTCACAGGGGGATTTTGGCTGGGTCTACCAGTTCACTTTTGCAAGACACACCTTCCCCTTGAGGATGAAATGCTAACTCTGGTTGACGAGGATGATAATGAGTTCCAAACAAAATACCTTGCCGATAAAACAGGTCTCAGTGGTGGTTGGAGAGGGTTTTCCATTGATCATCAGTTAGTAGATGGGGATGCTTTGGTGTTTCAGTTAACTAAGCCAACTGAATTCAAGGTATATATCATCAGAGCATACAATTTAGAAGACAGAGAAGATACCAATGAGGATTCTGATGTCACCCAATTGGAAAAAAGTAGCAAAAGAAATACCAAATCATCAGGGCATAAATCCAGGGCAAATAATTCTGAGGATAAAGGAGATAATGGTGAGGATTCAGCAGATGTCTCTCAGTTGGAAAAAAGTGGCAAAAAAACTACTAAATCATCAGGGCGTAAAAGCAGGGCAAATAAATCCAAAGATAAAGCAGATAACGGCGCGGATTTGGATGTCCCCGAGCTGGAAAAAAGTGGCAAAAGAATTACTAGATCATCAAGGCAAAAGAATTTTGTCAGTCTTAGCTCTCTCTGGACTGTTGTAATTCCAAACAAAAAGGGTTTAGGCATTTTCCGATCAGAAATGGCGTCTCTCATGGCGGTTCGGCGTGCTCGAACCCCTATTCTTATCTCCTCCTTCTTCAAGGTACGGTCTCCTCTCCCTTCTCGTTTCACCTTCACTTGTGGAAATCAGACAGATACCCTAATCAAAGCCCTAAGCACCTCGGCAATCCGTAACGATTTCTCAAATTTTCCTCCTCCGCCGCAACAACCTTCTTCGTCTGACCCTCGACATCGTCAAGCCCAGTGGGGCTCGCCGAGCCAGGTTCATCCTCCGAGTGGAAATTTTAATAATCAGTCGTTCTCGGAGTTTCAGAATCGCGATTATGTTCAACAGGGAAGCCCCAGTAATCAATTGAAGTATCGGAGTCAGAATCAGAGCCCTCAACCCAATCCTGGATTTTCCCGGCAGGGTCAGAGCTATAGTCAACCCGGTAACCCTAATTCGTGGAATCCTCCAAATCAAAGCTACCCGCAGTATCAAAATCCTTCGCAGGCGAACCCTCAAAATTTCAATTATCAGCAACAAAGAGGCCCTAACCAATGGAACAATCAAAAACAGGGATATCCACAATTTGGAAGGCCTGAACAGCGTAACCCACAAGTAGAGAATTCTAATCAGTTGAATAATCAGGCTGGGATTCAAAGGCAAGGTGCTCAAAATCAAGCACTAAATGCCCTTGTATCTCCTATTGACGAACTGCGGCGCCTTTGTGGAGAGGGGAAGATGAAAGAAGCTGTTGAATTATTGAAAGAAGGTGTTAAAGCTGATGCTGATTGTTTCCATTTGTTGTTTGAACTATGTGGGAAATCCAAGTCATTTGACAATGCTAAAGTAGTTCATGATTACTTTTTACAGTCAACTTGTAGAAGTGATCTGCAATTGAATAATAAAGTGCTTGAGATGTATGGGAGATGTGGAAGCATGAGTGATGCACGGAGAGTGTTCGACTATATGCCTGATAGAAATATTGATTCTTGGCATTTTATGATGAAAGGATATGCTGATAATGGATTGGGTGATGAGGGTCTGGAGTTATTTGAGAATATGAAGCAGCTAGGGTTGCAACCCGATTCACAAACTTTCCTTTTTGTTATGTCAGCTTGTGCTAGTGCGAATGCTGTGGAAGAAGGATTTCTGTACTTTGAGTCAATGAAAAATGATTATCATATTACCCCAGACATGGATCATTATTTGGGGCTTTTAGGTATTCTTGGAGAACCAGGACACATTAATGAGGCTTTCGAGTATGTTAAGAAACTGCCAATGGAGCCCACAGTTGAGGTATGGGAGACTTTAAAGAACTATGCTAGAATTCATGGAGATGTTGATCTTGAGGACTATGCAGAGGAGCTAATTGTTGATCTGGACCCGACAAAAGCTGTTTCTAATAAGATATCGACACCACCTCCCAAAAAACGGTCTGCAATTAGCATGCTTGATGGGAAGAACAGGATTAGTGAATTCAGAAATCCAACTCTCTACAAAGATGATGAAAAATTGAAGGCTTTGAAGGCAATGAAAGAACAAGGGTATGTGCCAGATACTAGATATGTTCTTCACGATATCGATCAAGAGGCCAAAGAGCAGGCATTGCTGTATCACAGTGAACGATTGGCAATCGCATATGGATTGATCAGTACTCCGGCACGAACGCCTCTTAGGATCATTAAGAACCTAAGGATCTGCGGTGATTGTCATAATGCCATCAAAATCATGTCTAGAATTGTTGGGAGAGAGTTAATTGTAAGGGACAACAAACGGTTCCATCATTTTAAGGATGCTTGA

Protein sequence

MCHVAEPRPLPAPFEFKSNYGALFIGARNSLSRAHHWKIEGNELQFSDGILQQKMVESNLTYEECRRQRLEENKKRMEELNLNKLADALKFSSPKSSPTKQLKRPRQPLDKSSFSVRRSSRFADKPPPNYKEVPIEPLPGIRRTYQRRDLLNRIYASQEERQYAIDRARDLQSSLESRYPSFVKPMLQSHVTGGFWLGLPVHFCKTHLPLEDEMLTLVDEDDNEFQTKYLADKTGLSGGWRGFSIDHQLVDGDALVFQLTKPTEFKVYIIRAYNLEDREDTNEDSDVTQLEKSSKRNTKSSGHKSRANNSEDKGDNGEDSADVSQLEKSGKKTTKSSGRKSRANKSKDKADNGADLDVPELEKSGKRITRSSRQKNFVSLSSLWTVVIPNKKGLGIFRSEMASLMAVRRARTPILISSFFKVRSPLPSRFTFTCGNQTDTLIKALSTSAIRNDFSNFPPPPQQPSSSDPRHRQAQWGSPSQVHPPSGNFNNQSFSEFQNRDYVQQGSPSNQLKYRSQNQSPQPNPGFSRQGQSYSQPGNPNSWNPPNQSYPQYQNPSQANPQNFNYQQQRGPNQWNNQKQGYPQFGRPEQRNPQVENSNQLNNQAGIQRQGAQNQALNALVSPIDELRRLCGEGKMKEAVELLKEGVKADADCFHLLFELCGKSKSFDNAKVVHDYFLQSTCRSDLQLNNKVLEMYGRCGSMSDARRVFDYMPDRNIDSWHFMMKGYADNGLGDEGLELFENMKQLGLQPDSQTFLFVMSACASANAVEEGFLYFESMKNDYHITPDMDHYLGLLGILGEPGHINEAFEYVKKLPMEPTVEVWETLKNYARIHGDVDLEDYAEELIVDLDPTKAVSNKISTPPPKKRSAISMLDGKNRISEFRNPTLYKDDEKLKALKAMKEQGYVPDTRYVLHDIDQEAKEQALLYHSERLAIAYGLISTPARTPLRIIKNLRICGDCHNAIKIMSRIVGRELIVRDNKRFHHFKDA
Homology
BLAST of CmUC02G032040 vs. NCBI nr
Match: KAG6571981.1 (Pentatricopeptide repeat-containing protein, mitochondrial, partial [Cucurbita argyrosperma subsp. sororia])

HSP 1 Score: 1213.4 bits (3138), Expect = 0.0e+00
Identity = 661/966 (68.43%), Postives = 703/966 (72.77%), Query Frame = 0

Query: 23  LFIGARNSLSRAH-HWKIEGNELQFSDGILQQKMVESNLTYEECRRQRLEENKKRMEELN 82
           LF+ A   LSR     KIEGN+LQF D  LQ+KMVESNLTYEECRRQRLEENKKRMEELN
Sbjct: 15  LFVVALEILSREPITRKIEGNKLQFRDRNLQRKMVESNLTYEECRRQRLEENKKRMEELN 74

Query: 83  LNKLADALKFSSPKSSPTKQLKRPRQPLDKSSFSVRRSSRFADKPPPNYKEVPIEPLPGI 142
           LNKLADALK SSPKSSPTKQLKRPRQPLD SS SVRRSSRFADKPPP+YKE PIEPL G+
Sbjct: 75  LNKLADALKSSSPKSSPTKQLKRPRQPLDISSLSVRRSSRFADKPPPSYKEEPIEPLAGL 134

Query: 143 RRTYQRRDLLNRIYASQEERQYAIDRARDLQSSLESRYPSFVKPMLQSHVTGGFWLGLPV 202
           RRTYQRRDLLNR+YAS  ERQYAIDRARDLQSSLESRYPSFVKPMLQSHVTGGFWLGLPV
Sbjct: 135 RRTYQRRDLLNRVYASDVERQYAIDRARDLQSSLESRYPSFVKPMLQSHVTGGFWLGLPV 194

Query: 203 HFCKTHLPLEDEMLTLVDEDDNEFQTKYLADKTGLSGGWRGFSIDHQLVDGDALVFQLTK 262
           HFCK HLPLEDEMLTLVDED+NEFQTKYLA+KTGLSGGWRGFSIDHQLVDGD LVFQLTK
Sbjct: 195 HFCKAHLPLEDEMLTLVDEDENEFQTKYLAEKTGLSGGWRGFSIDHQLVDGDTLVFQLTK 254

Query: 263 PTEFKVYIIRAYNLEDREDTNEDSDVTQLEKSSKRNTKSSGHKSRANNSEDKGDNGEDSA 322
           PTEFKVYIIRAYNLEDRE+T+EDSDVTQLE + KRNT S                G+ + 
Sbjct: 255 PTEFKVYIIRAYNLEDRENTHEDSDVTQLESNGKRNTGS----------------GQINL 314

Query: 323 DVSQLEKSGKKTTKSSGRKSRANKSKDKADNGADLDVPELEKSGKRITRSSRQKNFVSLS 382
            + QL                               +P +  S K++ +  + +     S
Sbjct: 315 KIKQLMAR----------------------------IP-VSPSWKQVAKELQDQQVEGKS 374

Query: 383 SLWTVVIPNKKGLGIFRSEMASLMAVRRARTPILISSFFKVRSPLPSRFTFTCGNQTDTL 442
           S               RS   S + ++     +L  S + +                   
Sbjct: 375 S--------------SRSAFFSPIRIKYGTINVLADSLYSI------------------- 434

Query: 443 IKALSTSAIRNDFSNFPPPPQQPSSSDPRHRQAQWGSPSQVHPPSGNFNNQSFSEFQNRD 502
                                                                       
Sbjct: 435 ------------------------------------------------------------ 494

Query: 503 YVQQGSPSNQLKYRSQNQSPQPNPGFSRQGQSYSQPGNPNSWNPPNQSYPQYQNPSQANP 562
                                                          SYPQY NPSQ NP
Sbjct: 495 -----------------------------------------------SYPQYPNPSQPNP 554

Query: 563 QNFNYQQQRGPNQWNNQKQGYPQFGRPEQRNPQVENSNQLNNQAGIQRQGAQNQALNALV 622
           QNFNYQQQR PNQW+NQ QG PQFG+P QRN Q ENS QLNNQAGIQR  AQN A NALV
Sbjct: 555 QNFNYQQQRAPNQWSNQNQGLPQFGKPGQRNLQAENSYQLNNQAGIQRHAAQNHAPNALV 614

Query: 623 SPIDELRRLCGEGKMKEAVELLKEGVKADADCFHLLFELCGKSKSFDNAKVVHDYFLQST 682
           SPIDELRR CGEGK+KEAVELLKEGVKADADCFH LFELCGKSKSF+NAKVVHDYFLQST
Sbjct: 615 SPIDELRRFCGEGKLKEAVELLKEGVKADADCFHELFELCGKSKSFENAKVVHDYFLQST 674

Query: 683 CRSDLQLNNKVLEMYGRCGSMSDARRVFDYMPDRNIDSWHFMMKGYADNGLGDEGLELFE 742
           CRSDLQLNNKVLEMYG+CGSMSDA+RVFD+MPDR+I+SWH M+KGYADNGLGDEGLELFE
Sbjct: 675 CRSDLQLNNKVLEMYGKCGSMSDAQRVFDHMPDRSIESWHLMIKGYADNGLGDEGLELFE 734

Query: 743 NMKQLGLQPDSQTFLFVMSACASANAVEEGFLYFESMKNDYHITPDMDHYLGLLGILGEP 802
           NMK+LGLQPDSQTFLFVMSACASA+AVEEGF+YFESMKNDYHI P+MDHYLGLLGILGEP
Sbjct: 735 NMKKLGLQPDSQTFLFVMSACASASAVEEGFMYFESMKNDYHINPNMDHYLGLLGILGEP 794

Query: 803 GHINEAFEYVKKLPMEPTVEVWETLKNYARIHGDVDLEDYAEELIVDLDPTKAVSNKIST 862
           GHINEAFEYV+KLPMEPTVEVWETLKNYARIHGDVDLEDYAEELIVDLDPTKA SNKI T
Sbjct: 795 GHINEAFEYVEKLPMEPTVEVWETLKNYARIHGDVDLEDYAEELIVDLDPTKAASNKIPT 795

Query: 863 PPPKKRSAISMLDGKNRISEFRNPTLYKDDEKLKALKAMKEQGYVPDTRYVLHDIDQEAK 922
           PPPKKRSAISMLDGKNRI EFRNPTLYKDDEKLKALKAMKEQGYVPDTRYVLHDIDQEAK
Sbjct: 855 PPPKKRSAISMLDGKNRIVEFRNPTLYKDDEKLKALKAMKEQGYVPDTRYVLHDIDQEAK 795

Query: 923 EQALLYHSERLAIAYGLISTPARTPLRIIKNLRICGDCHNAIKIMSRIVGRELIVRDNKR 982
           EQALLYHSERLAIAYGLISTPARTPLRIIKNLRICGDCHNAIKIMSRIVGRELIVRDNKR
Sbjct: 915 EQALLYHSERLAIAYGLISTPARTPLRIIKNLRICGDCHNAIKIMSRIVGRELIVRDNKR 795

Query: 983 FHHFKD 988
           FHHFKD
Sbjct: 975 FHHFKD 795

BLAST of CmUC02G032040 vs. NCBI nr
Match: XP_038887834.1 (pentatricopeptide repeat-containing protein At2g15690, mitochondrial [Benincasa hispida])

HSP 1 Score: 1096.6 bits (2835), Expect = 0.0e+00
Identity = 551/587 (93.87%), Postives = 564/587 (96.08%), Query Frame = 0

Query: 401 MASLMAVRRARTPILISSFFKVRSPLPSRFTFTCGNQTDTLIKALSTSAIRNDFSNFPPP 460
           MASLMA RRARTPIL+SSFFKVRSPLPSRFTFTCG+QTDTLIKALSTSAI NDFSNFPPP
Sbjct: 1   MASLMAARRARTPILLSSFFKVRSPLPSRFTFTCGDQTDTLIKALSTSAIPNDFSNFPPP 60

Query: 461 PQQPSSSDPRHRQAQWGSPSQVHPPSGNFNNQSFSEFQNRDYVQQGSPSNQLKYRSQNQS 520
           PQQPSSSDPR+RQAQ G  SQVHPP+GNFNNQSFSEFQNRDYVQQGSPSNQ  YRSQNQS
Sbjct: 61  PQQPSSSDPRYRQAQRG--SQVHPPNGNFNNQSFSEFQNRDYVQQGSPSNQFNYRSQNQS 120

Query: 521 PQPNPGFSRQGQSYSQPGNPNSWNPPNQSYPQYQNPSQANPQNFNYQQQRGPNQWNNQKQ 580
           PQPNPGFS QGQ Y+Q GNPNSWNPPNQSYPQYQNPSQ NPQNF YQQ+RGPNQWNNQ Q
Sbjct: 121 PQPNPGFSLQGQRYTQAGNPNSWNPPNQSYPQYQNPSQPNPQNFKYQQERGPNQWNNQNQ 180

Query: 581 GYPQFGRPEQRNPQVENSNQLNNQAGIQRQGAQNQALNALVSPIDELRRLCGEGKMKEAV 640
           GYPQ GRPEQRNPQVE SNQLNNQAGIQR GAQNQ  NA VS IDELRR+CGEGKMKEAV
Sbjct: 181 GYPQSGRPEQRNPQVEYSNQLNNQAGIQRLGAQNQEPNAFVSRIDELRRVCGEGKMKEAV 240

Query: 641 ELLKEGVKADADCFHLLFELCGKSKSFDNAKVVHDYFLQSTCRSDLQLNNKVLEMYGRCG 700
           ELLKEGVKADADCFHLLFELCGKSKSFDNAKVVHDYFLQSTCRSDLQLNNKVLEMYGRCG
Sbjct: 241 ELLKEGVKADADCFHLLFELCGKSKSFDNAKVVHDYFLQSTCRSDLQLNNKVLEMYGRCG 300

Query: 701 SMSDARRVFDYMPDRNIDSWHFMMKGYADNGLGDEGLELFENMKQLGLQPDSQTFLFVMS 760
           SMSDA+RVFD+MPDRNIDSWHFMMKGYADNGLGDEGLELFENMK+LGLQP+SQTFLFVMS
Sbjct: 301 SMSDAQRVFDHMPDRNIDSWHFMMKGYADNGLGDEGLELFENMKKLGLQPNSQTFLFVMS 360

Query: 761 ACASANAVEEGFLYFESMKNDYHITPDMDHYLGLLGILGEPGHINEAFEYVKKLPMEPTV 820
           ACASANAVEEGFLYFESMKNDYHITPDMD YLGLLGILGEPGHINEAFEYV+KLPMEPTV
Sbjct: 361 ACASANAVEEGFLYFESMKNDYHITPDMDSYLGLLGILGEPGHINEAFEYVEKLPMEPTV 420

Query: 821 EVWETLKNYARIHGDVDLEDYAEELIVDLDPTKAVSNKISTPPPKKRSAISMLDGKNRIS 880
           EVWETLKNYARIHGDVDLEDYAEELIVDLDPTKAV NKI TPPPKKRSAI+MLDGKNRIS
Sbjct: 421 EVWETLKNYARIHGDVDLEDYAEELIVDLDPTKAVYNKIPTPPPKKRSAINMLDGKNRIS 480

Query: 881 EFRNPTLYKDDEKLKALKAMKEQGYVPDTRYVLHDIDQEAKEQALLYHSERLAIAYGLIS 940
           EFRNPTLYKDDEKLKALKAMKEQGYVPDTRYVLHDIDQEAKEQALLYHSERLAIAYGLIS
Sbjct: 481 EFRNPTLYKDDEKLKALKAMKEQGYVPDTRYVLHDIDQEAKEQALLYHSERLAIAYGLIS 540

Query: 941 TPARTPLRIIKNLRICGDCHNAIKIMSRIVGRELIVRDNKRFHHFKD 988
           TPARTPLRIIKNLRICGDCHNAIKIMSRIVGRELIVRDNKRFHHFKD
Sbjct: 541 TPARTPLRIIKNLRICGDCHNAIKIMSRIVGRELIVRDNKRFHHFKD 585

BLAST of CmUC02G032040 vs. NCBI nr
Match: KAG7011659.1 (Pentatricopeptide repeat-containing protein, partial [Cucurbita argyrosperma subsp. argyrosperma])

HSP 1 Score: 1054.3 bits (2725), Expect = 6.6e-304
Identity = 528/587 (89.95%), Postives = 549/587 (93.53%), Query Frame = 0

Query: 401 MASLMAVRRARTPILISSFFKVRSPLPSRFTFTCGNQTDTLIKALSTSAIRNDFSNFPPP 460
           MASLMAVRR RTPI ISSF KVRSPLPS FTF CGN+T+TLIKALSTSA  +DFSNFP P
Sbjct: 1   MASLMAVRRVRTPIHISSFIKVRSPLPSTFTFFCGNRTETLIKALSTSAFPDDFSNFPTP 60

Query: 461 PQQPSSSDPRHRQAQWGSPSQVHPPSGNFNNQSFSEFQNRDYVQQGSPSNQLKYRSQNQS 520
           PQQPS SDPR+ Q QWGSPSQVH PSGNFNNQSFSEFQNRDYVQQGSPSN + YRSQNQS
Sbjct: 61  PQQPSLSDPRYLQGQWGSPSQVHCPSGNFNNQSFSEFQNRDYVQQGSPSNHMIYRSQNQS 120

Query: 521 PQPNPGFSRQGQSYSQPGNPNSWNPPNQSYPQYQNPSQANPQNFNYQQQRGPNQWNNQKQ 580
             PNPGF RQGQSY+Q GNPNSWNPPNQSYPQY NPSQ NPQNFNYQQQR PNQW+NQ Q
Sbjct: 121 SYPNPGFPRQGQSYTQGGNPNSWNPPNQSYPQYPNPSQPNPQNFNYQQQRAPNQWSNQNQ 180

Query: 581 GYPQFGRPEQRNPQVENSNQLNNQAGIQRQGAQNQALNALVSPIDELRRLCGEGKMKEAV 640
           G PQFG+P QRN Q ENS QLNNQAGIQR  AQN A NALVSPIDELRR CGEGK+KEAV
Sbjct: 181 GLPQFGKPGQRNLQAENSYQLNNQAGIQRHAAQNHAPNALVSPIDELRRFCGEGKLKEAV 240

Query: 641 ELLKEGVKADADCFHLLFELCGKSKSFDNAKVVHDYFLQSTCRSDLQLNNKVLEMYGRCG 700
           ELLKEGVKADADCFH LFELCGKSKSF+NAKVVHDYFLQSTCRSDLQLNNKVLEMYG+CG
Sbjct: 241 ELLKEGVKADADCFHELFELCGKSKSFENAKVVHDYFLQSTCRSDLQLNNKVLEMYGKCG 300

Query: 701 SMSDARRVFDYMPDRNIDSWHFMMKGYADNGLGDEGLELFENMKQLGLQPDSQTFLFVMS 760
           SMSDA+RVFD+MPDR+I+SWH M+KGYADNGLGDEGLELFENMK+LGLQPDSQTFLFVMS
Sbjct: 301 SMSDAQRVFDHMPDRSIESWHLMIKGYADNGLGDEGLELFENMKKLGLQPDSQTFLFVMS 360

Query: 761 ACASANAVEEGFLYFESMKNDYHITPDMDHYLGLLGILGEPGHINEAFEYVKKLPMEPTV 820
           ACASA+AVEEGF+YFESMKNDYHI P+MDHYLGLLGILGEPGHINEAFEYV+KLPMEPTV
Sbjct: 361 ACASASAVEEGFMYFESMKNDYHINPNMDHYLGLLGILGEPGHINEAFEYVEKLPMEPTV 420

Query: 821 EVWETLKNYARIHGDVDLEDYAEELIVDLDPTKAVSNKISTPPPKKRSAISMLDGKNRIS 880
           EVWETLKNYARIHGDVDLEDYAEELIVDLDPTKA SNKI TPPPKKRSAISMLDGKNRI 
Sbjct: 421 EVWETLKNYARIHGDVDLEDYAEELIVDLDPTKAASNKIPTPPPKKRSAISMLDGKNRIV 480

Query: 881 EFRNPTLYKDDEKLKALKAMKEQGYVPDTRYVLHDIDQEAKEQALLYHSERLAIAYGLIS 940
           EFRNPTLYKDDEKLKALKAMKEQGYVPDTRYVLHDIDQEAKEQALLYHSERLAIAYGLIS
Sbjct: 481 EFRNPTLYKDDEKLKALKAMKEQGYVPDTRYVLHDIDQEAKEQALLYHSERLAIAYGLIS 540

Query: 941 TPARTPLRIIKNLRICGDCHNAIKIMSRIVGRELIVRDNKRFHHFKD 988
           TPARTPLRIIKNLRICGDCHNAIKIMSRIVGRELIVRDNKRFHHFKD
Sbjct: 541 TPARTPLRIIKNLRICGDCHNAIKIMSRIVGRELIVRDNKRFHHFKD 587

BLAST of CmUC02G032040 vs. NCBI nr
Match: XP_022952757.1 (pentatricopeptide repeat-containing protein At2g15690, mitochondrial-like [Cucurbita moschata])

HSP 1 Score: 1052.0 bits (2719), Expect = 3.3e-303
Identity = 526/587 (89.61%), Postives = 548/587 (93.36%), Query Frame = 0

Query: 401 MASLMAVRRARTPILISSFFKVRSPLPSRFTFTCGNQTDTLIKALSTSAIRNDFSNFPPP 460
           MASLMAVRR RTPI ISSF KVRSPLPS FTF+CGN+T+TLIKALSTSA  +DFSNFP P
Sbjct: 1   MASLMAVRRVRTPIHISSFIKVRSPLPSTFTFSCGNRTETLIKALSTSAFPDDFSNFPTP 60

Query: 461 PQQPSSSDPRHRQAQWGSPSQVHPPSGNFNNQSFSEFQNRDYVQQGSPSNQLKYRSQNQS 520
           PQQPSSSDPR+ Q QWGSPSQVH PSGNFNNQSFSEFQNRDYVQQGSPSNQ+ YRSQNQS
Sbjct: 61  PQQPSSSDPRYLQGQWGSPSQVHRPSGNFNNQSFSEFQNRDYVQQGSPSNQMNYRSQNQS 120

Query: 521 PQPNPGFSRQGQSYSQPGNPNSWNPPNQSYPQYQNPSQANPQNFNYQQQRGPNQWNNQKQ 580
             PNPGF RQGQSY+Q GNPNSWNPPNQSYPQYQNPSQ NPQNFNYQQQR PNQW+NQ Q
Sbjct: 121 SYPNPGFPRQGQSYTQGGNPNSWNPPNQSYPQYQNPSQPNPQNFNYQQQRAPNQWSNQNQ 180

Query: 581 GYPQFGRPEQRNPQVENSNQLNNQAGIQRQGAQNQALNALVSPIDELRRLCGEGKMKEAV 640
           G PQFG+P QRN Q ENS QLNNQAGIQ  GAQN   NALVSPIDELRR CGEGK+KEAV
Sbjct: 181 GLPQFGKPGQRNLQAENSYQLNNQAGIQGHGAQNHTPNALVSPIDELRRFCGEGKLKEAV 240

Query: 641 ELLKEGVKADADCFHLLFELCGKSKSFDNAKVVHDYFLQSTCRSDLQLNNKVLEMYGRCG 700
           ELLKEGVKADADCFH  FELCGKSKSF+NAKVVHDYFLQSTCRSDLQLNNKVLEMYG+CG
Sbjct: 241 ELLKEGVKADADCFHEFFELCGKSKSFENAKVVHDYFLQSTCRSDLQLNNKVLEMYGKCG 300

Query: 701 SMSDARRVFDYMPDRNIDSWHFMMKGYADNGLGDEGLELFENMKQLGLQPDSQTFLFVMS 760
           SMSDA+RVFD+M DR+I+SWH M+KGYADNGLGDEGLELFENMK+LGL P+SQTFLFVMS
Sbjct: 301 SMSDAQRVFDHMLDRSIESWHLMIKGYADNGLGDEGLELFENMKKLGLHPNSQTFLFVMS 360

Query: 761 ACASANAVEEGFLYFESMKNDYHITPDMDHYLGLLGILGEPGHINEAFEYVKKLPMEPTV 820
           ACASA+AVEEGF+YFESMKNDYHI PDMDHYLGLLGILGEPGHINEAFEYV+KLPMEPTV
Sbjct: 361 ACASASAVEEGFMYFESMKNDYHINPDMDHYLGLLGILGEPGHINEAFEYVEKLPMEPTV 420

Query: 821 EVWETLKNYARIHGDVDLEDYAEELIVDLDPTKAVSNKISTPPPKKRSAISMLDGKNRIS 880
           EVWETLKNYARIHGDVDLEDYAEELIVDLDPTKA SNKI TPPPKKR AISMLDGKNRI 
Sbjct: 421 EVWETLKNYARIHGDVDLEDYAEELIVDLDPTKAASNKIPTPPPKKRFAISMLDGKNRIV 480

Query: 881 EFRNPTLYKDDEKLKALKAMKEQGYVPDTRYVLHDIDQEAKEQALLYHSERLAIAYGLIS 940
           EFRNPTLYKDDEKLKALKAMKEQGYVPDTRYVLHDIDQEAKEQALLYHSERLAIAYGLIS
Sbjct: 481 EFRNPTLYKDDEKLKALKAMKEQGYVPDTRYVLHDIDQEAKEQALLYHSERLAIAYGLIS 540

Query: 941 TPARTPLRIIKNLRICGDCHNAIKIMSRIVGRELIVRDNKRFHHFKD 988
           TPARTPLRIIKNLRICGDCHNAIKIMSRIVGRELIVRDNKRFHHFKD
Sbjct: 541 TPARTPLRIIKNLRICGDCHNAIKIMSRIVGRELIVRDNKRFHHFKD 587

BLAST of CmUC02G032040 vs. NCBI nr
Match: XP_022972422.1 (pentatricopeptide repeat-containing protein At2g15690, mitochondrial-like [Cucurbita maxima])

HSP 1 Score: 1049.3 bits (2712), Expect = 2.1e-302
Identity = 527/587 (89.78%), Postives = 549/587 (93.53%), Query Frame = 0

Query: 401 MASLMAVRRARTPILISSFFKVRSPLPSRFTFTCGNQTDTLIKALSTSAIRNDFSNFPPP 460
           MASLMAVRR RTPI ISSF KVRSPLPS FTF+CGNQT+TLIKALSTSA  +DFSNFP P
Sbjct: 1   MASLMAVRRVRTPIHISSFIKVRSPLPSTFTFSCGNQTETLIKALSTSAFPDDFSNFPTP 60

Query: 461 PQQPSSSDPRHRQAQWGSPSQVHPPSGNFNNQSFSEFQNRDYVQQGSPSNQLKYRSQNQS 520
           PQQPSSS PR+ Q Q GSPSQVH PSGNFNNQSFSEFQNRDYVQ GSPSNQ+  RSQNQS
Sbjct: 61  PQQPSSSHPRYLQGQRGSPSQVHRPSGNFNNQSFSEFQNRDYVQLGSPSNQMNNRSQNQS 120

Query: 521 PQPNPGFSRQGQSYSQPGNPNSWNPPNQSYPQYQNPSQANPQNFNYQQQRGPNQWNNQKQ 580
             PNPGF RQGQSY+Q GNPNSWNPPNQSYPQYQNPSQ NPQNFNYQQQR PNQW+NQ Q
Sbjct: 121 SYPNPGFPRQGQSYTQGGNPNSWNPPNQSYPQYQNPSQPNPQNFNYQQQRAPNQWSNQNQ 180

Query: 581 GYPQFGRPEQRNPQVENSNQLNNQAGIQRQGAQNQALNALVSPIDELRRLCGEGKMKEAV 640
           G PQFG+P QRN Q ENS QLNNQAGIQ  GAQN A NALVSPIDELRR CGEGK+KEAV
Sbjct: 181 GLPQFGKPGQRNLQAENSYQLNNQAGIQGHGAQNHAPNALVSPIDELRRFCGEGKLKEAV 240

Query: 641 ELLKEGVKADADCFHLLFELCGKSKSFDNAKVVHDYFLQSTCRSDLQLNNKVLEMYGRCG 700
           ELLKEGVKADADCFH LFELCGKSKSFDNAKVVHDYFLQSTCRSDLQLNNKVLEMYG+CG
Sbjct: 241 ELLKEGVKADADCFHELFELCGKSKSFDNAKVVHDYFLQSTCRSDLQLNNKVLEMYGKCG 300

Query: 701 SMSDARRVFDYMPDRNIDSWHFMMKGYADNGLGDEGLELFENMKQLGLQPDSQTFLFVMS 760
           SMSDA+RVFD+MPDR+I+SWH M+KGYADNGLGDEGLELFENMK+LGLQP+SQTFLFVMS
Sbjct: 301 SMSDAQRVFDHMPDRSIESWHLMIKGYADNGLGDEGLELFENMKKLGLQPNSQTFLFVMS 360

Query: 761 ACASANAVEEGFLYFESMKNDYHITPDMDHYLGLLGILGEPGHINEAFEYVKKLPMEPTV 820
           ACASA+AVEEGF+YFESMKNDYHI PDMDHYLGLLGILGEPGHINEAFEYV+KLP+EPTV
Sbjct: 361 ACASASAVEEGFMYFESMKNDYHINPDMDHYLGLLGILGEPGHINEAFEYVEKLPIEPTV 420

Query: 821 EVWETLKNYARIHGDVDLEDYAEELIVDLDPTKAVSNKISTPPPKKRSAISMLDGKNRIS 880
           EVWETLKNYA+IHGDVDLEDYAEELIVDLDPTKA SNKI TPPPKKRSAISMLDGKNRI 
Sbjct: 421 EVWETLKNYAKIHGDVDLEDYAEELIVDLDPTKAASNKIPTPPPKKRSAISMLDGKNRIV 480

Query: 881 EFRNPTLYKDDEKLKALKAMKEQGYVPDTRYVLHDIDQEAKEQALLYHSERLAIAYGLIS 940
           EFRNPTLYKDDEKLKALKAMKEQGYVPDTRYVLHDIDQEAKEQALLYHSERLAIAYGLIS
Sbjct: 481 EFRNPTLYKDDEKLKALKAMKEQGYVPDTRYVLHDIDQEAKEQALLYHSERLAIAYGLIS 540

Query: 941 TPARTPLRIIKNLRICGDCHNAIKIMSRIVGRELIVRDNKRFHHFKD 988
           TPARTPLRIIKNLRICGDCHNAIKIMSRIVGRELIVRDNKRFHHFKD
Sbjct: 541 TPARTPLRIIKNLRICGDCHNAIKIMSRIVGRELIVRDNKRFHHFKD 587

BLAST of CmUC02G032040 vs. ExPASy Swiss-Prot
Match: Q9ZQE5 (Pentatricopeptide repeat-containing protein At2g15690, mitochondrial OS=Arabidopsis thaliana OX=3702 GN=PCMP-H66 PE=1 SV=2)

HSP 1 Score: 493.8 bits (1270), Expect = 4.5e-138
Identity = 303/612 (49.51%), Postives = 390/612 (63.73%), Query Frame = 0

Query: 401 MASLMAVRRARTP--ILISSFFKVRSPLP---SRFTFTCGNQTDTLIKALSTSAIRNDFS 460
           M+SLMA+R ART   + I S  ++RS  P   S+F F+ G      IK LSTSA  ND+ 
Sbjct: 1   MSSLMAIRCARTQNIVTIGSLLQLRSSFPRLSSQFHFS-GTLNSIPIKHLSTSAAANDY- 60

Query: 461 NFPPPPQQPSSSDPRHRQAQWGSPSQVHPPSGNFNNQSFSEFQNRDYVQQGSPSNQLKYR 520
                          H+  Q GSPSQ   P   +  QSF + QN+    Q  P +  ++ 
Sbjct: 61  ---------------HQNPQSGSPSQHQRP---YPPQSF-DSQNQTNTNQRVPQSPNQWS 120

Query: 521 SQN--QSPQ-----PNPGFSRQ---GQSYSQPGNPNSWNPPNQSY----PQY--QNPSQA 580
           +Q+  Q PQ     P  G  R    GQ+  Q G  + +   N  +    PQY  Q P   
Sbjct: 121 TQHGGQIPQYGGQNPQHGGQRPPYGGQNPQQGGQMSQYGGHNPQHGGHRPQYGGQRPQYG 180

Query: 581 NPQNFNYQQQRGPNQWNNQKQGY-PQFGRPEQRNPQ-VENSNQLNNQAGIQRQGAQNQAL 640
            P N NYQ Q    Q +NQ Q Y PQ    +Q+ PQ   +SNQ  NQ            +
Sbjct: 181 GPGN-NYQNQN--VQQSNQSQYYTPQ----QQQQPQPPRSSNQSPNQ------------M 240

Query: 641 NALVSP--IDELRRLCGEGKMKEAVELLKEGVKADADCFHLLFELCGKSKSFDNAKVVHD 700
           N +  P  ++E+ RLC     K+A+ELL +G   D +CF LLFE C   KS +++K VHD
Sbjct: 241 NEVAPPPSVEEVMRLCQRRLYKDAIELLDKGAMPDRECFVLLFESCANLKSLEHSKKVHD 300

Query: 701 YFLQSTCRSDLQLNNKVLEMYGRCGSMSDARRVFDYMPDRNIDSWHFMMKGYADNGLGDE 760
           +FLQS  R D +LNN V+ M+G C S++DA+RVFD+M D+++DSWH MM  Y+DNG+GD+
Sbjct: 301 HFLQSKFRGDPKLNNMVISMFGECSSITDAKRVFDHMVDKDMDSWHLMMCAYSDNGMGDD 360

Query: 761 GLELFENMKQLGLQPDSQTFLFVMSACASANAVEEGFLYFESMKNDYHITPDMDHYLGLL 820
            L LFE M + GL+P+ +TFL V  ACA+   +EE FL+F+SMKN++ I+P  +HYLG+L
Sbjct: 361 ALHLFEEMTKHGLKPNEETFLTVFLACATVGGIEEAFLHFDSMKNEHGISPKTEHYLGVL 420

Query: 821 GILGEPGHINEAFEYVKKLPMEPTVEVWETLKNYARIHGDVDLEDYAEELIVDLDPTKAV 880
           G+LG+ GH+ EA +Y++ LP EPT + WE ++NYAR+HGD+DLEDY EEL+VD+DP+KAV
Sbjct: 421 GVLGKCGHLVEAEQYIRDLPFEPTADFWEAMRNYARLHGDIDLEDYMEELMVDVDPSKAV 480

Query: 881 SNKISTPPPKKRSAISMLDGKNRISEFRNPTLYKDDEKLKALKAMKEQGYVPDTRYVLHD 940
            NKI TPPPK     +M+  K+RI EFRN T YKD+ K  A K  K   YVPDTR+VLHD
Sbjct: 481 INKIPTPPPKSFKETNMVTSKSRILEFRNLTFYKDEAKEMAAK--KGVVYVPDTRFVLHD 540

Query: 941 IDQEAKEQALLYHSERLAIAYGLISTPARTPLRIIKNLRICGDCHNAIKIMSRIVGRELI 988
           IDQEAKEQALLYHSERLAIAYG+I TP R  L IIKNLR+CGDCHN IKIMS+I+GR LI
Sbjct: 541 IDQEAKEQALLYHSERLAIAYGIICTPPRKTLTIIKNLRVCGDCHNFIKIMSKIIGRVLI 570

BLAST of CmUC02G032040 vs. ExPASy Swiss-Prot
Match: Q9SUU7 (Pentatricopeptide repeat-containing protein At4g32450, mitochondrial OS=Arabidopsis thaliana OX=3702 GN=PCMP-H63 PE=2 SV=1)

HSP 1 Score: 294.7 bits (753), Expect = 4.0e-78
Identity = 201/576 (34.90%), Postives = 299/576 (51.91%), Query Frame = 0

Query: 433 TCGNQTDTLIKALSTSAIRNDFSNFPPPPQQPSSSDPRHRQAQWGSPSQVHPPSG-NFNN 492
           TC  +  +L   LST+A+R  F N       P++ +P        S   +   +G N   
Sbjct: 14  TCKLRYSSLFSYLSTAALRLGFEN-------PTNGNPMD-----NSSHHIGYVNGFNGGE 73

Query: 493 QSFSEFQNRDYVQQGSPSNQLKYRSQNQSPQPNPGFSRQGQSYSQPGNPNSWNPPNQSYP 552
           QS   FQ   Y Q  +P +                    GQ+ +     N +N  NQSY 
Sbjct: 74  QSLGGFQQNSYEQSLNPVS--------------------GQNPTNRFYQNGYN-RNQSYG 133

Query: 553 QYQNPSQANPQNFNYQQQRGPNQWNNQKQGYPQFGRPEQRNPQVENSNQLNNQAGIQRQG 612
           ++      N +N N+Q   G + +     G PQ       + Q ++S             
Sbjct: 134 EHS--EIINQRNQNWQSSDGCSSYGTTGNGVPQENNTGGNHFQQDHSGH----------- 193

Query: 613 AQNQALNALVSPIDELRRLCGEGKMKEAVELLK----EGVKADADCFHLLFELCGKSKSF 672
                     S +DEL  +C EGK+K+AVE++K    EG   D      + +LCG +++ 
Sbjct: 194 ----------SSLDELDSICREGKVKKAVEIIKSWRNEGYVVDLPRLFWIAQLCGDAQAL 253

Query: 673 DNAKVVHDYFLQSTCRSDLQLNNKVLEMYGRCGSMSDARRVFDYMPDRNIDSWHFMMKGY 732
             AKVVH++   S   SD+   N ++EMY  CGS+ DA  VF+ MP+RN+++W  +++ +
Sbjct: 254 QEAKVVHEFITSSVGISDISAYNSIIEMYSGCGSVEDALTVFNSMPERNLETWCGVIRCF 313

Query: 733 ADNGLGDEGLELFENMKQLGLQPDSQTFLFVMSACASANAVEEGFLYFESMKNDYHITPD 792
           A NG G++ ++ F   KQ G +PD + F  +  AC     + EG L+FESM  +Y I P 
Sbjct: 314 AKNGQGEDAIDTFSRFKQEGNKPDGEMFKEIFFACGVLGDMNEGLLHFESMYKEYGIIPC 373

Query: 793 MDHYLGLLGILGEPGHINEAFEYVKKLPMEPTVEVWETLKNYARIHGDVDLEDYAEELIV 852
           M+HY+ L+ +L EPG+++EA  +V+   MEP V++WETL N +R+HGD+ L D  ++++ 
Sbjct: 374 MEHYVSLVKMLAEPGYLDEALRFVES--MEPNVDLWETLMNLSRVHGDLILGDRCQDMVE 433

Query: 853 DLDPTKAVSNKISTPPPKKRSAI------SMLDGKN---------RISEFRNPTLYKDDE 912
            LD ++      +   P K S +       M  G N          IS   N  LY    
Sbjct: 434 QLDASRLNKESKAGLVPVKSSDLVKEKLQRMAKGPNYGIRYMAAGDISRPENRELYM--- 493

Query: 913 KLKALKA-MKEQGYVPDTRYVLHDIDQEAKEQALLYHSERLAIAYGLISTPARTPLRIIK 972
            LK+LK  M E GYVP ++  LHD+DQE+K++ L  H+ER A     + TPAR+ +R++K
Sbjct: 494 ALKSLKEHMIEIGYVPLSKLALHDVDQESKDENLFNHNERFAFISTFLDTPARSLIRVMK 528

Query: 973 NLRICGDCHNAIKIMSRIVGRELIVRDNKRFHHFKD 988
           NLR+C DCHNA+K+MS+IVGRELI RD KRFHH KD
Sbjct: 554 NLRVCADCHNALKLMSKIVGRELISRDAKRFHHMKD 528

BLAST of CmUC02G032040 vs. ExPASy Swiss-Prot
Match: Q680H3 (Pentatricopeptide repeat-containing protein At2g25580 OS=Arabidopsis thaliana OX=3702 GN=PCMP-H75 PE=2 SV=2)

HSP 1 Score: 292.0 bits (746), Expect = 2.6e-77
Identity = 156/386 (40.41%), Postives = 231/386 (59.84%), Query Frame = 0

Query: 624 IDELRRLCGEGKMKEAVE----LLKEGVKADADCFHLLFELCGKSKSFDNAKVVHDYFLQ 683
           I+E    C  GK+K+A+     L       D      L ++CG+++    AK VH     
Sbjct: 223 IEEYDAFCKHGKVKKALYTIDILASMNYVVDLSRLLRLAKICGEAEGLQEAKTVHGKISA 282

Query: 684 STCRSDLQLNNKVLEMYGRCGSMSDARRVFDYMPDRNIDSWHFMMKGYADNGLGDEGLEL 743
           S    DL  N+ +LEMY  CG  ++A  VF+ M ++N+++W  +++ +A NG G++ +++
Sbjct: 283 SVSHLDLSSNHVLLEMYSNCGLANEAASVFEKMSEKNLETWCIIIRCFAKNGFGEDAIDM 342

Query: 744 FENMKQLGLQPDSQTFLFVMSACASANAVEEGFLYFESMKNDYHITPDMDHYLGLLGILG 803
           F   K+ G  PD Q F  +  AC     V+EG L+FESM  DY I P ++ Y+ L+ +  
Sbjct: 343 FSRFKEEGNIPDGQLFRGIFYACGMLGDVDEGLLHFESMSRDYGIAPSIEDYVSLVEMYA 402

Query: 804 EPGHINEAFEYVKKLPMEPTVEVWETLKNYARIHGDVDLEDYAEELIVDLDPTK------ 863
            PG ++EA E+V+++PMEP V+VWETL N +R+HG+++L DY  E++  LDPT+      
Sbjct: 403 LPGFLDEALEFVERMPMEPNVDVWETLMNLSRVHGNLELGDYCAEVVEFLDPTRLNKQSR 462

Query: 864 -----AVSNKISTPPPKKRSAISMLDG-KNRISEFR--NPTLYKDDEKLKALKAMK---- 923
                  ++ +     KKRS I  L G K+ + EFR  +  L ++DE  + L+ +K    
Sbjct: 463 EGFIPVKASDVEKESLKKRSGI--LHGVKSSMQEFRAGDTNLPENDELFQLLRNLKMHMV 522

Query: 924 EQGYVPDTRYVLHDIDQEAKEQALLYHSERLAIAYGLISTPARTPLRIIKNLRICGDCHN 983
           E GYV +TR  LHDIDQE+KE  LL HSER+A A  ++++  R P  +IKNLR+C DCHN
Sbjct: 523 EVGYVAETRMALHDIDQESKETLLLGHSERIAFARAVLNSAPRKPFTVIKNLRVCVDCHN 582

Query: 984 AIKIMSRIVGRELIVRDNKRFHHFKD 988
           A+KIMS IVGRE+I RD KRFH  K+
Sbjct: 583 ALKIMSDIVGREVITRDIKRFHQMKN 606

BLAST of CmUC02G032040 vs. ExPASy Swiss-Prot
Match: Q9LIQ7 (Pentatricopeptide repeat-containing protein At3g24000, mitochondrial OS=Arabidopsis thaliana OX=3702 GN=PCMP-H87 PE=3 SV=1)

HSP 1 Score: 263.5 bits (672), Expect = 9.9e-69
Identity = 139/397 (35.01%), Postives = 229/397 (57.68%), Query Frame = 0

Query: 618 NALVSPIDELRRLCGEGKMKEAVELLKEGVKADADCFHLLFELCGKSKSFDNAKVVHDYF 677
           NAL++     RR   E  ++    +L++G +     +  LF  C  +   +  K VH Y 
Sbjct: 231 NALIA--GHARRSGTEKALELFQGMLRDGFRPSHFSYASLFGACSSTGFLEQGKWVHAYM 290

Query: 678 LQSTCRSDLQLNNKVLEMYGRCGSMSDARRVFDYMPDRNIDSWHFMMKGYADNGLGDEGL 737
           ++S  +      N +L+MY + GS+ DAR++FD +  R++ SW+ ++  YA +G G E +
Sbjct: 291 IKSGEKLVAFAGNTLLDMYAKSGSIHDARKIFDRLAKRDVVSWNSLLTAYAQHGFGKEAV 350

Query: 738 ELFENMKQLGLQPDSQTFLFVMSACASANAVEEGFLYFESMKNDYHITPDMDHYLGLLGI 797
             FE M+++G++P+  +FL V++AC+ +  ++EG+ Y+E MK D  I P+  HY+ ++ +
Sbjct: 351 WWFEEMRRVGIRPNEISFLSVLTACSHSGLLDEGWHYYELMKKD-GIVPEAWHYVTVVDL 410

Query: 798 LGEPGHINEAFEYVKKLPMEPTVEVWETLKNYARIHGDVDLEDYAEELIVDLDP------ 857
           LG  G +N A  +++++P+EPT  +W+ L N  R+H + +L  YA E + +LDP      
Sbjct: 411 LGRAGDLNRALRFIEEMPIEPTAAIWKALLNACRMHKNTELGAYAAEHVFELDPDDPGPH 470

Query: 858 ---------------TKAVSNKISTPPPKKRSAISMLDGKNRISEF-RNPTLYKDDEKL- 917
                             V  K+     KK  A S ++ +N I  F  N   +   E++ 
Sbjct: 471 VILYNIYASGGRWNDAARVRKKMKESGVKKEPACSWVEIENAIHMFVANDERHPQREEIA 530

Query: 918 ----KALKAMKEQGYVPDTRYVLHDIDQEAKEQALLYHSERLAIAYGLISTPARTPLRII 977
               + L  +KE GYVPDT +V+  +DQ+ +E  L YHSE++A+A+ L++TP  + + I 
Sbjct: 531 RKWEEVLAKIKELGYVPDTSHVIVHVDQQEREVNLQYHSEKIALAFALLNTPPGSTIHIK 590

Query: 978 KNLRICGDCHNAIKIMSRIVGRELIVRDNKRFHHFKD 988
           KN+R+CGDCH AIK+ S++VGRE+IVRD  RFHHFKD
Sbjct: 591 KNIRVCGDCHTAIKLASKVVGREIIVRDTNRFHHFKD 624

BLAST of CmUC02G032040 vs. ExPASy Swiss-Prot
Match: A8MQA3 (Pentatricopeptide repeat-containing protein At4g21065 OS=Arabidopsis thaliana OX=3702 GN=PCMP-H28 PE=2 SV=2)

HSP 1 Score: 261.9 bits (668), Expect = 2.9e-68
Identity = 139/386 (36.01%), Postives = 218/386 (56.48%), Query Frame = 0

Query: 634 GKMKEAVELLKE----GVKADADCFHLLFELCGKSKSFDNAKVVHDYFLQSTCRSDLQLN 693
           GK +EA+ L  E    G+K D      L   C K  +    K VH Y ++     +L  +
Sbjct: 201 GKPEEALALYTEMNSKGIKPDGFTIVSLLSACAKIGALTLGKRVHVYMIKVGLTRNLHSS 260

Query: 694 NKVLEMYGRCGSMSDARRVFDYMPDRNIDSWHFMMKGYADNGLGDEGLELFENMKQL-GL 753
           N +L++Y RCG + +A+ +FD M D+N  SW  ++ G A NG G E +ELF+ M+   GL
Sbjct: 261 NVLLDLYARCGRVEEAKTLFDEMVDKNSVSWTSLIVGLAVNGFGKEAIELFKYMESTEGL 320

Query: 754 QPDSQTFLFVMSACASANAVEEGFLYFESMKNDYHITPDMDHYLGLLGILGEPGHINEAF 813
            P   TF+ ++ AC+    V+EGF YF  M+ +Y I P ++H+  ++ +L   G + +A+
Sbjct: 321 LPCEITFVGILYACSHCGMVKEGFEYFRRMREEYKIEPRIEHFGCMVDLLARAGQVKKAY 380

Query: 814 EYVKKLPMEPTVEVWETLKNYARIHGDVDLEDYAEELIVDLDP----------------- 873
           EY+K +PM+P V +W TL     +HGD DL ++A   I+ L+P                 
Sbjct: 381 EYIKSMPMQPNVVIWRTLLGACTVHGDSDLAEFARIQILQLEPNHSGDYVLLSNMYASEQ 440

Query: 874 ----TKAVSNKISTPPPKKRSAISMLDGKNRISEF-----RNPTLYKDDEKLKALKA-MK 933
                + +  ++     KK    S+++  NR+ EF      +P       KLK +   ++
Sbjct: 441 RWSDVQKIRKQMLRDGVKKVPGHSLVEVGNRVHEFLMGDKSHPQSDAIYAKLKEMTGRLR 500

Query: 934 EQGYVPDTRYVLHDIDQEAKEQALLYHSERLAIAYGLISTPARTPLRIIKNLRICGDCHN 988
            +GYVP    V  D+++E KE A++YHSE++AIA+ LISTP R+P+ ++KNLR+C DCH 
Sbjct: 501 SEGYVPQISNVYVDVEEEEKENAVVYHSEKIAIAFMLISTPERSPITVVKNLRVCADCHL 560

BLAST of CmUC02G032040 vs. ExPASy TrEMBL
Match: A0A6J1GL94 (pentatricopeptide repeat-containing protein At2g15690, mitochondrial-like OS=Cucurbita moschata OX=3662 GN=LOC111455360 PE=3 SV=1)

HSP 1 Score: 1052.0 bits (2719), Expect = 1.6e-303
Identity = 526/587 (89.61%), Postives = 548/587 (93.36%), Query Frame = 0

Query: 401 MASLMAVRRARTPILISSFFKVRSPLPSRFTFTCGNQTDTLIKALSTSAIRNDFSNFPPP 460
           MASLMAVRR RTPI ISSF KVRSPLPS FTF+CGN+T+TLIKALSTSA  +DFSNFP P
Sbjct: 1   MASLMAVRRVRTPIHISSFIKVRSPLPSTFTFSCGNRTETLIKALSTSAFPDDFSNFPTP 60

Query: 461 PQQPSSSDPRHRQAQWGSPSQVHPPSGNFNNQSFSEFQNRDYVQQGSPSNQLKYRSQNQS 520
           PQQPSSSDPR+ Q QWGSPSQVH PSGNFNNQSFSEFQNRDYVQQGSPSNQ+ YRSQNQS
Sbjct: 61  PQQPSSSDPRYLQGQWGSPSQVHRPSGNFNNQSFSEFQNRDYVQQGSPSNQMNYRSQNQS 120

Query: 521 PQPNPGFSRQGQSYSQPGNPNSWNPPNQSYPQYQNPSQANPQNFNYQQQRGPNQWNNQKQ 580
             PNPGF RQGQSY+Q GNPNSWNPPNQSYPQYQNPSQ NPQNFNYQQQR PNQW+NQ Q
Sbjct: 121 SYPNPGFPRQGQSYTQGGNPNSWNPPNQSYPQYQNPSQPNPQNFNYQQQRAPNQWSNQNQ 180

Query: 581 GYPQFGRPEQRNPQVENSNQLNNQAGIQRQGAQNQALNALVSPIDELRRLCGEGKMKEAV 640
           G PQFG+P QRN Q ENS QLNNQAGIQ  GAQN   NALVSPIDELRR CGEGK+KEAV
Sbjct: 181 GLPQFGKPGQRNLQAENSYQLNNQAGIQGHGAQNHTPNALVSPIDELRRFCGEGKLKEAV 240

Query: 641 ELLKEGVKADADCFHLLFELCGKSKSFDNAKVVHDYFLQSTCRSDLQLNNKVLEMYGRCG 700
           ELLKEGVKADADCFH  FELCGKSKSF+NAKVVHDYFLQSTCRSDLQLNNKVLEMYG+CG
Sbjct: 241 ELLKEGVKADADCFHEFFELCGKSKSFENAKVVHDYFLQSTCRSDLQLNNKVLEMYGKCG 300

Query: 701 SMSDARRVFDYMPDRNIDSWHFMMKGYADNGLGDEGLELFENMKQLGLQPDSQTFLFVMS 760
           SMSDA+RVFD+M DR+I+SWH M+KGYADNGLGDEGLELFENMK+LGL P+SQTFLFVMS
Sbjct: 301 SMSDAQRVFDHMLDRSIESWHLMIKGYADNGLGDEGLELFENMKKLGLHPNSQTFLFVMS 360

Query: 761 ACASANAVEEGFLYFESMKNDYHITPDMDHYLGLLGILGEPGHINEAFEYVKKLPMEPTV 820
           ACASA+AVEEGF+YFESMKNDYHI PDMDHYLGLLGILGEPGHINEAFEYV+KLPMEPTV
Sbjct: 361 ACASASAVEEGFMYFESMKNDYHINPDMDHYLGLLGILGEPGHINEAFEYVEKLPMEPTV 420

Query: 821 EVWETLKNYARIHGDVDLEDYAEELIVDLDPTKAVSNKISTPPPKKRSAISMLDGKNRIS 880
           EVWETLKNYARIHGDVDLEDYAEELIVDLDPTKA SNKI TPPPKKR AISMLDGKNRI 
Sbjct: 421 EVWETLKNYARIHGDVDLEDYAEELIVDLDPTKAASNKIPTPPPKKRFAISMLDGKNRIV 480

Query: 881 EFRNPTLYKDDEKLKALKAMKEQGYVPDTRYVLHDIDQEAKEQALLYHSERLAIAYGLIS 940
           EFRNPTLYKDDEKLKALKAMKEQGYVPDTRYVLHDIDQEAKEQALLYHSERLAIAYGLIS
Sbjct: 481 EFRNPTLYKDDEKLKALKAMKEQGYVPDTRYVLHDIDQEAKEQALLYHSERLAIAYGLIS 540

Query: 941 TPARTPLRIIKNLRICGDCHNAIKIMSRIVGRELIVRDNKRFHHFKD 988
           TPARTPLRIIKNLRICGDCHNAIKIMSRIVGRELIVRDNKRFHHFKD
Sbjct: 541 TPARTPLRIIKNLRICGDCHNAIKIMSRIVGRELIVRDNKRFHHFKD 587

BLAST of CmUC02G032040 vs. ExPASy TrEMBL
Match: A0A6J1I4R8 (pentatricopeptide repeat-containing protein At2g15690, mitochondrial-like OS=Cucurbita maxima OX=3661 GN=LOC111470986 PE=3 SV=1)

HSP 1 Score: 1049.3 bits (2712), Expect = 1.0e-302
Identity = 527/587 (89.78%), Postives = 549/587 (93.53%), Query Frame = 0

Query: 401 MASLMAVRRARTPILISSFFKVRSPLPSRFTFTCGNQTDTLIKALSTSAIRNDFSNFPPP 460
           MASLMAVRR RTPI ISSF KVRSPLPS FTF+CGNQT+TLIKALSTSA  +DFSNFP P
Sbjct: 1   MASLMAVRRVRTPIHISSFIKVRSPLPSTFTFSCGNQTETLIKALSTSAFPDDFSNFPTP 60

Query: 461 PQQPSSSDPRHRQAQWGSPSQVHPPSGNFNNQSFSEFQNRDYVQQGSPSNQLKYRSQNQS 520
           PQQPSSS PR+ Q Q GSPSQVH PSGNFNNQSFSEFQNRDYVQ GSPSNQ+  RSQNQS
Sbjct: 61  PQQPSSSHPRYLQGQRGSPSQVHRPSGNFNNQSFSEFQNRDYVQLGSPSNQMNNRSQNQS 120

Query: 521 PQPNPGFSRQGQSYSQPGNPNSWNPPNQSYPQYQNPSQANPQNFNYQQQRGPNQWNNQKQ 580
             PNPGF RQGQSY+Q GNPNSWNPPNQSYPQYQNPSQ NPQNFNYQQQR PNQW+NQ Q
Sbjct: 121 SYPNPGFPRQGQSYTQGGNPNSWNPPNQSYPQYQNPSQPNPQNFNYQQQRAPNQWSNQNQ 180

Query: 581 GYPQFGRPEQRNPQVENSNQLNNQAGIQRQGAQNQALNALVSPIDELRRLCGEGKMKEAV 640
           G PQFG+P QRN Q ENS QLNNQAGIQ  GAQN A NALVSPIDELRR CGEGK+KEAV
Sbjct: 181 GLPQFGKPGQRNLQAENSYQLNNQAGIQGHGAQNHAPNALVSPIDELRRFCGEGKLKEAV 240

Query: 641 ELLKEGVKADADCFHLLFELCGKSKSFDNAKVVHDYFLQSTCRSDLQLNNKVLEMYGRCG 700
           ELLKEGVKADADCFH LFELCGKSKSFDNAKVVHDYFLQSTCRSDLQLNNKVLEMYG+CG
Sbjct: 241 ELLKEGVKADADCFHELFELCGKSKSFDNAKVVHDYFLQSTCRSDLQLNNKVLEMYGKCG 300

Query: 701 SMSDARRVFDYMPDRNIDSWHFMMKGYADNGLGDEGLELFENMKQLGLQPDSQTFLFVMS 760
           SMSDA+RVFD+MPDR+I+SWH M+KGYADNGLGDEGLELFENMK+LGLQP+SQTFLFVMS
Sbjct: 301 SMSDAQRVFDHMPDRSIESWHLMIKGYADNGLGDEGLELFENMKKLGLQPNSQTFLFVMS 360

Query: 761 ACASANAVEEGFLYFESMKNDYHITPDMDHYLGLLGILGEPGHINEAFEYVKKLPMEPTV 820
           ACASA+AVEEGF+YFESMKNDYHI PDMDHYLGLLGILGEPGHINEAFEYV+KLP+EPTV
Sbjct: 361 ACASASAVEEGFMYFESMKNDYHINPDMDHYLGLLGILGEPGHINEAFEYVEKLPIEPTV 420

Query: 821 EVWETLKNYARIHGDVDLEDYAEELIVDLDPTKAVSNKISTPPPKKRSAISMLDGKNRIS 880
           EVWETLKNYA+IHGDVDLEDYAEELIVDLDPTKA SNKI TPPPKKRSAISMLDGKNRI 
Sbjct: 421 EVWETLKNYAKIHGDVDLEDYAEELIVDLDPTKAASNKIPTPPPKKRSAISMLDGKNRIV 480

Query: 881 EFRNPTLYKDDEKLKALKAMKEQGYVPDTRYVLHDIDQEAKEQALLYHSERLAIAYGLIS 940
           EFRNPTLYKDDEKLKALKAMKEQGYVPDTRYVLHDIDQEAKEQALLYHSERLAIAYGLIS
Sbjct: 481 EFRNPTLYKDDEKLKALKAMKEQGYVPDTRYVLHDIDQEAKEQALLYHSERLAIAYGLIS 540

Query: 941 TPARTPLRIIKNLRICGDCHNAIKIMSRIVGRELIVRDNKRFHHFKD 988
           TPARTPLRIIKNLRICGDCHNAIKIMSRIVGRELIVRDNKRFHHFKD
Sbjct: 541 TPARTPLRIIKNLRICGDCHNAIKIMSRIVGRELIVRDNKRFHHFKD 587

BLAST of CmUC02G032040 vs. ExPASy TrEMBL
Match: A0A1S3BZP9 (pentatricopeptide repeat-containing protein At2g15690 OS=Cucumis melo OX=3656 GN=LOC103495231 PE=3 SV=1)

HSP 1 Score: 1032.7 bits (2669), Expect = 9.9e-298
Identity = 526/619 (84.98%), Postives = 547/619 (88.37%), Query Frame = 0

Query: 401 MASLMAVRRARTPILISSFFKVRSPLPSRFTFTCGNQTDTLIKALSTSAIRNDFSNFPPP 460
           MASLMAVRR RTPI +SSFFKVR PL S FTFT  NQT+TLIK LSTSAI +DFSNFP  
Sbjct: 1   MASLMAVRRVRTPITVSSFFKVRYPLSSSFTFTFRNQTETLIKTLSTSAIPSDFSNFPSS 60

Query: 461 PQQPSSSDPRHRQAQWGSPSQVHPPSGNFNNQSFSEFQNRDYVQQGSPSNQLKYRSQNQS 520
           PQQPSSS P +RQ QWGSPSQV+PPS NFN QSFSEFQN DY QQG+PSNQL YRSQ+QS
Sbjct: 61  PQQPSSSSPPYRQPQWGSPSQVNPPSENFNRQSFSEFQNHDYAQQGTPSNQLNYRSQHQS 120

Query: 521 PQPNPGFSRQGQSYSQPGNPNSW--------------------------------NPPNQ 580
           PQPNPGFSR+GQSY+Q G  NSW                                NPPNQ
Sbjct: 121 PQPNPGFSREGQSYTQVGKTNSWNPPNQSYPQYQNPSQPNPPNQSYPQYQNPSQPNPPNQ 180

Query: 581 SYPQYQNPSQANPQNFNYQQQRGPNQWNNQKQGYPQFGRPEQRNPQVENSNQLNNQAGIQ 640
           SYPQYQNPSQ NP NFNYQQQRGPNQWNNQ QG+PQFGR E RNPQ ENSNQLNNQA IQ
Sbjct: 181 SYPQYQNPSQPNPPNFNYQQQRGPNQWNNQNQGHPQFGRSEHRNPQPENSNQLNNQAEIQ 240

Query: 641 RQGAQNQALNALVSPIDELRRLCGEGKMKEAVELLKEGVKADADCFHLLFELCGKSKSFD 700
           R G QNQA NALVSPIDELRR CGEGK+KEAVELLK+GVKAD DCFHLLFELCGKSKS D
Sbjct: 241 RHGTQNQAPNALVSPIDELRRFCGEGKLKEAVELLKQGVKADVDCFHLLFELCGKSKSLD 300

Query: 701 NAKVVHDYFLQSTCRSDLQLNNKVLEMYGRCGSMSDARRVFDYMPDRNIDSWHFMMKGYA 760
           NAKVVHDYFLQSTCRSDLQLNN+VLEMYGRCGSMSDARRVFD+MPDRNIDSWH MMKGYA
Sbjct: 301 NAKVVHDYFLQSTCRSDLQLNNEVLEMYGRCGSMSDARRVFDHMPDRNIDSWHLMMKGYA 360

Query: 761 DNGLGDEGLELFENMKQLGLQPDSQTFLFVMSACASANAVEEGFLYFESMKNDYHITPDM 820
           DNGLGDEGLELFENMK LGLQP+SQTFL+VMSACASA+AVEEGFLYFESMKNDYHITPD 
Sbjct: 361 DNGLGDEGLELFENMKNLGLQPNSQTFLYVMSACASADAVEEGFLYFESMKNDYHITPDT 420

Query: 821 DHYLGLLGILGEPGHINEAFEYVKKLPMEPTVEVWETLKNYARIHGDVDLEDYAEELIVD 880
           +HYLGLLGILGEPGHI+EAFEYV+KLPMEPTVEVWETLKNYARIHGDVDLEDYAEELIVD
Sbjct: 421 NHYLGLLGILGEPGHIHEAFEYVEKLPMEPTVEVWETLKNYARIHGDVDLEDYAEELIVD 480

Query: 881 LDPTKAVSNKISTPPPKKRSAISMLDGKNRISEFRNPTLYKDDEKLKALKAMKEQGYVPD 940
           LDPTKAVSNKI TPPPKKRSAISML+GKNRI EFRNPTLYKDDEKLKALKAMKEQGYVPD
Sbjct: 481 LDPTKAVSNKIPTPPPKKRSAISMLEGKNRIVEFRNPTLYKDDEKLKALKAMKEQGYVPD 540

Query: 941 TRYVLHDIDQEAKEQALLYHSERLAIAYGLISTPARTPLRIIKNLRICGDCHNAIKIMSR 988
           TRYVLHDIDQEAKEQALLYHSERLAIAYGLISTPARTPLRIIKNLRICGDCHNAIKIMSR
Sbjct: 541 TRYVLHDIDQEAKEQALLYHSERLAIAYGLISTPARTPLRIIKNLRICGDCHNAIKIMSR 600

BLAST of CmUC02G032040 vs. ExPASy TrEMBL
Match: A0A6J1C4C3 (pentatricopeptide repeat-containing protein At2g15690 OS=Momordica charantia OX=3673 GN=LOC111007809 PE=3 SV=1)

HSP 1 Score: 1026.9 bits (2654), Expect = 5.4e-296
Identity = 515/595 (86.55%), Postives = 543/595 (91.26%), Query Frame = 0

Query: 401 MASLMAVRRARTPILISSFFKVRSPLPSRFTFTCGNQTDTLIKALSTSAIRNDFSNFPPP 460
           MASLMAVRRAR PIL SSFFKVR PLPS F+F+CGNQT+T IKALSTSAI ND+SNF P 
Sbjct: 1   MASLMAVRRARIPILASSFFKVRPPLPSHFSFSCGNQTETPIKALSTSAIPNDYSNFSPS 60

Query: 461 PQQPSSSDPRHRQ-----AQWGSPSQVHPPSGNFNNQSFSEFQNRDYVQQGSPSNQLKYR 520
           PQQ  +SDPR  Q      QWG+PSQVHPPSGNFNNQSFSEFQNRDYVQQGS  NQ+ Y+
Sbjct: 61  PQQNPASDPRFLQGRRTPGQWGTPSQVHPPSGNFNNQSFSEFQNRDYVQQGSAGNQMNYQ 120

Query: 521 SQNQSPQPNPGFSRQGQSYSQPGNPNSWNPPNQSYPQYQN---PSQANPQNFNYQQQRGP 580
           SQN+   PNPGFS+QGQ Y+Q GNPNSWNPPNQSYPQ QN   PS  NPQNFNYQQQRGP
Sbjct: 121 SQNRRSHPNPGFSQQGQGYTQAGNPNSWNPPNQSYPQNQNPSLPSLPNPQNFNYQQQRGP 180

Query: 581 NQWNNQKQGYPQFGRPEQRNPQVENSNQLNNQAGIQRQGAQNQALNALVSPIDELRRLCG 640
           NQWNNQ QGYPQ G P QRNPQVEN NQLNNQ G+Q  GAQ QA NALV PIDELRRLCG
Sbjct: 181 NQWNNQNQGYPQVGNPAQRNPQVENYNQLNNQGGVQGHGAQTQAPNALVPPIDELRRLCG 240

Query: 641 EGKMKEAVELLKEGVKADADCFHLLFELCGKSKSFDNAKVVHDYFLQSTCRSDLQLNNKV 700
           +GK+KEAVELLKEGVKADADCFH++FELCGKSKSFDNAK+VHDYFLQSTCR DLQLNNKV
Sbjct: 241 DGKIKEAVELLKEGVKADADCFHVMFELCGKSKSFDNAKIVHDYFLQSTCRGDLQLNNKV 300

Query: 701 LEMYGRCGSMSDARRVFDYMPDRNIDSWHFMMKGYADNGLGDEGLELFENMKQLGLQPDS 760
           LEMYG+CGSMSDARRVFD+MPDRNIDSWH M+KGYADNGLGDEGLELFENMK+LGLQP+S
Sbjct: 301 LEMYGKCGSMSDARRVFDHMPDRNIDSWHLMIKGYADNGLGDEGLELFENMKKLGLQPNS 360

Query: 761 QTFLFVMSACASANAVEEGFLYFESMKNDYHITPDMDHYLGLLGILGEPGHINEAFEYVK 820
           QTFL+VMSACAS +AVEEGF+YFESMKNDYHI P+MDHYLGLLGILGEPGHINEAFEYV+
Sbjct: 361 QTFLYVMSACASVSAVEEGFMYFESMKNDYHIVPEMDHYLGLLGILGEPGHINEAFEYVE 420

Query: 821 KLPMEPTVEVWETLKNYARIHGDVDLEDYAEELIVDLDPTKAVSNKISTPPPKKRSAISM 880
           KLPMEPTVEVWETLKNYARIHG+VDLEDYAEELIV LDPTKA  NKI TPPPKKRSAISM
Sbjct: 421 KLPMEPTVEVWETLKNYARIHGNVDLEDYAEELIVALDPTKAPPNKIPTPPPKKRSAISM 480

Query: 881 LDGKNRISEFRNPTLYKDDEKLKALKAMKEQGYVPDTRYVLHDIDQEAKEQALLYHSERL 940
           LDGKNRI EFRNPTLYKDDEKLKALKAMKEQGYVPDTRYVLHDIDQEAKEQALLYHSERL
Sbjct: 481 LDGKNRIVEFRNPTLYKDDEKLKALKAMKEQGYVPDTRYVLHDIDQEAKEQALLYHSERL 540

Query: 941 AIAYGLISTPARTPLRIIKNLRICGDCHNAIKIMSRIVGRELIVRDNKRFHHFKD 988
           AIAYGLISTPARTPLRIIKNLRICGDCHNAIKIMSRIVGRELIVRDNKRFHHFKD
Sbjct: 541 AIAYGLISTPARTPLRIIKNLRICGDCHNAIKIMSRIVGRELIVRDNKRFHHFKD 595

BLAST of CmUC02G032040 vs. ExPASy TrEMBL
Match: A0A5D3C491 (Pentatricopeptide repeat-containing protein OS=Cucumis melo var. makuwa OX=1194695 GN=E5676_scaffold13G00440 PE=3 SV=1)

HSP 1 Score: 1014.2 bits (2621), Expect = 3.6e-292
Identity = 526/667 (78.86%), Postives = 547/667 (82.01%), Query Frame = 0

Query: 401 MASLMAVRRARTPILISSFFKVRSPLPSRFTFTCGNQTDTLIKALSTSAIRNDFSNFPPP 460
           MASLMAVRR RTPI +SSFFKVR PL S FTFT  NQT+TLIK LSTSAI +DFSNFP  
Sbjct: 1   MASLMAVRRVRTPITVSSFFKVRYPLSSSFTFTFRNQTETLIKTLSTSAIPSDFSNFPSS 60

Query: 461 PQQPSSSDPRHRQAQWGSPSQVHPPSGNFNNQSFSEFQNRDYVQQGSPSNQLKYRSQNQS 520
           PQQPSSS P +RQ QWGSPSQV+PPS NFN QSFSEFQN DY QQG+PSNQL YRSQ+QS
Sbjct: 61  PQQPSSSSPPYRQPQWGSPSQVNPPSENFNRQSFSEFQNHDYAQQGTPSNQLNYRSQHQS 120

Query: 521 PQPNPGFSRQGQSYSQPGNPNSW------------------------------------- 580
           PQPNPGFSR+GQSY+Q G  NSW                                     
Sbjct: 121 PQPNPGFSREGQSYTQVGKTNSWNPPNQSYPQYQNPSQPNPPNQSYPQYQKPSQPSPPNQ 180

Query: 581 -------------------------------------------NPPNQSYPQYQNPSQAN 640
                                                      NPPNQSYPQYQNPSQ N
Sbjct: 181 SYPQYQNPSQPNPPNQSYPQYQNPSHPNPPNQSYPQYQNPSQPNPPNQSYPQYQNPSQPN 240

Query: 641 PQNFNYQQQRGPNQWNNQKQGYPQFGRPEQRNPQVENSNQLNNQAGIQRQGAQNQALNAL 700
           P NFNYQQQRGPNQWNNQ QG+PQFGR E RNPQ ENSNQLNNQA IQR G QNQA NAL
Sbjct: 241 PPNFNYQQQRGPNQWNNQNQGHPQFGRSEHRNPQPENSNQLNNQAEIQRHGTQNQAPNAL 300

Query: 701 VSPIDELRRLCGEGKMKEAVELLKEGVKADADCFHLLFELCGKSKSFDNAKVVHDYFLQS 760
           VSPIDELRR CGEGK+KEAVELLK+GVKAD DCFHLLFELCGKSKS DNAKVVHDYFLQS
Sbjct: 301 VSPIDELRRFCGEGKLKEAVELLKQGVKADVDCFHLLFELCGKSKSLDNAKVVHDYFLQS 360

Query: 761 TCRSDLQLNNKVLEMYGRCGSMSDARRVFDYMPDRNIDSWHFMMKGYADNGLGDEGLELF 820
           TCRSDLQLNN+VLEMYGRCGSMSDARRVFD+MPDRNIDSWH MMKGYADNGLGDEGLELF
Sbjct: 361 TCRSDLQLNNEVLEMYGRCGSMSDARRVFDHMPDRNIDSWHLMMKGYADNGLGDEGLELF 420

Query: 821 ENMKQLGLQPDSQTFLFVMSACASANAVEEGFLYFESMKNDYHITPDMDHYLGLLGILGE 880
           ENMK LGLQP+SQTFL+VMSACASA+AVEEGFLYFESMKNDYHITPD +HYLGLLGILGE
Sbjct: 421 ENMKNLGLQPNSQTFLYVMSACASADAVEEGFLYFESMKNDYHITPDTNHYLGLLGILGE 480

Query: 881 PGHINEAFEYVKKLPMEPTVEVWETLKNYARIHGDVDLEDYAEELIVDLDPTKAVSNKIS 940
           PGHI+EAFEYV+KLPMEPTVEVWETLKNYARIHGDVDLEDYAEELIVDLDPTKAVSNKI 
Sbjct: 481 PGHIHEAFEYVEKLPMEPTVEVWETLKNYARIHGDVDLEDYAEELIVDLDPTKAVSNKIP 540

Query: 941 TPPPKKRSAISMLDGKNRISEFRNPTLYKDDEKLKALKAMKEQGYVPDTRYVLHDIDQEA 988
           TPPPKKRSAISML+GKNRI EFRNPTLYKDDEKLKALKAMKEQGYVPDTRYVLHDIDQEA
Sbjct: 541 TPPPKKRSAISMLEGKNRIVEFRNPTLYKDDEKLKALKAMKEQGYVPDTRYVLHDIDQEA 600

BLAST of CmUC02G032040 vs. TAIR 10
Match: AT2G15690.1 (Tetratricopeptide repeat (TPR)-like superfamily protein )

HSP 1 Score: 493.8 bits (1270), Expect = 3.2e-139
Identity = 303/612 (49.51%), Postives = 390/612 (63.73%), Query Frame = 0

Query: 401 MASLMAVRRARTP--ILISSFFKVRSPLP---SRFTFTCGNQTDTLIKALSTSAIRNDFS 460
           M+SLMA+R ART   + I S  ++RS  P   S+F F+ G      IK LSTSA  ND+ 
Sbjct: 1   MSSLMAIRCARTQNIVTIGSLLQLRSSFPRLSSQFHFS-GTLNSIPIKHLSTSAAANDY- 60

Query: 461 NFPPPPQQPSSSDPRHRQAQWGSPSQVHPPSGNFNNQSFSEFQNRDYVQQGSPSNQLKYR 520
                          H+  Q GSPSQ   P   +  QSF + QN+    Q  P +  ++ 
Sbjct: 61  ---------------HQNPQSGSPSQHQRP---YPPQSF-DSQNQTNTNQRVPQSPNQWS 120

Query: 521 SQN--QSPQ-----PNPGFSRQ---GQSYSQPGNPNSWNPPNQSY----PQY--QNPSQA 580
           +Q+  Q PQ     P  G  R    GQ+  Q G  + +   N  +    PQY  Q P   
Sbjct: 121 TQHGGQIPQYGGQNPQHGGQRPPYGGQNPQQGGQMSQYGGHNPQHGGHRPQYGGQRPQYG 180

Query: 581 NPQNFNYQQQRGPNQWNNQKQGY-PQFGRPEQRNPQ-VENSNQLNNQAGIQRQGAQNQAL 640
            P N NYQ Q    Q +NQ Q Y PQ    +Q+ PQ   +SNQ  NQ            +
Sbjct: 181 GPGN-NYQNQN--VQQSNQSQYYTPQ----QQQQPQPPRSSNQSPNQ------------M 240

Query: 641 NALVSP--IDELRRLCGEGKMKEAVELLKEGVKADADCFHLLFELCGKSKSFDNAKVVHD 700
           N +  P  ++E+ RLC     K+A+ELL +G   D +CF LLFE C   KS +++K VHD
Sbjct: 241 NEVAPPPSVEEVMRLCQRRLYKDAIELLDKGAMPDRECFVLLFESCANLKSLEHSKKVHD 300

Query: 701 YFLQSTCRSDLQLNNKVLEMYGRCGSMSDARRVFDYMPDRNIDSWHFMMKGYADNGLGDE 760
           +FLQS  R D +LNN V+ M+G C S++DA+RVFD+M D+++DSWH MM  Y+DNG+GD+
Sbjct: 301 HFLQSKFRGDPKLNNMVISMFGECSSITDAKRVFDHMVDKDMDSWHLMMCAYSDNGMGDD 360

Query: 761 GLELFENMKQLGLQPDSQTFLFVMSACASANAVEEGFLYFESMKNDYHITPDMDHYLGLL 820
            L LFE M + GL+P+ +TFL V  ACA+   +EE FL+F+SMKN++ I+P  +HYLG+L
Sbjct: 361 ALHLFEEMTKHGLKPNEETFLTVFLACATVGGIEEAFLHFDSMKNEHGISPKTEHYLGVL 420

Query: 821 GILGEPGHINEAFEYVKKLPMEPTVEVWETLKNYARIHGDVDLEDYAEELIVDLDPTKAV 880
           G+LG+ GH+ EA +Y++ LP EPT + WE ++NYAR+HGD+DLEDY EEL+VD+DP+KAV
Sbjct: 421 GVLGKCGHLVEAEQYIRDLPFEPTADFWEAMRNYARLHGDIDLEDYMEELMVDVDPSKAV 480

Query: 881 SNKISTPPPKKRSAISMLDGKNRISEFRNPTLYKDDEKLKALKAMKEQGYVPDTRYVLHD 940
            NKI TPPPK     +M+  K+RI EFRN T YKD+ K  A K  K   YVPDTR+VLHD
Sbjct: 481 INKIPTPPPKSFKETNMVTSKSRILEFRNLTFYKDEAKEMAAK--KGVVYVPDTRFVLHD 540

Query: 941 IDQEAKEQALLYHSERLAIAYGLISTPARTPLRIIKNLRICGDCHNAIKIMSRIVGRELI 988
           IDQEAKEQALLYHSERLAIAYG+I TP R  L IIKNLR+CGDCHN IKIMS+I+GR LI
Sbjct: 541 IDQEAKEQALLYHSERLAIAYGIICTPPRKTLTIIKNLRVCGDCHNFIKIMSKIIGRVLI 570

BLAST of CmUC02G032040 vs. TAIR 10
Match: AT4G32450.1 (Pentatricopeptide repeat (PPR) superfamily protein )

HSP 1 Score: 294.7 bits (753), Expect = 2.8e-79
Identity = 201/576 (34.90%), Postives = 299/576 (51.91%), Query Frame = 0

Query: 433 TCGNQTDTLIKALSTSAIRNDFSNFPPPPQQPSSSDPRHRQAQWGSPSQVHPPSG-NFNN 492
           TC  +  +L   LST+A+R  F N       P++ +P        S   +   +G N   
Sbjct: 14  TCKLRYSSLFSYLSTAALRLGFEN-------PTNGNPMD-----NSSHHIGYVNGFNGGE 73

Query: 493 QSFSEFQNRDYVQQGSPSNQLKYRSQNQSPQPNPGFSRQGQSYSQPGNPNSWNPPNQSYP 552
           QS   FQ   Y Q  +P +                    GQ+ +     N +N  NQSY 
Sbjct: 74  QSLGGFQQNSYEQSLNPVS--------------------GQNPTNRFYQNGYN-RNQSYG 133

Query: 553 QYQNPSQANPQNFNYQQQRGPNQWNNQKQGYPQFGRPEQRNPQVENSNQLNNQAGIQRQG 612
           ++      N +N N+Q   G + +     G PQ       + Q ++S             
Sbjct: 134 EHS--EIINQRNQNWQSSDGCSSYGTTGNGVPQENNTGGNHFQQDHSGH----------- 193

Query: 613 AQNQALNALVSPIDELRRLCGEGKMKEAVELLK----EGVKADADCFHLLFELCGKSKSF 672
                     S +DEL  +C EGK+K+AVE++K    EG   D      + +LCG +++ 
Sbjct: 194 ----------SSLDELDSICREGKVKKAVEIIKSWRNEGYVVDLPRLFWIAQLCGDAQAL 253

Query: 673 DNAKVVHDYFLQSTCRSDLQLNNKVLEMYGRCGSMSDARRVFDYMPDRNIDSWHFMMKGY 732
             AKVVH++   S   SD+   N ++EMY  CGS+ DA  VF+ MP+RN+++W  +++ +
Sbjct: 254 QEAKVVHEFITSSVGISDISAYNSIIEMYSGCGSVEDALTVFNSMPERNLETWCGVIRCF 313

Query: 733 ADNGLGDEGLELFENMKQLGLQPDSQTFLFVMSACASANAVEEGFLYFESMKNDYHITPD 792
           A NG G++ ++ F   KQ G +PD + F  +  AC     + EG L+FESM  +Y I P 
Sbjct: 314 AKNGQGEDAIDTFSRFKQEGNKPDGEMFKEIFFACGVLGDMNEGLLHFESMYKEYGIIPC 373

Query: 793 MDHYLGLLGILGEPGHINEAFEYVKKLPMEPTVEVWETLKNYARIHGDVDLEDYAEELIV 852
           M+HY+ L+ +L EPG+++EA  +V+   MEP V++WETL N +R+HGD+ L D  ++++ 
Sbjct: 374 MEHYVSLVKMLAEPGYLDEALRFVES--MEPNVDLWETLMNLSRVHGDLILGDRCQDMVE 433

Query: 853 DLDPTKAVSNKISTPPPKKRSAI------SMLDGKN---------RISEFRNPTLYKDDE 912
            LD ++      +   P K S +       M  G N          IS   N  LY    
Sbjct: 434 QLDASRLNKESKAGLVPVKSSDLVKEKLQRMAKGPNYGIRYMAAGDISRPENRELYM--- 493

Query: 913 KLKALKA-MKEQGYVPDTRYVLHDIDQEAKEQALLYHSERLAIAYGLISTPARTPLRIIK 972
            LK+LK  M E GYVP ++  LHD+DQE+K++ L  H+ER A     + TPAR+ +R++K
Sbjct: 494 ALKSLKEHMIEIGYVPLSKLALHDVDQESKDENLFNHNERFAFISTFLDTPARSLIRVMK 528

Query: 973 NLRICGDCHNAIKIMSRIVGRELIVRDNKRFHHFKD 988
           NLR+C DCHNA+K+MS+IVGRELI RD KRFHH KD
Sbjct: 554 NLRVCADCHNALKLMSKIVGRELISRDAKRFHHMKD 528

BLAST of CmUC02G032040 vs. TAIR 10
Match: AT2G25580.1 (Tetratricopeptide repeat (TPR)-like superfamily protein )

HSP 1 Score: 292.0 bits (746), Expect = 1.8e-78
Identity = 156/386 (40.41%), Postives = 231/386 (59.84%), Query Frame = 0

Query: 624 IDELRRLCGEGKMKEAVE----LLKEGVKADADCFHLLFELCGKSKSFDNAKVVHDYFLQ 683
           I+E    C  GK+K+A+     L       D      L ++CG+++    AK VH     
Sbjct: 223 IEEYDAFCKHGKVKKALYTIDILASMNYVVDLSRLLRLAKICGEAEGLQEAKTVHGKISA 282

Query: 684 STCRSDLQLNNKVLEMYGRCGSMSDARRVFDYMPDRNIDSWHFMMKGYADNGLGDEGLEL 743
           S    DL  N+ +LEMY  CG  ++A  VF+ M ++N+++W  +++ +A NG G++ +++
Sbjct: 283 SVSHLDLSSNHVLLEMYSNCGLANEAASVFEKMSEKNLETWCIIIRCFAKNGFGEDAIDM 342

Query: 744 FENMKQLGLQPDSQTFLFVMSACASANAVEEGFLYFESMKNDYHITPDMDHYLGLLGILG 803
           F   K+ G  PD Q F  +  AC     V+EG L+FESM  DY I P ++ Y+ L+ +  
Sbjct: 343 FSRFKEEGNIPDGQLFRGIFYACGMLGDVDEGLLHFESMSRDYGIAPSIEDYVSLVEMYA 402

Query: 804 EPGHINEAFEYVKKLPMEPTVEVWETLKNYARIHGDVDLEDYAEELIVDLDPTK------ 863
            PG ++EA E+V+++PMEP V+VWETL N +R+HG+++L DY  E++  LDPT+      
Sbjct: 403 LPGFLDEALEFVERMPMEPNVDVWETLMNLSRVHGNLELGDYCAEVVEFLDPTRLNKQSR 462

Query: 864 -----AVSNKISTPPPKKRSAISMLDG-KNRISEFR--NPTLYKDDEKLKALKAMK---- 923
                  ++ +     KKRS I  L G K+ + EFR  +  L ++DE  + L+ +K    
Sbjct: 463 EGFIPVKASDVEKESLKKRSGI--LHGVKSSMQEFRAGDTNLPENDELFQLLRNLKMHMV 522

Query: 924 EQGYVPDTRYVLHDIDQEAKEQALLYHSERLAIAYGLISTPARTPLRIIKNLRICGDCHN 983
           E GYV +TR  LHDIDQE+KE  LL HSER+A A  ++++  R P  +IKNLR+C DCHN
Sbjct: 523 EVGYVAETRMALHDIDQESKETLLLGHSERIAFARAVLNSAPRKPFTVIKNLRVCVDCHN 582

Query: 984 AIKIMSRIVGRELIVRDNKRFHHFKD 988
           A+KIMS IVGRE+I RD KRFH  K+
Sbjct: 583 ALKIMSDIVGREVITRDIKRFHQMKN 606

BLAST of CmUC02G032040 vs. TAIR 10
Match: AT3G24000.1 (Tetratricopeptide repeat (TPR)-like superfamily protein )

HSP 1 Score: 265.4 bits (677), Expect = 1.8e-70
Identity = 140/398 (35.18%), Postives = 230/398 (57.79%), Query Frame = 0

Query: 618 NALVSPIDELRRLCGEGKMKEAVELLKEGVKADADCFHLLFELCGKSKSFDNAKVVHDYF 677
           NAL++     RR   E  ++    +L++G +     +  LF  C  +   +  K VH Y 
Sbjct: 231 NALIA--GHARRSGTEKALELFQGMLRDGFRPSHFSYASLFGACSSTGFLEQGKWVHAYM 290

Query: 678 LQSTCRSDLQLNNKVLEMYGRCGSMSDARRVFDYMPDRNIDSWHFMMKGYADNGLGDEGL 737
           ++S  +      N +L+MY + GS+ DAR++FD +  R++ SW+ ++  YA +G G E +
Sbjct: 291 IKSGEKLVAFAGNTLLDMYAKSGSIHDARKIFDRLAKRDVVSWNSLLTAYAQHGFGKEAV 350

Query: 738 ELFENMKQLGLQPDSQTFLFVMSACASANAVEEGFLYFESMKNDYHITPDMDHYLGLLGI 797
             FE M+++G++P+  +FL V++AC+ +  ++EG+ Y+E MK D  I P+  HY+ ++ +
Sbjct: 351 WWFEEMRRVGIRPNEISFLSVLTACSHSGLLDEGWHYYELMKKD-GIVPEAWHYVTVVDL 410

Query: 798 LGEPGHINEAFEYVKKLPMEPTVEVWETLKNYARIHGDVDLEDYAEELIVDLDP------ 857
           LG  G +N A  +++++P+EPT  +W+ L N  R+H + +L  YA E + +LDP      
Sbjct: 411 LGRAGDLNRALRFIEEMPIEPTAAIWKALLNACRMHKNTELGAYAAEHVFELDPDDPGPH 470

Query: 858 ---------------TKAVSNKISTPPPKKRSAISMLDGKNRISEF-RNPTLYKDDEKL- 917
                             V  K+     KK  A S ++ +N I  F  N   +   E++ 
Sbjct: 471 VILYNIYASGGRWNDAARVRKKMKESGVKKEPACSWVEIENAIHMFVANDERHPQREEIA 530

Query: 918 ----KALKAMKEQGYVPDTRYVLHDIDQEAKEQALLYHSERLAIAYGLISTPARTPLRII 977
               + L  +KE GYVPDT +V+  +DQ+ +E  L YHSE++A+A+ L++TP  + + I 
Sbjct: 531 RKWEEVLAKIKELGYVPDTSHVIVHVDQQEREVNLQYHSEKIALAFALLNTPPGSTIHIK 590

Query: 978 KNLRICGDCHNAIKIMSRIVGRELIVRDNKRFHHFKDA 989
           KN+R+CGDCH AIK+ S++VGRE+IVRD  RFHHFKDA
Sbjct: 591 KNIRVCGDCHTAIKLASKVVGREIIVRDTNRFHHFKDA 625

BLAST of CmUC02G032040 vs. TAIR 10
Match: AT4G21065.1 (Tetratricopeptide repeat (TPR)-like superfamily protein )

HSP 1 Score: 261.9 bits (668), Expect = 2.0e-69
Identity = 139/386 (36.01%), Postives = 218/386 (56.48%), Query Frame = 0

Query: 634 GKMKEAVELLKE----GVKADADCFHLLFELCGKSKSFDNAKVVHDYFLQSTCRSDLQLN 693
           GK +EA+ L  E    G+K D      L   C K  +    K VH Y ++     +L  +
Sbjct: 201 GKPEEALALYTEMNSKGIKPDGFTIVSLLSACAKIGALTLGKRVHVYMIKVGLTRNLHSS 260

Query: 694 NKVLEMYGRCGSMSDARRVFDYMPDRNIDSWHFMMKGYADNGLGDEGLELFENMKQL-GL 753
           N +L++Y RCG + +A+ +FD M D+N  SW  ++ G A NG G E +ELF+ M+   GL
Sbjct: 261 NVLLDLYARCGRVEEAKTLFDEMVDKNSVSWTSLIVGLAVNGFGKEAIELFKYMESTEGL 320

Query: 754 QPDSQTFLFVMSACASANAVEEGFLYFESMKNDYHITPDMDHYLGLLGILGEPGHINEAF 813
            P   TF+ ++ AC+    V+EGF YF  M+ +Y I P ++H+  ++ +L   G + +A+
Sbjct: 321 LPCEITFVGILYACSHCGMVKEGFEYFRRMREEYKIEPRIEHFGCMVDLLARAGQVKKAY 380

Query: 814 EYVKKLPMEPTVEVWETLKNYARIHGDVDLEDYAEELIVDLDP----------------- 873
           EY+K +PM+P V +W TL     +HGD DL ++A   I+ L+P                 
Sbjct: 381 EYIKSMPMQPNVVIWRTLLGACTVHGDSDLAEFARIQILQLEPNHSGDYVLLSNMYASEQ 440

Query: 874 ----TKAVSNKISTPPPKKRSAISMLDGKNRISEF-----RNPTLYKDDEKLKALKA-MK 933
                + +  ++     KK    S+++  NR+ EF      +P       KLK +   ++
Sbjct: 441 RWSDVQKIRKQMLRDGVKKVPGHSLVEVGNRVHEFLMGDKSHPQSDAIYAKLKEMTGRLR 500

Query: 934 EQGYVPDTRYVLHDIDQEAKEQALLYHSERLAIAYGLISTPARTPLRIIKNLRICGDCHN 988
            +GYVP    V  D+++E KE A++YHSE++AIA+ LISTP R+P+ ++KNLR+C DCH 
Sbjct: 501 SEGYVPQISNVYVDVEEEEKENAVVYHSEKIAIAFMLISTPERSPITVVKNLRVCADCHL 560

The following BLAST results are available for this feature:
Match NameE-valueIdentityDescription
KAG6571981.10.0e+0068.43Pentatricopeptide repeat-containing protein, mitochondrial, partial [Cucurbita a... [more]
XP_038887834.10.0e+0093.87pentatricopeptide repeat-containing protein At2g15690, mitochondrial [Benincasa ... [more]
KAG7011659.16.6e-30489.95Pentatricopeptide repeat-containing protein, partial [Cucurbita argyrosperma sub... [more]
XP_022952757.13.3e-30389.61pentatricopeptide repeat-containing protein At2g15690, mitochondrial-like [Cucur... [more]
XP_022972422.12.1e-30289.78pentatricopeptide repeat-containing protein At2g15690, mitochondrial-like [Cucur... [more]
Match NameE-valueIdentityDescription
Q9ZQE54.5e-13849.51Pentatricopeptide repeat-containing protein At2g15690, mitochondrial OS=Arabidop... [more]
Q9SUU74.0e-7834.90Pentatricopeptide repeat-containing protein At4g32450, mitochondrial OS=Arabidop... [more]
Q680H32.6e-7740.41Pentatricopeptide repeat-containing protein At2g25580 OS=Arabidopsis thaliana OX... [more]
Q9LIQ79.9e-6935.01Pentatricopeptide repeat-containing protein At3g24000, mitochondrial OS=Arabidop... [more]
A8MQA32.9e-6836.01Pentatricopeptide repeat-containing protein At4g21065 OS=Arabidopsis thaliana OX... [more]
Match NameE-valueIdentityDescription
A0A6J1GL941.6e-30389.61pentatricopeptide repeat-containing protein At2g15690, mitochondrial-like OS=Cuc... [more]
A0A6J1I4R81.0e-30289.78pentatricopeptide repeat-containing protein At2g15690, mitochondrial-like OS=Cuc... [more]
A0A1S3BZP99.9e-29884.98pentatricopeptide repeat-containing protein At2g15690 OS=Cucumis melo OX=3656 GN... [more]
A0A6J1C4C35.4e-29686.55pentatricopeptide repeat-containing protein At2g15690 OS=Momordica charantia OX=... [more]
A0A5D3C4913.6e-29278.86Pentatricopeptide repeat-containing protein OS=Cucumis melo var. makuwa OX=11946... [more]
Match NameE-valueIdentityDescription
AT2G15690.13.2e-13949.51Tetratricopeptide repeat (TPR)-like superfamily protein [more]
AT4G32450.12.8e-7934.90Pentatricopeptide repeat (PPR) superfamily protein [more]
AT2G25580.11.8e-7840.41Tetratricopeptide repeat (TPR)-like superfamily protein [more]
AT3G24000.11.8e-7035.18Tetratricopeptide repeat (TPR)-like superfamily protein [more]
AT4G21065.12.0e-6936.01Tetratricopeptide repeat (TPR)-like superfamily protein [more]
InterPro
Analysis Name: InterPro Annotations of Watermelon (USVL531) v1
Date Performed: 2022-01-31
IPR TermIPR DescriptionSourceSource TermSource DescriptionAlignment
NoneNo IPR availableCOILSCoilCoilcoord: 60..87
NoneNo IPR availableMOBIDB_LITEmobidb-litedisorder_predictioncoord: 279..372
NoneNo IPR availableMOBIDB_LITEmobidb-litedisorder_predictioncoord: 505..597
NoneNo IPR availableMOBIDB_LITEmobidb-litedisorder_predictioncoord: 341..369
NoneNo IPR availableMOBIDB_LITEmobidb-litedisorder_predictioncoord: 88..136
NoneNo IPR availableMOBIDB_LITEmobidb-litedisorder_predictioncoord: 467..493
NoneNo IPR availableMOBIDB_LITEmobidb-litedisorder_predictioncoord: 305..334
NoneNo IPR availableMOBIDB_LITEmobidb-litedisorder_predictioncoord: 452..493
NoneNo IPR availablePANTHERPTHR24015:SF1958PENTATRICOPEPTIDE REPEAT-CONTAINING PROTEINcoord: 402..987
NoneNo IPR availablePANTHERPTHR24015OS07G0578800 PROTEIN-RELATEDcoord: 402..987
IPR003340B3 DNA binding domainSMARTSM01019B3_2coord: 182..273
e-value: 8.6E-20
score: 81.8
IPR003340B3 DNA binding domainPFAMPF02362B3coord: 182..272
e-value: 2.5E-14
score: 53.0
IPR003340B3 DNA binding domainPROSITEPS50863B3coord: 182..273
score: 11.843089
IPR003340B3 DNA binding domainCDDcd10017B3_DNAcoord: 180..271
e-value: 5.08712E-26
score: 100.865
IPR032867DYW domainPFAMPF14432DYW_deaminasecoord: 892..986
e-value: 7.0E-33
score: 113.2
IPR011990Tetratricopeptide-like helical domain superfamilyGENE3D1.25.40.10Tetratricopeptide repeat domaincoord: 625..781
e-value: 5.5E-29
score: 103.5
IPR015300DNA-binding pseudobarrel domain superfamilyGENE3D2.40.330.10coord: 156..274
e-value: 2.2E-25
score: 90.8
IPR015300DNA-binding pseudobarrel domain superfamilySUPERFAMILY101936DNA-binding pseudobarrel domaincoord: 175..272
IPR002885Pentatricopeptide repeatTIGRFAMTIGR00756TIGR00756coord: 719..752
e-value: 4.2E-5
score: 21.5
IPR002885Pentatricopeptide repeatPFAMPF13041PPR_2coord: 716..763
e-value: 2.3E-8
score: 34.1
IPR002885Pentatricopeptide repeatPROSITEPS51375PPRcoord: 716..750
score: 11.761533

Relationships

The following mRNA feature(s) are a part of this gene:

Feature NameUnique NameType
CmUC02G032040.1CmUC02G032040.1mRNA


GO Annotation
GO Assignments
This gene is annotated with the following GO terms.
Category Term Accession Term Name
molecular_function GO:0003677 DNA binding
molecular_function GO:0005515 protein binding
molecular_function GO:0008270 zinc ion binding