CmoCh20G007310 (gene) Cucurbita moschata (Rifu) v1

Overview
NameCmoCh20G007310
Typegene
OrganismCucurbita moschata (Cucurbita moschata (Rifu) v1)
DescriptionPentatricopeptide (PPR) repeat protein-like
LocationCmo_Chr20: 3623481 .. 3630422 (-)
RNA-Seq ExpressionCmoCh20G007310
SyntenyCmoCh20G007310
Sequences
The following sequences are available for this feature:

Gene sequence (with intron)

Legend: exonfive_prime_UTRpolypeptideCDSthree_prime_UTR
Hold the cursor over a type above to highlight its positions in the sequence below.
TGAACTTCTTATAATAACAAGAATGGAGCTGTTGCTTTGCTTTTTCTACTAAGAGCAACAGAGATGGAGAAGATGATGAGTTTAAACCATCTGTTCGTGACGACGTTCATCGGAAGCTTGTCGATGTTCATGGTCATTCCGACCATTGTTGACTTAACAATGGAGTTTGTGTGTCCTCACCAGGATCACTGTTCCATCGCCATTTATCTCTCTGGTGTCCAGCAGGCGGTATGTGCCTAATACCATCTACTTCCATTTGCTTTAACAACACCATGAACTGTTTAAGAACGTTTCTTTTTTTCAACTTTTTGATACTTTTCTTCAAGCGTTTTGATCTTTAAACCGACCCCGCATCGGGATTTTTATGATTTAAAAGATAGTCGAACTAGCTCGATCAAAAAGTTGGAAATTTTAACCTTCCCCTGCATGTTCTACTCGAGAAAAAGGGAAGATGATGGTGTTTTGTCTATAACAAAGGACTAAAATAGTCAAGGAGTTGTAAAACTGCAGATTGTAGGGCTTGGAGCAGTGGTGATAACACCAGTAATTGGGAATCTATCAGACAGATACGGAAGGAAAGCAATGCTGACTATCCCAATGACGTTTTCAATCATACCGCTCGGTCGGTTTAGTGTTAAATCTCAGTTCGTTTCATCATAAAGTTAAATCATTAGCTATGATACAAGATTTGAATGATTTTGCAGCCATAATGGGTTATAGAAGAACTACCAACTTCTTCTATGCATTTTACATCATGAAAACTCTCACAGACATGGTTTCAGAAGGCACTACAGTTTCACTTGCTCTTGCTTATGTGGTATTTAGCTTCACTTAAACTATACTATACCAAAAACTACTTCTTTTGATTCTCCTTTGTTTTATTCTCAGGCAGACAAAACTTCAGAGGATCAGAGGATCTCGGCGTTCGGAATCCTATCTGGGGTCAGATCTGTAGGTTATGTGTGTGGAACCTTTTTGGCTCGGCTCCTTTCAACTGCTACAGTGTTTCAGGTTCTCCATGGCTTCTTTCTGCCAACATAACAAAGTATAAGTATGCCCAAATTGTGCTTTCTTTGTATCCTTGTGCAGGTGGCTGCTTTCATGTCAGTGCTTGCGGTAGTGCACATGAGGACTTTTCTCAAGGAAAGTATTCCAGATCAGAATGAGTTGACTCAACCGATCTTCGACGAAAACTTAAGTGGTGGTGATGATGAAAATGGACCAGAATTGCCTACAAGAACTCAGTTATCGATAGGGATGTCTTCTATAAGAGACGTTATCTCCTTAATCACGAGTAGGTAATTTCGATGCTCGTTTCCAATGGTTTCTGCCAGTTCTTAAGTGATTTTCTCTTCACTGCTAGTTTTCTTCCATTTTGTTCTTTTTGATGTTTTTCAATGACCAACTCCAAATGAAACAATACTCTATAGCACAACATTTTCACAAGCAGCAAGAGTTTCCTTCTTCAATAGTCTAGCAGAGAAGGGGATGCAAGCATCACTAGCGGTACAATTCTCATCAAATAGAAAAATTTTAGCTCGTTTTCGAAACCATCTGTTACCATAAACACATCACCTTCAGGTTTCAACTTCAAACTTAGATCTATTCCTTTGCTACAGTATTTCTTAAAGGCCCGTTTTCACTTCGACAAAAACCAGTTTGCTGACTTGATGATAATTGAGGGGGTTGCCGGGGCCGTTTCACTGGTATCCTTTACTATTATGAGCTACTACAACCCAGGGTATCTTTTGCAAGATTGATTATAGGTAGGTACTCTTTTGTTTTGAACTTATACACTCTTGCAGTTTCTTTTGATGCCCGCTTTGGCACTAGCTATAAGACAGGAGAGGTTGCTATCAATAGGGCTGTGGGCGAGCATTATAAATGTAGGCATCTAACTCCCATCTCTTATATGTTTTATAATGCCATAAGCATGCCCCATTTGCCTCTCCCTAACCCCATGATGCTGATGATTACCCACTTCAATGCAGATACTACTTAACAGCATAGCTTGGTCAGTTTGGGTAATTGATCTCCACTTAAGTAGCTAGATTCTTATCCTGTTCAAAAAATAAATAAAAAAATAAATAAATAGCCCTTGTTCTGCTCGTAATCGTACAGGCACTCGTTTGATTCTGTTCGGTCTCCTTGACCTGATAGCTATTCTCTATCCAACCTATCAACTTTTGTTTCTTTAATAACCACAATGATCTTAACATTTGTAGGTTCCTTATGCTTTAAGAGCATTTACAATTTTTACAATTCTGGTCAGTCCAATAGTAAGTTTTCCTTGCTTTCTTCAAAGCTTGTCTTTGCTCATGAAAAAGTGCCGTGGCATTCACACGAATCTATGGCTTCAACAGATATTCAACATTGCATCGAGTCAAGTTGGACCGAGTGAGCAGGTCAGTACACTAAACAGTAAAAAAGAAACTTACTAATGTTTTGTTGGTTGAACCTCTGAAAGTAAAATAAATTTCATCTCAAGGGGAAGGCCCAAGGATACATCTCAGGCATTAATTCCCTTGCGAACATTGCTTCTCCATTACTTTTCAGTCCCTTGATAGGTATATCAATTGTCAAGATAAGTGTAACTTTTTTTCACCACAAATGGAACTCCATTTGTTTGACTTCATCCATCAATCTATTCAGCTCTTTTCCTCTCCAAGGATGCACCATTTGACTTCCCCGGCTTCGGTATTTTGTGCATTGGGCTCGCTTCGGTAAGGATACAAAATACCTCCTATTGCCCTCTTCCTATGTTTAGATTGCAACTCTATAAAGCCAGACAATATTCCAGCAAAGAACATTGAGAAACCAAGATAGAAACAACAGTGTACATCATTCTTTCTTGGAAATACTCAAAATTGAACTGTTCATAGATTCCTATACTCCTAAGGAAAAAAAAACTGCATCGGTTGAACACATTCTAGAACTGACTGTAACAACCCAAGCCCACCGCTAGCAGATATTATTTGCTTTGGTCCGTTACGTATCGCCATCAGCCTCACGGTTTTAAAATGCGTCTACTAGAGAGAGGTTTCCACACCCTTATAAGAAATGCTTCGTTCCCCTCTCCAACTGACAATTGGGAGGTCTCACATTGATTGGAGAAGGGAATGAGTGTCAACGAGGATGCTAGACCCTGGGTGGATTGTGAGATCCCACATCGGTTGGGGAGGAGAACGAAACATTCTTCCACCCCTAGTAGACGCGTTCTAAAACCTTGAGGGGAAGCCTAGAAGGAAAAGCTCAAAGAGAACAATATCTGCTGGCTTTGGGCTTGGGCTGTTACAATAACACACCATTTTGATTTGAATTTGAAGCCTTAGATATCAATTTCTCAACACCCCCATCCCCAACTCCTTGATCCGATTGAAAACTGCTGTTTATTTCTAATATGGCTATGAACTAACCCCAAATACTGGCGAAGTTCTAAAAGGATTGATAGCTATTGAGTTAGTTCTTTAAGATTGTTCTCATGTGTTCTGTTCCCTTAACAACATCTTTCTATTTTGCAGTTGATTGGCTTCACTCTAAGCCTGATGATCCGTGTAGACCCGTTCATTTTCATTCAGAAAATCAAAAACTTAGTGTAGAGCTTGTTCATAACAGCCAGACAGGGGGATTTGGAAATCCCTAAGTAAAATCCTTATTGGTAGATACTGTGAATTTGAAGCAACACAGAGACAACAGATACTCAAACTCAAAGCAACTACAGCAGAAAGTGCGTCGTAAACTAAACATTCAGACGTACCTGTAGCGTGTGTATATGTATTGGAGCCCTTCGATGGTTTTCTTTCAGAACCCTAAACAGAACTCAGTTCAGTTTTTGTACATATTTTGATCTTGTAAATCCTCAGAGCCCTCAAGATTGTTGTGCGTTCTTCACCATTGATGAATGCAATAAACAAAATGCTTCGATATAATGCACAAATCAATGTTGTACAGGTCCCGATTGTGTCGAAATGGAGACCATGTTTTCTTCTGCAAATCAGATATAAAAGCTCGAAACAGAGAGAGAAATACAGACCTTAGAATAGAACTTACATTGCCCAAGAATTTGGATGAGACATTGGAGTTTATGGCTGTGTGAACTGTGAAGGCTTTGCTTCTTTTAGTTTTTATATTCTATTACTTAAGAAATGAGTACAATGATAATTGAGGGACTTTGAGACATTCGCTTCCGCCACACCGGACTACGTACATGTTAGGGTGTTAGTGGATATTGTCCTCTTTGGGCTTTCCCTGGCTTCTCATCAAGGCTTTCAAATGCGTCTGCTAGGGAAGATTTCCACACTATTATAAATGGTGGTTTATTCTCCTCCTCAACTAATGTGAGTTTTCCCTTTTGTGCTTCCCCTCAAGACTTTAAAACGCGTCTGCTAAGTGAAGGTTTCCACACCCTTATAAATGGGCCGTTACAAACCACTATTGATAAGGGGGTGGAAACCTTCCCCTAGCATACACGTTTTAAAGTCTTGAGGGGAAGCTCGAAAAGGAAAGCGCAAAGAGAACAATATCTGCTAGCGGTGGATCTGGCCGTTACAAAGGGACTTCGGAACATTGGCTTCCGCCAACAGTGGACTGTTAAGGATTGCATCCCGCACATGTTTACATTGTAGCCTGCAACTGATTCAGCTTCCGCTTCAGGGAGATCAAATGACTTTGTTAGTTGTTGTTCTGATCGAGCACTCTCTAAAGAAGACTGGCTTTTGGCCCATTCGTGAAAGCTTGTAAAGTTAGACGGTGAAACTTCTGTTAGATGAACATGACTCTCCCACTGGTATGATATTGTCCACTTTGAGCATCTCATGACTTAGTTTTGAAATTCCCAAAAAGCCTCGACAATGAGGAGAATACTCTTTTATCATAAACTCATGATCATTACTCTTTTATGATAAACTCAGGATCATTCGGAAATTAGGCCGTGGAACTTTCATCAACCATTAGTTAACTACTTGAACTGACCTAGTGGAGGGGTATCTGATTTTGCAGATGCGATTTACTTTTGGCGCCTAATGGAGCTCCGCTTTTGCCCGCCGCCGTACGTGATTGGGGATAGCGTTCGACTCTTCTCAAAGGCACCTAAACGCTACGACGGCTTCTGCAGTTACCATTTCCGGCCAAATCTGCAGGTCAAATGTGCTACACTCACCAAACAAAGTCACCGATTCCTCTCTACTTTGGCCACAACCGCCGCCGCCGGCGACCATTCAGCTACCAATCGTTTGATTCGGAAGTTTGTTGCGAGTTCTCCGAAATCTATTACTCTCAATGTCCTCTCCGATATCCTTTCCTCTCGCACGGCTCAACCTGGACTCTGCTCTGTTGCTCTCACCGTAAGTAGCGTTTTCTTTTTCTTTTCCCCCTCATTATCGCATCATTCTGCGAGAAGATCGAGATGAAAATTATATTTCCAGGTCTTAGATTTAGGTTTCCTTCTCGTCTACGTAATGGTAAAGTTGAGAACTATTGCGAAATTTGGTCCTTGATCTCTTCTGATTCTCTCGTTAAACTGTTCTATCAATTTCCTTATGTTCCTACAGTTATATTCCAGAATTACTGAGACGTCCTGGTTCACATGGAATTCCAAGCTAGTTGCTGACCTTGTTGCCTTCCTCGATAAAAATGGACAGATTGTTGACTCGGAAACCCTAATTTCCGAGGCAATTTCGAAATTAGGGATTCAAGAAAGAAAGCTTGTAAACTTCTACTGTCAGCTGGTTGAATCTCAATCCAAACACGGTTCAGAAAGAGGATTTGGTATCGCATATGCTTGTCTTCTTGAGCTTCTTTATAAGTCGTCCTCGATTTATGTGAAACGTCGAGCTTATGAATCAATGGTTACTGGTTTGTGCTCCATGAAAAGGCCTCAGGAAGCTGAGAGTTTGGTAAAAGAAATGAAAGCCAAAGGATTTGCTCCTGCTGCATTTGAATACAGGTCCATTATTTACGCATATGGAACATTGGGGTTGTTTGAAGATATGAAGAGGAGTTTGGAAGAGATGAAGAACGATGATATTGCTTTAGACACAGTTTGTTCTAACATGGTGCTTTCATCATATGGAGTTCATAATAAGCTTGCAGATATGGTTCTATGGCTTCAAATAATGAAAACTTCTGCTCTTCCTTTCTCGGTTCGAACGTACAATTCTGTCTTGAATTCATGTCCGAAGATTACGTCGATGCTACAAGACAAGAGCGACGATCTTCCAGTTTTGATTGAAGACTTGATCACGGTTCTGGACGGGGACGAGGCTTTGTTGGTTGAAGAGTTGGTTGGTTCATCTGTTTTGAAAGAAGTAATGGTGTGGGATGCAATGGAGATGAAGTTGGATTTGCATGGAGCACATGTTGGTGCAGCTTATGTGATCATTTTGGAGTGGATGAAGGAGATGAGACTGAAGTTTGAGGATGAGAGCTGTGTGATTCCAGCACAAGTTACAGTGATTTGTGGATCTGGAAACCATAGTATTGTTAGAGGAGAGTCTCCTGTAAAAGCTCTAATTAGAGAGATTATGTTTCGGACACAAAGTCCGCTGAGAATTGATCGCAAGAACACTGGTTGCTTTGTCGCCAAAGGAAAAGCGGTAAAGAATTGGGTATGTTTGAGGTGAATATAGAGAGATGTTGTCTTCTTTGGACTTTTTCTTTTGGGTTTTCCATCAAAGTTTCTAAAACGCGCTAGGAAGAGGTTTCACACTCTCATATAGAAGGTAGGAAGAGGTTGTTGAACACAACTCACTGTGGGAAACACTCGCTCTCTTTATTAAGACCAATCGAGAAGAGAATACAAGACACTCTGTAGAATACTTCTGCTTTTTATTGTTTTTAGATTTCTTGGATGAATAACCTAGGTAGGGTGGGGGTATTTATACTAAGAGTAAACAATCTATATTTAACCAATCTAAATCTAGACCTAACCGATTTGAATCTAGA

mRNA sequence

TGAACTTCTTATAATAACAAGAATGGAGCTGTTGCTTTGCTTTTTCTACTAAGAGCAACAGAGATGGAGAAGATGATGAGTTTAAACCATCTGTTCGTGACGACGTTCATCGGAAGCTTGTCGATGTTCATGGTCATTCCGACCATTGTTGACTTAACAATGGAGTTTGTGTGTCCTCACCAGGATCACTGTTCCATCGCCATTTATCTCTCTGGTGTCCAGCAGGCGATTGTAGGGCTTGGAGCAGTGGTGATAACACCAGTAATTGGGAATCTATCAGACAGATACGGAAGGAAAGCAATGCTGACTATCCCAATGACGTTTTCAATCATACCGCTCGCCATAATGGGTTATAGAAGAACTACCAACTTCTTCTATGCATTTTACATCATGAAAACTCTCACAGACATGGTTTCAGAAGGCACTACAGTTTCACTTGCTCTTGCTTATGTGGCAGACAAAACTTCAGAGGATCAGAGGATCTCGGCGTTCGGAATCCTATCTGGGGTCAGATCTGTAGGTTATGTGTGTGGAACCTTTTTGGCTCGGCTCCTTTCAACTGCTACAGTGTTTCAGGTGGCTGCTTTCATGTCAGTGCTTGCGGTAGTGCACATGAGGACTTTTCTCAAGGAAAGTATTCCAGATCAGAATGAGTTGACTCAACCGATCTTCGACGAAAACTTAAGTGGTGGTGATGATGAAAATGGACCAGAATTGCCTACAAGAACTCAGTTATCGATAGGGATGTCTTCTATAAGAGACGTTATCTCCTTAATCACGAGTAGCACAACATTTTCACAAGCAGCAAGAGTTTCCTTCTTCAATAGTCTAGCAGAGAAGGGGATGCAAGCATCACTAGCGTATTTCTTAAAGGCCCGTTTTCACTTCGACAAAAACCAGTTTGCTGACTTGATGATAATTGAGGGGGTTGCCGGGGCCGTTTCACTGTTTCTTTTGATGCCCGCTTTGGCACTAGCTATAAGACAGGAGAGGTTGCTATCAATAGGGCTGTGGGCGAGCATTATAAATGTTCCTTATGCTTTAAGAGCATTTACAATTTTTACAATTCTGGTCAGTCCAATAATATTCAACATTGCATCGAGTCAAGTTGGACCGAGTGAGCAGGGGAAGGCCCAAGGATACATCTCAGGCATTAATTCCCTTGCGAACATTGCTTCTCCATTACTTTTCAGTCCCTTGATAGCTCTTTTCCTCTCCAAGGATGCACCATTTGACTTCCCCGGCTTCGGTATTTTGTGCATTGGGCTCGCTTCGTTGATTGGCTTCACTCTAAGCCTGATGATCCGTGTAGACCCGTTCATTTTCATTCAGAAAATCAAAAACTTAGTATACTGTGAATTTGAAGCAACACAGAGACAACAGATACTCAAACTCAAAGCAACTACAGCAGAAAATGCGATTTACTTTTGGCGCCTAATGGAGCTCCGCTTTTGCCCGCCGCCGTACGTGATTGGGGATAGCGTTCGACTCTTCTCAAAGGCACCTAAACGCTACGACGGCTTCTGCAGTTACCATTTCCGGCCAAATCTGCAGGTCAAATGTGCTACACTCACCAAACAAAGTCACCGATTCCTCTCTACTTTGGCCACAACCGCCGCCGCCGGCGACCATTCAGCTACCAATCGTTTGATTCGGAAGTTTGTTGCGAGTTCTCCGAAATCTATTACTCTCAATGTCCTCTCCGATATCCTTTCCTCTCGCACGGCTCAACCTGGACTCTGCTCTGTTGCTCTCACCTTATATTCCAGAATTACTGAGACGTCCTGGTTCACATGGAATTCCAAGCTAGTTGCTGACCTTGTTGCCTTCCTCGATAAAAATGGACAGATTGTTGACTCGGAAACCCTAATTTCCGAGGCAATTTCGAAATTAGGGATTCAAGAAAGAAAGCTTGTAAACTTCTACTGTCAGCTGGTTGAATCTCAATCCAAACACGGTTCAGAAAGAGGATTTGGTATCGCATATGCTTGTCTTCTTGAGCTTCTTTATAAGTCGTCCTCGATTTATGTGAAACGTCGAGCTTATGAATCAATGGTTACTGGTTTGTGCTCCATGAAAAGGCCTCAGGAAGCTGAGAGTTTGGTAAAAGAAATGAAAGCCAAAGGATTTGCTCCTGCTGCATTTGAATACAGGTCCATTATTTACGCATATGGAACATTGGGGTTGTTTGAAGATATGAAGAGGAGTTTGGAAGAGATGAAGAACGATGATATTGCTTTAGACACAGTTTGTTCTAACATGGTGCTTTCATCATATGGAGTTCATAATAAGCTTGCAGATATGGTTCTATGGCTTCAAATAATGAAAACTTCTGCTCTTCCTTTCTCGGTTCGAACGTACAATTCTGTCTTGAATTCATGTCCGAAGATTACGTCGATGCTACAAGACAAGAGCGACGATCTTCCAGTTTTGATTGAAGACTTGATCACGGTTCTGGACGGGGACGAGGCTTTGTTGGTTGAAGAGTTGGTTGGTTCATCTGTTTTGAAAGAAGTAATGGTGTGGGATGCAATGGAGATGAAGTTGGATTTGCATGGAGCACATGTTGGTGCAGCTTATGTGATCATTTTGGAGTGGATGAAGGAGATGAGACTGAAGTTTGAGGATGAGAGCTGTGTGATTCCAGCACAAGTTACAGTGATTTGTGGATCTGGAAACCATAGTATTGTTAGAGGAGAGTCTCCTGTAAAAGCTCTAATTAGAGAGATTATGTTTCGGACACAAAGTCCGCTGAGAATTGATCGCAAGAACACTGGTTGCTTTGTCGCCAAAGGAAAAGCGGTAAAGAATTGGGTATGTTTGAGGTGAATATAGAGAGATGTTGTCTTCTTTGGACTTTTTCTTTTGGGTTTTCCATCAAAGTTTCTAAAACGCGCTAGGAAGAGGTTTCACACTCTCATATAGAAGGTAGGAAGAGGTTGTTGAACACAACTCACTGTGGGAAACACTCGCTCTCTTTATTAAGACCAATCGAGAAGAGAATACAAGACACTCTGTAGAATACTTCTGCTTTTTATTGTTTTTAGATTTCTTGGATGAATAACCTAGGTAGGGTGGGGGTATTTATACTAAGAGTAAACAATCTATATTTAACCAATCTAAATCTAGACCTAACCGATTTGAATCTAGA

Coding sequence (CDS)

ATGGAGAAGATGATGAGTTTAAACCATCTGTTCGTGACGACGTTCATCGGAAGCTTGTCGATGTTCATGGTCATTCCGACCATTGTTGACTTAACAATGGAGTTTGTGTGTCCTCACCAGGATCACTGTTCCATCGCCATTTATCTCTCTGGTGTCCAGCAGGCGATTGTAGGGCTTGGAGCAGTGGTGATAACACCAGTAATTGGGAATCTATCAGACAGATACGGAAGGAAAGCAATGCTGACTATCCCAATGACGTTTTCAATCATACCGCTCGCCATAATGGGTTATAGAAGAACTACCAACTTCTTCTATGCATTTTACATCATGAAAACTCTCACAGACATGGTTTCAGAAGGCACTACAGTTTCACTTGCTCTTGCTTATGTGGCAGACAAAACTTCAGAGGATCAGAGGATCTCGGCGTTCGGAATCCTATCTGGGGTCAGATCTGTAGGTTATGTGTGTGGAACCTTTTTGGCTCGGCTCCTTTCAACTGCTACAGTGTTTCAGGTGGCTGCTTTCATGTCAGTGCTTGCGGTAGTGCACATGAGGACTTTTCTCAAGGAAAGTATTCCAGATCAGAATGAGTTGACTCAACCGATCTTCGACGAAAACTTAAGTGGTGGTGATGATGAAAATGGACCAGAATTGCCTACAAGAACTCAGTTATCGATAGGGATGTCTTCTATAAGAGACGTTATCTCCTTAATCACGAGTAGCACAACATTTTCACAAGCAGCAAGAGTTTCCTTCTTCAATAGTCTAGCAGAGAAGGGGATGCAAGCATCACTAGCGTATTTCTTAAAGGCCCGTTTTCACTTCGACAAAAACCAGTTTGCTGACTTGATGATAATTGAGGGGGTTGCCGGGGCCGTTTCACTGTTTCTTTTGATGCCCGCTTTGGCACTAGCTATAAGACAGGAGAGGTTGCTATCAATAGGGCTGTGGGCGAGCATTATAAATGTTCCTTATGCTTTAAGAGCATTTACAATTTTTACAATTCTGGTCAGTCCAATAATATTCAACATTGCATCGAGTCAAGTTGGACCGAGTGAGCAGGGGAAGGCCCAAGGATACATCTCAGGCATTAATTCCCTTGCGAACATTGCTTCTCCATTACTTTTCAGTCCCTTGATAGCTCTTTTCCTCTCCAAGGATGCACCATTTGACTTCCCCGGCTTCGGTATTTTGTGCATTGGGCTCGCTTCGTTGATTGGCTTCACTCTAAGCCTGATGATCCGTGTAGACCCGTTCATTTTCATTCAGAAAATCAAAAACTTAGTATACTGTGAATTTGAAGCAACACAGAGACAACAGATACTCAAACTCAAAGCAACTACAGCAGAAAATGCGATTTACTTTTGGCGCCTAATGGAGCTCCGCTTTTGCCCGCCGCCGTACGTGATTGGGGATAGCGTTCGACTCTTCTCAAAGGCACCTAAACGCTACGACGGCTTCTGCAGTTACCATTTCCGGCCAAATCTGCAGGTCAAATGTGCTACACTCACCAAACAAAGTCACCGATTCCTCTCTACTTTGGCCACAACCGCCGCCGCCGGCGACCATTCAGCTACCAATCGTTTGATTCGGAAGTTTGTTGCGAGTTCTCCGAAATCTATTACTCTCAATGTCCTCTCCGATATCCTTTCCTCTCGCACGGCTCAACCTGGACTCTGCTCTGTTGCTCTCACCTTATATTCCAGAATTACTGAGACGTCCTGGTTCACATGGAATTCCAAGCTAGTTGCTGACCTTGTTGCCTTCCTCGATAAAAATGGACAGATTGTTGACTCGGAAACCCTAATTTCCGAGGCAATTTCGAAATTAGGGATTCAAGAAAGAAAGCTTGTAAACTTCTACTGTCAGCTGGTTGAATCTCAATCCAAACACGGTTCAGAAAGAGGATTTGGTATCGCATATGCTTGTCTTCTTGAGCTTCTTTATAAGTCGTCCTCGATTTATGTGAAACGTCGAGCTTATGAATCAATGGTTACTGGTTTGTGCTCCATGAAAAGGCCTCAGGAAGCTGAGAGTTTGGTAAAAGAAATGAAAGCCAAAGGATTTGCTCCTGCTGCATTTGAATACAGGTCCATTATTTACGCATATGGAACATTGGGGTTGTTTGAAGATATGAAGAGGAGTTTGGAAGAGATGAAGAACGATGATATTGCTTTAGACACAGTTTGTTCTAACATGGTGCTTTCATCATATGGAGTTCATAATAAGCTTGCAGATATGGTTCTATGGCTTCAAATAATGAAAACTTCTGCTCTTCCTTTCTCGGTTCGAACGTACAATTCTGTCTTGAATTCATGTCCGAAGATTACGTCGATGCTACAAGACAAGAGCGACGATCTTCCAGTTTTGATTGAAGACTTGATCACGGTTCTGGACGGGGACGAGGCTTTGTTGGTTGAAGAGTTGGTTGGTTCATCTGTTTTGAAAGAAGTAATGGTGTGGGATGCAATGGAGATGAAGTTGGATTTGCATGGAGCACATGTTGGTGCAGCTTATGTGATCATTTTGGAGTGGATGAAGGAGATGAGACTGAAGTTTGAGGATGAGAGCTGTGTGATTCCAGCACAAGTTACAGTGATTTGTGGATCTGGAAACCATAGTATTGTTAGAGGAGAGTCTCCTGTAAAAGCTCTAATTAGAGAGATTATGTTTCGGACACAAAGTCCGCTGAGAATTGATCGCAAGAACACTGGTTGCTTTGTCGCCAAAGGAAAAGCGGTAAAGAATTGGGTATGTTTGAGGTGA

Protein sequence

MEKMMSLNHLFVTTFIGSLSMFMVIPTIVDLTMEFVCPHQDHCSIAIYLSGVQQAIVGLGAVVITPVIGNLSDRYGRKAMLTIPMTFSIIPLAIMGYRRTTNFFYAFYIMKTLTDMVSEGTTVSLALAYVADKTSEDQRISAFGILSGVRSVGYVCGTFLARLLSTATVFQVAAFMSVLAVVHMRTFLKESIPDQNELTQPIFDENLSGGDDENGPELPTRTQLSIGMSSIRDVISLITSSTTFSQAARVSFFNSLAEKGMQASLAYFLKARFHFDKNQFADLMIIEGVAGAVSLFLLMPALALAIRQERLLSIGLWASIINVPYALRAFTIFTILVSPIIFNIASSQVGPSEQGKAQGYISGINSLANIASPLLFSPLIALFLSKDAPFDFPGFGILCIGLASLIGFTLSLMIRVDPFIFIQKIKNLVYCEFEATQRQQILKLKATTAENAIYFWRLMELRFCPPPYVIGDSVRLFSKAPKRYDGFCSYHFRPNLQVKCATLTKQSHRFLSTLATTAAAGDHSATNRLIRKFVASSPKSITLNVLSDILSSRTAQPGLCSVALTLYSRITETSWFTWNSKLVADLVAFLDKNGQIVDSETLISEAISKLGIQERKLVNFYCQLVESQSKHGSERGFGIAYACLLELLYKSSSIYVKRRAYESMVTGLCSMKRPQEAESLVKEMKAKGFAPAAFEYRSIIYAYGTLGLFEDMKRSLEEMKNDDIALDTVCSNMVLSSYGVHNKLADMVLWLQIMKTSALPFSVRTYNSVLNSCPKITSMLQDKSDDLPVLIEDLITVLDGDEALLVEELVGSSVLKEVMVWDAMEMKLDLHGAHVGAAYVIILEWMKEMRLKFEDESCVIPAQVTVICGSGNHSIVRGESPVKALIREIMFRTQSPLRIDRKNTGCFVAKGKAVKNWVCLR
Homology
BLAST of CmoCh20G007310 vs. ExPASy Swiss-Prot
Match: Q8GWA9 (Pentatricopeptide repeat-containing protein At2g17033 OS=Arabidopsis thaliana OX=3702 GN=At2g17033 PE=2 SV=1)

HSP 1 Score: 468.0 bits (1203), Expect = 2.5e-130
Identity = 236/417 (56.59%), Postives = 306/417 (73.38%), Query Frame = 0

Query: 503 LTKQSHRFLSTLATTAAAGDHSATNRLIRKFVASSPKSITLNVLSDILSSRTAQPGLCSV 562
           L K   RFLS+L++ A AGD SA NR I+KFVA+SPKS+ LNVLS +LS +T+ P L   
Sbjct: 89  LMKHGDRFLSSLSSPALAGDPSAINRHIKKFVAASPKSVALNVLSHLLSDQTSHPHLSFF 148

Query: 563 ALTLYSRITETSWFTWNSKLVADLVAFLDKNGQIVDSETLISEAISKLGIQERKLVNFYC 622
           AL+LYS ITE SWF WN KL+A+L+A L+K  +  +SETL+S A+S+L   ER    F C
Sbjct: 149 ALSLYSEITEASWFDWNPKLIAELIALLNKQERFDESETLLSTAVSRLKSNERDFTLFLC 208

Query: 623 QLVESQSKHGSERGFGIAYACLLELLYKSSSIYVKRRAYESMVTGLCSMKRPQEAESLVK 682
            LVES SK GS +GF  A   L E++ +SSS+YVK +AY+SMV+GLC+M +P +AE +++
Sbjct: 209 NLVESNSKQGSIQGFSEASFRLREIIQRSSSVYVKTQAYKSMVSGLCNMDQPHDAERVIE 268

Query: 683 EMKAKGFAPAAFEYRSIIYAYGTLGLFEDMKRSLEEMKNDDIALDTVCSNMVLSSYGVHN 742
           EM+ +   P  FEY+S++Y YG LGLF+DM R +  M  +   +DTVCSNMVLSSYG H+
Sbjct: 269 EMRMEKIKPGLFEYKSVLYGYGRLGLFDDMNRVVHRMGTEGHKIDTVCSNMVLSSYGAHD 328

Query: 743 KLADMVLWLQIMKTSALPFSVRTYNSVLNSCPKITSMLQDKSDDLPVLIEDLITVLDGDE 802
            L  M  WLQ +K   +PFS+RTYNSVLNSCP I SML+D  D  PV + +L T L+ DE
Sbjct: 329 ALPQMGSWLQKLKGFNVPFSIRTYNSVLNSCPTIISMLKD-LDSCPVSLSELRTFLNEDE 388

Query: 803 ALLVEELVGSSVLKEVMVWDAMEMKLDLHGAHVGAAYVIILEWMKEMRLKFEDESCVIPA 862
           ALLV EL  SSVL E + W+A+E KLDLHG H+ ++Y+I+L+WM E RL+F +E CVIPA
Sbjct: 389 ALLVHELTQSSVLDEAIEWNAVEGKLDLHGMHLSSSYLILLQWMDETRLRFSEEKCVIPA 448

Query: 863 QVTVICGSGNHSIVRGESPVKALIREIMFRTQSPLRIDRKNTGCFVAKGKAVKNWVC 920
           ++ V+ GSG HS VRGESPVKAL+++IM RT SP+RIDRKN G F+AKGK VK W+C
Sbjct: 449 EIVVVSGSGKHSNVRGESPVKALVKKIMVRTGSPMRIDRKNVGSFIAKGKTVKEWLC 504

BLAST of CmoCh20G007310 vs. ExPASy Swiss-Prot
Match: P02982 (Tetracycline resistance protein, class A OS=Escherichia coli OX=562 GN=tetA PE=3 SV=2)

HSP 1 Score: 76.6 bits (187), Expect = 1.6e-12
Identity = 94/383 (24.54%), Postives = 159/383 (41.51%), Query Frame = 0

Query: 13  TTFIGSLSMFMVIPTIVDLTMEFVCPHQDHCSIAIYLSGVQQAIVGLGAVVITPVIGNLS 72
           T  + ++ + +++P +  L  + V     H +      G+  A+  L      PV+G LS
Sbjct: 13  TVALDAVGIGLIMPVLPGLLRDLV-----HSNDVTAHYGILLALYALMQFACAPVLGALS 72

Query: 73  DRYGRKAMLTIPMTFSIIPLAIMGYRRTTNFFYAFYIMKTLTDMVSEGTTVSLALAYVAD 132
           DR+GR+ +L + +  + +  AIM    T  F +  YI + +  +   G T ++A AY+AD
Sbjct: 73  DRFGRRPVLLVSLAGAAVDYAIMA---TAPFLWVLYIGRIVAGIT--GATGAVAGAYIAD 132

Query: 133 KTSEDQRISAFGILSGVRSVGYVCGTFLARLL---STATVFQVAAFMSVLAVVHMRTFLK 192
            T  D+R   FG +S     G V G  L  L+   S    F  AA ++ L  +     L 
Sbjct: 133 ITDGDERARHFGFMSACFGFGMVAGPVLGGLMGGFSPHAPFFAAAALNGLNFLTGCFLLP 192

Query: 193 ESIPDQNELTQPIFDENLSGGDDENGPELPTRTQLSIGMSSIRDVISLITSSTTFSQAAR 252
           ES                       G   P R +    ++S R         T  +    
Sbjct: 193 ES---------------------HKGERRPLRREALNPLASFR----WARGMTVVAALMA 252

Query: 253 VSFFNSLAEKGMQASLAYFLKARFHFDKNQFADLMIIEGVAGAVSLFLLMPALALAIRQE 312
           V F   L  +   A    F + RFH+D       +   G+  +++  ++   +A  + + 
Sbjct: 253 VFFIMQLVGQVPAALWVIFGEDRFHWDATTIGISLAAFGILHSLAQAMITGPVAARLGER 312

Query: 313 RLLSIGLWA---SIINVPYALRAFTIFTILV--------SPIIFNIASSQVGPSEQGKAQ 372
           R L +G+ A     I + +A R +  F I+V         P +  + S QV    QG+ Q
Sbjct: 313 RALMLGMIADGTGYILLAFATRGWMAFPIMVLLASGGIGMPALQAMLSRQVDEERQGQLQ 360

Query: 373 GYISGINSLANIASPLLFSPLIA 382
           G ++ + SL +I  PLLF+ + A
Sbjct: 373 GSLAALTSLTSIVGPLLFTAIYA 360

BLAST of CmoCh20G007310 vs. ExPASy Swiss-Prot
Match: Q96MC6 (Hippocampus abundant transcript 1 protein OS=Homo sapiens OX=9606 GN=MFSD14A PE=1 SV=2)

HSP 1 Score: 74.3 bits (181), Expect = 7.9e-12
Identity = 89/397 (22.42%), Postives = 166/397 (41.81%), Query Frame = 0

Query: 6   SLNHLFVTTFIGSLSM-FMVIPTIVDLTMEFVCPHQDHCSIAIYLSGVQQAIVGLGAVVI 65
           S+ H  +  F+   +   +  PT+V L       H+        ++G+ Q + GL + + 
Sbjct: 36  SVYHAVIVIFLEFFAWGLLTAPTLVVL-------HETFPKHTFLMNGLIQGVKGLLSFLS 95

Query: 66  TPVIGNLSDRYGRKAMLTIPMTFSIIPLAIMGYRRTTNFFYAFYIMKTLTDMVSEGTTVS 125
            P+IG LSD +GRK+ L + + F+  P+ +M   + + ++Y   I  +    V    T S
Sbjct: 96  APLIGALSDVWGRKSFLLLTVFFTCAPIPLM---KISPWWYFAVISVSGVFAV----TFS 155

Query: 126 LALAYVADKTSEDQRISAFGILSGVRSVGYV----CGTFLARLLSTATVFQVAAFMSVLA 185
           +  AYVAD T E +R  A+G++S   +   V     G +L R+   + V  +A  +++L 
Sbjct: 156 VVFAYVADITQEHERSMAYGLVSATFAASLVTSPAIGAYLGRVYGDSLVVVLATAIALLD 215

Query: 186 VVHMRTFLKESIPDQNELTQ---PIFDENLSGGDDENGPELPTRTQLSIGMSSIRDVISL 245
           +  +   + ES+P++        PI  E             P  +   +G  SI  +I +
Sbjct: 216 ICFILVAVPESLPEKMRPASWGAPISWEQAD----------PFASLKKVGQDSIVLLICI 275

Query: 246 ITSSTTFSQAARVSFFNSLAEKGMQASLAYFLKARFHFDKNQFADLMIIEGVAGAVSLFL 305
                         F + L E G  +S   +L+    F     A  + + G+   ++  +
Sbjct: 276 TV------------FLSYLPEAGQYSSFFLYLRQIMKFSPESVAAFIAVLGILSIIAQTI 335

Query: 306 LMPALALAIRQERLLSIGLWASIINVP-----------YALRAFTIFTILVSPIIFNIAS 365
           ++  L  +I  +  + +GL   I+ +            +A  A    + +  P +  + S
Sbjct: 336 VLSLLMRSIGNKNTILLGLGFQILQLAWYGFGSEPWMMWAAGAVAAMSSITFPAVSALVS 395

Query: 366 SQVGPSEQGKAQGYISGINSLANIASPLLFSPLIALF 384
                 +QG  QG I+GI  L N   P L+  +  +F
Sbjct: 396 RTADADQQGVVQGMITGIRGLCNGLGPALYGFIFYIF 396

BLAST of CmoCh20G007310 vs. ExPASy Swiss-Prot
Match: P70187 (Hippocampus abundant transcript 1 protein OS=Mus musculus OX=10090 GN=Mfsd14a PE=2 SV=3)

HSP 1 Score: 74.3 bits (181), Expect = 7.9e-12
Identity = 89/397 (22.42%), Postives = 166/397 (41.81%), Query Frame = 0

Query: 6   SLNHLFVTTFIGSLSM-FMVIPTIVDLTMEFVCPHQDHCSIAIYLSGVQQAIVGLGAVVI 65
           S+ H  +  F+   +   +  PT+V L       H+        ++G+ Q + GL + + 
Sbjct: 36  SVYHAVIVIFLEFFAWGLLTAPTLVVL-------HETFPKHTFLMNGLIQGVKGLLSFLS 95

Query: 66  TPVIGNLSDRYGRKAMLTIPMTFSIIPLAIMGYRRTTNFFYAFYIMKTLTDMVSEGTTVS 125
            P+IG LSD +GRK+ L + + F+  P+ +M   + + ++Y   I  +    V    T S
Sbjct: 96  APLIGALSDVWGRKSFLLLTVFFTCAPIPLM---KISPWWYFAVISVSGVFAV----TFS 155

Query: 126 LALAYVADKTSEDQRISAFGILSGVRSVGYV----CGTFLARLLSTATVFQVAAFMSVLA 185
           +  AYVAD T E +R  A+G++S   +   V     G +L R+   + V  +A  +++L 
Sbjct: 156 VVFAYVADITQEHERSMAYGLVSATFAASLVTSPAIGAYLGRVYGDSLVVVLATAIALLD 215

Query: 186 VVHMRTFLKESIPDQNELTQ---PIFDENLSGGDDENGPELPTRTQLSIGMSSIRDVISL 245
           +  +   + ES+P++        PI  E             P  +   +G  SI  +I +
Sbjct: 216 ICFILVAVPESLPEKMRPASWGAPISWEQAD----------PFASLKKVGQDSIVLLICI 275

Query: 246 ITSSTTFSQAARVSFFNSLAEKGMQASLAYFLKARFHFDKNQFADLMIIEGVAGAVSLFL 305
                         F + L E G  +S   +L+    F     A  + + G+   ++  +
Sbjct: 276 TV------------FLSYLPEAGQYSSFFLYLRQIMKFSPESVAAFIAVLGILSIIAQTI 335

Query: 306 LMPALALAIRQERLLSIGLWASIINVP-----------YALRAFTIFTILVSPIIFNIAS 365
           ++  L  +I  +  + +GL   I+ +            +A  A    + +  P +  + S
Sbjct: 336 VLSLLMRSIGNKNTILLGLGFQILQLAWYGFGSEPWMMWAAGAVAAMSSITFPAVSALVS 395

Query: 366 SQVGPSEQGKAQGYISGINSLANIASPLLFSPLIALF 384
                 +QG  QG I+GI  L N   P L+  +  +F
Sbjct: 396 RTADADQQGVVQGMITGIRGLCNGLGPALYGFIFYIF 396

BLAST of CmoCh20G007310 vs. ExPASy Swiss-Prot
Match: A4IF94 (Hippocampus abundant transcript-like protein 1 OS=Bos taurus OX=9913 GN=MFSD14B PE=2 SV=1)

HSP 1 Score: 67.8 bits (164), Expect = 7.4e-10
Identity = 82/360 (22.78%), Postives = 156/360 (43.33%), Query Frame = 0

Query: 39  HQDHCSIAIYLSGVQQAIVGLGAVVITPVIGNLSDRYGRKAMLTIPMTFSIIPLAIMGYR 98
           H+        ++G+ Q + GL + +  P+IG LSD +GRK  L   + F+  P+ +M   
Sbjct: 68  HETFPQHTFLMNGLIQGVKGLLSFLSAPLIGALSDVWGRKPFLLGTVFFTCFPIPLM--- 127

Query: 99  RTTNFFYAFYIMKTLTDMVSEGTTVSLALAYVADKTSEDQRISAFGILSGVRSVGYV--- 158
           R + ++Y  + M +++ + S   T S+  AYVAD T E +R +A+G +S   +   V   
Sbjct: 128 RISPWWY--FAMISISGVFS--VTFSVIFAYVADVTQEHERSTAYGWVSATFAASLVSSP 187

Query: 159 -CGTFLARLLSTATVFQVAAFMSVLAVVHMRTFLKESIPDQNELTQPIFDENLSGGDDEN 218
             G +L+     + V  VA  +++L +  +   + ES+P++           LS G    
Sbjct: 188 AIGAYLSASYGDSLVVLVATVVALLDICFILLAVPESLPEKM--------RPLSWG---- 247

Query: 219 GPELPTRTQLSIGMSSIRDVISLITSSTTFSQAARVSFFNSLAEKGMQASLAYFLKARFH 278
                   ++S   +     +  +   +T        F + L E G  +S   +L+    
Sbjct: 248 -------ARISWKQADPFASLKKVGKDSTILLICITVFLSYLPEAGQYSSFFLYLRQVIG 307

Query: 279 FDKNQFADLMIIEGVAGAVSLFLLMPALALAIRQERLLSIGL--------WASIINVPYA 338
           F   + A  + + G+   V+  + + +L  ++  +  + +GL        W    +  + 
Sbjct: 308 FGSIKIAAFIAMVGILSIVAQTVFLTSLMRSLGNKNTVLLGLGFQMFQLAWYGFGSQAWM 367

Query: 339 LRAFTIFTILVS---PIIFNIASSQVGPSEQGKAQGYISGINSLANIASPLLFSPLIALF 384
           + A  I   + S   P +  + S     ++QG AQG I+GI  L N   P L+  +  +F
Sbjct: 368 MWAAGIVAAVSSITFPAVSTLVSQNADSNQQGVAQGIITGIRGLCNGLGPALYGFIFYMF 401

BLAST of CmoCh20G007310 vs. ExPASy TrEMBL
Match: A0A6J1FXE0 (pentatricopeptide repeat-containing protein At2g17033 isoform X1 OS=Cucurbita moschata OX=3662 GN=LOC111448442 PE=3 SV=1)

HSP 1 Score: 906.0 bits (2340), Expect = 1.3e-259
Identity = 459/459 (100.00%), Postives = 459/459 (100.00%), Query Frame = 0

Query: 459 MELRFCPPPYVIGDSVRLFSKAPKRYDGFCSYHFRPNLQVKCATLTKQSHRFLSTLATTA 518
           MELRFCPPPYVIGDSVRLFSKAPKRYDGFCSYHFRPNLQVKCATLTKQSHRFLSTLATTA
Sbjct: 1   MELRFCPPPYVIGDSVRLFSKAPKRYDGFCSYHFRPNLQVKCATLTKQSHRFLSTLATTA 60

Query: 519 AAGDHSATNRLIRKFVASSPKSITLNVLSDILSSRTAQPGLCSVALTLYSRITETSWFTW 578
           AAGDHSATNRLIRKFVASSPKSITLNVLSDILSSRTAQPGLCSVALTLYSRITETSWFTW
Sbjct: 61  AAGDHSATNRLIRKFVASSPKSITLNVLSDILSSRTAQPGLCSVALTLYSRITETSWFTW 120

Query: 579 NSKLVADLVAFLDKNGQIVDSETLISEAISKLGIQERKLVNFYCQLVESQSKHGSERGFG 638
           NSKLVADLVAFLDKNGQIVDSETLISEAISKLGIQERKLVNFYCQLVESQSKHGSERGFG
Sbjct: 121 NSKLVADLVAFLDKNGQIVDSETLISEAISKLGIQERKLVNFYCQLVESQSKHGSERGFG 180

Query: 639 IAYACLLELLYKSSSIYVKRRAYESMVTGLCSMKRPQEAESLVKEMKAKGFAPAAFEYRS 698
           IAYACLLELLYKSSSIYVKRRAYESMVTGLCSMKRPQEAESLVKEMKAKGFAPAAFEYRS
Sbjct: 181 IAYACLLELLYKSSSIYVKRRAYESMVTGLCSMKRPQEAESLVKEMKAKGFAPAAFEYRS 240

Query: 699 IIYAYGTLGLFEDMKRSLEEMKNDDIALDTVCSNMVLSSYGVHNKLADMVLWLQIMKTSA 758
           IIYAYGTLGLFEDMKRSLEEMKNDDIALDTVCSNMVLSSYGVHNKLADMVLWLQIMKTSA
Sbjct: 241 IIYAYGTLGLFEDMKRSLEEMKNDDIALDTVCSNMVLSSYGVHNKLADMVLWLQIMKTSA 300

Query: 759 LPFSVRTYNSVLNSCPKITSMLQDKSDDLPVLIEDLITVLDGDEALLVEELVGSSVLKEV 818
           LPFSVRTYNSVLNSCPKITSMLQDKSDDLPVLIEDLITVLDGDEALLVEELVGSSVLKEV
Sbjct: 301 LPFSVRTYNSVLNSCPKITSMLQDKSDDLPVLIEDLITVLDGDEALLVEELVGSSVLKEV 360

Query: 819 MVWDAMEMKLDLHGAHVGAAYVIILEWMKEMRLKFEDESCVIPAQVTVICGSGNHSIVRG 878
           MVWDAMEMKLDLHGAHVGAAYVIILEWMKEMRLKFEDESCVIPAQVTVICGSGNHSIVRG
Sbjct: 361 MVWDAMEMKLDLHGAHVGAAYVIILEWMKEMRLKFEDESCVIPAQVTVICGSGNHSIVRG 420

Query: 879 ESPVKALIREIMFRTQSPLRIDRKNTGCFVAKGKAVKNW 918
           ESPVKALIREIMFRTQSPLRIDRKNTGCFVAKGKAVKNW
Sbjct: 421 ESPVKALIREIMFRTQSPLRIDRKNTGCFVAKGKAVKNW 459

BLAST of CmoCh20G007310 vs. ExPASy TrEMBL
Match: A0A6J1JE75 (pentatricopeptide repeat-containing protein At2g17033 isoform X1 OS=Cucurbita maxima OX=3661 GN=LOC111483639 PE=3 SV=1)

HSP 1 Score: 873.6 bits (2256), Expect = 7.2e-250
Identity = 445/459 (96.95%), Postives = 448/459 (97.60%), Query Frame = 0

Query: 459 MELRFCPPPYVIGDSVRLFSKAPKRYDGFCSYHFRPNLQVKCATLTKQSHRFLSTLATTA 518
           MELR CPPPYVIGDSVRLFSKAPKRYDGFCSYHFRPNLQVKCATLTKQSHRFLSTLATTA
Sbjct: 1   MELRLCPPPYVIGDSVRLFSKAPKRYDGFCSYHFRPNLQVKCATLTKQSHRFLSTLATTA 60

Query: 519 AAGDHSATNRLIRKFVASSPKSITLNVLSDILSSRTAQPGLCSVALTLYSRITETSWFTW 578
           AAGDHSATNRLIRKFVASSPKSITLNVLSDILSSRTAQPGLCSVA  LYSRITETSWF W
Sbjct: 61  AAGDHSATNRLIRKFVASSPKSITLNVLSDILSSRTAQPGLCSVARILYSRITETSWFAW 120

Query: 579 NSKLVADLVAFLDKNGQIVDSETLISEAISKLGIQERKLVNFYCQLVESQSKHGSERGFG 638
           NSKLVADLVAFLDKNGQIVDSETLISEAISKLGIQERKLVNFYCQLVESQSKHGSERGF 
Sbjct: 121 NSKLVADLVAFLDKNGQIVDSETLISEAISKLGIQERKLVNFYCQLVESQSKHGSERGFR 180

Query: 639 IAYACLLELLYKSSSIYVKRRAYESMVTGLCSMKRPQEAESLVKEMKAKGFAPAAFEYRS 698
            AYACL ELLY SSSIYVKRRAYESMVTGLCSMKRPQEAESLVKEMKAKGFAPAAFEYRS
Sbjct: 181 NAYACLHELLYNSSSIYVKRRAYESMVTGLCSMKRPQEAESLVKEMKAKGFAPAAFEYRS 240

Query: 699 IIYAYGTLGLFEDMKRSLEEMKNDDIALDTVCSNMVLSSYGVHNKLADMVLWLQIMKTSA 758
           IIYAYGTLGLFEDMKRSLEEMKND IALDTVCSNMVLSSYGVHNKLADMVLWLQIMKTSA
Sbjct: 241 IIYAYGTLGLFEDMKRSLEEMKNDHIALDTVCSNMVLSSYGVHNKLADMVLWLQIMKTSA 300

Query: 759 LPFSVRTYNSVLNSCPKITSMLQDKSDDLPVLIEDLITVLDGDEALLVEELVGSSVLKEV 818
           LPFSVRTYNSVLNSCPKITSMLQDKS DLPVLIEDLI+VLDGDEALLVEELVGSSVL+EV
Sbjct: 301 LPFSVRTYNSVLNSCPKITSMLQDKSGDLPVLIEDLISVLDGDEALLVEELVGSSVLREV 360

Query: 819 MVWDAMEMKLDLHGAHVGAAYVIILEWMKEMRLKFEDESCVIPAQVTVICGSGNHSIVRG 878
           MVWDAMEMKLDLHGAHVGAAYVIILEWMKEMRLKFE+ESCVIPAQVTVICGSGNHSIVR 
Sbjct: 361 MVWDAMEMKLDLHGAHVGAAYVIILEWMKEMRLKFENESCVIPAQVTVICGSGNHSIVRR 420

Query: 879 ESPVKALIREIMFRTQSPLRIDRKNTGCFVAKGKAVKNW 918
           ESPVKALIREIMFRTQSPLRIDRKNTGCFVAKGKAVKNW
Sbjct: 421 ESPVKALIREIMFRTQSPLRIDRKNTGCFVAKGKAVKNW 459

BLAST of CmoCh20G007310 vs. ExPASy TrEMBL
Match: A0A6J1FTY6 (uncharacterized protein LOC111448122 isoform X2 OS=Cucurbita moschata OX=3662 GN=LOC111448122 PE=4 SV=1)

HSP 1 Score: 778.5 bits (2009), Expect = 3.1e-221
Identity = 429/468 (91.67%), Postives = 429/468 (91.67%), Query Frame = 0

Query: 1   MEKMMSLNHLFVTTFIGSLSMFMVIPTIVDLTMEFVCPHQDHCSIAIYLSGVQQAIVGLG 60
           MEKMMSLNHLFVTTFIGSLSMFMVIPTIVDLTMEFVCPHQDHCSIAIYLSGVQQAIVGLG
Sbjct: 1   MEKMMSLNHLFVTTFIGSLSMFMVIPTIVDLTMEFVCPHQDHCSIAIYLSGVQQAIVGLG 60

Query: 61  AVVITPVIGNLSDRYGRKAMLTIPMTFSIIPLAIMGYRRTTNFFYAFYIMKTLTDMVSEG 120
           AVVITPVIGNLSDRYGRKAMLTIPMTFSIIPLAIMGYRRTTNFFYAFYIMKTLTDMVSEG
Sbjct: 61  AVVITPVIGNLSDRYGRKAMLTIPMTFSIIPLAIMGYRRTTNFFYAFYIMKTLTDMVSEG 120

Query: 121 TTVSLALAYVADKTSEDQRISAFGILSGVRSVGYVCGTFLARLLSTATVFQVAAFMSVLA 180
           TTVSLALAYVADKTSEDQRISAFGILSGVRSVGYVCGTFLARLLSTATVFQVAAFMSVLA
Sbjct: 121 TTVSLALAYVADKTSEDQRISAFGILSGVRSVGYVCGTFLARLLSTATVFQVAAFMSVLA 180

Query: 181 VVHMRTFLKESIPDQNELTQPIFDENLSGGDDENGPELPTRTQLSIGMSSIRDVISLITS 240
           VVHMRTFLKESIPDQNELTQPIFDENLSGGDDENGPELPTRTQLSIGMSSIRDVISLITS
Sbjct: 181 VVHMRTFLKESIPDQNELTQPIFDENLSGGDDENGPELPTRTQLSIGMSSIRDVISLITS 240

Query: 241 STTFSQAARVSFFNSLAEKGMQASLAYFLKARFHFDKNQFADLMIIEGVAGAVSLFLLMP 300
           STTFSQAARVSFFNSLAEKGMQASLAYFLKARFHFDKNQFADLMIIEGVAGAVSLFLLMP
Sbjct: 241 STTFSQAARVSFFNSLAEKGMQASLAYFLKARFHFDKNQFADLMIIEGVAGAVSLFLLMP 300

Query: 301 ALALAIRQERLLSIGLWASIIN-----------VPYALRAFTIFTILVSPI--------- 360
           ALALAIRQERLLSIGLWASIIN           VPYALRAFTIFTILVSPI         
Sbjct: 301 ALALAIRQERLLSIGLWASIINILLNSIAWSVWVPYALRAFTIFTILVSPIVSFPCFLQS 360

Query: 361 -------------------IFNIASSQVGPSEQGKAQGYISGINSLANIASPLLFSPLIA 420
                              IFNIASSQVGPSEQGKAQGYISGINSLANIASPLLFSPLIA
Sbjct: 361 LSLLMKKCRGIHTNLWLQQIFNIASSQVGPSEQGKAQGYISGINSLANIASPLLFSPLIA 420

Query: 421 LFLSKDAPFDFPGFGILCIGLASLIGFTLSLMIRVDPFIFIQKIKNLV 430
           LFLSKDAPFDFPGFGILCIGLASLIGFTLSLMIRVDPFIFIQKIKNLV
Sbjct: 421 LFLSKDAPFDFPGFGILCIGLASLIGFTLSLMIRVDPFIFIQKIKNLV 468

BLAST of CmoCh20G007310 vs. ExPASy TrEMBL
Match: A0A6J1JBV8 (hippocampus abundant transcript 1 protein-like isoform X3 OS=Cucurbita maxima OX=3661 GN=LOC111483637 PE=4 SV=1)

HSP 1 Score: 761.9 bits (1966), Expect = 3.0e-216
Identity = 411/440 (93.41%), Postives = 419/440 (95.23%), Query Frame = 0

Query: 1   MEKMMSLNHLFVTTFIGSLSMFMVIPTIVDLTMEFVCPHQDHCSIAIYLSGVQQAIVGLG 60
           MEKMMSLNHLFVTTFIGSLSMFMVIP+IVD+TMEFVCPHQDHCSIAIYLSG+QQAIVGLG
Sbjct: 1   MEKMMSLNHLFVTTFIGSLSMFMVIPSIVDITMEFVCPHQDHCSIAIYLSGLQQAIVGLG 60

Query: 61  AVVITPVIGNLSDRYGRKAMLTIPMTFSIIPLAIMGYRRTTNFFYAFYIMKTLTDMVSEG 120
           A+VITPVIGNLSDRYGRK MLTIP+TFSIIPLAIMGYRRTTNFFYAFYIMKTLTDMVSEG
Sbjct: 61  ALVITPVIGNLSDRYGRKTMLTIPLTFSIIPLAIMGYRRTTNFFYAFYIMKTLTDMVSEG 120

Query: 121 TTVSLALAYVADKTSEDQRISAFGILSGVRSVGYVCGTFLARLLSTATVFQVAAFMSVLA 180
           TTVSLALAYVADKTSEDQRISAFGILSGVRSVGYVCGTF ARLLSTATVFQVAAFMSVLA
Sbjct: 121 TTVSLALAYVADKTSEDQRISAFGILSGVRSVGYVCGTFSARLLSTATVFQVAAFMSVLA 180

Query: 181 VVHMRTFLKESIPDQNELTQPIFDENLSGGDDENGPELPTRTQLSIGMSSIRDVISLITS 240
           VVHMR FLKESIPDQNELTQPI DENLSGGDDENGPELPT TQLSIGMSSIRDVISLITS
Sbjct: 181 VVHMRIFLKESIPDQNELTQPILDENLSGGDDENGPELPTITQLSIGMSSIRDVISLITS 240

Query: 241 STTFSQAARVSFFNSLAEKGMQASLAYFLKARFHFDKNQFADLMIIEGVAGAVSLFLLMP 300
           STTFSQAARVSFFNSLAEKGMQASLAYFLKARFHFDKNQFADLMII+G+AGA+SLFLLMP
Sbjct: 241 STTFSQAARVSFFNSLAEKGMQASLAYFLKARFHFDKNQFADLMIIDGIAGAISLFLLMP 300

Query: 301 ALALAIRQERLLSIGLWASIIN-----------VPYALRAFTIFTILVSPIIFNIASSQV 360
           ALALAIRQERLLSIGLWASIIN           VPYALRA TIFTILVSPIIFNIASSQV
Sbjct: 301 ALALAIRQERLLSIGLWASIINILLNSIAWSVWVPYALRALTIFTILVSPIIFNIASSQV 360

Query: 361 GPSEQGKAQGYISGINSLANIASPLLFSPLIALFLSKDAPFDFPGFGILCIGLASLIGFT 420
           GPSEQGKAQG ISGINSLANI SPLLFSPLIALFLSKDAPFDFPGFGILCIGLASLIGF 
Sbjct: 361 GPSEQGKAQGCISGINSLANIVSPLLFSPLIALFLSKDAPFDFPGFGILCIGLASLIGFA 420

Query: 421 LSLMIRVDPFIFIQKIKNLV 430
           LSLMIRVDPFI IQKIKNLV
Sbjct: 421 LSLMIRVDPFICIQKIKNLV 440

BLAST of CmoCh20G007310 vs. ExPASy TrEMBL
Match: A0A6J1FXN8 (uncharacterized protein LOC111448122 isoform X3 OS=Cucurbita moschata OX=3662 GN=LOC111448122 PE=4 SV=1)

HSP 1 Score: 748.8 bits (1932), Expect = 2.7e-212
Identity = 404/416 (97.12%), Postives = 405/416 (97.36%), Query Frame = 0

Query: 1   MEKMMSLNHLFVTTFIGSLSMFMVIPTIVDLTMEFVCPHQDHCSIAIYLSGVQQAIVGLG 60
           MEKMMSLNHLFVTTFIGSLSMFMVIPTIVDLTMEFVCPHQDHCSIAIYLSGVQQAIVGLG
Sbjct: 1   MEKMMSLNHLFVTTFIGSLSMFMVIPTIVDLTMEFVCPHQDHCSIAIYLSGVQQAIVGLG 60

Query: 61  AVVITPVIGNLSDRYGRKAMLTIPMTFSIIPLAIMGYRRTTNFFYAFYIMKTLTDMVSEG 120
           AVVITPVIGNLSDRYGRKAMLTIPMTFSIIPLAIMGYRRTTNFFYAFYIMKTLTDMVSEG
Sbjct: 61  AVVITPVIGNLSDRYGRKAMLTIPMTFSIIPLAIMGYRRTTNFFYAFYIMKTLTDMVSEG 120

Query: 121 TTVSLALAYVADKTSEDQRISAFGILSGVRSVGYVCGTFLARLLSTATVFQVAAFMSVLA 180
           TTVSLALAYVADKTSEDQRISAFGILSGVRSVGYVCGTFLARLLSTATVFQVAAFMSVLA
Sbjct: 121 TTVSLALAYVADKTSEDQRISAFGILSGVRSVGYVCGTFLARLLSTATVFQVAAFMSVLA 180

Query: 181 VVHMRTFLKESIPDQNELTQPIFDENLSGGDDENGPELPTRTQLSIGMSSIRDVISLITS 240
           VVHMRTFLKESIPDQNELTQPIFDENLSGGDDENGPELPTRTQLSIGMSSIRDVISLITS
Sbjct: 181 VVHMRTFLKESIPDQNELTQPIFDENLSGGDDENGPELPTRTQLSIGMSSIRDVISLITS 240

Query: 241 STTFSQAARVSFFNSLAEKGMQASLAYFLKARFHFDKNQFADLMIIEGVAGAVSLFLLMP 300
           STTFSQAARVSFFNSLAEKGMQASLAYFLKARFHFDKNQFADLMIIEGVAGAVSLFLLMP
Sbjct: 241 STTFSQAARVSFFNSLAEKGMQASLAYFLKARFHFDKNQFADLMIIEGVAGAVSLFLLMP 300

Query: 301 ALALAIRQERLLSIGLWASIIN-----------VPYALRAFTIFTILVSPIIFNIASSQV 360
           ALALAIRQERLLSIGLWASIIN           VPYALRAFTIFTILVSPIIFNIASSQV
Sbjct: 301 ALALAIRQERLLSIGLWASIINILLNSIAWSVWVPYALRAFTIFTILVSPIIFNIASSQV 360

Query: 361 GPSEQGKAQGYISGINSLANIASPLLFSPLIALFLSKDAPFDFPGFGILCIGLASL 406
           GPSEQGKAQGYISGINSLANIASPLLFSPLIALFLSKDAPFDFPGFGILCIGLAS+
Sbjct: 361 GPSEQGKAQGYISGINSLANIASPLLFSPLIALFLSKDAPFDFPGFGILCIGLASV 416

BLAST of CmoCh20G007310 vs. NCBI nr
Match: XP_022943803.1 (pentatricopeptide repeat-containing protein At2g17033 isoform X1 [Cucurbita moschata])

HSP 1 Score: 906.0 bits (2340), Expect = 2.7e-259
Identity = 459/459 (100.00%), Postives = 459/459 (100.00%), Query Frame = 0

Query: 459 MELRFCPPPYVIGDSVRLFSKAPKRYDGFCSYHFRPNLQVKCATLTKQSHRFLSTLATTA 518
           MELRFCPPPYVIGDSVRLFSKAPKRYDGFCSYHFRPNLQVKCATLTKQSHRFLSTLATTA
Sbjct: 1   MELRFCPPPYVIGDSVRLFSKAPKRYDGFCSYHFRPNLQVKCATLTKQSHRFLSTLATTA 60

Query: 519 AAGDHSATNRLIRKFVASSPKSITLNVLSDILSSRTAQPGLCSVALTLYSRITETSWFTW 578
           AAGDHSATNRLIRKFVASSPKSITLNVLSDILSSRTAQPGLCSVALTLYSRITETSWFTW
Sbjct: 61  AAGDHSATNRLIRKFVASSPKSITLNVLSDILSSRTAQPGLCSVALTLYSRITETSWFTW 120

Query: 579 NSKLVADLVAFLDKNGQIVDSETLISEAISKLGIQERKLVNFYCQLVESQSKHGSERGFG 638
           NSKLVADLVAFLDKNGQIVDSETLISEAISKLGIQERKLVNFYCQLVESQSKHGSERGFG
Sbjct: 121 NSKLVADLVAFLDKNGQIVDSETLISEAISKLGIQERKLVNFYCQLVESQSKHGSERGFG 180

Query: 639 IAYACLLELLYKSSSIYVKRRAYESMVTGLCSMKRPQEAESLVKEMKAKGFAPAAFEYRS 698
           IAYACLLELLYKSSSIYVKRRAYESMVTGLCSMKRPQEAESLVKEMKAKGFAPAAFEYRS
Sbjct: 181 IAYACLLELLYKSSSIYVKRRAYESMVTGLCSMKRPQEAESLVKEMKAKGFAPAAFEYRS 240

Query: 699 IIYAYGTLGLFEDMKRSLEEMKNDDIALDTVCSNMVLSSYGVHNKLADMVLWLQIMKTSA 758
           IIYAYGTLGLFEDMKRSLEEMKNDDIALDTVCSNMVLSSYGVHNKLADMVLWLQIMKTSA
Sbjct: 241 IIYAYGTLGLFEDMKRSLEEMKNDDIALDTVCSNMVLSSYGVHNKLADMVLWLQIMKTSA 300

Query: 759 LPFSVRTYNSVLNSCPKITSMLQDKSDDLPVLIEDLITVLDGDEALLVEELVGSSVLKEV 818
           LPFSVRTYNSVLNSCPKITSMLQDKSDDLPVLIEDLITVLDGDEALLVEELVGSSVLKEV
Sbjct: 301 LPFSVRTYNSVLNSCPKITSMLQDKSDDLPVLIEDLITVLDGDEALLVEELVGSSVLKEV 360

Query: 819 MVWDAMEMKLDLHGAHVGAAYVIILEWMKEMRLKFEDESCVIPAQVTVICGSGNHSIVRG 878
           MVWDAMEMKLDLHGAHVGAAYVIILEWMKEMRLKFEDESCVIPAQVTVICGSGNHSIVRG
Sbjct: 361 MVWDAMEMKLDLHGAHVGAAYVIILEWMKEMRLKFEDESCVIPAQVTVICGSGNHSIVRG 420

Query: 879 ESPVKALIREIMFRTQSPLRIDRKNTGCFVAKGKAVKNW 918
           ESPVKALIREIMFRTQSPLRIDRKNTGCFVAKGKAVKNW
Sbjct: 421 ESPVKALIREIMFRTQSPLRIDRKNTGCFVAKGKAVKNW 459

BLAST of CmoCh20G007310 vs. NCBI nr
Match: XP_023512520.1 (pentatricopeptide repeat-containing protein At2g17033 isoform X1 [Cucurbita pepo subsp. pepo])

HSP 1 Score: 890.6 bits (2300), Expect = 1.2e-254
Identity = 451/459 (98.26%), Postives = 453/459 (98.69%), Query Frame = 0

Query: 459 MELRFCPPPYVIGDSVRLFSKAPKRYDGFCSYHFRPNLQVKCATLTKQSHRFLSTLATTA 518
           MELR CPPPYVIGDSVRLFSKAPKRYDGFCSYHFRPNLQVKCATLTKQSHRFLSTLATTA
Sbjct: 1   MELRLCPPPYVIGDSVRLFSKAPKRYDGFCSYHFRPNLQVKCATLTKQSHRFLSTLATTA 60

Query: 519 AAGDHSATNRLIRKFVASSPKSITLNVLSDILSSRTAQPGLCSVALTLYSRITETSWFTW 578
           AAGDHSATNRLIRKFVASSPKSITLNVLSDILSSRTAQPGLCSVALTLYSRITETSWFTW
Sbjct: 61  AAGDHSATNRLIRKFVASSPKSITLNVLSDILSSRTAQPGLCSVALTLYSRITETSWFTW 120

Query: 579 NSKLVADLVAFLDKNGQIVDSETLISEAISKLGIQERKLVNFYCQLVESQSKHGSERGFG 638
           NSKLVADLVAFLDKNGQIVDSETLISEAISKLGIQERKLVNFYCQLVESQSKHGSERGFG
Sbjct: 121 NSKLVADLVAFLDKNGQIVDSETLISEAISKLGIQERKLVNFYCQLVESQSKHGSERGFG 180

Query: 639 IAYACLLELLYKSSSIYVKRRAYESMVTGLCSMKRPQEAESLVKEMKAKGFAPAAFEYRS 698
            AYACLLELLY SSSIYVKRRAYESMVTGLCSMKRPQEAESLVKEMKAKGFAPAAFEYRS
Sbjct: 181 NAYACLLELLYNSSSIYVKRRAYESMVTGLCSMKRPQEAESLVKEMKAKGFAPAAFEYRS 240

Query: 699 IIYAYGTLGLFEDMKRSLEEMKNDDIALDTVCSNMVLSSYGVHNKLADMVLWLQIMKTSA 758
           IIYAYGTLGLFEDMKRSLEEMKNDDIALDTVCSNMVLSSYG HNKLADMVLWLQIMKTSA
Sbjct: 241 IIYAYGTLGLFEDMKRSLEEMKNDDIALDTVCSNMVLSSYGAHNKLADMVLWLQIMKTSA 300

Query: 759 LPFSVRTYNSVLNSCPKITSMLQDKSDDLPVLIEDLITVLDGDEALLVEELVGSSVLKEV 818
           LPFSVRTYNSVLNSCPKITS+LQDKS DLPVLIEDLITVLDGDEALLVEELVGSSVLKEV
Sbjct: 301 LPFSVRTYNSVLNSCPKITSILQDKSGDLPVLIEDLITVLDGDEALLVEELVGSSVLKEV 360

Query: 819 MVWDAMEMKLDLHGAHVGAAYVIILEWMKEMRLKFEDESCVIPAQVTVICGSGNHSIVRG 878
           MVWDAMEMKLDLHG HVGAAYVIILEWMKEMRLKFEDESCVIPAQVTVICGSGNHSIVRG
Sbjct: 361 MVWDAMEMKLDLHGVHVGAAYVIILEWMKEMRLKFEDESCVIPAQVTVICGSGNHSIVRG 420

Query: 879 ESPVKALIREIMFRTQSPLRIDRKNTGCFVAKGKAVKNW 918
           ESPVKALIREIMFRTQSPLRIDRKNTGCF+AKGKAVKNW
Sbjct: 421 ESPVKALIREIMFRTQSPLRIDRKNTGCFLAKGKAVKNW 459

BLAST of CmoCh20G007310 vs. NCBI nr
Match: KAG7010834.1 (Pentatricopeptide repeat-containing protein [Cucurbita argyrosperma subsp. argyrosperma])

HSP 1 Score: 886.3 bits (2289), Expect = 2.2e-253
Identity = 452/459 (98.47%), Postives = 453/459 (98.69%), Query Frame = 0

Query: 459 MELRFCPPPYVIGDSVRLFSKAPKRYDGFCSYHFRPNLQVKCATLTKQSHRFLSTLATTA 518
           MELRFCPPPYVIGDSVRLFSKAPKRYD FCSYHFRPNLQVKCATLTKQSHRFLSTLATTA
Sbjct: 1   MELRFCPPPYVIGDSVRLFSKAPKRYDRFCSYHFRPNLQVKCATLTKQSHRFLSTLATTA 60

Query: 519 AAGDHSATNRLIRKFVASSPKSITLNVLSDILSSRTAQPGLCSVALTLYSRITETSWFTW 578
           AAGDHSATNRLIRKFVASSPKS+TLNVLSDILSSRTAQPGLCSVALTLYSRITETSWFTW
Sbjct: 61  AAGDHSATNRLIRKFVASSPKSLTLNVLSDILSSRTAQPGLCSVALTLYSRITETSWFTW 120

Query: 579 NSKLVADLVAFLDKNGQIVDSETLISEAISKLGIQERKLVNFYCQLVESQSKHGSERGFG 638
           NSKLVADLVAFLDKNGQIVDSETLISEAISKLGIQERKLVNFYCQLVESQSKHGSERGFG
Sbjct: 121 NSKLVADLVAFLDKNGQIVDSETLISEAISKLGIQERKLVNFYCQLVESQSKHGSERGFG 180

Query: 639 IAYACLLELLYKSSSIYVKRRAYESMVTGLCSMKRPQEAESLVKEMKAKGFAPAAFEYRS 698
            AYA LLELLY SSSIYVKRRAYESMVTGLCSMKRPQEAESLVKEMKAKGFAPAAFEYRS
Sbjct: 181 NAYARLLELLYNSSSIYVKRRAYESMVTGLCSMKRPQEAESLVKEMKAKGFAPAAFEYRS 240

Query: 699 IIYAYGTLGLFEDMKRSLEEMKNDDIALDTVCSNMVLSSYGVHNKLADMVLWLQIMKTSA 758
           IIYAYGTLGLFEDMKRSLEEMKNDDIALDTVCSNMVLSSYGVHNKLADMVLWLQIMKTSA
Sbjct: 241 IIYAYGTLGLFEDMKRSLEEMKNDDIALDTVCSNMVLSSYGVHNKLADMVLWLQIMKTSA 300

Query: 759 LPFSVRTYNSVLNSCPKITSMLQDKSDDLPVLIEDLITVLDGDEALLVEELVGSSVLKEV 818
           LPFSVRTYNSVLNSC KITSMLQDKS DLPVLIEDLITVLDGDEALLVEELVGSSVLKEV
Sbjct: 301 LPFSVRTYNSVLNSCLKITSMLQDKSGDLPVLIEDLITVLDGDEALLVEELVGSSVLKEV 360

Query: 819 MVWDAMEMKLDLHGAHVGAAYVIILEWMKEMRLKFEDESCVIPAQVTVICGSGNHSIVRG 878
           MVWDAMEMKLDLHGAHVGAAYVIILEWMKEMRLKFEDESCVIPAQVTVICGSGNHSIVRG
Sbjct: 361 MVWDAMEMKLDLHGAHVGAAYVIILEWMKEMRLKFEDESCVIPAQVTVICGSGNHSIVRG 420

Query: 879 ESPVKALIREIMFRTQSPLRIDRKNTGCFVAKGKAVKNW 918
           ESPVKALIREIMFRTQSPLRIDRKNTGCFVAKGKAVKNW
Sbjct: 421 ESPVKALIREIMFRTQSPLRIDRKNTGCFVAKGKAVKNW 459

BLAST of CmoCh20G007310 vs. NCBI nr
Match: XP_022985638.1 (pentatricopeptide repeat-containing protein At2g17033 isoform X1 [Cucurbita maxima])

HSP 1 Score: 873.6 bits (2256), Expect = 1.5e-249
Identity = 445/459 (96.95%), Postives = 448/459 (97.60%), Query Frame = 0

Query: 459 MELRFCPPPYVIGDSVRLFSKAPKRYDGFCSYHFRPNLQVKCATLTKQSHRFLSTLATTA 518
           MELR CPPPYVIGDSVRLFSKAPKRYDGFCSYHFRPNLQVKCATLTKQSHRFLSTLATTA
Sbjct: 1   MELRLCPPPYVIGDSVRLFSKAPKRYDGFCSYHFRPNLQVKCATLTKQSHRFLSTLATTA 60

Query: 519 AAGDHSATNRLIRKFVASSPKSITLNVLSDILSSRTAQPGLCSVALTLYSRITETSWFTW 578
           AAGDHSATNRLIRKFVASSPKSITLNVLSDILSSRTAQPGLCSVA  LYSRITETSWF W
Sbjct: 61  AAGDHSATNRLIRKFVASSPKSITLNVLSDILSSRTAQPGLCSVARILYSRITETSWFAW 120

Query: 579 NSKLVADLVAFLDKNGQIVDSETLISEAISKLGIQERKLVNFYCQLVESQSKHGSERGFG 638
           NSKLVADLVAFLDKNGQIVDSETLISEAISKLGIQERKLVNFYCQLVESQSKHGSERGF 
Sbjct: 121 NSKLVADLVAFLDKNGQIVDSETLISEAISKLGIQERKLVNFYCQLVESQSKHGSERGFR 180

Query: 639 IAYACLLELLYKSSSIYVKRRAYESMVTGLCSMKRPQEAESLVKEMKAKGFAPAAFEYRS 698
            AYACL ELLY SSSIYVKRRAYESMVTGLCSMKRPQEAESLVKEMKAKGFAPAAFEYRS
Sbjct: 181 NAYACLHELLYNSSSIYVKRRAYESMVTGLCSMKRPQEAESLVKEMKAKGFAPAAFEYRS 240

Query: 699 IIYAYGTLGLFEDMKRSLEEMKNDDIALDTVCSNMVLSSYGVHNKLADMVLWLQIMKTSA 758
           IIYAYGTLGLFEDMKRSLEEMKND IALDTVCSNMVLSSYGVHNKLADMVLWLQIMKTSA
Sbjct: 241 IIYAYGTLGLFEDMKRSLEEMKNDHIALDTVCSNMVLSSYGVHNKLADMVLWLQIMKTSA 300

Query: 759 LPFSVRTYNSVLNSCPKITSMLQDKSDDLPVLIEDLITVLDGDEALLVEELVGSSVLKEV 818
           LPFSVRTYNSVLNSCPKITSMLQDKS DLPVLIEDLI+VLDGDEALLVEELVGSSVL+EV
Sbjct: 301 LPFSVRTYNSVLNSCPKITSMLQDKSGDLPVLIEDLISVLDGDEALLVEELVGSSVLREV 360

Query: 819 MVWDAMEMKLDLHGAHVGAAYVIILEWMKEMRLKFEDESCVIPAQVTVICGSGNHSIVRG 878
           MVWDAMEMKLDLHGAHVGAAYVIILEWMKEMRLKFE+ESCVIPAQVTVICGSGNHSIVR 
Sbjct: 361 MVWDAMEMKLDLHGAHVGAAYVIILEWMKEMRLKFENESCVIPAQVTVICGSGNHSIVRR 420

Query: 879 ESPVKALIREIMFRTQSPLRIDRKNTGCFVAKGKAVKNW 918
           ESPVKALIREIMFRTQSPLRIDRKNTGCFVAKGKAVKNW
Sbjct: 421 ESPVKALIREIMFRTQSPLRIDRKNTGCFVAKGKAVKNW 459

BLAST of CmoCh20G007310 vs. NCBI nr
Match: KAG6571003.1 (Pentatricopeptide repeat-containing protein, partial [Cucurbita argyrosperma subsp. sororia])

HSP 1 Score: 825.9 bits (2132), Expect = 3.5e-235
Identity = 428/463 (92.44%), Postives = 429/463 (92.66%), Query Frame = 0

Query: 459 MELRFCPPPYVIGDSVRLFSKAPKRYDGFCSYHFRPNLQVKCATLTKQSHRFLSTLATTA 518
           MELRFCPPPYVIGDSVRLFSKAPKRYD FCSYHFRPNLQVKCATLTKQSHRFLSTLATTA
Sbjct: 1   MELRFCPPPYVIGDSVRLFSKAPKRYDRFCSYHFRPNLQVKCATLTKQSHRFLSTLATTA 60

Query: 519 AAGDHSATNRLIRKFVASSPKSITLNVLSDILSSRTAQPGLCSVALTLYSRITETSWFTW 578
           AAGDHSATNRLIRKFVASSPKS+TLNVLSDILSSRTAQPGLCSVALTLYSRITETSWFTW
Sbjct: 61  AAGDHSATNRLIRKFVASSPKSLTLNVLSDILSSRTAQPGLCSVALTLYSRITETSWFTW 120

Query: 579 NSKLVADLVAFLDKNGQIVDSETLISEAISKLGIQERKLVNFYCQLVESQSKHGSERGFG 638
           NSKLVADLVAFLDKNGQIVDSETLISEAISKLGIQERKLVNFYCQLVESQSKHGSERGFG
Sbjct: 121 NSKLVADLVAFLDKNGQIVDSETLISEAISKLGIQERKLVNFYCQLVESQSKHGSERGFG 180

Query: 639 IAYACLLELLYKSSSIYVKRRAYESMVTGLCSMKRPQEAESLVKEMKAKGFAPAAFEYRS 698
            AYA LLELLY SSSIYVKRRAYESMVTGLCSMKRPQEAESLVKEMKAKGFAPAAFEYRS
Sbjct: 181 NAYARLLELLYNSSSIYVKRRAYESMVTGLCSMKRPQEAESLVKEMKAKGFAPAAFEYRS 240

Query: 699 IIYAYGTLGLFEDMKRSLEEMKNDDIALDTVCSNMVLSSYGVHNKLADMVLWLQIMKTSA 758
           IIYAYGTLGLFEDMKRSLEEMKNDDIALDTVCSNMVLSSYGVHNKLADM           
Sbjct: 241 IIYAYGTLGLFEDMKRSLEEMKNDDIALDTVCSNMVLSSYGVHNKLADM----------- 300

Query: 759 LPFSVRTYNSVLNSCPKITSMLQDKSDDLPVLIEDLITVLDGDEALLVEELVGSSVLKEV 818
                            ITSMLQDKS DLPVLIEDLITVLDGDEALLVEELVGSSVLKEV
Sbjct: 301 -----------------ITSMLQDKSGDLPVLIEDLITVLDGDEALLVEELVGSSVLKEV 360

Query: 819 MVWDAMEMKLDLHGAHVGAAYVIILEWMKEMRLKFEDESCVIPAQVTVICGSGNHSIVRG 878
           MVWDAMEMKLDLHGAHVGAAYVIILEWMKEMRLKFEDESCVIPAQVTVICGSGNHSIVRG
Sbjct: 361 MVWDAMEMKLDLHGAHVGAAYVIILEWMKEMRLKFEDESCVIPAQVTVICGSGNHSIVRG 420

Query: 879 ESPVKALIREIMFRTQSPLRIDRKNTGCFVAKGKAVKNWVCLR 922
           ESPVKALIREIMFRTQSPLRIDRKNT CFVAKGKAVKNWVCLR
Sbjct: 421 ESPVKALIREIMFRTQSPLRIDRKNTACFVAKGKAVKNWVCLR 435

BLAST of CmoCh20G007310 vs. TAIR 10
Match: AT2G17033.1 (pentatricopeptide (PPR) repeat-containing protein )

HSP 1 Score: 468.0 bits (1203), Expect = 1.7e-131
Identity = 236/417 (56.59%), Postives = 306/417 (73.38%), Query Frame = 0

Query: 503 LTKQSHRFLSTLATTAAAGDHSATNRLIRKFVASSPKSITLNVLSDILSSRTAQPGLCSV 562
           L K   RFLS+L++ A AGD SA NR I+KFVA+SPKS+ LNVLS +LS +T+ P L   
Sbjct: 88  LMKHGDRFLSSLSSPALAGDPSAINRHIKKFVAASPKSVALNVLSHLLSDQTSHPHLSFF 147

Query: 563 ALTLYSRITETSWFTWNSKLVADLVAFLDKNGQIVDSETLISEAISKLGIQERKLVNFYC 622
           AL+LYS ITE SWF WN KL+A+L+A L+K  +  +SETL+S A+S+L   ER    F C
Sbjct: 148 ALSLYSEITEASWFDWNPKLIAELIALLNKQERFDESETLLSTAVSRLKSNERDFTLFLC 207

Query: 623 QLVESQSKHGSERGFGIAYACLLELLYKSSSIYVKRRAYESMVTGLCSMKRPQEAESLVK 682
            LVES SK GS +GF  A   L E++ +SSS+YVK +AY+SMV+GLC+M +P +AE +++
Sbjct: 208 NLVESNSKQGSIQGFSEASFRLREIIQRSSSVYVKTQAYKSMVSGLCNMDQPHDAERVIE 267

Query: 683 EMKAKGFAPAAFEYRSIIYAYGTLGLFEDMKRSLEEMKNDDIALDTVCSNMVLSSYGVHN 742
           EM+ +   P  FEY+S++Y YG LGLF+DM R +  M  +   +DTVCSNMVLSSYG H+
Sbjct: 268 EMRMEKIKPGLFEYKSVLYGYGRLGLFDDMNRVVHRMGTEGHKIDTVCSNMVLSSYGAHD 327

Query: 743 KLADMVLWLQIMKTSALPFSVRTYNSVLNSCPKITSMLQDKSDDLPVLIEDLITVLDGDE 802
            L  M  WLQ +K   +PFS+RTYNSVLNSCP I SML+D  D  PV + +L T L+ DE
Sbjct: 328 ALPQMGSWLQKLKGFNVPFSIRTYNSVLNSCPTIISMLKD-LDSCPVSLSELRTFLNEDE 387

Query: 803 ALLVEELVGSSVLKEVMVWDAMEMKLDLHGAHVGAAYVIILEWMKEMRLKFEDESCVIPA 862
           ALLV EL  SSVL E + W+A+E KLDLHG H+ ++Y+I+L+WM E RL+F +E CVIPA
Sbjct: 388 ALLVHELTQSSVLDEAIEWNAVEGKLDLHGMHLSSSYLILLQWMDETRLRFSEEKCVIPA 447

Query: 863 QVTVICGSGNHSIVRGESPVKALIREIMFRTQSPLRIDRKNTGCFVAKGKAVKNWVC 920
           ++ V+ GSG HS VRGESPVKAL+++IM RT SP+RIDRKN G F+AKGK VK W+C
Sbjct: 448 EIVVVSGSGKHSNVRGESPVKALVKKIMVRTGSPMRIDRKNVGSFIAKGKTVKEWLC 503

BLAST of CmoCh20G007310 vs. TAIR 10
Match: AT2G17033.2 (pentatricopeptide (PPR) repeat-containing protein )

HSP 1 Score: 468.0 bits (1203), Expect = 1.7e-131
Identity = 236/417 (56.59%), Postives = 306/417 (73.38%), Query Frame = 0

Query: 503 LTKQSHRFLSTLATTAAAGDHSATNRLIRKFVASSPKSITLNVLSDILSSRTAQPGLCSV 562
           L K   RFLS+L++ A AGD SA NR I+KFVA+SPKS+ LNVLS +LS +T+ P L   
Sbjct: 89  LMKHGDRFLSSLSSPALAGDPSAINRHIKKFVAASPKSVALNVLSHLLSDQTSHPHLSFF 148

Query: 563 ALTLYSRITETSWFTWNSKLVADLVAFLDKNGQIVDSETLISEAISKLGIQERKLVNFYC 622
           AL+LYS ITE SWF WN KL+A+L+A L+K  +  +SETL+S A+S+L   ER    F C
Sbjct: 149 ALSLYSEITEASWFDWNPKLIAELIALLNKQERFDESETLLSTAVSRLKSNERDFTLFLC 208

Query: 623 QLVESQSKHGSERGFGIAYACLLELLYKSSSIYVKRRAYESMVTGLCSMKRPQEAESLVK 682
            LVES SK GS +GF  A   L E++ +SSS+YVK +AY+SMV+GLC+M +P +AE +++
Sbjct: 209 NLVESNSKQGSIQGFSEASFRLREIIQRSSSVYVKTQAYKSMVSGLCNMDQPHDAERVIE 268

Query: 683 EMKAKGFAPAAFEYRSIIYAYGTLGLFEDMKRSLEEMKNDDIALDTVCSNMVLSSYGVHN 742
           EM+ +   P  FEY+S++Y YG LGLF+DM R +  M  +   +DTVCSNMVLSSYG H+
Sbjct: 269 EMRMEKIKPGLFEYKSVLYGYGRLGLFDDMNRVVHRMGTEGHKIDTVCSNMVLSSYGAHD 328

Query: 743 KLADMVLWLQIMKTSALPFSVRTYNSVLNSCPKITSMLQDKSDDLPVLIEDLITVLDGDE 802
            L  M  WLQ +K   +PFS+RTYNSVLNSCP I SML+D  D  PV + +L T L+ DE
Sbjct: 329 ALPQMGSWLQKLKGFNVPFSIRTYNSVLNSCPTIISMLKD-LDSCPVSLSELRTFLNEDE 388

Query: 803 ALLVEELVGSSVLKEVMVWDAMEMKLDLHGAHVGAAYVIILEWMKEMRLKFEDESCVIPA 862
           ALLV EL  SSVL E + W+A+E KLDLHG H+ ++Y+I+L+WM E RL+F +E CVIPA
Sbjct: 389 ALLVHELTQSSVLDEAIEWNAVEGKLDLHGMHLSSSYLILLQWMDETRLRFSEEKCVIPA 448

Query: 863 QVTVICGSGNHSIVRGESPVKALIREIMFRTQSPLRIDRKNTGCFVAKGKAVKNWVC 920
           ++ V+ GSG HS VRGESPVKAL+++IM RT SP+RIDRKN G F+AKGK VK W+C
Sbjct: 449 EIVVVSGSGKHSNVRGESPVKALVKKIMVRTGSPMRIDRKNVGSFIAKGKTVKEWLC 504

BLAST of CmoCh20G007310 vs. TAIR 10
Match: AT2G16980.2 (Major facilitator superfamily protein )

HSP 1 Score: 344.0 bits (881), Expect = 3.8e-94
Identity = 204/445 (45.84%), Postives = 281/445 (63.15%), Query Frame = 0

Query: 3   KMMSLNHLFVTTFIGSLSMFMVIPTIVDLTMEFVCPH-QDHCSIAIYLSGVQQAIVGLGA 62
           ++  L HL VT F+  L+ +++ P + D+T+  VC    D CS+A+YL+GVQQ  VG+G 
Sbjct: 5   RLGELRHLLVTVFLSGLAEYLIRPVMTDVTVAAVCSGLDDSCSLAVYLTGVQQVTVGMGT 64

Query: 63  VVITPVIGNLSDRYGRKAMLTIPMTFSIIPLAIMGYRRTTNFFYAFYIMKTLTDMVSEGT 122
           +V+ PVIGNLSDRYG KAMLT+PM  S++P AI+GYRR TNFFYAFY++KTL DMV +GT
Sbjct: 65  MVMMPVIGNLSDRYGIKAMLTLPMCLSVLPPAILGYRRDTNFFYAFYVIKTLFDMVCQGT 124

Query: 123 TVSLALAYVADKTSEDQRISAFGILSGVRSVGYVCGTFLARLLSTATVFQVAAFMSVLAV 182
              LA AYVA      +RIS FGIL+GV S+  VC +  AR LS A+ FQVAA    + +
Sbjct: 125 IDCLANAYVAKNVHGTKRISMFGILAGVSSISGVCASLSARFLSIASTFQVAAISLFIGL 184

Query: 183 VHMRTFLKESIPDQNELTQPIFDENLSGG-----DDENGPEL-----------PTRTQL- 242
           V+MR FLKE + D ++      DE  SGG     +  NG +L           PT+T + 
Sbjct: 185 VYMRVFLKERLQDADD-----DDEADSGGCRSHQEVHNGGDLKMLTEPILRDAPTKTHVF 244

Query: 243 SIGMSSIRDVISLITSSTTFSQAARVSFFNSLAEKGMQASLAYFLKARFHFDKNQFADLM 302
           +   SS +D++SLI +ST   QA  V+FF + +E G  ++L YFLKARF F+KN FA+L 
Sbjct: 245 NSKYSSWKDMVSLINNSTILIQALVVTFFATFSESGRGSALMYFLKARFGFNKNDFAELF 304

Query: 303 IIEGVAGAVSLFLLMPALALAIRQERLLSIGLWASIIN-----------VPYALRAFTIF 362
           ++  + G++S   ++P L+  I + ++LS GL     N           VPYA+      
Sbjct: 305 LLVTIIGSISQLFILPTLSSTIGERKVLSTGLLMEFFNATCLSVAWSPWVPYAMTMLVPG 364

Query: 363 TILVSPIIFNIASSQVGPSEQGKAQGYISGINSLANIASPLLFSPLIALFLSKDAPFDFP 419
            + V P +  IAS QVG SEQGK QG ISG+ + A + +P ++SPL ALFLS++APF FP
Sbjct: 365 AMFVMPSVCGIASRQVGSSEQGKVQGCISGVRAFAQVVAPFVYSPLTALFLSENAPFYFP 424

BLAST of CmoCh20G007310 vs. TAIR 10
Match: AT2G16990.1 (Major facilitator superfamily protein )

HSP 1 Score: 342.0 bits (876), Expect = 1.4e-93
Identity = 200/433 (46.19%), Postives = 269/433 (62.12%), Query Frame = 0

Query: 7   LNHLFVTTFIGSLSMFMVIPTIVDLTMEFVCP-HQDHCSIAIYLSGVQQAIVGLGAVVIT 66
           L H+  T F+ + + FMV+P I D+T+  VC    D CS+A+YL+G QQ  +G+G +++ 
Sbjct: 8   LRHMLATVFLSAFAGFMVVPVITDVTVAAVCSGPDDSCSLAVYLTGFQQVAIGMGTMIMM 67

Query: 67  PVIGNLSDRYGRKAMLTIPMTFSIIPLAIMGYRRTTNFFYAFYIMKTLTDMVSEGTTVSL 126
           PVIGNLSDRYG K +LT+PM  SI+P  I+GYRR   FFY FYI K LT MV EGT   L
Sbjct: 68  PVIGNLSDRYGIKTILTLPMCLSIVPPVILGYRRDIKFFYVFYISKILTSMVCEGTVDCL 127

Query: 127 ALAYVADKTSEDQRISAFGILSGVRSVGYVCGTFLARLLSTATVFQVAAFMSVLAVVHMR 186
           A AYVA       RISAFGIL+G++++  + GT +AR L  A  FQV+A    + +V+MR
Sbjct: 128 AYAYVAVNIHGSTRISAFGILAGIKTIAGLFGTLVARFLPIALTFQVSAISFFVGLVYMR 187

Query: 187 TFLKESIPDQNELTQPIFDENLSGGDDENGPEL--------PTRTQL-SIGMSSIRDVIS 246
            FLKE + D  +        +    D  N   L        P +TQ+     SS++D+IS
Sbjct: 188 VFLKEKLNDDEDDDLHHGTYHQEDHDSINTTMLAEPILNDRPIKTQVFHKKYSSLKDMIS 247

Query: 247 LITSSTTFSQAARVSFFNSLAEKGMQASLAYFLKARFHFDKNQFADLMIIEGVAGAVSLF 306
           L+ +ST F QA  V+FF+S ++ GM+++  YFLKARF FDK QFADL+++  + G++S  
Sbjct: 248 LMKTSTIFFQALVVTFFSSFSDSGMESAFLYFLKARFGFDKKQFADLLLLITIVGSISQL 307

Query: 307 LLMPALALAIRQERLLSIGLWASIIN-----------VPYALRAFTIFTILVSPIIFNIA 366
            ++P  A AI + +LLS GL+   IN           VPY    F    + V P +  IA
Sbjct: 308 FVLPRFASAIGECKLLSTGLFMEFINMAIVSISWAPWVPYLTTVFVPGALFVMPSVCGIA 367

Query: 367 SSQVGPSEQGKAQGYISGINSLANIASPLLFSPLIALFLSKDAPFDFPGFGILCIGLASL 419
           S QVGP EQGK QG ISG+ S   + +P +FSPL ALFLSK+APF FPGF +LCI L+SL
Sbjct: 368 SRQVGPGEQGKVQGCISGVRSFGKVVAPFVFSPLTALFLSKNAPFYFPGFSLLCISLSSL 427

BLAST of CmoCh20G007310 vs. TAIR 10
Match: AT2G16990.2 (Major facilitator superfamily protein )

HSP 1 Score: 342.0 bits (876), Expect = 1.4e-93
Identity = 200/433 (46.19%), Postives = 269/433 (62.12%), Query Frame = 0

Query: 7   LNHLFVTTFIGSLSMFMVIPTIVDLTMEFVCP-HQDHCSIAIYLSGVQQAIVGLGAVVIT 66
           L H+  T F+ + + FMV+P I D+T+  VC    D CS+A+YL+G QQ  +G+G +++ 
Sbjct: 8   LRHMLATVFLSAFAGFMVVPVITDVTVAAVCSGPDDSCSLAVYLTGFQQVAIGMGTMIMM 67

Query: 67  PVIGNLSDRYGRKAMLTIPMTFSIIPLAIMGYRRTTNFFYAFYIMKTLTDMVSEGTTVSL 126
           PVIGNLSDRYG K +LT+PM  SI+P  I+GYRR   FFY FYI K LT MV EGT   L
Sbjct: 68  PVIGNLSDRYGIKTILTLPMCLSIVPPVILGYRRDIKFFYVFYISKILTSMVCEGTVDCL 127

Query: 127 ALAYVADKTSEDQRISAFGILSGVRSVGYVCGTFLARLLSTATVFQVAAFMSVLAVVHMR 186
           A AYVA       RISAFGIL+G++++  + GT +AR L  A  FQV+A    + +V+MR
Sbjct: 128 AYAYVAVNIHGSTRISAFGILAGIKTIAGLFGTLVARFLPIALTFQVSAISFFVGLVYMR 187

Query: 187 TFLKESIPDQNELTQPIFDENLSGGDDENGPEL--------PTRTQL-SIGMSSIRDVIS 246
            FLKE + D  +        +    D  N   L        P +TQ+     SS++D+IS
Sbjct: 188 VFLKEKLNDDEDDDLHHGTYHQEDHDSINTTMLAEPILNDRPIKTQVFHKKYSSLKDMIS 247

Query: 247 LITSSTTFSQAARVSFFNSLAEKGMQASLAYFLKARFHFDKNQFADLMIIEGVAGAVSLF 306
           L+ +ST F QA  V+FF+S ++ GM+++  YFLKARF FDK QFADL+++  + G++S  
Sbjct: 248 LMKTSTIFFQALVVTFFSSFSDSGMESAFLYFLKARFGFDKKQFADLLLLITIVGSISQL 307

Query: 307 LLMPALALAIRQERLLSIGLWASIIN-----------VPYALRAFTIFTILVSPIIFNIA 366
            ++P  A AI + +LLS GL+   IN           VPY    F    + V P +  IA
Sbjct: 308 FVLPRFASAIGECKLLSTGLFMEFINMAIVSISWAPWVPYLTTVFVPGALFVMPSVCGIA 367

Query: 367 SSQVGPSEQGKAQGYISGINSLANIASPLLFSPLIALFLSKDAPFDFPGFGILCIGLASL 419
           S QVGP EQGK QG ISG+ S   + +P +FSPL ALFLSK+APF FPGF +LCI L+SL
Sbjct: 368 SRQVGPGEQGKVQGCISGVRSFGKVVAPFVFSPLTALFLSKNAPFYFPGFSLLCISLSSL 427

The following BLAST results are available for this feature:
Match NameE-valueIdentityDescription
Q8GWA92.5e-13056.59Pentatricopeptide repeat-containing protein At2g17033 OS=Arabidopsis thaliana OX... [more]
P029821.6e-1224.54Tetracycline resistance protein, class A OS=Escherichia coli OX=562 GN=tetA PE=3... [more]
Q96MC67.9e-1222.42Hippocampus abundant transcript 1 protein OS=Homo sapiens OX=9606 GN=MFSD14A PE=... [more]
P701877.9e-1222.42Hippocampus abundant transcript 1 protein OS=Mus musculus OX=10090 GN=Mfsd14a PE... [more]
A4IF947.4e-1022.78Hippocampus abundant transcript-like protein 1 OS=Bos taurus OX=9913 GN=MFSD14B ... [more]
Match NameE-valueIdentityDescription
A0A6J1FXE01.3e-259100.00pentatricopeptide repeat-containing protein At2g17033 isoform X1 OS=Cucurbita mo... [more]
A0A6J1JE757.2e-25096.95pentatricopeptide repeat-containing protein At2g17033 isoform X1 OS=Cucurbita ma... [more]
A0A6J1FTY63.1e-22191.67uncharacterized protein LOC111448122 isoform X2 OS=Cucurbita moschata OX=3662 GN... [more]
A0A6J1JBV83.0e-21693.41hippocampus abundant transcript 1 protein-like isoform X3 OS=Cucurbita maxima OX... [more]
A0A6J1FXN82.7e-21297.12uncharacterized protein LOC111448122 isoform X3 OS=Cucurbita moschata OX=3662 GN... [more]
Match NameE-valueIdentityDescription
XP_022943803.12.7e-259100.00pentatricopeptide repeat-containing protein At2g17033 isoform X1 [Cucurbita mosc... [more]
XP_023512520.11.2e-25498.26pentatricopeptide repeat-containing protein At2g17033 isoform X1 [Cucurbita pepo... [more]
KAG7010834.12.2e-25398.47Pentatricopeptide repeat-containing protein [Cucurbita argyrosperma subsp. argyr... [more]
XP_022985638.11.5e-24996.95pentatricopeptide repeat-containing protein At2g17033 isoform X1 [Cucurbita maxi... [more]
KAG6571003.13.5e-23592.44Pentatricopeptide repeat-containing protein, partial [Cucurbita argyrosperma sub... [more]
Match NameE-valueIdentityDescription
AT2G17033.11.7e-13156.59pentatricopeptide (PPR) repeat-containing protein [more]
AT2G17033.21.7e-13156.59pentatricopeptide (PPR) repeat-containing protein [more]
AT2G16980.23.8e-9445.84Major facilitator superfamily protein [more]
AT2G16990.11.4e-9346.19Major facilitator superfamily protein [more]
AT2G16990.21.4e-9346.19Major facilitator superfamily protein [more]
InterPro
Analysis Name: InterPro Annotations of Cucurbita moschata (Rifu) v1
Date Performed: 2021-10-25
IPR TermIPR DescriptionSourceSource TermSource DescriptionAlignment
IPR001958Tetracycline resistance protein TetA/multidrug resistance protein MdtGPRINTSPR01035TCRTETAcoord: 357..380
score: 32.08
coord: 143..165
score: 29.57
coord: 50..69
score: 30.0
IPR002625Smr domainSMARTSM00463SMR_2coord: 825..912
e-value: 3.6E-20
score: 83.0
IPR002625Smr domainPROSITEPS50828SMRcoord: 828..912
score: 15.652205
IPR036259MFS transporter superfamilyGENE3D1.20.1250.20MFS general substrate transporter like domainscoord: 5..411
e-value: 1.6E-32
score: 114.7
IPR036259MFS transporter superfamilySUPERFAMILY103473MFS general substrate transportercoord: 10..413
IPR002885Pentatricopeptide repeatTIGRFAMTIGR00756TIGR00756coord: 660..691
e-value: 7.0E-6
score: 23.9
IPR002885Pentatricopeptide repeatPFAMPF01535PPRcoord: 660..689
e-value: 6.6E-5
score: 22.9
coord: 696..723
e-value: 1.4
score: 9.3
IPR002885Pentatricopeptide repeatPROSITEPS51375PPRcoord: 657..691
score: 11.169622
NoneNo IPR availableGENE3D3.30.1370.110coord: 827..910
e-value: 1.3E-7
score: 33.7
NoneNo IPR availablePANTHERPTHR47932:SF53OS02G0120000 PROTEINcoord: 497..919
NoneNo IPR availablePANTHERPTHR47932ATPASE EXPRESSION PROTEIN 3coord: 497..919
NoneNo IPR availableCDDcd17330MFS_SLC46_TetA_likecoord: 10..413
e-value: 1.83896E-53
score: 188.171
IPR011990Tetratricopeptide-like helical domain superfamilyGENE3D1.25.40.10Tetratricopeptide repeat domaincoord: 552..786
e-value: 3.1E-19
score: 71.5
IPR011701Major facilitator superfamilyPFAMPF07690MFS_1coord: 10..373
e-value: 2.7E-24
score: 85.8
IPR020846Major facilitator superfamily domainPROSITEPS50850MFScoord: 1..192
score: 10.080136
IPR036063Smr domain superfamilySUPERFAMILY160443SMR domain-likecoord: 825..910

Relationships

The following mRNA feature(s) are a part of this gene:

Feature NameUnique NameType
CmoCh20G007310.1CmoCh20G007310.1mRNA


GO Annotation
GO Assignments
This gene is annotated with the following GO terms.
Category Term Accession Term Name
biological_process GO:0055085 transmembrane transport
cellular_component GO:0016021 integral component of membrane
molecular_function GO:0005515 protein binding
molecular_function GO:0022857 transmembrane transporter activity