CcUC04G080720 (gene) Watermelon (PI 537277) v1

Overview
NameCcUC04G080720
Typegene
OrganismCitrullus colocynthis (Watermelon (PI 537277) v1)
DescriptionPentatricopeptide repeat-containing protein
LocationCicolChr04: 34879013 .. 34883236 (-)
RNA-Seq ExpressionCcUC04G080720
SyntenyCcUC04G080720
Sequences
The following sequences are available for this feature:

Gene sequence (with intron)

Legend: polypeptideexonCDSthree_prime_UTR
Hold the cursor over a type above to highlight its positions in the sequence below.
ATGCTCGTTAATCTCTCCCATCAATTGCTTAAACCTGCTTGCCGTCAATTTGCTTTCTCCTCCAAAGGCTTCAACTGTTGGGCTTTACGTATTCGAAATGCTCCCTCCCTCCATAGAGCACTTGCTATCTACTCCCAGATGCACCGCCAATCCGTTCCCCACGACAGCTTCTCAATTCTATTTATGCTCAAAGCCTGCGCTCCTTCCAACAATCTCTCAACTCTTCACCATCTCCATGCCCATATTACTAAACTTGGTTTCACTACTCATGTCTTTGTCGCTACATCCCTACTCTATGCATACGTCCTTAACTCCTTTCAACTTGCTTGCTTGGTGTTCGATGAAATGCCCCACAAAAACACTGTTACCTGGAACACTATGATTTTGGGGTATTCCAAGACGGGGGATGTAGATAGAGCTCGCCAGCTGTTTGATCAAATGCCCTCAAGAGACTTGGTATCTTGGTCCACCATGATTGCTGCCTACATTAACAATCGCAATTACAGGGGTGGTTTGCTTCTTTTCCAAGATATGATAGCTAACGGAATAAATCCTGACCAGATGGCGGCAGGATCAATCTTAAATGGGTGTGCTCATATGGGCTCTTTAGGATTGTTGGCTGGAAAATCGGTTCATGGTTTTGTGGTCAAGAATAGGTGGGAACTCAACCTGGAGCTTGGTACAGTTTTGGTTGATATGTACGCCAAGTGTGGATTTTTGAAGTATGCTTGCCAGATATTTCTTTTGATGTCTGAAAAGAATGTCAAGACTTGGACTGCTCTGATATGTGGTTTGGCCCAGCATGGCTACTGCAAGGATGCATTAGTTTTATTTGAGACGATGTCACATGAAGGTGTGGAACCGAATGAATTGACTTTTACTGGGGTTTTAAGTGCATGTGCCCATGCAGGGTTTGTTCAAGAAGGTCGCAAATACTTTAACATGATTGAAGAATATGGATTAGAAATAAGGATTCAACATTATGGTTGCATGGTTGATTTGTTGGGCAAGTCGGGATTGTTGGAGGAGGCCTATGGGGTTATTAAGACTATGAGACTTGAACCTAATGTCACTGTGTGGAGCTCTCTTTTGTCGGCCTGTAAGCAACATAAAAGCTTTGACATGGCCGAGAGAGTCATTGAGCAGATATTGGAAACGACAGAACCCAAGAATCATGGTGGAATTTACTCTCTTATATGTGATTTGTATGTTCTAGAGGAGAAGTGGGATGATGCAGAAAAGATAAGGAATTTACTGAGCCAAAATGTCCCGAAGGTTAGGGCCTATAGCCTTATCAGAAGTGGATTATAATTGGCTACAATTGCATGATTCATTCTTTATGGATTAGAATGGTAATTGATGATGGTCAACTTTATAAATTATTTTCACTTGTGTGTATCTTATTTTTCCAATGTGAATTTCTCGACATGGGACAATGTCTATTTGAATTTACCATTTCTCGATCAAATCCCACCTGTTTAGGCTTCATAGACTGATACCATAATAAATGAGCATAAAATTCATTTCAAAACCAATTAATAATGAGTAGCCATATATCTTATTTTATTGTGGGATCCTTCATAAAAGCAAATCTTTAGTTTGCAGCTTCTCAAATGCCAACTCTAGCCAGGTTTATCTGATCAAAAGAGCGTGTCCTTATTTTCTGCGCGGACATCAAATACTACAACATTGGGGAATGATTGCTGACTGATGTTAGTGCTATGTAGGCGCGCTTACTTTTCTGGCCATTGCTTGATCTACATATGCTGCTCTCTAGCCCATTCGAGATTGTCATGCGAATGCGAGTTCGACAAAGATTTTTTGGCCAAATGATAGATATTAGTCGACATTCGTAGTTTTGCTACTCAACAAAACGAGAACTACAAGTTGTAATTGGGCTGGGGTCTAAACAAAACTACCACAAATGGGCTCAAAATGGCCCAAAACTTGGCGTAACAAAACAGGCCCACTTTCTAGTACGTTGCCCAGTGTCATGCTGCCCACAGTTAATGCCATTAGCTTCTATCCATCGCAGCGCAACAAGGCAAATCAGCGGTGTGCGGGTGGAAGCTAATCCTGTTTTGTTTTATCTTCTTCTGAAACTTGAAACTAGCGTTTGTTGGATATGAGGTCGCTGACAATTTCTGTTGGGGCTGCCACTCCCAAAACGCTCAGCTCCAACAACACATCACCCAATAATTGTCAATCAACGAAACAAGTCTCCAAATTTAACTACATTCGTACCCCCAAAAAGCTCATTTTCTCATCAAGAACTGGAAGTGAAGGTGAACCCAGCTTCATTTGGCAGAGGAGAAGCAATTGGGTGTGGATTCCTGCTCCTTGCCAAAGTTCTTCTACAACCACTGCCTGCAGCTACTGAAGCCACACCATGTGAATTTACAACAGCTCCGTCAGGCCTAGCATTCTGTGATAAAGTTGTCGGGACAGCCCCGAGGCTGAGAAAGGACAGCTAATCAAGGCATGCCTTCAATCCCTTTTGCCTCATTGTTCTCTACCATAGATTGAAGGATCTGAAATAAGGTGAAGAGTGCGTGCATATATAATTGATGTTGTATTGTGGTATTGTATACCCATGTGAAGTGACAAATTGGTTATTGGTAGGCACATTATGTTGGAAAATTAGAGAGCGGAAAGGTGTTTGACAGCAGCCATAATCGGGGGAAACCGTTAACCTTTCGAATTGGGGTTGGTGAGGTGTGTATATATATATATAGACACATGAGGAGTTATTAATCGATGGAATTAAAGTAAAAATGTACAGGTTATAAAAGGTTGGGATGAAGGTATTCTTGGGGGCAATGGAGTTCCAGCGATACTTCCGGGTACTTTTTCCATGTAGATTGAAAAAATGGAAATTGGAATGGCGTTAATTAACATAAATTTCCTTGTGTGATTTGTGAAGGAGGTAAGTGGGTGCTAAAGCTTCCTCCAGACCTTGGGTATGGCGCAAGAGGAGCTGGATGTAGAGGAGGAGGTACGACAATGAAATTGCTGGGGAAGGGGTTTATGAATTCAATTACTTCATAATCTTAATTAATGTTGTAGGTTGGAAGTGGGAAAAGAGTTTTGAGTTGACAGTGGTTTTTGTTTGATTGCTGATGCAGGGTCATGTGTAATTCCTCCTAACTCAGTTCTCTTATTTGATGTGGAGTTCATTGGGAAGGCATGACATCAGAACAGCTTCAACCCTCGTTCTCATCCATTTCCTATTCGTTTCATATCTTCAAATAACCTTCTTTTTATTTATTTACAATTACAATCACACACTTCTCATACTTCCTGCCTTTCTTCGCCTCTTATTCTCTCTCCACTTGTGAATGCTAAGGGTTGTTTAGTTGTCATTATTTTTTTATGGAATTTTTCTTCTCCACGCCATCCAATTGAGAAACATTGTCTCGTAAATGGCATACACATTGTCTGTATTCTCTTTGAACTCCTGTATTACCTTTGTTCAGGGTCATCAGATGAGAGATCTTCCTCTGATTCCGAGAAATTGGTGAGATATTTCCTCAGCATCGAGAGGGGAAAGAGGAGACGGTAGTCAATCAAATGGGTTGGTGGACTGTCATTTCTCCACTTTGCTGAGAGAGAGAGTGTAGCAATAAGGTAGGCGTGTCAAGGAAGAAGGGGCAGTCCAAAGAGGTTGAAGGCCAACCTTAATATTAATAGAATTCGAAATGACTTCATGGAGGGCCATAGCTTCGAAGAGCTCAACAAATCTAATAAAAAGCTGAAACTTCAGTATTGTTTGCCATCGGAGTTTCAAAAAATAGCTTCAAGATTTATCTCGTAGCTGCCAACAACTACATCTGAATTAACCTTGAAGGATTGCTTAGGGGAGGGGATCCAATAAGAAACAACAGCGTGAATTTAATCAATAGAATAAATTACATTAGATGGATATATTGATTTAGAAAATAAAAAAAAAAAAACCAATCAATTTTATCTATAAACTTGACAAGTTTGTATCAATCTTCCAACAAATTCTAAAAATCACCAATTTAATAAAAATTAAATTTTGCATAAAATTAAATTTGGTTAATGCATAAAATCAACCATTAAATAGGAAATTAAAACTTACAAATTTCAATTACAACAACTATATCAACAACAATAAACTAGCTCTATGCCGTTTTTGTTTGAAATCTATCAATTTACATAAAAATTAGTCTTTTTGTAATTGAACTCTGTTAGTCTTTTGAG

mRNA sequence

ATGCTCGTTAATCTCTCCCATCAATTGCTTAAACCTGCTTGCCGTCAATTTGCTTTCTCCTCCAAAGGCTTCAACTGTTGGGCTTTACGTATTCGAAATGCTCCCTCCCTCCATAGAGCACTTGCTATCTACTCCCAGATGCACCGCCAATCCGTTCCCCACGACAGCTTCTCAATTCTATTTATGCTCAAAGCCTGCGCTCCTTCCAACAATCTCTCAACTCTTCACCATCTCCATGCCCATATTACTAAACTTGGTTTCACTACTCATGTCTTTGTCGCTACATCCCTACTCTATGCATACGTCCTTAACTCCTTTCAACTTGCTTGCTTGGTGTTCGATGAAATGCCCCACAAAAACACTGTTACCTGGAACACTATGATTTTGGGGTATTCCAAGACGGGGGATGTAGATAGAGCTCGCCAGCTGTTTGATCAAATGCCCTCAAGAGACTTGGTATCTTGGTCCACCATGATTGCTGCCTACATTAACAATCGCAATTACAGGGGTGGTTTGCTTCTTTTCCAAGATATGATAGCTAACGGAATAAATCCTGACCAGATGGCGGCAGGATCAATCTTAAATGGGTGTGCTCATATGGGCTCTTTAGGATTGTTGGCTGGAAAATCGGTTCATGGTTTTGTGGTCAAGAATAGGTGGGAACTCAACCTGGAGCTTGGTACAGTTTTGGTTGATATGTACGCCAAGTGTGGATTTTTGAAGTATGCTTGCCAGATATTTCTTTTGATGTCTGAAAAGAATGTCAAGACTTGGACTGCTCTGATATGTGGTTTGGCCCAGCATGGCTACTGCAAGGATGCATTAGTTTTATTTGAGACGATGTCACATGAAGGTGTGGAACCGAATGAATTGACTTTTACTGGGGTTTTAAGTGCATGTGCCCATGCAGGGTTTGTTCAAGAAGGTCGCAAATACTTTAACATGATTGAAGAATATGGATTAGAAATAAGGATTCAACATTATGGTTGCATGGTTGATTTGTTGGGCAAGTCGGGATTGTTGGAGGAGGCCTATGGGGTTATTAAGACTATGAGACTTGAACCTAATGTCACTGTGTGGAGCTCTCTTTTGTCGGCCTGTAAGCAACATAAAAGCTTTGACATGGCCGAGAGAGTCATTGAGCAGATATTGGAAACGACAGAACCCAAGAATCATGGTGGAATTTACTCTCTTATATGTGATTTGTATGTTCTAGAGGAGAAGTGGGATGATGCAGAAAAGATAAGGAATTTACTGAGCCAAAATGTCCCGAAGGTTAGGGCCTATAGCCTTATCAGAAGTTTATCTGATCAAAAGAGCCCCATTCGAGATTGTCATGCGAATCGAGAACTACAAGTTGTAATTGGGCTGGGGTCTAAACAAAACTACCACAAATGGGCTCAAAATGGCCCAAAACTTGGCGTAACAAAACAGGCCCACTTTCTAGTACGTTGCCCAGTGTCATGCTGCCCACAGTTAATGCCATTAGCTTCTATCCATCGCAGCGCAACAAGGCAAATCAGCGGTGTGCGGGTGGAAGCTAATCCTTTGGGGCTGCCACTCCCAAAACGCTCAGCTCCAACAACACATCACCCAATAATTGTCAATCAACGAAACAAGTCTCCAAATTTAACTACATTCGTACCCCCAAAAAGCTCATTTTCTCATCAAGAACTGGAAGTGAAGGTGAACCCAGCTTCATTTGGCAGAGGAGAAGCAATTGGGTGTGGATTCCTGCTCCTTGCCAAAGTTCTTCTACAACCACTGCCTGCAGCTACTGAAGCCACACCATGTGAATTTACAACAGCTCCGTCAGGCCTAGCATTCTGTGATAAAGCACATTATGTTGGAAAATTAGAGAGCGGAAAGGTGTTTGACAGCAGCCATAATCGGGGGAAACCGTTAACCTTTCGAATTGGGGTTGGTGAGGTTATAAAAGGTTGGGATGAAGGTATTCTTGGGGGCAATGGAGTTCCAGCGATACTTCCGGGAGGTAAGTGGGTGCTAAAGCTTCCTCCAGACCTTGGGTATGGCGCAAGAGGAGCTGGATGTAGAGGAGGAGGGTCATGTGTAATTCCTCCTAACTCAGTTCTCTTATTTGATGTGGAGTTCATTGGGAAGGCATGACATCAGAACAGCTTCAACCCTCGTTCTCATCCATTTCCTATTCGTTTCATATCTTCAAATAACCTTCTTTTTATTTATTTACAATTACAATCACACACTTCTCATACTTCCTGCCTTTCTTCGCCTCTTATTCTCTCTCCACTTGTGAATGCTAAGGGTTGTTTAGTTGTCATTATTTTTTTATGGAATTTTTCTTCTCCACGCCATCCAATTGAGAAACATTGTCTCGTAAATGGCATACACATTGTCTGTATTCTCTTTGAACTCCTGTATTACCTTTGTTCAGGGTCATCAGATGAGAGATCTTCCTCTGATTCCGAGAAATTGGTGAGATATTTCCTCAGCATCGAGAGGGGAAAGAGGAGACGGTAGTCAATCAAATGGGTTGGTGGACTGTCATTTCTCCACTTTGCTGAGAGAGAGAGTGTAGCAATAAGGTAGGCGTGTCAAGGAAGAAGGGGCAGTCCAAAGAGGTTGAAGGCCAACCTTAATATTAATAGAATTCGAAATGACTTCATGGAGGGCCATAGCTTCGAAGAGCTCAACAAATCTAATAAAAAGCTGAAACTTCAGTATTGTTTGCCATCGGAGTTTCAAAAAATAGCTTCAAGATTTATCTCGTAGCTGCCAACAACTACATCTGAATTAACCTTGAAGGATTGCTTAGGGGAGGGGATCCAATAAGAAACAACAGCGTGAATTTAATCAATAGAATAAATTACATTAGATGGATATATTGATTTAGAAAATAAAAAAAAAAAAACCAATCAATTTTATCTATAAACTTGACAAGTTTGTATCAATCTTCCAACAAATTCTAAAAATCACCAATTTAATAAAAATTAAATTTTGCATAAAATTAAATTTGGTTAATGCATAAAATCAACCATTAAATAGGAAATTAAAACTTACAAATTTCAATTACAACAACTATATCAACAACAATAAACTAGCTCTATGCCGTTTTTGTTTGAAATCTATCAATTTACATAAAAATTAGTCTTTTTGTAATTGAACTCTGTTAGTCTTTTGAG

Coding sequence (CDS)

ATGCTCGTTAATCTCTCCCATCAATTGCTTAAACCTGCTTGCCGTCAATTTGCTTTCTCCTCCAAAGGCTTCAACTGTTGGGCTTTACGTATTCGAAATGCTCCCTCCCTCCATAGAGCACTTGCTATCTACTCCCAGATGCACCGCCAATCCGTTCCCCACGACAGCTTCTCAATTCTATTTATGCTCAAAGCCTGCGCTCCTTCCAACAATCTCTCAACTCTTCACCATCTCCATGCCCATATTACTAAACTTGGTTTCACTACTCATGTCTTTGTCGCTACATCCCTACTCTATGCATACGTCCTTAACTCCTTTCAACTTGCTTGCTTGGTGTTCGATGAAATGCCCCACAAAAACACTGTTACCTGGAACACTATGATTTTGGGGTATTCCAAGACGGGGGATGTAGATAGAGCTCGCCAGCTGTTTGATCAAATGCCCTCAAGAGACTTGGTATCTTGGTCCACCATGATTGCTGCCTACATTAACAATCGCAATTACAGGGGTGGTTTGCTTCTTTTCCAAGATATGATAGCTAACGGAATAAATCCTGACCAGATGGCGGCAGGATCAATCTTAAATGGGTGTGCTCATATGGGCTCTTTAGGATTGTTGGCTGGAAAATCGGTTCATGGTTTTGTGGTCAAGAATAGGTGGGAACTCAACCTGGAGCTTGGTACAGTTTTGGTTGATATGTACGCCAAGTGTGGATTTTTGAAGTATGCTTGCCAGATATTTCTTTTGATGTCTGAAAAGAATGTCAAGACTTGGACTGCTCTGATATGTGGTTTGGCCCAGCATGGCTACTGCAAGGATGCATTAGTTTTATTTGAGACGATGTCACATGAAGGTGTGGAACCGAATGAATTGACTTTTACTGGGGTTTTAAGTGCATGTGCCCATGCAGGGTTTGTTCAAGAAGGTCGCAAATACTTTAACATGATTGAAGAATATGGATTAGAAATAAGGATTCAACATTATGGTTGCATGGTTGATTTGTTGGGCAAGTCGGGATTGTTGGAGGAGGCCTATGGGGTTATTAAGACTATGAGACTTGAACCTAATGTCACTGTGTGGAGCTCTCTTTTGTCGGCCTGTAAGCAACATAAAAGCTTTGACATGGCCGAGAGAGTCATTGAGCAGATATTGGAAACGACAGAACCCAAGAATCATGGTGGAATTTACTCTCTTATATGTGATTTGTATGTTCTAGAGGAGAAGTGGGATGATGCAGAAAAGATAAGGAATTTACTGAGCCAAAATGTCCCGAAGGTTAGGGCCTATAGCCTTATCAGAAGTTTATCTGATCAAAAGAGCCCCATTCGAGATTGTCATGCGAATCGAGAACTACAAGTTGTAATTGGGCTGGGGTCTAAACAAAACTACCACAAATGGGCTCAAAATGGCCCAAAACTTGGCGTAACAAAACAGGCCCACTTTCTAGTACGTTGCCCAGTGTCATGCTGCCCACAGTTAATGCCATTAGCTTCTATCCATCGCAGCGCAACAAGGCAAATCAGCGGTGTGCGGGTGGAAGCTAATCCTTTGGGGCTGCCACTCCCAAAACGCTCAGCTCCAACAACACATCACCCAATAATTGTCAATCAACGAAACAAGTCTCCAAATTTAACTACATTCGTACCCCCAAAAAGCTCATTTTCTCATCAAGAACTGGAAGTGAAGGTGAACCCAGCTTCATTTGGCAGAGGAGAAGCAATTGGGTGTGGATTCCTGCTCCTTGCCAAAGTTCTTCTACAACCACTGCCTGCAGCTACTGAAGCCACACCATGTGAATTTACAACAGCTCCGTCAGGCCTAGCATTCTGTGATAAAGCACATTATGTTGGAAAATTAGAGAGCGGAAAGGTGTTTGACAGCAGCCATAATCGGGGGAAACCGTTAACCTTTCGAATTGGGGTTGGTGAGGTTATAAAAGGTTGGGATGAAGGTATTCTTGGGGGCAATGGAGTTCCAGCGATACTTCCGGGAGGTAAGTGGGTGCTAAAGCTTCCTCCAGACCTTGGGTATGGCGCAAGAGGAGCTGGATGTAGAGGAGGAGGGTCATGTGTAATTCCTCCTAACTCAGTTCTCTTATTTGATGTGGAGTTCATTGGGAAGGCATGA

Protein sequence

MLVNLSHQLLKPACRQFAFSSKGFNCWALRIRNAPSLHRALAIYSQMHRQSVPHDSFSILFMLKACAPSNNLSTLHHLHAHITKLGFTTHVFVATSLLYAYVLNSFQLACLVFDEMPHKNTVTWNTMILGYSKTGDVDRARQLFDQMPSRDLVSWSTMIAAYINNRNYRGGLLLFQDMIANGINPDQMAAGSILNGCAHMGSLGLLAGKSVHGFVVKNRWELNLELGTVLVDMYAKCGFLKYACQIFLLMSEKNVKTWTALICGLAQHGYCKDALVLFETMSHEGVEPNELTFTGVLSACAHAGFVQEGRKYFNMIEEYGLEIRIQHYGCMVDLLGKSGLLEEAYGVIKTMRLEPNVTVWSSLLSACKQHKSFDMAERVIEQILETTEPKNHGGIYSLICDLYVLEEKWDDAEKIRNLLSQNVPKVRAYSLIRSLSDQKSPIRDCHANRELQVVIGLGSKQNYHKWAQNGPKLGVTKQAHFLVRCPVSCCPQLMPLASIHRSATRQISGVRVEANPLGLPLPKRSAPTTHHPIIVNQRNKSPNLTTFVPPKSSFSHQELEVKVNPASFGRGEAIGCGFLLLAKVLLQPLPAATEATPCEFTTAPSGLAFCDKAHYVGKLESGKVFDSSHNRGKPLTFRIGVGEVIKGWDEGILGGNGVPAILPGGKWVLKLPPDLGYGARGAGCRGGGSCVIPPNSVLLFDVEFIGKA
Homology
BLAST of CcUC04G080720 vs. NCBI nr
Match: XP_038877521.1 (pentatricopeptide repeat-containing protein At5g66520-like [Benincasa hispida])

HSP 1 Score: 796.6 bits (2056), Expect = 1.8e-226
Identity = 387/434 (89.17%), Postives = 408/434 (94.01%), Query Frame = 0

Query: 1   MLVNLSHQLLKPACRQFAFSSKGFNCWALRIRNAPSLHRALAIYSQMHRQSVPHDSFSIL 60
           MLVNLS  LLKPACRQFA S+KGFN WALRIRNAPSLH+ALAIYSQMHRQSVPHDSFSIL
Sbjct: 1   MLVNLSCHLLKPACRQFALSAKGFNSWALRIRNAPSLHKALAIYSQMHRQSVPHDSFSIL 60

Query: 61  FMLKACAPSNNLSTLHHLHAHITKLGFTTHVFVATSLLYAYVLNSFQLACLVFDEMPHKN 120
           FMLKACA SNN+S LHHLHAHITKLGFT HVFVATSLLYAYVL+S QLACLVFDEMPHK+
Sbjct: 61  FMLKACARSNNVSVLHHLHAHITKLGFTAHVFVATSLLYAYVLHSIQLACLVFDEMPHKS 120

Query: 121 TVTWNTMILGYSKTGDVDRARQLFDQMPSRDLVSWSTMIAAYINNRNYRGGLLLFQDMIA 180
           TVTWNTMIL YSKTGDVD ARQLFDQMPSRDL SWS+MI AY+NNRNYR GLL+FQDMI 
Sbjct: 121 TVTWNTMILRYSKTGDVDAARQLFDQMPSRDLASWSSMITAYVNNRNYRAGLLIFQDMIV 180

Query: 181 NGINPDQMAAGSILNGCAHMGSLGLLAGKSVHGFVVKNRWELNLELGTVLVDMYAKCGFL 240
           NGI+PDQ+A GSILNGCAHMGSLGLLAGKSVHGFVVKNRWELNL+LGTVLVDMYAKCGFL
Sbjct: 181 NGISPDQIAIGSILNGCAHMGSLGLLAGKSVHGFVVKNRWELNLQLGTVLVDMYAKCGFL 240

Query: 241 KYACQIFLLMSEKNVKTWTALICGLAQHGYCKDALVLFETMSHEGVEPNELTFTGVLSAC 300
           KYACQIF  MSEKNV+TWTALICG+AQHGY K+AL+LFETM  EGVEPNELTFTGVLSAC
Sbjct: 241 KYACQIFHFMSEKNVRTWTALICGMAQHGYGKEALLLFETMRREGVEPNELTFTGVLSAC 300

Query: 301 AHAGFVQEGRKYFNMIEEYGLEIRIQHYGCMVDLLGKSGLLEEAYGVIKTMRLEPNVTVW 360
            HAGFV+EGRKYFNMIEEYGLEIRIQHYGCMVDLLG+SGLLEEAYGVIK+MRLEPNV VW
Sbjct: 301 VHAGFVEEGRKYFNMIEEYGLEIRIQHYGCMVDLLGRSGLLEEAYGVIKSMRLEPNVIVW 360

Query: 361 SSLLSACKQHKSFDMAERVIEQILETTEPKNHGGIYSLICDLYVLEEKWDDAEKIRNLLS 420
           SSLLSACKQHK FDMAERVIEQIL+ TEP+NHGGIYSLI DLYVLEEKWDDAEKIRNLL+
Sbjct: 361 SSLLSACKQHKRFDMAERVIEQILKKTEPENHGGIYSLISDLYVLEEKWDDAEKIRNLLN 420

Query: 421 QNVPKVRAYSLIRS 435
           QNV KVRAYSLIRS
Sbjct: 421 QNVRKVRAYSLIRS 434

BLAST of CcUC04G080720 vs. NCBI nr
Match: XP_023544798.1 (pentatricopeptide repeat-containing protein At5g66520-like [Cucurbita pepo subsp. pepo])

HSP 1 Score: 780.8 bits (2015), Expect = 1.0e-221
Identity = 380/434 (87.56%), Postives = 400/434 (92.17%), Query Frame = 0

Query: 1   MLVNLSHQLLKPACRQFAFSSKGFNCWALRIRNAPSLHRALAIYSQMHRQSVPHDSFSIL 60
           ML++ S  LL PA RQFA  +KGFN WALRIRNAPSLH+ALAIYSQMHRQSVPHDSFSIL
Sbjct: 1   MLLSRSSHLLNPARRQFALFAKGFNSWALRIRNAPSLHKALAIYSQMHRQSVPHDSFSIL 60

Query: 61  FMLKACAPSNNLSTLHHLHAHITKLGFTTHVFVATSLLYAYVLNSFQLACLVFDEMPHKN 120
           F+LKACA SNNLS LHHLHAHITKLGFTTHVFVATSLLYAYVLNSF+LACL+FDEMPHKN
Sbjct: 61  FVLKACARSNNLSILHHLHAHITKLGFTTHVFVATSLLYAYVLNSFELACLLFDEMPHKN 120

Query: 121 TVTWNTMILGYSKTGDVDRARQLFDQMPSRDLVSWSTMIAAYINNRNYRGGLLLFQDMIA 180
           TVTWNTMI GYSKTGDVDRARQLFD MPS+DL SWS  IAAY+NNRNYRGGLLLFQDMI 
Sbjct: 121 TVTWNTMIFGYSKTGDVDRARQLFDLMPSKDLASWSATIAAYVNNRNYRGGLLLFQDMIV 180

Query: 181 NGINPDQMAAGSILNGCAHMGSLGLLAGKSVHGFVVKNRWELNLELGTVLVDMYAKCGFL 240
            GI PDQMA GSIL GCA+MGSLGLLAGKSVHGFVVKNRWELNLELGTVLVDMYAKCGF 
Sbjct: 181 IGITPDQMAVGSILKGCAYMGSLGLLAGKSVHGFVVKNRWELNLELGTVLVDMYAKCGFF 240

Query: 241 KYACQIFLLMSEKNVKTWTALICGLAQHGYCKDALVLFETMSHEGVEPNELTFTGVLSAC 300
           KYACQ+F LMSEKNV+TWTALICGLAQHGYCK+AL LFE M +E VEPNELTFTG+LSAC
Sbjct: 241 KYACQVFHLMSEKNVRTWTALICGLAQHGYCKEALDLFEMMRNECVEPNELTFTGILSAC 300

Query: 301 AHAGFVQEGRKYFNMIEEYGLEIRIQHYGCMVDLLGKSGLLEEAYGVIKTMRLEPNVTVW 360
            HAGFVQEGRKYFNMIEEYGLE RIQHYGCMVDLLG+SGLLEEAYGVIK MRLEPN+ VW
Sbjct: 301 VHAGFVQEGRKYFNMIEEYGLETRIQHYGCMVDLLGRSGLLEEAYGVIKNMRLEPNIIVW 360

Query: 361 SSLLSACKQHKSFDMAERVIEQILETTEPKNHGGIYSLICDLYVLEEKWDDAEKIRNLLS 420
           SSLLSACKQHKSFDMAERVIEQIL+  EP+NHGGIYSLI DLYVLEEKWDDAEKIRNLL+
Sbjct: 361 SSLLSACKQHKSFDMAERVIEQILDKLEPENHGGIYSLISDLYVLEEKWDDAEKIRNLLN 420

Query: 421 QNVPKVRAYSLIRS 435
           QNV KVRAYSLIRS
Sbjct: 421 QNVRKVRAYSLIRS 434

BLAST of CcUC04G080720 vs. NCBI nr
Match: XP_022978962.1 (pentatricopeptide repeat-containing protein At5g66520-like [Cucurbita maxima])

HSP 1 Score: 778.5 bits (2009), Expect = 5.0e-221
Identity = 378/434 (87.10%), Postives = 401/434 (92.40%), Query Frame = 0

Query: 1   MLVNLSHQLLKPACRQFAFSSKGFNCWALRIRNAPSLHRALAIYSQMHRQSVPHDSFSIL 60
           ML++ S  LL PA RQFA S+KGFN WALRIRNAPSL++ALAIYSQMHRQSVPHDSFSIL
Sbjct: 1   MLLSRSSHLLNPARRQFALSAKGFNSWALRIRNAPSLNKALAIYSQMHRQSVPHDSFSIL 60

Query: 61  FMLKACAPSNNLSTLHHLHAHITKLGFTTHVFVATSLLYAYVLNSFQLACLVFDEMPHKN 120
           F+LKACA SNNLS LHHLHAHITKLGFTTHVFVATSLLYAYVLNSF+LACL+FDEMPHKN
Sbjct: 61  FVLKACARSNNLSILHHLHAHITKLGFTTHVFVATSLLYAYVLNSFELACLLFDEMPHKN 120

Query: 121 TVTWNTMILGYSKTGDVDRARQLFDQMPSRDLVSWSTMIAAYINNRNYRGGLLLFQDMIA 180
           TVTWNTMI GYSKTGDVDRARQLFD MPSRDL SWS  IAAY+NNRNYRGGLLLFQDMI 
Sbjct: 121 TVTWNTMIFGYSKTGDVDRARQLFDLMPSRDLASWSATIAAYVNNRNYRGGLLLFQDMIV 180

Query: 181 NGINPDQMAAGSILNGCAHMGSLGLLAGKSVHGFVVKNRWELNLELGTVLVDMYAKCGFL 240
            GI PDQMA GSIL GCA+MGSLGLLAGKSVHGFVVKNRWELNLELGTVLVDMYAKCGF 
Sbjct: 181 IGITPDQMAVGSILKGCAYMGSLGLLAGKSVHGFVVKNRWELNLELGTVLVDMYAKCGFF 240

Query: 241 KYACQIFLLMSEKNVKTWTALICGLAQHGYCKDALVLFETMSHEGVEPNELTFTGVLSAC 300
           KYACQ+F LMSEKNV+TWTALICGLA+HGYCK+AL LFE M +E VEPNELTFTG+LSAC
Sbjct: 241 KYACQVFHLMSEKNVRTWTALICGLAKHGYCKEALDLFEMMRNECVEPNELTFTGILSAC 300

Query: 301 AHAGFVQEGRKYFNMIEEYGLEIRIQHYGCMVDLLGKSGLLEEAYGVIKTMRLEPNVTVW 360
            HAGFVQEGRKYFNMIEEYGLE RIQHYGCMVDLLG+SGLLEEAYGVI  MRLEPN+ VW
Sbjct: 301 VHAGFVQEGRKYFNMIEEYGLETRIQHYGCMVDLLGRSGLLEEAYGVINNMRLEPNIIVW 360

Query: 361 SSLLSACKQHKSFDMAERVIEQILETTEPKNHGGIYSLICDLYVLEEKWDDAEKIRNLLS 420
           SSLLSACKQHKSFDMAERVIEQIL+ +EP+NHGG+YSLI DLYVLEEKWDDAEKIRNLL+
Sbjct: 361 SSLLSACKQHKSFDMAERVIEQILDKSEPENHGGVYSLISDLYVLEEKWDDAEKIRNLLN 420

Query: 421 QNVPKVRAYSLIRS 435
           QNV KVRAYSLIRS
Sbjct: 421 QNVRKVRAYSLIRS 434

BLAST of CcUC04G080720 vs. NCBI nr
Match: KAA0052417.1 (pentatricopeptide repeat-containing protein [Cucumis melo var. makuwa])

HSP 1 Score: 776.9 bits (2005), Expect = 1.5e-220
Identity = 380/434 (87.56%), Postives = 400/434 (92.17%), Query Frame = 0

Query: 1   MLVNLSHQLLKPACRQFAFSSKGFNCWALRIRNAPSLHRALAIYSQMHRQSVPHDSFSIL 60
           MLVNL   LL P+CRQFAFS+KGFN WALRIRNAPSLH+ALAI+SQMHRQSVPHDSFSIL
Sbjct: 1   MLVNL---LLNPSCRQFAFSAKGFNSWALRIRNAPSLHKALAIFSQMHRQSVPHDSFSIL 60

Query: 61  FMLKACAPSNNLSTLHHLHAHITKLGFTTHVFVATSLLYAYVLNSFQLACLVFDEMPHKN 120
           FMLKACA SNNLS LHHLHAHITKLGFTTHVFVATSLL++YVL+SFQLA LVFDEMPHKN
Sbjct: 61  FMLKACASSNNLSILHHLHAHITKLGFTTHVFVATSLLHSYVLHSFQLARLVFDEMPHKN 120

Query: 121 TVTWNTMILGYSKTGDVDRARQLFDQMPSRDLVSWSTMIAAYINNRNYRGGLLLFQDMIA 180
           +VTWNTMI GYSKTGDV  ARQLFD+MPSRDL SWS MIAAYINNRNYRG LLLFQDMI 
Sbjct: 121 SVTWNTMISGYSKTGDVHTARQLFDRMPSRDLASWSAMIAAYINNRNYRGALLLFQDMII 180

Query: 181 NGINPDQMAAGSILNGCAHMGSLGLLAGKSVHGFVVKNRWELNLELGTVLVDMYAKCGFL 240
           NGINPDQMAAGSILNGCAHMGSLG LAGKSVHGFVVKNRWELNLELGTVLV MYA+CG L
Sbjct: 181 NGINPDQMAAGSILNGCAHMGSLGSLAGKSVHGFVVKNRWELNLELGTVLVHMYARCGLL 240

Query: 241 KYACQIFLLMSEKNVKTWTALICGLAQHGYCKDALVLFETMSHEGVEPNELTFTGVLSAC 300
           KYACQIF LMSE+NV+TWTALICGLA HG CK+AL LFETM HEGVEPNELTFTGVLSAC
Sbjct: 241 KYACQIFHLMSERNVRTWTALICGLAHHGCCKEALALFETMRHEGVEPNELTFTGVLSAC 300

Query: 301 AHAGFVQEGRKYFNMIEEYGLEIRIQHYGCMVDLLGKSGLLEEAYGVIKTMRLEPNVTVW 360
            HAG VQEGRKYFNMIEEYGLEIRIQHYGC VDLLG+SGLLEEAYGVIK+MR EPNV VW
Sbjct: 301 VHAGLVQEGRKYFNMIEEYGLEIRIQHYGCFVDLLGRSGLLEEAYGVIKSMRFEPNVIVW 360

Query: 361 SSLLSACKQHKSFDMAERVIEQILETTEPKNHGGIYSLICDLYVLEEKWDDAEKIRNLLS 420
           SSLLSACKQHKSFD+AERVIEQILE TEP NHGG+YSL+ DLYVL+EKWDDAE IRNLL+
Sbjct: 361 SSLLSACKQHKSFDLAERVIEQILEKTEPNNHGGVYSLVSDLYVLQEKWDDAENIRNLLN 420

Query: 421 QNVPKVRAYSLIRS 435
           Q V KVRAYSLIRS
Sbjct: 421 QKVRKVRAYSLIRS 431

BLAST of CcUC04G080720 vs. NCBI nr
Match: XP_022925591.1 (pentatricopeptide repeat-containing protein At5g66520-like [Cucurbita moschata] >KAG6581530.1 Pentatricopeptide repeat-containing protein, partial [Cucurbita argyrosperma subsp. sororia])

HSP 1 Score: 776.2 bits (2003), Expect = 2.5e-220
Identity = 378/434 (87.10%), Postives = 399/434 (91.94%), Query Frame = 0

Query: 1   MLVNLSHQLLKPACRQFAFSSKGFNCWALRIRNAPSLHRALAIYSQMHRQSVPHDSFSIL 60
           ML++ S  LL PA RQFA  +KGFN WALRIRNAPSL +ALAIYSQMHRQSVPHDSFSIL
Sbjct: 1   MLLSRSSHLLNPARRQFALFAKGFNSWALRIRNAPSLQKALAIYSQMHRQSVPHDSFSIL 60

Query: 61  FMLKACAPSNNLSTLHHLHAHITKLGFTTHVFVATSLLYAYVLNSFQLACLVFDEMPHKN 120
           F+LKACA SNNLS LHHLHAHITKLGFTTHVFVATSLLYAYVLNSF+LACL+FDEMPHKN
Sbjct: 61  FVLKACARSNNLSILHHLHAHITKLGFTTHVFVATSLLYAYVLNSFELACLLFDEMPHKN 120

Query: 121 TVTWNTMILGYSKTGDVDRARQLFDQMPSRDLVSWSTMIAAYINNRNYRGGLLLFQDMIA 180
           TVTWNTMI GYSKTGDVDRARQLFD MPS+DL SWS  IAAY+NNRNYRGGLLLFQDMI 
Sbjct: 121 TVTWNTMIFGYSKTGDVDRARQLFDLMPSKDLASWSATIAAYVNNRNYRGGLLLFQDMIV 180

Query: 181 NGINPDQMAAGSILNGCAHMGSLGLLAGKSVHGFVVKNRWELNLELGTVLVDMYAKCGFL 240
            GI PDQMA GSIL GCA+MGSLGLLAGKSVHGFVVKNRW+LNLELGTVLVDMYAKCGF 
Sbjct: 181 IGITPDQMAVGSILKGCAYMGSLGLLAGKSVHGFVVKNRWKLNLELGTVLVDMYAKCGFF 240

Query: 241 KYACQIFLLMSEKNVKTWTALICGLAQHGYCKDALVLFETMSHEGVEPNELTFTGVLSAC 300
           KYACQ+F LMSEKNV+TWTALICGLAQHGYCK+AL LFE M +E VEPNELTFTG+LSAC
Sbjct: 241 KYACQVFHLMSEKNVRTWTALICGLAQHGYCKEALDLFEMMRNECVEPNELTFTGILSAC 300

Query: 301 AHAGFVQEGRKYFNMIEEYGLEIRIQHYGCMVDLLGKSGLLEEAYGVIKTMRLEPNVTVW 360
            HAGFVQEGRKYFNMIEEYGLE RIQHYGCMVDLLG+SGLLEEAYGVIK MRLEPN+ VW
Sbjct: 301 VHAGFVQEGRKYFNMIEEYGLETRIQHYGCMVDLLGRSGLLEEAYGVIKNMRLEPNIIVW 360

Query: 361 SSLLSACKQHKSFDMAERVIEQILETTEPKNHGGIYSLICDLYVLEEKWDDAEKIRNLLS 420
           SSLLSACKQHKSFDMAERVIEQIL+  EP+NHGGIYSLI DLYVLEEKWDDAEKIRNLL+
Sbjct: 361 SSLLSACKQHKSFDMAERVIEQILDKLEPENHGGIYSLISDLYVLEEKWDDAEKIRNLLN 420

Query: 421 QNVPKVRAYSLIRS 435
           QNV KVRAYSLIRS
Sbjct: 421 QNVRKVRAYSLIRS 434

BLAST of CcUC04G080720 vs. ExPASy Swiss-Prot
Match: Q9FJY7 (Pentatricopeptide repeat-containing protein At5g66520 OS=Arabidopsis thaliana OX=3702 GN=PCMP-H61 PE=2 SV=1)

HSP 1 Score: 310.1 bits (793), Expect = 6.6e-83
Identity = 161/412 (39.08%), Postives = 257/412 (62.38%), Query Frame = 0

Query: 27  WALRIRN---APSLHRALAIYSQMHRQSVPHDSFSILFMLKACAPSNNLSTLHHLHAHIT 86
           W L IR    +    R+L +Y +M   S PH++++   +LKAC+  +       +HA IT
Sbjct: 83  WNLMIRGFSCSDEPERSLLLYQRMLCSSAPHNAYTFPSLLKACSNLSAFEETTQIHAQIT 142

Query: 87  KLGFTTHVFVATSLLYAY-VLNSFQLACLVFDEMPHKNTVTWNTMILGYSKTGDVDRARQ 146
           KLG+   V+   SL+ +Y V  +F+LA L+FD +P  + V+WN++I GY K G +D A  
Sbjct: 143 KLGYENDVYAVNSLINSYAVTGNFKLAHLLFDRIPEPDDVSWNSVIKGYVKAGKMDIALT 202

Query: 147 LFDQMPSRDLVSWSTMIAAYINNRNYRGGLLLFQDMIANGINPDQMAAGSILNGCAHMGS 206
           LF +M  ++ +SW+TMI+ Y+     +  L LF +M  + + PD ++  + L+ CA +G+
Sbjct: 203 LFRKMAEKNAISWTTMISGYVQADMNKEALQLFHEMQNSDVEPDNVSLANALSACAQLGA 262

Query: 207 LGLLAGKSVHGFVVKNRWELNLELGTVLVDMYAKCGFLKYACQIFLLMSEKNVKTWTALI 266
           L    GK +H ++ K R  ++  LG VL+DMYAKCG ++ A ++F  + +K+V+ WTALI
Sbjct: 263 LE--QGKWIHSYLNKTRIRMDSVLGCVLIDMYAKCGEMEEALEVFKNIKKKSVQAWTALI 322

Query: 267 CGLAQHGYCKDALVLFETMSHEGVEPNELTFTGVLSACAHAGFVQEGRK-YFNMIEEYGL 326
            G A HG+ ++A+  F  M   G++PN +TFT VL+AC++ G V+EG+  +++M  +Y L
Sbjct: 323 SGYAYHGHGREAISKFMEMQKMGIKPNVITFTAVLTACSYTGLVEEGKLIFYSMERDYNL 382

Query: 327 EIRIQHYGCMVDLLGKSGLLEEAYGVIKTMRLEPNVTVWSSLLSACKQHKSFDMAERVIE 386
           +  I+HYGC+VDLLG++GLL+EA   I+ M L+PN  +W +LL AC+ HK+ ++ E  I 
Sbjct: 383 KPTIEHYGCIVDLLGRAGLLDEAKRFIQEMPLKPNAVIWGALLKACRIHKNIELGEE-IG 442

Query: 387 QILETTEPKNHGGIYSLICDLYVLEEKWDDAEKIRNLL-SQNVPKVRAYSLI 433
           +IL   +P  HGG Y    +++ +++KWD A + R L+  Q V KV   S I
Sbjct: 443 EILIAIDP-YHGGRYVHKANIHAMDKKWDKAAETRRLMKEQGVAKVPGCSTI 490

BLAST of CcUC04G080720 vs. ExPASy Swiss-Prot
Match: Q56X05 (Pentatricopeptide repeat-containing protein At1g06143 OS=Arabidopsis thaliana OX=3702 GN=EMB1444 PE=2 SV=2)

HSP 1 Score: 280.0 bits (715), Expect = 7.3e-74
Identity = 149/416 (35.82%), Postives = 235/416 (56.49%), Query Frame = 0

Query: 39  RALAIYSQMHRQSVPHDSFSILFMLKACAPSNNLSTLHHLHAHITKLGFTTHVFVATSLL 98
           R+L +Y +M R SV   S++   ++KA + ++       L AHI K GF  HV + T+L+
Sbjct: 109 RSLELYVRMLRDSVSPSSYTYSSLVKASSFASRFG--ESLQAHIWKFGFGFHVKIQTTLI 168

Query: 99  YAY-VLNSFQLACLVFDEMPHKNTVTWNTM------------------------------ 158
             Y      + A  VFDEMP ++ + W TM                              
Sbjct: 169 DFYSATGRIREARKVFDEMPERDDIAWTTMVSAYRRVLDMDSANSLANQMSEKNEATSNC 228

Query: 159 -ILGYSKTGDVDRARQLFDQMPSRDLVSWSTMIAAYINNRNYRGGLLLFQDMIANGINPD 218
            I GY   G++++A  LF+QMP +D++SW+TMI  Y  N+ YR  + +F  M+  GI PD
Sbjct: 229 LINGYMGLGNLEQAESLFNQMPVKDIISWTTMIKGYSQNKRYREAIAVFYKMMEEGIIPD 288

Query: 219 QMAAGSILNGCAHMGSLGLLAGKSVHGFVVKNRWELNLELGTVLVDMYAKCGFLKYACQI 278
           ++   ++++ CAH+G L +  GK VH + ++N + L++ +G+ LVDMY+KCG L+ A  +
Sbjct: 289 EVTMSTVISACAHLGVLEI--GKEVHMYTLQNGFVLDVYIGSALVDMYSKCGSLERALLV 348

Query: 279 FLLMSEKNVKTWTALICGLAQHGYCKDALVLFETMSHEGVEPNELTFTGVLSACAHAGFV 338
           F  + +KN+  W ++I GLA HG+ ++AL +F  M  E V+PN +TF  V +AC HAG V
Sbjct: 349 FFNLPKKNLFCWNSIIEGLAAHGFAQEALKMFAKMEMESVKPNAVTFVSVFTACTHAGLV 408

Query: 339 QEGRK-YFNMIEEYGLEIRIQHYGCMVDLLGKSGLLEEAYGVIKTMRLEPNVTVWSSLLS 398
            EGR+ Y +MI++Y +   ++HYG MV L  K+GL+ EA  +I  M  EPN  +W +LL 
Sbjct: 409 DEGRRIYRSMIDDYSIVSNVEHYGGMVHLFSKAGLIYEALELIGNMEFEPNAVIWGALLD 468

Query: 399 ACKQHKSFDMAERVIEQILETTEPKNHGGIYSLICDLYVLEEKWDDAEKIRNLLSQ 422
            C+ HK+  +AE    +++   EP N  G Y L+  +Y  + +W D  +IR  + +
Sbjct: 469 GCRIHKNLVIAEIAFNKLM-VLEPMN-SGYYFLLVSMYAEQNRWRDVAEIRGRMRE 518

BLAST of CcUC04G080720 vs. ExPASy Swiss-Prot
Match: Q1PEU4 (Pentatricopeptide repeat-containing protein At2g44880 OS=Arabidopsis thaliana OX=3702 GN=PCMP-E9 PE=2 SV=2)

HSP 1 Score: 273.9 bits (699), Expect = 5.2e-72
Identity = 139/315 (44.13%), Postives = 207/315 (65.71%), Query Frame = 0

Query: 112 VFDEMPHKNTVTWNTMILGYSKTGDVDRARQLFDQMPSRDLVSWSTMIAAYINNRNYRGG 171
           +FDEM HK  +TW TMI GY    D+D AR+LFD MP R+LVSW+TMI  Y  N+  + G
Sbjct: 198 LFDEMTHKTVITWTTMIHGYCNIKDIDAARKLFDAMPERNLVSWNTMIGGYCQNKQPQEG 257

Query: 172 LLLFQDMIA-NGINPDQMAAGSILNGCAHMGSLGLLAGKSVHGFVVKNRWELNLELGTVL 231
           + LFQ+M A   ++PD +   S+L   +  G+L L  G+  H FV + + +  +++ T +
Sbjct: 258 IRLFQEMQATTSLDPDDVTILSVLPAISDTGALSL--GEWCHCFVQRKKLDKKVKVCTAI 317

Query: 232 VDMYAKCGFLKYACQIFLLMSEKNVKTWTALICGLAQHGYCKDALVLFETMSHEGVEPNE 291
           +DMY+KCG ++ A +IF  M EK V +W A+I G A +G  + AL LF TM  E  +P+E
Sbjct: 318 LDMYSKCGEIEKAKRIFDEMPEKQVASWNAMIHGYALNGNARAALDLFVTMMIE-EKPDE 377

Query: 292 LTFTGVLSACAHAGFVQEGRKYFNMIEEYGLEIRIQHYGCMVDLLGKSGLLEEAYGVIKT 351
           +T   V++AC H G V+EGRK+F+++ E GL  +I+HYGCMVDLLG++G L+EA  +I  
Sbjct: 378 ITMLAVITACNHGGLVEEGRKWFHVMREMGLNAKIEHYGCMVDLLGRAGSLKEAEDLITN 437

Query: 352 MRLEPNVTVWSSLLSACKQHKSFDMAERVIEQILETTEPKNHGGIYSLICDLYVLEEKWD 411
           M  EPN  + SS LSAC Q+K  + AER++++ +E  EP+N G  Y L+ +LY  +++WD
Sbjct: 438 MPFEPNGIILSSFLSACGQYKDIERAERILKKAVE-LEPQNDGN-YVLLRNLYAADKRWD 497

Query: 412 DAEKIRNLLSQNVPK 426
           D   ++N++ +N  K
Sbjct: 498 DFGMVKNVMRKNQAK 507

BLAST of CcUC04G080720 vs. ExPASy Swiss-Prot
Match: Q9FG16 (Pentatricopeptide repeat-containing protein At5g06540 OS=Arabidopsis thaliana OX=3702 GN=PCMP-H88 PE=3 SV=1)

HSP 1 Score: 272.7 bits (696), Expect = 1.2e-71
Identity = 141/397 (35.52%), Postives = 235/397 (59.19%), Query Frame = 0

Query: 39  RALAIYSQMHRQSVPHDSFSILFMLKACAPSNNLSTLHHLHAHITKLGFTTHVFVATSLL 98
           +A   Y+QM +  +  D+ +  F++KA +    +      H+ I + GF   V+V  SL+
Sbjct: 100 KAFGFYTQMLKSRIWPDNITFPFLIKASSEMECVLVGEQTHSQIVRFGFQNDVYVENSLV 159

Query: 99  YAYVLNSFQLAC-LVFDEMPHKNTVTWNTMILGYSKTGDVDRARQLFDQMPSRDLVSWST 158
           + Y    F  A   +F +M  ++ V+W +M+ GY K G V+ AR++FD+MP R+L +WS 
Sbjct: 160 HMYANCGFIAAAGRIFGQMGFRDVVSWTSMVAGYCKCGMVENAREMFDEMPHRNLFTWSI 219

Query: 159 MIAAYINNRNYRGGLLLFQDMIANGINPDQMAAGSILNGCAHMGSLGLLAGKSVHGFVVK 218
           MI  Y  N  +   + LF+ M   G+  ++    S+++ CAH+G+L    G+  + +VVK
Sbjct: 220 MINGYAKNNCFEKAIDLFEFMKREGVVANETVMVSVISSCAHLGALEF--GERAYEYVVK 279

Query: 219 NRWELNLELGTVLVDMYAKCGFLKYACQIFLLMSEKNVKTWTALICGLAQHGYCKDALVL 278
           +   +NL LGT LVDM+ +CG ++ A  +F  + E +  +W+++I GLA HG+   A+  
Sbjct: 280 SHMTVNLILGTALVDMFWRCGDIEKAIHVFEGLPETDSLSWSSIIKGLAVHGHAHKAMHY 339

Query: 279 FETMSHEGVEPNELTFTGVLSACAHAGFVQEGRK-YFNMIEEYGLEIRIQHYGCMVDLLG 338
           F  M   G  P ++TFT VLSAC+H G V++G + Y NM +++G+E R++HYGC+VD+LG
Sbjct: 340 FSQMISLGFIPRDVTFTAVLSACSHGGLVEKGLEIYENMKKDHGIEPRLEHYGCIVDMLG 399

Query: 339 KSGLLEEAYGVIKTMRLEPNVTVWSSLLSACKQHKSFDMAERVIEQILETTEPKNHGGIY 398
           ++G L EA   I  M ++PN  +  +LL ACK +K+ ++AERV   +++      H G Y
Sbjct: 400 RAGKLAEAENFILKMHVKPNAPILGALLGACKIYKNTEVAERVGNMLIKV--KPEHSGYY 459

Query: 399 SLICDLYVLEEKWDDAEKIRNLLSQN-VPKVRAYSLI 433
            L+ ++Y    +WD  E +R+++ +  V K   +SLI
Sbjct: 460 VLLSNIYACAGQWDKIESLRDMMKEKLVKKPPGWSLI 492

BLAST of CcUC04G080720 vs. ExPASy Swiss-Prot
Match: Q9CA54 (Pentatricopeptide repeat-containing protein At1g74630 OS=Arabidopsis thaliana OX=3702 GN=PCMP-H71 PE=2 SV=1)

HSP 1 Score: 271.6 bits (693), Expect = 2.6e-71
Identity = 156/461 (33.84%), Postives = 252/461 (54.66%), Query Frame = 0

Query: 38  HRALAIYSQMHRQS-VPHDSFSILFMLKACAPSNNLSTLHHLHAHITKLGFTTHVFVATS 97
           H ++A++ +M R+  V  DSFS  F++KA     +L T   +H    K G  +H+FV T+
Sbjct: 87  HNSVAVFVEMMRKGFVFPDSFSFAFVIKAVENFRSLRTGFQMHCQALKHGLESHLFVGTT 146

Query: 98  LLYAY----------------------VLNSFQLACL----------VFDEMPHKNTVTW 157
           L+  Y                        N+   AC           +FD+M  +N  +W
Sbjct: 147 LIGMYGGCGCVEFARKVFDEMHQPNLVAWNAVITACFRGNDVAGAREIFDKMLVRNHTSW 206

Query: 158 NTMILGYSKTGDVDRARQLFDQMPSRDLVSWSTMIAAYINNRNYRGGLLLFQDMIANGIN 217
           N M+ GY K G+++ A+++F +MP RD VSWSTMI    +N ++    L F+++   G++
Sbjct: 207 NVMLAGYIKAGELESAKRIFSEMPHRDDVSWSTMIVGIAHNGSFNESFLYFRELQRAGMS 266

Query: 218 PDQMAAGSILNGCAHMGSLGLLAGKSVHGFVVKNRWELNLELGTVLVDMYAKCGFLKYAC 277
           P++++   +L+ C+  GS     GK +HGFV K  +   + +   L+DMY++CG +  A 
Sbjct: 267 PNEVSLTGVLSACSQSGSFEF--GKILHGFVEKAGYSWIVSVNNALIDMYSRCGNVPMAR 326

Query: 278 QIFLLMSEKN-VKTWTALICGLAQHGYCKDALVLFETMSHEGVEPNELTFTGVLSACAHA 337
            +F  M EK  + +WT++I GLA HG  ++A+ LF  M+  GV P+ ++F  +L AC+HA
Sbjct: 327 LVFEGMQEKRCIVSWTSMIAGLAMHGQGEEAVRLFNEMTAYGVTPDGISFISLLHACSHA 386

Query: 338 GFVQEGRKYFN-MIEEYGLEIRIQHYGCMVDLLGKSGLLEEAYGVIKTMRLEPNVTVWSS 397
           G ++EG  YF+ M   Y +E  I+HYGCMVDL G+SG L++AY  I  M + P   VW +
Sbjct: 387 GLIEEGEDYFSEMKRVYHIEPEIEHYGCMVDLYGRSGKLQKAYDFICQMPIPPTAIVWRT 446

Query: 398 LLSACKQHKSFDMAERVIEQILETTEPKNHGGIYSLICDLYVLEEKWDDAEKIR-NLLSQ 456
           LL AC  H + ++AE+V +Q L   +P N G +  L+ + Y    KW D   IR +++ Q
Sbjct: 447 LLGACSSHGNIELAEQV-KQRLNELDPNNSGDLV-LLSNAYATAGKWKDVASIRKSMIVQ 506

BLAST of CcUC04G080720 vs. ExPASy TrEMBL
Match: A0A6J1IMI1 (pentatricopeptide repeat-containing protein At5g66520-like OS=Cucurbita maxima OX=3661 GN=LOC111478755 PE=4 SV=1)

HSP 1 Score: 778.5 bits (2009), Expect = 2.4e-221
Identity = 378/434 (87.10%), Postives = 401/434 (92.40%), Query Frame = 0

Query: 1   MLVNLSHQLLKPACRQFAFSSKGFNCWALRIRNAPSLHRALAIYSQMHRQSVPHDSFSIL 60
           ML++ S  LL PA RQFA S+KGFN WALRIRNAPSL++ALAIYSQMHRQSVPHDSFSIL
Sbjct: 1   MLLSRSSHLLNPARRQFALSAKGFNSWALRIRNAPSLNKALAIYSQMHRQSVPHDSFSIL 60

Query: 61  FMLKACAPSNNLSTLHHLHAHITKLGFTTHVFVATSLLYAYVLNSFQLACLVFDEMPHKN 120
           F+LKACA SNNLS LHHLHAHITKLGFTTHVFVATSLLYAYVLNSF+LACL+FDEMPHKN
Sbjct: 61  FVLKACARSNNLSILHHLHAHITKLGFTTHVFVATSLLYAYVLNSFELACLLFDEMPHKN 120

Query: 121 TVTWNTMILGYSKTGDVDRARQLFDQMPSRDLVSWSTMIAAYINNRNYRGGLLLFQDMIA 180
           TVTWNTMI GYSKTGDVDRARQLFD MPSRDL SWS  IAAY+NNRNYRGGLLLFQDMI 
Sbjct: 121 TVTWNTMIFGYSKTGDVDRARQLFDLMPSRDLASWSATIAAYVNNRNYRGGLLLFQDMIV 180

Query: 181 NGINPDQMAAGSILNGCAHMGSLGLLAGKSVHGFVVKNRWELNLELGTVLVDMYAKCGFL 240
            GI PDQMA GSIL GCA+MGSLGLLAGKSVHGFVVKNRWELNLELGTVLVDMYAKCGF 
Sbjct: 181 IGITPDQMAVGSILKGCAYMGSLGLLAGKSVHGFVVKNRWELNLELGTVLVDMYAKCGFF 240

Query: 241 KYACQIFLLMSEKNVKTWTALICGLAQHGYCKDALVLFETMSHEGVEPNELTFTGVLSAC 300
           KYACQ+F LMSEKNV+TWTALICGLA+HGYCK+AL LFE M +E VEPNELTFTG+LSAC
Sbjct: 241 KYACQVFHLMSEKNVRTWTALICGLAKHGYCKEALDLFEMMRNECVEPNELTFTGILSAC 300

Query: 301 AHAGFVQEGRKYFNMIEEYGLEIRIQHYGCMVDLLGKSGLLEEAYGVIKTMRLEPNVTVW 360
            HAGFVQEGRKYFNMIEEYGLE RIQHYGCMVDLLG+SGLLEEAYGVI  MRLEPN+ VW
Sbjct: 301 VHAGFVQEGRKYFNMIEEYGLETRIQHYGCMVDLLGRSGLLEEAYGVINNMRLEPNIIVW 360

Query: 361 SSLLSACKQHKSFDMAERVIEQILETTEPKNHGGIYSLICDLYVLEEKWDDAEKIRNLLS 420
           SSLLSACKQHKSFDMAERVIEQIL+ +EP+NHGG+YSLI DLYVLEEKWDDAEKIRNLL+
Sbjct: 361 SSLLSACKQHKSFDMAERVIEQILDKSEPENHGGVYSLISDLYVLEEKWDDAEKIRNLLN 420

Query: 421 QNVPKVRAYSLIRS 435
           QNV KVRAYSLIRS
Sbjct: 421 QNVRKVRAYSLIRS 434

BLAST of CcUC04G080720 vs. ExPASy TrEMBL
Match: A0A5A7UD49 (Pentatricopeptide repeat-containing protein OS=Cucumis melo var. makuwa OX=1194695 GN=E6C27_scaffold120G00120 PE=4 SV=1)

HSP 1 Score: 776.9 bits (2005), Expect = 7.0e-221
Identity = 380/434 (87.56%), Postives = 400/434 (92.17%), Query Frame = 0

Query: 1   MLVNLSHQLLKPACRQFAFSSKGFNCWALRIRNAPSLHRALAIYSQMHRQSVPHDSFSIL 60
           MLVNL   LL P+CRQFAFS+KGFN WALRIRNAPSLH+ALAI+SQMHRQSVPHDSFSIL
Sbjct: 1   MLVNL---LLNPSCRQFAFSAKGFNSWALRIRNAPSLHKALAIFSQMHRQSVPHDSFSIL 60

Query: 61  FMLKACAPSNNLSTLHHLHAHITKLGFTTHVFVATSLLYAYVLNSFQLACLVFDEMPHKN 120
           FMLKACA SNNLS LHHLHAHITKLGFTTHVFVATSLL++YVL+SFQLA LVFDEMPHKN
Sbjct: 61  FMLKACASSNNLSILHHLHAHITKLGFTTHVFVATSLLHSYVLHSFQLARLVFDEMPHKN 120

Query: 121 TVTWNTMILGYSKTGDVDRARQLFDQMPSRDLVSWSTMIAAYINNRNYRGGLLLFQDMIA 180
           +VTWNTMI GYSKTGDV  ARQLFD+MPSRDL SWS MIAAYINNRNYRG LLLFQDMI 
Sbjct: 121 SVTWNTMISGYSKTGDVHTARQLFDRMPSRDLASWSAMIAAYINNRNYRGALLLFQDMII 180

Query: 181 NGINPDQMAAGSILNGCAHMGSLGLLAGKSVHGFVVKNRWELNLELGTVLVDMYAKCGFL 240
           NGINPDQMAAGSILNGCAHMGSLG LAGKSVHGFVVKNRWELNLELGTVLV MYA+CG L
Sbjct: 181 NGINPDQMAAGSILNGCAHMGSLGSLAGKSVHGFVVKNRWELNLELGTVLVHMYARCGLL 240

Query: 241 KYACQIFLLMSEKNVKTWTALICGLAQHGYCKDALVLFETMSHEGVEPNELTFTGVLSAC 300
           KYACQIF LMSE+NV+TWTALICGLA HG CK+AL LFETM HEGVEPNELTFTGVLSAC
Sbjct: 241 KYACQIFHLMSERNVRTWTALICGLAHHGCCKEALALFETMRHEGVEPNELTFTGVLSAC 300

Query: 301 AHAGFVQEGRKYFNMIEEYGLEIRIQHYGCMVDLLGKSGLLEEAYGVIKTMRLEPNVTVW 360
            HAG VQEGRKYFNMIEEYGLEIRIQHYGC VDLLG+SGLLEEAYGVIK+MR EPNV VW
Sbjct: 301 VHAGLVQEGRKYFNMIEEYGLEIRIQHYGCFVDLLGRSGLLEEAYGVIKSMRFEPNVIVW 360

Query: 361 SSLLSACKQHKSFDMAERVIEQILETTEPKNHGGIYSLICDLYVLEEKWDDAEKIRNLLS 420
           SSLLSACKQHKSFD+AERVIEQILE TEP NHGG+YSL+ DLYVL+EKWDDAE IRNLL+
Sbjct: 361 SSLLSACKQHKSFDLAERVIEQILEKTEPNNHGGVYSLVSDLYVLQEKWDDAENIRNLLN 420

Query: 421 QNVPKVRAYSLIRS 435
           Q V KVRAYSLIRS
Sbjct: 421 QKVRKVRAYSLIRS 431

BLAST of CcUC04G080720 vs. ExPASy TrEMBL
Match: A0A6J1ECM3 (pentatricopeptide repeat-containing protein At5g66520-like OS=Cucurbita moschata OX=3662 GN=LOC111432978 PE=4 SV=1)

HSP 1 Score: 776.2 bits (2003), Expect = 1.2e-220
Identity = 378/434 (87.10%), Postives = 399/434 (91.94%), Query Frame = 0

Query: 1   MLVNLSHQLLKPACRQFAFSSKGFNCWALRIRNAPSLHRALAIYSQMHRQSVPHDSFSIL 60
           ML++ S  LL PA RQFA  +KGFN WALRIRNAPSL +ALAIYSQMHRQSVPHDSFSIL
Sbjct: 1   MLLSRSSHLLNPARRQFALFAKGFNSWALRIRNAPSLQKALAIYSQMHRQSVPHDSFSIL 60

Query: 61  FMLKACAPSNNLSTLHHLHAHITKLGFTTHVFVATSLLYAYVLNSFQLACLVFDEMPHKN 120
           F+LKACA SNNLS LHHLHAHITKLGFTTHVFVATSLLYAYVLNSF+LACL+FDEMPHKN
Sbjct: 61  FVLKACARSNNLSILHHLHAHITKLGFTTHVFVATSLLYAYVLNSFELACLLFDEMPHKN 120

Query: 121 TVTWNTMILGYSKTGDVDRARQLFDQMPSRDLVSWSTMIAAYINNRNYRGGLLLFQDMIA 180
           TVTWNTMI GYSKTGDVDRARQLFD MPS+DL SWS  IAAY+NNRNYRGGLLLFQDMI 
Sbjct: 121 TVTWNTMIFGYSKTGDVDRARQLFDLMPSKDLASWSATIAAYVNNRNYRGGLLLFQDMIV 180

Query: 181 NGINPDQMAAGSILNGCAHMGSLGLLAGKSVHGFVVKNRWELNLELGTVLVDMYAKCGFL 240
            GI PDQMA GSIL GCA+MGSLGLLAGKSVHGFVVKNRW+LNLELGTVLVDMYAKCGF 
Sbjct: 181 IGITPDQMAVGSILKGCAYMGSLGLLAGKSVHGFVVKNRWKLNLELGTVLVDMYAKCGFF 240

Query: 241 KYACQIFLLMSEKNVKTWTALICGLAQHGYCKDALVLFETMSHEGVEPNELTFTGVLSAC 300
           KYACQ+F LMSEKNV+TWTALICGLAQHGYCK+AL LFE M +E VEPNELTFTG+LSAC
Sbjct: 241 KYACQVFHLMSEKNVRTWTALICGLAQHGYCKEALDLFEMMRNECVEPNELTFTGILSAC 300

Query: 301 AHAGFVQEGRKYFNMIEEYGLEIRIQHYGCMVDLLGKSGLLEEAYGVIKTMRLEPNVTVW 360
            HAGFVQEGRKYFNMIEEYGLE RIQHYGCMVDLLG+SGLLEEAYGVIK MRLEPN+ VW
Sbjct: 301 VHAGFVQEGRKYFNMIEEYGLETRIQHYGCMVDLLGRSGLLEEAYGVIKNMRLEPNIIVW 360

Query: 361 SSLLSACKQHKSFDMAERVIEQILETTEPKNHGGIYSLICDLYVLEEKWDDAEKIRNLLS 420
           SSLLSACKQHKSFDMAERVIEQIL+  EP+NHGGIYSLI DLYVLEEKWDDAEKIRNLL+
Sbjct: 361 SSLLSACKQHKSFDMAERVIEQILDKLEPENHGGIYSLISDLYVLEEKWDDAEKIRNLLN 420

Query: 421 QNVPKVRAYSLIRS 435
           QNV KVRAYSLIRS
Sbjct: 421 QNVRKVRAYSLIRS 434

BLAST of CcUC04G080720 vs. ExPASy TrEMBL
Match: A0A1S3AYC4 (pentatricopeptide repeat-containing protein At5g66520-like OS=Cucumis melo OX=3656 GN=LOC103484239 PE=4 SV=1)

HSP 1 Score: 772.7 bits (1994), Expect = 1.3e-219
Identity = 379/434 (87.33%), Postives = 398/434 (91.71%), Query Frame = 0

Query: 1   MLVNLSHQLLKPACRQFAFSSKGFNCWALRIRNAPSLHRALAIYSQMHRQSVPHDSFSIL 60
           MLVNL   LL P+CRQFAFS+KGFN WALRIRNAPSLH+ALAI+SQMHRQSVPHDSFSIL
Sbjct: 1   MLVNL---LLNPSCRQFAFSAKGFNSWALRIRNAPSLHKALAIFSQMHRQSVPHDSFSIL 60

Query: 61  FMLKACAPSNNLSTLHHLHAHITKLGFTTHVFVATSLLYAYVLNSFQLACLVFDEMPHKN 120
           FMLKACA SNNLS LHHLHAHITKLGFTTHVFVATSLL++YVL+SFQLA LVFDEMPHKN
Sbjct: 61  FMLKACASSNNLSILHHLHAHITKLGFTTHVFVATSLLHSYVLHSFQLARLVFDEMPHKN 120

Query: 121 TVTWNTMILGYSKTGDVDRARQLFDQMPSRDLVSWSTMIAAYINNRNYRGGLLLFQDMIA 180
           +VTWNTMI GYSKTGDV  ARQLFD+MPSRDL SWS MIAAYINNRNYR  LLLFQDMI 
Sbjct: 121 SVTWNTMISGYSKTGDVHTARQLFDRMPSRDLASWSAMIAAYINNRNYRVALLLFQDMII 180

Query: 181 NGINPDQMAAGSILNGCAHMGSLGLLAGKSVHGFVVKNRWELNLELGTVLVDMYAKCGFL 240
           NGINPDQMAAGSILNGCAHMGSLG LAGKSVHGFVVKNRWELNLELGTVLV MYAKCG L
Sbjct: 181 NGINPDQMAAGSILNGCAHMGSLGSLAGKSVHGFVVKNRWELNLELGTVLVHMYAKCGLL 240

Query: 241 KYACQIFLLMSEKNVKTWTALICGLAQHGYCKDALVLFETMSHEGVEPNELTFTGVLSAC 300
           KYACQIF LMSE+NV+TWTALICGLA HG CK+AL LFETM HEGVEPNELTFTGVLSAC
Sbjct: 241 KYACQIFHLMSERNVRTWTALICGLAHHGCCKEALALFETMRHEGVEPNELTFTGVLSAC 300

Query: 301 AHAGFVQEGRKYFNMIEEYGLEIRIQHYGCMVDLLGKSGLLEEAYGVIKTMRLEPNVTVW 360
            HAG VQEGRKYFNMIEEYGLEIRIQHYGC VDLLG+SGLLEEAYGVIK+MR EPNV VW
Sbjct: 301 VHAGLVQEGRKYFNMIEEYGLEIRIQHYGCFVDLLGRSGLLEEAYGVIKSMRFEPNVIVW 360

Query: 361 SSLLSACKQHKSFDMAERVIEQILETTEPKNHGGIYSLICDLYVLEEKWDDAEKIRNLLS 420
           SSLLSACKQHKSFD+AERVIE ILE TEP NHGG+YSL+ DLYVL+EKWDDAE IRNLL+
Sbjct: 361 SSLLSACKQHKSFDLAERVIEHILEKTEPNNHGGVYSLVSDLYVLQEKWDDAENIRNLLN 420

Query: 421 QNVPKVRAYSLIRS 435
           Q V KVRAYSLIRS
Sbjct: 421 QKVRKVRAYSLIRS 431

BLAST of CcUC04G080720 vs. ExPASy TrEMBL
Match: A0A0A0KIK9 (Uncharacterized protein OS=Cucumis sativus OX=3659 GN=Csa_6G538690 PE=4 SV=1)

HSP 1 Score: 768.5 bits (1983), Expect = 2.5e-218
Identity = 377/434 (86.87%), Postives = 397/434 (91.47%), Query Frame = 0

Query: 1   MLVNLSHQLLKPACRQFAFSSKGFNCWALRIRNAPSLHRALAIYSQMHRQSVPHDSFSIL 60
           MLVNL   LL P+CR FAFS+KG N WALRIRNAPSLH+ALA YSQMHRQSVPHDSFSIL
Sbjct: 4   MLVNL---LLNPSCRHFAFSAKGVNSWALRIRNAPSLHKALAFYSQMHRQSVPHDSFSIL 63

Query: 61  FMLKACAPSNNLSTLHHLHAHITKLGFTTHVFVATSLLYAYVLNSFQLACLVFDEMPHKN 120
           FMLKACA SNNLS LHHLHAHITKLGFTTHVFVATSLL++YVL+SFQLA LVFDEMPHKN
Sbjct: 64  FMLKACASSNNLSILHHLHAHITKLGFTTHVFVATSLLHSYVLHSFQLARLVFDEMPHKN 123

Query: 121 TVTWNTMILGYSKTGDVDRARQLFDQMPSRDLVSWSTMIAAYINNRNYRGGLLLFQDMIA 180
           +VTWNTMI GYSK GDV  ARQLFD+MPSRDL SWS MIAAYINNRNYRG LLLFQDMI 
Sbjct: 124 SVTWNTMISGYSKAGDVHTARQLFDRMPSRDLASWSAMIAAYINNRNYRGALLLFQDMII 183

Query: 181 NGINPDQMAAGSILNGCAHMGSLGLLAGKSVHGFVVKNRWELNLELGTVLVDMYAKCGFL 240
           NGINPDQMAAGSILNGCAHMGSLGLLAGKSVHGFVVKNRWELNLELGTVLVDMYAKCGFL
Sbjct: 184 NGINPDQMAAGSILNGCAHMGSLGLLAGKSVHGFVVKNRWELNLELGTVLVDMYAKCGFL 243

Query: 241 KYACQIFLLMSEKNVKTWTALICGLAQHGYCKDALVLFETMSHEGVEPNELTFTGVLSAC 300
           KYACQIF LMSE+NV+TWTALICGLA HG CK+ALVLFETM HEGVEPNE TFTGVLSAC
Sbjct: 244 KYACQIFNLMSERNVRTWTALICGLAHHGCCKEALVLFETMRHEGVEPNEFTFTGVLSAC 303

Query: 301 AHAGFVQEGRKYFNMIEEYGLEIRIQHYGCMVDLLGKSGLLEEAYGVIKTMRLEPNVTVW 360
            HAG VQEGRKYFNMIEE GLEIRIQHYGC VDLLG+SGLLEEAYGVIK+MRLEPNV VW
Sbjct: 304 VHAGLVQEGRKYFNMIEECGLEIRIQHYGCFVDLLGRSGLLEEAYGVIKSMRLEPNVIVW 363

Query: 361 SSLLSACKQHKSFDMAERVIEQILETTEPKNHGGIYSLICDLYVLEEKWDDAEKIRNLLS 420
           SSLLSACKQHKSFD+AERVIEQILE  EP NH G+YSL+ DLYVL++KWDDAE IRNLL+
Sbjct: 364 SSLLSACKQHKSFDLAERVIEQILEKIEPDNHAGVYSLVSDLYVLQDKWDDAENIRNLLN 423

Query: 421 QNVPKVRAYSLIRS 435
           Q+V K RAYSLIRS
Sbjct: 424 QHVRKGRAYSLIRS 434

BLAST of CcUC04G080720 vs. TAIR 10
Match: AT5G66520.1 (Tetratricopeptide repeat (TPR)-like superfamily protein )

HSP 1 Score: 310.1 bits (793), Expect = 4.7e-84
Identity = 161/412 (39.08%), Postives = 257/412 (62.38%), Query Frame = 0

Query: 27  WALRIRN---APSLHRALAIYSQMHRQSVPHDSFSILFMLKACAPSNNLSTLHHLHAHIT 86
           W L IR    +    R+L +Y +M   S PH++++   +LKAC+  +       +HA IT
Sbjct: 83  WNLMIRGFSCSDEPERSLLLYQRMLCSSAPHNAYTFPSLLKACSNLSAFEETTQIHAQIT 142

Query: 87  KLGFTTHVFVATSLLYAY-VLNSFQLACLVFDEMPHKNTVTWNTMILGYSKTGDVDRARQ 146
           KLG+   V+   SL+ +Y V  +F+LA L+FD +P  + V+WN++I GY K G +D A  
Sbjct: 143 KLGYENDVYAVNSLINSYAVTGNFKLAHLLFDRIPEPDDVSWNSVIKGYVKAGKMDIALT 202

Query: 147 LFDQMPSRDLVSWSTMIAAYINNRNYRGGLLLFQDMIANGINPDQMAAGSILNGCAHMGS 206
           LF +M  ++ +SW+TMI+ Y+     +  L LF +M  + + PD ++  + L+ CA +G+
Sbjct: 203 LFRKMAEKNAISWTTMISGYVQADMNKEALQLFHEMQNSDVEPDNVSLANALSACAQLGA 262

Query: 207 LGLLAGKSVHGFVVKNRWELNLELGTVLVDMYAKCGFLKYACQIFLLMSEKNVKTWTALI 266
           L    GK +H ++ K R  ++  LG VL+DMYAKCG ++ A ++F  + +K+V+ WTALI
Sbjct: 263 LE--QGKWIHSYLNKTRIRMDSVLGCVLIDMYAKCGEMEEALEVFKNIKKKSVQAWTALI 322

Query: 267 CGLAQHGYCKDALVLFETMSHEGVEPNELTFTGVLSACAHAGFVQEGRK-YFNMIEEYGL 326
            G A HG+ ++A+  F  M   G++PN +TFT VL+AC++ G V+EG+  +++M  +Y L
Sbjct: 323 SGYAYHGHGREAISKFMEMQKMGIKPNVITFTAVLTACSYTGLVEEGKLIFYSMERDYNL 382

Query: 327 EIRIQHYGCMVDLLGKSGLLEEAYGVIKTMRLEPNVTVWSSLLSACKQHKSFDMAERVIE 386
           +  I+HYGC+VDLLG++GLL+EA   I+ M L+PN  +W +LL AC+ HK+ ++ E  I 
Sbjct: 383 KPTIEHYGCIVDLLGRAGLLDEAKRFIQEMPLKPNAVIWGALLKACRIHKNIELGEE-IG 442

Query: 387 QILETTEPKNHGGIYSLICDLYVLEEKWDDAEKIRNLL-SQNVPKVRAYSLI 433
           +IL   +P  HGG Y    +++ +++KWD A + R L+  Q V KV   S I
Sbjct: 443 EILIAIDP-YHGGRYVHKANIHAMDKKWDKAAETRRLMKEQGVAKVPGCSTI 490

BLAST of CcUC04G080720 vs. TAIR 10
Match: AT1G06150.1 (basic helix-loop-helix (bHLH) DNA-binding superfamily protein )

HSP 1 Score: 280.0 bits (715), Expect = 5.2e-75
Identity = 149/416 (35.82%), Postives = 235/416 (56.49%), Query Frame = 0

Query: 39   RALAIYSQMHRQSVPHDSFSILFMLKACAPSNNLSTLHHLHAHITKLGFTTHVFVATSLL 98
            R+L +Y +M R SV   S++   ++KA + ++       L AHI K GF  HV + T+L+
Sbjct: 854  RSLELYVRMLRDSVSPSSYTYSSLVKASSFASRFG--ESLQAHIWKFGFGFHVKIQTTLI 913

Query: 99   YAY-VLNSFQLACLVFDEMPHKNTVTWNTM------------------------------ 158
              Y      + A  VFDEMP ++ + W TM                              
Sbjct: 914  DFYSATGRIREARKVFDEMPERDDIAWTTMVSAYRRVLDMDSANSLANQMSEKNEATSNC 973

Query: 159  -ILGYSKTGDVDRARQLFDQMPSRDLVSWSTMIAAYINNRNYRGGLLLFQDMIANGINPD 218
             I GY   G++++A  LF+QMP +D++SW+TMI  Y  N+ YR  + +F  M+  GI PD
Sbjct: 974  LINGYMGLGNLEQAESLFNQMPVKDIISWTTMIKGYSQNKRYREAIAVFYKMMEEGIIPD 1033

Query: 219  QMAAGSILNGCAHMGSLGLLAGKSVHGFVVKNRWELNLELGTVLVDMYAKCGFLKYACQI 278
            ++   ++++ CAH+G L +  GK VH + ++N + L++ +G+ LVDMY+KCG L+ A  +
Sbjct: 1034 EVTMSTVISACAHLGVLEI--GKEVHMYTLQNGFVLDVYIGSALVDMYSKCGSLERALLV 1093

Query: 279  FLLMSEKNVKTWTALICGLAQHGYCKDALVLFETMSHEGVEPNELTFTGVLSACAHAGFV 338
            F  + +KN+  W ++I GLA HG+ ++AL +F  M  E V+PN +TF  V +AC HAG V
Sbjct: 1094 FFNLPKKNLFCWNSIIEGLAAHGFAQEALKMFAKMEMESVKPNAVTFVSVFTACTHAGLV 1153

Query: 339  QEGRK-YFNMIEEYGLEIRIQHYGCMVDLLGKSGLLEEAYGVIKTMRLEPNVTVWSSLLS 398
             EGR+ Y +MI++Y +   ++HYG MV L  K+GL+ EA  +I  M  EPN  +W +LL 
Sbjct: 1154 DEGRRIYRSMIDDYSIVSNVEHYGGMVHLFSKAGLIYEALELIGNMEFEPNAVIWGALLD 1213

Query: 399  ACKQHKSFDMAERVIEQILETTEPKNHGGIYSLICDLYVLEEKWDDAEKIRNLLSQ 422
             C+ HK+  +AE    +++   EP N  G Y L+  +Y  + +W D  +IR  + +
Sbjct: 1214 GCRIHKNLVIAEIAFNKLM-VLEPMN-SGYYFLLVSMYAEQNRWRDVAEIRGRMRE 1263

BLAST of CcUC04G080720 vs. TAIR 10
Match: AT2G44880.1 (Pentatricopeptide repeat (PPR-like) superfamily protein )

HSP 1 Score: 273.9 bits (699), Expect = 3.7e-73
Identity = 139/315 (44.13%), Postives = 207/315 (65.71%), Query Frame = 0

Query: 112 VFDEMPHKNTVTWNTMILGYSKTGDVDRARQLFDQMPSRDLVSWSTMIAAYINNRNYRGG 171
           +FDEM HK  +TW TMI GY    D+D AR+LFD MP R+LVSW+TMI  Y  N+  + G
Sbjct: 198 LFDEMTHKTVITWTTMIHGYCNIKDIDAARKLFDAMPERNLVSWNTMIGGYCQNKQPQEG 257

Query: 172 LLLFQDMIA-NGINPDQMAAGSILNGCAHMGSLGLLAGKSVHGFVVKNRWELNLELGTVL 231
           + LFQ+M A   ++PD +   S+L   +  G+L L  G+  H FV + + +  +++ T +
Sbjct: 258 IRLFQEMQATTSLDPDDVTILSVLPAISDTGALSL--GEWCHCFVQRKKLDKKVKVCTAI 317

Query: 232 VDMYAKCGFLKYACQIFLLMSEKNVKTWTALICGLAQHGYCKDALVLFETMSHEGVEPNE 291
           +DMY+KCG ++ A +IF  M EK V +W A+I G A +G  + AL LF TM  E  +P+E
Sbjct: 318 LDMYSKCGEIEKAKRIFDEMPEKQVASWNAMIHGYALNGNARAALDLFVTMMIE-EKPDE 377

Query: 292 LTFTGVLSACAHAGFVQEGRKYFNMIEEYGLEIRIQHYGCMVDLLGKSGLLEEAYGVIKT 351
           +T   V++AC H G V+EGRK+F+++ E GL  +I+HYGCMVDLLG++G L+EA  +I  
Sbjct: 378 ITMLAVITACNHGGLVEEGRKWFHVMREMGLNAKIEHYGCMVDLLGRAGSLKEAEDLITN 437

Query: 352 MRLEPNVTVWSSLLSACKQHKSFDMAERVIEQILETTEPKNHGGIYSLICDLYVLEEKWD 411
           M  EPN  + SS LSAC Q+K  + AER++++ +E  EP+N G  Y L+ +LY  +++WD
Sbjct: 438 MPFEPNGIILSSFLSACGQYKDIERAERILKKAVE-LEPQNDGN-YVLLRNLYAADKRWD 497

Query: 412 DAEKIRNLLSQNVPK 426
           D   ++N++ +N  K
Sbjct: 498 DFGMVKNVMRKNQAK 507

BLAST of CcUC04G080720 vs. TAIR 10
Match: AT5G06540.1 (Pentatricopeptide repeat (PPR) superfamily protein )

HSP 1 Score: 272.7 bits (696), Expect = 8.3e-73
Identity = 141/397 (35.52%), Postives = 235/397 (59.19%), Query Frame = 0

Query: 39  RALAIYSQMHRQSVPHDSFSILFMLKACAPSNNLSTLHHLHAHITKLGFTTHVFVATSLL 98
           +A   Y+QM +  +  D+ +  F++KA +    +      H+ I + GF   V+V  SL+
Sbjct: 100 KAFGFYTQMLKSRIWPDNITFPFLIKASSEMECVLVGEQTHSQIVRFGFQNDVYVENSLV 159

Query: 99  YAYVLNSFQLAC-LVFDEMPHKNTVTWNTMILGYSKTGDVDRARQLFDQMPSRDLVSWST 158
           + Y    F  A   +F +M  ++ V+W +M+ GY K G V+ AR++FD+MP R+L +WS 
Sbjct: 160 HMYANCGFIAAAGRIFGQMGFRDVVSWTSMVAGYCKCGMVENAREMFDEMPHRNLFTWSI 219

Query: 159 MIAAYINNRNYRGGLLLFQDMIANGINPDQMAAGSILNGCAHMGSLGLLAGKSVHGFVVK 218
           MI  Y  N  +   + LF+ M   G+  ++    S+++ CAH+G+L    G+  + +VVK
Sbjct: 220 MINGYAKNNCFEKAIDLFEFMKREGVVANETVMVSVISSCAHLGALEF--GERAYEYVVK 279

Query: 219 NRWELNLELGTVLVDMYAKCGFLKYACQIFLLMSEKNVKTWTALICGLAQHGYCKDALVL 278
           +   +NL LGT LVDM+ +CG ++ A  +F  + E +  +W+++I GLA HG+   A+  
Sbjct: 280 SHMTVNLILGTALVDMFWRCGDIEKAIHVFEGLPETDSLSWSSIIKGLAVHGHAHKAMHY 339

Query: 279 FETMSHEGVEPNELTFTGVLSACAHAGFVQEGRK-YFNMIEEYGLEIRIQHYGCMVDLLG 338
           F  M   G  P ++TFT VLSAC+H G V++G + Y NM +++G+E R++HYGC+VD+LG
Sbjct: 340 FSQMISLGFIPRDVTFTAVLSACSHGGLVEKGLEIYENMKKDHGIEPRLEHYGCIVDMLG 399

Query: 339 KSGLLEEAYGVIKTMRLEPNVTVWSSLLSACKQHKSFDMAERVIEQILETTEPKNHGGIY 398
           ++G L EA   I  M ++PN  +  +LL ACK +K+ ++AERV   +++      H G Y
Sbjct: 400 RAGKLAEAENFILKMHVKPNAPILGALLGACKIYKNTEVAERVGNMLIKV--KPEHSGYY 459

Query: 399 SLICDLYVLEEKWDDAEKIRNLLSQN-VPKVRAYSLI 433
            L+ ++Y    +WD  E +R+++ +  V K   +SLI
Sbjct: 460 VLLSNIYACAGQWDKIESLRDMMKEKLVKKPPGWSLI 492

BLAST of CcUC04G080720 vs. TAIR 10
Match: AT1G74630.1 (Tetratricopeptide repeat (TPR)-like superfamily protein )

HSP 1 Score: 271.6 bits (693), Expect = 1.8e-72
Identity = 156/461 (33.84%), Postives = 252/461 (54.66%), Query Frame = 0

Query: 38  HRALAIYSQMHRQS-VPHDSFSILFMLKACAPSNNLSTLHHLHAHITKLGFTTHVFVATS 97
           H ++A++ +M R+  V  DSFS  F++KA     +L T   +H    K G  +H+FV T+
Sbjct: 87  HNSVAVFVEMMRKGFVFPDSFSFAFVIKAVENFRSLRTGFQMHCQALKHGLESHLFVGTT 146

Query: 98  LLYAY----------------------VLNSFQLACL----------VFDEMPHKNTVTW 157
           L+  Y                        N+   AC           +FD+M  +N  +W
Sbjct: 147 LIGMYGGCGCVEFARKVFDEMHQPNLVAWNAVITACFRGNDVAGAREIFDKMLVRNHTSW 206

Query: 158 NTMILGYSKTGDVDRARQLFDQMPSRDLVSWSTMIAAYINNRNYRGGLLLFQDMIANGIN 217
           N M+ GY K G+++ A+++F +MP RD VSWSTMI    +N ++    L F+++   G++
Sbjct: 207 NVMLAGYIKAGELESAKRIFSEMPHRDDVSWSTMIVGIAHNGSFNESFLYFRELQRAGMS 266

Query: 218 PDQMAAGSILNGCAHMGSLGLLAGKSVHGFVVKNRWELNLELGTVLVDMYAKCGFLKYAC 277
           P++++   +L+ C+  GS     GK +HGFV K  +   + +   L+DMY++CG +  A 
Sbjct: 267 PNEVSLTGVLSACSQSGSFEF--GKILHGFVEKAGYSWIVSVNNALIDMYSRCGNVPMAR 326

Query: 278 QIFLLMSEKN-VKTWTALICGLAQHGYCKDALVLFETMSHEGVEPNELTFTGVLSACAHA 337
            +F  M EK  + +WT++I GLA HG  ++A+ LF  M+  GV P+ ++F  +L AC+HA
Sbjct: 327 LVFEGMQEKRCIVSWTSMIAGLAMHGQGEEAVRLFNEMTAYGVTPDGISFISLLHACSHA 386

Query: 338 GFVQEGRKYFN-MIEEYGLEIRIQHYGCMVDLLGKSGLLEEAYGVIKTMRLEPNVTVWSS 397
           G ++EG  YF+ M   Y +E  I+HYGCMVDL G+SG L++AY  I  M + P   VW +
Sbjct: 387 GLIEEGEDYFSEMKRVYHIEPEIEHYGCMVDLYGRSGKLQKAYDFICQMPIPPTAIVWRT 446

Query: 398 LLSACKQHKSFDMAERVIEQILETTEPKNHGGIYSLICDLYVLEEKWDDAEKIR-NLLSQ 456
           LL AC  H + ++AE+V +Q L   +P N G +  L+ + Y    KW D   IR +++ Q
Sbjct: 447 LLGACSSHGNIELAEQV-KQRLNELDPNNSGDLV-LLSNAYATAGKWKDVASIRKSMIVQ 506

The following BLAST results are available for this feature:
Match NameE-valueIdentityDescription
XP_038877521.11.8e-22689.17pentatricopeptide repeat-containing protein At5g66520-like [Benincasa hispida][more]
XP_023544798.11.0e-22187.56pentatricopeptide repeat-containing protein At5g66520-like [Cucurbita pepo subsp... [more]
XP_022978962.15.0e-22187.10pentatricopeptide repeat-containing protein At5g66520-like [Cucurbita maxima][more]
KAA0052417.11.5e-22087.56pentatricopeptide repeat-containing protein [Cucumis melo var. makuwa][more]
XP_022925591.12.5e-22087.10pentatricopeptide repeat-containing protein At5g66520-like [Cucurbita moschata] ... [more]
Match NameE-valueIdentityDescription
Q9FJY76.6e-8339.08Pentatricopeptide repeat-containing protein At5g66520 OS=Arabidopsis thaliana OX... [more]
Q56X057.3e-7435.82Pentatricopeptide repeat-containing protein At1g06143 OS=Arabidopsis thaliana OX... [more]
Q1PEU45.2e-7244.13Pentatricopeptide repeat-containing protein At2g44880 OS=Arabidopsis thaliana OX... [more]
Q9FG161.2e-7135.52Pentatricopeptide repeat-containing protein At5g06540 OS=Arabidopsis thaliana OX... [more]
Q9CA542.6e-7133.84Pentatricopeptide repeat-containing protein At1g74630 OS=Arabidopsis thaliana OX... [more]
Match NameE-valueIdentityDescription
A0A6J1IMI12.4e-22187.10pentatricopeptide repeat-containing protein At5g66520-like OS=Cucurbita maxima O... [more]
A0A5A7UD497.0e-22187.56Pentatricopeptide repeat-containing protein OS=Cucumis melo var. makuwa OX=11946... [more]
A0A6J1ECM31.2e-22087.10pentatricopeptide repeat-containing protein At5g66520-like OS=Cucurbita moschata... [more]
A0A1S3AYC41.3e-21987.33pentatricopeptide repeat-containing protein At5g66520-like OS=Cucumis melo OX=36... [more]
A0A0A0KIK92.5e-21886.87Uncharacterized protein OS=Cucumis sativus OX=3659 GN=Csa_6G538690 PE=4 SV=1[more]
Match NameE-valueIdentityDescription
AT5G66520.14.7e-8439.08Tetratricopeptide repeat (TPR)-like superfamily protein [more]
AT1G06150.15.2e-7535.82basic helix-loop-helix (bHLH) DNA-binding superfamily protein [more]
AT2G44880.13.7e-7344.13Pentatricopeptide repeat (PPR-like) superfamily protein [more]
AT5G06540.18.3e-7335.52Pentatricopeptide repeat (PPR) superfamily protein [more]
AT1G74630.11.8e-7233.84Tetratricopeptide repeat (TPR)-like superfamily protein [more]
InterPro
Analysis Name: InterPro Annotations of Watermelon (PI 537277) v1
Date Performed: 2022-01-31
IPR TermIPR DescriptionSourceSource TermSource DescriptionAlignment
IPR001179FKBP-type peptidyl-prolyl cis-trans isomerase domainPFAMPF00254FKBP_Ccoord: 612..705
e-value: 2.1E-21
score: 76.0
IPR001179FKBP-type peptidyl-prolyl cis-trans isomerase domainPROSITEPS50059FKBP_PPIASEcoord: 614..708
score: 25.258366
NoneNo IPR availableGENE3D3.10.50.40coord: 595..708
e-value: 1.3E-33
score: 117.9
NoneNo IPR availablePANTHERPTHR47928REPEAT-CONTAINING PROTEIN, PUTATIVE-RELATEDcoord: 21..433
NoneNo IPR availablePANTHERPTHR47928:SF104OS01G0800400 PROTEINcoord: 21..433
NoneNo IPR availableSUPERFAMILY54534FKBP-likecoord: 613..705
IPR002885Pentatricopeptide repeatPFAMPF01535PPRcoord: 359..384
e-value: 0.22
score: 11.8
coord: 122..151
e-value: 3.3E-8
score: 33.2
coord: 328..352
e-value: 0.019
score: 15.2
coord: 153..183
e-value: 1.3E-4
score: 21.9
IPR002885Pentatricopeptide repeatPFAMPF13041PPR_2coord: 253..301
e-value: 1.1E-11
score: 44.7
IPR002885Pentatricopeptide repeatTIGRFAMTIGR00756TIGR00756coord: 122..153
e-value: 3.9E-8
score: 31.0
coord: 257..290
e-value: 1.5E-6
score: 26.0
coord: 153..186
e-value: 8.9E-7
score: 26.7
IPR002885Pentatricopeptide repeatPROSITEPS51375PPRcoord: 289..323
score: 8.670445
IPR002885Pentatricopeptide repeatPROSITEPS51375PPRcoord: 120..154
score: 12.58363
IPR002885Pentatricopeptide repeatPROSITEPS51375PPRcoord: 254..288
score: 11.388848
IPR011990Tetratricopeptide-like helical domain superfamilyGENE3D1.25.40.10Tetratricopeptide repeat domaincoord: 232..451
e-value: 4.6E-41
score: 143.2
coord: 62..231
e-value: 1.8E-30
score: 108.4

Relationships

The following mRNA feature(s) are a part of this gene:

Feature NameUnique NameType
CcUC04G080720.1CcUC04G080720.1mRNA


GO Annotation
GO Assignments
This gene is annotated with the following GO terms.
Category Term Accession Term Name
molecular_function GO:0003755 peptidyl-prolyl cis-trans isomerase activity
molecular_function GO:0005515 protein binding