Spg039607 (gene) Sponge gourd (cylindrica) v1

Overview
NameSpg039607
Typegene
OrganismLuffa cylindrica (Sponge gourd (cylindrica) v1)
DescriptionPentatricopeptide repeat-containing protein
Locationscaffold10: 45657058 .. 45660007 (-)
RNA-Seq ExpressionSpg039607
SyntenySpg039607
Sequences
The following sequences are available for this feature:

Gene sequence (with intron)

Legend: polypeptideCDS
Hold the cursor over a type above to highlight its positions in the sequence below.
ATGCGGGGTCGAGGAATTTTCCCAGATCAGTTTGCTTATTCTGGTATCTTGCAGATATGTATTGGTTTGGAGTTCATTGAGTTGGGCAGAATGGTTCGTGCCCAGATTGTTATTAGAGGCTTTGCATTTCATACTTGTGTGTCTACTTCTCTTCTTAATATGTATGCAAAGTTACAAAAGGTTGAGGATTCAATCAAGGTGTTTAAGACCATGACTGAAGTTAATATAGTCTCATGGAATGCTATGATCTCAGGTTTCACATACAATGGTCCTTACTTAGAGGCTTTTGATAATTTTCTCAGAATGAAGGGAGAAGGAGTAATAACTGATGTACAAATGTCGATTGGTGTTGCAAAAAGCTATCGGTATGTTACGAGATGTGAACAAGGCAAAAGAAGTTAGCCATTTTGCTTCTGAGTTAGGTGTGGACTCTAATGCTCTAGCGGGAACTGCCCTCATTGATATGCATTCTAAACGTGGATCTTTGCAAGAGGCAAGATCTATTTTGACTCACGTTTTACAAATTTCGGGGTATTTACAGGGTGAGTGTAATGAAAAAGCCTTGGAATTGTTTGCCAAAATGTATCAAAACGACATACACTTGGACCATTACACTTATTGTAGTGTATTTAATGCTATGGCTGCTTTGAAGTGCTTGCTGTCGGGAAAGAAGGTTCATGCCAGGGCTAAAATTAGGATCGGAAGTGAATCATATAAGTATCTCCAATGCAGTGGCTAATGCGTATACTAAATGTGGATCGCTAGATGATGTAAGGAAGATCATTTACTACAGGATGGAAAAAGAGATTTAGTATCTTGGACCACCCTAGTGACGGCTTATTCTCAATGTTCTGAATGGGATAAAGCAATAGAGATCTTCTCAAATATGGGAGAAGAAGGTTTTACACCCAATCAATTTGCCTTTTCAAGTGTGCTCGTTTCATGTGCGAGCCTTTGCTTACTCGATGGTCAGCAAGTCCACGGGTTCCTCTGCAAGGTTGGCTTGGACATGAACAAATGCATAGGAAGTGGTCTAATTGACATGTATGCCAAATGTGGCAGTCTGGCTGAGGCAAAGAAGGTTTTCGAAAGAATCTCTAATGCCAATACAGTTTCATGGACCACTGTAACATCAGGGCATGCTCAATACAGTATTGTGGATGACGCCTTTGAACTCTTTAGAAGGATGGAGCAGTTAGGTGTGGAGCCGAATGCTGTTTACTTTTTTGTGTGTTCTATTTGCATGTAGCCATGGAGGTCGGGTAGAGGAAGGCCTACAGTACTTCAAGCTAATGAAGGAAACCTATCGTTTGGTGCCAGAGATGGAGCATTATTCCTGTATTGTTGATCTCTTAAGTCGTGTGGGACGTCTAAACAATGCAATGGAGTTTATTAGTAGGATGCCCATAGAGCTCAATGAAATGGTTTGATAGACCTTGTTGGGGGCATGCAGGGTCCATGGTAATGTTGAATTGGGAGAGCTTGCTGCTCAGAAGATGCCTTCTTTCAAAGCAGAAAACTCTGCTACCCATGTTCTTTTATCCAATACCTCAAATCAGGGATTTTCAAAGATGGACTTAATTTGCGGCATGTGATGAAAGAGCAGGGCGTAAAAAAGGAACCAGGGTGTAGTTGGATCTCTGTGAATGGTACATTGCACAAGTTTTATGCAGGAGATCAACAACATCCAGAAAAAGATAAAATTTATGCAAAGCTAGAAGAGTTGAGGTTGAAGGCCAATTCTTTAGGGGGATGTACCAGATTTGAGTTATGAGCTGTAAGTTGTGGACCTCGGATAAGTTATACAGATACGGATGGAAGCCCCACGTCTATTTGACCAAATTTCTTCAGAGAAAAAAAAAATGAAAAAAAAGCATCAGCTAAAGATATCAAATTAAATAGGGAAGGCAAACGGATGCTTAAGAACTAAATTTGCTGGCTAAAGAATATGGGACTTACAGATATGATTGTCTGCATTATTCATCTCCTGCAGTTTCTGAACTATGTATGATGCTGTACTGCATTAGATAAGGAAGAATGACGATTCGAGTTGGGGGAATGATAAAAAGCGCCACAGATCGATGATCGGTTCTATGAAAATATTTTACTTTTCGACAATATGATTGCTATGTTGGGTGAAGGTTATCAGAAGTTTGACCTCTTCTATAGTAATGCATTGTTATGGTGTGCTAATATGGAATCATAGAGCCTTTCTGCTTATGATTTTACTGGGGTCTACCGAAATTGTCATACTCGAAGTAGTAAAATTTATCATTACATTACTTGAATAAGTTTGGGAAGAAATTTCTCAGAATCATCAGCCAGTGGAAATGAAGACATTTCTCTTCTCTCTCCCTCTGATATATTTTTGTTTTCACTCTCTAAATTTGTCTTTTATCCTTCCTTTTTTTTTCTTGGAAACGTGGCCTAGTTGTTTTATTTCCTGAATTCTATTTATCATGCCAAGAAACTGTTTACTATTTCTCCCGTGAATTAGCTGAAATGAGTTTACTGTTTTATATTATTATTATTCTTGCATTTTCCTTTTTACCATAAGAGGATTTTGATTGTTCTTGACTGATTTCTTTCAGCTGCAGTGAAGTGATAACTGATATAAAATATTTTGACACGATCTCTCATAACTAATCAATATGCTTCTTCTTCTCCACCAATAAACTCACTAGGTGCTGCGGGTGTCATAAGACTGGTCTATGTCACTGAAAACCACTTTATGAGGATGCTCCTTCTCCAAGACCCAGGCATATAAAGTAAACCAAAACTACTGTGCAAAAACAGTGTCTATCCATTCTTAGATAATGCATTTAGTGAGTTAATCCTATATTCCATCTAAGGTTCTGGTTCTAAGGTGCCAAATCTTAGTGACTTAAAACTCTTGTTGTCATCAGAGCTGACTTTGGAGCAGGTGAAACAAGGTCTTTCTGAAGATTAG

mRNA sequence

ATGCGGGGTCGAGGAATTTTCCCAGATCAGTTTGCTTATTCTGGTATCTTGCAGATATGTATTGGTTTGGAGTTCATTGAGTTGGGCAGAATGGTTCGTGCCCAGATTGTTATTAGAGGCTTTGCATTTCATACTTGTGTGTCTACTTCTCTTCTTAATATGTATGCAAAGTTACAAAAGGTTGAGGATTCAATCAAGGTGTTTAAGACCATGACTGAAGTTAATATAGTCTCATGGAATGCTATGATCTCAGCTATCGGTATGTTACGAGATGTGAACAAGGCAAAAGAAGTTAGCCATTTTGCTTCTGAGTTAGGTGTGGACTCTAATGCTCTAGCGGGAACTGCCCTCATTGATATGCATTCTAAACGTGGATCTTTGCAAGAGGCAAGATCTATTTTGACTCACGTTTTACAAATTTCGGGGTATTTACAGGGTGAGTGTAATGAAAAAGCCTTGGAATTGTTTGCCAAAATGTATCAAAACGACATACACTTGGACCATTACACTTATTGTAGTGTATTTAATGCTATGGCTGCTTTGAAGTGCTTGCTGTCGGGAAAGAAGGATGGAAAAAGAGATTTAGTATCTTGGACCACCCTAGTGACGGCTTATTCTCAATGTTCTGAATGGGATAAAGCAATAGAGATCTTCTCAAATATGGGAGAAGAAGGTTTTACACCCAATCAATTTGCCTTTTCAAGTGTGCTCGTTTCATGTGCGAGCCTTTGCTTACTCGATGGTCAGCAAGTCCACGGGTTCCTCTGCAAGGTTGGCTTGGACATGAACAAATGCATAGGAAGTGGTCTAATTGACATGTATGCCAAATGTGGCAGTCTGGCTGAGGCAAAGAAGGTTTTCGAAAGAATCTCTAATGCCAATACAGTTTCATGGACCACTGTAACATCAGGGCATGCTCAATACAGTATTGTGGATGACGCCTTTGAACTCTTTAGAAGGATGGAGCAGTTAGGTGTGGAGCCGAATGCTGTTTACTTTTTTATAAGGAAGAATGACGATTCGAGTTGGGGGAATGATAAAAAGCGCCACAGATCGATGATCGGTTCTGGTTCTAAGGTGCCAAATCTTAGTGACTTAAAACTCTTGTTGTCATCAGAGCTGACTTTGGAGCAGGTGAAACAAGGTCTTTCTGAAGATTAG

Coding sequence (CDS)

ATGCGGGGTCGAGGAATTTTCCCAGATCAGTTTGCTTATTCTGGTATCTTGCAGATATGTATTGGTTTGGAGTTCATTGAGTTGGGCAGAATGGTTCGTGCCCAGATTGTTATTAGAGGCTTTGCATTTCATACTTGTGTGTCTACTTCTCTTCTTAATATGTATGCAAAGTTACAAAAGGTTGAGGATTCAATCAAGGTGTTTAAGACCATGACTGAAGTTAATATAGTCTCATGGAATGCTATGATCTCAGCTATCGGTATGTTACGAGATGTGAACAAGGCAAAAGAAGTTAGCCATTTTGCTTCTGAGTTAGGTGTGGACTCTAATGCTCTAGCGGGAACTGCCCTCATTGATATGCATTCTAAACGTGGATCTTTGCAAGAGGCAAGATCTATTTTGACTCACGTTTTACAAATTTCGGGGTATTTACAGGGTGAGTGTAATGAAAAAGCCTTGGAATTGTTTGCCAAAATGTATCAAAACGACATACACTTGGACCATTACACTTATTGTAGTGTATTTAATGCTATGGCTGCTTTGAAGTGCTTGCTGTCGGGAAAGAAGGATGGAAAAAGAGATTTAGTATCTTGGACCACCCTAGTGACGGCTTATTCTCAATGTTCTGAATGGGATAAAGCAATAGAGATCTTCTCAAATATGGGAGAAGAAGGTTTTACACCCAATCAATTTGCCTTTTCAAGTGTGCTCGTTTCATGTGCGAGCCTTTGCTTACTCGATGGTCAGCAAGTCCACGGGTTCCTCTGCAAGGTTGGCTTGGACATGAACAAATGCATAGGAAGTGGTCTAATTGACATGTATGCCAAATGTGGCAGTCTGGCTGAGGCAAAGAAGGTTTTCGAAAGAATCTCTAATGCCAATACAGTTTCATGGACCACTGTAACATCAGGGCATGCTCAATACAGTATTGTGGATGACGCCTTTGAACTCTTTAGAAGGATGGAGCAGTTAGGTGTGGAGCCGAATGCTGTTTACTTTTTTATAAGGAAGAATGACGATTCGAGTTGGGGGAATGATAAAAAGCGCCACAGATCGATGATCGGTTCTGGTTCTAAGGTGCCAAATCTTAGTGACTTAAAACTCTTGTTGTCATCAGAGCTGACTTTGGAGCAGGTGAAACAAGGTCTTTCTGAAGATTAG

Protein sequence

MRGRGIFPDQFAYSGILQICIGLEFIELGRMVRAQIVIRGFAFHTCVSTSLLNMYAKLQKVEDSIKVFKTMTEVNIVSWNAMISAIGMLRDVNKAKEVSHFASELGVDSNALAGTALIDMHSKRGSLQEARSILTHVLQISGYLQGECNEKALELFAKMYQNDIHLDHYTYCSVFNAMAALKCLLSGKKDGKRDLVSWTTLVTAYSQCSEWDKAIEIFSNMGEEGFTPNQFAFSSVLVSCASLCLLDGQQVHGFLCKVGLDMNKCIGSGLIDMYAKCGSLAEAKKVFERISNANTVSWTTVTSGHAQYSIVDDAFELFRRMEQLGVEPNAVYFFIRKNDDSSWGNDKKRHRSMIGSGSKVPNLSDLKLLLSSELTLEQVKQGLSED
Homology
BLAST of Spg039607 vs. NCBI nr
Match: XP_022134356.1 (pentatricopeptide repeat-containing protein At2g33680-like [Momordica charantia])

HSP 1 Score: 490.3 bits (1261), Expect = 1.5e-134
Identity = 272/420 (64.76%), Postives = 297/420 (70.71%), Query Frame = 0

Query: 1   MRGRGIFPDQFAYSGILQICIGLEFIELGRMVRAQIVIRGFAFHTCVSTSLLNMYAKLQK 60
           M+ +GIFPDQFAYSGI++ICIGLE IELGRMV AQIV RGFA HT VST+LL+MY+KLQK
Sbjct: 151 MQNQGIFPDQFAYSGIVKICIGLESIELGRMVHAQIVSRGFASHTFVSTALLDMYSKLQK 210

Query: 61  VEDSIKVFKTMTEVNIVSWNAMIS-----------------------------------A 120
           +EDS KVF TMTEVN+VSWNAMIS                                   A
Sbjct: 211 IEDSYKVFNTMTEVNVVSWNAMISGFASNSLYSEAFDHFLRMKGEGAMPDAHTFIGVAKA 270

Query: 121 IGMLRDVNKAKEVSHFASELGVDSNALAGTALIDMHSKRGSLQEARSIL-THV------- 180
           IGMLRDV+KAKEVSH+AS+LGVDSN L GTALIDMHSK GSLQEARSI  +H        
Sbjct: 271 IGMLRDVSKAKEVSHYASKLGVDSNTLVGTALIDMHSKCGSLQEARSIFDSHFTNCGVNA 330

Query: 181 ---LQISGYLQGECNEKALELFAKMYQNDIHLDHYTYCSVFNAMAALKCLLSGKK----- 240
                ISGYLQ ECN KALE FAKM QNDIHLDHYTYCSVFNA+AALKCL +GKK     
Sbjct: 331 PWNAMISGYLQSECNGKALEFFAKMNQNDIHLDHYTYCSVFNAIAALKCLSTGKKVHARA 390

Query: 241 -----------------------------------DGKRDLVSWTTLVTAYSQCSEWDKA 300
                                                +RDLVSWTTLVTAYSQ SEWDKA
Sbjct: 391 IKSGFEVNHISISNAVANAYAKCGSLDDVRKVLYRMEERDLVSWTTLVTAYSQYSEWDKA 450

Query: 301 IEIFSNMGEEGFTPNQFAFSSVLVSCASLCLLD-GQQVHGFLCKVGLDMNKCIGSGLIDM 334
           IEIFSNM EEGF PNQF FSSVL+SCASLCLL+ GQQVHGFL KVGLDM+K I S LIDM
Sbjct: 451 IEIFSNMREEGFVPNQFTFSSVLLSCASLCLLEYGQQVHGFLFKVGLDMDKFIESALIDM 510

BLAST of Spg039607 vs. NCBI nr
Match: XP_022939229.1 (pentatricopeptide repeat-containing protein At3g16610-like [Cucurbita moschata] >KAG6578466.1 Pentatricopeptide repeat-containing protein, partial [Cucurbita argyrosperma subsp. sororia])

HSP 1 Score: 488.0 bits (1255), Expect = 7.3e-134
Identity = 267/420 (63.57%), Postives = 298/420 (70.95%), Query Frame = 0

Query: 1   MRGRGIFPDQFAYSGILQICIGLEFIELGRMVRAQIVIRGFAFHTCVSTSLLNMYAKLQK 60
           M+ + IFPDQFAYSG+LQICIGLE IELG+MV AQIVIRGFA HT VST+LLNMYAKLQK
Sbjct: 159 MQSQDIFPDQFAYSGVLQICIGLESIELGKMVHAQIVIRGFASHTFVSTALLNMYAKLQK 218

Query: 61  VEDSIKVFKTMTEVNIVSWNAMIS-----------------------------------A 120
           ++DS +VF TMTEVN+VSWNAMIS                                   A
Sbjct: 219 IDDSYEVFNTMTEVNVVSWNAMISGFTSNGLYSDAFDHFLRMKGEGVTPDAQTFISIAKA 278

Query: 121 IGMLRDVNKAKEVSHFASELGVDSNALAGTALIDMHSKRGSLQEARSIL-THV------- 180
           IGMLRDVNKAKE+S +ASELG+DSN L GTALIDMHSK GSLQEARSI  +H        
Sbjct: 279 IGMLRDVNKAKEISRYASELGMDSNPLVGTALIDMHSKCGSLQEARSIFDSHFTNCRVNG 338

Query: 181 ---LQISGYLQGECNEKALELFAKMYQNDIHLDHYTYCSVFNAMAALKCLLSGKK----- 240
                ISGYLQ E NEKALELFAKM  N++HLD YTYCSVFNA+AALKCL  GKK     
Sbjct: 339 PWNAMISGYLQSEFNEKALELFAKMCLNNVHLDRYTYCSVFNAIAALKCLSLGKKVHARA 398

Query: 241 -----------------------------------DGKRDLVSWTTLVTAYSQCSEWDKA 300
                                                +RDLVSWTTLVTAYSQCSEWDKA
Sbjct: 399 IKSGLEVNNISISNAVANAYAKCGSLEDLRKVFYSMEERDLVSWTTLVTAYSQCSEWDKA 458

Query: 301 IEIFSNMGEEGFTPNQFAFSSVLVSCASLCLLD-GQQVHGFLCKVGLDMNKCIGSGLIDM 334
           IEIFSNM EEGF PNQFAFSSVL+SCASLCLL+ GQQVHGF+ KVGLDM+KCI S LIDM
Sbjct: 459 IEIFSNMREEGFAPNQFAFSSVLISCASLCLLEYGQQVHGFIGKVGLDMDKCIQSALIDM 518

BLAST of Spg039607 vs. NCBI nr
Match: KAG7016032.1 (Pentatricopeptide repeat-containing protein, partial [Cucurbita argyrosperma subsp. argyrosperma])

HSP 1 Score: 488.0 bits (1255), Expect = 7.3e-134
Identity = 267/420 (63.57%), Postives = 298/420 (70.95%), Query Frame = 0

Query: 1   MRGRGIFPDQFAYSGILQICIGLEFIELGRMVRAQIVIRGFAFHTCVSTSLLNMYAKLQK 60
           M+ + IFPDQFAYSG+LQICIGLE IELG+MV AQIVIRGFA HT VST+LLNMYAKLQK
Sbjct: 141 MQSQDIFPDQFAYSGVLQICIGLESIELGKMVHAQIVIRGFASHTFVSTALLNMYAKLQK 200

Query: 61  VEDSIKVFKTMTEVNIVSWNAMIS-----------------------------------A 120
           ++DS +VF TMTEVN+VSWNAMIS                                   A
Sbjct: 201 IDDSYEVFNTMTEVNVVSWNAMISGFTSNGLYSDAFDHFLRMKGEGVTPDAQTFISIAKA 260

Query: 121 IGMLRDVNKAKEVSHFASELGVDSNALAGTALIDMHSKRGSLQEARSIL-THV------- 180
           IGMLRDVNKAKE+S +ASELG+DSN L GTALIDMHSK GSLQEARSI  +H        
Sbjct: 261 IGMLRDVNKAKEISRYASELGMDSNPLVGTALIDMHSKCGSLQEARSIFDSHFTNCRVNG 320

Query: 181 ---LQISGYLQGECNEKALELFAKMYQNDIHLDHYTYCSVFNAMAALKCLLSGKK----- 240
                ISGYLQ E NEKALELFAKM  N++HLD YTYCSVFNA+AALKCL  GKK     
Sbjct: 321 PWNAMISGYLQSEFNEKALELFAKMCLNNVHLDRYTYCSVFNAIAALKCLSLGKKVHARA 380

Query: 241 -----------------------------------DGKRDLVSWTTLVTAYSQCSEWDKA 300
                                                +RDLVSWTTLVTAYSQCSEWDKA
Sbjct: 381 IKSGLEVNNISISNAVANAYAKCGSLEDLRKVFYSMEERDLVSWTTLVTAYSQCSEWDKA 440

Query: 301 IEIFSNMGEEGFTPNQFAFSSVLVSCASLCLLD-GQQVHGFLCKVGLDMNKCIGSGLIDM 334
           IEIFSNM EEGF PNQFAFSSVL+SCASLCLL+ GQQVHGF+ KVGLDM+KCI S LIDM
Sbjct: 441 IEIFSNMREEGFAPNQFAFSSVLISCASLCLLEYGQQVHGFIGKVGLDMDKCIQSALIDM 500

BLAST of Spg039607 vs. NCBI nr
Match: XP_023549692.1 (pentatricopeptide repeat-containing protein At3g16610-like [Cucurbita pepo subsp. pepo])

HSP 1 Score: 488.0 bits (1255), Expect = 7.3e-134
Identity = 266/420 (63.33%), Postives = 297/420 (70.71%), Query Frame = 0

Query: 1   MRGRGIFPDQFAYSGILQICIGLEFIELGRMVRAQIVIRGFAFHTCVSTSLLNMYAKLQK 60
           M+ + IFPDQFAYSG+LQICIGLE IELG+MV AQIVIRGFA HT VST+LLNMYAKLQK
Sbjct: 159 MQSQDIFPDQFAYSGVLQICIGLESIELGKMVHAQIVIRGFASHTFVSTALLNMYAKLQK 218

Query: 61  VEDSIKVFKTMTEVNIVSWNAMIS-----------------------------------A 120
           ++DS +VF TMTEVN+VSWNAMIS                                   A
Sbjct: 219 IDDSYEVFNTMTEVNVVSWNAMISGFTSNGLYSDAFDHFLRMKGEGVTPDAQTFISIAKA 278

Query: 121 IGMLRDVNKAKEVSHFASELGVDSNALAGTALIDMHSKRGSLQEARSIL-THV------- 180
           IGMLRDVNKAKE+S +ASELG+DSN L GT LIDMHSK GSLQEARSI  +H        
Sbjct: 279 IGMLRDVNKAKEISRYASELGMDSNTLVGTGLIDMHSKCGSLQEARSIFDSHFTNCRVNG 338

Query: 181 ---LQISGYLQGECNEKALELFAKMYQNDIHLDHYTYCSVFNAMAALKCLLSGKK----- 240
                ISGYLQ E NEKALELFAKM  N++HLD YTYCSVFNA+AALKCL  GKK     
Sbjct: 339 PWNAMISGYLQSEFNEKALELFAKMCLNNVHLDRYTYCSVFNAIAALKCLSLGKKVHARA 398

Query: 241 -----------------------------------DGKRDLVSWTTLVTAYSQCSEWDKA 300
                                                +RDLVSWTTLVTAYSQCSEWDKA
Sbjct: 399 IKSGLEVNNISISNAVANAYAKCGSLEDLRKVFYSMEERDLVSWTTLVTAYSQCSEWDKA 458

Query: 301 IEIFSNMGEEGFTPNQFAFSSVLVSCASLCLLD-GQQVHGFLCKVGLDMNKCIGSGLIDM 334
           IEIFSNM EEGF PNQFAFSSVLVSCASLCLL+ GQQVHG +CKVGLDM+KCI S LIDM
Sbjct: 459 IEIFSNMREEGFAPNQFAFSSVLVSCASLCLLEYGQQVHGVICKVGLDMDKCIQSALIDM 518

BLAST of Spg039607 vs. NCBI nr
Match: XP_022993736.1 (pentatricopeptide repeat-containing protein At2g27610-like [Cucurbita maxima])

HSP 1 Score: 487.3 bits (1253), Expect = 1.3e-133
Identity = 266/420 (63.33%), Postives = 298/420 (70.95%), Query Frame = 0

Query: 1   MRGRGIFPDQFAYSGILQICIGLEFIELGRMVRAQIVIRGFAFHTCVSTSLLNMYAKLQK 60
           M+ + IFPDQFAYSG+LQICIGLE IELG+MV AQIVIRGFA HT VST+LLNMYAKLQK
Sbjct: 159 MQSQDIFPDQFAYSGVLQICIGLESIELGKMVHAQIVIRGFASHTFVSTALLNMYAKLQK 218

Query: 61  VEDSIKVFKTMTEVNIVSWNAMIS-----------------------------------A 120
           ++DS +VF TMTEVN+VSWNAMIS                                   A
Sbjct: 219 IDDSYEVFNTMTEVNVVSWNAMISGFTSNGLYSDAFDHFLRMKGEGVTPDAQTFISIAKA 278

Query: 121 IGMLRDVNKAKEVSHFASELGVDSNALAGTALIDMHSKRGSLQEARSIL-THV------- 180
           IGMLRDVNKAKE+S +AS+LG+DSN L GTALIDMHSK GSLQEARSI  +H        
Sbjct: 279 IGMLRDVNKAKEISRYASKLGMDSNTLVGTALIDMHSKCGSLQEARSIFDSHFTNCRVNG 338

Query: 181 ---LQISGYLQGECNEKALELFAKMYQNDIHLDHYTYCSVFNAMAALKCLLSGKK----- 240
                ISGYLQ E NE+ALELFAKM  N++HLD YTYCSVFNA+AALKCL  GKK     
Sbjct: 339 PWNAMISGYLQSEFNEEALELFAKMCLNNVHLDRYTYCSVFNAIAALKCLSLGKKVHARA 398

Query: 241 -----------------------------------DGKRDLVSWTTLVTAYSQCSEWDKA 300
                                                +RDLVSWTTLVTAYSQCSEWDKA
Sbjct: 399 IKSGLEVNNISISNAVANAYAKCGSLEDLRKVFYSMEERDLVSWTTLVTAYSQCSEWDKA 458

Query: 301 IEIFSNMGEEGFTPNQFAFSSVLVSCASLCLLD-GQQVHGFLCKVGLDMNKCIGSGLIDM 334
           IEIFSNM EEGF PNQFAFSSVLVSCASLCLL+ GQQVHGF+CKVGLDM+KCI S LIDM
Sbjct: 459 IEIFSNMREEGFAPNQFAFSSVLVSCASLCLLEYGQQVHGFICKVGLDMDKCIQSALIDM 518

BLAST of Spg039607 vs. ExPASy Swiss-Prot
Match: Q9SN39 (Pentatricopeptide repeat-containing protein DOT4, chloroplastic OS=Arabidopsis thaliana OX=3702 GN=DOT4 PE=2 SV=1)

HSP 1 Score: 152.5 bits (384), Expect = 9.6e-36
Identity = 111/418 (26.56%), Postives = 196/418 (46.89%), Query Frame = 0

Query: 1   MRGRGIFPDQFAYSGILQICIGLEFIELGRMVRAQIVIRGFAFHTCVSTSLLNMYAKLQK 60
           M   G+  D + +S + +    L  +  G  +   I+  GF     V  SL+  Y K Q+
Sbjct: 186 MMSSGVEMDSYTFSCVSKSFSSLRSVHGGEQLHGFILKSGFGERNSVGNSLVAFYLKNQR 245

Query: 61  VEDSIKVFKTMTEVNIVSWNAMISA--------------IGML---RDVNKAKEVSHFAS 120
           V+ + KVF  MTE +++SWN++I+               + ML    +++ A  VS FA 
Sbjct: 246 VDSARKVFDEMTERDVISWNSIINGYVSNGLAEKGLSVFVQMLVSGIEIDLATIVSVFAG 305

Query: 121 -------ELGVDSNALAGTA-----------LIDMHSKRGSLQEA---------RSILTH 180
                   LG   +++   A           L+DM+SK G L  A         RS++++
Sbjct: 306 CADSRLISLGRAVHSIGVKACFSREDRFCNTLLDMYSKCGDLDSAKAVFREMSDRSVVSY 365

Query: 181 VLQISGYLQGECNEKALELFAKMYQNDIHLDHYTYCSVFNAMAALKCLLSGKKDGK---- 240
              I+GY +     +A++LF +M +  I  D YT  +V N  A  + L  GK+  +    
Sbjct: 366 TSMIAGYAREGLAGEAVKLFEEMEEEGISPDVYTVTAVLNCCARYRLLDEGKRVHEWIKE 425

Query: 241 -----------------------------------RDLVSWTTLVTAYSQCSEWDKAIEI 300
                                              +D++SW T++  YS+    ++A+ +
Sbjct: 426 NDLGFDIFVSNALMDMYAKCGSMQEAELVFSEMRVKDIISWNTIIGGYSKNCYANEALSL 485

Query: 301 FSNMGEE-GFTPNQFAFSSVLVSCASLCLLD-GQQVHGFLCKVGLDMNKCIGSGLIDMYA 334
           F+ + EE  F+P++   + VL +CASL   D G+++HG++ + G   ++ + + L+DMYA
Sbjct: 486 FNLLLEEKRFSPDERTVACVLPACASLSAFDKGREIHGYIMRNGYFSDRHVANSLVDMYA 545

BLAST of Spg039607 vs. ExPASy Swiss-Prot
Match: O23169 (Pentatricopeptide repeat-containing protein At4g37170 OS=Arabidopsis thaliana OX=3702 GN=PCMP-H5 PE=3 SV=1)

HSP 1 Score: 151.0 bits (380), Expect = 2.8e-35
Identity = 94/350 (26.86%), Postives = 165/350 (47.14%), Query Frame = 0

Query: 3   GRGIFPDQFAYSGILQICIGLEFIELGRMVRAQIVIRGFAFHTCVSTSLLNMYAKLQKVE 62
           GR   P    Y  ++Q+C     +E G+ V   I   GF     +   LL MYAK   + 
Sbjct: 78  GRAKKPPASTYCNLIQVCSQTRALEEGKKVHEHIRTSGFVPGIVIWNRLLRMYAKCGSLV 137

Query: 63  DSIKVFKTMTEVNIVSWNAMISAIGMLRDVNKAKEVSHFASELGVDSNALAGTALIDMHS 122
           D+ KVF  M   ++ SWN M++      +V   +E      E+  + ++ + TA++  + 
Sbjct: 138 DARKVFDEMPNRDLCSWNVMVNGYA---EVGLLEEARKLFDEM-TEKDSYSWTAMVTGYV 197

Query: 123 KRGSLQEA--------------RSILTHVLQISGYLQGECNEKALELFAKMYQNDIHLDH 182
           K+   +EA               +I T  + ++     +C  +  E+   + +  +  D 
Sbjct: 198 KKDQPEEALVLYSLMQRVPNSRPNIFTVSIAVAAAAAVKCIRRGKEIHGHIVRAGLDSDE 257

Query: 183 YTYCSVFNAMAALKCLLSGK----KDGKRDLVSWTTLVTAYSQCSEWDKAIEIFSNMGEE 242
             + S+ +      C+   +    K  ++D+VSWT+++  Y + S W +   +FS +   
Sbjct: 258 VLWSSLMDMYGKCGCIDEARNIFDKIVEKDVVSWTSMIDRYFKSSRWREGFSLFSELVGS 317

Query: 243 GFTPNQFAFSSVLVSCASLCLLD-GQQVHGFLCKVGLDMNKCIGSGLIDMYAKCGSLAEA 302
              PN++ F+ VL +CA L   + G+QVHG++ +VG D      S L+DMY KCG++  A
Sbjct: 318 CERPNEYTFAGVLNACADLTTEELGKQVHGYMTRVGFDPYSFASSSLVDMYTKCGNIESA 377

Query: 303 KKVFERISNANTVSWTTVTSGHAQYSIVDDAFELFRRMEQLGVEPNAVYF 334
           K V +     + VSWT++  G AQ    D+A + F  + + G +P+ V F
Sbjct: 378 KHVVDGCPKPDLVSWTSLIGGCAQNGQPDEALKYFDLLLKSGTKPDHVTF 423

BLAST of Spg039607 vs. ExPASy Swiss-Prot
Match: Q9C866 (Pentatricopeptide repeat-containing protein At1g31430 OS=Arabidopsis thaliana OX=3702 GN=PCMP-E55 PE=2 SV=1)

HSP 1 Score: 151.0 bits (380), Expect = 2.8e-35
Identity = 104/369 (28.18%), Postives = 174/369 (47.15%), Query Frame = 0

Query: 1   MRGRGIFPDQFAYSGILQICIGLEFIELGRMVRAQIVIRGFAFHTCVSTSLLNMYAKLQK 60
           +RG+G++PD F    +L+    L  +  G  V    V  G  F + VS SL+ MYA L K
Sbjct: 37  LRGQGLYPDNFTLPVVLKSIGRLRKVIEGEKVHGYAVKAGLEFDSYVSNSLMGMYASLGK 96

Query: 61  VEDSIKVFKTMTEVNIVSWNAMIS----------AIGMLRDVNKAKE--------VSHFA 120
           +E + KVF  M + ++VSWN +IS          AIG+ + +++           VS  +
Sbjct: 97  IEITHKVFDEMPQRDVVSWNGLISSYVGNGRFEDAIGVFKRMSQESNLKFDEGTIVSTLS 156

Query: 121 S-------ELG----------VDSNALAGTALIDMHSKRGSLQEARSILTHVLQISGYLQ 180
           +       E+G           + +   G AL+DM  K G L +AR++   +        
Sbjct: 157 ACSALKNLEIGERIYRFVVTEFEMSVRIGNALVDMFCKCGCLDKARAVFDSM-------- 216

Query: 181 GECNEKALELFAKMYQNDIHLDHYTYCSVFNAMAALKCLLSGKKDGKRDLVSWTTLVTAY 240
               +K ++ +  M    +         V    + +K           D+V WT ++  Y
Sbjct: 217 ---RDKNVKCWTSMVFGYVSTGRIDEARVLFERSPVK-----------DVVLWTAMMNGY 276

Query: 241 SQCSEWDKAIEIFSNMGEEGFTPNQFAFSSVLVSCASLCLLD-GQQVHGFLCKVGLDMNK 300
            Q + +D+A+E+F  M   G  P+ F   S+L  CA    L+ G+ +HG++ +  + ++K
Sbjct: 277 VQFNRFDEALELFRCMQTAGIRPDNFVLVSLLTGCAQTGALEQGKWIHGYINENRVTVDK 336

Query: 301 CIGSGLIDMYAKCGSLAEAKKVFERISNANTVSWTTVTSGHAQYSIVDDAFELFRRMEQL 334
            +G+ L+DMYAKCG +  A +VF  I   +T SWT++  G A   +   A +L+  ME +
Sbjct: 337 VVGTALVDMYAKCGCIETALEVFYEIKERDTASWTSLIYGLAMNGMSGRALDLYYEMENV 383

BLAST of Spg039607 vs. ExPASy Swiss-Prot
Match: Q9S7F4 (Putative pentatricopeptide repeat-containing protein At2g01510 OS=Arabidopsis thaliana OX=3702 GN=PCMP-H36 PE=3 SV=1)

HSP 1 Score: 148.3 bits (373), Expect = 1.8e-34
Identity = 98/335 (29.25%), Postives = 162/335 (48.36%), Query Frame = 0

Query: 1   MRGRGIFPDQFAYSGILQICIGLEFIELGRMVRAQIVIRGFAFHTCVSTSLLNMYAKLQK 60
           MR  G  P  F +SG+L+  +GL    LG+ + A  V  GF+    V   +L+ Y+K  +
Sbjct: 241 MRQSGHQPSDFTFSGVLKAVVGLHDFALGQQLHALSVTTGFSRDASVGNQILDFYSKHDR 300

Query: 61  VEDSIKVFKTMTEVNIVSWNAMISAIGMLRDVNKAKEVSHFASE---LGVDSNALAGTAL 120
           V ++  +F  M E++ VS+N +IS+       ++ +   HF  E   +G D        +
Sbjct: 301 VLETRMLFDEMPELDFVSYNVVISSYS---QADQYEASLHFFREMQCMGFDRRNFPFATM 360

Query: 121 IDMHSKRGSLQEARSILTHVLQISGYLQGECNEKALELFAKMYQNDIHLDHYTYCSVFNA 180
           + + +   SLQ  R +              C        + ++  +  +D Y  C +F  
Sbjct: 361 LSIAANLSSLQMGRQL-------------HCQALLATADSILHVGNSLVDMYAKCEMFE- 420

Query: 181 MAALKCLLSGKKDGKRDLVSWTTLVTAYSQCSEWDKAIEIFSNMGEEGFTPNQFAFSSVL 240
               +  L  K   +R  VSWT L++ Y Q       +++F+ M       +Q  F++VL
Sbjct: 421 ----EAELIFKSLPQRTTVSWTALISGYVQKGLHGAGLKLFTKMRGSNLRADQSTFATVL 480

Query: 241 VSCASLC-LLDGQQVHGFLCKVGLDMNKCIGSGLIDMYAKCGSLAEAKKVFERISNANTV 300
            + AS   LL G+Q+H F+ + G   N   GSGL+DMYAKCGS+ +A +VFE + + N V
Sbjct: 481 KASASFASLLLGKQLHAFIIRSGNLENVFSGSGLVDMYAKCGSIKDAVQVFEEMPDRNAV 540

Query: 301 SWTTVTSGHAQYSIVDDAFELFRRMEQLGVEPNAV 332
           SW  + S HA     + A   F +M + G++P++V
Sbjct: 541 SWNALISAHADNGDGEAAIGAFAKMIESGLQPDSV 554

BLAST of Spg039607 vs. ExPASy Swiss-Prot
Match: Q3E6Q1 (Pentatricopeptide repeat-containing protein At1g11290, chloroplastic OS=Arabidopsis thaliana OX=3702 GN=PCMP-H40 PE=2 SV=1)

HSP 1 Score: 148.3 bits (373), Expect = 1.8e-34
Identity = 104/417 (24.94%), Postives = 179/417 (42.93%), Query Frame = 0

Query: 1   MRGRGIFPDQFAYSGILQICIGLEFIELGRMVRAQIVIRGFAFHTCVSTSLLNMYAKLQK 60
           MR   + P  + ++ +L++C     + +G+ +   +V  GF+      T L NMYAK ++
Sbjct: 126 MRYDDVEPVVYNFTYLLKVCGDEAELRVGKEIHGLLVKSGFSLDLFAMTGLENMYAKCRQ 185

Query: 61  VEDSIKVFKTMTEVNIVSWNAMIS-----------------------------------A 120
           V ++ KVF  M E ++VSWN +++                                   A
Sbjct: 186 VNEARKVFDRMPERDLVSWNTIVAGYSQNGMARMALEMVKSMCEENLKPSFITIVSVLPA 245

Query: 121 IGMLRDVNKAKEVSHFASELGVDSNALAGTALIDMHSKRGSLQEARSILTHVLQ------ 180
           +  LR ++  KE+  +A   G DS     TAL+DM++K GSL+ AR +   +L+      
Sbjct: 246 VSALRLISVGKEIHGYAMRSGFDSLVNISTALVDMYAKCGSLETARQLFDGMLERNVVSW 305

Query: 181 ---ISGYLQGECNEKALELFAKMYQNDIHLDHYTYCSVFNAMAALKCLLSGK-------- 240
              I  Y+Q E  ++A+ +F KM    +     +     +A A L  L  G+        
Sbjct: 306 NSMIDAYVQNENPKEAMLIFQKMLDEGVKPTDVSVMGALHACADLGDLERGRFIHKLSVE 365

Query: 241 -------------------------------KDGKRDLVSWTTLVTAYSQCSEWDKAIEI 300
                                          K   R LVSW  ++  ++Q      A+  
Sbjct: 366 LGLDRNVSVVNSLISMYCKCKEVDTAASMFGKLQSRTLVSWNAMILGFAQNGRPIDALNY 425

Query: 301 FSNMGEEGFTPNQFAFSSVLVSCASLCLL-DGQQVHGFLCKVGLDMNKCIGSGLIDMYAK 334
           FS M      P+ F + SV+ + A L +    + +HG + +  LD N  + + L+DMYAK
Sbjct: 426 FSQMRSRTVKPDTFTYVSVITAIAELSITHHAKWIHGVVMRSCLDKNVFVTTALVDMYAK 485

BLAST of Spg039607 vs. ExPASy TrEMBL
Match: A0A6J1BXL8 (pentatricopeptide repeat-containing protein At2g33680-like OS=Momordica charantia OX=3673 GN=LOC111006636 PE=4 SV=1)

HSP 1 Score: 490.3 bits (1261), Expect = 7.2e-135
Identity = 272/420 (64.76%), Postives = 297/420 (70.71%), Query Frame = 0

Query: 1   MRGRGIFPDQFAYSGILQICIGLEFIELGRMVRAQIVIRGFAFHTCVSTSLLNMYAKLQK 60
           M+ +GIFPDQFAYSGI++ICIGLE IELGRMV AQIV RGFA HT VST+LL+MY+KLQK
Sbjct: 151 MQNQGIFPDQFAYSGIVKICIGLESIELGRMVHAQIVSRGFASHTFVSTALLDMYSKLQK 210

Query: 61  VEDSIKVFKTMTEVNIVSWNAMIS-----------------------------------A 120
           +EDS KVF TMTEVN+VSWNAMIS                                   A
Sbjct: 211 IEDSYKVFNTMTEVNVVSWNAMISGFASNSLYSEAFDHFLRMKGEGAMPDAHTFIGVAKA 270

Query: 121 IGMLRDVNKAKEVSHFASELGVDSNALAGTALIDMHSKRGSLQEARSIL-THV------- 180
           IGMLRDV+KAKEVSH+AS+LGVDSN L GTALIDMHSK GSLQEARSI  +H        
Sbjct: 271 IGMLRDVSKAKEVSHYASKLGVDSNTLVGTALIDMHSKCGSLQEARSIFDSHFTNCGVNA 330

Query: 181 ---LQISGYLQGECNEKALELFAKMYQNDIHLDHYTYCSVFNAMAALKCLLSGKK----- 240
                ISGYLQ ECN KALE FAKM QNDIHLDHYTYCSVFNA+AALKCL +GKK     
Sbjct: 331 PWNAMISGYLQSECNGKALEFFAKMNQNDIHLDHYTYCSVFNAIAALKCLSTGKKVHARA 390

Query: 241 -----------------------------------DGKRDLVSWTTLVTAYSQCSEWDKA 300
                                                +RDLVSWTTLVTAYSQ SEWDKA
Sbjct: 391 IKSGFEVNHISISNAVANAYAKCGSLDDVRKVLYRMEERDLVSWTTLVTAYSQYSEWDKA 450

Query: 301 IEIFSNMGEEGFTPNQFAFSSVLVSCASLCLLD-GQQVHGFLCKVGLDMNKCIGSGLIDM 334
           IEIFSNM EEGF PNQF FSSVL+SCASLCLL+ GQQVHGFL KVGLDM+K I S LIDM
Sbjct: 451 IEIFSNMREEGFVPNQFTFSSVLLSCASLCLLEYGQQVHGFLFKVGLDMDKFIESALIDM 510

BLAST of Spg039607 vs. ExPASy TrEMBL
Match: A0A6J1FGJ6 (pentatricopeptide repeat-containing protein At3g16610-like OS=Cucurbita moschata OX=3662 GN=LOC111445205 PE=4 SV=1)

HSP 1 Score: 488.0 bits (1255), Expect = 3.6e-134
Identity = 267/420 (63.57%), Postives = 298/420 (70.95%), Query Frame = 0

Query: 1   MRGRGIFPDQFAYSGILQICIGLEFIELGRMVRAQIVIRGFAFHTCVSTSLLNMYAKLQK 60
           M+ + IFPDQFAYSG+LQICIGLE IELG+MV AQIVIRGFA HT VST+LLNMYAKLQK
Sbjct: 159 MQSQDIFPDQFAYSGVLQICIGLESIELGKMVHAQIVIRGFASHTFVSTALLNMYAKLQK 218

Query: 61  VEDSIKVFKTMTEVNIVSWNAMIS-----------------------------------A 120
           ++DS +VF TMTEVN+VSWNAMIS                                   A
Sbjct: 219 IDDSYEVFNTMTEVNVVSWNAMISGFTSNGLYSDAFDHFLRMKGEGVTPDAQTFISIAKA 278

Query: 121 IGMLRDVNKAKEVSHFASELGVDSNALAGTALIDMHSKRGSLQEARSIL-THV------- 180
           IGMLRDVNKAKE+S +ASELG+DSN L GTALIDMHSK GSLQEARSI  +H        
Sbjct: 279 IGMLRDVNKAKEISRYASELGMDSNPLVGTALIDMHSKCGSLQEARSIFDSHFTNCRVNG 338

Query: 181 ---LQISGYLQGECNEKALELFAKMYQNDIHLDHYTYCSVFNAMAALKCLLSGKK----- 240
                ISGYLQ E NEKALELFAKM  N++HLD YTYCSVFNA+AALKCL  GKK     
Sbjct: 339 PWNAMISGYLQSEFNEKALELFAKMCLNNVHLDRYTYCSVFNAIAALKCLSLGKKVHARA 398

Query: 241 -----------------------------------DGKRDLVSWTTLVTAYSQCSEWDKA 300
                                                +RDLVSWTTLVTAYSQCSEWDKA
Sbjct: 399 IKSGLEVNNISISNAVANAYAKCGSLEDLRKVFYSMEERDLVSWTTLVTAYSQCSEWDKA 458

Query: 301 IEIFSNMGEEGFTPNQFAFSSVLVSCASLCLLD-GQQVHGFLCKVGLDMNKCIGSGLIDM 334
           IEIFSNM EEGF PNQFAFSSVL+SCASLCLL+ GQQVHGF+ KVGLDM+KCI S LIDM
Sbjct: 459 IEIFSNMREEGFAPNQFAFSSVLISCASLCLLEYGQQVHGFIGKVGLDMDKCIQSALIDM 518

BLAST of Spg039607 vs. ExPASy TrEMBL
Match: A0A6J1JX63 (pentatricopeptide repeat-containing protein At2g27610-like OS=Cucurbita maxima OX=3661 GN=LOC111489649 PE=4 SV=1)

HSP 1 Score: 487.3 bits (1253), Expect = 6.1e-134
Identity = 266/420 (63.33%), Postives = 298/420 (70.95%), Query Frame = 0

Query: 1   MRGRGIFPDQFAYSGILQICIGLEFIELGRMVRAQIVIRGFAFHTCVSTSLLNMYAKLQK 60
           M+ + IFPDQFAYSG+LQICIGLE IELG+MV AQIVIRGFA HT VST+LLNMYAKLQK
Sbjct: 159 MQSQDIFPDQFAYSGVLQICIGLESIELGKMVHAQIVIRGFASHTFVSTALLNMYAKLQK 218

Query: 61  VEDSIKVFKTMTEVNIVSWNAMIS-----------------------------------A 120
           ++DS +VF TMTEVN+VSWNAMIS                                   A
Sbjct: 219 IDDSYEVFNTMTEVNVVSWNAMISGFTSNGLYSDAFDHFLRMKGEGVTPDAQTFISIAKA 278

Query: 121 IGMLRDVNKAKEVSHFASELGVDSNALAGTALIDMHSKRGSLQEARSIL-THV------- 180
           IGMLRDVNKAKE+S +AS+LG+DSN L GTALIDMHSK GSLQEARSI  +H        
Sbjct: 279 IGMLRDVNKAKEISRYASKLGMDSNTLVGTALIDMHSKCGSLQEARSIFDSHFTNCRVNG 338

Query: 181 ---LQISGYLQGECNEKALELFAKMYQNDIHLDHYTYCSVFNAMAALKCLLSGKK----- 240
                ISGYLQ E NE+ALELFAKM  N++HLD YTYCSVFNA+AALKCL  GKK     
Sbjct: 339 PWNAMISGYLQSEFNEEALELFAKMCLNNVHLDRYTYCSVFNAIAALKCLSLGKKVHARA 398

Query: 241 -----------------------------------DGKRDLVSWTTLVTAYSQCSEWDKA 300
                                                +RDLVSWTTLVTAYSQCSEWDKA
Sbjct: 399 IKSGLEVNNISISNAVANAYAKCGSLEDLRKVFYSMEERDLVSWTTLVTAYSQCSEWDKA 458

Query: 301 IEIFSNMGEEGFTPNQFAFSSVLVSCASLCLLD-GQQVHGFLCKVGLDMNKCIGSGLIDM 334
           IEIFSNM EEGF PNQFAFSSVLVSCASLCLL+ GQQVHGF+CKVGLDM+KCI S LIDM
Sbjct: 459 IEIFSNMREEGFAPNQFAFSSVLVSCASLCLLEYGQQVHGFICKVGLDMDKCIQSALIDM 518

BLAST of Spg039607 vs. ExPASy TrEMBL
Match: A0A5D3C2B1 (Pentatricopeptide repeat-containing protein OS=Cucumis melo var. makuwa OX=1194695 GN=E5676_scaffold409G001490 PE=4 SV=1)

HSP 1 Score: 480.3 bits (1235), Expect = 7.4e-132
Identity = 263/420 (62.62%), Postives = 293/420 (69.76%), Query Frame = 0

Query: 1   MRGRGIFPDQFAYSGILQICIGLEFIELGRMVRAQIVIRGFAFHTCVSTSLLNMYAKLQK 60
           M+ +GIFPD FAYSGILQICIGL+ +ELG+MV AQIVIRGF  HT VST+LLNMYAKLQ+
Sbjct: 176 MQSQGIFPDHFAYSGILQICIGLDSVELGKMVHAQIVIRGFTSHTFVSTALLNMYAKLQE 235

Query: 61  VEDSIKVFKTMTEVNIVSWNAMIS-----------------------------------A 120
           +EDS KVF TMTEVN+VSWNAMI+                                   A
Sbjct: 236 IEDSCKVFNTMTEVNVVSWNAMITGFTSNGFYLDAFDLFLRMKGEGVTPDAQTFIGVAKA 295

Query: 121 IGMLRDVNKAKEVSHFASELGVDSNALAGTALIDMHSKRGSLQEARSIL-THVL------ 180
           IGMLRDVNKAKEVS +A ELGVDSN L GTALIDMHSK GSLQEARSI  +H +      
Sbjct: 296 IGMLRDVNKAKEVSGYALELGVDSNTLVGTALIDMHSKCGSLQEARSIFNSHFVTCRFNA 355

Query: 181 ----QISGYLQGECNEKALELFAKMYQNDIHLDHYTYCSVFNAMAALKCLLSGKK----- 240
                ISGYLQ   NEKALELFAKM Q+DIHLD YTYCSVFNA+A+LKCLLSGKK     
Sbjct: 356 PWNAMISGYLQSGLNEKALELFAKMCQSDIHLDRYTYCSVFNAIASLKCLLSGKKVHARA 415

Query: 241 -----------------------------------DGKRDLVSWTTLVTAYSQCSEWDKA 300
                                                 RDL+SWT+LVTAYSQCSEWDKA
Sbjct: 416 IKSGLEVNCVSISNAVANAYAKCGSLEDVRKVFNRMEDRDLISWTSLVTAYSQCSEWDKA 475

Query: 301 IEIFSNMGEEGFTPNQFAFSSVLVSCASLCLLD-GQQVHGFLCKVGLDMNKCIGSGLIDM 334
           IEIFSNM  EG+ PNQFAFSSVLVSCA+LCLL+ GQQVHG +CKVGLDM+KCI S L+DM
Sbjct: 476 IEIFSNMRAEGYAPNQFAFSSVLVSCANLCLLEYGQQVHGIICKVGLDMDKCIESALVDM 535

BLAST of Spg039607 vs. ExPASy TrEMBL
Match: A0A5A7UEQ9 (Pentatricopeptide repeat-containing protein OS=Cucumis melo var. makuwa OX=1194695 GN=E6C27_scaffold60G005280 PE=4 SV=1)

HSP 1 Score: 480.3 bits (1235), Expect = 7.4e-132
Identity = 263/420 (62.62%), Postives = 293/420 (69.76%), Query Frame = 0

Query: 1   MRGRGIFPDQFAYSGILQICIGLEFIELGRMVRAQIVIRGFAFHTCVSTSLLNMYAKLQK 60
           M+ +GIFPD FAYSGILQICIGL+ +ELG+MV AQIVIRGF  HT VST+LLNMYAKLQ+
Sbjct: 159 MQSQGIFPDHFAYSGILQICIGLDSVELGKMVHAQIVIRGFTSHTFVSTALLNMYAKLQE 218

Query: 61  VEDSIKVFKTMTEVNIVSWNAMIS-----------------------------------A 120
           +EDS KVF TMTEVN+VSWNAMI+                                   A
Sbjct: 219 IEDSCKVFNTMTEVNVVSWNAMITGFTSNGFYLDAFDLFLRMKGEGVTPDAQTFIGVAKA 278

Query: 121 IGMLRDVNKAKEVSHFASELGVDSNALAGTALIDMHSKRGSLQEARSIL-THVL------ 180
           IGMLRDVNKAKEVS +A ELGVDSN L GTALIDMHSK GSLQEARSI  +H +      
Sbjct: 279 IGMLRDVNKAKEVSGYALELGVDSNTLVGTALIDMHSKCGSLQEARSIFNSHFVTCRFNA 338

Query: 181 ----QISGYLQGECNEKALELFAKMYQNDIHLDHYTYCSVFNAMAALKCLLSGKK----- 240
                ISGYLQ   NEKALELFAKM Q+DIHLD YTYCSVFNA+A+LKCLLSGKK     
Sbjct: 339 PWNAMISGYLQSGLNEKALELFAKMCQSDIHLDRYTYCSVFNAIASLKCLLSGKKVHARA 398

Query: 241 -----------------------------------DGKRDLVSWTTLVTAYSQCSEWDKA 300
                                                 RDL+SWT+LVTAYSQCSEWDKA
Sbjct: 399 IKSGLEVNCVSISNAVANAYAKCGSLEDVRKVFNRMEDRDLISWTSLVTAYSQCSEWDKA 458

Query: 301 IEIFSNMGEEGFTPNQFAFSSVLVSCASLCLLD-GQQVHGFLCKVGLDMNKCIGSGLIDM 334
           IEIFSNM  EG+ PNQFAFSSVLVSCA+LCLL+ GQQVHG +CKVGLDM+KCI S L+DM
Sbjct: 459 IEIFSNMRAEGYAPNQFAFSSVLVSCANLCLLEYGQQVHGIICKVGLDMDKCIESALVDM 518

BLAST of Spg039607 vs. TAIR 10
Match: AT4G18750.1 (Pentatricopeptide repeat (PPR) superfamily protein )

HSP 1 Score: 152.5 bits (384), Expect = 6.8e-37
Identity = 111/418 (26.56%), Postives = 196/418 (46.89%), Query Frame = 0

Query: 1   MRGRGIFPDQFAYSGILQICIGLEFIELGRMVRAQIVIRGFAFHTCVSTSLLNMYAKLQK 60
           M   G+  D + +S + +    L  +  G  +   I+  GF     V  SL+  Y K Q+
Sbjct: 186 MMSSGVEMDSYTFSCVSKSFSSLRSVHGGEQLHGFILKSGFGERNSVGNSLVAFYLKNQR 245

Query: 61  VEDSIKVFKTMTEVNIVSWNAMISA--------------IGML---RDVNKAKEVSHFAS 120
           V+ + KVF  MTE +++SWN++I+               + ML    +++ A  VS FA 
Sbjct: 246 VDSARKVFDEMTERDVISWNSIINGYVSNGLAEKGLSVFVQMLVSGIEIDLATIVSVFAG 305

Query: 121 -------ELGVDSNALAGTA-----------LIDMHSKRGSLQEA---------RSILTH 180
                   LG   +++   A           L+DM+SK G L  A         RS++++
Sbjct: 306 CADSRLISLGRAVHSIGVKACFSREDRFCNTLLDMYSKCGDLDSAKAVFREMSDRSVVSY 365

Query: 181 VLQISGYLQGECNEKALELFAKMYQNDIHLDHYTYCSVFNAMAALKCLLSGKKDGK---- 240
              I+GY +     +A++LF +M +  I  D YT  +V N  A  + L  GK+  +    
Sbjct: 366 TSMIAGYAREGLAGEAVKLFEEMEEEGISPDVYTVTAVLNCCARYRLLDEGKRVHEWIKE 425

Query: 241 -----------------------------------RDLVSWTTLVTAYSQCSEWDKAIEI 300
                                              +D++SW T++  YS+    ++A+ +
Sbjct: 426 NDLGFDIFVSNALMDMYAKCGSMQEAELVFSEMRVKDIISWNTIIGGYSKNCYANEALSL 485

Query: 301 FSNMGEE-GFTPNQFAFSSVLVSCASLCLLD-GQQVHGFLCKVGLDMNKCIGSGLIDMYA 334
           F+ + EE  F+P++   + VL +CASL   D G+++HG++ + G   ++ + + L+DMYA
Sbjct: 486 FNLLLEEKRFSPDERTVACVLPACASLSAFDKGREIHGYIMRNGYFSDRHVANSLVDMYA 545

BLAST of Spg039607 vs. TAIR 10
Match: AT1G31430.1 (Pentatricopeptide repeat (PPR-like) superfamily protein )

HSP 1 Score: 151.0 bits (380), Expect = 2.0e-36
Identity = 104/369 (28.18%), Postives = 174/369 (47.15%), Query Frame = 0

Query: 1   MRGRGIFPDQFAYSGILQICIGLEFIELGRMVRAQIVIRGFAFHTCVSTSLLNMYAKLQK 60
           +RG+G++PD F    +L+    L  +  G  V    V  G  F + VS SL+ MYA L K
Sbjct: 37  LRGQGLYPDNFTLPVVLKSIGRLRKVIEGEKVHGYAVKAGLEFDSYVSNSLMGMYASLGK 96

Query: 61  VEDSIKVFKTMTEVNIVSWNAMIS----------AIGMLRDVNKAKE--------VSHFA 120
           +E + KVF  M + ++VSWN +IS          AIG+ + +++           VS  +
Sbjct: 97  IEITHKVFDEMPQRDVVSWNGLISSYVGNGRFEDAIGVFKRMSQESNLKFDEGTIVSTLS 156

Query: 121 S-------ELG----------VDSNALAGTALIDMHSKRGSLQEARSILTHVLQISGYLQ 180
           +       E+G           + +   G AL+DM  K G L +AR++   +        
Sbjct: 157 ACSALKNLEIGERIYRFVVTEFEMSVRIGNALVDMFCKCGCLDKARAVFDSM-------- 216

Query: 181 GECNEKALELFAKMYQNDIHLDHYTYCSVFNAMAALKCLLSGKKDGKRDLVSWTTLVTAY 240
               +K ++ +  M    +         V    + +K           D+V WT ++  Y
Sbjct: 217 ---RDKNVKCWTSMVFGYVSTGRIDEARVLFERSPVK-----------DVVLWTAMMNGY 276

Query: 241 SQCSEWDKAIEIFSNMGEEGFTPNQFAFSSVLVSCASLCLLD-GQQVHGFLCKVGLDMNK 300
            Q + +D+A+E+F  M   G  P+ F   S+L  CA    L+ G+ +HG++ +  + ++K
Sbjct: 277 VQFNRFDEALELFRCMQTAGIRPDNFVLVSLLTGCAQTGALEQGKWIHGYINENRVTVDK 336

Query: 301 CIGSGLIDMYAKCGSLAEAKKVFERISNANTVSWTTVTSGHAQYSIVDDAFELFRRMEQL 334
            +G+ L+DMYAKCG +  A +VF  I   +T SWT++  G A   +   A +L+  ME +
Sbjct: 337 VVGTALVDMYAKCGCIETALEVFYEIKERDTASWTSLIYGLAMNGMSGRALDLYYEMENV 383

BLAST of Spg039607 vs. TAIR 10
Match: AT4G37170.1 (Pentatricopeptide repeat (PPR) superfamily protein )

HSP 1 Score: 151.0 bits (380), Expect = 2.0e-36
Identity = 94/350 (26.86%), Postives = 165/350 (47.14%), Query Frame = 0

Query: 3   GRGIFPDQFAYSGILQICIGLEFIELGRMVRAQIVIRGFAFHTCVSTSLLNMYAKLQKVE 62
           GR   P    Y  ++Q+C     +E G+ V   I   GF     +   LL MYAK   + 
Sbjct: 78  GRAKKPPASTYCNLIQVCSQTRALEEGKKVHEHIRTSGFVPGIVIWNRLLRMYAKCGSLV 137

Query: 63  DSIKVFKTMTEVNIVSWNAMISAIGMLRDVNKAKEVSHFASELGVDSNALAGTALIDMHS 122
           D+ KVF  M   ++ SWN M++      +V   +E      E+  + ++ + TA++  + 
Sbjct: 138 DARKVFDEMPNRDLCSWNVMVNGYA---EVGLLEEARKLFDEM-TEKDSYSWTAMVTGYV 197

Query: 123 KRGSLQEA--------------RSILTHVLQISGYLQGECNEKALELFAKMYQNDIHLDH 182
           K+   +EA               +I T  + ++     +C  +  E+   + +  +  D 
Sbjct: 198 KKDQPEEALVLYSLMQRVPNSRPNIFTVSIAVAAAAAVKCIRRGKEIHGHIVRAGLDSDE 257

Query: 183 YTYCSVFNAMAALKCLLSGK----KDGKRDLVSWTTLVTAYSQCSEWDKAIEIFSNMGEE 242
             + S+ +      C+   +    K  ++D+VSWT+++  Y + S W +   +FS +   
Sbjct: 258 VLWSSLMDMYGKCGCIDEARNIFDKIVEKDVVSWTSMIDRYFKSSRWREGFSLFSELVGS 317

Query: 243 GFTPNQFAFSSVLVSCASLCLLD-GQQVHGFLCKVGLDMNKCIGSGLIDMYAKCGSLAEA 302
              PN++ F+ VL +CA L   + G+QVHG++ +VG D      S L+DMY KCG++  A
Sbjct: 318 CERPNEYTFAGVLNACADLTTEELGKQVHGYMTRVGFDPYSFASSSLVDMYTKCGNIESA 377

Query: 303 KKVFERISNANTVSWTTVTSGHAQYSIVDDAFELFRRMEQLGVEPNAVYF 334
           K V +     + VSWT++  G AQ    D+A + F  + + G +P+ V F
Sbjct: 378 KHVVDGCPKPDLVSWTSLIGGCAQNGQPDEALKYFDLLLKSGTKPDHVTF 423

BLAST of Spg039607 vs. TAIR 10
Match: AT1G11290.1 (Pentatricopeptide repeat (PPR) superfamily protein )

HSP 1 Score: 148.3 bits (373), Expect = 1.3e-35
Identity = 104/417 (24.94%), Postives = 179/417 (42.93%), Query Frame = 0

Query: 1   MRGRGIFPDQFAYSGILQICIGLEFIELGRMVRAQIVIRGFAFHTCVSTSLLNMYAKLQK 60
           MR   + P  + ++ +L++C     + +G+ +   +V  GF+      T L NMYAK ++
Sbjct: 126 MRYDDVEPVVYNFTYLLKVCGDEAELRVGKEIHGLLVKSGFSLDLFAMTGLENMYAKCRQ 185

Query: 61  VEDSIKVFKTMTEVNIVSWNAMIS-----------------------------------A 120
           V ++ KVF  M E ++VSWN +++                                   A
Sbjct: 186 VNEARKVFDRMPERDLVSWNTIVAGYSQNGMARMALEMVKSMCEENLKPSFITIVSVLPA 245

Query: 121 IGMLRDVNKAKEVSHFASELGVDSNALAGTALIDMHSKRGSLQEARSILTHVLQ------ 180
           +  LR ++  KE+  +A   G DS     TAL+DM++K GSL+ AR +   +L+      
Sbjct: 246 VSALRLISVGKEIHGYAMRSGFDSLVNISTALVDMYAKCGSLETARQLFDGMLERNVVSW 305

Query: 181 ---ISGYLQGECNEKALELFAKMYQNDIHLDHYTYCSVFNAMAALKCLLSGK-------- 240
              I  Y+Q E  ++A+ +F KM    +     +     +A A L  L  G+        
Sbjct: 306 NSMIDAYVQNENPKEAMLIFQKMLDEGVKPTDVSVMGALHACADLGDLERGRFIHKLSVE 365

Query: 241 -------------------------------KDGKRDLVSWTTLVTAYSQCSEWDKAIEI 300
                                          K   R LVSW  ++  ++Q      A+  
Sbjct: 366 LGLDRNVSVVNSLISMYCKCKEVDTAASMFGKLQSRTLVSWNAMILGFAQNGRPIDALNY 425

Query: 301 FSNMGEEGFTPNQFAFSSVLVSCASLCLL-DGQQVHGFLCKVGLDMNKCIGSGLIDMYAK 334
           FS M      P+ F + SV+ + A L +    + +HG + +  LD N  + + L+DMYAK
Sbjct: 426 FSQMRSRTVKPDTFTYVSVITAIAELSITHHAKWIHGVVMRSCLDKNVFVTTALVDMYAK 485

BLAST of Spg039607 vs. TAIR 10
Match: AT3G02010.1 (Pentatricopeptide repeat (PPR) superfamily protein )

HSP 1 Score: 148.3 bits (373), Expect = 1.3e-35
Identity = 98/335 (29.25%), Postives = 162/335 (48.36%), Query Frame = 0

Query: 1   MRGRGIFPDQFAYSGILQICIGLEFIELGRMVRAQIVIRGFAFHTCVSTSLLNMYAKLQK 60
           MR  G  P  F +SG+L+  +GL    LG+ + A  V  GF+    V   +L+ Y+K  +
Sbjct: 241 MRQSGHQPSDFTFSGVLKAVVGLHDFALGQQLHALSVTTGFSRDASVGNQILDFYSKHDR 300

Query: 61  VEDSIKVFKTMTEVNIVSWNAMISAIGMLRDVNKAKEVSHFASE---LGVDSNALAGTAL 120
           V ++  +F  M E++ VS+N +IS+       ++ +   HF  E   +G D        +
Sbjct: 301 VLETRMLFDEMPELDFVSYNVVISSYS---QADQYEASLHFFREMQCMGFDRRNFPFATM 360

Query: 121 IDMHSKRGSLQEARSILTHVLQISGYLQGECNEKALELFAKMYQNDIHLDHYTYCSVFNA 180
           + + +   SLQ  R +              C        + ++  +  +D Y  C +F  
Sbjct: 361 LSIAANLSSLQMGRQL-------------HCQALLATADSILHVGNSLVDMYAKCEMFE- 420

Query: 181 MAALKCLLSGKKDGKRDLVSWTTLVTAYSQCSEWDKAIEIFSNMGEEGFTPNQFAFSSVL 240
               +  L  K   +R  VSWT L++ Y Q       +++F+ M       +Q  F++VL
Sbjct: 421 ----EAELIFKSLPQRTTVSWTALISGYVQKGLHGAGLKLFTKMRGSNLRADQSTFATVL 480

Query: 241 VSCASLC-LLDGQQVHGFLCKVGLDMNKCIGSGLIDMYAKCGSLAEAKKVFERISNANTV 300
            + AS   LL G+Q+H F+ + G   N   GSGL+DMYAKCGS+ +A +VFE + + N V
Sbjct: 481 KASASFASLLLGKQLHAFIIRSGNLENVFSGSGLVDMYAKCGSIKDAVQVFEEMPDRNAV 540

Query: 301 SWTTVTSGHAQYSIVDDAFELFRRMEQLGVEPNAV 332
           SW  + S HA     + A   F +M + G++P++V
Sbjct: 541 SWNALISAHADNGDGEAAIGAFAKMIESGLQPDSV 554

The following BLAST results are available for this feature:
Match NameE-valueIdentityDescription
XP_022134356.11.5e-13464.76pentatricopeptide repeat-containing protein At2g33680-like [Momordica charantia][more]
XP_022939229.17.3e-13463.57pentatricopeptide repeat-containing protein At3g16610-like [Cucurbita moschata] ... [more]
KAG7016032.17.3e-13463.57Pentatricopeptide repeat-containing protein, partial [Cucurbita argyrosperma sub... [more]
XP_023549692.17.3e-13463.33pentatricopeptide repeat-containing protein At3g16610-like [Cucurbita pepo subsp... [more]
XP_022993736.11.3e-13363.33pentatricopeptide repeat-containing protein At2g27610-like [Cucurbita maxima][more]
Match NameE-valueIdentityDescription
Q9SN399.6e-3626.56Pentatricopeptide repeat-containing protein DOT4, chloroplastic OS=Arabidopsis t... [more]
O231692.8e-3526.86Pentatricopeptide repeat-containing protein At4g37170 OS=Arabidopsis thaliana OX... [more]
Q9C8662.8e-3528.18Pentatricopeptide repeat-containing protein At1g31430 OS=Arabidopsis thaliana OX... [more]
Q9S7F41.8e-3429.25Putative pentatricopeptide repeat-containing protein At2g01510 OS=Arabidopsis th... [more]
Q3E6Q11.8e-3424.94Pentatricopeptide repeat-containing protein At1g11290, chloroplastic OS=Arabidop... [more]
Match NameE-valueIdentityDescription
A0A6J1BXL87.2e-13564.76pentatricopeptide repeat-containing protein At2g33680-like OS=Momordica charanti... [more]
A0A6J1FGJ63.6e-13463.57pentatricopeptide repeat-containing protein At3g16610-like OS=Cucurbita moschata... [more]
A0A6J1JX636.1e-13463.33pentatricopeptide repeat-containing protein At2g27610-like OS=Cucurbita maxima O... [more]
A0A5D3C2B17.4e-13262.62Pentatricopeptide repeat-containing protein OS=Cucumis melo var. makuwa OX=11946... [more]
A0A5A7UEQ97.4e-13262.62Pentatricopeptide repeat-containing protein OS=Cucumis melo var. makuwa OX=11946... [more]
Match NameE-valueIdentityDescription
AT4G18750.16.8e-3726.56Pentatricopeptide repeat (PPR) superfamily protein [more]
AT1G31430.12.0e-3628.18Pentatricopeptide repeat (PPR-like) superfamily protein [more]
AT4G37170.12.0e-3626.86Pentatricopeptide repeat (PPR) superfamily protein [more]
AT1G11290.11.3e-3524.94Pentatricopeptide repeat (PPR) superfamily protein [more]
AT3G02010.11.3e-3529.25Pentatricopeptide repeat (PPR) superfamily protein [more]
InterPro
Analysis Name: InterPro Annotations of Sponge gourd (cylindrica) v1
Date Performed: 2022-08-01
IPR TermIPR DescriptionSourceSource TermSource DescriptionAlignment
IPR002885Pentatricopeptide repeatPFAMPF01535PPRcoord: 49..74
e-value: 0.045
score: 14.0
coord: 140..162
e-value: 0.74
score: 10.2
coord: 77..98
e-value: 0.26
score: 11.6
coord: 296..326
e-value: 0.0016
score: 18.5
coord: 270..290
e-value: 0.037
score: 14.3
IPR002885Pentatricopeptide repeatTIGRFAMTIGR00756TIGR00756coord: 296..330
e-value: 2.4E-5
score: 22.2
coord: 196..229
e-value: 6.0E-7
score: 27.2
IPR002885Pentatricopeptide repeatPFAMPF13041PPR_2coord: 194..241
e-value: 4.7E-8
score: 33.1
IPR002885Pentatricopeptide repeatPROSITEPS51375PPRcoord: 294..328
score: 10.972319
IPR002885Pentatricopeptide repeatPROSITEPS51375PPRcoord: 194..228
score: 12.309597
IPR011990Tetratricopeptide-like helical domain superfamilyGENE3D1.25.40.10Tetratricopeptide repeat domaincoord: 250..338
e-value: 3.2E-15
score: 57.9
IPR011990Tetratricopeptide-like helical domain superfamilyGENE3D1.25.40.10Tetratricopeptide repeat domaincoord: 1..138
e-value: 3.7E-19
score: 71.2
coord: 139..247
e-value: 7.9E-18
score: 66.9
NoneNo IPR availablePANTHERPTHR24015OS07G0578800 PROTEIN-RELATEDcoord: 192..333
coord: 3..83
NoneNo IPR availablePANTHERPTHR24015:SF1875PENTATRICOPEPTIDE (PPR) REPEAT PROTEINcoord: 192..333
coord: 3..83
NoneNo IPR availablePANTHERPTHR24015OS07G0578800 PROTEIN-RELATEDcoord: 140..188
NoneNo IPR availablePANTHERPTHR24015:SF1875PENTATRICOPEPTIDE (PPR) REPEAT PROTEINcoord: 140..188

Relationships

The following mRNA feature(s) are a part of this gene:

Feature NameUnique NameType
Spg039607.1Spg039607.1mRNA


GO Annotation
GO Assignments
This gene is annotated with the following GO terms.
Category Term Accession Term Name
molecular_function GO:0005515 protein binding