CmoCh14G014620 (gene) Cucurbita moschata (Rifu)

NameCmoCh14G014620
Typegene
OrganismCucurbita moschata (Cucurbita moschata (Rifu))
DescriptionPentatricopeptide repeat-containing protein
LocationCmo_Chr14 : 11851445 .. 11853175 (-)
The following sequences are available for this feature:

Gene sequence (with intron)

Legend: exonfive_prime_UTRCDS
Hold the cursor over a type above to highlight its positions in the sequence below.
AATAATTTCGTGGAAATCAGACTCCTACTGGAGCGTAAGGACACCATTCCGCCTTCACTGAACTGAGTTTCTCGCAAAAGGGCAGTCGAGATGAAATCTACGCTAATGGGTCGTCTTCAACTCCATTTCCCTCAATTGGGTTTCCGCCAAAACCTCACAAATCCAACCCTCCATTGTTGTACGGCAGGTCCACCTCCAGATACCATCTGTGGCCTCAGAAAGGGTCAGAGGAAGCCCTTAGGTAAGTCAAGGGTGCCCTCCACTGAGTCTATTCAAGCAGTTCAGTCGCTCAAGCTCGCTAAATCCGCCTCCAAAATGGAGGACGTAATCAATAGCAAGCTTAGCAGATTGTTGAAAGCAGACTTGTTTGATGCTCTGGCTGAATTACAGAGGCAAAATGAACTGGAACTATCGCTTCAGGTATTGCTTACTGCTGTTAATCGTATCGGAATATTGTAAAAACCAGATATCTTACCCTTAAATTATGAGATTGATCGAATTATACCTTAAACTCTCTTAGAGGCTGCCTTTAGTTCAACTCAATAGCAGTGGTTGCTTAGAATTCAAGGTTTAGGGTTTAATCGTCTGCGTTTTTTCAAGCTACAACTATCAGAGATTCAATTACAACGATGAGGAAAAGCCTAATGGTGTGATTGTAGCTATGTTTTTGCATCTTTTTCGCTATCATTTATTCTGTTGTAGGTTTTCCCTTACAAGAACTTAAATTATTTGACATTGGCTAATTTAGGAAATGATCATAGATTTATAATTTATAAATAAGGAATACATCTTTATTGGTATGAGTCCTTTCGGGGAAATCAAAAGTAAAGCCATCAAAGTTTATGCTCAAAGTGAACAATATCATACCATTGTGGAAGGTCGTGGTTCCTTACATTATATTAGAGTCATGCCCTTAACTTAGCCATATCCATAAAATCCTCAAATGTCGAACAAAGAAATTGTGAGCCTCAAAGGTGTAGTCAAAAGTGACTCAAGTGTCGAACAAAGGGTGTACTTTGTTCGAGGGTTCCAGAGAAAGGAGTCGAGCCTCGATTAAGGAGAGGATGTTCGAGGGCTACATAGGCCTCAGGGGAGGCTCTATGGTGTACTTTGTTCGAGTGGAGGATTGTTGAGGATTGTTGGGAGGGAGGTCCCACATTGGCTAATTTAGGGAATGATCATGAGTTTATAAGGAATACAAAGTCGACAATATCATACCATTGTAGAGGTTCGTGATTCCTAACAATAAGTTTACTGGAACGGTTCCATTTTATTCTCTTCTGCAGCCTATAGCAATCCTGCCACTTCAAATTTTATTTCATCTTAACTGAATTCAGGCTTTAGGTCTTCAAATTTATGAGGAATGAAGAGTGGTACGAGCCAGATTTAAACTTGTACCATGTGATGATTCAAATGATGGGGAAGAACAAAATGATTGAAATGGCTGAAGAGGTCTTCCATGAGTTAAAAAGGGATGGGTTAGAACCAGACACAAGAGCTTTTAATGAGATGATGGGAGCATATTTGCAAGTAGACATGGTGGAAAGAGCTGTTGAGACATATGAATTAATGAAAGCATCAGGTTGTATTCCAGATAAACTGACTTTCAAGATTTTGATCAAGAATCTTGAGAGATTTAGGGAAGAGTTTGCTGCAGTGGTCAAGAAAGAATGTGCTGAGTTCTTGGATTCTCCTGAGAAGTTCCTCAGAGGGATGTTGAACAGAAACTGA

mRNA sequence

AATAATTTCGTGGAAATCAGACTCCTACTGGAGCGTAAGGACACCATTCCGCCTTCACTGAACTGAGTTTCTCGCAAAAGGGCAGTCGAGATGAAATCTACGCTAATGGGTCGTCTTCAACTCCATTTCCCTCAATTGGGTTTCCGCCAAAACCTCACAAATCCAACCCTCCATTGTTGTACGGCAGGTCCACCTCCAGATACCATCTGTGGCCTCAGAAAGGGTCAGAGGAAGCCCTTAGGTAAGTCAAGGGTGCCCTCCACTGAGTCTATTCAAGCAGTTCAGTCGCTCAAGCTCGCTAAATCCGCCTCCAAAATGGAGGACGTAATCAATAGCAAGCTTAGCAGATTGTTGAAAGCAGACTTGTTTGATGCTCTGGCTGAATTACAGAGGCAAAATGAACTGGAACTATCGCTTCAGGTCTTCAAATTTATGAGGAATGAAGAGTGGTACGAGCCAGATTTAAACTTGTACCATGTGATGATTCAAATGATGGGGAAGAACAAAATGATTGAAATGGCTGAAGAGGTCTTCCATGAGTTAAAAAGGGATGGGTTAGAACCAGACACAAGAGCTTTTAATGAGATGATGGGAGCATATTTGCAAGTAGACATGGTGGAAAGAGCTGTTGAGACATATGAATTAATGAAAGCATCAGGTTGTATTCCAGATAAACTGACTTTCAAGATTTTGATCAAGAATCTTGAGAGATTTAGGGAAGAGTTTGCTGCAGTGGTCAAGAAAGAATGTGCTGAGTTCTTGGATTCTCCTGAGAAGTTCCTCAGAGGGATGTTGAACAGAAACTGA

Coding sequence (CDS)

ATGAAATCTACGCTAATGGGTCGTCTTCAACTCCATTTCCCTCAATTGGGTTTCCGCCAAAACCTCACAAATCCAACCCTCCATTGTTGTACGGCAGGTCCACCTCCAGATACCATCTGTGGCCTCAGAAAGGGTCAGAGGAAGCCCTTAGGTAAGTCAAGGGTGCCCTCCACTGAGTCTATTCAAGCAGTTCAGTCGCTCAAGCTCGCTAAATCCGCCTCCAAAATGGAGGACGTAATCAATAGCAAGCTTAGCAGATTGTTGAAAGCAGACTTGTTTGATGCTCTGGCTGAATTACAGAGGCAAAATGAACTGGAACTATCGCTTCAGGTCTTCAAATTTATGAGGAATGAAGAGTGGTACGAGCCAGATTTAAACTTGTACCATGTGATGATTCAAATGATGGGGAAGAACAAAATGATTGAAATGGCTGAAGAGGTCTTCCATGAGTTAAAAAGGGATGGGTTAGAACCAGACACAAGAGCTTTTAATGAGATGATGGGAGCATATTTGCAAGTAGACATGGTGGAAAGAGCTGTTGAGACATATGAATTAATGAAAGCATCAGGTTGTATTCCAGATAAACTGACTTTCAAGATTTTGATCAAGAATCTTGAGAGATTTAGGGAAGAGTTTGCTGCAGTGGTCAAGAAAGAATGTGCTGAGTTCTTGGATTCTCCTGAGAAGTTCCTCAGAGGGATGTTGAACAGAAACTGA
BLAST of CmoCh14G014620 vs. Swiss-Prot
Match: PPR89_ARATH (Pentatricopeptide repeat-containing protein At1g62350 OS=Arabidopsis thaliana GN=At1g62350 PE=2 SV=1)

HSP 1 Score: 90.9 bits (224), Expect = 2.0e-17
Identity = 50/155 (32.26%), Postives = 93/155 (60.00%), Query Frame = 1

Query: 57  STESIQAVQSLKLAKSAS-KMEDVINSKLSRLLKADLFDALAELQRQNELELSLQVFKFM 116
           S E + A + LK  ++ S +++  I S +SRLLK+DL   LAE QRQN++ L +++++ +
Sbjct: 2   SKEGLIAAKELKRLQTQSVRLDRFIGSHVSRLLKSDLVSVLAEFQRQNQVFLCMKLYEVV 61

Query: 117 RNEEWYEPDLNLYHVMIQMMGKNKMIEMAEEVFHELKRDGLEPDTRAFNEMMGAYLQVDM 176
           R E WY PD+  Y  M+ M+ +NK ++  ++V+ +LK++ +  D   F +++  +L  ++
Sbjct: 62  RREIWYRPDMFFYRDMLMMLARNKKVDETKKVWEDLKKEEVLFDQHTFGDLVRGFLDNEL 121

Query: 177 VERAVETYELMKASGCIPDKLTFKILIKNLERFRE 211
              A+  Y  M+ S   P  L F++++K L  + E
Sbjct: 122 PLEAMRLYGEMRESPDRPLSLPFRVILKGLVPYPE 156

BLAST of CmoCh14G014620 vs. Swiss-Prot
Match: PP266_ARATH (Pentatricopeptide repeat-containing protein At3g46870 OS=Arabidopsis thaliana GN=At3g46870 PE=1 SV=1)

HSP 1 Score: 90.1 bits (222), Expect = 3.5e-17
Identity = 58/206 (28.16%), Postives = 112/206 (54.37%), Query Frame = 1

Query: 18  FRQNLT-NPTLHCCTAG------------PPPDTICGLRKGQRKPLGK----SRVPSTES 77
           F QN+T NP++H  +              P P T+   R    +P G      ++   E+
Sbjct: 18  FFQNITRNPSIHRISFSNLKPKTLLHPIPPKPFTVFVSRFHDGRPRGPLWRGKKLIGKEA 77

Query: 78  IQAVQSLK-LAKSASKMEDVINSKLSRLLKADLFDALAELQRQNELELSLQVFKFMRNEE 137
           +  +  LK L +   K++  I + + RLLK D+   + EL+RQ E  L++++F+ ++ +E
Sbjct: 78  LFVILGLKRLKEDDEKLDKFIKTHVFRLLKLDMLAVIGELERQEETALAIKMFEVIQKQE 137

Query: 138 WYEPDLNLYHVMIQMMGKNKMIEMAEEVFHELKRDGLEPDTRAFNEMMGAYLQVDMVERA 197
           WY+PD+ +Y  +I  + K+K ++ A  ++ ++K++ L PD++ + E++  +L+      A
Sbjct: 138 WYQPDVFMYKDLIVSLAKSKRMDEAMALWEKMKKENLFPDSQTYTEVIRGFLRDGCPADA 197

Query: 198 VETYELMKASGCIPDKLTFKILIKNL 206
           +  YE M  S   P++L F++L+K L
Sbjct: 198 MNVYEDMLKSPDPPEELPFRVLLKGL 223

BLAST of CmoCh14G014620 vs. Swiss-Prot
Match: PP279_ARATH (Pentatricopeptide repeat-containing protein At3g53170 OS=Arabidopsis thaliana GN=At3g53170 PE=3 SV=1)

HSP 1 Score: 68.6 bits (166), Expect = 1.1e-10
Identity = 33/112 (29.46%), Postives = 63/112 (56.25%), Query Frame = 1

Query: 92  LFDALAELQRQNELELSLQVFKFMRNEEWYEPDLNLYHVMIQMMGKNKMIEMAEEVFHEL 151
           + +AL E  ++N  + +L++F  +R + WYEP    Y  + +++G  K  + A  +F  +
Sbjct: 61  VLEALDEAIKENRWQSALKIFNLLRKQHWYEPRCKTYTKLFKVLGNCKQPDQASLLFEVM 120

Query: 152 KRDGLEPDTRAFNEMMGAYLQVDMVERAVETYELMKA-SGCIPDKLTFKILI 203
             +GL+P    +  ++  Y + +++++A  T E MK+ S C PD  TF +LI
Sbjct: 121 LSEGLKPTIDVYTSLISVYGKSELLDKAFSTLEYMKSVSDCKPDVFTFTVLI 172

BLAST of CmoCh14G014620 vs. Swiss-Prot
Match: PP424_ARATH (Pentatricopeptide repeat-containing protein At5g48730, chloroplastic OS=Arabidopsis thaliana GN=At5g48730 PE=2 SV=2)

HSP 1 Score: 65.5 bits (158), Expect = 9.2e-10
Identity = 43/140 (30.71%), Postives = 73/140 (52.14%), Query Frame = 1

Query: 66  SLKLAKSASKMEDVINSKLSRLLKADLFDALAELQRQNELELSLQVFKFMRNEEWYEPDL 125
           S+ L + A+K          +LL   + ++L E       E ++QVF+ +R + WY+P++
Sbjct: 91  SIILRREATKSIIEKKKGSKKLLPRTVLESLHERITALRWESAIQVFELLREQLWYKPNV 150

Query: 126 NLYHVMIQMMGKNKMIEMAEEVFHELKRDGLEPDTRAFNEMMGAYLQVDMVERAVETYEL 185
            +Y  +I M+GK K  E A E+F E+  +G   +   +  ++ AY +    + A    E 
Sbjct: 151 GIYVKLIVMLGKCKQPEKAHELFQEMINEGCVVNHEVYTALVSAYSRSGRFDAAFTLLER 210

Query: 186 MKAS-GCIPDKLTFKILIKN 205
           MK+S  C PD  T+ ILIK+
Sbjct: 211 MKSSHNCQPDVHTYSILIKS 230

BLAST of CmoCh14G014620 vs. Swiss-Prot
Match: PP186_ARATH (Pentatricopeptide repeat-containing protein At2g35130 OS=Arabidopsis thaliana GN=At2g35130 PE=2 SV=1)

HSP 1 Score: 62.4 bits (150), Expect = 7.8e-09
Identity = 29/96 (30.21%), Postives = 56/96 (58.33%), Query Frame = 1

Query: 107 LSLQVFKFMRNEEWYEPDLNLYHVMIQMMGKNKMIEMAEEVFHELKRDGLEPDTRAFNEM 166
           +S +++  MR+ +  +P++  Y  ++    +  + E AEE+F +L+ DGLEPD   +N +
Sbjct: 282 MSWKLYCEMRSHQC-KPNICTYTALVNAFAREGLCEKAEEIFEQLQEDGLEPDVYVYNAL 341

Query: 167 MGAYLQVDMVERAVETYELMKASGCIPDKLTFKILI 203
           M +Y +      A E + LM+  GC PD+ ++ I++
Sbjct: 342 MESYSRAGYPYGAAEIFSLMQHMGCEPDRASYNIMV 376

BLAST of CmoCh14G014620 vs. TrEMBL
Match: A0A0A0KU23_CUCSA (Uncharacterized protein OS=Cucumis sativus GN=Csa_4G011020 PE=4 SV=1)

HSP 1 Score: 375.9 bits (964), Expect = 3.5e-101
Identity = 188/230 (81.74%), Postives = 208/230 (90.43%), Query Frame = 1

Query: 1   MKSTLMGRLQLHFPQLGFRQNLTNPTLHCCTAGPPPDTICGLRKGQRKPLGKSRVPSTES 60
           MKSTL+G LQLHF QLG RQNLTN +L C TA PPP+ ICGLRKG  +PLG SRVPS E+
Sbjct: 1   MKSTLVGPLQLHFLQLGLRQNLTNRSLRCGTAAPPPNIICGLRKGSNRPLGLSRVPSNEA 60

Query: 61  IQAVQSLKLAKSASKMEDVINSKLSRLLKADLFDALAELQRQNELELSLQVFKFMRNEEW 120
           IQAVQSLKLAKS SKMEDVIN+KL RLLKADLFDAL+ELQRQNELELSLQVFKFM+NEEW
Sbjct: 61  IQAVQSLKLAKSTSKMEDVINTKLGRLLKADLFDALSELQRQNELELSLQVFKFMQNEEW 120

Query: 121 YEPDLNLYHVMIQMMGKNKMIEMAEEVFHELKRDGLEPDTRAFNEMMGAYLQVDMVERAV 180
           +EPDL LYH MI +MGKNKMIEMAEEVFH+L++DGLEPDTRAFNEMMGAYLQVDM+ERAV
Sbjct: 121 FEPDLRLYHGMIMLMGKNKMIEMAEEVFHKLRKDGLEPDTRAFNEMMGAYLQVDMIERAV 180

Query: 181 ETYELMKASGCIPDKLTFKILIKNLERFREEFAAVVKKECAEFLDSPEKF 231
           ETY LM ASGC PD+LTFKILIKNLE+FREEFA VVKK+C E+LD+P+KF
Sbjct: 181 ETYRLMIASGCTPDELTFKILIKNLEKFREEFAVVVKKDCNEYLDNPQKF 230

BLAST of CmoCh14G014620 vs. TrEMBL
Match: M5XFP6_PRUPE (Uncharacterized protein OS=Prunus persica GN=PRUPE_ppa010483mg PE=4 SV=1)

HSP 1 Score: 278.5 bits (711), Expect = 7.7e-72
Identity = 150/232 (64.66%), Postives = 181/232 (78.02%), Query Frame = 1

Query: 6   MGRLQLHFPQLGFRQNLTNPTLHCCTAGP-----PPDTICGLRKGQRKPLGKSRVPSTES 65
           M  ++ H PQLGF++N   P L    A P     P   +CGLR G RKPL +SRV STE+
Sbjct: 1   MSSIKFHIPQLGFKKNYPEPHLRYRLALPSSSSTPSHIVCGLRGGPRKPLWRSRVLSTEA 60

Query: 66  IQAVQSLKLAKS-ASKMEDVINSKLSRLLKADLFDALAELQRQNELELSLQVFKFMRNEE 125
           IQAVQSLKL+KS  SK+E V+  +LSRLLKADL DALAELQRQNE+EL+L+VFKF+R E 
Sbjct: 61  IQAVQSLKLSKSNPSKLEQVMGGRLSRLLKADLLDALAELQRQNEVELALKVFKFVREEV 120

Query: 126 WYEPDLNLYHVMIQMMGKNKMIEMAEEVFHELKRDGLEPDTRAFNEMMGAYLQVDMVERA 185
           WY+PDL+LY  MI ++GKNK+IEMAEE+F  LK +GLE DTRAF EM+GAY+QV M E+A
Sbjct: 121 WYKPDLSLYCSMILLLGKNKLIEMAEELFSGLKEEGLEYDTRAFTEMIGAYIQVGMTEKA 180

Query: 186 VETYELMKASGCIPDKLTFKILIKNLERF-REEFAAVVKKECAEFLDSPEKF 231
           +ETYELMKASGC PDKLTF ILI+NLE+   EE AA +KK CAE++DSPEKF
Sbjct: 181 METYELMKASGCAPDKLTFTILIRNLEKVGEEELAAHIKKYCAEYVDSPEKF 232

BLAST of CmoCh14G014620 vs. TrEMBL
Match: A0A0D2PRX0_GOSRA (Uncharacterized protein OS=Gossypium raimondii GN=B456_005G125600 PE=4 SV=1)

HSP 1 Score: 260.0 bits (663), Expect = 2.8e-66
Identity = 141/244 (57.79%), Postives = 182/244 (74.59%), Query Frame = 1

Query: 1   MKSTLMGRLQL-HFPQLGF----RQNLTNPTLHCCTAGPPPDTICGLRKGQRKPLGKSRV 60
           MK +LMG L++ H PQ+GF      N    ++            CGLR G +KPL KSRV
Sbjct: 1   MKPSLMGSLRISHLPQMGFLKYPHHNHKQSSI----------ITCGLRGGTKKPLWKSRV 60

Query: 61  PSTESIQAVQSLKLAKSASKMEDVINSKLSRLLKADLFDALAELQRQNELELSLQVFKFM 120
            STE+IQAV SLKLA S SK+  V++++LSRLLKADL D LAELQRQNE  L+L+VF+F+
Sbjct: 61  LSTEAIQAVHSLKLANSNSKLHHVLSTRLSRLLKADLLDTLAELQRQNEFHLALKVFEFV 120

Query: 121 RNEEWYEPDLNLYHVMIQMMGKNKMIEMAEEVFHELKRDGLEPDTRAFNEMMGAYLQVDM 180
           R E WY+PD+ LY  MIQ++GKNKM EMAE++F EL++DGL+PDTRAF E++GAYLQV M
Sbjct: 121 RKEVWYKPDMCLYCNMIQLLGKNKMTEMAEQLFTELEKDGLKPDTRAFTELIGAYLQVGM 180

Query: 181 VERAVETYELMKASGCIPDKLTFKILIKNLERF-REEFAAVVKKECAEFLDSPEKFLRGM 239
           +E+A+ETYE +KA GC PDKLTF ILI+NLE   +EE AAVVKK+C E+L+ PE+FL  +
Sbjct: 181 MEKAMETYERLKACGCSPDKLTFTILIRNLENVGKEELAAVVKKDCIEYLEFPERFLEDV 234

BLAST of CmoCh14G014620 vs. TrEMBL
Match: A0A061EK44_THECC (Vacuolar sorting protein 9 domain, putative isoform 1 OS=Theobroma cacao GN=TCM_019942 PE=4 SV=1)

HSP 1 Score: 259.2 bits (661), Expect = 4.8e-66
Identity = 140/232 (60.34%), Postives = 173/232 (74.57%), Query Frame = 1

Query: 1   MKSTLMGRLQLHFPQLGFRQNLTNPTLHCCTAGPPPDTICGLRKGQRKPLGKSRVPSTES 60
           MK +LMG L L F  +     L NP  H      P    CGLR G RK L +SRV S E+
Sbjct: 1   MKPSLMGTLNLKFSLIPLTGFLQNPQNH--KRQHPSTVTCGLRGGTRKHLWRSRVLSAEA 60

Query: 61  IQAVQSLKLAKSASKMEDVINSKLSRLLKADLFDALAELQRQNELELSLQVFKFMRNEEW 120
           IQAV SLKLA S SK++ V ++KLSRLLKADL D LAELQRQNE  L+L+VF+F+R E W
Sbjct: 61  IQAVHSLKLANSNSKLQHVFSNKLSRLLKADLLDTLAELQRQNEFHLALEVFEFVRKEVW 120

Query: 121 YEPDLNLYHVMIQMMGKNKMIEMAEEVFHELKRDGLEPDTRAFNEMMGAYLQVDMVERAV 180
           Y+PDL+LY  MIQ++GKN+M EMAE VF EL ++GL+PDTRAF EM+GAYL V M ++A+
Sbjct: 121 YKPDLSLYCDMIQLLGKNRMTEMAERVFTELDKEGLKPDTRAFTEMIGAYLIVGMTDKAM 180

Query: 181 ETYELMKASGCIPDKLTFKILIKNLERF-REEFAAVVKKECAEFLDSPEKFL 232
           ETYE++KASGC PDKLTF ILI+NLE   RE+ AAV+KK+C E+L+ PE+FL
Sbjct: 181 ETYEMLKASGCCPDKLTFTILIRNLENAGREDLAAVLKKDCTEYLEYPERFL 230

BLAST of CmoCh14G014620 vs. TrEMBL
Match: A5B6N8_VITVI (Putative uncharacterized protein OS=Vitis vinifera GN=VITISV_017782 PE=4 SV=1)

HSP 1 Score: 257.3 bits (656), Expect = 1.8e-65
Identity = 140/227 (61.67%), Postives = 171/227 (75.33%), Query Frame = 1

Query: 6   MGRLQLHFPQLGFRQNLTNPTLHCCTAGPPPDTICGLRKGQRKPLGKSRVPSTESIQAVQ 65
           MG L+   PQL F QN   P  H       P   CGLR G RKPL +SRV STE+IQ VQ
Sbjct: 1   MGSLKFQLPQLRFPQN---PQTH---KPQFPKIACGLRGGPRKPLWRSRVLSTEAIQVVQ 60

Query: 66  SLKLAKSASKMEDVINSKLSRLLKADLFDALAELQRQNELELSLQVFKFMRNEEWYEPDL 125
           SLKLAKS+ K+E+V +S++SRLLK+DL D LAELQRQ EL+L+L+VF+F+R E WY+PDL
Sbjct: 61  SLKLAKSSIKLEEVFSSRVSRLLKSDLLDTLAELQRQGELDLTLKVFEFIRKEVWYKPDL 120

Query: 126 NLYHVMIQMMGKNKMIEMAEEVFHELKRDGLEPDTRAFNEMMGAYLQVDMVERAVETYEL 185
           +LY  MI ++GK K+IEMAE +F ELK++GLEPDTR + EM+GAYLQV M E+A+E Y L
Sbjct: 121 SLYSDMIMILGKKKLIEMAEGLFSELKKEGLEPDTRVYTEMIGAYLQVGMTEKAMEMYGL 180

Query: 186 MKASGCIPDKLTFKILIKNLERF-REEFAAVVKKECAEFLDSPEKFL 232
           MKASGC PDKLT  ILI+NLE    EE AA VKKEC E++D P+KFL
Sbjct: 181 MKASGCAPDKLTLTILIRNLENAGEEELAAGVKKECEEYVDYPKKFL 221

BLAST of CmoCh14G014620 vs. TAIR10
Match: AT5G09320.1 (AT5G09320.1 Vacuolar sorting protein 9 (VPS9) domain)

HSP 1 Score: 104.0 bits (258), Expect = 1.3e-22
Identity = 68/207 (32.85%), Postives = 113/207 (54.59%), Query Frame = 1

Query: 43  RKGQRKPLGKSRVPSTESIQAVQSLKLAK----------------SASKMEDVINSKLSR 102
           R   RKPL + R+ S E+IQAVQ+LK A                 S++ ++ VI SK  R
Sbjct: 492 RSKNRKPLQRGRMLSIEAIQAVQALKRANPLLPPPPVPSTSTTSSSSALLDRVIISKFRR 551

Query: 103 LLKADLFDALAELQRQNELELSLQVFKFMRNEEWYEPDLNLYHVMIQMMGKNKMIEMAEE 162
           LLK D+   L EL RQNE  L+L+VF+ +R E WY+P + +Y  MI +M  N ++E    
Sbjct: 552 LLKFDMVAVLRELLRQNECSLALKVFEEIRKEYWYKPQVRMYTDMITVMADNSLMEEVNY 611

Query: 163 VFHELKRD-GLEPDTRAFNEMMGAYLQVDMVERAVETYELMKASGCIPDKLTFKILIKNL 222
           ++  +K + GL  +   FN ++   L   + +  ++ Y  M++ G  PD+ +F++L+  L
Sbjct: 612 LYSAMKSEKGLMAEIEWFNTLLTILLNHKLFDLVMDCYAFMQSIGYEPDRASFRVLVLGL 671

Query: 223 ERFRE-EFAAVVKKECAEFLDSPEKFL 232
           E   E   +A+V+++  E+     +F+
Sbjct: 672 ESNGEMGLSAIVRQDAHEYYGESLEFI 698

BLAST of CmoCh14G014620 vs. TAIR10
Match: AT1G62350.1 (AT1G62350.1 Pentatricopeptide repeat (PPR) superfamily protein)

HSP 1 Score: 90.9 bits (224), Expect = 1.1e-18
Identity = 50/155 (32.26%), Postives = 93/155 (60.00%), Query Frame = 1

Query: 57  STESIQAVQSLKLAKSAS-KMEDVINSKLSRLLKADLFDALAELQRQNELELSLQVFKFM 116
           S E + A + LK  ++ S +++  I S +SRLLK+DL   LAE QRQN++ L +++++ +
Sbjct: 2   SKEGLIAAKELKRLQTQSVRLDRFIGSHVSRLLKSDLVSVLAEFQRQNQVFLCMKLYEVV 61

Query: 117 RNEEWYEPDLNLYHVMIQMMGKNKMIEMAEEVFHELKRDGLEPDTRAFNEMMGAYLQVDM 176
           R E WY PD+  Y  M+ M+ +NK ++  ++V+ +LK++ +  D   F +++  +L  ++
Sbjct: 62  RREIWYRPDMFFYRDMLMMLARNKKVDETKKVWEDLKKEEVLFDQHTFGDLVRGFLDNEL 121

Query: 177 VERAVETYELMKASGCIPDKLTFKILIKNLERFRE 211
              A+  Y  M+ S   P  L F++++K L  + E
Sbjct: 122 PLEAMRLYGEMRESPDRPLSLPFRVILKGLVPYPE 156

BLAST of CmoCh14G014620 vs. TAIR10
Match: AT3G46870.1 (AT3G46870.1 Pentatricopeptide repeat (PPR) superfamily protein)

HSP 1 Score: 90.1 bits (222), Expect = 2.0e-18
Identity = 58/206 (28.16%), Postives = 112/206 (54.37%), Query Frame = 1

Query: 18  FRQNLT-NPTLHCCTAG------------PPPDTICGLRKGQRKPLGK----SRVPSTES 77
           F QN+T NP++H  +              P P T+   R    +P G      ++   E+
Sbjct: 18  FFQNITRNPSIHRISFSNLKPKTLLHPIPPKPFTVFVSRFHDGRPRGPLWRGKKLIGKEA 77

Query: 78  IQAVQSLK-LAKSASKMEDVINSKLSRLLKADLFDALAELQRQNELELSLQVFKFMRNEE 137
           +  +  LK L +   K++  I + + RLLK D+   + EL+RQ E  L++++F+ ++ +E
Sbjct: 78  LFVILGLKRLKEDDEKLDKFIKTHVFRLLKLDMLAVIGELERQEETALAIKMFEVIQKQE 137

Query: 138 WYEPDLNLYHVMIQMMGKNKMIEMAEEVFHELKRDGLEPDTRAFNEMMGAYLQVDMVERA 197
           WY+PD+ +Y  +I  + K+K ++ A  ++ ++K++ L PD++ + E++  +L+      A
Sbjct: 138 WYQPDVFMYKDLIVSLAKSKRMDEAMALWEKMKKENLFPDSQTYTEVIRGFLRDGCPADA 197

Query: 198 VETYELMKASGCIPDKLTFKILIKNL 206
           +  YE M  S   P++L F++L+K L
Sbjct: 198 MNVYEDMLKSPDPPEELPFRVLLKGL 223

BLAST of CmoCh14G014620 vs. TAIR10
Match: AT3G27750.1 (AT3G27750.1 FUNCTIONS IN: molecular_function unknown)

HSP 1 Score: 70.1 bits (170), Expect = 2.1e-12
Identity = 60/192 (31.25%), Postives = 91/192 (47.40%), Query Frame = 1

Query: 28  HCCTAGPPPDTICGLRKGQRK---PLGKSRVPSTESIQAVQSLKLAKSASKMEDVINSKL 87
           H  +   P  T   +R G R    PL K R+ STE+IQ++QSLK A        +    L
Sbjct: 17  HTLSVIVPKRTFVSIRCGPRDNRGPLLKGRILSTEAIQSIQSLKRAHRTGVSLSLTLRPL 76

Query: 88  SRLLKADLFDALAELQRQNELELSLQVFKFMRNEEWYEP-DLNLYHVMIQMMGKNKMIEM 147
            RL+K+DL   L EL RQ+   L++ V   +R E  Y P DL LY  ++  + +NK  + 
Sbjct: 77  RRLIKSDLISVLRELLRQDYCTLAVHVLSTLRTE--YPPLDLVLYADIVNALTRNKEFDE 136

Query: 148 AEEVFHELKRDGLEPDTRAFNEMMGAYLQVDMVERAVETYELMKASG-----CIPDKLTF 207
            + +  E+       D +A  +++ A +  +  E  V  Y LM+ SG        D+   
Sbjct: 137 IDRLIGEIDGIDQRSDDKALAKLIRAVVGAERRESVVRVYTLMRESGWGSESWEADEYVA 196

Query: 208 KILIKNLERFRE 211
           ++L K L R  E
Sbjct: 197 EVLSKGLLRLGE 206

BLAST of CmoCh14G014620 vs. TAIR10
Match: AT3G53170.1 (AT3G53170.1 Tetratricopeptide repeat (TPR)-like superfamily protein)

HSP 1 Score: 65.9 bits (159), Expect = 4.0e-11
Identity = 53/203 (26.11%), Postives = 98/203 (48.28%), Query Frame = 1

Query: 10  QLHFPQLGFRQNLTNPTLHCC--TAGPPPDTICGLR-KGQRKPLGKSRVPSTE-----SI 69
           QL F +   +Q++++P    C  T    P T+C  +   +R     S + ST        
Sbjct: 21  QLGFSRSVVQQHISSPVYFRCIPTISITP-TMCSTKVPNERTEKMNSGLISTRHQVDPKK 80

Query: 70  QAVQSLKLAKSASKMEDVINS-KLSRLLKADLFDALAELQRQNELELSLQVFKFMRNEEW 129
           +  + L+   +   +E   NS K   L    + +AL E  ++N  + +L++F  +R + W
Sbjct: 81  ELSRILRTDAAVKGIERKANSEKYLTLWPKAVLEALDEAIKENRWQSALKIFNLLRKQHW 140

Query: 130 YEPDLNLYHVMIQMMGKNKMIEMAEEVFHELKRDGLEPDTRAFNEMMGAYLQVDMVERAV 189
           YEP    Y  + +++G  K  + A  +F  +  +GL+P    +  ++  Y + +++++A 
Sbjct: 141 YEPRCKTYTKLFKVLGNCKQPDQASLLFEVMLSEGLKPTIDVYTSLISVYGKSELLDKAF 200

Query: 190 ETYELMKA-SGCIPDKLTFKILI 203
            T E MK+ S C PD  TF +LI
Sbjct: 201 STLEYMKSVSDCKPDVFTFTVLI 222

BLAST of CmoCh14G014620 vs. NCBI nr
Match: gi|659108216|ref|XP_008454079.1| (PREDICTED: pentatricopeptide repeat-containing protein At1g62350-like [Cucumis melo])

HSP 1 Score: 392.5 bits (1007), Expect = 5.2e-106
Identity = 193/230 (83.91%), Postives = 212/230 (92.17%), Query Frame = 1

Query: 1   MKSTLMGRLQLHFPQLGFRQNLTNPTLHCCTAGPPPDTICGLRKGQRKPLGKSRVPSTES 60
           MKSTLMGRLQLHFP+LG RQNLTN +LHCCTA PPP+ ICGLRKG +KPLG+SRVPS E+
Sbjct: 1   MKSTLMGRLQLHFPELGLRQNLTNRSLHCCTAAPPPNIICGLRKGLKKPLGRSRVPSNEA 60

Query: 61  IQAVQSLKLAKSASKMEDVINSKLSRLLKADLFDALAELQRQNELELSLQVFKFMRNEEW 120
           IQAVQSLKLAK  SKMEDVIN+KLSRLLKADLFDAL ELQRQNELELSLQVFKFM+NEEW
Sbjct: 61  IQAVQSLKLAKCTSKMEDVINTKLSRLLKADLFDALTELQRQNELELSLQVFKFMQNEEW 120

Query: 121 YEPDLNLYHVMIQMMGKNKMIEMAEEVFHELKRDGLEPDTRAFNEMMGAYLQVDMVERAV 180
           YEPDL LYH MI MMGKNKMIEMAEEVFH+L++DGLEPDTRAFNEMMGAYLQVDM+ERA 
Sbjct: 121 YEPDLRLYHGMIMMMGKNKMIEMAEEVFHKLRKDGLEPDTRAFNEMMGAYLQVDMIERAA 180

Query: 181 ETYELMKASGCIPDKLTFKILIKNLERFREEFAAVVKKECAEFLDSPEKF 231
           +TY LM ASGC PDKLTFKILIKNLE+F+EEFA VVKK+C E+LD+P+KF
Sbjct: 181 DTYRLMIASGCTPDKLTFKILIKNLEKFKEEFAIVVKKDCYEYLDNPQKF 230

BLAST of CmoCh14G014620 vs. NCBI nr
Match: gi|449469204|ref|XP_004152311.1| (PREDICTED: pentatricopeptide repeat-containing protein At1g62350-like [Cucumis sativus])

HSP 1 Score: 375.9 bits (964), Expect = 5.1e-101
Identity = 188/230 (81.74%), Postives = 208/230 (90.43%), Query Frame = 1

Query: 1   MKSTLMGRLQLHFPQLGFRQNLTNPTLHCCTAGPPPDTICGLRKGQRKPLGKSRVPSTES 60
           MKSTL+G LQLHF QLG RQNLTN +L C TA PPP+ ICGLRKG  +PLG SRVPS E+
Sbjct: 1   MKSTLVGPLQLHFLQLGLRQNLTNRSLRCGTAAPPPNIICGLRKGSNRPLGLSRVPSNEA 60

Query: 61  IQAVQSLKLAKSASKMEDVINSKLSRLLKADLFDALAELQRQNELELSLQVFKFMRNEEW 120
           IQAVQSLKLAKS SKMEDVIN+KL RLLKADLFDAL+ELQRQNELELSLQVFKFM+NEEW
Sbjct: 61  IQAVQSLKLAKSTSKMEDVINTKLGRLLKADLFDALSELQRQNELELSLQVFKFMQNEEW 120

Query: 121 YEPDLNLYHVMIQMMGKNKMIEMAEEVFHELKRDGLEPDTRAFNEMMGAYLQVDMVERAV 180
           +EPDL LYH MI +MGKNKMIEMAEEVFH+L++DGLEPDTRAFNEMMGAYLQVDM+ERAV
Sbjct: 121 FEPDLRLYHGMIMLMGKNKMIEMAEEVFHKLRKDGLEPDTRAFNEMMGAYLQVDMIERAV 180

Query: 181 ETYELMKASGCIPDKLTFKILIKNLERFREEFAAVVKKECAEFLDSPEKF 231
           ETY LM ASGC PD+LTFKILIKNLE+FREEFA VVKK+C E+LD+P+KF
Sbjct: 181 ETYRLMIASGCTPDELTFKILIKNLEKFREEFAVVVKKDCNEYLDNPQKF 230

BLAST of CmoCh14G014620 vs. NCBI nr
Match: gi|645231541|ref|XP_008222444.1| (PREDICTED: pentatricopeptide repeat-containing protein At1g62350-like [Prunus mume])

HSP 1 Score: 287.7 bits (735), Expect = 1.8e-74
Identity = 154/241 (63.90%), Postives = 188/241 (78.01%), Query Frame = 1

Query: 5   LMGRLQLHFPQLGFRQNLTNPTLHCCTAGP-----PPDTICGLRKGQRKPLGKSRVPSTE 64
           LMG ++ H PQLGF+QN   P L    A P     P   +CGLR G RKPL +SRV STE
Sbjct: 4   LMGSIKFHIPQLGFKQNYPEPHLRYRLALPSSSSTPSHIVCGLRGGPRKPLWRSRVLSTE 63

Query: 65  SIQAVQSLKLAKS-ASKMEDVINSKLSRLLKADLFDALAELQRQNELELSLQVFKFMRNE 124
           +IQAVQSLKL+KS  SK+E V+  +LSRLLKADL DALAELQRQNE+EL+L+VFKF+R E
Sbjct: 64  AIQAVQSLKLSKSNPSKLEQVMGGRLSRLLKADLLDALAELQRQNEVELALKVFKFVREE 123

Query: 125 EWYEPDLNLYHVMIQMMGKNKMIEMAEEVFHELKRDGLEPDTRAFNEMMGAYLQVDMVER 184
            WY+PDL+LY  MI ++GKNK+IEMAEE+F  LK +GLE DTRAF EM+GAY+QV M E+
Sbjct: 124 VWYKPDLSLYCSMILLLGKNKLIEMAEELFSGLKEEGLEYDTRAFTEMIGAYIQVGMTEK 183

Query: 185 AVETYELMKASGCIPDKLTFKILIKNLERF-REEFAAVVKKECAEFLDSPEKFLRGMLNR 239
           A+ETYELMKASGC+PDKLTF ILI+NLE+   EE AA VKK+CAE++DSPEKF   +  +
Sbjct: 184 AMETYELMKASGCVPDKLTFTILIRNLEKVGEEELAAHVKKDCAEYVDSPEKFFEEVARK 243

BLAST of CmoCh14G014620 vs. NCBI nr
Match: gi|596203295|ref|XP_007223738.1| (hypothetical protein PRUPE_ppa010483mg [Prunus persica])

HSP 1 Score: 278.5 bits (711), Expect = 1.1e-71
Identity = 150/232 (64.66%), Postives = 181/232 (78.02%), Query Frame = 1

Query: 6   MGRLQLHFPQLGFRQNLTNPTLHCCTAGP-----PPDTICGLRKGQRKPLGKSRVPSTES 65
           M  ++ H PQLGF++N   P L    A P     P   +CGLR G RKPL +SRV STE+
Sbjct: 1   MSSIKFHIPQLGFKKNYPEPHLRYRLALPSSSSTPSHIVCGLRGGPRKPLWRSRVLSTEA 60

Query: 66  IQAVQSLKLAKS-ASKMEDVINSKLSRLLKADLFDALAELQRQNELELSLQVFKFMRNEE 125
           IQAVQSLKL+KS  SK+E V+  +LSRLLKADL DALAELQRQNE+EL+L+VFKF+R E 
Sbjct: 61  IQAVQSLKLSKSNPSKLEQVMGGRLSRLLKADLLDALAELQRQNEVELALKVFKFVREEV 120

Query: 126 WYEPDLNLYHVMIQMMGKNKMIEMAEEVFHELKRDGLEPDTRAFNEMMGAYLQVDMVERA 185
           WY+PDL+LY  MI ++GKNK+IEMAEE+F  LK +GLE DTRAF EM+GAY+QV M E+A
Sbjct: 121 WYKPDLSLYCSMILLLGKNKLIEMAEELFSGLKEEGLEYDTRAFTEMIGAYIQVGMTEKA 180

Query: 186 VETYELMKASGCIPDKLTFKILIKNLERF-REEFAAVVKKECAEFLDSPEKF 231
           +ETYELMKASGC PDKLTF ILI+NLE+   EE AA +KK CAE++DSPEKF
Sbjct: 181 METYELMKASGCAPDKLTFTILIRNLEKVGEEELAAHIKKYCAEYVDSPEKF 232

BLAST of CmoCh14G014620 vs. NCBI nr
Match: gi|657955513|ref|XP_008369226.1| (PREDICTED: pentatricopeptide repeat-containing protein At1g62350-like [Malus domestica])

HSP 1 Score: 272.3 bits (695), Expect = 7.9e-70
Identity = 149/235 (63.40%), Postives = 185/235 (78.72%), Query Frame = 1

Query: 1   MKSTLMGRL---QLHFPQLGFRQNLTNPTLHCCTAGPPPDTICGLRKGQRKPLGKSRVPS 60
           MKS +MG L   + H PQLG +Q+ ++        G       GLR G RKPL KSRV S
Sbjct: 1   MKSAVMGSLGCLKFHVPQLGLKQDTSSSRKFTTVCG-------GLRGGPRKPLWKSRVLS 60

Query: 61  TESIQAVQSLKLAKSA-SKMEDVINSKLSRLLKADLFDALAELQRQNELELSLQVFKFMR 120
           TE+IQAVQSLKLAKS  SK+E+V +++LSRLLKADL DALAEL RQNE++L+L+VFKF+R
Sbjct: 61  TEAIQAVQSLKLAKSTPSKLEEVFDARLSRLLKADLLDALAELHRQNEVQLALKVFKFVR 120

Query: 121 NEEWYEPDLNLYHVMIQMMGKNKMIEMAEEVFHELKRDGLEPDTRAFNEMMGAYLQVDMV 180
            E WY+PDL+LY  MI ++GKNK+IEMAEE+F  LK +GLEPDTRAF EM+GAY+QV M 
Sbjct: 121 EEVWYKPDLSLYCSMILLLGKNKLIEMAEELFSGLKEEGLEPDTRAFTEMIGAYIQVGMT 180

Query: 181 ERAVETYELMKASGCIPDKLTFKILIKNLERFREE-FAAVVKKECAEFLDSPEKF 231
           E+A++TYELMKASGC PDKLTF ILI+NLE+  EE +AA VK++CAE++DSPEKF
Sbjct: 181 EKAMKTYELMKASGCAPDKLTFTILIRNLEKAGEEDWAASVKQDCAEYVDSPEKF 228

The following BLAST results are available for this feature:
Match NameE-valueIdentityDescription
PPR89_ARATH2.0e-1732.26Pentatricopeptide repeat-containing protein At1g62350 OS=Arabidopsis thaliana GN... [more]
PP266_ARATH3.5e-1728.16Pentatricopeptide repeat-containing protein At3g46870 OS=Arabidopsis thaliana GN... [more]
PP279_ARATH1.1e-1029.46Pentatricopeptide repeat-containing protein At3g53170 OS=Arabidopsis thaliana GN... [more]
PP424_ARATH9.2e-1030.71Pentatricopeptide repeat-containing protein At5g48730, chloroplastic OS=Arabidop... [more]
PP186_ARATH7.8e-0930.21Pentatricopeptide repeat-containing protein At2g35130 OS=Arabidopsis thaliana GN... [more]
Match NameE-valueIdentityDescription
A0A0A0KU23_CUCSA3.5e-10181.74Uncharacterized protein OS=Cucumis sativus GN=Csa_4G011020 PE=4 SV=1[more]
M5XFP6_PRUPE7.7e-7264.66Uncharacterized protein OS=Prunus persica GN=PRUPE_ppa010483mg PE=4 SV=1[more]
A0A0D2PRX0_GOSRA2.8e-6657.79Uncharacterized protein OS=Gossypium raimondii GN=B456_005G125600 PE=4 SV=1[more]
A0A061EK44_THECC4.8e-6660.34Vacuolar sorting protein 9 domain, putative isoform 1 OS=Theobroma cacao GN=TCM_... [more]
A5B6N8_VITVI1.8e-6561.67Putative uncharacterized protein OS=Vitis vinifera GN=VITISV_017782 PE=4 SV=1[more]
Match NameE-valueIdentityDescription
AT5G09320.11.3e-2232.85 Vacuolar sorting protein 9 (VPS9) domain[more]
AT1G62350.11.1e-1832.26 Pentatricopeptide repeat (PPR) superfamily protein[more]
AT3G46870.12.0e-1828.16 Pentatricopeptide repeat (PPR) superfamily protein[more]
AT3G27750.12.1e-1231.25 FUNCTIONS IN: molecular_function unknown[more]
AT3G53170.14.0e-1126.11 Tetratricopeptide repeat (TPR)-like superfamily protein[more]
Match NameE-valueIdentityDescription
gi|659108216|ref|XP_008454079.1|5.2e-10683.91PREDICTED: pentatricopeptide repeat-containing protein At1g62350-like [Cucumis m... [more]
gi|449469204|ref|XP_004152311.1|5.1e-10181.74PREDICTED: pentatricopeptide repeat-containing protein At1g62350-like [Cucumis s... [more]
gi|645231541|ref|XP_008222444.1|1.8e-7463.90PREDICTED: pentatricopeptide repeat-containing protein At1g62350-like [Prunus mu... [more]
gi|596203295|ref|XP_007223738.1|1.1e-7164.66hypothetical protein PRUPE_ppa010483mg [Prunus persica][more]
gi|657955513|ref|XP_008369226.1|7.9e-7063.40PREDICTED: pentatricopeptide repeat-containing protein At1g62350-like [Malus dom... [more]
The following terms have been associated with this gene:
Vocabulary: INTERPRO
TermDefinition
IPR002885Pentatricopeptide_repeat
GO Assignments
This gene is annotated with the following GO terms.
Category Term Accession Term Name
biological_process GO:0008150 biological_process
cellular_component GO:0005575 cellular_component
molecular_function GO:0003674 molecular_function
molecular_function GO:0005515 protein binding

The following mRNA feature(s) are a part of this gene:

Feature NameUnique NameType
CmoCh14G014620.1CmoCh14G014620.1mRNA


Analysis Name: InterPro Annotations of Cucurbita moschata
Date Performed: 2017-05-19
IPR TermIPR DescriptionSourceSource TermSource DescriptionAlignment
IPR002885Pentatricopeptide repeatPFAMPF01535PPRcoord: 128..156
score: 4.
IPR002885Pentatricopeptide repeatPFAMPF13041PPR_2coord: 158..205
score: 3.8
IPR002885Pentatricopeptide repeatTIGRFAMsTIGR00756TIGR00756coord: 162..194
score: 2.0E-6coord: 127..160
score: 9.
IPR002885Pentatricopeptide repeatPROFILEPS51375PPRcoord: 124..158
score: 11.093coord: 159..193
score: 10
NoneNo IPR availablePANTHERPTHR24015FAMILY NOT NAMEDcoord: 58..211
score: 6.7
NoneNo IPR availablePANTHERPTHR24015:SF525SUBFAMILY NOT NAMEDcoord: 58..211
score: 6.7