Cla97C02G047460.1 (mRNA) Watermelon (97103) v2

NameCla97C02G047460.1
TypemRNA
OrganismCitrullus lanatus (Watermelon (97103) v2)
Descriptionpentatricopeptide repeat-containing protein At2g28050-like
LocationCla97Chr02 : 35137272 .. 35137850 (-)
Sequence length579
The following sequences are available for this feature:

Gene sequence (with intron)

Legend: polypeptideCDSexon
Hold the cursor over a type above to highlight its positions in the sequence below.
ATGTCGATTCAAAACCTCCTCAAATCAGTCAAAGCCAATCCTCGATCCAATTCCTCCTCGAAGCCACTCATGGAAATCATTTCAACAATCTTAGCACCGGCATCGTTCGATTCTACTCCGGCATGCTTCATTTCTGAACTCAACCTCCATGTTCTCACTTCCATTCTCTCAAACCCAAATCTTAGATCTTTAGAATGCTTCCATTTCTTCGATTTTTGCCTTAAAAATCAGCCTCTCATGTCCTTCAAACCTGACCTTCAGTCCCATTTGATCGTCATTTGCCGGCTTCTCCAGGCGAGGATGTTCGCCGATGCCGAGATCCTCCTCAAAACTGTGTCAATTGATGGAAATCACCGCTACCCATTTGCTGTTATTGCTTCTAATGTTGAAATTTCTTGTCCAGAATGGAAGGTTAAGGTAAAGTTTTTCAAATTCATGCTTGCGCTGTACTCGGAGAACGGGTTCTTCGACTCTGTTTCTGAAACGTTTAGCTATATGAAGAATAATGGGATTATGATCGACGATCATACTTGTACTGTGCATTTACCATCCCTTAAAGGATCTAACAGACGCATTTAA

mRNA sequence

ATGTCGATTCAAAACCTCCTCAAATCAGTCAAAGCCAATCCTCGATCCAATTCCTCCTCGAAGCCACTCATGGAAATCATTTCAACAATCTTAGCACCGGCATCGTTCGATTCTACTCCGGCATGCTTCATTTCTGAACTCAACCTCCATGTTCTCACTTCCATTCTCTCAAACCCAAATCTTAGATCTTTAGAATGCTTCCATTTCTTCGATTTTTGCCTTAAAAATCAGCCTCTCATGTCCTTCAAACCTGACCTTCAGTCCCATTTGATCGTCATTTGCCGGCTTCTCCAGGCGAGGATGTTCGCCGATGCCGAGATCCTCCTCAAAACTGTGTCAATTGATGGAAATCACCGCTACCCATTTGCTGTTATTGCTTCTAATGTTGAAATTTCTTGTCCAGAATGGAAGGTTAAGGTAAAGTTTTTCAAATTCATGCTTGCGCTGTACTCGGAGAACGGGTTCTTCGACTCTGTTTCTGAAACGTTTAGCTATATGAAGAATAATGGGATTATGATCGACGATCATACTTGTACTGTGCATTTACCATCCCTTAAAGGATCTAACAGACGCATTTAA

Coding sequence (CDS)

ATGTCGATTCAAAACCTCCTCAAATCAGTCAAAGCCAATCCTCGATCCAATTCCTCCTCGAAGCCACTCATGGAAATCATTTCAACAATCTTAGCACCGGCATCGTTCGATTCTACTCCGGCATGCTTCATTTCTGAACTCAACCTCCATGTTCTCACTTCCATTCTCTCAAACCCAAATCTTAGATCTTTAGAATGCTTCCATTTCTTCGATTTTTGCCTTAAAAATCAGCCTCTCATGTCCTTCAAACCTGACCTTCAGTCCCATTTGATCGTCATTTGCCGGCTTCTCCAGGCGAGGATGTTCGCCGATGCCGAGATCCTCCTCAAAACTGTGTCAATTGATGGAAATCACCGCTACCCATTTGCTGTTATTGCTTCTAATGTTGAAATTTCTTGTCCAGAATGGAAGGTTAAGGTAAAGTTTTTCAAATTCATGCTTGCGCTGTACTCGGAGAACGGGTTCTTCGACTCTGTTTCTGAAACGTTTAGCTATATGAAGAATAATGGGATTATGATCGACGATCATACTTGTACTGTGCATTTACCATCCCTTAAAGGATCTAACAGACGCATTTAA

Protein sequence

MSIQNLLKSVKANPRSNSSSKPLMEIISTILAPASFDSTPACFISELNLHVLTSILSNPNLRSLECFHFFDFCLKNQPLMSFKPDLQSHLIVICRLLQARMFADAEILLKTVSIDGNHRYPFAVIASNVEISCPEWKVKVKFFKFMLALYSENGFFDSVSETFSYMKNNGIMIDDHTCTVHLPSLKGSNRRI
BLAST of Cla97C02G047460.1 vs. NCBI nr
Match: XP_022943381.1 (pentatricopeptide repeat-containing protein At2g28050 [Cucurbita moschata])

HSP 1 Score: 295.8 bits (756), Expect = 1.0e-76
Identity = 154/193 (79.79%), Postives = 166/193 (86.01%), Query Frame = 0

Query: 1   MSIQNL---LKSVKANPRSNSSSKPLMEIISTILAPASFDSTPACFISELNLHVLTSILS 60
           MSI NL   LK+VK N +SN SSKPL EIISTILAPA FDST +CFIS+LN HV TSILS
Sbjct: 1   MSIPNLLKTLKTVKGNCQSNPSSKPLAEIISTILAPAPFDSTESCFISQLNPHVFTSILS 60

Query: 61  NPNLRSLECFHFFDFCLKNQPLMSFKPDLQSHLIVICRLLQARMFADAEILLKTVSIDGN 120
           NPNLRS ECFHFFDFCLKNQPLMSF PDLQ+HL VICRLL+ARMF+DA  LLKTVSIDGN
Sbjct: 61  NPNLRSSECFHFFDFCLKNQPLMSFTPDLQAHLTVICRLLKARMFSDAASLLKTVSIDGN 120

Query: 121 HRYPFAVIASNVEISCPEWKVKVKFFKFMLALYSENGFFDSVSETFSYMKNNGIMIDDHT 180
            RY FA+IAS VE  C E KVKVKFF FMLALYSE+GFFDSVSETFSYMKNNGIMID+ +
Sbjct: 121 LRYSFAIIASTVESCCQERKVKVKFFNFMLALYSESGFFDSVSETFSYMKNNGIMIDEQS 180

Query: 181 CTVHLPSLKGSNR 191
           CTVHL SLKGSN+
Sbjct: 181 CTVHLLSLKGSNQ 193

BLAST of Cla97C02G047460.1 vs. NCBI nr
Match: XP_023511890.1 (pentatricopeptide repeat-containing protein At2g28050 [Cucurbita pepo subsp. pepo])

HSP 1 Score: 294.3 bits (752), Expect = 3.0e-76
Identity = 154/193 (79.79%), Postives = 165/193 (85.49%), Query Frame = 0

Query: 1   MSIQNL---LKSVKANPRSNSSSKPLMEIISTILAPASFDSTPACFISELNLHVLTSILS 60
           MSI NL   LK+VK N +SN SS+PL EIISTILAPA FDST +CFIS+LN  VLTSILS
Sbjct: 1   MSIPNLLKTLKTVKGNRQSNPSSRPLAEIISTILAPAPFDSTVSCFISQLNPRVLTSILS 60

Query: 61  NPNLRSLECFHFFDFCLKNQPLMSFKPDLQSHLIVICRLLQARMFADAEILLKTVSIDGN 120
           NPNLRS ECFHFFDFCLKNQPLMSF PDLQ+HL VICRLL+ARMF+DA  LLKTVSIDGN
Sbjct: 61  NPNLRSSECFHFFDFCLKNQPLMSFTPDLQAHLTVICRLLKARMFSDAASLLKTVSIDGN 120

Query: 121 HRYPFAVIASNVEISCPEWKVKVKFFKFMLALYSENGFFDSVSETFSYMKNNGIMIDDHT 180
            RY FA+IAS VE  C E KVKVKFF FMLALYSENGFFDSVSETFSYMKNNGIMID+  
Sbjct: 121 LRYSFAIIASTVESCCQERKVKVKFFNFMLALYSENGFFDSVSETFSYMKNNGIMIDEQR 180

Query: 181 CTVHLPSLKGSNR 191
           CTVHL SLKGSN+
Sbjct: 181 CTVHLLSLKGSNQ 193

BLAST of Cla97C02G047460.1 vs. NCBI nr
Match: XP_022985570.1 (pentatricopeptide repeat-containing protein At2g28050 [Cucurbita maxima])

HSP 1 Score: 292.7 bits (748), Expect = 8.9e-76
Identity = 152/193 (78.76%), Postives = 165/193 (85.49%), Query Frame = 0

Query: 1   MSIQNL---LKSVKANPRSNSSSKPLMEIISTILAPASFDSTPACFISELNLHVLTSILS 60
           MSI NL   LK+VK N +SN SSKPL+EIISTILAPA F+S  +CFIS+L  HVLTSILS
Sbjct: 1   MSIPNLLKTLKTVKGNRQSNPSSKPLVEIISTILAPAPFNSNESCFISQLKPHVLTSILS 60

Query: 61  NPNLRSLECFHFFDFCLKNQPLMSFKPDLQSHLIVICRLLQARMFADAEILLKTVSIDGN 120
           NPNLRS ECFHFFDFCLKNQPLMSF PDLQ+HL VICRLL+ARMF+DA  LLKTVSIDGN
Sbjct: 61  NPNLRSSECFHFFDFCLKNQPLMSFTPDLQAHLTVICRLLKARMFSDAASLLKTVSIDGN 120

Query: 121 HRYPFAVIASNVEISCPEWKVKVKFFKFMLALYSENGFFDSVSETFSYMKNNGIMIDDHT 180
            RYPFA+IAS VE  C E KVKVKFF FML LYSE GFFDSVSETFSYMKNNGIMID+ +
Sbjct: 121 LRYPFAIIASTVESCCQERKVKVKFFNFMLTLYSEYGFFDSVSETFSYMKNNGIMIDEQS 180

Query: 181 CTVHLPSLKGSNR 191
           CTVHL SLKGSN+
Sbjct: 181 CTVHLLSLKGSNQ 193

BLAST of Cla97C02G047460.1 vs. NCBI nr
Match: XP_022140390.1 (pentatricopeptide repeat-containing protein At2g28050 [Momordica charantia])

HSP 1 Score: 262.3 bits (669), Expect = 1.3e-66
Identity = 140/193 (72.54%), Postives = 156/193 (80.83%), Query Frame = 0

Query: 1   MSIQNL---LKSVKANPRSNSSSKPLMEIISTILAPASFDSTPACFISELNLHVLTSILS 60
           MSIQNL   LK+VK N +SN SSKP++EIISTILAPA  DS     IS+LN H L SILS
Sbjct: 1   MSIQNLLKTLKTVKGNRQSNPSSKPILEIISTILAPAPSDSVATSLISQLNPHGLRSILS 60

Query: 61  NPNLRSLECFHFFDFCLKNQPLMSFKPDLQSHLIVICRLLQARMFADAEILLKTVSIDGN 120
           +PNL S ECFHFFDF LKNQ L+SFKPDLQ+HL VICRLL+ RMF+DAE LLKTVSID N
Sbjct: 61  DPNLGSSECFHFFDFVLKNQSLVSFKPDLQAHLTVICRLLKERMFSDAERLLKTVSIDSN 120

Query: 121 HRYPFAVIASNVEISCPEWKVKVKFFKFMLALYSENGFFDSVSETFSYMKNNGIMIDDHT 180
             YPFAVIAS VE  C E KVKVKFF FM+A+YS NGFFDSVSE FSYMKNNGI ID+ T
Sbjct: 121 CCYPFAVIASTVENCCRERKVKVKFFNFMMAMYSNNGFFDSVSEIFSYMKNNGIKIDEKT 180

Query: 181 CTVHLPSLKGSNR 191
           CTVHL +LKGS++
Sbjct: 181 CTVHLLALKGSDQ 193

BLAST of Cla97C02G047460.1 vs. NCBI nr
Match: XP_021277830.1 (pentatricopeptide repeat-containing protein At2g28050 [Herrania umbratica])

HSP 1 Score: 182.6 bits (462), Expect = 1.3e-42
Identity = 100/195 (51.28%), Postives = 134/195 (68.72%), Query Frame = 0

Query: 1   MSIQNLLKSVKANPRSNSSS------KPLMEIISTILAPASFDSTPACFISELNLHVLTS 60
           M++QN LK++K   +S  S       +PL ++IS IL  +      A  IS+LN      
Sbjct: 1   MTLQNFLKTLKTVKKSYPSGLSSKPYQPLPDLISQILTTSK--PLDASTISDLNPTTFQY 60

Query: 61  ILSNPNLRSLECFHFFDFCLKNQPLMSFKPDLQSHLIVICRLLQARMFADAEILLKTVSI 120
           IL+NP+L++ +CF FF+  +KNQ L+SFKPDLQ+HL + CRLL+AR+F+DAE  LK+VS+
Sbjct: 61  ILTNPDLKASKCFRFFNLVIKNQSLVSFKPDLQAHLTLTCRLLKARLFSDAEATLKSVSV 120

Query: 121 DGNHRYPFAVIASNVEISCPEWKVKVKFFKFMLALYSENGFFDSVSETFSYMKNNGIMID 180
           D + RYPF VIAS VE  C E KV  KF+  ML +YS+NG F  V +TF YMKNNGI ID
Sbjct: 121 DESLRYPFLVIASAVENCCFESKVITKFYNLMLKVYSDNGKFGEVLKTFDYMKNNGIKID 180

Query: 181 DHTCTVHLPSLKGSN 190
           + TCTVHL +LKG++
Sbjct: 181 ERTCTVHLIALKGAD 193

BLAST of Cla97C02G047460.1 vs. TrEMBL
Match: tr|A0A061GX52|A0A061GX52_THECC (Pentatricopeptide repeat-containing protein, putative OS=Theobroma cacao OX=3641 GN=TCM_042124 PE=4 SV=1)

HSP 1 Score: 181.8 bits (460), Expect = 1.5e-42
Identity = 99/195 (50.77%), Postives = 133/195 (68.21%), Query Frame = 0

Query: 1   MSIQNLLKSVKANPRSNSSS------KPLMEIISTILAPASFDSTPACFISELNLHVLTS 60
           M++QN LK++K   +S  S       +PL + IS  L  +      A  IS+LN      
Sbjct: 95  MTLQNFLKTLKTVKKSYPSGLSSKPYQPLPDFISQFLTTSK--PLDASTISDLNPTTFHD 154

Query: 61  ILSNPNLRSLECFHFFDFCLKNQPLMSFKPDLQSHLIVICRLLQARMFADAEILLKTVSI 120
           IL+NP+L++ +CF FF+  +KNQ L+SFKPDLQ+HL + CRLL+AR+F+DAE +LK+VS+
Sbjct: 155 ILTNPDLKASKCFRFFNLVIKNQSLVSFKPDLQAHLTLTCRLLKARLFSDAEAMLKSVSV 214

Query: 121 DGNHRYPFAVIASNVEISCPEWKVKVKFFKFMLALYSENGFFDSVSETFSYMKNNGIMID 180
           D + RYPF VIAS VE  C E KV  KF+  ML +YS+NG F  V +TF YMKNNGI ID
Sbjct: 215 DESLRYPFLVIASAVENCCFESKVITKFYNLMLKVYSDNGKFGEVLKTFDYMKNNGIKID 274

Query: 181 DHTCTVHLPSLKGSN 190
           + TCTVHL +LKG++
Sbjct: 275 ERTCTVHLIALKGAD 287

BLAST of Cla97C02G047460.1 vs. TrEMBL
Match: tr|A0A0D2QDH4|A0A0D2QDH4_GOSRA (Uncharacterized protein OS=Gossypium raimondii OX=29730 GN=B456_009G110100 PE=4 SV=1)

HSP 1 Score: 178.7 bits (452), Expect = 1.2e-41
Identity = 98/192 (51.04%), Postives = 129/192 (67.19%), Query Frame = 0

Query: 1   MSIQNLLKSVKANPRSNSSS------KPLMEIISTILAPASFDSTPACFISELNLHVLTS 60
           M++QN L ++K   +S++S+      + L + IS  L PA+     A  +  L       
Sbjct: 1   MTLQNFLNTLKTVKKSSASTLSPKHFQHLPDFISHFLKPAT--PLDASALPSLTPTTFRD 60

Query: 61  ILSNPNLRSLECFHFFDFCLKNQPLMSFKPDLQSHLIVICRLLQARMFADAEILLKTVSI 120
           ILSNP+L++ +CF FF+F   NQ L+SFKP LQ HLI+ICRLL+AR+FADAE +LKT+S+
Sbjct: 61  ILSNPDLKASKCFRFFNFVANNQSLLSFKPHLQDHLILICRLLKARLFADAEAMLKTLSV 120

Query: 121 DGNHRYPFAVIASNVEISCPEWKVKVKFFKFMLALYSENGFFDSVSETFSYMKNNGIMID 180
           D N RYPF VIAS VE  C E KV  K F FML +YS+NG F   S+TF YMK+NGI I+
Sbjct: 121 DENLRYPFLVIASAVENCCFESKVTTKLFNFMLKVYSDNGNFSEASKTFDYMKDNGIKIN 180

Query: 181 DHTCTVHLPSLK 187
           + TCTVHL +LK
Sbjct: 181 ERTCTVHLNTLK 190

BLAST of Cla97C02G047460.1 vs. TrEMBL
Match: tr|A0A1R3HAK7|A0A1R3HAK7_COCAP (Uncharacterized protein OS=Corchorus capsularis OX=210143 GN=CCACVL1_20560 PE=4 SV=1)

HSP 1 Score: 175.3 bits (443), Expect = 1.4e-40
Identity = 102/197 (51.78%), Postives = 130/197 (65.99%), Query Frame = 0

Query: 1   MSIQNL---LKSVKANPRSNSSSKPLMEI---ISTILAPAS--FDSTPACFISELNLHVL 60
           M++QN+   LK+ K    S  SS P   I   IS +LA +    DST    +S+LN    
Sbjct: 1   MTLQNVARTLKTAKGRCPSTLSSNPYQHIPAFISQLLATSKPFNDST----LSDLNPATF 60

Query: 61  TSILSNPNLRSLECFHFFDFCLKNQPLMSFKPDLQSHLIVICRLLQARMFADAEILLKTV 120
             IL+NP+L++ +CFHFFDF LKNQ L+ FKPDLQ+HL +I RLLQAR+F +AE L K+V
Sbjct: 61  HDILANPDLKASKCFHFFDFVLKNQSLVPFKPDLQAHLTLIGRLLQARLFRNAEALFKSV 120

Query: 121 SIDGNHRYPFAVIASNVEISCPEWKVKVKFFKFMLALYSENGFFDSVSETFSYMKNNGIM 180
           S+D N RYPF  IAS VE  C E K+  K F  ML +YS+NG F  V + F YMKN GIM
Sbjct: 121 SVDENFRYPFLDIASAVENCCFEPKIMTKIFNSMLKVYSDNGKFGEVLKVFDYMKNKGIM 180

Query: 181 IDDHTCTVHLPSLKGSN 190
           ID+ TCTVHL +L G++
Sbjct: 181 IDERTCTVHLHALNGAD 193

BLAST of Cla97C02G047460.1 vs. TrEMBL
Match: tr|A0A1U8PQB6|A0A1U8PQB6_GOSHI (pentatricopeptide repeat-containing protein At2g28050-like OS=Gossypium hirsutum OX=3635 GN=LOC107960772 PE=4 SV=1)

HSP 1 Score: 173.3 bits (438), Expect = 5.2e-40
Identity = 98/194 (50.52%), Postives = 128/194 (65.98%), Query Frame = 0

Query: 1   MSIQNLLKSVKANPRSNSSSKP------LMEIISTILAPASFDSTP--ACFISELNLHVL 60
           M++QN L ++K   +S++S+ P      L ++IS  L P    STP  A  +  L     
Sbjct: 1   MTLQNFLNTLKTVKKSSASTLPPKHFQHLPDLISHFLKP----STPLDASALPSLTPTTF 60

Query: 61  TSILSNPNLRSLECFHFFDFCLKNQPLMSFKPDLQSHLIVICRLLQARMFADAEILLKTV 120
             ILSNP+L++ +CF FF+F   NQ L+SFKP LQ HL +I  LL+AR+FADAE +LKT+
Sbjct: 61  RDILSNPDLKASKCFRFFNFVANNQSLLSFKPHLQDHLTLISGLLKARLFADAEAMLKTL 120

Query: 121 SIDGNHRYPFAVIASNVEISCPEWKVKVKFFKFMLALYSENGFFDSVSETFSYMKNNGIM 180
           S+D N RYPF VIAS VE  C E KV  K F FML +YS+NG F   S+TF YMK+NGI 
Sbjct: 121 SVDENLRYPFLVIASAVENCCFESKVTTKLFNFMLKVYSDNGNFSEASKTFDYMKDNGIK 180

Query: 181 IDDHTCTVHLPSLK 187
           I++ TCTVHL +LK
Sbjct: 181 INERTCTVHLNTLK 190

BLAST of Cla97C02G047460.1 vs. TrEMBL
Match: tr|A0A2P5X4S4|A0A2P5X4S4_GOSBA (Uncharacterized protein OS=Gossypium barbadense OX=3634 GN=GOBAR_AA22347 PE=4 SV=1)

HSP 1 Score: 173.3 bits (438), Expect = 5.2e-40
Identity = 98/194 (50.52%), Postives = 128/194 (65.98%), Query Frame = 0

Query: 1   MSIQNLLKSVKANPRSNSSS------KPLMEIISTILAPASFDSTP--ACFISELNLHVL 60
           M++QN L ++K   +S++S+      + L + IS  L P    STP  A  +  L     
Sbjct: 1   MTLQNFLNTLKTVKKSSASTLSPKHFQHLPDFISHFLKP----STPLDASALPSLTPTTF 60

Query: 61  TSILSNPNLRSLECFHFFDFCLKNQPLMSFKPDLQSHLIVICRLLQARMFADAEILLKTV 120
             ILSNP+L++ +CF FF+F   NQ L+SFKP LQ HLI+I RLL+ R+FADAE +LKT+
Sbjct: 61  RDILSNPDLKASKCFRFFNFVANNQSLLSFKPHLQDHLILISRLLKTRLFADAEAMLKTL 120

Query: 121 SIDGNHRYPFAVIASNVEISCPEWKVKVKFFKFMLALYSENGFFDSVSETFSYMKNNGIM 180
           S+D N RYPF VIAS VE  C E KV  K F FML +YS+NG F   S+TF YMK+NGI 
Sbjct: 121 SVDENLRYPFLVIASAVENCCFESKVTTKLFNFMLKVYSDNGNFSEASKTFDYMKDNGIK 180

Query: 181 IDDHTCTVHLPSLK 187
           I++ TCTVHL +LK
Sbjct: 181 INERTCTVHLNTLK 190

BLAST of Cla97C02G047460.1 vs. Swiss-Prot
Match: sp|Q9ZUU7|PP174_ARATH (Pentatricopeptide repeat-containing protein At2g28050 OS=Arabidopsis thaliana OX=3702 GN=At2g28050 PE=3 SV=1)

HSP 1 Score: 133.7 bits (335), Expect = 2.3e-30
Identity = 82/190 (43.16%), Postives = 117/190 (61.58%), Query Frame = 0

Query: 1   MSIQNLLKSVKANPRSNSSSKPLMEIISTILAPASFDST---PACFISELNLHVLTSILS 60
           M+ Q  LK++K   ++ + S     I   +L  +S + T   P   +S+LNL  L  ILS
Sbjct: 1   MTPQCFLKTLKTAKQTYTFS-----IYKLLLNSSSINQTLASPETPLSDLNLSTLRRILS 60

Query: 61  NPNLRSLECFHFFDFCLKNQPLMSFKPDLQSHLIVICRLLQARMFADAEILLKTVSIDGN 120
           +P+++S +C   F+F L+N  L SF+PDL++HL +  R+L  R F+ A+ LLK V+ID  
Sbjct: 61  DPDIKSWKCISLFNFILENPSLFSFQPDLRTHLSLTFRVLSERRFSYAKELLKPVAIDDI 120

Query: 121 HRYPFAVIASNVEISCP-EWKVKVKFFKFMLALYSENGFFDSVSETFSYMKNNGIMIDDH 180
            RYPF VI S+V   C  E KV  +FF  M+ +YS+NG F  V E F YMKNN + ID+ 
Sbjct: 121 LRYPFNVIVSSVIDECGCEKKVVGRFFNSMIMVYSDNGKFSEVVEVFEYMKNNEVKIDEK 180

Query: 181 TCTVHLPSLK 187
           TCT+HL +LK
Sbjct: 181 TCTLHLLNLK 185

BLAST of Cla97C02G047460.1 vs. TAIR10
Match: AT2G28050.1 (Pentatricopeptide repeat (PPR) superfamily protein)

HSP 1 Score: 133.7 bits (335), Expect = 1.2e-31
Identity = 82/190 (43.16%), Postives = 117/190 (61.58%), Query Frame = 0

Query: 1   MSIQNLLKSVKANPRSNSSSKPLMEIISTILAPASFDST---PACFISELNLHVLTSILS 60
           M+ Q  LK++K   ++ + S     I   +L  +S + T   P   +S+LNL  L  ILS
Sbjct: 1   MTPQCFLKTLKTAKQTYTFS-----IYKLLLNSSSINQTLASPETPLSDLNLSTLRRILS 60

Query: 61  NPNLRSLECFHFFDFCLKNQPLMSFKPDLQSHLIVICRLLQARMFADAEILLKTVSIDGN 120
           +P+++S +C   F+F L+N  L SF+PDL++HL +  R+L  R F+ A+ LLK V+ID  
Sbjct: 61  DPDIKSWKCISLFNFILENPSLFSFQPDLRTHLSLTFRVLSERRFSYAKELLKPVAIDDI 120

Query: 121 HRYPFAVIASNVEISCP-EWKVKVKFFKFMLALYSENGFFDSVSETFSYMKNNGIMIDDH 180
            RYPF VI S+V   C  E KV  +FF  M+ +YS+NG F  V E F YMKNN + ID+ 
Sbjct: 121 LRYPFNVIVSSVIDECGCEKKVVGRFFNSMIMVYSDNGKFSEVVEVFEYMKNNEVKIDEK 180

Query: 181 TCTVHLPSLK 187
           TCT+HL +LK
Sbjct: 181 TCTLHLLNLK 185

The following BLAST results are available for this feature:
Match NameE-valueIdentityDescription
XP_022943381.11.0e-7679.79pentatricopeptide repeat-containing protein At2g28050 [Cucurbita moschata][more]
XP_023511890.13.0e-7679.79pentatricopeptide repeat-containing protein At2g28050 [Cucurbita pepo subsp. pep... [more]
XP_022985570.18.9e-7678.76pentatricopeptide repeat-containing protein At2g28050 [Cucurbita maxima][more]
XP_022140390.11.3e-6672.54pentatricopeptide repeat-containing protein At2g28050 [Momordica charantia][more]
XP_021277830.11.3e-4251.28pentatricopeptide repeat-containing protein At2g28050 [Herrania umbratica][more]
Match NameE-valueIdentityDescription
tr|A0A061GX52|A0A061GX52_THECC1.5e-4250.77Pentatricopeptide repeat-containing protein, putative OS=Theobroma cacao OX=3641... [more]
tr|A0A0D2QDH4|A0A0D2QDH4_GOSRA1.2e-4151.04Uncharacterized protein OS=Gossypium raimondii OX=29730 GN=B456_009G110100 PE=4 ... [more]
tr|A0A1R3HAK7|A0A1R3HAK7_COCAP1.4e-4051.78Uncharacterized protein OS=Corchorus capsularis OX=210143 GN=CCACVL1_20560 PE=4 ... [more]
tr|A0A1U8PQB6|A0A1U8PQB6_GOSHI5.2e-4050.52pentatricopeptide repeat-containing protein At2g28050-like OS=Gossypium hirsutum... [more]
tr|A0A2P5X4S4|A0A2P5X4S4_GOSBA5.2e-4050.52Uncharacterized protein OS=Gossypium barbadense OX=3634 GN=GOBAR_AA22347 PE=4 SV... [more]
Match NameE-valueIdentityDescription
sp|Q9ZUU7|PP174_ARATH2.3e-3043.16Pentatricopeptide repeat-containing protein At2g28050 OS=Arabidopsis thaliana OX... [more]
Match NameE-valueIdentityDescription
AT2G28050.11.2e-3143.16Pentatricopeptide repeat (PPR) superfamily protein[more]
The following terms have been associated with this mRNA:
Vocabulary: Molecular Function
TermDefinition
GO:0005515protein binding
Vocabulary: INTERPRO
TermDefinition
IPR011990TPR-like_helical_dom_sf
GO Assignments
This mRNA is annotated with the following GO terms.
Category Term Accession Term Name
molecular_function GO:0005515 protein binding

This mRNA is a part of the following gene feature(s):

Feature NameUnique NameType
Cla97C02G047460Cla97C02G047460gene


The following CDS feature(s) are a part of this mRNA:

Feature NameUnique NameType
Cla97C02G047460.1.CDS.1Cla97C02G047460.1.CDS.1CDS


The following exon feature(s) are a part of this mRNA:

Feature NameUnique NameType
Cla97C02G047460.1.exon.1Cla97C02G047460.1.exon.1exon


The following polypeptide feature(s) derives from this mRNA:

Feature NameUnique NameType
Cla97C02G047460.1Cla97C02G047460.1-proteinpolypeptide


Analysis Name: InterPro Annotations of watermelon 97103 v2
Date Performed: 2019-05-12
IPR TermIPR DescriptionSourceSource TermSource DescriptionAlignment
IPR011990Tetratricopeptide-like helical domain superfamilyGENE3DG3DSA:1.25.40.10coord: 48..190
e-value: 3.6E-5
score: 25.1
NoneNo IPR availablePANTHERPTHR24015:SF418SUBFAMILY NOT NAMEDcoord: 44..189
NoneNo IPR availablePANTHERPTHR24015FAMILY NOT NAMEDcoord: 44..189