Lsi01G017740 (gene) Bottle gourd (USVL1VR-Ls)

NameLsi01G017740
Typegene
OrganismLagenaria siceraria (Bottle gourd (USVL1VR-Ls))
DescriptionPentatricopeptide repeat-containing protein, mitochondrial
Locationchr01 : 16706800 .. 16713606 (+)
The following sequences are available for this feature:

Gene sequence (with intron)

Legend: CDS
Hold the cursor over a type above to highlight its positions in the sequence below.
ATGACTAGTTTGCCCTCCAAAGTAAAGTTATTCATTAATATTTTCTAAACCTCCCATTTTCTTCATTCCACACTCATCTCTTTCTCAATCTTTCTCCAACCACTCTAGTATCACTCTCCATTCTAAAAAAACTTTGGTAAGAAATTACATTTTTGTTAAAGCTTTTGAGTTTTCATGCTTATTTTTTTCTCAACATTTTTTGACAACCACACTTTTCTCTTACACTTTTTTTCATCACAAGAAAAATTATGTGAAAAAAATTACCAACCAATTATTCACGCGCTAGAATTTCCTCCATTTTTTGATGGACCTACCCATACTACTAAGACCAAAAGATATAAATTCAAATTGGGTTTTGAATCTTTTCTCATATCTTCTCGTGTGAAAAAAATTCATCGATAGGAAAGTGCAAGAATTACATAAACTCAAACTTTGTAGTGCTTGAATTTTTCTCGCAGTGTGGAAAAATTTACATAAATTCAAAACTTGAGGTGCTTGTGAGGGGAGAAAGTTCAAGAAAATTTATAGAAAATCAAAATTTAAGGTGCTACAATTTTTCTCGTTGTTTCTCGTAGCCAGAAAATGCGATAAAATTTATAGAAACTCAAAATTTAAGTTATTGGAATTTTTCTTGCATCTTCTCGCAGCGAAAAAATGTGAGAAAATTTACAAAACTCGAAATTTTTTGTGCTGAAATTATTATTACAACTTCTCGTAGCGAGAAAGTGAGAAAATGTACATTACATCAAAATTTTGAGTTGCTGGAACTTTTCTCGCAGCTTATTGCCGAGAAAATTTACAGAAACTCATAAGATTTTCAACTTTTATCTTCTATAAGTGAACAAATTGAAGATTTTATAGTAATTGTATTTAATTAGTCCAAATTCAATTATTTGTAATTTCGTCATTCATTTGAGAAATGATGTGTCAATTTGTGATTGGTCCAAAATTTCCCATTCAACGAAACCTTTGAGTTTGAATATGAGGAAGCGTCGATGAAGTCTTCAAACTAAAGCGAGCAACACCCTGCACTCAAAACTTATTTAAAGCAAGCAAAGAGAAATACAAGGTTTCTCCAAAAGCTACTTTATAAGCATCTAACTTACAGTGTCTTAATCTTAGTGCTATATACCCCTTTGATTTGTGCGTTGCCCCCAATGAGGCATCTTAAACCCACATTCTTTGACTATAATGAACACCATCAATTTTCTAGTGTCACGCTTTAGGCGATTGTGACACTTTTTTTTTTTGGACTGAACTTAAATTTCTTTAAGTTGCCTACATACCCTTGAAGAAAGGGACCAAGTCATAAAGTAGTTCAACTACGATTTTTTTCTCTTTTTACTAGACTGAACTTAAAGTTTCCTTCAATCTACCTACATACCCTTTATAATGTATAGGGATCAAGTCATAACGTATTTCAACTGAGTTCTTTTTTTTTTTTTNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNTTTTTTTTTTTTTTTTTTTTGCTTTTTACTGAACTGAACCTAAAGTTTTCTTCAAGCTTCCTGGATACTCTTTATAAGGATATGGATCAAGTCATAACGTAGTTCAAAGTTTTTTTTTTTTTTTCCATTTACCTTTTAAACTCATGCGAGTTAGGAGCACTATGTAGTTTAGGCTCTTGCGATAAAGAGCTTCTACATTTAAGCTTTAGAAGCTCTTATTTCATCATGACTTTGTTCGTCTCCTTCATTTGTAGGATTCGTCAATATGATGAGTTTTGGCTTCACCATCAAGGAACCTTCTGTATTTATGTCAATAGACAACTTCCTCTTCTTGCCTGAAGGGACACGACTGTGAATCTTCTCGCCGTTGTTTTCTTCATGAAATAATTTTGCCTTCAAGATTTTTATCTTTCTTTCATGTTGACTGCTTGTCATTTTAAGACGATCAAAACTAGATGTTGAAGGTAGATCTTTCTTTTATATGGAGATACTTAACCTTTTGAAGCTTGAAGTTAGGGTGGAGGTGATTGCTAAACATTGATCTTCCTCTTTTATCTTGGTCATACCCGATCTTTGGAAGACTGAATATTGAGTAGTTGAAGGCTTGAGATGATCGAAGACAGAAGTTCATTGCTCAGGTTTTTTTTAATTGTCGACTTCTTCAACGGTTATGTGATTGTTGTTGGCAACTTTTGAGCATTCTGTAAATGGTGACTCACCCTTCTTACGTCTTGACAAATGAACATAGTGCAAGATTGAAAGACTCGGAGATTTCTCATCTTTTAAAATTCTAGTCTTTGCAGGGCTAGTGAATGCCTAACTCTTTTGAGAGGTGGAAGTACTTGTCTTTTTGCTTGACTCTTTTTTTGCATTCACATGTGATTTTAGCTTTGGATTATCCTCACCTTTCATTAGAGGGATTTTTGCAGGCATAGCTTTTCTGACATTATCAATCTTTGAGTAAAACTTTGCGTCTACAAAGTGAGACTCAGCTTCTGAGAATGCATTAGAGTCTGCTTCAACCTTCTTAATGTCATCTTGATAAAACTTGAAGCATTGGTGCAGTGTTGAAGTTATCACTCCATTTCTATGAATCCAAGGACGATCTAGTAACAACTTATAAGTGGTCTTTGAATCTATAACATGAAACAATGTGTTGGCCTTCAGTTCGCTAATGATGAGTTCTAAGGTGTCATGCCTATCGCTCTTTGGCTGCCTTGGTTGAAGCCTTGAATTACTAATTTGTTGTTTGACAGTTCTTCCTGTTGGGGTTGATGCCCTAAATCTCGTGGTCTTGTATTTTGTAAACTTGTATTGTACTAACTAAATTATTTATTTAATAAAATAATTGTTATTTTCTTTGATATTTGGTTGCATTAACCCAAAAACCAATAAACTAAGATCCTAGATTATTAGTAGTAACTTAAACACGTATGTAGAGACATACAAATGGATCGTGTTTAAGTGGTAACCTAAACGGTTTGTAGTAGATGGATAAAGCTAGGTACCTTATCCTAGTAACACTACGGATACAACCCACTTTGTGATTGTTACAAGTGTTGTAAAGTGTCACAAACGTTGATATCCTAAATGTTGATGTGGATTTGCTTGTGTACTTTTTTTAATTGCTTTCAGGAACCAAGTATTTGGGATTGGAGCTTCATGTACCTGTCTTAGATTAGATGTATTGACAACTTGCACAGATGTAAAACGTGGCATATCTACTTCTCATGTCTCATCTGAAAGTTTTCCAATTTCACAGATTGGTTTCACAACTTGATTCGTGGGCCTTCAGGACATGGTGTAGACAGATGCAGTCTTTCTTCACAAGCTGGTGCAAGAAGTAGTGGGGAGGAAGATGACTTGGAAGATGGGTTTTCTGAACTTGACTCACCCGAAAAAATAATGCCTACTCAAACTGCTGATTTAGAGGAAGATGACATGTTGATTTCTAATCTACAACTATTTGAAGACAGTGATGGTGATTCGGTAGGGCCCTCTCAAAATGATTTAGATTTATCAGAGGCTATGGATGATGTGGACGAAAAGATTCCAATAAATAGAAAGGCTTACTCGGAATTGTTTCAGGCCATAATGAGTAGTCCAGGCTTATCTGTTCATGGCGTTCTGGATAAATGGGTGAAAGAAGGAAAGGATTTAAGCAAATCTGAGATATCAATAACCATGCTTAATCTCCGTAAAAGTCGACGTTATGGAAGGGCATTACAAGTGATTTTCATCATTTTATCTGTAGTTTAACTAGTTTAGAAACTTGCTGATTTTCCTTGTAGTTTAAGTCTCAGGTAGATTACAATAGGTTTAGCCTTACACTTACACATGAAAACTCAATTTCCTAAGGTTTCAAAAAGAGAAAGAATGCATCTTTAGTCTATCCAGTTAAAAAAAAAGGGAGTAATTAGCATACAATTTTAAAGCTTGTGAATACGACCAATGCAATGAATTTACACTGTAATAAATATTATTTATTTATATGTAATTATATTTTGGAGAATAATTATGCTTTGCTCACCAGCACCGCCTTGAGGTTCTCAAAGGAAAAGATGTCTATTTGTATGTCATTTTTATTCATGAACCGCTTAAGGCATCAAATTATTATTTAATAATTTTGATGTAATTTAATAATTTTCAAATTTATCCTTGTTAATGACATTTTAAGTAGGATTATTTTAGACATTTTACTATGAATTATGATAAGTCATTAGGGTTTTATTAAGAGGCTATTTAAAGTCTTGCCTTTAGGTTGTTGGAAAACAGTTTTTGATGTTTTTAGAACTTGAATGAGAATTTTCTAGTAGATGACAGAGTTTTATCCCGCCACTTTTGCAATTGGCGATTCTTGAATCAGGGTTGCATGCCAAGAACATTCAGGTAAGTCTGATCATCTTGCTTGTGGAATGTTCGAACTTGGAGTAAGAAACTAGTTCTCCTTATCTCTTGGTCTTCGATCAAAAGGTAATCTAATTCTAGTCTTTTCCTTTGGATTGACATCAATTGATTTGGGGGTTTTGAAATTTGTTGTTAAGGGTTCAACAATTGAGTTCCTAATTTTCAAATTGCTAGGATTTTCATATAAGAACTTCTTGTCTCATAACACCTTGTGGTTACTCCCTTCACCAAAAGAACAAGATTCAAGTTCAGATTTACTACTTCTATTTTCCTTACCACACGTACCTAGATATTATTTGCACCACCATATGATGTATGGGCTATGAAGGTGATTGTAGACATCTAGCTACCTGATTAGTCTCTCTATAAATATGTTAATGTGAATCAAATCAGGGAAAAACTGTTTGTTACTTGGTCTGCTGCTACCTCATGATCATTCATTTAATTATGTGGCAGTTGACAAATATTATGGTTGCTTCTTGTATGACTTGTTATACCGATCGAATATCCCCAAAGGGATCCAAATACTTGTGCTCATGTTATATAATGCACTAACAAGTTTTGTAGTTGAAACTCCGTGATGCTATAACCCAATAGTTGTATTATCAAGAATTTTGTACTTTACCATGTTTACACCAATGTCAGTCTAAGATTACTTCATTCAATTTCCATACTCTTCTATGTGCCAGCTTGGTTGTGGCAAAAAAATTTGGTGCAAATTTGGAATATGGTTTCAAACGTGCTTAGCTTAGGGCAACTCTATGTTGGGTGACCTCTTGGNATGCTATAACCCAATAGTTGTATTATCAAGAATTTTGTACTTTACCATGTTTACACCAATGTCAGTCTAAGATTACTTCATTCAATTTCCATACTCTTCTATGTGCCAGCTTGGTTGTGGCAAAAAAAATTTGGTGCAAATTTGGAATATGGTTTCAAACGTGCTTAGCTTAGGGCAACTCTATGTTGGGTGACCTCTTGTAAATTTTCTTAGGAAACATGTGAGTGAACACAAAGCATGTTGTAAGGACCTGTATTGGTTTGTGGGGGTAGTCTTCACTCTTAAAAGCGGTTCAAGACAAGCAAGGATAACATTGCAAGGTCGTAAGTGTAAGAAGATCTGAGTTAGATGTTGAATAAAAGTATTTATTTGAGTGATAGACTTACAGTGTTGAAAGGTAGATAGGTTAAAGCGGACTGTAGATATCACAAAAGGGTAGTACTTACCGTCCTCACACCTCCTTCCAGGTGAAAAAGGTAAATTTGGGAGGGGATGTGACAGTAAGAGATGTAAGAAAATGCCGGTACCATTAGGTGCTAATTTTGAATTCTGCGCATGGGATGTTACAATTGAACTAGGGTTGAAGGTATGAAAAAAGAAAACAATTACAAAAACAAGAACTGGCGAAAGATTGAACTAGGAAGAACTTATATTGATAACTTATTCAGGAAATAAAAGTAATAGTCACAAGCGGCTCAACACCAAAATTGAAAAAACCAGCAAATAATATTTGATTTTTTTTAAAAAAGAAGATTTAAATAATAAAAAAAAATGAACAAACAATAATGAAAACCAAAAAGATAATGGAAATCGGAAAAAAAAAAATTGGATTTCAAATTGGAAAGAGAAGATAATTTAGATTTCAAATTATTTATAGGGAAAATTTTTTTAGTACTTTAGTTTTTAGAAATAGGTGCAATTAGTCCCTATAATTTTAATTTGACCATTTTAGTCCCTCACTTTTACAAAATAGGTTTAAAAGGTCCCTATTCTCCTTTACTAAATTTGTTTAATATTTTGTATTGTTTCAAAAACTAAAAAATGACATATTTAATTTTTAAAATAAAAAACAATATCTCTCCTCTCACTTTCCATCGTTCTCCTCTCTCTCCTCCTCTTCTTCCACCGTCCTTTTATCTGATCTTCATCTTTCTCTCCCATTGTCTGATCGATGAACTTGTACAACAAGGTAACAACAGTGAATGCGAATCATCCAAATTTCCGTCACCTTCTCCCCTCGTCAGAAGGCCATCACGCCGAAAAACTGCCGACACAAGTCGTTTGCTCGTCGGAAGATCAAAATACGATCACGAAGAGCAAAGAAGATAGATGA

mRNA sequence

ATGACTAATTGGTTTCACAACTTGATTCGTGGGCCTTCAGGACATGGTGTAGACAGATGCAGTCTTTCTTCACAAGCTGGTGCAAGAAGTAGTGGGGAGGAAGATGACTTGGAAGATGGGTTTTCTGAACTTGACTCACCCGAAAAAATAATGCCTACTCAAACTGCTGATTTAGAGGAAGATGACATGTTGATTTCTAATCTACAACTATTTGAAGACAGTGATGGTGATTCGGTAGGGCCCTCTCAAAATGATTTAGATTTATCAGAGGCTATGGATGATGTGGACGAAAAGATTCCAATAAATAGAAAGGCTTACTCGGAATTGTTTCAGGCCATAATGAGTAGTCCAGGCTTATCTGTTCATGGCGTTCTGGATAAATGGGTGAAAGAAGGAAAGGATTTAAGCAAATCTGAGATATCAATAACCATGCTTAATCTCCGTAAAAGTCGACGTTATGGAAGGGCATTACAAGTAACAACAGTGAATGCGAATCATCCAAATTTCCGTCACCTTCTCCCCTCGTCAGAAGGCCATCACGCCGAAAAACTGCCGACACAAGTCGTTTGCTCGTCGGAAGATCAAAATACGATCACGAAGAGCAAAGAAGATAGATGA

Coding sequence (CDS)

ATGACTAATTGGTTTCACAACTTGATTCGTGGGCCTTCAGGACATGGTGTAGACAGATGCAGTCTTTCTTCACAAGCTGGTGCAAGAAGTAGTGGGGAGGAAGATGACTTGGAAGATGGGTTTTCTGAACTTGACTCACCCGAAAAAATAATGCCTACTCAAACTGCTGATTTAGAGGAAGATGACATGTTGATTTCTAATCTACAACTATTTGAAGACAGTGATGGTGATTCGGTAGGGCCCTCTCAAAATGATTTAGATTTATCAGAGGCTATGGATGATGTGGACGAAAAGATTCCAATAAATAGAAAGGCTTACTCGGAATTGTTTCAGGCCATAATGAGTAGTCCAGGCTTATCTGTTCATGGCGTTCTGGATAAATGGGTGAAAGAAGGAAAGGATTTAAGCAAATCTGAGATATCAATAACCATGCTTAATCTCCGTAAAAGTCGACGTTATGGAAGGGCATTACAAGTAACAACAGTGAATGCGAATCATCCAAATTTCCGTCACCTTCTCCCCTCGTCAGAAGGCCATCACGCCGAAAAACTGCCGACACAAGTCGTTTGCTCGTCGGAAGATCAAAATACGATCACGAAGAGCAAAGAAGATAGATGA

Protein sequence

MTNWFHNLIRGPSGHGVDRCSLSSQAGARSSGEEDDLEDGFSELDSPEKIMPTQTADLEEDDMLISNLQLFEDSDGDSVGPSQNDLDLSEAMDDVDEKIPINRKAYSELFQAIMSSPGLSVHGVLDKWVKEGKDLSKSEISITMLNLRKSRRYGRALQVTTVNANHPNFRHLLPSSEGHHAEKLPTQVVCSSEDQNTITKSKEDR
BLAST of Lsi01G017740 vs. Swiss-Prot
Match: PP135_ARATH (Pentatricopeptide repeat-containing protein At1g80270, mitochondrial OS=Arabidopsis thaliana GN=At1g80270 PE=2 SV=1)

HSP 1 Score: 92.4 bits (228), Expect = 6.0e-18
Identity = 60/140 (42.86%), Postives = 85/140 (60.71%), Query Frame = 1

Query: 21  SLSSQAGARSSGEEDDLEDGFSELDSPEKIMPTQTADLEEDDMLISNLQLFEDSDGDSVG 80
           +LSS AG +S  EEDDLEDGFSEL+  +    + ++D +E        +L  D +     
Sbjct: 61  ALSSSAGTKSDQEEDDLEDGFSELEGSKSGQGSTSSDEDEG-------KLSADEE----- 120

Query: 81  PSQNDLDLSEAMDDVDEKIPINRKAYSELFQAIMSSPGLSVHGVLDKWVKEGKDLSKSEI 140
             + +LDL E   DV  K     K  SELF+ I+S+PGLS+   LDKWV+EG ++++ EI
Sbjct: 121 -EEEELDLIET--DVSRKTV--EKKQSELFKTIVSAPGLSIGSALDKWVEEGNEITRVEI 180

Query: 141 SITMLNLRKSRRYGRALQVT 161
           +  ML LR+ R YGRALQ++
Sbjct: 181 AKAMLQLRRRRMYGRALQMS 183

BLAST of Lsi01G017740 vs. Swiss-Prot
Match: PPR44_ARATH (Pentatricopeptide repeat-containing protein At1g15480, mitochondrial OS=Arabidopsis thaliana GN=At1g15480 PE=2 SV=2)

HSP 1 Score: 82.8 bits (203), Expect = 4.8e-15
Identity = 54/142 (38.03%), Postives = 78/142 (54.93%), Query Frame = 1

Query: 19  RCSLSSQAGARSSGEEDDLEDGFSELDSPEKIMPTQTADLEEDDMLISNLQLFEDSDGDS 78
           R SLSS AGA+++G++DDLED   +L +P++                      +  DG+ 
Sbjct: 64  RRSLSSDAGAKTTGDDDDLEDKNVDLATPDETSS-------------------DSEDGEE 123

Query: 79  VGPSQNDLDLSEAMDDVDEKIPINRKAYSELFQAIMSSPGLSVHGVLDKWVKEGKDLSKS 138
               + D++ +E    V E      K  SE+F+AI+S  GLSV   LDKWV++GKD ++ 
Sbjct: 124 FSGDEGDIEGAELELHVPES-----KRPSEMFKAIVSVSGLSVGSALDKWVEQGKDTNRK 181

Query: 139 EISITMLNLRKSRRYGRALQVT 161
           E    ML LRK R +GRALQ+T
Sbjct: 184 EFESAMLQLRKRRMFGRALQMT 181

BLAST of Lsi01G017740 vs. Swiss-Prot
Match: PP234_ARATH (Pentatricopeptide repeat-containing protein At3g15590, mitochondrial OS=Arabidopsis thaliana GN=At3g15590 PE=2 SV=1)

HSP 1 Score: 63.9 bits (154), Expect = 2.3e-09
Identity = 49/138 (35.51%), Postives = 79/138 (57.25%), Query Frame = 1

Query: 22  LSSQAGARSSGEEDDLEDGFSELDSPEKIMPTQTADLEEDDMLISNLQLFEDSDGDSVGP 81
           LSS A A+  G+E   E+   EL   E+ +P  + D+ E   ++ +  LFE      +G 
Sbjct: 71  LSSIADAKDKGDEVVREE---ELSESEEAVPV-SGDVPEG--VVDDDSLFEPE----LGS 130

Query: 82  SQNDLDLSEAMDDVDEKIPINRKAYSELFQAIMSSPGLSVHGVLDKWVKEGKDLSKSEIS 141
             +DL++ E       K P  ++  SEL+++I++    SV  VL+KWVKEGKDLS++E++
Sbjct: 131 DNDDLEIEEKHSKDGGK-PTKKRGQSELYESIVAYK--SVKHVLEKWVKEGKDLSQAEVT 190

Query: 142 ITMLNLRKSRRYGRALQV 160
           + + NLRK + Y   LQ+
Sbjct: 191 LAIHNLRKRKSYAMCLQL 195

BLAST of Lsi01G017740 vs. TrEMBL
Match: A0A061EW28_THECC (Pentatricopeptide repeat-containing protein isoform 2 (Fragment) OS=Theobroma cacao GN=TCM_024110 PE=4 SV=1)

HSP 1 Score: 135.6 bits (340), Expect = 6.9e-29
Identity = 76/158 (48.10%), Postives = 108/158 (68.35%), Query Frame = 1

Query: 5   FHNLIRGPSGHGVDRCSLSSQAGARSSGEEDDLEDGFSELDSPEKIMPTQTADLEE--DD 64
           F++         V RC LSSQAGA SSGEED+LEDGFSEL++P      +    ++  +D
Sbjct: 52  FYHSRHASCNFSVGRCGLSSQAGAESSGEEDELEDGFSELETPATAEKKENCSAQDKAED 111

Query: 65  MLISNLQLFEDSDGDSVGPSQNDLDLSEAMDDVDEKIPINRKAYSELFQAIMSSPGLSVH 124
            LIS+ +L  D + D    ++N+L+LSE   D+ +K    R+  SELF+A++++PGLSVH
Sbjct: 112 GLISDPELSGDEE-DIEETAKNELELSEDETDLSDKKSSTRRIVSELFKAVIAAPGLSVH 171

Query: 125 GVLDKWVKEGKDLSKSEISITMLNLRKSRRYGRALQVT 161
            VLDKW++EGK  +++EIS+ MLNLRK R YGRALQ++
Sbjct: 172 KVLDKWLEEGKAFNRTEISVAMLNLRKRRMYGRALQLS 208

BLAST of Lsi01G017740 vs. TrEMBL
Match: A0A061EWQ2_THECC (Pentatricopeptide repeat-containing protein isoform 1 OS=Theobroma cacao GN=TCM_024110 PE=4 SV=1)

HSP 1 Score: 135.6 bits (340), Expect = 6.9e-29
Identity = 76/158 (48.10%), Postives = 108/158 (68.35%), Query Frame = 1

Query: 5   FHNLIRGPSGHGVDRCSLSSQAGARSSGEEDDLEDGFSELDSPEKIMPTQTADLEE--DD 64
           F++         V RC LSSQAGA SSGEED+LEDGFSEL++P      +    ++  +D
Sbjct: 54  FYHSRHASCNFSVGRCGLSSQAGAESSGEEDELEDGFSELETPATAEKKENCSAQDKAED 113

Query: 65  MLISNLQLFEDSDGDSVGPSQNDLDLSEAMDDVDEKIPINRKAYSELFQAIMSSPGLSVH 124
            LIS+ +L  D + D    ++N+L+LSE   D+ +K    R+  SELF+A++++PGLSVH
Sbjct: 114 GLISDPELSGDEE-DIEETAKNELELSEDETDLSDKKSSTRRIVSELFKAVIAAPGLSVH 173

Query: 125 GVLDKWVKEGKDLSKSEISITMLNLRKSRRYGRALQVT 161
            VLDKW++EGK  +++EIS+ MLNLRK R YGRALQ++
Sbjct: 174 KVLDKWLEEGKAFNRTEISVAMLNLRKRRMYGRALQLS 210

BLAST of Lsi01G017740 vs. TrEMBL
Match: M5WFR3_PRUPE (Uncharacterized protein OS=Prunus persica GN=PRUPE_ppa1027225mg PE=4 SV=1)

HSP 1 Score: 130.6 bits (327), Expect = 2.2e-27
Identity = 77/144 (53.47%), Postives = 101/144 (70.14%), Query Frame = 1

Query: 17  VDRCSLSSQAGARSSGEEDDLEDGFSELDSPEKIMPTQTADLEEDDMLISNLQLFEDSDG 76
           VDR S SSQAGA SSGE+DDLEDGFSEL++    +P+  A   ED+ LIS  +L ED + 
Sbjct: 62  VDRRSFSSQAGAESSGEKDDLEDGFSELET----LPSAEASQHEDE-LISEPELSEDEE- 121

Query: 77  DSVGPSQNDLDLSEAMDDVDEKIPINRKAYSELFQAIMSSPGLSVHGVLDKWVKEGKDLS 136
             V PSQ++L+LSE   D  EK    ++  SELF+AI++ P  SVHG LDKWVK G DL+
Sbjct: 122 -EVEPSQHELELSENGADSIEKRSPRKRNVSELFKAILAFPAFSVHGALDKWVKAGNDLN 181

Query: 137 KSEISITMLNLRKSRRYGRALQVT 161
           ++EIS+ M N RK + +GRALQ++
Sbjct: 182 RAEISLAMFNFRKRQMFGRALQLS 198

BLAST of Lsi01G017740 vs. TrEMBL
Match: A0A0D2NRM3_GOSRA (Uncharacterized protein OS=Gossypium raimondii GN=B456_002G225900 PE=4 SV=1)

HSP 1 Score: 120.9 bits (302), Expect = 1.8e-24
Identity = 74/156 (47.44%), Postives = 94/156 (60.26%), Query Frame = 1

Query: 5   FHNLIRGPSGHGVDRCSLSSQAGARSSGEEDDLEDGFSELDSPEKIMPTQTADLEEDDML 64
           F+      S   V RC LSSQAGA SSGEEDDLEDGFSEL++          D    D  
Sbjct: 58  FYRSTHASSNFSVWRCGLSSQAGAESSGEEDDLEDGFSELETAGN--SENKRDHIAKDAT 117

Query: 65  ISNLQLFEDSDGDSVGPSQNDLDLSEAMDDVDEKIPINRKAYSELFQAIMSSPGLSVHGV 124
              L   ++   D    + N+L+L  A  DV +K    R+  S LF+A++S+PGLS H V
Sbjct: 118 EDGLDSDQEFSVDVEETASNELELFGAETDVSDKKSSGRRTTSGLFKALISAPGLSTHKV 177

Query: 125 LDKWVKEGKDLSKSEISITMLNLRKSRRYGRALQVT 161
           LDKW++EGK LS++EIS   LNLRK R YGRALQ++
Sbjct: 178 LDKWLEEGKSLSRAEISSATLNLRKRRMYGRALQLS 211

BLAST of Lsi01G017740 vs. TrEMBL
Match: A0A0D2QAJ6_GOSRA (Uncharacterized protein OS=Gossypium raimondii GN=B456_002G225900 PE=4 SV=1)

HSP 1 Score: 120.9 bits (302), Expect = 1.8e-24
Identity = 74/156 (47.44%), Postives = 94/156 (60.26%), Query Frame = 1

Query: 5   FHNLIRGPSGHGVDRCSLSSQAGARSSGEEDDLEDGFSELDSPEKIMPTQTADLEEDDML 64
           F+      S   V RC LSSQAGA SSGEEDDLEDGFSEL++          D    D  
Sbjct: 55  FYRSTHASSNFSVWRCGLSSQAGAESSGEEDDLEDGFSELETAGN--SENKRDHIAKDAT 114

Query: 65  ISNLQLFEDSDGDSVGPSQNDLDLSEAMDDVDEKIPINRKAYSELFQAIMSSPGLSVHGV 124
              L   ++   D    + N+L+L  A  DV +K    R+  S LF+A++S+PGLS H V
Sbjct: 115 EDGLDSDQEFSVDVEETASNELELFGAETDVSDKKSSGRRTTSGLFKALISAPGLSTHKV 174

Query: 125 LDKWVKEGKDLSKSEISITMLNLRKSRRYGRALQVT 161
           LDKW++EGK LS++EIS   LNLRK R YGRALQ++
Sbjct: 175 LDKWLEEGKSLSRAEISSATLNLRKRRMYGRALQLS 208

BLAST of Lsi01G017740 vs. TAIR10
Match: AT1G80270.1 (AT1G80270.1 PENTATRICOPEPTIDE REPEAT 596)

HSP 1 Score: 92.4 bits (228), Expect = 3.4e-19
Identity = 60/140 (42.86%), Postives = 85/140 (60.71%), Query Frame = 1

Query: 21  SLSSQAGARSSGEEDDLEDGFSELDSPEKIMPTQTADLEEDDMLISNLQLFEDSDGDSVG 80
           +LSS AG +S  EEDDLEDGFSEL+  +    + ++D +E        +L  D +     
Sbjct: 61  ALSSSAGTKSDQEEDDLEDGFSELEGSKSGQGSTSSDEDEG-------KLSADEE----- 120

Query: 81  PSQNDLDLSEAMDDVDEKIPINRKAYSELFQAIMSSPGLSVHGVLDKWVKEGKDLSKSEI 140
             + +LDL E   DV  K     K  SELF+ I+S+PGLS+   LDKWV+EG ++++ EI
Sbjct: 121 -EEEELDLIET--DVSRKTV--EKKQSELFKTIVSAPGLSIGSALDKWVEEGNEITRVEI 180

Query: 141 SITMLNLRKSRRYGRALQVT 161
           +  ML LR+ R YGRALQ++
Sbjct: 181 AKAMLQLRRRRMYGRALQMS 183

BLAST of Lsi01G017740 vs. TAIR10
Match: AT1G15480.1 (AT1G15480.1 Tetratricopeptide repeat (TPR)-like superfamily protein)

HSP 1 Score: 82.8 bits (203), Expect = 2.7e-16
Identity = 54/142 (38.03%), Postives = 78/142 (54.93%), Query Frame = 1

Query: 19  RCSLSSQAGARSSGEEDDLEDGFSELDSPEKIMPTQTADLEEDDMLISNLQLFEDSDGDS 78
           R SLSS AGA+++G++DDLED   +L +P++                      +  DG+ 
Sbjct: 64  RRSLSSDAGAKTTGDDDDLEDKNVDLATPDETSS-------------------DSEDGEE 123

Query: 79  VGPSQNDLDLSEAMDDVDEKIPINRKAYSELFQAIMSSPGLSVHGVLDKWVKEGKDLSKS 138
               + D++ +E    V E      K  SE+F+AI+S  GLSV   LDKWV++GKD ++ 
Sbjct: 124 FSGDEGDIEGAELELHVPES-----KRPSEMFKAIVSVSGLSVGSALDKWVEQGKDTNRK 181

Query: 139 EISITMLNLRKSRRYGRALQVT 161
           E    ML LRK R +GRALQ+T
Sbjct: 184 EFESAMLQLRKRRMFGRALQMT 181

BLAST of Lsi01G017740 vs. TAIR10
Match: AT3G15590.1 (AT3G15590.1 Tetratricopeptide repeat (TPR)-like superfamily protein)

HSP 1 Score: 63.9 bits (154), Expect = 1.3e-10
Identity = 49/138 (35.51%), Postives = 79/138 (57.25%), Query Frame = 1

Query: 22  LSSQAGARSSGEEDDLEDGFSELDSPEKIMPTQTADLEEDDMLISNLQLFEDSDGDSVGP 81
           LSS A A+  G+E   E+   EL   E+ +P  + D+ E   ++ +  LFE      +G 
Sbjct: 71  LSSIADAKDKGDEVVREE---ELSESEEAVPV-SGDVPEG--VVDDDSLFEPE----LGS 130

Query: 82  SQNDLDLSEAMDDVDEKIPINRKAYSELFQAIMSSPGLSVHGVLDKWVKEGKDLSKSEIS 141
             +DL++ E       K P  ++  SEL+++I++    SV  VL+KWVKEGKDLS++E++
Sbjct: 131 DNDDLEIEEKHSKDGGK-PTKKRGQSELYESIVAYK--SVKHVLEKWVKEGKDLSQAEVT 190

Query: 142 ITMLNLRKSRRYGRALQV 160
           + + NLRK + Y   LQ+
Sbjct: 191 LAIHNLRKRKSYAMCLQL 195

BLAST of Lsi01G017740 vs. NCBI nr
Match: gi|470135586|ref|XP_004303594.1| (PREDICTED: pentatricopeptide repeat-containing protein At1g80270, mitochondrial-like [Fragaria vesca subsp. vesca])

HSP 1 Score: 146.0 bits (367), Expect = 7.4e-32
Identity = 78/150 (52.00%), Postives = 108/150 (72.00%), Query Frame = 1

Query: 13  SGHGVDRCSLSSQAGARSSGEEDDLEDGFSELDSPE--KIMPTQTADLEEDDMLISNLQL 72
           S   VDRCSLSSQAG  SSGEEDDLEDGFS+L++P   +++    A+ E +D L+S  ++
Sbjct: 65  SKFSVDRCSLSSQAGTESSGEEDDLEDGFSKLETPPTAEVVQASNANDENEDELVSGPEI 124

Query: 73  FEDSDGDSVGPSQNDLDLSEAMDDVDEKIPINRKAYSELFQAIMSSPGLSVHGVLDKWVK 132
            +D     + PSQN+L+LSE   D+++K+   R+  S LF+AI+  PG SVH  LDKWVK
Sbjct: 125 SDDEA--DIEPSQNELELSEIETDLNDKLSPRRRDVSALFKAILEFPGFSVHSALDKWVK 184

Query: 133 EGKDLSKSEISITMLNLRKSRRYGRALQVT 161
           EG DLS++EIS+T +NLR+ R +GRALQ++
Sbjct: 185 EGNDLSRAEISLTKINLRRRRMFGRALQLS 212

BLAST of Lsi01G017740 vs. NCBI nr
Match: gi|590634360|ref|XP_007028355.1| (Pentatricopeptide repeat-containing protein isoform 1 [Theobroma cacao])

HSP 1 Score: 135.6 bits (340), Expect = 9.9e-29
Identity = 76/158 (48.10%), Postives = 108/158 (68.35%), Query Frame = 1

Query: 5   FHNLIRGPSGHGVDRCSLSSQAGARSSGEEDDLEDGFSELDSPEKIMPTQTADLEE--DD 64
           F++         V RC LSSQAGA SSGEED+LEDGFSEL++P      +    ++  +D
Sbjct: 54  FYHSRHASCNFSVGRCGLSSQAGAESSGEEDELEDGFSELETPATAEKKENCSAQDKAED 113

Query: 65  MLISNLQLFEDSDGDSVGPSQNDLDLSEAMDDVDEKIPINRKAYSELFQAIMSSPGLSVH 124
            LIS+ +L  D + D    ++N+L+LSE   D+ +K    R+  SELF+A++++PGLSVH
Sbjct: 114 GLISDPELSGDEE-DIEETAKNELELSEDETDLSDKKSSTRRIVSELFKAVIAAPGLSVH 173

Query: 125 GVLDKWVKEGKDLSKSEISITMLNLRKSRRYGRALQVT 161
            VLDKW++EGK  +++EIS+ MLNLRK R YGRALQ++
Sbjct: 174 KVLDKWLEEGKAFNRTEISVAMLNLRKRRMYGRALQLS 210

BLAST of Lsi01G017740 vs. NCBI nr
Match: gi|590634363|ref|XP_007028356.1| (Pentatricopeptide repeat-containing protein isoform 2, partial [Theobroma cacao])

HSP 1 Score: 135.6 bits (340), Expect = 9.9e-29
Identity = 76/158 (48.10%), Postives = 108/158 (68.35%), Query Frame = 1

Query: 5   FHNLIRGPSGHGVDRCSLSSQAGARSSGEEDDLEDGFSELDSPEKIMPTQTADLEE--DD 64
           F++         V RC LSSQAGA SSGEED+LEDGFSEL++P      +    ++  +D
Sbjct: 52  FYHSRHASCNFSVGRCGLSSQAGAESSGEEDELEDGFSELETPATAEKKENCSAQDKAED 111

Query: 65  MLISNLQLFEDSDGDSVGPSQNDLDLSEAMDDVDEKIPINRKAYSELFQAIMSSPGLSVH 124
            LIS+ +L  D + D    ++N+L+LSE   D+ +K    R+  SELF+A++++PGLSVH
Sbjct: 112 GLISDPELSGDEE-DIEETAKNELELSEDETDLSDKKSSTRRIVSELFKAVIAAPGLSVH 171

Query: 125 GVLDKWVKEGKDLSKSEISITMLNLRKSRRYGRALQVT 161
            VLDKW++EGK  +++EIS+ MLNLRK R YGRALQ++
Sbjct: 172 KVLDKWLEEGKAFNRTEISVAMLNLRKRRMYGRALQLS 208

BLAST of Lsi01G017740 vs. NCBI nr
Match: gi|645270967|ref|XP_008240694.1| (PREDICTED: pentatricopeptide repeat-containing protein At1g80270, mitochondrial-like [Prunus mume])

HSP 1 Score: 131.0 bits (328), Expect = 2.4e-27
Identity = 77/144 (53.47%), Postives = 102/144 (70.83%), Query Frame = 1

Query: 17  VDRCSLSSQAGARSSGEEDDLEDGFSELDSPEKIMPTQTADLEEDDMLISNLQLFEDSDG 76
           VDR S SSQAGA SSGE+DDLEDGFSEL++    +P+  A   ED+ LIS  +L ED + 
Sbjct: 70  VDRRSFSSQAGAESSGEKDDLEDGFSELET----LPSAEASQHEDE-LISEPELSEDEE- 129

Query: 77  DSVGPSQNDLDLSEAMDDVDEKIPINRKAYSELFQAIMSSPGLSVHGVLDKWVKEGKDLS 136
             V PSQ++L+LSE   D  EK    ++  SELF+AI++ PG SVHG LDKWVK G DL+
Sbjct: 130 -EVEPSQHELELSENGADSIEKRSPRKRNVSELFKAILAFPGFSVHGALDKWVKAGNDLN 189

Query: 137 KSEISITMLNLRKSRRYGRALQVT 161
           ++EIS+ M + RK + +GRALQ++
Sbjct: 190 RAEISLAMFHFRKRQMFGRALQLS 206

BLAST of Lsi01G017740 vs. NCBI nr
Match: gi|595818864|ref|XP_007204421.1| (hypothetical protein PRUPE_ppa1027225mg [Prunus persica])

HSP 1 Score: 130.6 bits (327), Expect = 3.2e-27
Identity = 77/144 (53.47%), Postives = 101/144 (70.14%), Query Frame = 1

Query: 17  VDRCSLSSQAGARSSGEEDDLEDGFSELDSPEKIMPTQTADLEEDDMLISNLQLFEDSDG 76
           VDR S SSQAGA SSGE+DDLEDGFSEL++    +P+  A   ED+ LIS  +L ED + 
Sbjct: 62  VDRRSFSSQAGAESSGEKDDLEDGFSELET----LPSAEASQHEDE-LISEPELSEDEE- 121

Query: 77  DSVGPSQNDLDLSEAMDDVDEKIPINRKAYSELFQAIMSSPGLSVHGVLDKWVKEGKDLS 136
             V PSQ++L+LSE   D  EK    ++  SELF+AI++ P  SVHG LDKWVK G DL+
Sbjct: 122 -EVEPSQHELELSENGADSIEKRSPRKRNVSELFKAILAFPAFSVHGALDKWVKAGNDLN 181

Query: 137 KSEISITMLNLRKSRRYGRALQVT 161
           ++EIS+ M N RK + +GRALQ++
Sbjct: 182 RAEISLAMFNFRKRQMFGRALQLS 198

The following BLAST results are available for this feature:
Match NameE-valueIdentityDescription
PP135_ARATH6.0e-1842.86Pentatricopeptide repeat-containing protein At1g80270, mitochondrial OS=Arabidop... [more]
PPR44_ARATH4.8e-1538.03Pentatricopeptide repeat-containing protein At1g15480, mitochondrial OS=Arabidop... [more]
PP234_ARATH2.3e-0935.51Pentatricopeptide repeat-containing protein At3g15590, mitochondrial OS=Arabidop... [more]
Match NameE-valueIdentityDescription
A0A061EW28_THECC6.9e-2948.10Pentatricopeptide repeat-containing protein isoform 2 (Fragment) OS=Theobroma ca... [more]
A0A061EWQ2_THECC6.9e-2948.10Pentatricopeptide repeat-containing protein isoform 1 OS=Theobroma cacao GN=TCM_... [more]
M5WFR3_PRUPE2.2e-2753.47Uncharacterized protein OS=Prunus persica GN=PRUPE_ppa1027225mg PE=4 SV=1[more]
A0A0D2NRM3_GOSRA1.8e-2447.44Uncharacterized protein OS=Gossypium raimondii GN=B456_002G225900 PE=4 SV=1[more]
A0A0D2QAJ6_GOSRA1.8e-2447.44Uncharacterized protein OS=Gossypium raimondii GN=B456_002G225900 PE=4 SV=1[more]
Match NameE-valueIdentityDescription
AT1G80270.13.4e-1942.86 PENTATRICOPEPTIDE REPEAT 596[more]
AT1G15480.12.7e-1638.03 Tetratricopeptide repeat (TPR)-like superfamily protein[more]
AT3G15590.11.3e-1035.51 Tetratricopeptide repeat (TPR)-like superfamily protein[more]
Match NameE-valueIdentityDescription
gi|470135586|ref|XP_004303594.1|7.4e-3252.00PREDICTED: pentatricopeptide repeat-containing protein At1g80270, mitochondrial-... [more]
gi|590634360|ref|XP_007028355.1|9.9e-2948.10Pentatricopeptide repeat-containing protein isoform 1 [Theobroma cacao][more]
gi|590634363|ref|XP_007028356.1|9.9e-2948.10Pentatricopeptide repeat-containing protein isoform 2, partial [Theobroma cacao][more]
gi|645270967|ref|XP_008240694.1|2.4e-2753.47PREDICTED: pentatricopeptide repeat-containing protein At1g80270, mitochondrial-... [more]
gi|595818864|ref|XP_007204421.1|3.2e-2753.47hypothetical protein PRUPE_ppa1027225mg [Prunus persica][more]
The following terms have been associated with this gene:
Vocabulary: INTERPRO
TermDefinition
GO Assignments
This gene is annotated with the following GO terms.
Category Term Accession Term Name
biological_process GO:0008150 biological_process
cellular_component GO:0005575 cellular_component
molecular_function GO:0003674 molecular_function
molecular_function GO:0003677 DNA binding
molecular_function GO:0005515 protein binding

The following mRNA feature(s) are a part of this gene:

Feature NameUnique NameType
Lsi01G017740.1Lsi01G017740.1mRNA


Analysis Name: InterPro Annotations of Lagenaria siceraria
Date Performed: 2017-09-18
IPR TermIPR DescriptionSourceSource TermSource DescriptionAlignment
NoneNo IPR availablePANTHERPTHR24015FAMILY NOT NAMEDcoord: 21..56
score: 4.0E-15coord: 79..158
score: 4.0
NoneNo IPR availablePANTHERPTHR24015:SF26SUBFAMILY NOT NAMEDcoord: 21..56
score: 4.0E-15coord: 79..158
score: 4.0

The following gene(s) are paralogous to this gene:
GeneParalogueOrganismBlock
Lsi01G017740Lsi03G016170Bottle gourd (USVL1VR-Ls)lsilsiB073