CsaV3_6G016010 (gene) Cucumber (Chinese Long) v3

NameCsaV3_6G016010
Typegene
OrganismCucumis sativus (Cucumber (Chinese Long) v3)
DescriptionPentatricopeptide repeat-containing protein, chloroplastic
Locationchr6 : 11988337 .. 11992605 (-)
The following sequences are available for this feature:

Gene sequence (with intron)

Legend: exonpolypeptideCDS
Hold the cursor over a type above to highlight its positions in the sequence below.
GTTTTTTTTATTGAGAGAGAGAAAGAGTAACATCATATATATAATGAGTTGTCAATGTAGACTTTGTTTTGGTTGTTGTAAATTATGGATTAATGAAATGTTTCAAGCTTTCAATTTAATTATGATTTGTATAATCAAATAACTTGGTTTGTTTGGGATAATTCGAGTTCGGTGGAATACTAATTAATACGGTTGACTATTCAAGAATTTATTCGATAATTTTTTCTTTTTTGATTTTTTTTTTTCATTGATCAATATATCATGTTTGGAGATCACTCACGGTAATTTTGAATGGTAGTTTTTGTAGTTTGAAAGAGTAAAATGATATGTGTTGATAATATTAGATGTGAGGTAAAAGGTAAATGAATGTTTCGATGATAAAAGTTTGTATTTAGTTTTGTTGTATTTAGGAGAGTAGGTAATATTAATATTATTCATGAAATGAATTGAGAATTGAGAATTGAGAATTGAGATGGTGTGGTATGCTTTATAATATATATCCTATGAATTTAAACTCATTTAATTCACTTTACTTTACTTTACTTCTCTTCATGAACACGTAATCAAAATGAATAAATTAAGTTAAAGTTAAAAAGAAAAAAAAATGGTATAAAACTCCAACACGAATGCTTTGTCTTCCAGTGGCGCCAAAAACGCTGCCATTGCGACCACTCACTCCGACCTCCCACCCTTCACCCTTCACCCTTCACCCTTCACCCTTCTCTCTATTCTCCCCCACAACCTTAGGGTTCCTTTCTTTTCTTCTCCGATCATCAATTCACTTTAATTCAATCCAATTCACCGCTTCTTCAACCCCCTTCTCCCTTTCTCTCTCTCTATTCTTCTCTCTTCTCTCTTCCGCTTTTCGTTTCCTTCATTTTCTGTTTCTTTTGTTACTGTTTTTTAACTGCTTCAAATTTCTCTCCTAATTCATGCGCCGGGAAACCCTTTTCTTCATTTCTTCCTTTCTTTACTTTCAAGTTTAGTGCCTTTGGATTCAAACTTCGAAATGATTTGTGCCCAGGGCTTTACTCCCTTAACCCAATTTGGGTTTTCATTTTCTTTATCTTCTCCATTGGAATCTCAGAGATGTGGGTTTTCTACTCCCCGATTGTATATGGTTTCTCCTATTTCTTGCAATTACCAGGATTCTACTTTCTCTGTTTCCAGAGCTGCTAAGTTTCGGGATTTAAGGTTGTTCAAATCGGTTGAGTTGGACCAGTTCATCACCAGTGATGATGAAGATGAAATGGGTGATGGGTTTTTTGAGGCCATTGAGGAATTGGAACGGATGACGAGGGAACCATCGGATGTTCTTGAAGAAATGAACGATCGCCTTTCCGCTAGGGAAATTCAGCTTGTGTTGGTCTACTTCTCTCAAGAAGGTAGAGATTCATGGTGTGCTCTTGAGGTTTTTGAGTGGCTTCAAAAGGAAAATCGGGTTGATAAGGAGACCATGGAGCTTATGGTGTCTATAATGTGTAGTTGGATCAAGAAGTTGGTGGAGGGACGACATAATGTTGGAGATGTTGTTGACCTTCTTGTGGATATGGATTGTGTTGGTTTGAAACCTCATTTTAGCATGATAGAAAAGGTTATCTCTTTGTACTGGGAAATGGGTGAGAAGGAAAAAGCGGTTTTCTTTGTGAAAGAGGTTTTGGGGCGCAATCTTGCTTTTATGAAGGATGATTGGGAGGGGCATAAAGGAGGACCGAGCGGTTATCTTGCATGGAAGATGATGGTAAGCCTTCAGCTAATCAAGATTTTCGTATCTTTAGCCATTAACTTGCTTCAGAGTCAACAATGGTATTCTTTTGCTGTGTCTATCTATTATTGCATTGAAAATTCATATCATGAAATCTAAATATCTTCAATCAATGTTTACTAGCTTTGAATGTCATATTTTGAATCTTGAAATTCCTGATTTTAGAGATTCCTTACGTTTACATTCTTTATTTTACTCCTGTAGTTTAATCATTTAGGTTAGTATCTTGATCTTATTAGATTTTTTTTGTTATTTGAGCATCAGATTTTTGTTGTGGTGACTTTGGTCAGTGTTGAAAAGAATATTTTCTGGGGTTGTGTTTGTATAATCATAATTGGAGGGGTTCAGGAAGAACTATGTAGGTTTTCTTTTTTATCAGAGAGCTAGGTGGGCGTTCAGTATGTACTATGATTTGGTATTAGTACTTTTGATACATATGGTTAAGCATGTTTGAGTCTATTGTTATGTGTGCCTCAAAAAAGAAAAAGAGAATCAGTGTTGGTGATACTGAAGTTTATTCCTGCAGACGTAGGATTTGGGAAACCCATGAGTATCTTAGGGACCTTTTCACTTGAACTTTGTTGTCGTTAGTTTCATTTATTGAGAACTTTTGGGCTTATCTTTGTTCATTGTCAATTTCTTTGGAGCTTCATCTATCATCAGTGAAAGTATCCATCTTTATGGTCTCTTCTAGAAGCCATTTTTAGATTGTTGAATTGCCTTTCTATGTTAGTATATGCGGTGTATTAAACTTCCATATCATAGTGCTACTTCTGGTATTCTCTTGCTTCTATTAGTTGCACTCCTGCATACATGATTTATTTTGAACAACATCGAGTAACTGTATGTGGCGCTGGAATCTAGTATTATAGCACACTACTGATACTTTTGTTCCCTTCACCAGTTAATGGTAGTTGATTAGTTGCACAAGGATTTTAAAAAGTTTTATCAAATTGTTATGTTTGTCCAGCACGACCCCAAGTGAAAATTCAATTGCTAATGATCTCGAAAATAAATTCTCTTAGGTTGATGGTGACTACAGAGGTGCAGTGAAAATGGTGCTGCATCTTAGAGAATCTGGGTTAAGGCCAGAGGTTTACTCCTATCTTATTGCCATGACTGCTGTGGTTAAAGAGCTGAATGAATTTGCAAAAGCTCTTCGGAAACTCAAAGGCTACGCAAGAGACGGGTTTGTGGCTGAACTTGATAAAAACAATGTTGAACTTGTTGCAAAGTATCAGACAGAGCTTCTAGCTGATGGAGTGCAGTTATCCAACTGGGTACTTGAAGAGGGAAGCTCTTCAATTCGTGGGGTGGTTCACGAGAGACTCCTTGCAATGTACATTTGTGCTGGGCAAGGAGTCGAGGCAGAGAGACAGCTTTGGGAAATGAAGCTTGTTGGCAAGGAGGCCGATGCTGATCTCTACGATATTGTTCTAGCCATTTGTGCTTCACAGAAGGAGACAAAAGCAATGAAACGGTTGCTTACCAGGATCGAGATTACGAGTCCCATGATTAAGAAAAAGAGTTTGACATGGCTACTTAGGGGTTACATCAAAGGAGGGCATTTCCGTGATGCTGCAGGAACATTAGTAAAAATGATCAATTTGGGTTTTCTCCCAGAGTACTTGGACAGAGTAGCTGTACTGCAAGGATTAAGAAAAGAAATTCGTGAACCAGAAAGTGTTCATACGTACCTCGATCTCTGCAAGTGTCTTTCTGATGCCAATCTAATTGGACCTAGTCTTGTATATTTGCACTTACAGAAACACAAGCTCTGGATCATTAAAATGCTTTGAAGAAACTCCTCAACACCTACCTCTTTGCACAGGCAGCTAATAAAAGTGGATCAAAAACCATATTCATATCATACAGCACCAGCTCTTTTTTAGGTGCTTTTACATGTTGATCTTGTATAGTTTGAAGCTCTGTAAGCACTTGAAGAGGAATACTTGTGTATATATATAAATATACATGTATGAGCAACGACTGTGCACAAAGAACCAATGTTTCAATGTATAGATTGTATAAAGAAATTACATATTCTGATATTTTTAGTGTACATGCCTTGGTTTCTTCCGTGTTTTCATGTGTGAAGAATCTGGTTTTCATTGTAAAAGGAAAAAGACAACCAAATGGAGAAGAAGGAGAAGGAGAAGTTTGTGAACAGTCGTGGCCATATTTTGCTTCTTCCTGCCCAGAAAACAGCATGTGGTGTGTTCATTTGATGGATATGATATTCAGGTTTTACACCATTGCACTATTACATTATCATATCAATAATTTATTGGGTGTATGATTGATGCAATGTCTTTTACTTTTTTCTTATAATGGTTAGTAAATTGAAGTATCTGTTTGGAATCAAACCCATGTTTCCATGTTGTGGAGGTTTAATGTAATTGTCTCTTGGGTCCATTTATATCTTGTATGTTTGGTATGTTTTTTTATTCTCAACTTAGAAAAAATTTCAATTGGGAATTGATACAAACTTTCTTACTTTCTTT

mRNA sequence

ATGATTTGTGCCCAGGGCTTTACTCCCTTAACCCAATTTGGGTTTTCATTTTCTTTATCTTCTCCATTGGAATCTCAGAGATGTGGGTTTTCTACTCCCCGATTGTATATGGTTTCTCCTATTTCTTGCAATTACCAGGATTCTACTTTCTCTGTTTCCAGAGCTGCTAAGTTTCGGGATTTAAGGTTGTTCAAATCGGTTGAGTTGGACCAGTTCATCACCAGTGATGATGAAGATGAAATGGGTGATGGGTTTTTTGAGGCCATTGAGGAATTGGAACGGATGACGAGGGAACCATCGGATGTTCTTGAAGAAATGAACGATCGCCTTTCCGCTAGGGAAATTCAGCTTGTGTTGGTCTACTTCTCTCAAGAAGGTAGAGATTCATGGTGTGCTCTTGAGGTTTTTGAGTGGCTTCAAAAGGAAAATCGGGTTGATAAGGAGACCATGGAGCTTATGGTGTCTATAATGTGTAGTTGGATCAAGAAGTTGGTGGAGGGACGACATAATGTTGGAGATGTTGTTGACCTTCTTGTGGATATGGATTGTGTTGGTTTGAAACCTCATTTTAGCATGATAGAAAAGGTTATCTCTTTGTACTGGGAAATGGGTGAGAAGGAAAAAGCGGTTTTCTTTGTGAAAGAGGTTTTGGGGCGCAATCTTGCTTTTATGAAGGATGATTGGGAGGGGCATAAAGGAGGACCGAGCGGTTATCTTGCATGGAAGATGATGGTTGATGGTGACTACAGAGGTGCAGTGAAAATGGTGCTGCATCTTAGAGAATCTGGGTTAAGGCCAGAGGTTTACTCCTATCTTATTGCCATGACTGCTGTGGTTAAAGAGCTGAATGAATTTGCAAAAGCTCTTCGGAAACTCAAAGGCTACGCAAGAGACGGGTTTGTGGCTGAACTTGATAAAAACAATGTTGAACTTGTTGCAAAGTATCAGACAGAGCTTCTAGCTGATGGAGTGCAGTTATCCAACTGGGTACTTGAAGAGGGAAGCTCTTCAATTCGTGGGGTGGTTCACGAGAGACTCCTTGCAATGTACATTTGTGCTGGGCAAGGAGTCGAGGCAGAGAGACAGCTTTGGGAAATGAAGCTTGTTGGCAAGGAGGCCGATGCTGATCTCTACGATATTGTTCTAGCCATTTGTGCTTCACAGAAGGAGACAAAAGCAATGAAACGGTTGCTTACCAGGATCGAGATTACGAGTCCCATGATTAAGAAAAAGAGTTTGACATGGCTACTTAGGGGTTACATCAAAGGAGGGCATTTCCGTGATGCTGCAGGAACATTAGTAAAAATGATCAATTTGGGTTTTCTCCCAGAGTACTTGGACAGAGTAGCTGTACTGCAAGGATTAAGAAAAGAAATTCGTGAACCAGAAAGTGTTCATACGTACCTCGATCTCTGCAAGTGTCTTTCTGATGCCAATCTAATTGGACCTAGTCTTGTATATTTGCACTTACAGAAACACAAGCTCTGGATCATTAAAATGCTTTGA

Coding sequence (CDS)

ATGATTTGTGCCCAGGGCTTTACTCCCTTAACCCAATTTGGGTTTTCATTTTCTTTATCTTCTCCATTGGAATCTCAGAGATGTGGGTTTTCTACTCCCCGATTGTATATGGTTTCTCCTATTTCTTGCAATTACCAGGATTCTACTTTCTCTGTTTCCAGAGCTGCTAAGTTTCGGGATTTAAGGTTGTTCAAATCGGTTGAGTTGGACCAGTTCATCACCAGTGATGATGAAGATGAAATGGGTGATGGGTTTTTTGAGGCCATTGAGGAATTGGAACGGATGACGAGGGAACCATCGGATGTTCTTGAAGAAATGAACGATCGCCTTTCCGCTAGGGAAATTCAGCTTGTGTTGGTCTACTTCTCTCAAGAAGGTAGAGATTCATGGTGTGCTCTTGAGGTTTTTGAGTGGCTTCAAAAGGAAAATCGGGTTGATAAGGAGACCATGGAGCTTATGGTGTCTATAATGTGTAGTTGGATCAAGAAGTTGGTGGAGGGACGACATAATGTTGGAGATGTTGTTGACCTTCTTGTGGATATGGATTGTGTTGGTTTGAAACCTCATTTTAGCATGATAGAAAAGGTTATCTCTTTGTACTGGGAAATGGGTGAGAAGGAAAAAGCGGTTTTCTTTGTGAAAGAGGTTTTGGGGCGCAATCTTGCTTTTATGAAGGATGATTGGGAGGGGCATAAAGGAGGACCGAGCGGTTATCTTGCATGGAAGATGATGGTTGATGGTGACTACAGAGGTGCAGTGAAAATGGTGCTGCATCTTAGAGAATCTGGGTTAAGGCCAGAGGTTTACTCCTATCTTATTGCCATGACTGCTGTGGTTAAAGAGCTGAATGAATTTGCAAAAGCTCTTCGGAAACTCAAAGGCTACGCAAGAGACGGGTTTGTGGCTGAACTTGATAAAAACAATGTTGAACTTGTTGCAAAGTATCAGACAGAGCTTCTAGCTGATGGAGTGCAGTTATCCAACTGGGTACTTGAAGAGGGAAGCTCTTCAATTCGTGGGGTGGTTCACGAGAGACTCCTTGCAATGTACATTTGTGCTGGGCAAGGAGTCGAGGCAGAGAGACAGCTTTGGGAAATGAAGCTTGTTGGCAAGGAGGCCGATGCTGATCTCTACGATATTGTTCTAGCCATTTGTGCTTCACAGAAGGAGACAAAAGCAATGAAACGGTTGCTTACCAGGATCGAGATTACGAGTCCCATGATTAAGAAAAAGAGTTTGACATGGCTACTTAGGGGTTACATCAAAGGAGGGCATTTCCGTGATGCTGCAGGAACATTAGTAAAAATGATCAATTTGGGTTTTCTCCCAGAGTACTTGGACAGAGTAGCTGTACTGCAAGGATTAAGAAAAGAAATTCGTGAACCAGAAAGTGTTCATACGTACCTCGATCTCTGCAAGTGTCTTTCTGATGCCAATCTAATTGGACCTAGTCTTGTATATTTGCACTTACAGAAACACAAGCTCTGGATCATTAAAATGCTTTGA

Protein sequence

MICAQGFTPLTQFGFSFSLSSPLESQRCGFSTPRLYMVSPISCNYQDSTFSVSRAAKFRDLRLFKSVELDQFITSDDEDEMGDGFFEAIEELERMTREPSDVLEEMNDRLSAREIQLVLVYFSQEGRDSWCALEVFEWLQKENRVDKETMELMVSIMCSWIKKLVEGRHNVGDVVDLLVDMDCVGLKPHFSMIEKVISLYWEMGEKEKAVFFVKEVLGRNLAFMKDDWEGHKGGPSGYLAWKMMVDGDYRGAVKMVLHLRESGLRPEVYSYLIAMTAVVKELNEFAKALRKLKGYARDGFVAELDKNNVELVAKYQTELLADGVQLSNWVLEEGSSSIRGVVHERLLAMYICAGQGVEAERQLWEMKLVGKEADADLYDIVLAICASQKETKAMKRLLTRIEITSPMIKKKSLTWLLRGYIKGGHFRDAAGTLVKMINLGFLPEYLDRVAVLQGLRKEIREPESVHTYLDLCKCLSDANLIGPSLVYLHLQKHKLWIIKML
BLAST of CsaV3_6G016010 vs. NCBI nr
Match: XP_011657120.1 (PREDICTED: pentatricopeptide repeat-containing protein At2g30100, chloroplastic [Cucumis sativus] >KGN47058.1 hypothetical protein Csa_6G182120 [Cucumis sativus])

HSP 1 Score: 1003.4 bits (2593), Expect = 2.7e-289
Identity = 501/501 (100.00%), Postives = 501/501 (100.00%), Query Frame = 0

Query: 1   MICAQGFTPLTQFGFSFSLSSPLESQRCGFSTPRLYMVSPISCNYQDSTFSVSRAAKFRD 60
           MICAQGFTPLTQFGFSFSLSSPLESQRCGFSTPRLYMVSPISCNYQDSTFSVSRAAKFRD
Sbjct: 1   MICAQGFTPLTQFGFSFSLSSPLESQRCGFSTPRLYMVSPISCNYQDSTFSVSRAAKFRD 60

Query: 61  LRLFKSVELDQFITSDDEDEMGDGFFEAIEELERMTREPSDVLEEMNDRLSAREIQLVLV 120
           LRLFKSVELDQFITSDDEDEMGDGFFEAIEELERMTREPSDVLEEMNDRLSAREIQLVLV
Sbjct: 61  LRLFKSVELDQFITSDDEDEMGDGFFEAIEELERMTREPSDVLEEMNDRLSAREIQLVLV 120

Query: 121 YFSQEGRDSWCALEVFEWLQKENRVDKETMELMVSIMCSWIKKLVEGRHNVGDVVDLLVD 180
           YFSQEGRDSWCALEVFEWLQKENRVDKETMELMVSIMCSWIKKLVEGRHNVGDVVDLLVD
Sbjct: 121 YFSQEGRDSWCALEVFEWLQKENRVDKETMELMVSIMCSWIKKLVEGRHNVGDVVDLLVD 180

Query: 181 MDCVGLKPHFSMIEKVISLYWEMGEKEKAVFFVKEVLGRNLAFMKDDWEGHKGGPSGYLA 240
           MDCVGLKPHFSMIEKVISLYWEMGEKEKAVFFVKEVLGRNLAFMKDDWEGHKGGPSGYLA
Sbjct: 181 MDCVGLKPHFSMIEKVISLYWEMGEKEKAVFFVKEVLGRNLAFMKDDWEGHKGGPSGYLA 240

Query: 241 WKMMVDGDYRGAVKMVLHLRESGLRPEVYSYLIAMTAVVKELNEFAKALRKLKGYARDGF 300
           WKMMVDGDYRGAVKMVLHLRESGLRPEVYSYLIAMTAVVKELNEFAKALRKLKGYARDGF
Sbjct: 241 WKMMVDGDYRGAVKMVLHLRESGLRPEVYSYLIAMTAVVKELNEFAKALRKLKGYARDGF 300

Query: 301 VAELDKNNVELVAKYQTELLADGVQLSNWVLEEGSSSIRGVVHERLLAMYICAGQGVEAE 360
           VAELDKNNVELVAKYQTELLADGVQLSNWVLEEGSSSIRGVVHERLLAMYICAGQGVEAE
Sbjct: 301 VAELDKNNVELVAKYQTELLADGVQLSNWVLEEGSSSIRGVVHERLLAMYICAGQGVEAE 360

Query: 361 RQLWEMKLVGKEADADLYDIVLAICASQKETKAMKRLLTRIEITSPMIKKKSLTWLLRGY 420
           RQLWEMKLVGKEADADLYDIVLAICASQKETKAMKRLLTRIEITSPMIKKKSLTWLLRGY
Sbjct: 361 RQLWEMKLVGKEADADLYDIVLAICASQKETKAMKRLLTRIEITSPMIKKKSLTWLLRGY 420

Query: 421 IKGGHFRDAAGTLVKMINLGFLPEYLDRVAVLQGLRKEIREPESVHTYLDLCKCLSDANL 480
           IKGGHFRDAAGTLVKMINLGFLPEYLDRVAVLQGLRKEIREPESVHTYLDLCKCLSDANL
Sbjct: 421 IKGGHFRDAAGTLVKMINLGFLPEYLDRVAVLQGLRKEIREPESVHTYLDLCKCLSDANL 480

Query: 481 IGPSLVYLHLQKHKLWIIKML 502
           IGPSLVYLHLQKHKLWIIKML
Sbjct: 481 IGPSLVYLHLQKHKLWIIKML 501

BLAST of CsaV3_6G016010 vs. NCBI nr
Match: XP_008465268.1 (PREDICTED: pentatricopeptide repeat-containing protein At2g30100, chloroplastic [Cucumis melo])

HSP 1 Score: 979.2 bits (2530), Expect = 5.4e-282
Identity = 488/501 (97.41%), Postives = 494/501 (98.60%), Query Frame = 0

Query: 1   MICAQGFTPLTQFGFSFSLSSPLESQRCGFSTPRLYMVSPISCNYQDSTFSVSRAAKFRD 60
           MICAQGFTPLTQFGFSFSLSSPLE+QR GFSTPRLYMVSPISCNYQDSTFSVSRAAKFRD
Sbjct: 1   MICAQGFTPLTQFGFSFSLSSPLETQRYGFSTPRLYMVSPISCNYQDSTFSVSRAAKFRD 60

Query: 61  LRLFKSVELDQFITSDDEDEMGDGFFEAIEELERMTREPSDVLEEMNDRLSAREIQLVLV 120
           LRLFKSVELDQFITSDDEDEMGDGFFEAIEELERMTREPSDVLEEMNDRLSAREIQLVLV
Sbjct: 61  LRLFKSVELDQFITSDDEDEMGDGFFEAIEELERMTREPSDVLEEMNDRLSAREIQLVLV 120

Query: 121 YFSQEGRDSWCALEVFEWLQKENRVDKETMELMVSIMCSWIKKLVEGRHNVGDVVDLLVD 180
           YFSQEGRDSWCALEVFEWLQKENRVDKETMELMVSIMCSWI KLVEGRHNVGDVVDLLVD
Sbjct: 121 YFSQEGRDSWCALEVFEWLQKENRVDKETMELMVSIMCSWINKLVEGRHNVGDVVDLLVD 180

Query: 181 MDCVGLKPHFSMIEKVISLYWEMGEKEKAVFFVKEVLGRNLAFMKDDWEGHKGGPSGYLA 240
           MDCVGLKPHFSMIEKVISLYWEMGEKEKA+FFVKEVLGRNLAFMKDDWEGHKGGPSGYLA
Sbjct: 181 MDCVGLKPHFSMIEKVISLYWEMGEKEKAIFFVKEVLGRNLAFMKDDWEGHKGGPSGYLA 240

Query: 241 WKMMVDGDYRGAVKMVLHLRESGLRPEVYSYLIAMTAVVKELNEFAKALRKLKGYARDGF 300
           WKMMVDGDYRGAVKMVLHLRESGLRPEVYSYLIAMTAVVKELNEFAKALRKLK YARDG+
Sbjct: 241 WKMMVDGDYRGAVKMVLHLRESGLRPEVYSYLIAMTAVVKELNEFAKALRKLKSYARDGY 300

Query: 301 VAELDKNNVELVAKYQTELLADGVQLSNWVLEEGSSSIRGVVHERLLAMYICAGQGVEAE 360
           VAELDKNNVELVAKYQTELLADGV+LSNWVLEEGSSSI GVVHERLLAMYICAGQGVEAE
Sbjct: 301 VAELDKNNVELVAKYQTELLADGVRLSNWVLEEGSSSIHGVVHERLLAMYICAGQGVEAE 360

Query: 361 RQLWEMKLVGKEADADLYDIVLAICASQKETKAMKRLLTRIEITSPMIKKKSLTWLLRGY 420
           RQLWEMKL+GKEADADLYDIVLAICASQKE KAMKRLLTRIEITSPMIKKKSLTWLLRGY
Sbjct: 361 RQLWEMKLLGKEADADLYDIVLAICASQKEIKAMKRLLTRIEITSPMIKKKSLTWLLRGY 420

Query: 421 IKGGHFRDAAGTLVKMINLGFLPEYLDRVAVLQGLRKEIREPESVHTYLDLCKCLSDANL 480
           IKGGHFRDAAGT+VKMINLGFLPEYLDRVAVLQGLRK IREPE VHTYLDLCKCLSDANL
Sbjct: 421 IKGGHFRDAAGTVVKMINLGFLPEYLDRVAVLQGLRKGIREPEIVHTYLDLCKCLSDANL 480

Query: 481 IGPSLVYLHLQKHKLWIIKML 502
           IGPSLVYLHLQKHKLWIIKML
Sbjct: 481 IGPSLVYLHLQKHKLWIIKML 501

BLAST of CsaV3_6G016010 vs. NCBI nr
Match: XP_022944005.1 (pentatricopeptide repeat-containing protein At2g30100, chloroplastic [Cucurbita moschata])

HSP 1 Score: 907.1 bits (2343), Expect = 2.6e-260
Identity = 453/510 (88.82%), Postives = 479/510 (93.92%), Query Frame = 0

Query: 1   MICAQGFTPLTQFGFSFSLSSPLESQRCGFSTPRL---------YMVSPISCNYQDSTFS 60
           MICAQGFTPLTQFGFSFSLSS L+S+R GFS P+L         +MVS I+CN+Q+STFS
Sbjct: 1   MICAQGFTPLTQFGFSFSLSSGLKSERLGFSAPQLCSRSPVNFCFMVSRITCNHQNSTFS 60

Query: 61  VSRAAKFRDLRLFKSVELDQFITSDDEDEMGDGFFEAIEELERMTREPSDVLEEMNDRLS 120
           VSRA KFRDLRLFKSVELDQFITSDDEDEMGDGFFEAIEELERMTR+PSDVLEEMNDRLS
Sbjct: 61  VSRAGKFRDLRLFKSVELDQFITSDDEDEMGDGFFEAIEELERMTRDPSDVLEEMNDRLS 120

Query: 121 AREIQLVLVYFSQEGRDSWCALEVFEWLQKENRVDKETMELMVSIMCSWIKKLVEGRHNV 180
           ARE QLVLVYFSQEGRDSWCALEVFEWLQKENRVDKETMELMVSIMCSWIKKLVEG+HNV
Sbjct: 121 AREFQLVLVYFSQEGRDSWCALEVFEWLQKENRVDKETMELMVSIMCSWIKKLVEGQHNV 180

Query: 181 GDVVDLLVDMDCVGLKPHFSMIEKVISLYWEMGEKEKAVFFVKEVLGRNLAFMKDDWEGH 240
            DVVDLLVDMDCVGLKPHFSMIEKVISLYW+MGEKEKA+ FVKEVLGR L FMKD+WEGH
Sbjct: 181 RDVVDLLVDMDCVGLKPHFSMIEKVISLYWDMGEKEKAISFVKEVLGRKLDFMKDNWEGH 240

Query: 241 KGGPSGYLAWKMMVDGDYRGAVKMVLHLRESGLRPEVYSYLIAMTAVVKELNEFAKALRK 300
           KGGPSGYLAWKMMVDGDYRGAVKMVL+LRESGL+PEVY YLIAMTAVVKELNEFAKALRK
Sbjct: 241 KGGPSGYLAWKMMVDGDYRGAVKMVLNLRESGLKPEVYCYLIAMTAVVKELNEFAKALRK 300

Query: 301 LKGYARDGFVAELDKNNVELVAKYQTELLADGVQLSNWVLEEGSSSIRGVVHERLLAMYI 360
           LK YARDG VAELDK+NVELV +YQ+ELLADGV+LSNWVL+EG SS  GVVHERLLAMYI
Sbjct: 301 LKSYARDGMVAELDKDNVELVKRYQSELLADGVRLSNWVLDEGGSSSHGVVHERLLAMYI 360

Query: 361 CAGQGVEAERQLWEMKLVGKEADADLYDIVLAICASQKETKAMKRLLTRIEITSPMIKKK 420
           CAGQG+EAERQLWEMKLVGKEADADLYDIVLAICASQKET+AM RLLTRIEITSP +KKK
Sbjct: 361 CAGQGLEAERQLWEMKLVGKEADADLYDIVLAICASQKETRAMNRLLTRIEITSPRLKKK 420

Query: 421 SLTWLLRGYIKGGHFRDAAGTLVKMINLGFLPEYLDRVAVLQGLRKEIREPESVHTYLDL 480
           SLTWLLRGYIKGGHFRDAA TLVKM+NLGFLPEYLDRVAVLQGLRK IREPE+V TYLDL
Sbjct: 421 SLTWLLRGYIKGGHFRDAAETLVKMVNLGFLPEYLDRVAVLQGLRKRIREPENVETYLDL 480

Query: 481 CKCLSDANLIGPSLVYLHLQKHKLWIIKML 502
           CKCLSDANLIGPSLVYLHLQK+KLW+IKML
Sbjct: 481 CKCLSDANLIGPSLVYLHLQKYKLWVIKML 510

BLAST of CsaV3_6G016010 vs. NCBI nr
Match: XP_023512972.1 (pentatricopeptide repeat-containing protein At2g30100, chloroplastic [Cucurbita pepo subsp. pepo])

HSP 1 Score: 907.1 bits (2343), Expect = 2.6e-260
Identity = 453/510 (88.82%), Postives = 479/510 (93.92%), Query Frame = 0

Query: 1   MICAQGFTPLTQFGFSFSLSSPLESQRCGFSTPRL---------YMVSPISCNYQDSTFS 60
           MICAQGFTPLTQFGFSFSLSS L+S+R GFS P+L         +MVS I+CN+Q+STFS
Sbjct: 1   MICAQGFTPLTQFGFSFSLSSGLKSERLGFSAPQLCSRSPVNFCFMVSRITCNHQNSTFS 60

Query: 61  VSRAAKFRDLRLFKSVELDQFITSDDEDEMGDGFFEAIEELERMTREPSDVLEEMNDRLS 120
           VSRA KFRDLRLFKSVELDQFITSDDEDEMGDGFFEAIEELERMTR+PSDVLEEMNDRLS
Sbjct: 61  VSRAGKFRDLRLFKSVELDQFITSDDEDEMGDGFFEAIEELERMTRDPSDVLEEMNDRLS 120

Query: 121 AREIQLVLVYFSQEGRDSWCALEVFEWLQKENRVDKETMELMVSIMCSWIKKLVEGRHNV 180
           ARE QLVLVYFSQEGRDSWCALEVFEWLQKENRVDKETMELMVSIMCSWIKKLVEG+HNV
Sbjct: 121 AREFQLVLVYFSQEGRDSWCALEVFEWLQKENRVDKETMELMVSIMCSWIKKLVEGQHNV 180

Query: 181 GDVVDLLVDMDCVGLKPHFSMIEKVISLYWEMGEKEKAVFFVKEVLGRNLAFMKDDWEGH 240
           GDVVDLLVDMDCVGLKPHFSMIEKVISLYW+MGEKEKA+ FVKEVLGR L FMKD+WEGH
Sbjct: 181 GDVVDLLVDMDCVGLKPHFSMIEKVISLYWDMGEKEKAISFVKEVLGRKLDFMKDNWEGH 240

Query: 241 KGGPSGYLAWKMMVDGDYRGAVKMVLHLRESGLRPEVYSYLIAMTAVVKELNEFAKALRK 300
           KGGPSGYLAWKMMVDGDYRGAVKMVL+LRESGL+PEVY YLIAMTAVVKELNEFAKALRK
Sbjct: 241 KGGPSGYLAWKMMVDGDYRGAVKMVLNLRESGLKPEVYCYLIAMTAVVKELNEFAKALRK 300

Query: 301 LKGYARDGFVAELDKNNVELVAKYQTELLADGVQLSNWVLEEGSSSIRGVVHERLLAMYI 360
           LK YARDG VAELDK+NVELV +YQ+ELLADGV+LSNWVL+EG SS   VVHERLLAMYI
Sbjct: 301 LKSYARDGMVAELDKDNVELVKRYQSELLADGVRLSNWVLDEGGSSSHRVVHERLLAMYI 360

Query: 361 CAGQGVEAERQLWEMKLVGKEADADLYDIVLAICASQKETKAMKRLLTRIEITSPMIKKK 420
           CAGQG+EAERQLWEMKLVGKEADADLYDIVLAICASQKET+AM RLLTRIEITSP +KKK
Sbjct: 361 CAGQGLEAERQLWEMKLVGKEADADLYDIVLAICASQKETRAMNRLLTRIEITSPRLKKK 420

Query: 421 SLTWLLRGYIKGGHFRDAAGTLVKMINLGFLPEYLDRVAVLQGLRKEIREPESVHTYLDL 480
           SLTWLLRGYIKGGHFRDAA TLVKM+NLGFLPEYLDRVAVLQGLRK IREPE+V TYLDL
Sbjct: 421 SLTWLLRGYIKGGHFRDAAETLVKMVNLGFLPEYLDRVAVLQGLRKRIREPENVETYLDL 480

Query: 481 CKCLSDANLIGPSLVYLHLQKHKLWIIKML 502
           CKCLSDANLIGPSLVYLHLQK+KLW+IKML
Sbjct: 481 CKCLSDANLIGPSLVYLHLQKYKLWVIKML 510

BLAST of CsaV3_6G016010 vs. NCBI nr
Match: XP_022986849.1 (pentatricopeptide repeat-containing protein At2g30100, chloroplastic [Cucurbita maxima])

HSP 1 Score: 902.9 bits (2332), Expect = 4.9e-259
Identity = 450/510 (88.24%), Postives = 480/510 (94.12%), Query Frame = 0

Query: 1   MICAQGFTPLTQFGFSFSLSSPLESQRCGFSTPRL---------YMVSPISCNYQDSTFS 60
           MICA GFTPLT+FGFSFSLSS L+S+R GFS P+L         ++VS I+CN+Q+STFS
Sbjct: 1   MICAPGFTPLTKFGFSFSLSSGLKSKRLGFSAPQLCSRSPVNFCFIVSRITCNHQNSTFS 60

Query: 61  VSRAAKFRDLRLFKSVELDQFITSDDEDEMGDGFFEAIEELERMTREPSDVLEEMNDRLS 120
           VSRA KFRDLRLFKSVELDQFITSDDEDEMGDGFFEAIEELERMTR+PSDVLEEMNDRLS
Sbjct: 61  VSRAGKFRDLRLFKSVELDQFITSDDEDEMGDGFFEAIEELERMTRDPSDVLEEMNDRLS 120

Query: 121 AREIQLVLVYFSQEGRDSWCALEVFEWLQKENRVDKETMELMVSIMCSWIKKLVEGRHNV 180
           ARE QLVLVYFSQEGRDSWCALEVFEWLQKENRVDKETMELMVSIMCSWIKKLVEG+HNV
Sbjct: 121 AREFQLVLVYFSQEGRDSWCALEVFEWLQKENRVDKETMELMVSIMCSWIKKLVEGQHNV 180

Query: 181 GDVVDLLVDMDCVGLKPHFSMIEKVISLYWEMGEKEKAVFFVKEVLGRNLAFMKDDWEGH 240
           GDVVDLLVDMDCVGLKPHFSMIEKVISLYW+MGEKEKA+ FVKEVLGR L FMKD+WEGH
Sbjct: 181 GDVVDLLVDMDCVGLKPHFSMIEKVISLYWDMGEKEKAISFVKEVLGRKLDFMKDNWEGH 240

Query: 241 KGGPSGYLAWKMMVDGDYRGAVKMVLHLRESGLRPEVYSYLIAMTAVVKELNEFAKALRK 300
           KGGPSGYLAWKMMVDGDYRGAVKMVL+LRESGL+PEVY +LIAMTAVVKELNEFAKALRK
Sbjct: 241 KGGPSGYLAWKMMVDGDYRGAVKMVLNLRESGLKPEVYCFLIAMTAVVKELNEFAKALRK 300

Query: 301 LKGYARDGFVAELDKNNVELVAKYQTELLADGVQLSNWVLEEGSSSIRGVVHERLLAMYI 360
           LK YARDG VAELDK+NVELV +YQ+ELLADGV+LSNWVL+EGSSS  GVVHERLLAMYI
Sbjct: 301 LKSYARDGMVAELDKDNVELVKRYQSELLADGVRLSNWVLDEGSSSSHGVVHERLLAMYI 360

Query: 361 CAGQGVEAERQLWEMKLVGKEADADLYDIVLAICASQKETKAMKRLLTRIEITSPMIKKK 420
           CAGQG+EAERQLWEMKLVGKEADADLYDIVLAICASQKET+AM RLL+RIEITSP +KKK
Sbjct: 361 CAGQGLEAERQLWEMKLVGKEADADLYDIVLAICASQKETRAMNRLLSRIEITSPRLKKK 420

Query: 421 SLTWLLRGYIKGGHFRDAAGTLVKMINLGFLPEYLDRVAVLQGLRKEIREPESVHTYLDL 480
           SLTWLLRGYIKGGHFRDAA TLVKM+NLGFLPEYLDRVAVLQGLRK IREPE+V TYLDL
Sbjct: 421 SLTWLLRGYIKGGHFRDAAETLVKMVNLGFLPEYLDRVAVLQGLRKRIREPENVETYLDL 480

Query: 481 CKCLSDANLIGPSLVYLHLQKHKLWIIKML 502
           CKCLSDANLIGPSLVYLHLQK+KLW+IKML
Sbjct: 481 CKCLSDANLIGPSLVYLHLQKYKLWVIKML 510

BLAST of CsaV3_6G016010 vs. TAIR10
Match: AT2G30100.1 (pentatricopeptide (PPR) repeat-containing protein)

HSP 1 Score: 602.8 bits (1553), Expect = 1.9e-172
Identity = 300/458 (65.50%), Postives = 366/458 (79.91%), Query Frame = 0

Query: 55  AAKFRDLRLFKSVELDQFITSDDE----DEMGDGFFEAIEELERMTREPSDVLEEMNDRL 114
           A KFR++ L +SVELDQFITS++E    +E+G+GFFEAIEELERMTREPSD+LEEMN RL
Sbjct: 46  AGKFREMGLSRSVELDQFITSEEEEGEAEEIGEGFFEAIEELERMTREPSDILEEMNHRL 105

Query: 115 SAREIQLVLVYFSQEGRDSWCALEVFEWLQKENRVDKETMELMVSIMCSWIKKLVEGRHN 174
           S+RE+QL+LVYF+QEGRDSWC LEVFEWL+KENRVD+E MELMVSIMC W+KKL+E   N
Sbjct: 106 SSRELQLMLVYFAQEGRDSWCTLEVFEWLKKENRVDEEIMELMVSIMCGWVKKLIEDECN 165

Query: 175 VGDVVDLLVDMDCVGLKPHFSMIEKVISLYWEMGEKEKAVFFVKEVLGRNLAFMKD---- 234
              V DLL++MDCVGLKP FSM++KVI+LY EMG+KE AV FVKEVL R   F       
Sbjct: 166 AHQVFDLLIEMDCVGLKPGFSMMDKVIALYCEMGKKESAVLFVKEVLRRRDGFGYSVVGG 225

Query: 235 -DWEGHKGGPSGYLAWKMMVDGDYRGAVKMVLHLRESGLRPEVYSYLIAMTAVVKELNEF 294
              EG KGGP GYLAWK MVDGDYR AV MV+ LR SGL+PE YSYLIAMTA+VKELN  
Sbjct: 226 GGSEGRKGGPVGYLAWKFMVDGDYRKAVDMVMELRLSGLKPEAYSYLIAMTAIVKELNSL 285

Query: 295 AKALRKLKGYARDGFVAELDKNNVELVAKYQTELLADGVQLSNWVLEEG--SSSIRGVVH 354
            K LR+LK +AR GFVAE+D ++  L+ KYQ+E L+ G+QL+ W +EEG  + SI GVVH
Sbjct: 286 GKTLRELKRFARAGFVAEIDDHDRVLIEKYQSETLSRGLQLATWAVEEGQENDSIIGVVH 345

Query: 355 ERLLAMYICAGQGVEAERQLWEMKLVGKEADADLYDIVLAICASQKETKAMKRLLTRIEI 414
           ERLLAMYICAG+G EAE+QLW+MKL G+E +ADL+DIV+AICASQKE  A+ RLLTR+E 
Sbjct: 346 ERLLAMYICAGRGPEAEKQLWKMKLAGREPEADLHDIVMAICASQKEVNAVSRLLTRVEF 405

Query: 415 TSPMIKKKSLTWLLRGYIKGGHFRDAAGTLVKMINLGFLPEYLDRVAVLQGLRKEIREPE 474
                KKK+L+WLLRGY+KGGHF +AA TLV MI+ G  PEY+DRVAV+QG+ ++I+ P 
Sbjct: 406 MGSQRKKKTLSWLLRGYVKGGHFEEAAETLVSMIDSGLHPEYIDRVAVMQGMTRKIQRPR 465

Query: 475 SVHTYLDLCKCLSDANLIGPSLVYLHLQKHKLWIIKML 502
            V  Y+ LCK L DA L+GP LVY+++ K+KLWI+KM+
Sbjct: 466 DVEAYMSLCKRLFDAGLVGPCLVYMYIDKYKLWIVKMM 503

BLAST of CsaV3_6G016010 vs. Swiss-Prot
Match: sp|Q0WNN7|PP176_ARATH (Pentatricopeptide repeat-containing protein At2g30100, chloroplastic OS=Arabidopsis thaliana OX=3702 GN=At2g30100 PE=2 SV=2)

HSP 1 Score: 602.8 bits (1553), Expect = 3.4e-171
Identity = 300/458 (65.50%), Postives = 366/458 (79.91%), Query Frame = 0

Query: 55  AAKFRDLRLFKSVELDQFITSDDE----DEMGDGFFEAIEELERMTREPSDVLEEMNDRL 114
           A KFR++ L +SVELDQFITS++E    +E+G+GFFEAIEELERMTREPSD+LEEMN RL
Sbjct: 46  AGKFREMGLSRSVELDQFITSEEEEGEAEEIGEGFFEAIEELERMTREPSDILEEMNHRL 105

Query: 115 SAREIQLVLVYFSQEGRDSWCALEVFEWLQKENRVDKETMELMVSIMCSWIKKLVEGRHN 174
           S+RE+QL+LVYF+QEGRDSWC LEVFEWL+KENRVD+E MELMVSIMC W+KKL+E   N
Sbjct: 106 SSRELQLMLVYFAQEGRDSWCTLEVFEWLKKENRVDEEIMELMVSIMCGWVKKLIEDECN 165

Query: 175 VGDVVDLLVDMDCVGLKPHFSMIEKVISLYWEMGEKEKAVFFVKEVLGRNLAFMKD---- 234
              V DLL++MDCVGLKP FSM++KVI+LY EMG+KE AV FVKEVL R   F       
Sbjct: 166 AHQVFDLLIEMDCVGLKPGFSMMDKVIALYCEMGKKESAVLFVKEVLRRRDGFGYSVVGG 225

Query: 235 -DWEGHKGGPSGYLAWKMMVDGDYRGAVKMVLHLRESGLRPEVYSYLIAMTAVVKELNEF 294
              EG KGGP GYLAWK MVDGDYR AV MV+ LR SGL+PE YSYLIAMTA+VKELN  
Sbjct: 226 GGSEGRKGGPVGYLAWKFMVDGDYRKAVDMVMELRLSGLKPEAYSYLIAMTAIVKELNSL 285

Query: 295 AKALRKLKGYARDGFVAELDKNNVELVAKYQTELLADGVQLSNWVLEEG--SSSIRGVVH 354
            K LR+LK +AR GFVAE+D ++  L+ KYQ+E L+ G+QL+ W +EEG  + SI GVVH
Sbjct: 286 GKTLRELKRFARAGFVAEIDDHDRVLIEKYQSETLSRGLQLATWAVEEGQENDSIIGVVH 345

Query: 355 ERLLAMYICAGQGVEAERQLWEMKLVGKEADADLYDIVLAICASQKETKAMKRLLTRIEI 414
           ERLLAMYICAG+G EAE+QLW+MKL G+E +ADL+DIV+AICASQKE  A+ RLLTR+E 
Sbjct: 346 ERLLAMYICAGRGPEAEKQLWKMKLAGREPEADLHDIVMAICASQKEVNAVSRLLTRVEF 405

Query: 415 TSPMIKKKSLTWLLRGYIKGGHFRDAAGTLVKMINLGFLPEYLDRVAVLQGLRKEIREPE 474
                KKK+L+WLLRGY+KGGHF +AA TLV MI+ G  PEY+DRVAV+QG+ ++I+ P 
Sbjct: 406 MGSQRKKKTLSWLLRGYVKGGHFEEAAETLVSMIDSGLHPEYIDRVAVMQGMTRKIQRPR 465

Query: 475 SVHTYLDLCKCLSDANLIGPSLVYLHLQKHKLWIIKML 502
            V  Y+ LCK L DA L+GP LVY+++ K+KLWI+KM+
Sbjct: 466 DVEAYMSLCKRLFDAGLVGPCLVYMYIDKYKLWIVKMM 503

BLAST of CsaV3_6G016010 vs. TrEMBL
Match: tr|A0A0A0KC35|A0A0A0KC35_CUCSA (Uncharacterized protein OS=Cucumis sativus OX=3659 GN=Csa_6G182120 PE=4 SV=1)

HSP 1 Score: 1003.4 bits (2593), Expect = 1.8e-289
Identity = 501/501 (100.00%), Postives = 501/501 (100.00%), Query Frame = 0

Query: 1   MICAQGFTPLTQFGFSFSLSSPLESQRCGFSTPRLYMVSPISCNYQDSTFSVSRAAKFRD 60
           MICAQGFTPLTQFGFSFSLSSPLESQRCGFSTPRLYMVSPISCNYQDSTFSVSRAAKFRD
Sbjct: 1   MICAQGFTPLTQFGFSFSLSSPLESQRCGFSTPRLYMVSPISCNYQDSTFSVSRAAKFRD 60

Query: 61  LRLFKSVELDQFITSDDEDEMGDGFFEAIEELERMTREPSDVLEEMNDRLSAREIQLVLV 120
           LRLFKSVELDQFITSDDEDEMGDGFFEAIEELERMTREPSDVLEEMNDRLSAREIQLVLV
Sbjct: 61  LRLFKSVELDQFITSDDEDEMGDGFFEAIEELERMTREPSDVLEEMNDRLSAREIQLVLV 120

Query: 121 YFSQEGRDSWCALEVFEWLQKENRVDKETMELMVSIMCSWIKKLVEGRHNVGDVVDLLVD 180
           YFSQEGRDSWCALEVFEWLQKENRVDKETMELMVSIMCSWIKKLVEGRHNVGDVVDLLVD
Sbjct: 121 YFSQEGRDSWCALEVFEWLQKENRVDKETMELMVSIMCSWIKKLVEGRHNVGDVVDLLVD 180

Query: 181 MDCVGLKPHFSMIEKVISLYWEMGEKEKAVFFVKEVLGRNLAFMKDDWEGHKGGPSGYLA 240
           MDCVGLKPHFSMIEKVISLYWEMGEKEKAVFFVKEVLGRNLAFMKDDWEGHKGGPSGYLA
Sbjct: 181 MDCVGLKPHFSMIEKVISLYWEMGEKEKAVFFVKEVLGRNLAFMKDDWEGHKGGPSGYLA 240

Query: 241 WKMMVDGDYRGAVKMVLHLRESGLRPEVYSYLIAMTAVVKELNEFAKALRKLKGYARDGF 300
           WKMMVDGDYRGAVKMVLHLRESGLRPEVYSYLIAMTAVVKELNEFAKALRKLKGYARDGF
Sbjct: 241 WKMMVDGDYRGAVKMVLHLRESGLRPEVYSYLIAMTAVVKELNEFAKALRKLKGYARDGF 300

Query: 301 VAELDKNNVELVAKYQTELLADGVQLSNWVLEEGSSSIRGVVHERLLAMYICAGQGVEAE 360
           VAELDKNNVELVAKYQTELLADGVQLSNWVLEEGSSSIRGVVHERLLAMYICAGQGVEAE
Sbjct: 301 VAELDKNNVELVAKYQTELLADGVQLSNWVLEEGSSSIRGVVHERLLAMYICAGQGVEAE 360

Query: 361 RQLWEMKLVGKEADADLYDIVLAICASQKETKAMKRLLTRIEITSPMIKKKSLTWLLRGY 420
           RQLWEMKLVGKEADADLYDIVLAICASQKETKAMKRLLTRIEITSPMIKKKSLTWLLRGY
Sbjct: 361 RQLWEMKLVGKEADADLYDIVLAICASQKETKAMKRLLTRIEITSPMIKKKSLTWLLRGY 420

Query: 421 IKGGHFRDAAGTLVKMINLGFLPEYLDRVAVLQGLRKEIREPESVHTYLDLCKCLSDANL 480
           IKGGHFRDAAGTLVKMINLGFLPEYLDRVAVLQGLRKEIREPESVHTYLDLCKCLSDANL
Sbjct: 421 IKGGHFRDAAGTLVKMINLGFLPEYLDRVAVLQGLRKEIREPESVHTYLDLCKCLSDANL 480

Query: 481 IGPSLVYLHLQKHKLWIIKML 502
           IGPSLVYLHLQKHKLWIIKML
Sbjct: 481 IGPSLVYLHLQKHKLWIIKML 501

BLAST of CsaV3_6G016010 vs. TrEMBL
Match: tr|A0A1S3CNE0|A0A1S3CNE0_CUCME (pentatricopeptide repeat-containing protein At2g30100, chloroplastic OS=Cucumis melo OX=3656 GN=LOC103502924 PE=4 SV=1)

HSP 1 Score: 979.2 bits (2530), Expect = 3.6e-282
Identity = 488/501 (97.41%), Postives = 494/501 (98.60%), Query Frame = 0

Query: 1   MICAQGFTPLTQFGFSFSLSSPLESQRCGFSTPRLYMVSPISCNYQDSTFSVSRAAKFRD 60
           MICAQGFTPLTQFGFSFSLSSPLE+QR GFSTPRLYMVSPISCNYQDSTFSVSRAAKFRD
Sbjct: 1   MICAQGFTPLTQFGFSFSLSSPLETQRYGFSTPRLYMVSPISCNYQDSTFSVSRAAKFRD 60

Query: 61  LRLFKSVELDQFITSDDEDEMGDGFFEAIEELERMTREPSDVLEEMNDRLSAREIQLVLV 120
           LRLFKSVELDQFITSDDEDEMGDGFFEAIEELERMTREPSDVLEEMNDRLSAREIQLVLV
Sbjct: 61  LRLFKSVELDQFITSDDEDEMGDGFFEAIEELERMTREPSDVLEEMNDRLSAREIQLVLV 120

Query: 121 YFSQEGRDSWCALEVFEWLQKENRVDKETMELMVSIMCSWIKKLVEGRHNVGDVVDLLVD 180
           YFSQEGRDSWCALEVFEWLQKENRVDKETMELMVSIMCSWI KLVEGRHNVGDVVDLLVD
Sbjct: 121 YFSQEGRDSWCALEVFEWLQKENRVDKETMELMVSIMCSWINKLVEGRHNVGDVVDLLVD 180

Query: 181 MDCVGLKPHFSMIEKVISLYWEMGEKEKAVFFVKEVLGRNLAFMKDDWEGHKGGPSGYLA 240
           MDCVGLKPHFSMIEKVISLYWEMGEKEKA+FFVKEVLGRNLAFMKDDWEGHKGGPSGYLA
Sbjct: 181 MDCVGLKPHFSMIEKVISLYWEMGEKEKAIFFVKEVLGRNLAFMKDDWEGHKGGPSGYLA 240

Query: 241 WKMMVDGDYRGAVKMVLHLRESGLRPEVYSYLIAMTAVVKELNEFAKALRKLKGYARDGF 300
           WKMMVDGDYRGAVKMVLHLRESGLRPEVYSYLIAMTAVVKELNEFAKALRKLK YARDG+
Sbjct: 241 WKMMVDGDYRGAVKMVLHLRESGLRPEVYSYLIAMTAVVKELNEFAKALRKLKSYARDGY 300

Query: 301 VAELDKNNVELVAKYQTELLADGVQLSNWVLEEGSSSIRGVVHERLLAMYICAGQGVEAE 360
           VAELDKNNVELVAKYQTELLADGV+LSNWVLEEGSSSI GVVHERLLAMYICAGQGVEAE
Sbjct: 301 VAELDKNNVELVAKYQTELLADGVRLSNWVLEEGSSSIHGVVHERLLAMYICAGQGVEAE 360

Query: 361 RQLWEMKLVGKEADADLYDIVLAICASQKETKAMKRLLTRIEITSPMIKKKSLTWLLRGY 420
           RQLWEMKL+GKEADADLYDIVLAICASQKE KAMKRLLTRIEITSPMIKKKSLTWLLRGY
Sbjct: 361 RQLWEMKLLGKEADADLYDIVLAICASQKEIKAMKRLLTRIEITSPMIKKKSLTWLLRGY 420

Query: 421 IKGGHFRDAAGTLVKMINLGFLPEYLDRVAVLQGLRKEIREPESVHTYLDLCKCLSDANL 480
           IKGGHFRDAAGT+VKMINLGFLPEYLDRVAVLQGLRK IREPE VHTYLDLCKCLSDANL
Sbjct: 421 IKGGHFRDAAGTVVKMINLGFLPEYLDRVAVLQGLRKGIREPEIVHTYLDLCKCLSDANL 480

Query: 481 IGPSLVYLHLQKHKLWIIKML 502
           IGPSLVYLHLQKHKLWIIKML
Sbjct: 481 IGPSLVYLHLQKHKLWIIKML 501

BLAST of CsaV3_6G016010 vs. TrEMBL
Match: tr|A0A2P5EK76|A0A2P5EK76_9ROSA (Pentatricopeptide repeat OS=Trema orientalis OX=63057 GN=TorRG33x02_183690 PE=4 SV=1)

HSP 1 Score: 740.7 bits (1911), Expect = 2.1e-210
Identity = 371/512 (72.46%), Postives = 430/512 (83.98%), Query Frame = 0

Query: 1   MICAQGFTPLTQFGFSFSLSSPLESQRC-GFSTPRLY----------MVSPISCNYQDST 60
           M  AQGFTPLT+ G S S    L   R  G+   + +          + S I C  Q+  
Sbjct: 1   MASAQGFTPLTELGISSSSCISLRRNRSFGYQVRKSFWGRTCCASSRVCSIICCKQQNPG 60

Query: 61  FSVSRAAKFRDLRLFKSVELDQFITSDDEDEMGDGFFEAIEELERMTREPSDVLEEMNDR 120
           F V + +KFR+ RLFKSVELDQF+TSDDE+EMG+GFFEAIEELERMTREPSDVLEEMNDR
Sbjct: 61  FIVVKPSKFREFRLFKSVELDQFLTSDDEEEMGEGFFEAIEELERMTREPSDVLEEMNDR 120

Query: 121 LSAREIQLVLVYFSQEGRDSWCALEVFEWLQKENRVDKETMELMVSIMCSWIKKLVEGRH 180
           LSARE+QLVLVYFSQEGRDSWCALEVFEWL+KENRVDKETMELMV++MCSWIKKL+EG H
Sbjct: 121 LSARELQLVLVYFSQEGRDSWCALEVFEWLRKENRVDKETMELMVTLMCSWIKKLIEGEH 180

Query: 181 NVGDVVDLLVDMDCVGLKPHFSMIEKVISLYWEMGEKEKAVFFVKEVLGRNLAFMKDDWE 240
           +VGDVVDLLVDMDCVGLKP FSM+EKVISLYW+MGEKE+AV FVK+VL R + +  DD +
Sbjct: 181 DVGDVVDLLVDMDCVGLKPSFSMMEKVISLYWDMGEKERAVLFVKDVLRRGITYSDDDGD 240

Query: 241 GHKGGPSGYLAWKMMVDGDYRGAVKMVLHLRESGLRPEVYSYLIAMTAVVKELNEFAKAL 300
           G+KGGP+GYLAWKMMV+G Y  AVK+V+ LRESGL+PEVYSYLIAMTA VKELNEFAKAL
Sbjct: 241 GNKGGPTGYLAWKMMVEGKYMDAVKLVVDLRESGLKPEVYSYLIAMTAAVKELNEFAKAL 300

Query: 301 RKLKGYARDGFVAELDKNNVELVAKYQTELLADGVQLSNWVLEEGSSSIRGVVHERLLAM 360
           RKLKG+AR    AELD+ + EL+ KYQ++LLADGV+LSNW  EEGSSS+ GVVHERLLAM
Sbjct: 301 RKLKGFARGRLTAELDEESTELIEKYQSDLLADGVRLSNWATEEGSSSLYGVVHERLLAM 360

Query: 361 YICAGQGVEAERQLWEMKLVGKEADADLYDIVLAICASQKETKAMKRLLTRIEITSPMIK 420
           YICAG+G+EAERQLWEMKLVGKEADADL+DIVLAICASQKE  A+ R+LTR+EI+S + K
Sbjct: 361 YICAGRGLEAERQLWEMKLVGKEADADLHDIVLAICASQKEASAIARMLTRVEISSSLRK 420

Query: 421 KKSLTWLLRGYIKGGHFRDAAGTLVKMINLGFLPEYLDRVAVLQGLRKEIREPESVHTYL 480
           KKSL+WLLRGY+KGGHF  AA T+VKM++LG  PEYLDR AVLQGLRK I+ P SV TYL
Sbjct: 421 KKSLSWLLRGYVKGGHFDKAAETVVKMLDLGLCPEYLDRAAVLQGLRKRIQGPGSVETYL 480

Query: 481 DLCKCLSDANLIGPSLVYLHLQKHKLWIIKML 502
            LCK LSD NL+GP LVYL+++K+KLWIIKM+
Sbjct: 481 KLCKHLSDNNLVGPCLVYLYIKKYKLWIIKMV 512

BLAST of CsaV3_6G016010 vs. TrEMBL
Match: tr|M5VXS0|M5VXS0_PRUPE (Uncharacterized protein OS=Prunus persica OX=3760 GN=PRUPE_6G052600 PE=4 SV=1)

HSP 1 Score: 740.0 bits (1909), Expect = 3.6e-210
Identity = 370/502 (73.71%), Postives = 425/502 (84.66%), Query Frame = 0

Query: 1   MICAQGFTPLTQFGFSFSLSSPLESQRCGFSTPRLYMVSPISCNYQDSTFSVSRAAKFRD 60
           M  AQG   LT   F+      +  +  GFS      V P  C +Q   F V++++K RD
Sbjct: 1   MASAQGLASLTHSLFAVKRQRFMGLR--GFSAQSCGRVFPRICKHQKPNFIVAKSSKVRD 60

Query: 61  LRLFKSVELDQFITSDDEDEMGDGFFEAIEELERMTREPSDVLEEMNDRLSAREIQLVLV 120
            RLFKSVELDQF+TSDDEDEMG+GFFEAIEELERMTREPSDVLEEMNDRLSARE+QLVLV
Sbjct: 61  FRLFKSVELDQFLTSDDEDEMGEGFFEAIEELERMTREPSDVLEEMNDRLSARELQLVLV 120

Query: 121 YFSQEGRDSWCALEVFEWLQKENRVDKETMELMVSIMCSWIKKLVEGRHNVGDVVDLLVD 180
           YFSQEGRDSWCALEVFEWL+KENRVDKETM+LMVSIMCSW+KKL++  H++GDVVDLLVD
Sbjct: 121 YFSQEGRDSWCALEVFEWLRKENRVDKETMDLMVSIMCSWVKKLIQREHDIGDVVDLLVD 180

Query: 181 MDCVGLKPHFSMIEKVISLYWEMGEKEKAVFFVKEVLGRNLAFM-KDDWEGHKGGPSGYL 240
           MDCVGLKP FSM+EKVISLYWEMGEKEKAV FVKEVL R + +  +DD +GHKGGP+GYL
Sbjct: 181 MDCVGLKPSFSMMEKVISLYWEMGEKEKAVLFVKEVLKRGIVYSEEDDTDGHKGGPTGYL 240

Query: 241 AWKMMVDGDYRGAVKMVLHLRESGLRPEVYSYLIAMTAVVKELNEFAKALRKLKGYARDG 300
           AWKMMV+G+YR +VK+V+HLRESGL+PEVYSYLIAMTAVVKELNE AKALRKLKG+ R G
Sbjct: 241 AWKMMVEGNYRDSVKLVIHLRESGLKPEVYSYLIAMTAVVKELNELAKALRKLKGFTRAG 300

Query: 301 FVAELDKNNVELVAKYQTELLADGVQLSNWVLEEGSSSIRGVVHERLLAMYICAGQGVEA 360
            +AE D  NV L+ KYQ++LL+DGVQLSNWV++EGSSS+ GVVHERLLAMYIC+G G+EA
Sbjct: 301 LIAEFDTENVGLIEKYQSDLLSDGVQLSNWVIQEGSSSLHGVVHERLLAMYICSGHGLEA 360

Query: 361 ERQLWEMKLVGKEADADLYDIVLAICASQKETKAMKRLLTRIEITSPMIKKKSLTWLLRG 420
           ERQLWEMKLVGKEADADLYDIVLAICASQKE  A+ RLLTR E+TS + KKKSL+WLLRG
Sbjct: 361 ERQLWEMKLVGKEADADLYDIVLAICASQKEASAIGRLLTRTEVTSSLRKKKSLSWLLRG 420

Query: 421 YIKGGHFRDAAGTLVKMINLGFLPEYLDRVAVLQGLRKEIREPESVHTYLDLCKCLSDAN 480
           YIKGGHF DAA T++KM++LG  PE+LDR AVLQGLRK I+E   V TYL LCK LSDA+
Sbjct: 421 YIKGGHFDDAAETVIKMLDLGLCPEFLDRAAVLQGLRKSIQESGGVDTYLKLCKRLSDAS 480

Query: 481 LIGPSLVYLHLQKHKLWIIKML 502
           LIGP LVYL ++K+KLWI KML
Sbjct: 481 LIGPCLVYLFIRKYKLWITKML 500

BLAST of CsaV3_6G016010 vs. TrEMBL
Match: tr|A0A2P5AFX8|A0A2P5AFX8_PARAD (Pentatricopeptide repeat OS=Parasponia andersonii OX=3476 GN=PanWU01x14_336330 PE=4 SV=1)

HSP 1 Score: 733.8 bits (1893), Expect = 2.6e-208
Identity = 367/512 (71.68%), Postives = 427/512 (83.40%), Query Frame = 0

Query: 1   MICAQGFTPLTQFGFSFSLSSPLESQRC-GFSTPRLY----------MVSPISCNYQDST 60
           M  AQGFTPLT+ G S S    L   R  G+   + +          + S I C  Q+  
Sbjct: 2   MASAQGFTPLTELGISSSPCISLRRNRSFGYQVRKSFWGRTCCASSRVCSIICCKQQNPG 61

Query: 61  FSVSRAAKFRDLRLFKSVELDQFITSDDEDEMGDGFFEAIEELERMTREPSDVLEEMNDR 120
           F V + +KFR+ RLFKSVELDQF+TSDDE+EMG+GFFEAIEELERM REPSDVLEEMNDR
Sbjct: 62  FVVVKPSKFREFRLFKSVELDQFLTSDDEEEMGEGFFEAIEELERMRREPSDVLEEMNDR 121

Query: 121 LSAREIQLVLVYFSQEGRDSWCALEVFEWLQKENRVDKETMELMVSIMCSWIKKLVEGRH 180
           LSARE+QLVLVYFSQEGRDSWCALEVFEWL+KENRVDKETMELMV++MCSWIKKL+EG H
Sbjct: 122 LSARELQLVLVYFSQEGRDSWCALEVFEWLRKENRVDKETMELMVTLMCSWIKKLIEGEH 181

Query: 181 NVGDVVDLLVDMDCVGLKPHFSMIEKVISLYWEMGEKEKAVFFVKEVLGRNLAFMKDDWE 240
           +VGDVVDLLVDMDCVGLKP FSM+EKVISLYW+MGEKE+AV FVK+VL   + +  DD  
Sbjct: 182 DVGDVVDLLVDMDCVGLKPSFSMMEKVISLYWDMGEKERAVLFVKDVLRHGITYSDDDGN 241

Query: 241 GHKGGPSGYLAWKMMVDGDYRGAVKMVLHLRESGLRPEVYSYLIAMTAVVKELNEFAKAL 300
           G+KGGP+GYLAWKMMV+G Y  AVK+V+ LRESGL+PEVYSYLIAMTA VKELNEFAKAL
Sbjct: 242 GNKGGPTGYLAWKMMVEGKYMDAVKLVVDLRESGLKPEVYSYLIAMTAAVKELNEFAKAL 301

Query: 301 RKLKGYARDGFVAELDKNNVELVAKYQTELLADGVQLSNWVLEEGSSSIRGVVHERLLAM 360
           RKLKG+AR    AELD+ + EL+ KYQ++LLADGV+LSNW +EEGSS + GV+HERLLAM
Sbjct: 302 RKLKGFARGRLTAELDEESTELIEKYQSDLLADGVRLSNWAIEEGSSLLYGVIHERLLAM 361

Query: 361 YICAGQGVEAERQLWEMKLVGKEADADLYDIVLAICASQKETKAMKRLLTRIEITSPMIK 420
           YICAG+G+EAERQLWEMKLVGKEADADL+DIVLAICASQKE  A+ R+LTR+EI+S + K
Sbjct: 362 YICAGRGLEAERQLWEMKLVGKEADADLHDIVLAICASQKEASAIARMLTRVEISSSLPK 421

Query: 421 KKSLTWLLRGYIKGGHFRDAAGTLVKMINLGFLPEYLDRVAVLQGLRKEIREPESVHTYL 480
           KKSL+WLLRGY+KGGHF  AA T+VKM++LG  PEYLDR AVLQGLRK I+ P SV TYL
Sbjct: 422 KKSLSWLLRGYVKGGHFDKAAETVVKMLDLGLCPEYLDRAAVLQGLRKRIQGPGSVETYL 481

Query: 481 DLCKCLSDANLIGPSLVYLHLQKHKLWIIKML 502
            LCK LSD NL+GP LVYL+++K+KLWIIKM+
Sbjct: 482 KLCKHLSDNNLVGPCLVYLYIKKYKLWIIKMV 513

The following BLAST results are available for this feature:
Match NameE-valueIdentityDescription
XP_011657120.12.7e-289100.00PREDICTED: pentatricopeptide repeat-containing protein At2g30100, chloroplastic ... [more]
XP_008465268.15.4e-28297.41PREDICTED: pentatricopeptide repeat-containing protein At2g30100, chloroplastic ... [more]
XP_022944005.12.6e-26088.82pentatricopeptide repeat-containing protein At2g30100, chloroplastic [Cucurbita ... [more]
XP_023512972.12.6e-26088.82pentatricopeptide repeat-containing protein At2g30100, chloroplastic [Cucurbita ... [more]
XP_022986849.14.9e-25988.24pentatricopeptide repeat-containing protein At2g30100, chloroplastic [Cucurbita ... [more]
Match NameE-valueIdentityDescription
AT2G30100.11.9e-17265.50pentatricopeptide (PPR) repeat-containing protein[more]
Match NameE-valueIdentityDescription
sp|Q0WNN7|PP176_ARATH3.4e-17165.50Pentatricopeptide repeat-containing protein At2g30100, chloroplastic OS=Arabidop... [more]
Match NameE-valueIdentityDescription
tr|A0A0A0KC35|A0A0A0KC35_CUCSA1.8e-289100.00Uncharacterized protein OS=Cucumis sativus OX=3659 GN=Csa_6G182120 PE=4 SV=1[more]
tr|A0A1S3CNE0|A0A1S3CNE0_CUCME3.6e-28297.41pentatricopeptide repeat-containing protein At2g30100, chloroplastic OS=Cucumis ... [more]
tr|A0A2P5EK76|A0A2P5EK76_9ROSA2.1e-21072.46Pentatricopeptide repeat OS=Trema orientalis OX=63057 GN=TorRG33x02_183690 PE=4 ... [more]
tr|M5VXS0|M5VXS0_PRUPE3.6e-21073.71Uncharacterized protein OS=Prunus persica OX=3760 GN=PRUPE_6G052600 PE=4 SV=1[more]
tr|A0A2P5AFX8|A0A2P5AFX8_PARAD2.6e-20871.68Pentatricopeptide repeat OS=Parasponia andersonii OX=3476 GN=PanWU01x14_336330 P... [more]
The following terms have been associated with this gene:
Vocabulary: Molecular Function
TermDefinition
GO:0005515protein binding
Vocabulary: INTERPRO
TermDefinition
IPR011990TPR-like_helical_dom_sf
GO Assignments
This gene is annotated with the following GO terms.
Category Term Accession Term Name
biological_process GO:0008150 biological_process
biological_process GO:0090305 nucleic acid phosphodiester bond hydrolysis
biological_process GO:0009451 RNA modification
cellular_component GO:0005575 cellular_component
cellular_component GO:0043231 intracellular membrane-bounded organelle
molecular_function GO:0005515 protein binding
molecular_function GO:0003674 molecular_function
molecular_function GO:0004519 endonuclease activity
molecular_function GO:0003723 RNA binding

The following mRNA feature(s) are a part of this gene:

Feature NameUnique NameType
CsaV3_6G016010.1CsaV3_6G016010.1mRNA


Analysis Name: InterPro Annotations of cucumber chineselong genome (v3)
Date Performed: 2019-03-04
IPR TermIPR DescriptionSourceSource TermSource DescriptionAlignment
IPR011990Tetratricopeptide-like helical domain superfamilyGENE3DG3DSA:1.25.40.10coord: 85..227
e-value: 1.2E-5
score: 26.7
NoneNo IPR availablePANTHERPTHR24015FAMILY NOT NAMEDcoord: 77..492
NoneNo IPR availablePANTHERPTHR24015:SF683SUBFAMILY NOT NAMEDcoord: 77..492

The following gene(s) are paralogous to this gene:

None