Cp4.1LG15g05020.1 (mRNA) Cucurbita pepo (Zucchini)

NameCp4.1LG15g05020.1
TypemRNA
OrganismCucurbita pepo (Cucurbita pepo (Zucchini))
DescriptionPentatricopeptide repeat-containing protein
LocationCp4.1LG15 : 5890431 .. 5894635 (-)
Sequence length1482
The following sequences are available for this feature:

Gene sequence (with intron)

Legend: CDS
Hold the cursor over a type above to highlight its positions in the sequence below.
ATGGGCCGCGTTAAGAAAATGGGCCTAGGTATAAGCCCAATGTAAGTTTGTGAAAAACAGTTGTCTAGTTTCTTTTCTTCGCCTCTCAAGCTTGCATTAACGGAGGTTTCTTTGAACCTTTCCTTAGGGTTTAGGCCTTTTCCGATCAGAAATGGCGTCTCTCATGGCGGTCCGGCGTGTTCGAACCCCCATTCATATCTCCTCCTTCATCAAGGTACTGTCTCCTCTCCCTTCAACTTTCACCTTCTCTTGCGGAAACCGAACAGAGACCCTAATCAAAGCCCTAAGCACCTCGGCGTTCCCTGACGATTTCTCCAATTTCCCTACACCGCCGCAACAACCTTCTTCGTCTGACCATCGATATCCTCAGGCTCAGTGGGGTTCGCCGAGCCAGGTTCATCGTTCGAGTGGAAATTTTAATCACCAGTCGTTCTCGGAGTTTCAGAATCGCGACTATGTTCAACAGGGAAGTCCTAGTAATCAAATGATTAATCGGAGTCAAAATCAGAGCTCGTACCCCAATCTTGGATTTCCCCGGCAGGGTCAGAGCTATACTCAAGGCGGTAACCCTAATTCGTGGAATCCTCCAAATCAGAGCTACCCGCAGTATCCAAATCCTTCGCAGCCGAACCCTCAAAATTTCAATTATCAGCAACAAAGAGGCCCTAACCAATGGAGCAATCAAAATCAGGGACTCCCACAATTTGGAAAGCCTGGTCAGCGGAACCTACAAGCAGAGAATTCTTATCAGTTGAATAATCAGGCTGGGATTCAAGGGCATGGTGCTCAAAATCACGCACCCAATGCCCTTGTATCTCCTATTGATGAATTGCGGCGCTTTTGTGGAGAGGGGAAGTTGAAAGAAGCTGTTGAATTATTGAAAGAAGGTGTTAAAGCTGATGCTGATTGTTTCCATGAGTTGTTTGAATTATGTGGGAAGTCGAAGTCATTTGAAAATGCTAAAGTAGTTCATGATTACTTTCTACAGTCAACTTGTAGAAGCGATCTGCAATTGAATAATAAAGTGCTTGAGATGTATGGGAAATGTGGGAGCATGAGCGATGCACAGAGAGTGTTCGACCATATGCTTGATAGGAGTATTGAGTCTTGGCATCTGATGATAAAAGGATATGCGGATAATGGATTGGGTGATGATGGTCTGGAGTTATTTGAGAACATGAAGAAGTTGGGATTGCATCCCAATTCACAAACTTTCCTTTTTGTTATGTCGGCTTGTGCTAGTGCCAGTGCTGTGGAAGAAGGATTTATGTACTTTGAGTCTATGAAAAATGATTACCATATCAACCCAGACATGGATCATTATCTGGGGCTTTTAGGTATTCTTGGAGAACCTGGACACATCAATGAGGCTTTCGAGTATGTCGAAAAACTGCCCATGGAGCCCACAGTCGAGGTATGGGAGACTTTGAAGAACTATGCTAGAATGCATGGAGATGTTGATCTTGAGGACTATGCAGAGGAGCTAATTGTTGATCTGGACCCAACGAAAGCTGCCTCTAACAAGATACCGACGCCACCTCCCAAGAAGCGATCTGCAATTAGCATGCTTGATGGGAAGAACAGGATTGTTGAATTCAGAAATCCAACTCTCTACAAAGATGATGAAAAATTAAAGGCTTTGAAGGCAATGAAAGAACAAGGGTATGTGCCAGATACTAGATATGTTCTTCACGATATTGATCAAGAGGCCAAAGAGCAGGCATTGCTGTATCATAGTGAACGATTGGCAATTGCATATGGATTGATCAGTACACCGGCGCGAACGCCTCTTAGGATCATTAAGAACCTACGGATCTGTGGTGACTGTCACAATGCCATCAAAATCATGTCTAGAATTGTTGGGAGAGAGTTGATTGTAAGGGACAACAAACGGTTTCATCATTTTAAGGATGGTAAATGTTCATGTGGGGATTACTGGTAAACCATCATTCTTGAACACACCACAACATGTACCTGATGAGTTTATCTTCCCAATAGCAGCTCGATGTAAAGTGTTAGACCATCAATATCCTTATGCGAACCACGAAACAAGGTACCGATTGGAGACGAGCTAGCCATTGATTTATAATGTACGCTCTTCTTATCCTGATATTTAATCATTCGTCTTATCATCATTTTTTTTTTCTTCTTTTCCTGCTGTTTATTGTTGGAACTGAATTGGAAAGTGCTAGTTTTCTTCTCCTGATAGATGATTAGTAAGCTTTTAGCAAAAGAAAACGTCTCAGTGGTAATTACTGGGTTTGGAATTTGTTTTCTCCTCGGTTTCAATTAGGTTGAAAGAATCACTTGAATCCTAGCTAGTTCTAAACCTTGACTTGGTTTATTAAAACATTAGTAGAAGAAAGTGGTAACAAAACATAGACTTGCGAGTGAAAGTAGTGTTTGTAAGCTTATCAAAAATGACATTGTTATCGTGTTTCGATCCACCCGAAGCAGTTTAACTAAAAGAGAAGTTGGAATTTATACAAAGCCTTTACATTATACTAACACTTTTTAACAATCAAACGCCATTAAAGAACCTTAAAAGTTGAAACACTCATCCTCTAAACATACTTTGCAGACCGAGTCATCATTGTCATAAACTCATCCATGTCCACCATGCCATCGCCGTTCCTGTCCACCGCCCTGACCATCCGCCGGCAATCCTCAATGCTACACTTTTCCCCCAATCCCTTCAACACTCTCATCACTTCCTCTGCACTAATCTTCCGATCCCCATTCACATCAAACGTCTCGAAAGCAGATTCCACCTCTCCAGTCTGAACTCCGCTCCGGTGAACCTCCATAAACTCGTTCAAGTCTATGTAACCATCGCCGTCTGTGTCCACTGCCTTGAAAATCTTCTGCACCTCTTCCATGGAGCTCCCTCTCCCCAGTGCCTCCAGAATTCCCCTGTATTCATGCTTTGAGATCCTCCCATCTCTGTTAGTGTCGAACCATTTGAAGATCTGCTTTATCTCCTCCGTGCTCGGCTGCAACGCCCGCCGGAGTCCTGAGCTCTGTTGGTTCTTGAAGGAGCACAGACGGGAGGGTTCCCGGAGGAACTTTTTCTTTAAGACACTGTATTGCAGGTCCAGGAGGTTCGAGTTCATCGGCATTTTTTTGTTTGGGTTACTCTGTTTTTGTTTTTTTTTTTTTAATGAAAAGAAGATTGATCTTCAAATTTCAGACGGACCCAGATGAACTTTCAGTGGTTCACTACCGAAAACAGGGTAGCTCTGTTTTTTTGTTGTTGAAACAGAGGACTCGATCCAGATGAATTTAAAGGGTTTAAGCACTGAAAAATTGCTTGCTCTGGTCTGTTTTTTTCTTTGTTGTAAGAAACAAGGAAGATCGATCTTGAAATTTAAGATGAACCCAGATGGGTTTGAAGGGTTTAAATAGTGAAAAATGGGTTTGATTCACTGAAAACTTGTGTTGGAAGAAAGAAAAGAATATGATTCTTTACAAAGTTTCTACTTTTCTCTGCCCTTCTTCATCCTCAGCTTGGTTTCTATTTCTATGGCTGATTTTACCCACATTTCACTTCCATTCTATTTAATCTTTTTACTCTTGGATGTCTCCTAGCATTGCTTTTAACAATTGTTTTAGAGACGTCTTCGAGACGTAATCTCTTATTTCAATCTCCATATTTGCAAAAATATTTTGAGGAGAATATTTATTTCTGTTCTTGATCCCATTAGCTAATAGTGAAAAATCTCTTCACTCTCTTCTCCATTCCACATTTAAATAAAAATCTCACTCCGTTTGAGGTAGGTCTCCACAAGGTCGAGTAAATACAGATCTTCAATTAGAGTCACTAACTTATAAGGGTCAAAATATTTGTCGAACTAAACAAAATAGAGCGGATAAAATCCTCTATGCTGCTTTAGTCTCTTCGTATCTTTTCATTTCCCTTTACTTTATTCTTCTTTTTTATACAAATTTGGCACCAGAGTTAGAAAAGGTTAGCCATATTGCTTCTCGAATAATATAATTTTTAGACACATGGCATGAATTTGAGTATGCATAGATTCTTAATCGATTGAGTTATGAACTCATAGATTCTTAATCGATTGAGTTATGTTCTGATTCACGATGTACTAATGTAAGTAAATTGTAAAATTATTTACATGCCGTAAAAGTGAGTGGTTTATTGTGATCTTTAGGAATAAACTCTATTTTTTTTTAGAATGTTCGAATAA

mRNA sequence

ATGGGCCGCGGTTTAGGCCTTTTCCGATCAGAAATGGCGTCTCTCATGGCGGTCCGGCGTGTTCGAACCCCCATTCATATCTCCTCCTTCATCAAGGTACTGTCTCCTCTCCCTTCAACTTTCACCTTCTCTTGCGGAAACCGAACAGAGACCCTAATCAAAGCCCTAAGCACCTCGGCGTTCCCTGACGATTTCTCCAATTTCCCTACACCGCCGCAACAACCTTCTTCGTCTGACCATCGATATCCTCAGGCTCAGTGGGGTTCGCCGAGCCAGGTTCATCGTTCGAGTGGAAATTTTAATCACCAGTCGTTCTCGGAGTTTCAGAATCGCGACTATGTTCAACAGGGAAGTCCTAGTAATCAAATGATTAATCGGAGTCAAAATCAGAGCTCGTACCCCAATCTTGGATTTCCCCGGCAGGGTCAGAGCTATACTCAAGGCGGTAACCCTAATTCGTGGAATCCTCCAAATCAGAGCTACCCGCAGTATCCAAATCCTTCGCAGCCGAACCCTCAAAATTTCAATTATCAGCAACAAAGAGGCCCTAACCAATGGAGCAATCAAAATCAGGGACTCCCACAATTTGGAAAGCCTGGTCAGCGGAACCTACAAGCAGAGAATTCTTATCAGTTGAATAATCAGGCTGGGATTCAAGGGCATGGTGCTCAAAATCACGCACCCAATGCCCTTGTATCTCCTATTGATGAATTGCGGCGCTTTTGTGGAGAGGGGAAGTTGAAAGAAGCTGTTGAATTATTGAAAGAAGGTGTTAAAGCTGATGCTGATTGTTTCCATGAGTTGTTTGAATTATGTGGGAAGTCGAAACCGAGTCATCATTGTCATAAACTCATCCATGTCCACCATGCCATCGCCGTTCCTGTCCACCGCCCTGACCATCCGCCGGCAATCCTCAATGCTACACTTTTCCCCCAATCCCTTCAACACTCTCATCACTTCCTCTGCACTAATCTTCCGATCCCCATTCACATCAAACGTCTCGAAAGCAGATTCCACCTCTCCAGTCTGAACTCCGCTCCGGTGAACCTCCATAAACTCGTTCAAGTCTATGTAACCATCGCCGTCTGTGTCCACTGCCTTGAAAATCTTCTGCACCTCTTCCATGGAGCTCCCTCTCCCCAGTGCCTCCAGAATTCCCCTATCTGCTTTATCTCCTCCGTGCTCGGCTGCAACGCCCGCCGGAGTCCTGAGCTCTGTTGGTTCTTGAAGGAGCACAGACGGGAGGGTTCCCGGAGGAACTTTTTCTTTAAGACACTGTATTGCAGGTCCAGGAGAAGATTGATCTTCAAATTTCAGACGGACCCAGATGAACTTTCAGTGGTTCACTACCGAAAACAGGAGGACTCGATCCAGATGAATTTAAAGGGTTTAAGCACTGAAAAATTGCTTGCTCTGAAACAAGGAAGATCGATCTTGAAATTTAAGATGAACCCAGATGGGTTTGAAGGAATGTTCGAATAA

Coding sequence (CDS)

ATGGGCCGCGGTTTAGGCCTTTTCCGATCAGAAATGGCGTCTCTCATGGCGGTCCGGCGTGTTCGAACCCCCATTCATATCTCCTCCTTCATCAAGGTACTGTCTCCTCTCCCTTCAACTTTCACCTTCTCTTGCGGAAACCGAACAGAGACCCTAATCAAAGCCCTAAGCACCTCGGCGTTCCCTGACGATTTCTCCAATTTCCCTACACCGCCGCAACAACCTTCTTCGTCTGACCATCGATATCCTCAGGCTCAGTGGGGTTCGCCGAGCCAGGTTCATCGTTCGAGTGGAAATTTTAATCACCAGTCGTTCTCGGAGTTTCAGAATCGCGACTATGTTCAACAGGGAAGTCCTAGTAATCAAATGATTAATCGGAGTCAAAATCAGAGCTCGTACCCCAATCTTGGATTTCCCCGGCAGGGTCAGAGCTATACTCAAGGCGGTAACCCTAATTCGTGGAATCCTCCAAATCAGAGCTACCCGCAGTATCCAAATCCTTCGCAGCCGAACCCTCAAAATTTCAATTATCAGCAACAAAGAGGCCCTAACCAATGGAGCAATCAAAATCAGGGACTCCCACAATTTGGAAAGCCTGGTCAGCGGAACCTACAAGCAGAGAATTCTTATCAGTTGAATAATCAGGCTGGGATTCAAGGGCATGGTGCTCAAAATCACGCACCCAATGCCCTTGTATCTCCTATTGATGAATTGCGGCGCTTTTGTGGAGAGGGGAAGTTGAAAGAAGCTGTTGAATTATTGAAAGAAGGTGTTAAAGCTGATGCTGATTGTTTCCATGAGTTGTTTGAATTATGTGGGAAGTCGAAACCGAGTCATCATTGTCATAAACTCATCCATGTCCACCATGCCATCGCCGTTCCTGTCCACCGCCCTGACCATCCGCCGGCAATCCTCAATGCTACACTTTTCCCCCAATCCCTTCAACACTCTCATCACTTCCTCTGCACTAATCTTCCGATCCCCATTCACATCAAACGTCTCGAAAGCAGATTCCACCTCTCCAGTCTGAACTCCGCTCCGGTGAACCTCCATAAACTCGTTCAAGTCTATGTAACCATCGCCGTCTGTGTCCACTGCCTTGAAAATCTTCTGCACCTCTTCCATGGAGCTCCCTCTCCCCAGTGCCTCCAGAATTCCCCTATCTGCTTTATCTCCTCCGTGCTCGGCTGCAACGCCCGCCGGAGTCCTGAGCTCTGTTGGTTCTTGAAGGAGCACAGACGGGAGGGTTCCCGGAGGAACTTTTTCTTTAAGACACTGTATTGCAGGTCCAGGAGAAGATTGATCTTCAAATTTCAGACGGACCCAGATGAACTTTCAGTGGTTCACTACCGAAAACAGGAGGACTCGATCCAGATGAATTTAAAGGGTTTAAGCACTGAAAAATTGCTTGCTCTGAAACAAGGAAGATCGATCTTGAAATTTAAGATGAACCCAGATGGGTTTGAAGGAATGTTCGAATAA

Protein sequence

MGRGLGLFRSEMASLMAVRRVRTPIHISSFIKVLSPLPSTFTFSCGNRTETLIKALSTSAFPDDFSNFPTPPQQPSSSDHRYPQAQWGSPSQVHRSSGNFNHQSFSEFQNRDYVQQGSPSNQMINRSQNQSSYPNLGFPRQGQSYTQGGNPNSWNPPNQSYPQYPNPSQPNPQNFNYQQQRGPNQWSNQNQGLPQFGKPGQRNLQAENSYQLNNQAGIQGHGAQNHAPNALVSPIDELRRFCGEGKLKEAVELLKEGVKADADCFHELFELCGKSKPSHHCHKLIHVHHAIAVPVHRPDHPPAILNATLFPQSLQHSHHFLCTNLPIPIHIKRLESRFHLSSLNSAPVNLHKLVQVYVTIAVCVHCLENLLHLFHGAPSPQCLQNSPICFISSVLGCNARRSPELCWFLKEHRREGSRRNFFFKTLYCRSRRRLIFKFQTDPDELSVVHYRKQEDSIQMNLKGLSTEKLLALKQGRSILKFKMNPDGFEGMFE
BLAST of Cp4.1LG15g05020.1 vs. Swiss-Prot
Match: PP153_ARATH (Pentatricopeptide repeat-containing protein At2g15690 OS=Arabidopsis thaliana GN=PCMP-H66 PE=2 SV=2)

HSP 1 Score: 62.4 bits (150), Expect = 1.6e-08
Identity = 93/295 (31.53%), Postives = 129/295 (43.73%), Query Frame = 1

Query: 12  MASLMAVRRVRTP--IHISSFIKVLSPLP---STFTFSCGNRTETLIKALSTSAFPDDFS 71
           M+SLMA+R  RT   + I S +++ S  P   S F FS G      IK LSTSA  +D+ 
Sbjct: 1   MSSLMAIRCARTQNIVTIGSLLQLRSSFPRLSSQFHFS-GTLNSIPIKHLSTSAAANDY- 60

Query: 72  NFPTPPQQPSSSDHRYPQAQWGSPSQVHRSSGNFNHQSFSEFQNRDYVQQGSPSNQMINR 131
                        H+ PQ+  GSPSQ  R    +  QSF   QN+    Q  P +     
Sbjct: 61  -------------HQNPQS--GSPSQHQRP---YPPQSFDS-QNQTNTNQRVPQSPNQWS 120

Query: 132 SQNQSSYPNLGFPRQGQSYTQGGNPNSWNPPN-QSYPQYPNPSQPNPQNFNYQQQRGPNQ 191
           +Q+    P  G    GQ+   GG    +   N Q   Q       NPQ+  ++ Q G  +
Sbjct: 121 TQHGGQIPQYG----GQNPQHGGQRPPYGGQNPQQGGQMSQYGGHNPQHGGHRPQYGGQR 180

Query: 192 WSNQNQGLPQFGKPGQ----RNLQAENS---YQLNNQAGIQGHGAQNHAPNAL--VSP-- 251
                   PQ+G PG     +N+Q  N    Y    Q   Q   + N +PN +  V+P  
Sbjct: 181 --------PQYGGPGNNYQNQNVQQSNQSQYYTPQQQQQPQPPRSSNQSPNQMNEVAPPP 240

Query: 252 -IDELRRFCGEGKLKEAVELLKEGVKADADCFHELFELCGKSKPSHHCHKLIHVH 289
            ++E+ R C     K+A+ELL +G   D +CF  LFE C   K   H  K +H H
Sbjct: 241 SVEEVMRLCQRRLYKDAIELLDKGAMPDRECFVLLFESCANLKSLEHSKK-VHDH 261

BLAST of Cp4.1LG15g05020.1 vs. TrEMBL
Match: A0A0A0K6L3_CUCSA (Uncharacterized protein OS=Cucumis sativus GN=Csa_7G073730 PE=4 SV=1)

HSP 1 Score: 223.4 bits (568), Expect = 6.1e-55
Identity = 138/226 (61.06%), Postives = 155/226 (68.58%), Query Frame = 1

Query: 72  PQQPSSSDHRYPQAQWGSPSQVHRSSGNFNHQSFSEFQNRDYVQQGSPSNQMINRSQNQS 131
           P Q +  +  YPQ Q  +PSQ      N  +QS+ ++QN     Q +P NQ  ++ QN S
Sbjct: 323 PSQSNPPNQSYPQYQ--NPSQ-----SNPPNQSYPQYQNPS---QTNPPNQSYSQYQNPS 382

Query: 132 S-------YPNLGFPRQ----GQSYTQGGNPNSWNPPNQSYPQYPNPSQPNPQNFNYQQQ 191
                   YP    P Q     QS+ Q  NP+  NPPNQSYPQY NPSQPNP NFNYQQQ
Sbjct: 383 QPNAPNQRYPQYQNPSQPNPPNQSHPQYQNPSQSNPPNQSYPQYQNPSQPNPPNFNYQQQ 442

Query: 192 RGPNQWSNQNQGLPQFGKPGQRNLQAENSYQLNNQAGIQGHGAQNHAPNALVSPIDELRR 251
           RGPNQW+NQNQ  PQFG+P  RN QAENS QLNNQAGIQ  G QN APNALVSPIDELRR
Sbjct: 443 RGPNQWNNQNQEHPQFGRPEHRNPQAENSNQLNNQAGIQRDGTQNQAPNALVSPIDELRR 502

Query: 252 FCGEGKLKEAVELLKEGVKADADCFHELFELCGKSKPSHHCHKLIH 287
           FCGEGKLKEAVELLK+GVKAD DCFH LFELCGKSK S    K++H
Sbjct: 503 FCGEGKLKEAVELLKQGVKADVDCFHLLFELCGKSK-SFDNAKVVH 537

BLAST of Cp4.1LG15g05020.1 vs. TrEMBL
Match: W9S6A2_9ROSA (Uncharacterized protein OS=Morus notabilis GN=L484_027540 PE=4 SV=1)

HSP 1 Score: 175.3 bits (443), Expect = 1.9e-40
Identity = 129/282 (45.74%), Postives = 160/282 (56.74%), Query Frame = 1

Query: 12  MASLMAVRRVRTPIHISSFIKVLSPLPSTFTFSCGNR--TETLIKALSTSAFPDDFSNFP 71
           MASLMA+RR R    ISSF KV    PS F     N    +TL+K LSTSAF D++ + P
Sbjct: 1   MASLMAIRRARCQ-KISSFFKVRPLHPSHFASINANNHNLQTLVKTLSTSAFTDEYQSPP 60

Query: 72  TPPQQPSSSDHRYPQAQ------------WGSPSQVHRSSGNFNHQSFSEFQNRDYVQQG 131
           TP    +   H  PQ Q            W S +Q+  S+  F++Q+    QN+DY  +G
Sbjct: 61  TPSDPRAVPHHGKPQRQGYPQTGNPNPNQWNSQNQI--SNNQFSYQN----QNQDYSNRG 120

Query: 132 SPSNQMINRSQNQSSYPNLGFPR--QGQSYTQGGNPNSWNPPNQSYPQYPNPSQPNPQNF 191
              NQ  N       +PN G+P   Q QSY Q GN    NP NQS+PQY NP+Q N QN 
Sbjct: 121 YYPNQGQN-------FPNRGYPNPVQNQSYPQHGNAQR-NPQNQSFPQYQNPNQTNIQNP 180

Query: 192 NYQQQRGPNQWSNQNQGLPQFGKPGQRNLQAENSYQLNNQAGIQGHGAQNHAPNALVSPI 251
           N+QQ R PNQW+NQNQ  PQ   P QRN Q ++  Q   QA      + N   +  ++ I
Sbjct: 181 NFQQPRSPNQWNNQNQAYPQRANPNQRNQQVQSPNQRTPQA-----QSANQITDVKLTSI 240

Query: 252 DELRRFCGEGKLKEAVELL-KEGVKADADCFHELFELCGKSK 277
            +LR  C EGK+KEA+EL+ KEGVKADADCF  LFELCGK K
Sbjct: 241 SDLRTLCREGKVKEAIELMDKEGVKADADCFCALFELCGKLK 262

BLAST of Cp4.1LG15g05020.1 vs. TrEMBL
Match: M5VN21_PRUPE (Uncharacterized protein OS=Prunus persica GN=PRUPE_ppa002958mg PE=4 SV=1)

HSP 1 Score: 165.6 bits (418), Expect = 1.5e-37
Identity = 132/292 (45.21%), Postives = 167/292 (57.19%), Query Frame = 1

Query: 12  MASLMAVRRVRTPIHISSFIKVLSPLPSTFTFSCGNRTE----------TLIKALSTSAF 71
           MASLMA+RR R+P  +S F KV     S F FS G+ T           T+ K+LSTSA 
Sbjct: 1   MASLMAIRRARSPA-LSPFFKVRPLHLSHFAFSHGSNTIAINITNLETLTIAKSLSTSAV 60

Query: 72  PDDFSNFPTPPQQPSSSDHRY--PQA---QWGSPSQVHRSSGNFNHQSFSEFQNRDYVQQ 131
           P+++   P P QQP  SD R    QA   QW +  Q + +S  +N Q+ ++  N  Y Q 
Sbjct: 61  PNEYQR-PPPQQQPPPSDPRAFDDQANPNQWAAQGQGYGNSNQWNPQTQNQTPNNQYNQN 120

Query: 132 GSPSNQMINRSQ------NQS-SYPNLGFP---RQGQSYTQGGNPNSWNPPNQSYPQYPN 191
            S   Q  N+S       NQ+ S+PN G+P    Q QSY Q GN N W+P  QS PQY N
Sbjct: 121 QSYPGQ--NQSYPGRGYPNQAPSFPNRGYPNQNNQNQSYPQRGNSNEWSPQVQSPPQYQN 180

Query: 192 PSQPN-PQNFNYQQQRGPNQWSNQNQGLPQFGKPGQRNLQAENSYQLNNQAGIQGHGAQN 251
           P+Q N P + ++QQ R PNQW+N NQG  Q   P Q + QA+N  Q +N      + A N
Sbjct: 181 PNQVNPPPSPSFQQPRSPNQWNNPNQGYQQPRNPNQWSPQAQNPAQWSNNN--NNNQAVN 240

Query: 252 HAPNALVSPIDELRRFCGEGKLKEAVELL-KEGVKADADCFHELFELCGKSK 277
             P  +   ID+LRR C EGK KEA+EL+ KEGVKADADCF  LFELCG+ K
Sbjct: 241 QTPVVVPPSIDDLRRLCQEGKAKEALELMDKEGVKADADCFQSLFELCGRLK 286

BLAST of Cp4.1LG15g05020.1 vs. TrEMBL
Match: F6HQB8_VITVI (Putative uncharacterized protein OS=Vitis vinifera GN=VIT_03s0063g00540 PE=4 SV=1)

HSP 1 Score: 138.3 bits (347), Expect = 2.6e-29
Identity = 116/298 (38.93%), Postives = 143/298 (47.99%), Query Frame = 1

Query: 12  MASLMAVRRVRTPIHISSFIKVLSPLPSTFTFSCGNRTETLIKALSTSAFPDDFSNFPTP 71
           MASL+++RR RTP+  S   KV SP  S F F       TL K LSTSA P+D+     P
Sbjct: 1   MASLLSIRRARTPL-FSFLSKVPSPYSSHFIF-------TLTKTLSTSAVPNDYQR---P 60

Query: 72  PQQPSSS-----DHRYPQAQWGSPSQVHRSSGNFNHQSFSEFQNRDYVQQGSPSNQMINR 131
            QQP S      D R P   W S +Q          QS+                Q +N 
Sbjct: 61  QQQPPSEPRDFQDQRNPSYNWNSQTQ---------SQSYP---------------QHMNY 120

Query: 132 SQNQSSYPNLGFPRQGQSYTQGGNPNSWNPPNQSYPQYPNPSQPNPQN------------ 191
                SYPN G+P QGQ Y Q  NPN WN    +YPQ  NPS+PN QN            
Sbjct: 121 GDQNQSYPNRGYPNQGQGYPQHENPNQWNRQTPTYPQPQNPSRPNHQNQYYPPTGNPSLG 180

Query: 192 FNYQQQRGPNQWSNQNQG-------LPQFGKPGQRNLQA--ENSY-------QLNNQAGI 251
             Y QQR PNQW+ Q+Q           + +PG RNL +    SY       Q NNQ   
Sbjct: 181 QGYPQQRSPNQWNPQHQNPSHLNNQNQNYPQPGSRNLPSNQNQSYPHQGSPSQWNNQNPN 240

Query: 252 QGHGAQNHAPNALVSPIDELRRFCGEGKLKEAVELLKEGVKADADCFHELFELCGKSK 277
           Q    +N   +A  S + +L   C EGK+KEAVEL+++GV+ADA CF+ LF  CG  K
Sbjct: 241 QAQIVENQVSHAPPS-VADLMNLCQEGKVKEAVELMEKGVRADAQCFYALFNSCGSPK 262

BLAST of Cp4.1LG15g05020.1 vs. TrEMBL
Match: A5BI58_VITVI (Putative uncharacterized protein OS=Vitis vinifera GN=VITISV_021220 PE=4 SV=1)

HSP 1 Score: 127.1 bits (318), Expect = 5.9e-26
Identity = 106/276 (38.41%), Postives = 135/276 (48.91%), Query Frame = 1

Query: 32  KVLSPLPSTFTFSCGNRTETLIKALSTSAFPDDFSNFPTPPQQPSSSDHRYPQAQWGSPS 91
           +V SP  S F F       TL K LSTSA P+D+     P QQP S    +         
Sbjct: 54  RVPSPYSSHFIF-------TLTKTLSTSAVPNDYQR---PQQQPPSEPRDFQ-------- 113

Query: 92  QVHRSSGNFNHQSFSEFQNRDYVQQGSPSNQMINRSQNQSSYPNLGFPRQGQSYTQGGNP 151
             H+ + N+N  S  + Q++ Y Q        +N  +   SYPN G+P QGQ Y Q G+P
Sbjct: 114 --HQRNPNYNWNS--QTQSQSYPQH-------MNYGEQNQSYPNRGYPNQGQGYPQHGSP 173

Query: 152 NSWNPPNQSYPQYPNPSQPNPQN------------FNYQQQRGPNQW----------SNQ 211
           N WN    +YPQ  NPS+PN QN              Y QQR PNQW          +NQ
Sbjct: 174 NQWNRQTPTYPQPQNPSRPNHQNQYYPPTGNPSLGQGYPQQRSPNQWNPQHQNPSPLNNQ 233

Query: 212 NQGLPQFGKPGQRNLQA--ENSY-------QLNNQAGIQGHGAQNHAPNALVSPIDELRR 271
           N+  PQ   PG RNL +    SY       Q NNQ   Q    +N   +A  S + +L  
Sbjct: 234 NENYPQ---PGSRNLPSNQNQSYPHQGSPSQWNNQNTNQAQIVENQVSHAPPS-VADLMN 293

Query: 272 FCGEGKLKEAVELLKEGVKADADCFHELFELCGKSK 277
            C EGK+KEAVEL+++GV+ADA CF+ LF  CG  K
Sbjct: 294 LCQEGKVKEAVELMEKGVRADAQCFYALFNSCGSPK 296

BLAST of Cp4.1LG15g05020.1 vs. TAIR10
Match: AT2G15690.1 (AT2G15690.1 Tetratricopeptide repeat (TPR)-like superfamily protein)

HSP 1 Score: 62.4 bits (150), Expect = 9.1e-10
Identity = 93/295 (31.53%), Postives = 129/295 (43.73%), Query Frame = 1

Query: 12  MASLMAVRRVRTP--IHISSFIKVLSPLP---STFTFSCGNRTETLIKALSTSAFPDDFS 71
           M+SLMA+R  RT   + I S +++ S  P   S F FS G      IK LSTSA  +D+ 
Sbjct: 1   MSSLMAIRCARTQNIVTIGSLLQLRSSFPRLSSQFHFS-GTLNSIPIKHLSTSAAANDY- 60

Query: 72  NFPTPPQQPSSSDHRYPQAQWGSPSQVHRSSGNFNHQSFSEFQNRDYVQQGSPSNQMINR 131
                        H+ PQ+  GSPSQ  R    +  QSF   QN+    Q  P +     
Sbjct: 61  -------------HQNPQS--GSPSQHQRP---YPPQSFDS-QNQTNTNQRVPQSPNQWS 120

Query: 132 SQNQSSYPNLGFPRQGQSYTQGGNPNSWNPPN-QSYPQYPNPSQPNPQNFNYQQQRGPNQ 191
           +Q+    P  G    GQ+   GG    +   N Q   Q       NPQ+  ++ Q G  +
Sbjct: 121 TQHGGQIPQYG----GQNPQHGGQRPPYGGQNPQQGGQMSQYGGHNPQHGGHRPQYGGQR 180

Query: 192 WSNQNQGLPQFGKPGQ----RNLQAENS---YQLNNQAGIQGHGAQNHAPNAL--VSP-- 251
                   PQ+G PG     +N+Q  N    Y    Q   Q   + N +PN +  V+P  
Sbjct: 181 --------PQYGGPGNNYQNQNVQQSNQSQYYTPQQQQQPQPPRSSNQSPNQMNEVAPPP 240

Query: 252 -IDELRRFCGEGKLKEAVELLKEGVKADADCFHELFELCGKSKPSHHCHKLIHVH 289
            ++E+ R C     K+A+ELL +G   D +CF  LFE C   K   H  K +H H
Sbjct: 241 SVEEVMRLCQRRLYKDAIELLDKGAMPDRECFVLLFESCANLKSLEHSKK-VHDH 261

BLAST of Cp4.1LG15g05020.1 vs. NCBI nr
Match: gi|659109903|ref|XP_008454941.1| (PREDICTED: pentatricopeptide repeat-containing protein At2g15690 [Cucumis melo])

HSP 1 Score: 367.9 bits (943), Expect = 2.9e-98
Identity = 208/307 (67.75%), Postives = 224/307 (72.96%), Query Frame = 1

Query: 12  MASLMAVRRVRTPIHISSFIKVLSPLPSTFTFSCGNRTETLIKALSTSAFPDDFSNFPTP 71
           MASLMAVRRVRTPI +SSF KV  PL S+FTF+  N+TETLIK LSTSA P DFSNFP+ 
Sbjct: 1   MASLMAVRRVRTPITVSSFFKVRYPLSSSFTFTFRNQTETLIKTLSTSAIPSDFSNFPSS 60

Query: 72  PQQPSSSDHRYPQAQWGSPSQVHRSSGNFNHQSFSEFQNRDYVQQGSPSNQMINRSQNQS 131
           PQQPSSS   Y Q QWGSPSQV+  S NFN QSFSEFQN DY QQG+PSNQ+  RSQ+QS
Sbjct: 61  PQQPSSSSPPYRQPQWGSPSQVNPPSENFNRQSFSEFQNHDYAQQGTPSNQLNYRSQHQS 120

Query: 132 SYPNLGFPRQGQSYTQGGNPNSW----------------NP----------------PNQ 191
             PN GF R+GQSYTQ G  NSW                NP                PNQ
Sbjct: 121 PQPNPGFSREGQSYTQVGKTNSWNPPNQSYPQYQNPSQPNPPNQSYPQYQNPSQPNPPNQ 180

Query: 192 SYPQYPNPSQPNPQNFNYQQQRGPNQWSNQNQGLPQFGKPGQRNLQAENSYQLNNQAGIQ 251
           SYPQY NPSQPNP NFNYQQQRGPNQW+NQNQG PQFG+   RN Q ENS QLNNQA IQ
Sbjct: 181 SYPQYQNPSQPNPPNFNYQQQRGPNQWNNQNQGHPQFGRSEHRNPQPENSNQLNNQAEIQ 240

Query: 252 GHGAQNHAPNALVSPIDELRRFCGEGKLKEAVELLKEGVKADADCFHELFELCGKSKPSH 287
            HG QN APNALVSPIDELRRFCGEGKLKEAVELLK+GVKAD DCFH LFELCGKSK   
Sbjct: 241 RHGTQNQAPNALVSPIDELRRFCGEGKLKEAVELLKQGVKADVDCFHLLFELCGKSKSLD 300

BLAST of Cp4.1LG15g05020.1 vs. NCBI nr
Match: gi|778725128|ref|XP_011658903.1| (PREDICTED: pentatricopeptide repeat-containing protein At2g15690-like [Cucumis sativus])

HSP 1 Score: 231.5 bits (589), Expect = 3.2e-57
Identity = 140/217 (64.52%), Postives = 157/217 (72.35%), Query Frame = 1

Query: 72  PQQPSSSDHRYPQAQWGSPSQVHRSSGNFNHQSFSEFQNRDYVQQGSPSNQMINRSQN-- 131
           P QP++ + RYPQ Q  +PSQ      N  +QS+ ++QN     Q +P NQ   + QN  
Sbjct: 275 PSQPNAPNQRYPQYQ--NPSQ-----SNPPNQSYPQYQNPS---QSNPPNQSYPQYQNPS 334

Query: 132 QSSYPNLGFPRQGQSYTQGGNPNSWNPPNQSYPQYPNPSQPNPQNFNYQQQRGPNQWSNQ 191
           QS+ PN       QSY Q  NP+  NPPNQSYPQY NPSQPNP NFNYQQQRGPNQW+NQ
Sbjct: 335 QSNPPN-------QSYPQYQNPSQSNPPNQSYPQYQNPSQPNPPNFNYQQQRGPNQWNNQ 394

Query: 192 NQGLPQFGKPGQRNLQAENSYQLNNQAGIQGHGAQNHAPNALVSPIDELRRFCGEGKLKE 251
           NQ  PQFG+P  RN QAENS QLNNQAGIQ  G QN APNALVSPIDELRRFCGEGKLKE
Sbjct: 395 NQEHPQFGRPEHRNPQAENSNQLNNQAGIQRDGTQNQAPNALVSPIDELRRFCGEGKLKE 454

Query: 252 AVELLKEGVKADADCFHELFELCGKSKPSHHCHKLIH 287
           AVELLK+GVKAD DCFH LFELCGKSK S    K++H
Sbjct: 455 AVELLKQGVKADVDCFHLLFELCGKSK-SFDNAKVVH 473

BLAST of Cp4.1LG15g05020.1 vs. NCBI nr
Match: gi|700188704|gb|KGN43937.1| (hypothetical protein Csa_7G073730 [Cucumis sativus])

HSP 1 Score: 223.4 bits (568), Expect = 8.7e-55
Identity = 138/226 (61.06%), Postives = 155/226 (68.58%), Query Frame = 1

Query: 72  PQQPSSSDHRYPQAQWGSPSQVHRSSGNFNHQSFSEFQNRDYVQQGSPSNQMINRSQNQS 131
           P Q +  +  YPQ Q  +PSQ      N  +QS+ ++QN     Q +P NQ  ++ QN S
Sbjct: 323 PSQSNPPNQSYPQYQ--NPSQ-----SNPPNQSYPQYQNPS---QTNPPNQSYSQYQNPS 382

Query: 132 S-------YPNLGFPRQ----GQSYTQGGNPNSWNPPNQSYPQYPNPSQPNPQNFNYQQQ 191
                   YP    P Q     QS+ Q  NP+  NPPNQSYPQY NPSQPNP NFNYQQQ
Sbjct: 383 QPNAPNQRYPQYQNPSQPNPPNQSHPQYQNPSQSNPPNQSYPQYQNPSQPNPPNFNYQQQ 442

Query: 192 RGPNQWSNQNQGLPQFGKPGQRNLQAENSYQLNNQAGIQGHGAQNHAPNALVSPIDELRR 251
           RGPNQW+NQNQ  PQFG+P  RN QAENS QLNNQAGIQ  G QN APNALVSPIDELRR
Sbjct: 443 RGPNQWNNQNQEHPQFGRPEHRNPQAENSNQLNNQAGIQRDGTQNQAPNALVSPIDELRR 502

Query: 252 FCGEGKLKEAVELLKEGVKADADCFHELFELCGKSKPSHHCHKLIH 287
           FCGEGKLKEAVELLK+GVKAD DCFH LFELCGKSK S    K++H
Sbjct: 503 FCGEGKLKEAVELLKQGVKADVDCFHLLFELCGKSK-SFDNAKVVH 537

BLAST of Cp4.1LG15g05020.1 vs. NCBI nr
Match: gi|703143188|ref|XP_010107950.1| (hypothetical protein L484_027540 [Morus notabilis])

HSP 1 Score: 175.3 bits (443), Expect = 2.7e-40
Identity = 129/282 (45.74%), Postives = 160/282 (56.74%), Query Frame = 1

Query: 12  MASLMAVRRVRTPIHISSFIKVLSPLPSTFTFSCGNR--TETLIKALSTSAFPDDFSNFP 71
           MASLMA+RR R    ISSF KV    PS F     N    +TL+K LSTSAF D++ + P
Sbjct: 1   MASLMAIRRARCQ-KISSFFKVRPLHPSHFASINANNHNLQTLVKTLSTSAFTDEYQSPP 60

Query: 72  TPPQQPSSSDHRYPQAQ------------WGSPSQVHRSSGNFNHQSFSEFQNRDYVQQG 131
           TP    +   H  PQ Q            W S +Q+  S+  F++Q+    QN+DY  +G
Sbjct: 61  TPSDPRAVPHHGKPQRQGYPQTGNPNPNQWNSQNQI--SNNQFSYQN----QNQDYSNRG 120

Query: 132 SPSNQMINRSQNQSSYPNLGFPR--QGQSYTQGGNPNSWNPPNQSYPQYPNPSQPNPQNF 191
              NQ  N       +PN G+P   Q QSY Q GN    NP NQS+PQY NP+Q N QN 
Sbjct: 121 YYPNQGQN-------FPNRGYPNPVQNQSYPQHGNAQR-NPQNQSFPQYQNPNQTNIQNP 180

Query: 192 NYQQQRGPNQWSNQNQGLPQFGKPGQRNLQAENSYQLNNQAGIQGHGAQNHAPNALVSPI 251
           N+QQ R PNQW+NQNQ  PQ   P QRN Q ++  Q   QA      + N   +  ++ I
Sbjct: 181 NFQQPRSPNQWNNQNQAYPQRANPNQRNQQVQSPNQRTPQA-----QSANQITDVKLTSI 240

Query: 252 DELRRFCGEGKLKEAVELL-KEGVKADADCFHELFELCGKSK 277
            +LR  C EGK+KEA+EL+ KEGVKADADCF  LFELCGK K
Sbjct: 241 SDLRTLCREGKVKEAIELMDKEGVKADADCFCALFELCGKLK 262

BLAST of Cp4.1LG15g05020.1 vs. NCBI nr
Match: gi|595798576|ref|XP_007201410.1| (hypothetical protein PRUPE_ppa002958mg [Prunus persica])

HSP 1 Score: 165.6 bits (418), Expect = 2.2e-37
Identity = 132/292 (45.21%), Postives = 167/292 (57.19%), Query Frame = 1

Query: 12  MASLMAVRRVRTPIHISSFIKVLSPLPSTFTFSCGNRTE----------TLIKALSTSAF 71
           MASLMA+RR R+P  +S F KV     S F FS G+ T           T+ K+LSTSA 
Sbjct: 1   MASLMAIRRARSPA-LSPFFKVRPLHLSHFAFSHGSNTIAINITNLETLTIAKSLSTSAV 60

Query: 72  PDDFSNFPTPPQQPSSSDHRY--PQA---QWGSPSQVHRSSGNFNHQSFSEFQNRDYVQQ 131
           P+++   P P QQP  SD R    QA   QW +  Q + +S  +N Q+ ++  N  Y Q 
Sbjct: 61  PNEYQR-PPPQQQPPPSDPRAFDDQANPNQWAAQGQGYGNSNQWNPQTQNQTPNNQYNQN 120

Query: 132 GSPSNQMINRSQ------NQS-SYPNLGFP---RQGQSYTQGGNPNSWNPPNQSYPQYPN 191
            S   Q  N+S       NQ+ S+PN G+P    Q QSY Q GN N W+P  QS PQY N
Sbjct: 121 QSYPGQ--NQSYPGRGYPNQAPSFPNRGYPNQNNQNQSYPQRGNSNEWSPQVQSPPQYQN 180

Query: 192 PSQPN-PQNFNYQQQRGPNQWSNQNQGLPQFGKPGQRNLQAENSYQLNNQAGIQGHGAQN 251
           P+Q N P + ++QQ R PNQW+N NQG  Q   P Q + QA+N  Q +N      + A N
Sbjct: 181 PNQVNPPPSPSFQQPRSPNQWNNPNQGYQQPRNPNQWSPQAQNPAQWSNNN--NNNQAVN 240

Query: 252 HAPNALVSPIDELRRFCGEGKLKEAVELL-KEGVKADADCFHELFELCGKSK 277
             P  +   ID+LRR C EGK KEA+EL+ KEGVKADADCF  LFELCG+ K
Sbjct: 241 QTPVVVPPSIDDLRRLCQEGKAKEALELMDKEGVKADADCFQSLFELCGRLK 286

The following BLAST results are available for this feature:
Match NameE-valueIdentityDescription
PP153_ARATH1.6e-0831.53Pentatricopeptide repeat-containing protein At2g15690 OS=Arabidopsis thaliana GN... [more]
Match NameE-valueIdentityDescription
A0A0A0K6L3_CUCSA6.1e-5561.06Uncharacterized protein OS=Cucumis sativus GN=Csa_7G073730 PE=4 SV=1[more]
W9S6A2_9ROSA1.9e-4045.74Uncharacterized protein OS=Morus notabilis GN=L484_027540 PE=4 SV=1[more]
M5VN21_PRUPE1.5e-3745.21Uncharacterized protein OS=Prunus persica GN=PRUPE_ppa002958mg PE=4 SV=1[more]
F6HQB8_VITVI2.6e-2938.93Putative uncharacterized protein OS=Vitis vinifera GN=VIT_03s0063g00540 PE=4 SV=... [more]
A5BI58_VITVI5.9e-2638.41Putative uncharacterized protein OS=Vitis vinifera GN=VITISV_021220 PE=4 SV=1[more]
Match NameE-valueIdentityDescription
AT2G15690.19.1e-1031.53 Tetratricopeptide repeat (TPR)-like superfamily protein[more]
Match NameE-valueIdentityDescription
gi|659109903|ref|XP_008454941.1|2.9e-9867.75PREDICTED: pentatricopeptide repeat-containing protein At2g15690 [Cucumis melo][more]
gi|778725128|ref|XP_011658903.1|3.2e-5764.52PREDICTED: pentatricopeptide repeat-containing protein At2g15690-like [Cucumis s... [more]
gi|700188704|gb|KGN43937.1|8.7e-5561.06hypothetical protein Csa_7G073730 [Cucumis sativus][more]
gi|703143188|ref|XP_010107950.1|2.7e-4045.74hypothetical protein L484_027540 [Morus notabilis][more]
gi|595798576|ref|XP_007201410.1|2.2e-3745.21hypothetical protein PRUPE_ppa002958mg [Prunus persica][more]
GO Assignments
This mRNA is annotated with the following GO terms.
Category Term Accession Term Name
biological_process GO:0008150 biological_process
cellular_component GO:0005575 cellular_component
molecular_function GO:0003674 molecular_function

This mRNA is a part of the following gene feature(s):

Feature NameUnique NameType
Cp4.1LG15g05020Cp4.1LG15g05020gene


The following polypeptide feature(s) derives from this mRNA:

Feature NameUnique NameType
Cp4.1LG15g05020.1Cp4.1LG15g05020.1-proteinpolypeptide


The following CDS feature(s) are a part of this mRNA:

Feature NameUnique NameType
Cp4.1LG15g05020.1:cds:008Cp4.1LG15g05020.1:cds:008CDS
Cp4.1LG15g05020.1:cds:007Cp4.1LG15g05020.1:cds:007CDS
Cp4.1LG15g05020.1:cds:006Cp4.1LG15g05020.1:cds:006CDS
Cp4.1LG15g05020.1:cds:005Cp4.1LG15g05020.1:cds:005CDS
Cp4.1LG15g05020.1:cds:004Cp4.1LG15g05020.1:cds:004CDS
Cp4.1LG15g05020.1:cds:003Cp4.1LG15g05020.1:cds:003CDS
Cp4.1LG15g05020.1:cds:002Cp4.1LG15g05020.1:cds:002CDS
Cp4.1LG15g05020.1:cds:001Cp4.1LG15g05020.1:cds:001CDS