Cp4.1LG14g02140 (gene) Cucurbita pepo (Zucchini)

NameCp4.1LG14g02140
Typegene
OrganismCucurbita pepo (Cucurbita pepo (Zucchini))
DescriptionRNase P Rpr2/Rpp21 subunit domain protein
LocationCp4.1LG14 : 3187143 .. 3195743 (+)
The following sequences are available for this feature:

Gene sequence (with intron)

Legend: five_prime_UTRCDSthree_prime_UTR
Hold the cursor over a type above to highlight its positions in the sequence below.
AGGAAATAAGAAATAAAATCTAAAAACAAAGCTATTAGACAGGATTTGTTCTTCGCGCCGTTCGCTTCGTCTGGGTTTCCCGTTTCCGCCTTTGTCTCCGATTGTCACTTTTTTCAGTCTTTCTTCTTCGCGCCGTTCGCTTCGTCTCTCTTTCTCTCTCAAGAGTTCGTTGTTCGTCAAGGAATCATGCTGTAAGCTCATGAGTTGATTGAGCTGAATTTCTGTTCTTGACATGGCGAAGAGGAAGGGAAATACGAAGAAAGGAGCGTCTAACCCCACATCCGGTCCGCAAGATTCGATCACTATTAGACAGGAAATCACTGGGAAATTCAAACCCAAAGTCTCTAACAATGTCAAAACTTATTTGAATCATTTGGAAAACCTAGCGACTTGGGCTAGTGGGAAAGCCTCTATTCCTTCATTGGCTGCTTTCTTTGGGCAGCGCCTCGCTACTGCAGCAGAGTCCTTGGCGGTCGCTCCCGACGCTTCTTTGTTTACTTGTCAGAGGTCCGTATGTTTTCTGCCCTTTACATTCGTTTGGCTTTGCGCATCTAGATTTGTTGTTTTCGTAGTGTTTATACATGGGTAAGTCCATGAAGATCGATTGTATTTGGGATGGTTTAAATTTTCTGATTAGTTCTAAATGGCACCAATCTGATGTAAACGACATTTGAATATCGAGATTTGACGCAAATTTTGTTTAACTGCAAATTATGATGTGGAAGGGATAGGGTAAGGTGCTTGGAATTATGGCTGTATTGTGAGTGATTTTAGGGGTTCTGTTGTGCAATGTACCAGCAGTTTGTGGACGCTTTGGCTCTTCTAGCTTCTTATTACATACTTAGGTTTTCATATTTTGTCCTGGAGTTTGTATTGTAGCAAGGTTGGGAGTGTAAGCTTGTGAATTGTACTATCTTATTTGGGCATATGCCCTTTGGTGACAATCAGATAACCAAAGAGGCTTAAGTTTTGGGAACTCATGAAGGAAAAAGTCTTAATCAAGCTTCATAAATGGAGGTAATCATCCAAAGAGGGATGCTGGACTTTGTTTGTCGTAGTCCTAGTAAAATGCCTACCTTTTTTATGTGCATCTCGTGATGCTGAGGAAAATGAGGAACTTTCTTTGGAATGGTTCTTTAGAGAGAAAGCGCAGCGATCTAGTGAATAGGAAGGCAGTATCCTTGTCCTTTTTAAATGAAAGGTTCGGAATTGGAAACTTGAGACGAGGCGATGAAGCCCTTATGGAAAACTGGGTTTGGTATTGCTAAGGAGAACCTGGCTTAATCAAGCACAAAATAGTGCATAGAGTTTGGAGAGGATGCAAGAAGGAATGAGCCATGCACACACTGGAATATAACGACTAAGAGAAATCGTTCGAATGAAGAATTGAAGAGTTCCATGCTATTTTGAGACTGCTGGAAAACAGGGTTATTGGAGTGGATGAAATAAGCTGGAATAAGGTAGCTCTGGTGTTCTCATTCAAATCTCTAGCTTTGTCAGACAGAGAGTTCATGAGAGGTATAGAAATAGAGCTTTCAGAGGTAGTGTAAAAAAGAACATGCCACAAATAGATCAAAACGTTAGAGTGAATAGCCTGCCTAAGTGGTCTTTTTTGTTGTTTTTATTTTTTTAGGACGTTACAATAAAAACGTCCTATGGTGATCTTGAATCTGTATATGCTCATTATCCTAGGAAGAGTATAAAGGAACCAACCACCTTTTCTTCCATTTCTCATTTGTCGATTGAATTTGGAGCTCGTTAGTCATTATTATAAAATTTTCGGATGCCTTGGGGTTTTTCATTGATCTTTGAAGGCTTACATATCCCTTCTACTAATGGGGTCCGTCTCTGAAGGGAAAAACCAGGAAGCTATGGCCAGTGCAATCAAATCTGTTCTTTGGAGCCTGTGGATTGAGAAGAAAAAAGAAAGCATTTAACATCGAGGCTAAAGACTGGTTTGAGGTGCATGAGTTTGTTTAGTTTCATTGTGTGAGATTCCACGTCGATTGGAGAGGGGAACGAAACATTCCTTATATGAGTGTGGAAACCTCTCAGATGCGTTTTAAAAACCTGGAAGAGAAACCCGAAAGAGAAAAACCAAAGAGGACAATATTTGCTAGTAGTGAGATTGAGTTGTTACAAATGGTATCAGAGCTAGACACGGGCGGTGTGCCAGCGAGGATGCTGGGCCACCAAGGGGGGTGAATTGTGAGATCCCACGTTGGTTAGAGAGGGGAACAAACCATTCCTTAAAAGAGAGTGGAAACCTCTCCCTAGTAGACACGTTTTAAAAACTTAAAACCTTGAGGGGATGCCTGAAAGGGAAAGACCAAAGAGGACAATATTTGCTAGCGGTGGGCTTGGGCTGTTACAAATGGTATCAGAGCTAGACACCGAGTCATATGCTAGCGAGGACGCTGACTCTCAAGGGGGGTGGATTGTGAGATCCCACATTGGTTGGTGAGGGGAACAAAATATTTCTTATAAGGGTGTGAAAACCTCTACCTAGCATACGTGTTTTAAAAACCTTGAAGGGAAGCCCGAAAGAGAAAGCCCAAAGAAGACAATATTTACTAGCAGTGGGCTTGGGCTTGGGCTGTTACACATTCTTATTTAATATTTCATATTTCCTTAGATAGTGGGTTCAACAACTCTATTCCCCTTGTAGCTAATAATAATTGAGAGGTGGGTTTTTGTAGTAACTTCAACTAACACCTTAGCTATGATGACAGAATTCTCTTCTATCTCCTATCACTTCCATGGAACTCATAATCAAAAATGTTTGCACGTATCTCATTAGACAAGCCTGAGCATTATAATATGCTCATTGACGCTTATTGTTAAATTTTTACCCTTACTTCGGAAAAAGAAACCTACTGATGTTTTAATTCAATGAATGGAGTGGGCAATCAACTCGAACCATATAGCAGTTTCATGTTAAGGACTTTTATCATGTCACATTTGAATGTTGTCTGCTTTGGCTACAAAACTAGCCAATCTTACCTGCTCTATTATATTGCATTCAGGTGTGAAACGATTCTTCAACCTGGCTCTAACTGTTCTATACGAATAGAGAAGAATAACACCAAGAGACGTCGAAGACAGAAGAAATGTAGTAATTCGACACAGAACAATGTGGCGTATTATTGCCACCACTGCTCGTGTAGGAACATAAAGAGAGGAACTCCCAAAGGCCATATGAAAGTGCTATACGACGCAGCGTTTGAAAGAAGGGTGAAGCCTGTGGATGTCAAGGATGGTCAAGAATGTGAGACATCTGCAGTGGAGAAGCCAACTGAGATTCTTACCATTGATGCTCCTAAAATTCCTGATGCTTCTGCAATTCCTCCTCCAACTGGGGACATCACTGCTCTTGATAACCCTGCAATTCAGCTTCGAACCAAGGCCATTCTTAATATTAATTCTCCAGCAACTCCATCCACCCTGAGCGTAACGACTTTGTCGAAATCGCAGAAACAGGAAATGACGACATTATCTGAGAAACATATAGGACACGAGATTAGAACAGACAAGGAGAAGAAAACTGGGGCTGTTCCTACTGTTGATACACCCGCCACCCCTTCCACCTCGACCGGAGTGACTCTGTTGGATTCGAAGAAGAGAAAGAGGAACAAGCCATCGTCTAAGAATCAAACTGATCCCGGAAGTTGCTCTGCTCCAACAGCAGATGGGGATAGAAGTGAAGGCACATCCAAAAGGAATCGTAAAAGAAAATCATGGACAAGTTTGAAGGAAGTTGCTCGGACGAATGAACAGAGTGGTAAACAAAAAAACATGGCTGAATTGGCAATTCCATTCTCCTTGTATTAAGGCACTGATTTAATGAATGTTTGTGTGAGGAATTGATTCTATTTTATACCTTCTCTTTCAGCATTTCTTTGGTTTTCAAGTGGGATGATTTGAATTTGCCATTGTAGACTGATTGAGGGTAGATCTTCGAATTAGGGTTTTAGCTCGATTATAAAAAGTACCCGTTTGAAAATTACTCCTATTCTTTTTGAAATTGTAATACATTTGACGATTCATAAACGTTTCAACTTATAGTAGAAATTCATTTGGCTGTTTACGGACTAGTTATTGAGATCTGAAGCATGTTTTAAGATGAATTTGGGAGCCATTAGAGTTATAATTTTTTTCAATCCAATTTCATCCTTTAATATTACTTAGTTTTTCAGACTTTATGGATTAGAATTAAGGAGACAAATGGAATACTTTATGATCGATTTTCATTATATTTCACTAATTTCCTAAACGAGGCGTAGGTCGGAAAACATCGAGAAATGGAGCAATCCCCATCGGATTCTTCGACGAGCATCGAACCCGGACGTCAGATTCCGGCGACATCCCCAAATTCGGCCATTCCCGAGACGCCCACTTGTTCTGTTCTTGAGGACTGCACCAGACCCAGATCTTCGTTGAGCCATACCGAGATTTTCAAAGCCATTGATGTCGTCGAGAAGGACTCTCTCGCCATTGCCGAGAGCTTTACCTCTCTCTTCGCTTCCCTGCGTTCGACTCTCTCCGAGGTCTCATAGTTTTTGCATCCTTCTTCTTTTACGCCCCATTTTCATTTTGCTTTAATATGTTGAAGATCGGATGAATGAGTTTTGAGTTTTGAGTTTGAGTACTTCACCAGATTTGATTATTACTGGACGAACCTGATTTTATCCCCTTTAATAATGACTGGAATTTACCACTCTAGCATGAATTTCTGGAGCAATGATTCTGGGTTTCCTCTTGTCAAGCAGTTTATCTGAGTGGAAATAGTATAATCTTGTAGGTCACCAGCAACTCTATTGATCATATGCATTGCTTCAACGATGCTGCAGGCCGCCTTCAAGAATCTGGTAAGTCAACATAGTAATGATTTTGATTCACAGGAATGGCTGAGTATGATGGCAGTTTAGATGTGTCATTGTCTACTACTGTTACTACTAAACTGATTACATTCATGGGTACTTCTTCCTGTATGAATCAAAATTCTATCTGTTTATCTTTATGTTCTGTGAGATCCCACATCGGTCGGAGAGGGGAACGAAACATTCCTTATAAGGATGTGGAAACCTCTTCCTAGCAGACGTGTTTTAAAACCATGAGGCTAACGGCGATGTGTAACGGGCCAAAGCAGACAATGTCTGCTAATAGTAGACTTGGGCTATTACATGTCCATTATTCTGTTCTAGAATCTGTGTTATGTGATCGGCCTTTGTGCCAGTTTTTGTACTAAATTTGGAAAGAGATTAGTTGCAATCAGTTCTCTCCCATACTACTAACAAAGACCTTGTTTTTCATCTGATGGTCTTTGACATGAGTTCTCTTCTATTCTTTCAACCTTGTAGTGCTTGATGCGGCAACAAAGGGTAATCGATACATAAATTCTTCCTTGAGGTATGCTTCAATGTCTTCTGGTATCTCCTTTGCTCTATAGGTTTCAATTTCTTTGCTGTCAGTATTTTCTATGACTAATAACTTAAATATTTTACTCATTTGTCAAAATGTGATTGCTCTGTGAATGTTGTAGCTTTTTACAACCTGCTGCATAAAATGATACATGAAAATTCTTGAACCAAGTTCTAGTAGGAGATTAGATTGAAACACGCACATAAAAAAAAGTTTCTTTATTGTTAAAAGATTCTATCCCAAGGGTCCAAAATTATAGTTTAATCTAAAAGAGGTTGTTTGTAACGGCTCAAGCCCACCGCTAGCAGATATTGTCTCTTTGGGCTTTCCCTTAAGATTTTTAAAACACGTCTGCTAGGGAAAGGTTTCCACACCCTTATAAAGAATGTTTCGTTCCCCTCTCCAACTGATGTGGGATCTCACAATGTACCCCCTTTGGGACCCAGCGTCCTCGCTAACACACGTTCCCCTCTCCAATCGATGTGGGACCCCAATCTACCCCCCTTCGGGACCCAACGTCCTTGTTAGCACACCGCCTCGTGTTCACTCCCTTCGGGACCCAACGTCCTTGTTGTTACTCTTTTGCTTGCATATATAACACATCTTTCAAAACCTCTTCTTCGAAAATCTCTCTCTGCCCTCATTTTCTAAAACCTTCTTCTTAAACCGAGGTCAAAGGCTCGAGCATACGTCGCTTAGCCATAAGTGACGTAAGAACAAATTGGCTTAGCTCACTTGTGGTTGGGAAGTGAGTGCCACACAATCGCAAGTCCGCTGCCATCAAAAGTACGATTGTGACACCAGGCACCTAAATAATTGTGGATATAACTGTTACTTGGTAGTCTTCTGCAAAAATTATTAAGAAATGAAAGAACAGCATAAAAGTTTTGAGAGATATTTATTAATGTAATGATAAAAACTACAACTTAGGCGCTTTGAGAAACTATTTGGTGTATCTCTGTGATAATGAGGATGGGAAGTGAGTGCCACACAATCGCAAGTCCGCTGCCATCAAAAGTACGATTGTGACACCAGGCACCTAAATAATTGTGGATATAACTGTTACTTGGTAGTCTTCTGCAAAAATTATTAAGAAATGAAAGAACAGCATAAAAGTTTTGAGAGATATTTATTAATGTAATGATAAAAACTACAACTTAGGCGCTTTGAGAAACTATTTGGTGTATCTCTGTGATAATGAGGAGTAAGTCCTCTTGATGAGTCCTCTGTTAGTTTGATTGTGTTCATAGTTTTTTTAGTGTGAAATTATATAGTTCTTCAATAGATGTTGGATGGCTGTCCCTGACTACAAATCTGCATAGCTATCACATAGAACTCTCTTTAATGTGATCCGAAGGGAACATAAGATTTAGAAAAGTTTAGTATATTGGCAGAGTTGTGTTTGTACTATCAATGATTGATCATAAGTTAGTAGAAATCAAATCAATGTGTCTTTTTACTTGTATTAATTATGACTCCGACTATGTTGTTTCACAAGTGAAGTTAGTCAATGTGAGATCCCACATCGGTTGGGGAGGAGAACGAAGCATTCCTTATAAGGGTGTGGAAACCTCTCCCTAACAAATGCGTTTTAAAAACCTTGAGGGGAAAATCAAAAGGGAAAGGCCAAAGAGGACAGTATCTATTAGCGGTGGGCTTGGATTGTTACAAGTAGTATTAGAGCCAGACACCAGGCGATGTGCCAACGAAGAGGTTGAGCCCCAAAGAGGGGTGGACACGAGGCGGTGTGCCAGCAAGGACACTGGGTCTCGAAGAGGGGTGGATTAGGGGGTTCCACATTGATTGGAGAAGGGAACAAGTGTCAGCAAGGACGCTGAGTCCCGAACGGGGGTGGATTGTAAGATCCCACATCGGTTGGGGAGGAGAACAAAGCATTCCTTATAAAGGTGTGGAACCCTCTCCCTCACAAACGCGCTTTAAAAACCTTGAGGGGAAACTCAAAAAGGAAAGGCCAAAGAGGACAGTATCTATTAGCGGTGGGCTTGGATTGTTACAAATAGTATTAGAGCCAGACACCGGATGATGTGCCAACAATGAGGCTGAGCCCCGAAGAGGGGTGGACACGAGGGGGTGTGCCAGCAAGGACGCTGGGTCTCGAAGAGGGGTGGATTAAGGGGTCCCACATTGATTGGAGAAAGAAACGAGTGCCAGCGAGGATGCTGGGCCCTGAAGGGGGTGGATTGTGATATCCCACATTAGTTGGGGAGGAGAACGAAACACCCTTTATAAGGGTGTGGAAACCTTTTCCTAGCAGACGCGTTTTTGGTAAAAACTCAAAGAGAACAGTATCTGCTAGTGGTGGGCTTGAACAGTCAACTAATGGAATTAGTCTAGTTCATCTGTATGATTCAAAATCATTAAGATCTCTTATAAATTTCAAGTCCATTCGCCCTTAATTTCCATTATACATATCTTTATACTATCTGATAATGATTAGTAAATCACAATTTCACATCTTAAAGATAATCATATGGCCGATGCTGTAACTTTGGACTGTATTTTGTTGTGTTTTCAGATTGAATCAGGAAATGAAAGGCATGGACAATCTAGCTGCTCAGCTGTATCCTTTCTATTTCATGCCTAGTAGACCATTTTAATGCTTTAGCTCTTTAGAATTGGGAACCTGAGCATACTAATACTTCTTTATCTGATATCTAAGTCCTCTGCATTACCCATTTCCTTAATTCTTGGTTAGAAAGATTTTGAGGAAGAATGTTGATGAACTAGACTTGGCTGTGAACAAGCTCCTAAATTTTCCATGAGGCTTGGCTTGGTCGTAGTTCTTGCAAGCTCGAGGATTATCGTTTCAGTAGAGAGCGAGAACACGGTTCATGTCGAAGAAGTCACTAGGCAGCTAGAATACAGTACATGAAGCAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAANTGCATAGCTTCCTTCACATTTTACTTTAGGAAGCATCTCATTGCATGGTTGATGTACTTATGGGGATATGAATAAGCTTACTGGATTCAATAATCAGTGTCTCTTTTCTTTACAGGAGGTTATTCAGCTTTTGAAAATTGCCCATCTTGCATATTTTTTACTGTGGTGTTTTATTTGAGATCACTGTGGTGTTTTATTTGAGATCGAATGTGATGTAACATATTGGTCGGGGAGGAGAATGAAACACC

mRNA sequence

AGGAAATAAGAAATAAAATCTAAAAACAAAGCTATTAGACAGGATTTGTTCTTCGCGCCGTTCGCTTCGTCTGGGTTTCCCGTTTCCGCCTTTGTCTCCGATTGTCACTTTTTTCAGTCTTTCTTCTTCGCGCCGTTCGCTTCGTCTCTCTTTCTCTCTCAAGAGTTCGTTGTTCGTCAAGGAATCATGCTGTAAGCTCATGAGTTGATTGAGCTGAATTTCTGTTCTTGACATGGCGAAGAGGAAGGGAAATACGAAGAAAGGAGCGTCTAACCCCACATCCGGTCCGCAAGATTCGATCACTATTAGACAGGAAATCACTGGGAAATTCAAACCCAAAGTCTCTAACAATGTCAAAACTTATTTGAATCATTTGGAAAACCTAGCGACTTGGGCTAGTGGGAAAGCCTCTATTCCTTCATTGGCTGCTTTCTTTGGGCAGCGCCTCGCTACTGCAGCAGAGTCCTTGGCGGTCGCTCCCGACGCTTCTTTGTTTACTTGTCAGAGGTGTGAAACGATTCTTCAACCTGGCTCTAACTGTTCTATACGAATAGAGAAGAATAACACCAAGAGACGTCGAAGACAGAAGAAATGTAGTAATTCGACACAGAACAATGTGGCGTATTATTGCCACCACTGCTCGTGTAGGAACATAAAGAGAGGAACTCCCAAAGGCCATATGAAAGTGCTATACGACGCAGCGTTTGAAAGAAGGGTGAAGCCTGTGGATGTCAAGGATGGTCAAGAATGTGAGACATCTGCAGTGGAGAAGCCAACTGAGATTCTTACCATTGATGCTCCTAAAATTCCTGATGCTTCTGCAATTCCTCCTCCAACTGGGGACATCACTGCTCTTGATAACCCTGCAATTCAGCTTCGAACCAAGGCCATTCTTAATATTAATTCTCCAGCAACTCCATCCACCCTGAGCGTAACGACTTTGTCGAAATCGCAGAAACAGGAAATGACGACATTATCTGAGAAACATATAGGACACGAGATTAGAACAGACAAGGAGAAGAAAACTGGGGCTGTTCCTACTGTTGATACACCCGCCACCCCTTCCACCTCGACCGGAGTGACTCTGTTGGATTCGAAGAAGAGAAAGAGGAACAAGCCATCGTCTAAGAATCAAACTGATCCCGGAAGTTGCTCTGCTCCAACAGCAGATGGGGATAGAAGTGAAGGCACATCCAAAAGGAATCGTAAAAGAAAATCATGGACAAGTTTGAAGGAAGTTGCTCGGACGAATGAACAGAGTGGTAAACAAAAAAACATGGCTGAATTGGCAATTCCATTCTCCTTGTATTAAGGCACTGATTTAATGAATGTTTGTGTGAGGAATTGATTCTATTTTATACCTTCTCTTTCAGCATTTCTTTGGTTTTCAAGTGGGATGATTTGAATTTGCCATTGTAGACTGATTGAGGGTAGATCTTCGAATTAGGGTTTTAGCTCGATTATAAAAAGTACCCGTTTGAAAATTACTCCTATTCTTTTTGAAATTGTAATACATTTGACGATTCATAAACGTTTCAACTTATAGTAGAAATTCATTTGGCTGTTTACGGACTAGTTATTGAGATCTGAAGCATGTTTTAAGATGAATTTGGGAGCCATTAGAGTTATAATTTTTTTCAATCCAATTTCATCCTTTAATATTACTTAGTTTTTCAGACTTTATGGATTAGAATTAAGGAGACAAATGGAATACTTTATGATCGATTTTCATTATATTTCACTAATTTCCTAAACGAGGCGTAGGTCGGAAAACATCGAGAAATGGAGCAATCCCCATCGGATTCTTCGACGAGCATCGAACCCGGACGTCAGATTCCGGCGACATCCCCAAATTCGGCCATTCCCGAGACGCCCACTTGTTCTGTTCTTGAGGACTGCACCAGACCCAGATCTTCGTTGAGCCATACCGAGATTTTCAAAGCCATTGATGTCGTCGAGAAGGACTCTCTCGCCATTGCCGAGAGCTTTACCTCTCTCTTCGCTTCCCTGCGTTCGACTCTCTCCGAGGTCACCAGCAACTCTATTGATCATATGCATTGCTTCAACGATGCTGCAGGCCGCCTTCAAGAATCTGTGCTTGATGCGGCAACAAAGGGTAATCGATACATAAATTCTTCCTTGAGATTGAATCAGGAAATGAAAGGCATGGACAATCTAGCTGCTCAGCTAAAGATTTTGAGGAAGAATGTTGATGAACTAGACTTGGCTGTGAACAAGCTCCTAAATTTTCCATGAGGCTTGGCTTGGTCGTAGTTCTTGCAAGCTCGAGGATTATCGTTTCAGTAGAGAGCGAGAACACGGTTCATGTCGAAGAAGTCACTAGGCAGCTAGAATACAGTACATGAAGCAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAANTGCATAGCTTCCTTCACATTTTACTTTAGGAAGCATCTCATTGCATGGTTGATGTACTTATGGGGATATGAATAAGCTTACTGGATTCAATAATCAGTGTCTCTTTTCTTTACAGGAGGTTATTCAGCTTTTGAAAATTGCCCATCTTGCATATTTTTTACTGTGGTGTTTTATTTGAGATCACTGTGGTGTTTTATTTGAGATCGAATGTGATGTAACATATTGGTCGGGGAGGAGAATGAAACACC

Coding sequence (CDS)

ATGGCGAAGAGGAAGGGAAATACGAAGAAAGGAGCGTCTAACCCCACATCCGGTCCGCAAGATTCGATCACTATTAGACAGGAAATCACTGGGAAATTCAAACCCAAAGTCTCTAACAATGTCAAAACTTATTTGAATCATTTGGAAAACCTAGCGACTTGGGCTAGTGGGAAAGCCTCTATTCCTTCATTGGCTGCTTTCTTTGGGCAGCGCCTCGCTACTGCAGCAGAGTCCTTGGCGGTCGCTCCCGACGCTTCTTTGTTTACTTGTCAGAGGTGTGAAACGATTCTTCAACCTGGCTCTAACTGTTCTATACGAATAGAGAAGAATAACACCAAGAGACGTCGAAGACAGAAGAAATGTAGTAATTCGACACAGAACAATGTGGCGTATTATTGCCACCACTGCTCGTGTAGGAACATAAAGAGAGGAACTCCCAAAGGCCATATGAAAGTGCTATACGACGCAGCGTTTGAAAGAAGGGTGAAGCCTGTGGATGTCAAGGATGGTCAAGAATGTGAGACATCTGCAGTGGAGAAGCCAACTGAGATTCTTACCATTGATGCTCCTAAAATTCCTGATGCTTCTGCAATTCCTCCTCCAACTGGGGACATCACTGCTCTTGATAACCCTGCAATTCAGCTTCGAACCAAGGCCATTCTTAATATTAATTCTCCAGCAACTCCATCCACCCTGAGCGTAACGACTTTGTCGAAATCGCAGAAACAGGAAATGACGACATTATCTGAGAAACATATAGGACACGAGATTAGAACAGACAAGGAGAAGAAAACTGGGGCTGTTCCTACTGTTGATACACCCGCCACCCCTTCCACCTCGACCGGAGTGACTCTGTTGGATTCGAAGAAGAGAAAGAGGAACAAGCCATCGTCTAAGAATCAAACTGATCCCGGAAGTTGCTCTGCTCCAACAGCAGATGGGGATAGAAGTGAAGGCACATCCAAAAGGAATCGTAAAAGAAAATCATGGACAAGTTTGAAGGAAGTTGCTCGGACGAATGAACAGAGTGGTAAACAAAAAAACATGGCTGAATTGGCAATTCCATTCTCCTTGTATTAA

Protein sequence

MAKRKGNTKKGASNPTSGPQDSITIRQEITGKFKPKVSNNVKTYLNHLENLATWASGKASIPSLAAFFGQRLATAAESLAVAPDASLFTCQRCETILQPGSNCSIRIEKNNTKRRRRQKKCSNSTQNNVAYYCHHCSCRNIKRGTPKGHMKVLYDAAFERRVKPVDVKDGQECETSAVEKPTEILTIDAPKIPDASAIPPPTGDITALDNPAIQLRTKAILNINSPATPSTLSVTTLSKSQKQEMTTLSEKHIGHEIRTDKEKKTGAVPTVDTPATPSTSTGVTLLDSKKRKRNKPSSKNQTDPGSCSAPTADGDRSEGTSKRNRKRKSWTSLKEVARTNEQSGKQKNMAELAIPFSLY
BLAST of Cp4.1LG14g02140 vs. TrEMBL
Match: A0A0A0KUP1_CUCSA (Uncharacterized protein OS=Cucumis sativus GN=Csa_4G048010 PE=4 SV=1)

HSP 1 Score: 397.9 bits (1021), Expect = 1.3e-107
Identity = 230/378 (60.85%), Postives = 264/378 (69.84%), Query Frame = 1

Query: 1   MAKRKGNTKKGASNPTSGPQDSITIRQEITGKFKPKVSNNVKTYLNHLENLATWASGKAS 60
           MA++K NT +G+SNP  GPQ+SIT+RQE TGK KPKVSNN K YLNHLENLATWASG+ S
Sbjct: 1   MARKKSNTNRGSSNPGFGPQNSITLRQEATGKIKPKVSNNAKVYLNHLENLATWASGQPS 60

Query: 61  IPSLAAFFGQRLATAAESLAVAPDASLFTCQRCETILQPGSNCSIRIEKNNTKRRRRQKK 120
           +PSLAAFFGQRLA AAESLAV+PD SLF C RCETILQPGSNC+IRIEKN  K+RRR KK
Sbjct: 61  LPSLAAFFGQRLAAAAESLAVSPDPSLFLCARCETILQPGSNCNIRIEKNTAKKRRRHKK 120

Query: 121 CSNSTQNNVAYYCHHCSCRNIKRGTPKGHMKVLYDAAFERRVKPVDVKDGQECETSAVEK 180
            SN TQN VAYYCH+CSCRNIKRGTPKGHMKVLY      +VK V VKDG+ECE      
Sbjct: 121 GSNLTQNVVAYYCHYCSCRNIKRGTPKGHMKVLYGTECVSKVKSVVVKDGKECENKIFTM 180

Query: 181 PT---------EILTIDAPKIP-----------DASAIPPPTGDITALDNPAIQLRTKAI 240
            T         + LTID P IP           D SAI  PTGDI+ +D PAI       
Sbjct: 181 DTPKIPPLTTVDCLTIDTPAIPSLSTTRDDLTIDTSAI-SPTGDISVVDGPAISSPR--- 240

Query: 241 LNINSPATPSTLSVTTLSKSQKQEMTTLSEKHIGHEIRTDKEKKTGAVPTVDTPATPSTS 300
               +PA  STLSVT++S+SQ ++                       +PT+D PATP T 
Sbjct: 241 ---TTPAISSTLSVTSISRSQVRD-----------------------IPTLDAPATPLTL 300

Query: 301 TGVTLLDSKKRKRNKPSSKNQTDPGSCSAPTADGDRSEGTSKRNRKRKSWTSLKEVARTN 359
           TG+TLLDSK+RKR KPSSKNQT+P SCS PT+ G+ SEGTSKR R RKSWTSLKE+A+  
Sbjct: 301 TGMTLLDSKRRKRKKPSSKNQTEPESCSGPTSHGETSEGTSKRKRNRKSWTSLKEIAQRE 347

BLAST of Cp4.1LG14g02140 vs. TrEMBL
Match: M5WDZ3_PRUPE (Uncharacterized protein OS=Prunus persica GN=PRUPE_ppa018166mg PE=4 SV=1)

HSP 1 Score: 202.6 bits (514), Expect = 8.1e-49
Identity = 152/400 (38.00%), Postives = 216/400 (54.00%), Query Frame = 1

Query: 2   AKRKGNTKKGASNPTSGPQDSITIRQEITGKFK---PKVSNNVKTYLNHLENLATWASGK 61
           AKR GNT  G++N       +I++R+E+TGK +   P V+      L HL+ LA WA G+
Sbjct: 7   AKRTGNTAYGSNN-------TISLREELTGKKQTKGPAVNAKSALKLEHLQRLAVWAGGE 66

Query: 62  ASIPSLAAFFGQRLATAAESLAVAPDASLFTCQRCETILQPGSNCSIRIEKNNTKRRRRQ 121
           AS+PSL AFF   LA+A E+L V PD SLFTCQRCETILQPG NC++RIEKN +K+RR+ 
Sbjct: 67  ASVPSLGAFFAHTLASAQEALGVPPDPSLFTCQRCETILQPGLNCTVRIEKNRSKKRRKS 126

Query: 122 KKCSNSTQNNVAYYCHHCSCRNIKRGTPKGHMKVLYDAAFER--RVKPV----------- 181
           KK ++ +QNNV Y CH CS RN+KRGTP GHMKV+     +   ++KP            
Sbjct: 127 KKPTSFSQNNVVYTCHFCSHRNLKRGTPHGHMKVICPTKTKTTSKLKPAKSISQKSVSSK 186

Query: 182 -------DVKDGQECETSAVEKPTEILT------IDAPKIPDASAIPPPTGDITA--LDN 241
                  +V+   E   S + +  EI +      I A +I  +       G+ TA  ++N
Sbjct: 187 KSIVAEDEVRKANEIAASEIIQANEIASSEITREIQANEIASSEIAREIQGNETASEIEN 246

Query: 242 PAIQLRTKAILNINSPATPSTLSVTTLSKSQKQEMTTL--SEKHIGHEIRTDKEKKT--- 301
                     + +N      T S     + Q+ E  +L  + ++   E   D+++ T   
Sbjct: 247 ETASSEIAREIQVNE-----TASSEIAREFQENETASLEIARENCIVETWADEDEVTIVN 306

Query: 302 --------GAVPTVDTPATPSTSTGVT-LLDSKKRKRNKPSSKNQTDPGSCSAPTADGDR 357
                   G +P VD+P TP   TG T LL  K+RKRNK  SK   +P +   PT D + 
Sbjct: 307 EMASSAMAGEIPMVDSPETPKVRTGPTLLLGGKRRKRNKSVSKKPAEPENSPNPT-DAEN 366

BLAST of Cp4.1LG14g02140 vs. TrEMBL
Match: A0A068UGM5_COFCA (Uncharacterized protein OS=Coffea canephora GN=GSCOC_T00024934001 PE=4 SV=1)

HSP 1 Score: 178.7 bits (452), Expect = 1.2e-41
Identity = 140/379 (36.94%), Postives = 198/379 (52.24%), Query Frame = 1

Query: 1   MAKRKGNTKKGASNPTSGPQ----DSITIRQEITGKFKPKVSNNVKTYLNHLENLATWAS 60
           M K+K ++   A+  +S P      S T+R+E +GK +  V+      L+HL+NLA WA+
Sbjct: 1   MGKKKNSSTAAAAAISSQPHLQQAPSGTMREESSGKKQSSVNPKSMLKLDHLKNLAIWAT 60

Query: 61  GKASIPSLAAFFGQRLATAAESLAVAPDASLFTCQRCETILQPGSNCSIRIEKNNTKRRR 120
            +AS+ SL AFFG RLA  AE+L V PD SLF+CQRCE+ILQPG NC++RIEK     +R
Sbjct: 61  AEASVSSLGAFFGHRLAATAEALGVPPDPSLFSCQRCESILQPGYNCTVRIEKIKPNPKR 120

Query: 121 RQKKCSNSTQNNVAYYCHHCSCRNIKRGTPKGHMKVLYDAAFERRVKPVDVKDGQ----E 180
           R+KK S S +N+V Y CH CS RN+KRGTPKG+MK L  +  +   K    K       +
Sbjct: 121 RRKKRSLSIENSVVYSCHFCSHRNLKRGTPKGYMKELCPSKPKVSSKSDPTKSSHPKFAK 180

Query: 181 CETSAVEKPTEILTIDA-------PKIPDASAIPPPTGDITALDNPAIQLRT----KAIL 240
            +     K   +  +D         K P   +   P+  +  L +   + RT    K I+
Sbjct: 181 AKVITASKNDPMSKVDGIASQKIINKDPVNDSSTTPSAKVETLLDTRKRKRTRSGSKKIV 240

Query: 241 NINSPATPSTLSVTTL--SKSQKQEMTTLSEKHIGHEIRTDKEKKTGAVPTVDTPATPST 300
              S + P      ++  SK +++  TT     +        +K     P  D+  TPS 
Sbjct: 241 ESESSSVPVDAKKASIPSSKRKRKSWTTSKNDPMSKVDGIASQKIINKDPVNDSSTTPSA 300

Query: 301 STGVTLLDSKKRKRNKPSSKNQTDPGSCSAPTADGDRSEGTSKRNRKRKSWTSLKEVART 359
               TLLD++KRKR +  SK   +  S S P      S  +SK  RKRKSWTSLK++A  
Sbjct: 301 KVE-TLLDTRKRKRTRSGSKKIVESESSSVPVDAKKASIPSSK--RKRKSWTSLKDIAEC 360

BLAST of Cp4.1LG14g02140 vs. TrEMBL
Match: M1A5Y5_SOLTU (Uncharacterized protein OS=Solanum tuberosum GN=PGSC0003DMG403005999 PE=4 SV=1)

HSP 1 Score: 174.1 bits (440), Expect = 3.1e-40
Identity = 144/375 (38.40%), Postives = 198/375 (52.80%), Query Frame = 1

Query: 9   KKGASNPTSGPQDSITIRQEITGKFKPKVSNNVKTYLN--HLENLATWASGKASIPSLAA 68
           K    N       SIT+R+E +G  K +   N K+ L   H++++ATWASG+ SIPSL A
Sbjct: 4   KSSRKNGVGKALGSITLREE-SGVKKKQTHVNAKSMLKLEHIKDIATWASGEGSIPSLGA 63

Query: 69  FFGQRLATAAESLAVAPDASLFTCQRCETILQPGSNCSIRIEKNNTKRRRRQKKCSNSTQ 128
           FFGQRLA AAESL V PD SLFTCQRCE+ILQ G NC+ RIEKN  K R++ K      +
Sbjct: 64  FFGQRLAVAAESLGVPPDPSLFTCQRCESILQAGYNCTTRIEKNKRKARKKLKTSGIPPK 123

Query: 129 NNVAYYCHHCSCRNIKRGTPKGHMKVLYDAAFERRVKPVDVKDGQECETSAVEKPTE--- 188
           N+V Y CH CS RN+KRGTP+G+MK L  A    + K + V   +     A   PTE   
Sbjct: 124 NSVVYECHFCSHRNLKRGTPRGYMKSLNPA----KPKTLTVDPAKSATRKAKVDPTESAM 183

Query: 189 ---------ILTIDAPKIPDASAIPPPT----GDITALDNPAIQLRTKAILNINSPATPS 248
                    + +ID  K+    +I   +      + + D  ++   TK+    +     S
Sbjct: 184 QRSEHLGTLVASIDKAKVDPTESIMHKSEHLDTSVASTDKASVD-PTKSAKQKSEHLDTS 243

Query: 249 TLSV------TTLSKSQKQEMTTLSEKHIGHEIRTDKEKKTGAVPTVDTPATP-STSTGV 308
             S+      TT S +QK E     +   G         +      +  PATP ST T  
Sbjct: 244 VASIDKARVDTTESATQKSEQ---FDALGGSTTDVIVSSELVGEDAMAGPATPLSTVTVT 303

Query: 309 TLLDSKKRKRNKPSSKNQTDPGSCSAPTADGDRSEGTSKRNRKRKSWTSLKEVARTNEQS 359
           +LLDSKKRKRN+  SK + +P   S+ T D +++  TS + RK+ SWTSLKE+A +  QS
Sbjct: 304 SLLDSKKRKRNRTGSKKK-EPQDGSSMT-DAEKTVSTSSK-RKKTSWTSLKEIAES--QS 363

BLAST of Cp4.1LG14g02140 vs. TrEMBL
Match: B9GWI5_POPTR (Uncharacterized protein OS=Populus trichocarpa GN=POPTR_0003s12730g PE=4 SV=1)

HSP 1 Score: 172.6 bits (436), Expect = 9.0e-40
Identity = 123/301 (40.86%), Postives = 168/301 (55.81%), Query Frame = 1

Query: 45  LNHLENLATWASGKASIPSLAAFFGQRLATAAESLAVAPDAS-LFTCQRCETILQPGSNC 104
           L HL+NLA+WA+ +ASIPSLAAFFG++ A++AE+L V  D S LF CQRC T L+PG NC
Sbjct: 42  LEHLQNLASWATEEASIPSLAAFFGRQFASSAEALGVPLDPSALFQCQRCGTFLRPGFNC 101

Query: 105 SIRIEKNNTKRRRRQKKCSNSTQNNVAYYCHHCSCRNIKRGTPKGHMKVLYDAAFERRVK 164
           + +IEKN +K RRR K+ S ST+NNV Y CHHC   N+KRGTPKGHMK +     + + K
Sbjct: 102 TTQIEKNQSKARRRHKRFSTSTKNNVVYKCHHCLHINLKRGTPKGHMKEICPPKPKPQAK 161

Query: 165 PVD--VKDGQECETSAVEKPTEILTIDAPKIPDASAIPPPTGDITALDNPAIQLRTKAIL 224
           P    ++     E     K  EI+ ID P +P   AI   T       NP  +++T  I 
Sbjct: 162 PTKSVLQKSANLEKGTSSK-GEIVKIDGPALP---AISLGTSMTNNPANPFPRIKTDEIY 221

Query: 225 NINSPATPSTLSVTTLSKSQKQEMTTLSEKHIGHEIRTDKEKKTGAVPTVDTPATPSTST 284
            I   A P+    T+++ S       +    I     T     + +     +PATP  S 
Sbjct: 222 KIAETALPAISVDTSITDSPATPFPRVKTDEIIKIDETALPAISMSASITSSPATPLPSG 281

Query: 285 GVTLLDSKKRKRNKPSSKNQTDPGSCSAPTADGDRSEGTSKRNRKRKSWTSLKEVARTNE 343
             +LLD+ K+KRN+ + K +   G  +A   D + +  TS + RKRKSWTSLKE+     
Sbjct: 282 RFSLLDATKKKRNRSAKKPEQSEGDSAA--MDAENTVSTSSK-RKRKSWTSLKEIVEKRA 335

BLAST of Cp4.1LG14g02140 vs. TAIR10
Match: AT5G41270.1 (AT5G41270.1 RNAse P, Rpr2/Rpp21 subunit (InterPro:IPR007175))

HSP 1 Score: 115.9 bits (289), Expect = 5.0e-26
Identity = 66/123 (53.66%), Postives = 83/123 (67.48%), Query Frame = 1

Query: 40  NVKTYLNH--LENLATWA-SGKASIPSLAAFFGQRLATAAESLAVAPDASLFTCQRCETI 99
           N+K+ L H  L+NLA W+ +G   IPSLA+  G+RLA   ES  +  D  L +CQRCETI
Sbjct: 19  NLKSVLRHEHLKNLALWSSTGDTPIPSLASLLGRRLAADTESTGITTDPDLVSCQRCETI 78

Query: 100 LQPGSNCSIRIEK---NNTKRRRRQKKCSN--STQNNVAYYCHHCSCRNIKRGTPKGHMK 155
           L+PG NC++RIEK   N  K+R R KK +N    QNNV Y+C+ CS RN+KRGT KG MK
Sbjct: 79  LKPGFNCNVRIEKVSANVKKKRNRCKKSNNICFPQNNVVYHCNFCSHRNLKRGTAKGQMK 138

BLAST of Cp4.1LG14g02140 vs. NCBI nr
Match: gi|659102242|ref|XP_008452026.1| (PREDICTED: uncharacterized protein LOC103493157 [Cucumis melo])

HSP 1 Score: 406.4 bits (1043), Expect = 5.3e-110
Identity = 233/378 (61.64%), Postives = 270/378 (71.43%), Query Frame = 1

Query: 1   MAKRKGNTKKGASNPTSGPQDSITIRQEITGKFKPKVSNNVKTYLNHLENLATWASGKAS 60
           MA++KGNTK+G+SNPTSGPQ+SIT+RQE TGK KPKVSNN K YLNHLENLATWASG+ S
Sbjct: 1   MARKKGNTKRGSSNPTSGPQNSITLRQEATGKIKPKVSNNAKVYLNHLENLATWASGQPS 60

Query: 61  IPSLAAFFGQRLATAAESLAVAPDASLFTCQRCETILQPGSNCSIRIEKNNTKRRRRQKK 120
           +PSLAAFFGQRLA AAESLAVAPD SLF C RCET+LQPGSNC IRIEKNN K+RRR KK
Sbjct: 61  LPSLAAFFGQRLAAAAESLAVAPDPSLFLCARCETVLQPGSNCYIRIEKNNAKKRRRHKK 120

Query: 121 CSNSTQNNVAYYCHHCSCRNIKRGTPKGHMKVLYDAAFERRVKPVDVKDGQECETS--AV 180
            SN TQN VAYYCH+CSCRNIKRGTPKGHMKVLY      +VK V VKDG+ECE     V
Sbjct: 121 ASNVTQNVVAYYCHYCSCRNIKRGTPKGHMKVLYGTECVSKVKSVVVKDGKECENKILTV 180

Query: 181 EKPT-------EILTIDAPKIP-----------DASAIPPPTGDITALDNPAIQLRTKAI 240
           + PT       + LTID P IP           D +A+ PPT DI+  D PAI       
Sbjct: 181 DAPTTPPLTTVDCLTIDTPAIPSLSTTRDDVAVDTTAV-PPTEDISVDDGPAISSPR--- 240

Query: 241 LNINSPATPSTLSVTTLSKSQKQEMTTLSEKHIGHEIRTDKEKKTGAVPTVDTPATPSTS 300
               +PA PST SVT++S+SQ ++                       +PT+D PATP T 
Sbjct: 241 ---TTPAIPSTSSVTSMSRSQVRD-----------------------IPTLDAPATPLTL 300

Query: 301 TGVTLLDSKKRKRNKPSSKNQTDPGSCSAPTADGDRSEGTSKRNRKRKSWTSLKEVARTN 359
           T +TLLDSK+RKR KPSSKN+T+P SCSAPT+ G++SE TSKR R RKSWTSLKE+A+  
Sbjct: 301 TAMTLLDSKRRKRKKPSSKNRTEPESCSAPTSHGEKSEDTSKRKRNRKSWTSLKEIAQRE 347

BLAST of Cp4.1LG14g02140 vs. NCBI nr
Match: gi|449457658|ref|XP_004146565.1| (PREDICTED: uncharacterized protein LOC101220608 [Cucumis sativus])

HSP 1 Score: 397.9 bits (1021), Expect = 1.9e-107
Identity = 230/378 (60.85%), Postives = 264/378 (69.84%), Query Frame = 1

Query: 1   MAKRKGNTKKGASNPTSGPQDSITIRQEITGKFKPKVSNNVKTYLNHLENLATWASGKAS 60
           MA++K NT +G+SNP  GPQ+SIT+RQE TGK KPKVSNN K YLNHLENLATWASG+ S
Sbjct: 1   MARKKSNTNRGSSNPGFGPQNSITLRQEATGKIKPKVSNNAKVYLNHLENLATWASGQPS 60

Query: 61  IPSLAAFFGQRLATAAESLAVAPDASLFTCQRCETILQPGSNCSIRIEKNNTKRRRRQKK 120
           +PSLAAFFGQRLA AAESLAV+PD SLF C RCETILQPGSNC+IRIEKN  K+RRR KK
Sbjct: 61  LPSLAAFFGQRLAAAAESLAVSPDPSLFLCARCETILQPGSNCNIRIEKNTAKKRRRHKK 120

Query: 121 CSNSTQNNVAYYCHHCSCRNIKRGTPKGHMKVLYDAAFERRVKPVDVKDGQECETSAVEK 180
            SN TQN VAYYCH+CSCRNIKRGTPKGHMKVLY      +VK V VKDG+ECE      
Sbjct: 121 GSNLTQNVVAYYCHYCSCRNIKRGTPKGHMKVLYGTECVSKVKSVVVKDGKECENKIFTM 180

Query: 181 PT---------EILTIDAPKIP-----------DASAIPPPTGDITALDNPAIQLRTKAI 240
            T         + LTID P IP           D SAI  PTGDI+ +D PAI       
Sbjct: 181 DTPKIPPLTTVDCLTIDTPAIPSLSTTRDDLTIDTSAI-SPTGDISVVDGPAISSPR--- 240

Query: 241 LNINSPATPSTLSVTTLSKSQKQEMTTLSEKHIGHEIRTDKEKKTGAVPTVDTPATPSTS 300
               +PA  STLSVT++S+SQ ++                       +PT+D PATP T 
Sbjct: 241 ---TTPAISSTLSVTSISRSQVRD-----------------------IPTLDAPATPLTL 300

Query: 301 TGVTLLDSKKRKRNKPSSKNQTDPGSCSAPTADGDRSEGTSKRNRKRKSWTSLKEVARTN 359
           TG+TLLDSK+RKR KPSSKNQT+P SCS PT+ G+ SEGTSKR R RKSWTSLKE+A+  
Sbjct: 301 TGMTLLDSKRRKRKKPSSKNQTEPESCSGPTSHGETSEGTSKRKRNRKSWTSLKEIAQRE 347

BLAST of Cp4.1LG14g02140 vs. NCBI nr
Match: gi|645266916|ref|XP_008238833.1| (PREDICTED: uncharacterized protein LOC103337456 [Prunus mume])

HSP 1 Score: 208.0 bits (528), Expect = 2.8e-50
Identity = 151/386 (39.12%), Postives = 214/386 (55.44%), Query Frame = 1

Query: 2   AKRKGNTKKGASNPTSGPQDSITIRQEITGKFK---PKVSNNVKTYLNHLENLATWASGK 61
           AKR GNT  G++N       +I++R+E+TGK +   P V+      L HL+ LA WA G+
Sbjct: 7   AKRTGNTAYGSNN-------TISLREELTGKKQTKGPAVNAKSALKLEHLQRLAVWAGGE 66

Query: 62  ASIPSLAAFFGQRLATAAESLAVAPDASLFTCQRCETILQPGSNCSIRIEKNNTKRRRRQ 121
            S+PSL AFF   LA+A E+L V PD SLFTCQRCETILQPG NC++RIEKN +K+RR+ 
Sbjct: 67  TSVPSLGAFFAHTLASAQEALGVPPDPSLFTCQRCETILQPGLNCTVRIEKNRSKKRRKS 126

Query: 122 KKCSNSTQNNVAYYCHHCSCRNIKRGTPKGHMKVLYDAAFERRVKPVDVKDGQECETSA- 181
           KK ++ +QNNV Y CH CS RN+KRGTP GHMKV+     +  +K    K   +   S+ 
Sbjct: 127 KKPTSFSQNNVVYTCHFCSHRNLKRGTPHGHMKVICPTKTKTTLKLKPAKSISQKFVSSK 186

Query: 182 --------VEKPTEILTIDAPKIPDASAIPPPTGDITALDNPAIQLRTKAILNINSPATP 241
                   V K  EI   +  K+ + +     + +IT          ++ +  I    T 
Sbjct: 187 KSIVAEDEVRKANEIAASEIIKVNEIA-----SSEITREIQANEIASSEIVREIQGNETA 246

Query: 242 STLSVTTLSKS-----QKQEMTTL--SEKHIGHEIRTDKEKKT-----------GAVPTV 301
           S +   T S       Q++E  +L  + ++   E R D+++ T           G +PTV
Sbjct: 247 SEIENETASSEIAREIQEKETASLEIARENRIMETRADEDEVTIVNEMASSAIGGEIPTV 306

Query: 302 DTPATPSTSTGVT-LLDSKKRKRNKPSSKNQTDPGSCSAPTADGDRSEGTSKRNRKRKSW 357
           D+P TP   TG T LL  K+RKRNK  SK   +P +   PT D + + G+    R+RK W
Sbjct: 307 DSPETPKVRTGPTLLLGGKRRKRNKSVSKKPAEPENSPNPT-DAENT-GSMSNKRRRKKW 366

BLAST of Cp4.1LG14g02140 vs. NCBI nr
Match: gi|595858677|ref|XP_007210793.1| (hypothetical protein PRUPE_ppa018166mg [Prunus persica])

HSP 1 Score: 202.6 bits (514), Expect = 1.2e-48
Identity = 152/400 (38.00%), Postives = 216/400 (54.00%), Query Frame = 1

Query: 2   AKRKGNTKKGASNPTSGPQDSITIRQEITGKFK---PKVSNNVKTYLNHLENLATWASGK 61
           AKR GNT  G++N       +I++R+E+TGK +   P V+      L HL+ LA WA G+
Sbjct: 7   AKRTGNTAYGSNN-------TISLREELTGKKQTKGPAVNAKSALKLEHLQRLAVWAGGE 66

Query: 62  ASIPSLAAFFGQRLATAAESLAVAPDASLFTCQRCETILQPGSNCSIRIEKNNTKRRRRQ 121
           AS+PSL AFF   LA+A E+L V PD SLFTCQRCETILQPG NC++RIEKN +K+RR+ 
Sbjct: 67  ASVPSLGAFFAHTLASAQEALGVPPDPSLFTCQRCETILQPGLNCTVRIEKNRSKKRRKS 126

Query: 122 KKCSNSTQNNVAYYCHHCSCRNIKRGTPKGHMKVLYDAAFER--RVKPV----------- 181
           KK ++ +QNNV Y CH CS RN+KRGTP GHMKV+     +   ++KP            
Sbjct: 127 KKPTSFSQNNVVYTCHFCSHRNLKRGTPHGHMKVICPTKTKTTSKLKPAKSISQKSVSSK 186

Query: 182 -------DVKDGQECETSAVEKPTEILT------IDAPKIPDASAIPPPTGDITA--LDN 241
                  +V+   E   S + +  EI +      I A +I  +       G+ TA  ++N
Sbjct: 187 KSIVAEDEVRKANEIAASEIIQANEIASSEITREIQANEIASSEIAREIQGNETASEIEN 246

Query: 242 PAIQLRTKAILNINSPATPSTLSVTTLSKSQKQEMTTL--SEKHIGHEIRTDKEKKT--- 301
                     + +N      T S     + Q+ E  +L  + ++   E   D+++ T   
Sbjct: 247 ETASSEIAREIQVNE-----TASSEIAREFQENETASLEIARENCIVETWADEDEVTIVN 306

Query: 302 --------GAVPTVDTPATPSTSTGVT-LLDSKKRKRNKPSSKNQTDPGSCSAPTADGDR 357
                   G +P VD+P TP   TG T LL  K+RKRNK  SK   +P +   PT D + 
Sbjct: 307 EMASSAMAGEIPMVDSPETPKVRTGPTLLLGGKRRKRNKSVSKKPAEPENSPNPT-DAEN 366

BLAST of Cp4.1LG14g02140 vs. NCBI nr
Match: gi|697095628|ref|XP_009612264.1| (PREDICTED: uncharacterized protein LOC104105616 [Nicotiana tomentosiformis])

HSP 1 Score: 198.0 bits (502), Expect = 2.9e-47
Identity = 145/357 (40.62%), Postives = 195/357 (54.62%), Query Frame = 1

Query: 5   KGNTKKGASNPTSGPQDSITIRQEITGKFKPK-VSNNVKTYLNHLENLATWASGKASIPS 64
           KG  KK     TSG   SIT+R E++GK K   V+      L H++NLATWASG+ASI S
Sbjct: 3   KGRAKKKGGGTTSG---SITLRDELSGKKKQSHVNAKSMLKLEHIKNLATWASGEASIHS 62

Query: 65  LAAFFGQRLATAAESLAVAPDASLFTCQRCETILQPGSNCSIRIEKNNTKRRRRQKKCSN 124
           L AFFGQRLA +AESL V PD SLFTCQRCE+ILQ G NC++RIEK   K R R+KK   
Sbjct: 63  LGAFFGQRLAASAESLGVPPDPSLFTCQRCESILQVGYNCTVRIEKKKRKARNRRKKPGI 122

Query: 125 STQNNVAYYCHHCSCRNIKRGTPKGHMKVLYDAAFER-RVKPVDVKDGQECETSAVEKPT 184
             +N+V Y CH CS RN+KRGTP+G+MK LY A     RV P      +     + +  T
Sbjct: 123 PPKNSVVYECHFCSHRNLKRGTPRGYMKELYPAKITTSRVDPT-----KSATRKSEQLDT 182

Query: 185 EILTIDAPKIPDASAIPPPTGDITALDNPAIQLRTKAILNINSPATPSTLSVTTLSKSQK 244
            + +ID  ++    +    +  +  L+            NI+      T S T     QK
Sbjct: 183 VVSSIDKARVDRTESATQKSEQLDTLE-----------ANIDETRVDLTESAT-----QK 242

Query: 245 QEMTTLSEKHIGHEIRTDKEKKTGAVPTVDTPATP-STSTGVTLLDSKKRKRNKPSSKNQ 304
            E    S      +   D   +         PATP ST T  +LLDSK++KRN+  SK +
Sbjct: 243 SEQFDTSVDSTNRDNNADVSSEIVGDDPTTGPATPLSTVTVTSLLDSKRKKRNRTVSKKK 302

Query: 305 TDPGSCSAPTADGDRSEGTSKRNRKRKSWTSLKEVARTNEQSGKQKNMAELAIPFSL 359
            +P   S+ T D +++  TS + RKRKSWTSLKE+A +  +    +  + +++PF L
Sbjct: 303 VEPQDGSSAT-DAEKTVSTSSK-RKRKSWTSLKEIAES--EGSNSRKFSNISVPFVL 331

The following BLAST results are available for this feature:
Match NameE-valueIdentityDescription
Match NameE-valueIdentityDescription
A0A0A0KUP1_CUCSA1.3e-10760.85Uncharacterized protein OS=Cucumis sativus GN=Csa_4G048010 PE=4 SV=1[more]
M5WDZ3_PRUPE8.1e-4938.00Uncharacterized protein OS=Prunus persica GN=PRUPE_ppa018166mg PE=4 SV=1[more]
A0A068UGM5_COFCA1.2e-4136.94Uncharacterized protein OS=Coffea canephora GN=GSCOC_T00024934001 PE=4 SV=1[more]
M1A5Y5_SOLTU3.1e-4038.40Uncharacterized protein OS=Solanum tuberosum GN=PGSC0003DMG403005999 PE=4 SV=1[more]
B9GWI5_POPTR9.0e-4040.86Uncharacterized protein OS=Populus trichocarpa GN=POPTR_0003s12730g PE=4 SV=1[more]
Match NameE-valueIdentityDescription
AT5G41270.15.0e-2653.66 RNAse P, Rpr2/Rpp21 subunit (InterPro:IPR007175)[more]
Match NameE-valueIdentityDescription
gi|659102242|ref|XP_008452026.1|5.3e-11061.64PREDICTED: uncharacterized protein LOC103493157 [Cucumis melo][more]
gi|449457658|ref|XP_004146565.1|1.9e-10760.85PREDICTED: uncharacterized protein LOC101220608 [Cucumis sativus][more]
gi|645266916|ref|XP_008238833.1|2.8e-5039.12PREDICTED: uncharacterized protein LOC103337456 [Prunus mume][more]
gi|595858677|ref|XP_007210793.1|1.2e-4838.00hypothetical protein PRUPE_ppa018166mg [Prunus persica][more]
gi|697095628|ref|XP_009612264.1|2.9e-4740.62PREDICTED: uncharacterized protein LOC104105616 [Nicotiana tomentosiformis][more]
The following terms have been associated with this gene:
Vocabulary: INTERPRO
TermDefinition
IPR007175Rpr2
GO Assignments
This gene is annotated with the following GO terms.
Category Term Accession Term Name
biological_process GO:0008150 biological_process
cellular_component GO:0005575 cellular_component
molecular_function GO:0003674 molecular_function

The following mRNA feature(s) are a part of this gene:

Feature NameUnique NameType
Cp4.1LG14g02140.1Cp4.1LG14g02140.1mRNA


Analysis Name: InterPro Annotations of Cucurbita pepo
Date Performed: 2017-12-02
IPR TermIPR DescriptionSourceSource TermSource DescriptionAlignment
IPR007175RNAse P, Rpr2/Rpp21 subunitPFAMPF04032Rpr2coord: 45..136
score: 1.7
NoneNo IPR availablePANTHERPTHR36072FAMILY NOT NAMEDcoord: 7..359
score: 4.9

The following gene(s) are paralogous to this gene:

None