Cp4.1LG07g08510.1 (mRNA) Cucurbita pepo (Zucchini)

NameCp4.1LG07g08510.1
TypemRNA
OrganismCucurbita pepo (Cucurbita pepo (Zucchini))
DescriptionRNA polymerase II-associated 3
LocationCp4.1LG07 : 7747225 .. 7752969 (+)
Sequence length1445
The following sequences are available for this feature:

Gene sequence (with intron)

Legend: five_prime_UTRCDS
Hold the cursor over a type above to highlight its positions in the sequence below.
GGGGTTTTACGCATTATGAATCCACGGCGAAGTGCTGACCGGAGCGAGGGGGACTGCTTTCAGCGGACGGCTCTGAGTGGACGCTTCGAGAACCACTTTGTGCTACAATCTCCGTTTATCCTATTGTGGGTTTCCTCTGGTCTCCCATGGCGGATTCATCCGGCAAGCACGGGCGTGATCAGCCTCTGGTATTTCCATTTATCCTAAGCTGCTGTAGTTCAGAGATTAACCGCCTTTTTGTTTCTTTCTTGCTTGAAGAACGAGCTTTTAATTTTGGGTTGAGTTCTAATTTCTGTTTAACTCGTTGTTTGTGGAAGCTAAGCGGAATGATTTCGTCTGCAATTTCTAGGTGAATGGAATTGATTAATTGGGATTCAGTGGGGTATGATAATGTGTAGTTTTGTTCACTTGCATTGCGGTGATTAGTAAGCTGAATGTGTTGAAATATGGGTTAGAAATTGTAATCTGACTTACTAATAGGTTTGTTTTTGCCACGAAATCGTTAGTTTTGGTATGTTGAGTTAAGTGCATGGGTGTGTTTACTGGCTTTTCTTATTGAATTGTGTGTGTTTCGGGTGCTTTCTGTTTCTTCAGGATTTCCAGGGGTTTTTGAATGACTTGCAGGATTGGGAACTCTCCCTTAATGGAAGAGACAAGAAATTGAAGCCACATGCCATTAGTAAAGAAAAGGAGGTATTACTCTATCAAGCTTTCTGTTGGGTAGCGAAATGCACATTTTCGAACAATAAACTGCTATCTCGGACGTTAATATTTATATTGGTTTATGTTTTATAAGGTCGATAGATTTAACTGCTTGTTCATTAGGTGTTTAAATTTCGTCGCAAGTGTGGATACGTTAATAGTAAATGTATGATGCTATTATGGACTATCTGAGGGGCACTTTAAGTGATGTCATGGCTAAGCAGGGTGGAAGGCAGCCAGTTAAAGCTACAGCAGCTGATTACTTGAAGCACTATGATGCAGTTAAGAGTCTATCAACAAAATCTCATACCGAGCAGAGTTTTGTTGATGCTGCTTCAGAGAAAGAACAGGTAGGTTACATTATTTTGTCCATGTTTTTATGATAAAGACATAATTGGTGCAAAGCTGGCATGAAACGACTTTACTCTTGACTACATGGTGTGTGTAATTTATCGTTATATTTCAGGGTAACGAGTATTTTAAGCAAAAGAAGTTTAAGGAAGCTATTGGCTGCTATTCAAGAAGCATTGCTTTGTCTCCAACAGCTGTAGCCTTTGCAAATAGGGCCATGGCCTACCTAAAAATCAGAAGGCAAGTTATAGATTCAAATTTTTCTTAGTAATTTTCTGCTATAAGTTGTTTTATCTGTTAAATAGTGCTCTTGATGTGGAAAGCTACAACTGTTAAATATGCTAAAATAGCTGTGTTCTCTGCGTTCTATTTCCACCTCGCTGGAGTCAGATTTCAGGAAGCTGAGGATGACTGTACAGAGGCCTTAAATTTAGATGATCGATATATTAAAGCATATTCACGCAGAGCAACAGCTAGAAAGGAACTTGGGAAGGCTAAAGAAGCCTTGGAAGGTATTAAAATAAGCAGCCAGCTTAATTGCACATTGATCTTTTATGCTTTTTATCCATCTGATATATTTGACTTTGCTGATTGTAGATGCTGAATTTGCTCAGAGGTTGGAGCCTAACAACCAAGAGATCAAGAAGCAACATGCTGAGCTCAGAGCTTTTGTTGGAAAAGTAAGTTTAGTAGAATTATCATTTCAGATAACTTTTGTTAAATATCTGGGTCTCTCTAAAACTTATATCCTGGTGAATGTGGAAGAGGGAGTTGTGATATTACAGATGAGTTATGCTGACAAGATATTTTCATTTAATTTTTATTGTACATGAATCCATCTTGCAGGCAATTCTTGAGAAGGCATCTGGTGCTTCGAGAAGCTCCACGAAAGAGAAGAAGATGGTTGGAAAATCTGACTCTGAAGCAAAAATTCAGGACATCCCTCCAGTCTCAAGCAGCACACTAAGGTCGGGGTTATCGGCAGCTCAAGACCACATTGAAGTAAGGATTACTAATTTGTTGTTGTCCATGGCATCGAGTATTTTCTCTTGAAACAAATATAGTAGGGTCCGACAGTTGTCATGTGAGAGCAATTGAGGTGCATGCAGTATAAGAAAATCAATAAATAAGCAAACAAATGCTTATCCATATCCTGTAATTTTTCAATGACGATTCAACCTACCTGCATCTTTTACAATTGACATTCAACAGATCTCTCTCTAGGTTTCTCTCTTTTTTCTCTAGTTTTCTGGAAACTATATTTTGCTATAATGTAAAACAATACTAATGTATGGTAACTTCCTTGTCCTGCTCTCTTTTGTAGAGCTCTGGTTTAATCTATTCCTTTGTTCAAATGCGAAATGGAGAGAAAATTTTCTGATAATTAATCACTTAAACTGCTCATATCTTGGCTGTAGCGTCTTTTTTTATTTTTTTATATTTGTATTTCTTTCCTCCAATTGACCAAATATGTTTGGCTTCTTTTGCATGTTTTTAGAGTACTAAATTAACTGATTGAAGAAAACTGGTTTAGTGCACCCCCCCTTCCCCCCAATAATTTTAAGATAATAATGCTTATTCTGAAGCTGGATGCTTGCAATATGATGTTTGAGCTTTTGGACTCTATTCTTAGGAAAATGGTGGAGAAAAAGCTGTCAAGGAATCTGCACGTTTAGAGGGAACCGAGGGCAGAAGTACAGGAGCTGAAATCAGATACAAAAGAGAAGCAACAAATGGTGTTCACAAAGACCCAAATTCGAATTTGGGGGCACTGGAGGTCTCTTAAGATCCTATAAACTCTTCTTTCTGCCTTGTACTCTTTCTTTCGATTGTTGGTTCTCAACTTTATGCTAACATATACCTTTTTTGTTTAAAGAAACACAAATAAGATATCTGCAATAGTTTCTTGTCTGCAAACTATTGTGGGGTTCATCTAAAAAATATTTCTGAAATCAAATAGAAGTGTGCCAATTTTCCTATTTCGGTTGTGGGGTTCAGCTTAAAATGTTCGTTACTAAAAAAAAAAAAAAAAAAAAGGGTAAAAGAAGAAATGAAGTTCTCATGAGGTAGGCCAAATGTAGCCAAGTCTCCCATCATTTGTGCCTAAGATGAGAGTCAATGTCAATCATACATTTATTCAGGAATTGGGACTAGGAATAAGAGTGCTCTCTTTCAGCTTTCTTTGAGGCATTACTTTAAAGACAATCTCGTACTGACTTGCTCACGCCTTATTCATGACTCAAAAGCCAATAGATTACTTTCATGATTAGTCGAATTAAAGCTCATCCATTTAAAGTCTAAAATTCTATGAGAACTTTATATCATATCATAGATTGAGCAGATGGGAAGATAGAAGGCTTCTACTTAACCTTTTTCTCTCACCTTTTATCAATTTGTGAACCATGTAATCATATCATGCTCATTTGGATGCCATAGCCTATTCCCTATGTATTTTTGCCACTAAACCTTTCTGACACAATGATATCTAAATTTTTTTGTTTTCACACCTATTCTGCTCCTGCCAAAACTTTCACAGAGAAATCATGTTTCAAGAAAGCAGGAACTGAAGCCTTCAGTTCAGGAACTTGCTTCTCGAGCAGCTTCTAGAAGTATGGTTGAAGCTGCAAAAAACATTGTAGCCCCAACCACTGCCTATCAATTTGAAGTTTCTTGGCGAGGATTCTCTGGTGATCGTGCACTGCAGGCTCAACTTCTAAAGGTATGTGATATAGACCAACTATGTTGTCAATGACACTGTGTTCATTTGCCTATCCAAGAGAATATGAATATCTGGGTATTTTCCTATGTATAAAAGTTGTTGTCTGATAAATATTAGAAGTGTGCTGCCATATTAAAGACGGCAACGCTAGTTTCGACTACCTGATATTGTAGGTCATATAGTTGTCTCCTCGTTTTCTATCCATGGGATCATCAATTGGCTGATATTTTAGGGTAGGAAAAAAAAAACACTCCTTTTCATTTTTGTGGTAGCGAGCACTCTGCATATTGGCATGTCCTTGGCAGCAAAGAATTCTGTTTGATATAGCGAGCACCCAATGTTTGTCACTTTATTCGGCTCCTCTCCACCGTTTTTTTTCCTTCTCTCTTTTTCACGCAATTGTTTCTCTTACTTGCTAGGCCATCTCTCCAGCCAAGTTGCCTCCGATATTCAAGAATGCACTATCAGCTCCAATTTTAATAGACATTGTCAAGTGCGTGGCTACCTTTTTCACGTAAGTTAAAAATTTCTAACTTGTCCTGGGGGAACATTTAGAAAATGATTTCAAACATCAATGGTTAACAGACCATTCTGGCCATTCAATTTGCAGCGAAGAGATGGCTTTGGCTATCAGTTTCTTAGAAAATTTAGCCAAGGTCCCAAGATTCAGCATACTCATGATGTGTCTTCCATCCAATGAAAAGTCTGGTAAGTTGTAGAAACCCTTCCCTCTTCTCGCCTTCTTGCATGGTAACCTTCATTTCATATAACATTTTCCCCATATCTCTAACCTTAGTGATTAATAGTTTAGCATCTACATTTGCAACCAAATTTATTAGGATAGAAAATGTTGGTTATGATTGATTTGGTGTATTGCACTTGCACATGTCTCTAAACATCATAGACGTTCACTCTGAAGCAGCCTGCCTTCTGGTTATTGTTACTCGCTCGATGATATATGTCAATTTGATAGGCTTTTCATATTCTTTTGTCTCTCTTGGTGGATTCGTTTCTTTTGCGTTTGCTCTTCTTTATTTTATTTTGGATAATCGTTTTTCTCAAAAAAGATAAAAGAAAAAAGAAAAGAAAAGAAAAGATAATGAAATTCCCATGTGGGACAAGAATGCTTCATACTGAGTATGGGCAGTGAACTTCTTGGCCCATGTTTGAGCTGTGAATTATGATCACCGCCTTGTTTTGTTATCGTCTGGTATCCTGTCTTTAAGAAATCATTTCATTAATGCCTTGAATTTTTTTTTTAATATGAAACAATCTCATTGTTATCTTGAAGAAGTAATCTAGTGACTGGATTTACATGTCTTGAGTTATAGCCTGTACCTCTATTGTTAAAACCAATCTTAAAAGGTGAGGAGTTGAATGCAAAAGGCTTTAATTCATATTTTATTGGATACATTTGTTTAGAAACTTGAATGTTACAAAGCCTTCTAATACTCGGCCTATCGACGGTGAGGTCCGAGACATTAATTTTGATAGGAACTGAAGTATATATCGACACAGCCCACAAAATAATCGGAAATTTGTGAGCTTATCATGGTGAGTGTTTTATATATTTGTATTCTAACTTCAAAGCCATGTGAAACAGATCTCCTCAAGATTTGGGATGGAGTATTTTGTGATGAGGCTGTTCCAATTGAGTACGCAGAAATGCTCGACAGCTTGCGTTCAAAGTATTTCCTTAAATTATGACAAACGTTGCTTGAATGAGAAGAAGCACTTGCCTCTCTTAAAGTCAACCCTGCTGCTTTTCTTCAGTGTAAAGTATCCACTTACTTGCTCCTCTGTTTTTGGGGTCACAGTTTGACAATTGGCATGTTAACATATGTGACTAAAAAAATGTCCCATTTCTAGAGCAATCTACCTGTCAATTCTCTTCTATAACCTTTTGAATTTACTCGAAAATTATCAGAATTGACTCGGACTTAGAATTCTCGACAACATTTCTGAACCTCGAC

mRNA sequence

GGGGTTTTACGCATTATGAATCCACGGCGAAGTGCTGACCGGAGCGAGGGGGACTGCTTTCAGCGGACGGCTCTGAGTGGACGCTTCGAGAACCACTTTGTGCTACAATCTCCGTTTATCCTATTGTGGGTTTCCTCTGGTCTCCCATGGCGGATTCATCCGGCAAGCACGGGCGTGATCAGCCTCTGGATTTCCAGGGGTTTTTGAATGACTTGCAGGATTGGGAACTCTCCCTTAATGGAAGAGACAAGAAATTGAAGCCACATGCCATTAGTAAAGAAAAGGAGGGTGGAAGGCAGCCAGTTAAAGCTACAGCAGCTGATTACTTGAAGCACTATGATGCAGTTAAGAGTCTATCAACAAAATCTCATACCGAGCAGAGTTTTGTTGATGCTGCTTCAGAGAAAGAACAGGGTAACGAGTATTTTAAGCAAAAGAAGTTTAAGGAAGCTATTGGCTGCTATTCAAGAAGCATTGCTTTGTCTCCAACAGCTGAAGCTGAGGATGACTGTACAGAGGCCTTAAATTTAGATGATCGATATATTAAAGCATATTCACGCAGAGCAACAGCTAGAAAGGAACTTGGGAAGGCTAAAGAAGCCTTGGAAGATGCTGAATTTGCTCAGAGGTTGGAGCCTAACAACCAAGAGATCAAGAAGCAACATGCTGAGCTCAGAGCTTTTGTTGGAAAAGCAATTCTTGAGAAGGCATCTGGTGCTTCGAGAAGCTCCACGAAAGAGAAGAAGATGGTTGGAAAATCTGACTCTGAAGCAAAAATTCAGGACATCCCTCCAGAAAATGGTGGAGAAAAAGCTGTCAAGGAATCTGCACGTTTAGAGGGAACCGAGGGCAGAAGTACAGGAGCTGAAATCAGATACAAAAGAGAAGCAACAAATGGTGTTCACAAAGACCCAAATTCGAATTTGGGGGCACTGGAGAGAAATCATGTTTCAAGAAAGCAGGAACTGAAGCCTTCAGTTCAGGAACTTGCTTCTCGAGCAGCTTCTAGAAGTATGGTTGAAGCTGCAAAAAACATTGTAGCCCCAACCACTGCCTATCAATTTGAAGTTTCTTGGCGAGGATTCTCTGGTGATCGTGCACTGCAGGCTCAACTTCTAAAGGCCATCTCTCCAGCCAAGTTGCCTCCGATATTCAAGAATGCACTATCAGCTCCAATTTTAATAGACATTGTCAAGTGCGTGGCTACCTTTTTCACCGAAGAGATGGCTTTGGCTATCAGTTTCTTAGAAAATTTAGCCAAGGTCCCAAGATTCAGCATACTCATGATGTGTCTTCCATCCAATGAAAAGTCTGATCTCCTCAAGATTTGGGATGGAGTATTTTGTGATGAGGCTGTTCCAATTGAGTACGCAGAAATGCTCGACAGCTTGCGTTCAAAAATTGACTCGGACTTAGAATTCTCGACAACATTTCTGAACCTCGAC

Coding sequence (CDS)

ATGGCGGATTCATCCGGCAAGCACGGGCGTGATCAGCCTCTGGATTTCCAGGGGTTTTTGAATGACTTGCAGGATTGGGAACTCTCCCTTAATGGAAGAGACAAGAAATTGAAGCCACATGCCATTAGTAAAGAAAAGGAGGGTGGAAGGCAGCCAGTTAAAGCTACAGCAGCTGATTACTTGAAGCACTATGATGCAGTTAAGAGTCTATCAACAAAATCTCATACCGAGCAGAGTTTTGTTGATGCTGCTTCAGAGAAAGAACAGGGTAACGAGTATTTTAAGCAAAAGAAGTTTAAGGAAGCTATTGGCTGCTATTCAAGAAGCATTGCTTTGTCTCCAACAGCTGAAGCTGAGGATGACTGTACAGAGGCCTTAAATTTAGATGATCGATATATTAAAGCATATTCACGCAGAGCAACAGCTAGAAAGGAACTTGGGAAGGCTAAAGAAGCCTTGGAAGATGCTGAATTTGCTCAGAGGTTGGAGCCTAACAACCAAGAGATCAAGAAGCAACATGCTGAGCTCAGAGCTTTTGTTGGAAAAGCAATTCTTGAGAAGGCATCTGGTGCTTCGAGAAGCTCCACGAAAGAGAAGAAGATGGTTGGAAAATCTGACTCTGAAGCAAAAATTCAGGACATCCCTCCAGAAAATGGTGGAGAAAAAGCTGTCAAGGAATCTGCACGTTTAGAGGGAACCGAGGGCAGAAGTACAGGAGCTGAAATCAGATACAAAAGAGAAGCAACAAATGGTGTTCACAAAGACCCAAATTCGAATTTGGGGGCACTGGAGAGAAATCATGTTTCAAGAAAGCAGGAACTGAAGCCTTCAGTTCAGGAACTTGCTTCTCGAGCAGCTTCTAGAAGTATGGTTGAAGCTGCAAAAAACATTGTAGCCCCAACCACTGCCTATCAATTTGAAGTTTCTTGGCGAGGATTCTCTGGTGATCGTGCACTGCAGGCTCAACTTCTAAAGGCCATCTCTCCAGCCAAGTTGCCTCCGATATTCAAGAATGCACTATCAGCTCCAATTTTAATAGACATTGTCAAGTGCGTGGCTACCTTTTTCACCGAAGAGATGGCTTTGGCTATCAGTTTCTTAGAAAATTTAGCCAAGGTCCCAAGATTCAGCATACTCATGATGTGTCTTCCATCCAATGAAAAGTCTGATCTCCTCAAGATTTGGGATGGAGTATTTTGTGATGAGGCTGTTCCAATTGAGTACGCAGAAATGCTCGACAGCTTGCGTTCAAAAATTGACTCGGACTTAGAATTCTCGACAACATTTCTGAACCTCGAC

Protein sequence

MADSSGKHGRDQPLDFQGFLNDLQDWELSLNGRDKKLKPHAISKEKEGGRQPVKATAADYLKHYDAVKSLSTKSHTEQSFVDAASEKEQGNEYFKQKKFKEAIGCYSRSIALSPTAEAEDDCTEALNLDDRYIKAYSRRATARKELGKAKEALEDAEFAQRLEPNNQEIKKQHAELRAFVGKAILEKASGASRSSTKEKKMVGKSDSEAKIQDIPPENGGEKAVKESARLEGTEGRSTGAEIRYKREATNGVHKDPNSNLGALERNHVSRKQELKPSVQELASRAASRSMVEAAKNIVAPTTAYQFEVSWRGFSGDRALQAQLLKAISPAKLPPIFKNALSAPILIDIVKCVATFFTEEMALAISFLENLAKVPRFSILMMCLPSNEKSDLLKIWDGVFCDEAVPIEYAEMLDSLRSKIDSDLEFSTTFLNLD
BLAST of Cp4.1LG07g08510.1 vs. Swiss-Prot
Match: OE64C_ARATH (Outer envelope protein 64, chloroplastic OS=Arabidopsis thaliana GN=OEP64 PE=1 SV=1)

HSP 1 Score: 77.0 bits (188), Expect = 5.5e-13
Identity = 50/127 (39.37%), Postives = 71/127 (55.91%), Query Frame = 1

Query: 71  STKSHTEQSFVDAASEKEQGNEYFKQKKFKEAIGCYSRSIALSPT--------------- 130
           S K+ T++   + A  KE+GN+ FK+K +++AIG YS +I LS                 
Sbjct: 464 SKKAITKEESAEIA--KEKGNQAFKEKLWQKAIGLYSEAIKLSDNNATYYSNRAAAYLEL 523

Query: 131 ---AEAEDDCTEALNLDDRYIKAYSRRATARKELGKAKEALEDAEFAQRLEPNNQEIKKQ 180
               +AE+DCT+A+ LD + +KAY RR TAR+ LG  K A+ED  +A  LEPNN+     
Sbjct: 524 GGFLQAEEDCTKAITLDKKNVKAYLRRGTAREMLGDCKGAIEDFRYALVLEPNNKRASLS 583

BLAST of Cp4.1LG07g08510.1 vs. Swiss-Prot
Match: RPAP3_MOUSE (RNA polymerase II-associated protein 3 OS=Mus musculus GN=Rpap3 PE=1 SV=1)

HSP 1 Score: 75.5 bits (184), Expect = 1.6e-12
Identity = 109/409 (26.65%), Postives = 175/409 (42.79%), Query Frame = 1

Query: 35  KKLKPHAISKEKEGGRQPVKATAADYLKHYDAVKSLSTKSHTEQSFVDAASEKEQGNEYF 94
           +K+     SKE  G   P  A AA+           S  +  +Q    A +EK+ GN +F
Sbjct: 243 RKINQALTSKENSG---PGAAAAAES----KPAAGESKPTGGQQGRQKAIAEKDLGNGFF 302

Query: 95  KQKKFKEAIGCYSRSIALSPT------------------AEAEDDCTEALNLDDRYIKAY 154
           K+ K+++AI CY+R IA   T                   EAE DCT+A+ LD  Y KA+
Sbjct: 303 KEGKYEQAIECYTRGIAADRTNALLPANRAMAYLKIQRYEEAERDCTQAIVLDGSYSKAF 362

Query: 155 SRRATARKELGKAKEALEDAEFAQRLEPNNQEIKKQHAELRAFVGKAILEKA--SGASRS 214
           +RR TAR  LGK  EA +D E    LEP N    KQ A   + + K ++EK         
Sbjct: 363 ARRGTARTFLGKINEAKQDFETVLLLEPGN----KQAATELSRIKKELIEKGHWDDVFLD 422

Query: 215 STKEKKMVGKSDSEAKIQDIPPENGGEKAVKE------SARLEGTEGRSTGAEIRYKREA 274
           ST+   +V   D+        P  G  KA+K+         +E  +   + A +     A
Sbjct: 423 STQRHHVVKAVDN--------PPRGSPKALKKVFIEETGNLIETVDAPDSSATVPESDRA 482

Query: 275 T----NGVHKDPNSNLGALERNHVSRKQELK-PSVQELASRAASRSMVEAAKNIVAPTTA 334
           T     G  K+P+  + +L      R + LK  +V + ++  A   + + A+   +   +
Sbjct: 483 TAAVGTGTKKNPSEGV-SLPAGDRPRAKVLKIEAVSDTSAPQAQVGVKQDARQPGSEKAS 542

Query: 335 YQFEV----------------SWRGFSGDRALQA------QLLKAISPAKLPPIFKNALS 389
            + E                 S++  S  R L++      Q +K I P+  P +F+  L 
Sbjct: 543 VRAEQMPGQLAAAGLPPVPANSFQLESDFRQLRSSPEMLYQYVKNIEPSLYPKLFQKNLD 602

BLAST of Cp4.1LG07g08510.1 vs. Swiss-Prot
Match: RPAP3_HUMAN (RNA polymerase II-associated protein 3 OS=Homo sapiens GN=RPAP3 PE=1 SV=2)

HSP 1 Score: 73.9 bits (180), Expect = 4.7e-12
Identity = 58/170 (34.12%), Postives = 87/170 (51.18%), Query Frame = 1

Query: 77  EQSFVDAASEKEQGNEYFKQKKFKEAIGCYSRSIA------------------LSPTAEA 136
           +Q+   A SEK++GN +FK+ K++ AI CY+R IA                  +    EA
Sbjct: 276 QQNKQQAISEKDRGNGFFKEGKYERAIECYTRGIAADGANALLPANRAMAYLKIQKYEEA 335

Query: 137 EDDCTEALNLDDRYIKAYSRRATARKELGKAKEALEDAEFAQRLEPNNQEIKKQHAELRA 196
           E DCT+A+ LD  Y KA++RR TAR  LGK  EA +D E    LEP N++   + ++++ 
Sbjct: 336 EKDCTQAILLDGSYSKAFARRGTARTFLGKLNEAKQDFETVLLLEPGNKQAVTELSKIK- 395

Query: 197 FVGKAILEKA--SGASRSSTKEKKMVGKSDSEAKIQDIPPENGGEKAVKE 227
              K ++EK         ST+ + +V       K  D PP  G  K +K+
Sbjct: 396 ---KELIEKGHWDDVFLDSTQRQNVV-------KPIDNPPHPGSTKPLKK 434

BLAST of Cp4.1LG07g08510.1 vs. Swiss-Prot
Match: RPAP3_RAT (RNA polymerase II-associated protein 3 OS=Rattus norvegicus GN=Rpap3 PE=1 SV=1)

HSP 1 Score: 71.6 bits (174), Expect = 2.3e-11
Identity = 83/302 (27.48%), Postives = 134/302 (44.37%), Query Frame = 1

Query: 117 EAEDDCTEALNLDDRYIKAYSRRATARKELGKAKEALEDAEFAQRLEPNNQE-------I 176
           EAE DCT+A+ LD  Y KA++RR TAR  LGK  EA +D E    LEP N++       I
Sbjct: 334 EAERDCTQAILLDGSYSKAFARRGTARTFLGKINEAKQDFETVLLLEPGNKQAVTELSRI 393

Query: 177 KKQHAE--------LRAFVGKAILEKASGASRSSTKEKKMVGKSDSEAKIQDI-PPENGG 236
           KK+  E        L +     +++      R S K  K V   ++   I+ +  PE+  
Sbjct: 394 KKELIEKGRWDDVFLDSTQRHNVVKPVDSPHRGSPKALKKVFIEETGNLIESVDAPESSA 453

Query: 237 EKAVKESARLEGTEGR----STGAEIRYKREATNGVHKDPNSNLGALERNHVSRKQELKP 296
                + A +    GR    S G  +         V K       +  +  V  KQ ++ 
Sbjct: 454 TVPESDRAAVAVDTGRKKDFSQGDSVSSGETPRAKVLKIEAVGDSSAPQAQVDVKQGVRQ 513

Query: 297 SVQELASRAASRSMVEAAKNIVAPTTA--YQFEVSWRGFSGDRALQAQLLKAISPAKLPP 356
           SV E  S   +++  + A  ++ P  A  +Q E  +R       +  Q +K I P+  P 
Sbjct: 514 SVSEKTSVRVAQTPGQLAAVVLPPVPANSFQLESDFRQLRSSPEMLYQYVKKIEPSLYPK 573

Query: 357 IFKNALSAPILIDIVKCVATFFT--EEMALAISFLENLAKVPRFSILMMCLPSNEKSDLL 395
           +F+  L   +   I+K +  F+   E+ AL    LE L+++ RF + +M +   E+ +L 
Sbjct: 574 LFQKNLDPDVFNQIIKILHDFYVEREKPALIFEVLERLSQLRRFDMAVMFMSGTER-ELT 633

BLAST of Cp4.1LG07g08510.1 vs. Swiss-Prot
Match: RPAP3_CHICK (RNA polymerase II-associated protein 3 OS=Gallus gallus GN=RPAP3 PE=2 SV=1)

HSP 1 Score: 68.6 bits (166), Expect = 2.0e-10
Identity = 82/306 (26.80%), Postives = 131/306 (42.81%), Query Frame = 1

Query: 117 EAEDDCTEALNLDDRYIKAYSRRATARKELGKAKEALEDAEFAQRLEPNNQEIKKQHAEL 176
           EAE+DCT+AL LD  Y KA++RR  AR  LGK KEA++D E   +LEP N++   +  ++
Sbjct: 335 EAENDCTQALLLDASYSKAFARRGAARVALGKLKEAMQDFEAVLKLEPGNKQAINELTKI 394

Query: 177 RAFVGK----------AILEKASGASR-----------SSTKEKKMVG--KSDSEAKIQD 236
           R  + +          A+L K S                STK  + +   + D +    D
Sbjct: 395 RNELAEKEQSCHEEYPAVLIKESEIKNIVKLTHNPLNLKSTKPLRRIAVEEVDDDVLNSD 454

Query: 237 IPPENGGEKAVKESARLEGTEG-----RSTGAEI----RYKREATNGVHKDPNSNLGALE 296
                      K S  +E TE      + T  +I    + K E    V   P    GA  
Sbjct: 455 FSSSTSLVNNWKNSVNIETTENLDQDDQLTSMDIPKAKQLKIEEITDV-SSPQLPAGAKG 514

Query: 297 RNHVSRKQELKPSVQELASRAASRSMVEAAKNIVAPTTAYQFEVSWRGFSGDRALQAQLL 356
            + V     L PS+ +   R    S   A+     P  ++Q E  +R            L
Sbjct: 515 VSSV-----LHPSMNKQIERENKASFRSASPVPAIPANSFQLESDFRKLKDCPEKMYLYL 574

Query: 357 KAISPAKLPPIFKNALSAPILIDIVKCVATFF--TEEMALAISFLENLAKVPRFSILMMC 389
           K I P+  P +F+ +L   +   I++ +  F+   EE +L +  L+ L+++ RF + +M 
Sbjct: 575 KQIEPSIYPKLFQKSLDPDLFNQILRILHDFYIEKEEPSLILEILQRLSELKRFDMAVMF 634

BLAST of Cp4.1LG07g08510.1 vs. TrEMBL
Match: A0A0A0LGI8_CUCSA (Uncharacterized protein OS=Cucumis sativus GN=Csa_2G034580 PE=4 SV=1)

HSP 1 Score: 630.9 bits (1626), Expect = 1.1e-177
Identity = 351/453 (77.48%), Postives = 371/453 (81.90%), Query Frame = 1

Query: 1   MADSSGKHGRDQPLDFQGFLNDLQDWELSLNGRDKKLKPHAISKEKEGGRQPVKATAADY 60
           MADSS KHGRDQ LDFQGFLNDLQDWE+S  G+DKKLKP AI KEKE  RQ  KA+AADY
Sbjct: 1   MADSSAKHGRDQLLDFQGFLNDLQDWEVSFKGKDKKLKPQAIGKEKEDRRQTEKASAADY 60

Query: 61  LKHYDAVKSLSTKSHTEQSFVDAASEKEQGNEYFKQKKFKEAIGCYSRSIALSPTA---- 120
           +K YDAV  LS    TE SFVDAASEKEQGNEYFKQKKFKEAI CYSRSIALSPTA    
Sbjct: 61  MKQYDAVNRLSRNFQTEGSFVDAASEKEQGNEYFKQKKFKEAIDCYSRSIALSPTAVAFA 120

Query: 121 -------------EAEDDCTEALNLDDRYIKAYSRRATARKELGKAKEALEDAEFAQRLE 180
                        EAEDDCTEALNLDDRYIKAYSRRATARKELGKAKEALEDAEFAQRLE
Sbjct: 121 NRAMAYLKIRRFQEAEDDCTEALNLDDRYIKAYSRRATARKELGKAKEALEDAEFAQRLE 180

Query: 181 PNNQEIKKQHAELRAFVGKAILEKASGASRSSTKEKKMVGKSDSEAKIQDIPP------- 240
           PNNQEIKKQHA+LRAFVGKAILEKASGASRSSTK KK + KSDS+AKIQDIPP       
Sbjct: 181 PNNQEIKKQHADLRAFVGKAILEKASGASRSSTKNKKTLKKSDSDAKIQDIPPVSSSTSR 240

Query: 241 -----------ENGGEKAVKESARLEGTEGRSTGAEIRYKREATNGVHKDPNSNLGALER 300
                      ENGG  AVK SARLE +E  S+GAEI  K+ ATNG HKD +S L ALER
Sbjct: 241 TGLLAARERVEENGGGNAVKTSARLEESEDTSSGAEITSKKVATNGFHKDSSSYLSALER 300

Query: 301 NHVSRKQELKPSVQELASRAASRSMVEAAKNIVAPTTAYQFEVSWRGFSGDRALQAQLLK 360
           +H+ RKQELK SV ELAS+AASRSMVEAAKNI+APTTAYQFEVSWRGFSGD+ALQA+LLK
Sbjct: 301 DHLPRKQELKASVYELASQAASRSMVEAAKNIIAPTTAYQFEVSWRGFSGDQALQARLLK 360

Query: 361 AISPAKLPPIFKNALSAPILIDIVKCVATFFTEEMALAISFLENLAKVPRFSILMMCLPS 419
            ISPAKLP IFK+AL+APILIDIVKCVATFF EE ALAISFLENL  VPRFSILMMCL S
Sbjct: 361 TISPAKLPQIFKDALTAPILIDIVKCVATFFIEEPALAISFLENLVNVPRFSILMMCLSS 420

BLAST of Cp4.1LG07g08510.1 vs. TrEMBL
Match: A0A067JV15_JATCU (Uncharacterized protein OS=Jatropha curcas GN=JCGZ_23222 PE=4 SV=1)

HSP 1 Score: 447.6 bits (1150), Expect = 1.7e-122
Identity = 258/465 (55.48%), Postives = 321/465 (69.03%), Query Frame = 1

Query: 1   MADSSGKHGRDQPLDFQGFLNDLQDWELSLNGRDKKLKPHAISKEKEGGRQPVKATAAD- 60
           MA    KHGRDQ LDFQGFLNDLQDWELSL  +DKK+KPH+  K  E  R   K +A D 
Sbjct: 1   MARVPTKHGRDQALDFQGFLNDLQDWELSLKDKDKKMKPHSSDKNSESRRSTGKTSAIDS 60

Query: 61  ---------YLKHYDAVKSLSTKSHTEQSFVDAASEKEQGNEYFKQKKFKEAIGCYSRSI 120
                    YL+++DA++ + +    E+S VDA SEKE GNEYFKQKK+KEAI CYSRSI
Sbjct: 61  SRGSSGQHDYLRNFDAIERIPSSFIAEESAVDATSEKELGNEYFKQKKYKEAIECYSRSI 120

Query: 121 ALSPT-----------------AEAEDDCTEALNLDDRYIKAYSRRATARKELGKAKEAL 180
           ALSPT                  EAEDDCTEALNLDDRYIKAYSRRATARKELG+ K++L
Sbjct: 121 ALSPTAVAYANRAMAYLKIRKFQEAEDDCTEALNLDDRYIKAYSRRATARKELGRFKKSL 180

Query: 181 EDAEFAQRLEPNNQEIKKQHAELRAFVGKAILEKASGASRSSTKEKKMVGKSDSEA---K 240
           ED+EFA RLEPNNQE+KKQ+AE++    K IL+KASG +RSS +  +  G+S+++    +
Sbjct: 181 EDSEFALRLEPNNQEVKKQYAEVKMLYDKEILQKASGVARSSVQGTQKGGRSETKVNGYE 240

Query: 241 IQDIPP---------------ENGGEKAVKESARLEGTEGRSTGAEIRYKREATNGVHKD 300
           +  +P                E G  +  ++S  +E  E +        + +A N  + D
Sbjct: 241 VNSVPSITQRTSVSTSQKGKNEQGAVEVPEKSTSVEEMENKILRVGSSIEGQA-NYSYTD 300

Query: 301 --PNSNLGALERNHVSRKQELKPSVQELASRAASRSMVEAAKNIVAPTTAYQFEVSWRGF 360
             P+SN  + +R +  RKQ+LK SVQELASRAASR+M EAAKNI  PT+AYQFEVSWRGF
Sbjct: 301 AMPSSNADSFQRYNRMRKQDLKASVQELASRAASRAMAEAAKNITPPTSAYQFEVSWRGF 360

Query: 361 SGDRALQAQLLKAISPAKLPPIFKNALSAPILIDIVKCVATFFTEEMALAISFLENLAKV 419
           SGDR LQ +LLKA+SP+ LP I KNALSA +LIDI+KCVATFF ++  LA+ +LENLAKV
Sbjct: 361 SGDRVLQTRLLKAMSPSALPQILKNALSASMLIDIIKCVATFFIDDTNLAVKYLENLAKV 420

BLAST of Cp4.1LG07g08510.1 vs. TrEMBL
Match: U5GD67_POPTR (Uncharacterized protein OS=Populus trichocarpa GN=POPTR_0005s01710g PE=4 SV=1)

HSP 1 Score: 406.0 bits (1042), Expect = 5.8e-110
Identity = 244/468 (52.14%), Postives = 309/468 (66.03%), Query Frame = 1

Query: 1   MADSSGKHGRDQPLDFQGFLNDLQDWELSLNGRDKKLKPHAISKEK---EGGRQPVKATA 60
           MA   GKHGRDQ LD          WEL L   DKK+K  + + +    E GR   K +A
Sbjct: 1   MARVPGKHGRDQALD----------WEL-LKDTDKKMKKKSRASDVKIGEDGRSEGKTSA 60

Query: 61  AD----------YLKHYDAVKSLSTKSHTEQSFVDAASEKEQGNEYFKQKKFKEAIGCYS 120
           AD          Y +++ A+  LS+   T++  VDA +EKE GNEYFKQKKF EAI CYS
Sbjct: 61  ADSSRSGSGQYEYSRNFGAINRLSSSFTTDEITVDATTEKELGNEYFKQKKFNEAIECYS 120

Query: 121 RSIALSPT-----------------AEAEDDCTEALNLDDRYIKAYSRRATARKELGKAK 180
           RSIALSPT                  EAEDDCTEALNLDDRYIKAYSRRATARKELGK K
Sbjct: 121 RSIALSPTAVAYANRAMAYLKIKRFREAEDDCTEALNLDDRYIKAYSRRATARKELGKLK 180

Query: 181 EALEDAEFAQRLEPNNQEIKKQHAELRAFVGKAILEKASGASRSSTKEKKMVGKSDSEAK 240
           E++ED+EFA +LEPNNQEIKKQ+AE+++   K IL+KASG  RSS +  +  G+S++   
Sbjct: 181 ESIEDSEFALKLEPNNQEIKKQYAEVKSLYEKEILQKASGTLRSSLQGTQQGGRSEASVN 240

Query: 241 IQDIPP------ENGGEKAVKESARLEGTEGRSTGA--------EIRYKREATNGVHKD- 300
              + P      + G   + K++ +LE  E              E+R + ++   V  D 
Sbjct: 241 GHAVHPVSIATQKTGVSASKKDNTKLEQEENDGNNLVKKSVHVKELRNRSKSDGHVGNDS 300

Query: 301 -----PNSNLGALERNHVSRKQELKPSVQELASRAASRSMVEAAKNIVAPTTAYQFEVSW 360
                P+S++ ++++N+ +R+QELK SV ELAS+AASR+M EAAKNI  P +AYQFEVSW
Sbjct: 301 PANATPSSSVESVQKNNRTRRQELKTSVIELASQAASRAMAEAAKNITPPNSAYQFEVSW 360

Query: 361 RGFSGDRALQAQLLKAISPAKLPPIFKNALSAPILIDIVKCVATFFTEEMALAISFLENL 419
           +GFSGDRALQA LLK  SP+ LP IFKNALS PILIDI+KCVA+FF ++M  A+ +LENL
Sbjct: 361 QGFSGDRALQAHLLKVTSPSALPQIFKNALSVPILIDIIKCVASFFNDDMDFAVKYLENL 420

BLAST of Cp4.1LG07g08510.1 vs. TrEMBL
Match: A0A0R0E971_SOYBN (Uncharacterized protein OS=Glycine max GN=GLYMA_20G096900 PE=4 SV=1)

HSP 1 Score: 401.4 bits (1030), Expect = 1.4e-108
Identity = 241/450 (53.56%), Postives = 296/450 (65.78%), Query Frame = 1

Query: 14  LDFQGFLNDLQDWELSLNGRDKKLKPHAISKEKEGGRQPVKATAADYLKHYDAVKS---- 73
           +DF GFLNDLQDWE S   + +  K +A S    G     KA+  D +   +A  S    
Sbjct: 1   MDFHGFLNDLQDWEFSRKDKARPQKENASSSRITGSVGVEKASKGDTISFDNARNSPGQY 60

Query: 74  -LSTKSHTEQSFV-----DAASEKEQGNEYFKQKKFKEAIGCYSRSIALSPTA------- 133
            LS  +H   SFV     DAASEK+ GNE+FKQKKFKEA  CYSRSIALSPTA       
Sbjct: 61  DLSRINHLHSSFVPEDVPDAASEKDLGNEFFKQKKFKEARDCYSRSIALSPTAVAYANRA 120

Query: 134 ----------EAEDDCTEALNLDDRYIKAYSRRATARKELGKAKEALEDAEFAQRLEPNN 193
                     EAEDDCTEALNLDDRYIKAYSRRATARKELGK KE+++DAEFA RLEPNN
Sbjct: 121 MANIKLRRFQEAEDDCTEALNLDDRYIKAYSRRATARKELGKIKESMDDAEFALRLEPNN 180

Query: 194 QEIKKQHAELRAFVGKAILEKASGASRSSTKEKKMVGKSDSEAKIQDIPP---------- 253
           QEIKKQ+A+ ++F  K IL+KASG  RS+ +  + VGKS+ +     I P          
Sbjct: 181 QEIKKQYADAKSFYEKDILQKASGVLRSTVQGTQKVGKSEEKVNGDSIHPISRSTQKSGL 240

Query: 254 --------ENGGEKAVKESARLEGTEGRSTGAEIRYKREATNGVHKDPNSNLGALERNHV 313
                   +N  +  VKES   E  +GR   A    + +  +G     +++    +RNH 
Sbjct: 241 AEVHHHKKDNKRQILVKESLLTEDVDGREIKARSWPQSQGDDGSKGGLSASNSLEQRNHR 300

Query: 314 SRKQELKPSVQELASRAASRSMVEAAKNIVAPTTAYQFEVSWRGFSGDRALQAQLLKAIS 373
             K E+K SVQ+LASRAASR+M EAAKN+  PTTAYQFEVSWR FSGD ALQA+LLKAIS
Sbjct: 301 ITKPEMKASVQQLASRAASRAMSEAAKNVTPPTTAYQFEVSWRAFSGDLALQARLLKAIS 360

Query: 374 PAKLPPIFKNALSAPILIDIVKCVATFFTEEMALAISFLENLAKVPRFSILMMCLPSNEK 419
           P +LP IFKNALS+ ILI+I+KC+A+ FTE+M L +S+LE+L KVPRF +++MCL S  K
Sbjct: 361 PHELPKIFKNALSSTILIEIIKCLASLFTEDMDLVVSYLEHLTKVPRFDVIVMCLSSTNK 420

BLAST of Cp4.1LG07g08510.1 vs. TrEMBL
Match: A0A0B2QTW7_GLYSO (RNA polymerase II-associated protein 3 OS=Glycine soja GN=glysoja_046388 PE=4 SV=1)

HSP 1 Score: 399.4 bits (1025), Expect = 5.4e-108
Identity = 241/455 (52.97%), Postives = 296/455 (65.05%), Query Frame = 1

Query: 14  LDFQGFLNDLQDWELSLNGRDKKLKPHAISKEKEGGRQPVKATAADYLKHYDAVKS---- 73
           +DF GFLNDLQDWE S   + +  K +A S    G     KA+  D +   +A  S    
Sbjct: 1   MDFHGFLNDLQDWEFSRKDKARPQKENASSSRITGSVGVEKASKGDTISFDNARNSPGQY 60

Query: 74  -LSTKSHTEQSFV-----DAASEKEQGNEYFKQKKFKEAIGCYSRSIALSPTA------- 133
            LS  +H   SFV     DAASEK+ GNE+FKQKKFKEA  CYSRSIALSPTA       
Sbjct: 61  DLSRINHLHSSFVPEDVPDAASEKDLGNEFFKQKKFKEARDCYSRSIALSPTAVAYANRA 120

Query: 134 ---------------EAEDDCTEALNLDDRYIKAYSRRATARKELGKAKEALEDAEFAQR 193
                          EAEDDCTEALNLDDRYIKAYSRRATARKELGK KE+++DAEFA R
Sbjct: 121 MANIKLRRQAYVLFQEAEDDCTEALNLDDRYIKAYSRRATARKELGKIKESMDDAEFALR 180

Query: 194 LEPNNQEIKKQHAELRAFVGKAILEKASGASRSSTKEKKMVGKSDSEAKIQDIPP----- 253
           LEPNNQEIKKQ+A+ ++F  K IL+KASG  RS+ +  + VGKS+ +     I P     
Sbjct: 181 LEPNNQEIKKQYADAKSFYEKDILQKASGVLRSTVQGTQKVGKSEEKVNGDSIHPISRST 240

Query: 254 -------------ENGGEKAVKESARLEGTEGRSTGAEIRYKREATNGVHKDPNSNLGAL 313
                        +N  +  VKES   E  +GR   A    + +  +G     +++    
Sbjct: 241 QKSGLAEVHHHKKDNKRQILVKESLLTEDVDGREIKARSWPQSQGDDGSKGGLSASNSLE 300

Query: 314 ERNHVSRKQELKPSVQELASRAASRSMVEAAKNIVAPTTAYQFEVSWRGFSGDRALQAQL 373
           +RNH   K E+K SVQ+LASRAASR+M EAAKN+  PTTAYQFEVSWR FSGD ALQA+L
Sbjct: 301 QRNHRITKPEMKASVQQLASRAASRAMSEAAKNVTPPTTAYQFEVSWRAFSGDLALQARL 360

Query: 374 LKAISPAKLPPIFKNALSAPILIDIVKCVATFFTEEMALAISFLENLAKVPRFSILMMCL 419
           LKAISP +LP IFKNALS+ ILI+I+KC+A+ FTE+M L +S+LE+L KVPRF +++MCL
Sbjct: 361 LKAISPHELPKIFKNALSSTILIEIIKCLASLFTEDMDLVVSYLEHLTKVPRFDVIVMCL 420

BLAST of Cp4.1LG07g08510.1 vs. TAIR10
Match: AT1G56440.2 (AT1G56440.2 Tetratricopeptide repeat (TPR)-like superfamily protein)

HSP 1 Score: 317.8 bits (813), Expect = 1.1e-86
Identity = 209/495 (42.22%), Postives = 282/495 (56.97%), Query Frame = 1

Query: 1   MADSSGKHGRDQPLDFQGFLNDLQDWELSLNGRDKKLKPHAISKEKEGGR--QPVKATAA 60
           MA S  KHGRDQ  DFQGF NDLQDWELSL  +DKK+K    +         +P  +   
Sbjct: 1   MARSPSKHGRDQTQDFQGFFNDLQDWELSLKDKDKKIKQQPANSSNPSSETFRPSGSGKY 60

Query: 61  DYLKHYDAVKSLSTKSHTEQSFVDAASEKEQGNEYFKQKKFKEAIGCYSRSIALSPT--- 120
           D+ K Y +++ LS+ S   +S +D++SEKEQGNE+FKQKKF EAI CYSRSIALSP    
Sbjct: 61  DFAKKYRSIRDLSS-SLIGESLLDSSSEKEQGNEFFKQKKFNEAIDCYSRSIALSPNAVT 120

Query: 121 --------------AEAEDDCTEALNLDDRYIKAY-----------------------SR 180
                          EAE DCTEALNLDDRYIKAY                        R
Sbjct: 121 YANRAMAYLKIKRYREAEVDCTEALNLDDRYIKAYSRRATARKELGMIKEAKEDAEFALR 180

Query: 181 RATARKELGK------------------------AKEALEDAEFAQRLEPNNQEIKKQHA 240
                +EL K                        A+E L+ +   ++++    E+  +  
Sbjct: 181 LEPESQELKKQYADIKSLLEKEIIEKATGAMQSTAQELLKTSGLDKKIQKPKTEMTSKPV 240

Query: 241 ELRAFVGKAILEKASGASRSSTKEKKMVGKSDSEAK----------IQDIPPENGGEKAV 300
            L A   + I++   G++ SS K  K++     E +          +     E   E ++
Sbjct: 241 TLVAKTNRDIVQPVLGSNESSGK--KLIENIQPEHRWNFRELISLYLSCYLQEKSKEGSM 300

Query: 301 KESARLEGTEGRS-TGAEIRYKREATNGVHKDPNSNLGALERNHVSRKQELKPSVQELAS 360
           K  A  E  + +  T     Y++EA      D N    +   N VS++ ELKPSVQELA+
Sbjct: 301 KIPAITEILDSKKVTPGSQSYEKEAKPS---DRNGTQPSGPENQVSKQLELKPSVQELAA 360

Query: 361 RAASRSMVEAAKNIVAPTTAYQFEVSWRGFSGDRALQAQLLKAISPAKLPPIFKNALSAP 419
            AAS +M EA+KNI  P +AY+FE SWR FSGD AL++QLLK  +P+ LP IFKNAL++P
Sbjct: 361 HAASLAMTEASKNIKTPKSAYEFENSWRSFSGDSALRSQLLKVTTPSSLPQIFKNALTSP 420

BLAST of Cp4.1LG07g08510.1 vs. TAIR10
Match: AT3G17970.1 (AT3G17970.1 translocon at the outer membrane of chloroplasts 64-III)

HSP 1 Score: 77.0 bits (188), Expect = 3.1e-14
Identity = 50/127 (39.37%), Postives = 71/127 (55.91%), Query Frame = 1

Query: 71  STKSHTEQSFVDAASEKEQGNEYFKQKKFKEAIGCYSRSIALSPT--------------- 130
           S K+ T++   + A  KE+GN+ FK+K +++AIG YS +I LS                 
Sbjct: 464 SKKAITKEESAEIA--KEKGNQAFKEKLWQKAIGLYSEAIKLSDNNATYYSNRAAAYLEL 523

Query: 131 ---AEAEDDCTEALNLDDRYIKAYSRRATARKELGKAKEALEDAEFAQRLEPNNQEIKKQ 180
               +AE+DCT+A+ LD + +KAY RR TAR+ LG  K A+ED  +A  LEPNN+     
Sbjct: 524 GGFLQAEEDCTKAITLDKKNVKAYLRRGTAREMLGDCKGAIEDFRYALVLEPNNKRASLS 583

BLAST of Cp4.1LG07g08510.1 vs. TAIR10
Match: AT5G09420.1 (AT5G09420.1 translocon at the outer membrane of chloroplasts 64-V)

HSP 1 Score: 58.2 bits (139), Expect = 1.5e-08
Identity = 41/130 (31.54%), Postives = 64/130 (49.23%), Query Frame = 1

Query: 69  SLSTKSHTEQSFVDAASEKEQGNEYFKQKKFKEAIGCYSRSIALSPT------------- 128
           +L+  S T  +   +   KE+GN  +K K++ +A+  Y+ +I L+               
Sbjct: 474 NLAPVSDTNGNMEASEVMKEKGNAAYKGKQWNKAVNFYTEAIKLNGANATYYCNRAAAFL 533

Query: 129 -----AEAEDDCTEALNLDDRYIKAYSRRATARKELGKAKEALEDAEFAQRLEPNNQEIK 181
                 +AE DCT+A+ +D + +KAY RR TAR+ L + KEA  D   A  LEP N+  K
Sbjct: 534 ELCCFQQAEQDCTKAMLIDKKNVKAYLRRGTARESLVRYKEAAADFRHALVLEPQNKTAK 593

BLAST of Cp4.1LG07g08510.1 vs. TAIR10
Match: AT2G42810.2 (AT2G42810.2 protein phosphatase 5.2)

HSP 1 Score: 54.3 bits (129), Expect = 2.2e-07
Identity = 47/152 (30.92%), Postives = 71/152 (46.71%), Query Frame = 1

Query: 73  KSHTEQSFVDAASE-KEQGNEYFKQKKFKEAIGCYSRSIALSPT---------------- 132
           ++  E S V  A E K Q NE FK  K+  AI  Y+++I L+                  
Sbjct: 2   ETKNENSDVSRAEEFKSQANEAFKGHKYSSAIDLYTKAIELNSNNAVYWANRAFAHTKLE 61

Query: 133 --AEAEDDCTEALNLDDRYIKAYSRRATARKELGKAKEALEDAEFAQRLEPNNQEIKKQH 192
               A  D ++A+ +D RY K Y RR  A   +GK K+AL+D +  +RL PN+ +  ++ 
Sbjct: 62  EYGSAIQDASKAIEVDSRYSKGYYRRGAAYLAMGKFKDALKDFQQVKRLSPNDPDATRKL 121

Query: 193 AELRAFVGKAILEKASGASRSSTKEKKMVGKS 206
            E    V K   E+A     S   E++ V +S
Sbjct: 122 KECEKAVMKLKFEEAISVPVS---ERRSVAES 150

BLAST of Cp4.1LG07g08510.1 vs. NCBI nr
Match: gi|659070087|ref|XP_008453348.1| (PREDICTED: RNA polymerase II-associated protein 3 isoform X1 [Cucumis melo])

HSP 1 Score: 639.4 bits (1648), Expect = 4.5e-180
Identity = 356/453 (78.59%), Postives = 373/453 (82.34%), Query Frame = 1

Query: 1   MADSSGKHGRDQPLDFQGFLNDLQDWELSLNGRDKKLKPHAISKEKEGGRQPVKATAADY 60
           MADSSGKHGRD PLDFQGFLNDLQDWE+S  G+DKKLKP AI KEKE  RQ  KA+AADY
Sbjct: 67  MADSSGKHGRDHPLDFQGFLNDLQDWEVSFKGKDKKLKPQAIGKEKEDRRQTEKASAADY 126

Query: 61  LKHYDAVKSLSTKSHTEQSFVDAASEKEQGNEYFKQKKFKEAIGCYSRSIALSPTA---- 120
           LK YDAV  LS    TEQSFVDAASEKEQGNEYFKQKKFKEAI CYSRSIALSPTA    
Sbjct: 127 LKQYDAVNRLSRNFQTEQSFVDAASEKEQGNEYFKQKKFKEAIDCYSRSIALSPTAVAFA 186

Query: 121 -------------EAEDDCTEALNLDDRYIKAYSRRATARKELGKAKEALEDAEFAQRLE 180
                        EAEDDCTEALNLDDRYIKAYSRRATARKELGKAKEALEDAEFAQRLE
Sbjct: 187 NRAMAYLKIRRFQEAEDDCTEALNLDDRYIKAYSRRATARKELGKAKEALEDAEFAQRLE 246

Query: 181 PNNQEIKKQHAELRAFVGKAILEKASGASRSSTKEKKMVGKSDSEAKIQDIPP------- 240
           PNNQEIKKQHA+LRAFVGKAILEKASGASRSS K+KKM+ KSDS+AKIQDIPP       
Sbjct: 247 PNNQEIKKQHADLRAFVGKAILEKASGASRSSPKDKKMLKKSDSDAKIQDIPPVSSSTSR 306

Query: 241 -----------ENGGEKAVKESARLEGTEGRSTGAEIRYKREATNGVHKDPNSNLGALER 300
                      ENGGE AVK SAR+E  E  ST AEI  K+ ATNG HKD +S L ALER
Sbjct: 307 TGLLAARERVEENGGENAVKTSARVEENEDISTRAEIACKKVATNGFHKDSSSYLWALER 366

Query: 301 NHVSRKQELKPSVQELASRAASRSMVEAAKNIVAPTTAYQFEVSWRGFSGDRALQAQLLK 360
           +H+ RK ELK SVQELAS+AASRSMVEAAKNI+APTTAYQFEVSWRGFS D+ALQA LLK
Sbjct: 367 DHLPRKPELKASVQELASQAASRSMVEAAKNIIAPTTAYQFEVSWRGFSDDQALQACLLK 426

Query: 361 AISPAKLPPIFKNALSAPILIDIVKCVATFFTEEMALAISFLENLAKVPRFSILMMCLPS 419
            ISPAKLP IFKNAL+APILIDIVKCVATFFTEE ALAISFLENL KVPRFSILMMCL S
Sbjct: 427 TISPAKLPQIFKNALTAPILIDIVKCVATFFTEEPALAISFLENLVKVPRFSILMMCLSS 486

BLAST of Cp4.1LG07g08510.1 vs. NCBI nr
Match: gi|449454004|ref|XP_004144746.1| (PREDICTED: RNA polymerase II-associated protein 3 isoform X1 [Cucumis sativus])

HSP 1 Score: 630.9 bits (1626), Expect = 1.6e-177
Identity = 351/453 (77.48%), Postives = 371/453 (81.90%), Query Frame = 1

Query: 1   MADSSGKHGRDQPLDFQGFLNDLQDWELSLNGRDKKLKPHAISKEKEGGRQPVKATAADY 60
           MADSS KHGRDQ LDFQGFLNDLQDWE+S  G+DKKLKP AI KEKE  RQ  KA+AADY
Sbjct: 1   MADSSAKHGRDQLLDFQGFLNDLQDWEVSFKGKDKKLKPQAIGKEKEDRRQTEKASAADY 60

Query: 61  LKHYDAVKSLSTKSHTEQSFVDAASEKEQGNEYFKQKKFKEAIGCYSRSIALSPTA---- 120
           +K YDAV  LS    TE SFVDAASEKEQGNEYFKQKKFKEAI CYSRSIALSPTA    
Sbjct: 61  MKQYDAVNRLSRNFQTEGSFVDAASEKEQGNEYFKQKKFKEAIDCYSRSIALSPTAVAFA 120

Query: 121 -------------EAEDDCTEALNLDDRYIKAYSRRATARKELGKAKEALEDAEFAQRLE 180
                        EAEDDCTEALNLDDRYIKAYSRRATARKELGKAKEALEDAEFAQRLE
Sbjct: 121 NRAMAYLKIRRFQEAEDDCTEALNLDDRYIKAYSRRATARKELGKAKEALEDAEFAQRLE 180

Query: 181 PNNQEIKKQHAELRAFVGKAILEKASGASRSSTKEKKMVGKSDSEAKIQDIPP------- 240
           PNNQEIKKQHA+LRAFVGKAILEKASGASRSSTK KK + KSDS+AKIQDIPP       
Sbjct: 181 PNNQEIKKQHADLRAFVGKAILEKASGASRSSTKNKKTLKKSDSDAKIQDIPPVSSSTSR 240

Query: 241 -----------ENGGEKAVKESARLEGTEGRSTGAEIRYKREATNGVHKDPNSNLGALER 300
                      ENGG  AVK SARLE +E  S+GAEI  K+ ATNG HKD +S L ALER
Sbjct: 241 TGLLAARERVEENGGGNAVKTSARLEESEDTSSGAEITSKKVATNGFHKDSSSYLSALER 300

Query: 301 NHVSRKQELKPSVQELASRAASRSMVEAAKNIVAPTTAYQFEVSWRGFSGDRALQAQLLK 360
           +H+ RKQELK SV ELAS+AASRSMVEAAKNI+APTTAYQFEVSWRGFSGD+ALQA+LLK
Sbjct: 301 DHLPRKQELKASVYELASQAASRSMVEAAKNIIAPTTAYQFEVSWRGFSGDQALQARLLK 360

Query: 361 AISPAKLPPIFKNALSAPILIDIVKCVATFFTEEMALAISFLENLAKVPRFSILMMCLPS 419
            ISPAKLP IFK+AL+APILIDIVKCVATFF EE ALAISFLENL  VPRFSILMMCL S
Sbjct: 361 TISPAKLPQIFKDALTAPILIDIVKCVATFFIEEPALAISFLENLVNVPRFSILMMCLSS 420

BLAST of Cp4.1LG07g08510.1 vs. NCBI nr
Match: gi|659070089|ref|XP_008453357.1| (PREDICTED: RNA polymerase II-associated protein 3 isoform X2 [Cucumis melo])

HSP 1 Score: 610.5 bits (1573), Expect = 2.2e-171
Identity = 346/453 (76.38%), Postives = 363/453 (80.13%), Query Frame = 1

Query: 1   MADSSGKHGRDQPLDFQGFLNDLQDWELSLNGRDKKLKPHAISKEKEGGRQPVKATAADY 60
           MADSSGKHGRD PLD          WE+S  G+DKKLKP AI KEKE  RQ  KA+AADY
Sbjct: 67  MADSSGKHGRDHPLD----------WEVSFKGKDKKLKPQAIGKEKEDRRQTEKASAADY 126

Query: 61  LKHYDAVKSLSTKSHTEQSFVDAASEKEQGNEYFKQKKFKEAIGCYSRSIALSPTA---- 120
           LK YDAV  LS    TEQSFVDAASEKEQGNEYFKQKKFKEAI CYSRSIALSPTA    
Sbjct: 127 LKQYDAVNRLSRNFQTEQSFVDAASEKEQGNEYFKQKKFKEAIDCYSRSIALSPTAVAFA 186

Query: 121 -------------EAEDDCTEALNLDDRYIKAYSRRATARKELGKAKEALEDAEFAQRLE 180
                        EAEDDCTEALNLDDRYIKAYSRRATARKELGKAKEALEDAEFAQRLE
Sbjct: 187 NRAMAYLKIRRFQEAEDDCTEALNLDDRYIKAYSRRATARKELGKAKEALEDAEFAQRLE 246

Query: 181 PNNQEIKKQHAELRAFVGKAILEKASGASRSSTKEKKMVGKSDSEAKIQDIPP------- 240
           PNNQEIKKQHA+LRAFVGKAILEKASGASRSS K+KKM+ KSDS+AKIQDIPP       
Sbjct: 247 PNNQEIKKQHADLRAFVGKAILEKASGASRSSPKDKKMLKKSDSDAKIQDIPPVSSSTSR 306

Query: 241 -----------ENGGEKAVKESARLEGTEGRSTGAEIRYKREATNGVHKDPNSNLGALER 300
                      ENGGE AVK SAR+E  E  ST AEI  K+ ATNG HKD +S L ALER
Sbjct: 307 TGLLAARERVEENGGENAVKTSARVEENEDISTRAEIACKKVATNGFHKDSSSYLWALER 366

Query: 301 NHVSRKQELKPSVQELASRAASRSMVEAAKNIVAPTTAYQFEVSWRGFSGDRALQAQLLK 360
           +H+ RK ELK SVQELAS+AASRSMVEAAKNI+APTTAYQFEVSWRGFS D+ALQA LLK
Sbjct: 367 DHLPRKPELKASVQELASQAASRSMVEAAKNIIAPTTAYQFEVSWRGFSDDQALQACLLK 426

Query: 361 AISPAKLPPIFKNALSAPILIDIVKCVATFFTEEMALAISFLENLAKVPRFSILMMCLPS 419
            ISPAKLP IFKNAL+APILIDIVKCVATFFTEE ALAISFLENL KVPRFSILMMCL S
Sbjct: 427 TISPAKLPQIFKNALTAPILIDIVKCVATFFTEEPALAISFLENLVKVPRFSILMMCLSS 486

BLAST of Cp4.1LG07g08510.1 vs. NCBI nr
Match: gi|659070091|ref|XP_008453365.1| (PREDICTED: RNA polymerase II-associated protein 3 isoform X3 [Cucumis melo])

HSP 1 Score: 585.5 bits (1508), Expect = 7.7e-164
Identity = 328/435 (75.40%), Postives = 344/435 (79.08%), Query Frame = 1

Query: 1   MADSSGKHGRDQPLDFQGFLNDLQDWELSLNGRDKKLKPHAISKEKEGGRQPVKATAADY 60
           MADSSGKHGRD PLDFQGFLNDLQDWE+S  G+DKKLKP AI KEKE  RQ  KA+AADY
Sbjct: 67  MADSSGKHGRDHPLDFQGFLNDLQDWEVSFKGKDKKLKPQAIGKEKEDRRQTEKASAADY 126

Query: 61  LKHYDAVKSLSTKSHTEQSFVDAASEKEQGNEYFKQKKFKEAIGCYSRSIALSPTA---- 120
           LK YDAV  LS    TEQSFVDAASEKEQGNEYFKQKKFKEAI CYSRSIALSPTA    
Sbjct: 127 LKQYDAVNRLSRNFQTEQSFVDAASEKEQGNEYFKQKKFKEAIDCYSRSIALSPTAVAFA 186

Query: 121 -------------EAEDDCTEALNLDDRYIKAYSRRATARKELGKAKEALEDAEFAQRLE 180
                        EAEDDCTEALNLDDRYIKAYSRRATARKELGKAKEALEDAEFAQRLE
Sbjct: 187 NRAMAYLKIRRFQEAEDDCTEALNLDDRYIKAYSRRATARKELGKAKEALEDAEFAQRLE 246

Query: 181 PNNQEIKKQHAELRAFVGKAILEKASGASRSSTKEKKMVGKSDSEAKIQDIPPENGGEKA 240
           PNNQEIKKQHA+LRAFVGKAILEKASGASRSS K+KKM+ KSDS+AKIQDIPP +     
Sbjct: 247 PNNQEIKKQHADLRAFVGKAILEKASGASRSSPKDKKMLKKSDSDAKIQDIPPVSSSTSR 306

Query: 241 VKESARLEGTEGRSTGAEIRYKREATNGVHKDPNSNLGALERNHVSRKQELKPSVQELAS 300
               A  E                               +ER+H+ RK ELK SVQELAS
Sbjct: 307 TGLLAARE------------------------------RVERDHLPRKPELKASVQELAS 366

Query: 301 RAASRSMVEAAKNIVAPTTAYQFEVSWRGFSGDRALQAQLLKAISPAKLPPIFKNALSAP 360
           +AASRSMVEAAKNI+APTTAYQFEVSWRGFS D+ALQA LLK ISPAKLP IFKNAL+AP
Sbjct: 367 QAASRSMVEAAKNIIAPTTAYQFEVSWRGFSDDQALQACLLKTISPAKLPQIFKNALTAP 426

Query: 361 ILIDIVKCVATFFTEEMALAISFLENLAKVPRFSILMMCLPSNEKSDLLKIWDGVFCDEA 419
           ILIDIVKCVATFFTEE ALAISFLENL KVPRFSILMMCL S+EK DLLKIWD VFCDEA
Sbjct: 427 ILIDIVKCVATFFTEEPALAISFLENLVKVPRFSILMMCLSSSEKFDLLKIWDEVFCDEA 471

BLAST of Cp4.1LG07g08510.1 vs. NCBI nr
Match: gi|778667104|ref|XP_011648870.1| (PREDICTED: RNA polymerase II-associated protein 3 isoform X2 [Cucumis sativus])

HSP 1 Score: 575.1 bits (1481), Expect = 1.0e-160
Identity = 323/435 (74.25%), Postives = 341/435 (78.39%), Query Frame = 1

Query: 1   MADSSGKHGRDQPLDFQGFLNDLQDWELSLNGRDKKLKPHAISKEKEGGRQPVKATAADY 60
           MADSS KHGRDQ LDFQGFLNDLQDWE+S  G+DKKLKP AI KEKE  RQ  KA+AADY
Sbjct: 1   MADSSAKHGRDQLLDFQGFLNDLQDWEVSFKGKDKKLKPQAIGKEKEDRRQTEKASAADY 60

Query: 61  LKHYDAVKSLSTKSHTEQSFVDAASEKEQGNEYFKQKKFKEAIGCYSRSIALSPTA---- 120
           +K YDAV  LS    TE SFVDAASEKEQGNEYFKQKKFKEAI CYSRSIALSPTA    
Sbjct: 61  MKQYDAVNRLSRNFQTEGSFVDAASEKEQGNEYFKQKKFKEAIDCYSRSIALSPTAVAFA 120

Query: 121 -------------EAEDDCTEALNLDDRYIKAYSRRATARKELGKAKEALEDAEFAQRLE 180
                        EAEDDCTEALNLDDRYIKAYSRRATARKELGKAKEALEDAEFAQRLE
Sbjct: 121 NRAMAYLKIRRFQEAEDDCTEALNLDDRYIKAYSRRATARKELGKAKEALEDAEFAQRLE 180

Query: 181 PNNQEIKKQHAELRAFVGKAILEKASGASRSSTKEKKMVGKSDSEAKIQDIPPENGGEKA 240
           PNNQEIKKQHA+LRAFVGKAILEKASGASRSSTK KK + KSDS+AKIQDIPP +     
Sbjct: 181 PNNQEIKKQHADLRAFVGKAILEKASGASRSSTKNKKTLKKSDSDAKIQDIPPVSSSTSR 240

Query: 241 VKESARLEGTEGRSTGAEIRYKREATNGVHKDPNSNLGALERNHVSRKQELKPSVQELAS 300
               A  E                               +ER+H+ RKQELK SV ELAS
Sbjct: 241 TGLLAARE------------------------------RVERDHLPRKQELKASVYELAS 300

Query: 301 RAASRSMVEAAKNIVAPTTAYQFEVSWRGFSGDRALQAQLLKAISPAKLPPIFKNALSAP 360
           +AASRSMVEAAKNI+APTTAYQFEVSWRGFSGD+ALQA+LLK ISPAKLP IFK+AL+AP
Sbjct: 301 QAASRSMVEAAKNIIAPTTAYQFEVSWRGFSGDQALQARLLKTISPAKLPQIFKDALTAP 360

Query: 361 ILIDIVKCVATFFTEEMALAISFLENLAKVPRFSILMMCLPSNEKSDLLKIWDGVFCDEA 419
           ILIDIVKCVATFF EE ALAISFLENL  VPRFSILMMCL S+EK DLLKIWD VFCDEA
Sbjct: 361 ILIDIVKCVATFFIEEPALAISFLENLVNVPRFSILMMCLSSSEKFDLLKIWDEVFCDEA 405

The following BLAST results are available for this feature:
Match NameE-valueIdentityDescription
OE64C_ARATH5.5e-1339.37Outer envelope protein 64, chloroplastic OS=Arabidopsis thaliana GN=OEP64 PE=1 S... [more]
RPAP3_MOUSE1.6e-1226.65RNA polymerase II-associated protein 3 OS=Mus musculus GN=Rpap3 PE=1 SV=1[more]
RPAP3_HUMAN4.7e-1234.12RNA polymerase II-associated protein 3 OS=Homo sapiens GN=RPAP3 PE=1 SV=2[more]
RPAP3_RAT2.3e-1127.48RNA polymerase II-associated protein 3 OS=Rattus norvegicus GN=Rpap3 PE=1 SV=1[more]
RPAP3_CHICK2.0e-1026.80RNA polymerase II-associated protein 3 OS=Gallus gallus GN=RPAP3 PE=2 SV=1[more]
Match NameE-valueIdentityDescription
A0A0A0LGI8_CUCSA1.1e-17777.48Uncharacterized protein OS=Cucumis sativus GN=Csa_2G034580 PE=4 SV=1[more]
A0A067JV15_JATCU1.7e-12255.48Uncharacterized protein OS=Jatropha curcas GN=JCGZ_23222 PE=4 SV=1[more]
U5GD67_POPTR5.8e-11052.14Uncharacterized protein OS=Populus trichocarpa GN=POPTR_0005s01710g PE=4 SV=1[more]
A0A0R0E971_SOYBN1.4e-10853.56Uncharacterized protein OS=Glycine max GN=GLYMA_20G096900 PE=4 SV=1[more]
A0A0B2QTW7_GLYSO5.4e-10852.97RNA polymerase II-associated protein 3 OS=Glycine soja GN=glysoja_046388 PE=4 SV... [more]
Match NameE-valueIdentityDescription
AT1G56440.21.1e-8642.22 Tetratricopeptide repeat (TPR)-like superfamily protein[more]
AT3G17970.13.1e-1439.37 translocon at the outer membrane of chloroplasts 64-III[more]
AT5G09420.11.5e-0831.54 translocon at the outer membrane of chloroplasts 64-V[more]
AT2G42810.22.2e-0730.92 protein phosphatase 5.2[more]
Match NameE-valueIdentityDescription
gi|659070087|ref|XP_008453348.1|4.5e-18078.59PREDICTED: RNA polymerase II-associated protein 3 isoform X1 [Cucumis melo][more]
gi|449454004|ref|XP_004144746.1|1.6e-17777.48PREDICTED: RNA polymerase II-associated protein 3 isoform X1 [Cucumis sativus][more]
gi|659070089|ref|XP_008453357.1|2.2e-17176.38PREDICTED: RNA polymerase II-associated protein 3 isoform X2 [Cucumis melo][more]
gi|659070091|ref|XP_008453365.1|7.7e-16475.40PREDICTED: RNA polymerase II-associated protein 3 isoform X3 [Cucumis melo][more]
gi|778667104|ref|XP_011648870.1|1.0e-16074.25PREDICTED: RNA polymerase II-associated protein 3 isoform X2 [Cucumis sativus][more]
The following terms have been associated with this mRNA:
Vocabulary: Molecular Function
TermDefinition
GO:0005515protein binding
Vocabulary: INTERPRO
TermDefinition
IPR025986RPAP3-like_C
IPR019734TPR_repeat
IPR013026TPR-contain_dom
IPR011990TPR-like_helical_dom_sf
GO Assignments
This mRNA is annotated with the following GO terms.
Category Term Accession Term Name
biological_process GO:0008150 biological_process
cellular_component GO:0005575 cellular_component
molecular_function GO:0005515 protein binding

This mRNA is a part of the following gene feature(s):

Feature NameUnique NameType
Cp4.1LG07g08510Cp4.1LG07g08510gene


The following five_prime_UTR feature(s) are a part of this mRNA:

Feature NameUnique NameType
Cp4.1LG07g08510.1:five_prime_utr:001Cp4.1LG07g08510.1:five_prime_utr:001five_prime_UTR


The following CDS feature(s) are a part of this mRNA:

Feature NameUnique NameType
Cp4.1LG07g08510.1:cds:001Cp4.1LG07g08510.1:cds:001CDS
Cp4.1LG07g08510.1:cds:002Cp4.1LG07g08510.1:cds:002CDS
Cp4.1LG07g08510.1:cds:003Cp4.1LG07g08510.1:cds:003CDS
Cp4.1LG07g08510.1:cds:004Cp4.1LG07g08510.1:cds:004CDS
Cp4.1LG07g08510.1:cds:005Cp4.1LG07g08510.1:cds:005CDS
Cp4.1LG07g08510.1:cds:006Cp4.1LG07g08510.1:cds:006CDS
Cp4.1LG07g08510.1:cds:007Cp4.1LG07g08510.1:cds:007CDS
Cp4.1LG07g08510.1:cds:008Cp4.1LG07g08510.1:cds:008CDS
Cp4.1LG07g08510.1:cds:009Cp4.1LG07g08510.1:cds:009CDS
Cp4.1LG07g08510.1:cds:010Cp4.1LG07g08510.1:cds:010CDS
Cp4.1LG07g08510.1:cds:011Cp4.1LG07g08510.1:cds:011CDS
Cp4.1LG07g08510.1:cds:012Cp4.1LG07g08510.1:cds:012CDS
Cp4.1LG07g08510.1:cds:013Cp4.1LG07g08510.1:cds:013CDS


The following polypeptide feature(s) derives from this mRNA:

Feature NameUnique NameType
Cp4.1LG07g08510.1Cp4.1LG07g08510.1-proteinpolypeptide


Analysis Name: InterPro Annotations of Cucurbita pepo
Date Performed: 2017-12-02
IPR TermIPR DescriptionSourceSource TermSource DescriptionAlignment
IPR011990Tetratricopeptide-like helical domainGENE3DG3DSA:1.25.40.10coord: 82..187
score: 2.2
IPR011990Tetratricopeptide-like helical domainunknownSSF48452TPR-likecoord: 84..183
score: 9.02
IPR013026Tetratricopeptide repeat-containing domainPROFILEPS50293TPR_REGIONcoord: 83..166
score: 9
IPR019734Tetratricopeptide repeatSMARTSM00028tpr_5coord: 83..116
score: 0.0012coord: 133..166
score: 0
IPR019734Tetratricopeptide repeatPROFILEPS50005TPRcoord: 83..116
score: 9.529coord: 133..166
score: 8
IPR025986RNA-polymerase II-associated protein 3-like, C-terminal domainPFAMPF13877RPAP3_Ccoord: 299..388
score: 4.2
NoneNo IPR availableunknownCoilCoilcoord: 139..159
scor
NoneNo IPR availablePANTHERPTHR22904TPR REPEAT CONTAINING PROTEINcoord: 264..349
score: 2.5E-56coord: 76..197
score: 2.5
NoneNo IPR availablePANTHERPTHR22904:SF323CARBOXYLATE CLAMP-TETRATRICOPEPTIDE REPEATcoord: 264..349
score: 2.5E-56coord: 76..197
score: 2.5