Cmc03g0072951 (gene) Melon (Charmono) v1.1

Overview
NameCmc03g0072951
Typegene
OrganismCucumis melo L. var. cantalupensis cv. Charmono (Melon (Charmono) v1.1)
DescriptionIntegrase catalytic domain-containing protein
LocationCMiso1.1chr03: 20303584 .. 20304375 (-)
RNA-Seq ExpressionCmc03g0072951
SyntenyCmc03g0072951
Sequences
The following sequences are available for this feature:

Gene sequence (with intron)

Legend: exonCDSpolypeptide
Hold the cursor over a type above to highlight its positions in the sequence below.
ATGACACAACCAGAAGGCTTTAAAATCTCTGGGCAAGAAAACAAAGTGTGTAAACTGAGAAAATCCCTATACGGTCTCAAGCAAGCTCCCAAGCAGTGGTATGACAAATTTAACAATACGTTGATAACCAACGGATTTAAAATAAATTCCTCTGACACGTGTGTTTATTCAAAGATGTTTGGAGCTGATTGCATATTAATATGTCTATATGTTGATGACATGTTAATCTTTGGAACAAACATGGAGTTAATAACTGATACTAAGTTTTTCCTCTCGTCACACTTTGAAATGAAAGACCTGGGAGAAGCAGACGTAATCCTAGGTGTTATTAGGAAAAACAAAACTAGTTTGTCTCTATGTCAATCTCACTACGTGGAGAAAATACTAAAGAAGTTTGATTCCTTTGATGTTTCTCCTGTGAGAACTCCCTTTGACACTAGTAAATATCTTAAGAAGAATAAAGGAGATAGTGTGTCTCAACCTGAATATGCAAAGATCATAGGTAGTGTGATGTATTTAATGAATTACACTAGATCGGATATTGCATATGATGTCAGTAGATTAAGTAGATATACACACAATCCTGATAGATACCACGGGGATGCCTTATGCTATATGTTGAGATATCTTAAAGGGACGATAGATTACTGTCTACACTTCAACAAATTTTCTGCCGTATTAGAAGGATATTGTGATGCAAACTGGGTCACAGATAATGATGAAGTTAACTTTACTAGTGGGTATGTATTTTTGCTCGGAGGTGGAGCAATATCTTGGAAGTCTACAAAATAG

mRNA sequence

ATGACACAACCAGAAGGCTTTAAAATCTCTGGGCAAGAAAACAAAGTGTGTAAACTGAGAAAATCCCTATACGGTCTCAAGCAAGCTCCCAAGCAGTGGTATGACAAATTTAACAATACGTTGATAACCAACGGATTTAAAATAAATTCCTCTGACACGTGTGTTTATTCAAAGATGTTTGGAGCTGATTGCATATTAATATGTCTATATGTTGATGACATGTTAATCTTTGGAACAAACATGGAGTTAATAACTGATACTAAGTTTTTCCTCTCGTCACACTTTGAAATGAAAGACCTGGGAGAAGCAGACGTAATCCTAGGTGTTATTAGGAAAAACAAAACTAGTTTGTCTCTATGTCAATCTCACTACGTGGAGAAAATACTAAAGAAGTTTGATTCCTTTGATGTTTCTCCTGTGAGAACTCCCTTTGACACTAGTAAATATCTTAAGAAGAATAAAGGAGATAGTGTGTCTCAACCTGAATATGCAAAGATCATAGGTAGTGTGATGTATTTAATGAATTACACTAGATCGGATATTGCATATGATGTCAGTAGATTAAGTAGATATACACACAATCCTGATAGATACCACGGGGATGCCTTATGCTATATGTTGAGATATCTTAAAGGGACGATAGATTACTGTCTACACTTCAACAAATTTTCTGCCGTATTAGAAGGATATTGTGATGCAAACTGGGTCACAGATAATGATGAAGTTAACTTTACTAGTGGGTATGTATTTTTGCTCGGAGGTGGAGCAATATCTTGGAAGTCTACAAAATAG

Coding sequence (CDS)

ATGACACAACCAGAAGGCTTTAAAATCTCTGGGCAAGAAAACAAAGTGTGTAAACTGAGAAAATCCCTATACGGTCTCAAGCAAGCTCCCAAGCAGTGGTATGACAAATTTAACAATACGTTGATAACCAACGGATTTAAAATAAATTCCTCTGACACGTGTGTTTATTCAAAGATGTTTGGAGCTGATTGCATATTAATATGTCTATATGTTGATGACATGTTAATCTTTGGAACAAACATGGAGTTAATAACTGATACTAAGTTTTTCCTCTCGTCACACTTTGAAATGAAAGACCTGGGAGAAGCAGACGTAATCCTAGGTGTTATTAGGAAAAACAAAACTAGTTTGTCTCTATGTCAATCTCACTACGTGGAGAAAATACTAAAGAAGTTTGATTCCTTTGATGTTTCTCCTGTGAGAACTCCCTTTGACACTAGTAAATATCTTAAGAAGAATAAAGGAGATAGTGTGTCTCAACCTGAATATGCAAAGATCATAGGTAGTGTGATGTATTTAATGAATTACACTAGATCGGATATTGCATATGATGTCAGTAGATTAAGTAGATATACACACAATCCTGATAGATACCACGGGGATGCCTTATGCTATATGTTGAGATATCTTAAAGGGACGATAGATTACTGTCTACACTTCAACAAATTTTCTGCCGTATTAGAAGGATATTGTGATGCAAACTGGGTCACAGATAATGATGAAGTTAACTTTACTAGTGGGTATGTATTTTTGCTCGGAGGTGGAGCAATATCTTGGAAGTCTACAAAATAG

Protein sequence

MTQPEGFKISGQENKVCKLRKSLYGLKQAPKQWYDKFNNTLITNGFKINSSDTCVYSKMFGADCILICLYVDDMLIFGTNMELITDTKFFLSSHFEMKDLGEADVILGVIRKNKTSLSLCQSHYVEKILKKFDSFDVSPVRTPFDTSKYLKKNKGDSVSQPEYAKIIGSVMYLMNYTRSDIAYDVSRLSRYTHNPDRYHGDALCYMLRYLKGTIDYCLHFNKFSAVLEGYCDANWVTDNDEVNFTSGYVFLLGGGAISWKSTK
Homology
BLAST of Cmc03g0072951 vs. NCBI nr
Match: TYK06518.1 (ty1-copia retrotransposon protein [Cucumis melo var. makuwa])

HSP 1 Score: 509.6 bits (1311), Expect = 1.6e-140
Identity = 248/264 (93.94%), Postives = 254/264 (96.21%), Query Frame = 0

Query: 1    MTQPEGFKISGQENKVCKLRKSLYGLKQAPKQWYDKFNNTLITNGFKINSSDTCVYSKMF 60
            MTQPEGFKISGQENKVCKLRKSLYGLKQAPKQWY+KFNNTLITNGFKINSSDTCVYSKM 
Sbjct: 961  MTQPEGFKISGQENKVCKLRKSLYGLKQAPKQWYEKFNNTLITNGFKINSSDTCVYSKMV 1020

Query: 61   GADCILICLYVDDMLIFGTNMELITDTKFFLSSHFEMKDLGEADVILGV-IRKNKTSLSL 120
            GADCILICLYVDDMLIFGTNMELITDTK+FLSSHFEMKDLGEADVILGV IRKNKTSLSL
Sbjct: 1021 GADCILICLYVDDMLIFGTNMELITDTKYFLSSHFEMKDLGEADVILGVKIRKNKTSLSL 1080

Query: 121  CQSHYVEKILKKFDSFDVSPVRTPFDTSKYLKKNKGDSVSQPEYAKIIGSVMYLMNYTRS 180
            CQSHYVEKILKKFDSFDVSPVRTPFD SK+LKKNKGDSVSQPEYAKIIGSVMYLMNYTR 
Sbjct: 1081 CQSHYVEKILKKFDSFDVSPVRTPFDASKHLKKNKGDSVSQPEYAKIIGSVMYLMNYTRP 1140

Query: 181  DIAYDVSRLSRYTHNPDRYHGDALCYMLRYLKGTIDYCLHFNKFSAVLEGYCDANWVTDN 240
            DIAY VSRLSRYTHNP+RYH DAL ++LRYLKGTIDYCLHF KF AVLEGYCDANWVTDN
Sbjct: 1141 DIAYAVSRLSRYTHNPNRYHWDALRHLLRYLKGTIDYCLHFKKFPAVLEGYCDANWVTDN 1200

Query: 241  DEVNFTSGYVFLLGGGAISWKSTK 264
            DEVN TSGYVFLLGGGAISWKSTK
Sbjct: 1201 DEVNSTSGYVFLLGGGAISWKSTK 1224

BLAST of Cmc03g0072951 vs. NCBI nr
Match: KAD6453934.1 (hypothetical protein E3N88_08640 [Mikania micrantha])

HSP 1 Score: 386.7 bits (992), Expect = 1.6e-103
Identity = 184/264 (69.70%), Postives = 221/264 (83.71%), Query Frame = 0

Query: 1   MTQPEGFKISGQENKVCKLRKSLYGLKQAPKQWYDKFNNTLITNGFKINSSDTCVYSKMF 60
           M QPEGF +SG E+KVCKLRKSLYGLKQAPK+WY+KF+ TL  +G+ +N+SD+CVYSK  
Sbjct: 312 MLQPEGFVVSGLESKVCKLRKSLYGLKQAPKKWYEKFDRTLKQDGYIVNNSDSCVYSKRS 371

Query: 61  GADCILICLYVDDMLIFGTNMELITDTKFFLSSHFEMKDLGEADVILGV-IRKNKTSLSL 120
               +LICLYVDDMLIFG +M  I  TK FLSS FEMKDLGEADVILGV I++    +SL
Sbjct: 372 KTGYVLICLYVDDMLIFGADMHDINQTKAFLSSKFEMKDLGEADVILGVKIKRTWNGISL 431

Query: 121 CQSHYVEKILKKFDSFDVSPVRTPFDTSKYLKKNKGDSVSQPEYAKIIGSVMYLMNYTRS 180
           CQSHY+E++LKKFD F+++PV+TP+D S  LKKN  +SVSQ EYAKIIGSVM+LMNYTR 
Sbjct: 432 CQSHYIEQMLKKFDCFELNPVKTPYDPSILLKKNNHESVSQSEYAKIIGSVMFLMNYTRP 491

Query: 181 DIAYDVSRLSRYTHNPDRYHGDALCYMLRYLKGTIDYCLHFNKFSAVLEGYCDANWVTDN 240
           DIAY VSRLSRYTHNP + H  A+  ++RYL+GT++ CLH+NKF AVLEGYCDANWVTDN
Sbjct: 492 DIAYTVSRLSRYTHNPSKEHWSAIHRLMRYLRGTMELCLHYNKFPAVLEGYCDANWVTDN 551

Query: 241 DEVNFTSGYVFLLGGGAISWKSTK 264
           DEV+ TSGYVF++GGGAISWKS+K
Sbjct: 552 DEVSSTSGYVFIMGGGAISWKSSK 575

BLAST of Cmc03g0072951 vs. NCBI nr
Match: KAE8670806.1 (hypothetical protein F3Y22_tig00112079pilonHSYRG00011 [Hibiscus syriacus])

HSP 1 Score: 364.8 bits (935), Expect = 6.4e-97
Identity = 177/264 (67.05%), Postives = 210/264 (79.55%), Query Frame = 0

Query: 1    MTQPEGFKISGQENKVCKLRKSLYGLKQAPKQWYDKFNNTLITNGFKINSSDTCVYSKMF 60
            M QP GF+  G E KV +L+KSLYGLKQAPKQWY+KF+ T+++ GF +N SD CVYSKMF
Sbjct: 784  MEQPLGFEAPGMEGKVYRLKKSLYGLKQAPKQWYEKFHKTILSFGFVVNGSDACVYSKMF 843

Query: 61   GADCILICLYVDDMLIFGTNMELITDTKFFLSSHFEMKDLGEADVILGV-IRKNKTSLSL 120
              +C++I LYVDDMLIF +N+E I   K FLS+ FEM  LGE DVILGV + K +   SL
Sbjct: 844  DTECVIISLYVDDMLIFSSNIESINKVKNFLSTKFEMTYLGEVDVILGVEVTKTEKGFSL 903

Query: 121  CQSHYVEKILKKFDSFDVSPVRTPFDTSKYLKKNKGDSVSQPEYAKIIGSVMYLMNYTRS 180
            CQ+HY++K+LKKFDSFDV PVRTP+D S +L KNKG SVSQ EYAK+IGS+M+LMNYTR 
Sbjct: 904  CQAHYIDKVLKKFDSFDVVPVRTPYDPSIHLVKNKGSSVSQTEYAKLIGSLMFLMNYTRP 963

Query: 181  DIAYDVSRLSRYTHNPDRYHGDALCYMLRYLKGTIDYCLHFNKFSAVLEGYCDANWVTDN 240
            DIAY VSRLSRYTHNP   H  AL  +L+YLKGT+D+ L F  F AVLEGYCDANWV+DN
Sbjct: 964  DIAYAVSRLSRYTHNPSGEHWIALKRLLKYLKGTLDWKLEFAGFPAVLEGYCDANWVSDN 1023

Query: 241  DEVNFTSGYVFLLGGGAISWKSTK 264
            DEV+ TSGYVF LGG AISWKS+K
Sbjct: 1024 DEVSSTSGYVFTLGGAAISWKSSK 1047

BLAST of Cmc03g0072951 vs. NCBI nr
Match: KAG7571733.1 (Integrase catalytic core [Arabidopsis suecica])

HSP 1 Score: 356.7 bits (914), Expect = 1.7e-94
Identity = 174/264 (65.91%), Postives = 210/264 (79.55%), Query Frame = 0

Query: 1    MTQPEGFKISGQENKVCKLRKSLYGLKQAPKQWYDKFNNTLITNGFKINSSDTCVYSKMF 60
            M QPEGF I GQENKVCKL KSLYGLKQAPKQW++KF+NTL+ NGF  N  DTCV+SK+ 
Sbjct: 902  MMQPEGFIIEGQENKVCKLIKSLYGLKQAPKQWFEKFSNTLLENGFVSNEGDTCVFSKVH 961

Query: 61   GADCILICLYVDDMLIFGTNMELITDTKFFLSSHFEMKDLGEADVILGV-IRKNKTSLSL 120
                ++ICLYVDDMLI GT++E++ DTK FLSS F+MKDLGEADVILG+ + K  +  SL
Sbjct: 962  EHGYVIICLYVDDMLILGTSLEIVCDTKVFLSSKFDMKDLGEADVILGIKVVKTDSGFSL 1021

Query: 121  CQSHYVEKILKKFDSFDVSPVRTPFDTSKYLKKNKGDSVSQPEYAKIIGSVMYLMNYTRS 180
             QSHY+EKILKKF  +D    ++P+D+S +L +N+G+SV+Q EYAK+IGSVMYLMN TR 
Sbjct: 1022 NQSHYIEKILKKFGYWDEPSAKSPYDSSLHLCQNRGESVNQSEYAKVIGSVMYLMNCTRP 1081

Query: 181  DIAYDVSRLSRYTHNPDRYHGDALCYMLRYLKGTIDYCLHFNKFSAVLEGYCDANWVTDN 240
            DIAY VSRLSRYTHNP   H  AL  ++RYLKGTID+ L ++  S VLE YCDANW +DN
Sbjct: 1082 DIAYAVSRLSRYTHNPGSNHWSALNRLMRYLKGTIDWNLCYSGTSCVLEAYCDANWSSDN 1141

Query: 241  DEVNFTSGYVFLLGGGAISWKSTK 264
            DEVN TSG+VF L GGAI+WKSTK
Sbjct: 1142 DEVNSTSGFVFTLAGGAIAWKSTK 1165

BLAST of Cmc03g0072951 vs. NCBI nr
Match: KAG7551885.1 (Ribonuclease H-like superfamily [Arabidopsis thaliana x Arabidopsis arenosa])

HSP 1 Score: 356.3 bits (913), Expect = 2.3e-94
Identity = 174/264 (65.91%), Postives = 210/264 (79.55%), Query Frame = 0

Query: 1    MTQPEGFKISGQENKVCKLRKSLYGLKQAPKQWYDKFNNTLITNGFKINSSDTCVYSKMF 60
            M QPEGF I GQENKVCKL KSLYGLKQAPKQW++KF+NTL+ NGF  N  DTCV+SK+ 
Sbjct: 933  MMQPEGFIIEGQENKVCKLIKSLYGLKQAPKQWFEKFSNTLLENGFVSNEGDTCVFSKVH 992

Query: 61   GADCILICLYVDDMLIFGTNMELITDTKFFLSSHFEMKDLGEADVILGV-IRKNKTSLSL 120
                ++ICLYVDDMLI GT++E++ DTK FLSS F+MKDLGEADVILG+ + K  +  SL
Sbjct: 993  EHGYVIICLYVDDMLILGTSLEIVCDTKVFLSSKFDMKDLGEADVILGIKVVKTDSGFSL 1052

Query: 121  CQSHYVEKILKKFDSFDVSPVRTPFDTSKYLKKNKGDSVSQPEYAKIIGSVMYLMNYTRS 180
             QSHY+EKILKKF  +D    ++P+D+S +L +N+G+SV+Q EYAK+IGSVMYLMN TR 
Sbjct: 1053 NQSHYIEKILKKFGYWDEPSAKSPYDSSLHLCQNRGESVNQSEYAKVIGSVMYLMNCTRP 1112

Query: 181  DIAYDVSRLSRYTHNPDRYHGDALCYMLRYLKGTIDYCLHFNKFSAVLEGYCDANWVTDN 240
            DIAY VSRLSRYTHNP   H  AL  ++RYLKGTID+ L ++  S VLE YCDANW +DN
Sbjct: 1113 DIAYVVSRLSRYTHNPGSNHWSALNRLMRYLKGTIDWNLCYSGTSCVLEAYCDANWSSDN 1172

Query: 241  DEVNFTSGYVFLLGGGAISWKSTK 264
            DEVN TSG+VF L GGAI+WKSTK
Sbjct: 1173 DEVNSTSGFVFTLAGGAIAWKSTK 1196

BLAST of Cmc03g0072951 vs. ExPASy Swiss-Prot
Match: P10978 (Retrovirus-related Pol polyprotein from transposon TNT 1-94 OS=Nicotiana tabacum OX=4097 PE=2 SV=1)

HSP 1 Score: 214.5 bits (545), Expect = 1.4e-54
Identity = 121/273 (44.32%), Postives = 168/273 (61.54%), Query Frame = 0

Query: 1    MTQPEGFKISGQENKVCKLRKSLYGLKQAPKQWYDKFNNTLITNGFKINSSDTCVYSKMF 60
            M QPEGF+++G+++ VCKL KSLYGLKQAP+QWY KF++ + +  +    SD CVY K F
Sbjct: 938  MEQPEGFEVAGKKHMVCKLNKSLYGLKQAPRQWYMKFDSFMKSQTYLKTYSDPCVYFKRF 997

Query: 61   GA-DCILICLYVDDMLIFGTNMELITDTKFFLSSHFEMKDLGEADVILGV-IRKNKTS-- 120
               + I++ LYVDDMLI G +  LI   K  LS  F+MKDLG A  ILG+ I + +TS  
Sbjct: 998  SENNFIILLLYVDDMLIVGKDKGLIAKLKGDLSKSFDMKDLGPAQQILGMKIVRERTSRK 1057

Query: 121  LSLCQSHYVEKILKKFDSFDVSPVRTPFDTSKYLKK--------NKGDSVSQPEYAKIIG 180
            L L Q  Y+E++L++F+  +  PV TP      L K         KG+    P Y+  +G
Sbjct: 1058 LWLSQEKYIERVLERFNMKNAKPVSTPLAGHLKLSKKMCPTTVEEKGNMAKVP-YSSAVG 1117

Query: 181  SVMYLMNYTRSDIAYDVSRLSRYTHNPDRYHGDALCYMLRYLKGTIDYCLHFNKFSAVLE 240
            S+MY M  TR DIA+ V  +SR+  NP + H +A+ ++LRYL+GT   CL F     +L+
Sbjct: 1118 SLMYAMVCTRPDIAHAVGVVSRFLENPGKEHWEAVKWILRYLRGTTGDCLCFGGSDPILK 1177

Query: 241  GYCDANWVTDNDEVNFTSGYVFLLGGGAISWKS 262
            GY DA+   D D    ++GY+F   GGAISW+S
Sbjct: 1178 GYTDADMAGDIDNRKSSTGYLFTFSGGAISWQS 1209

BLAST of Cmc03g0072951 vs. ExPASy Swiss-Prot
Match: Q94HW2 (Retrovirus-related Pol polyprotein from transposon RE1 OS=Arabidopsis thaliana OX=3702 GN=RE1 PE=2 SV=1)

HSP 1 Score: 185.3 bits (469), Expect = 9.1e-46
Identity = 108/266 (40.60%), Postives = 149/266 (56.02%), Query Frame = 0

Query: 1    MTQPEGFKISGQENKVCKLRKSLYGLKQAPKQWYDKFNNTLITNGFKINSSDTCVYSKMF 60
            M+QP GF    + N VCKLRK+LYGLKQAP+ WY +  N L+T GF  + SDT ++    
Sbjct: 1081 MSQPPGFIDKDRPNYVCKLRKALYGLKQAPRAWYVELRNYLLTIGFVNSVSDTSLFVLQR 1140

Query: 61   GADCILICLYVDDMLIFGTNMELITDTKFFLSSHFEMKDLGEADVILGVIRKN-KTSLSL 120
            G   + + +YVDD+LI G +  L+ +T   LS  F +KD  E    LG+  K   T L L
Sbjct: 1141 GKSIVYMLVYVDDILITGNDPTLLHNTLDNLSQRFSVKDHEELHYFLGIEAKRVPTGLHL 1200

Query: 121  CQSHYVEKILKKFDSFDVSPVRTPFDTSKYLKKNKGDSVSQP-EYAKIIGSVMYLMNYTR 180
             Q  Y+  +L + +     PV TP   S  L    G  ++ P EY  I+GS+ YL  +TR
Sbjct: 1201 SQRRYILDLLARTNMITAKPVTTPMAPSPKLSLYSGTKLTDPTEYRGIVGSLQYLA-FTR 1260

Query: 181  SDIAYDVSRLSRYTHNPDRYHGDALCYMLRYLKGTIDYCLHFNKFSAV-LEGYCDANWVT 240
             DI+Y V+RLS++ H P   H  AL  +LRYL GT ++ +   K + + L  Y DA+W  
Sbjct: 1261 PDISYAVNRLSQFMHMPTEEHLQALKRILRYLAGTPNHGIFLKKGNTLSLHAYSDADWAG 1320

Query: 241  DNDEVNFTSGYVFLLGGGAISWKSTK 264
            D D+   T+GY+  LG   ISW S K
Sbjct: 1321 DKDDYVSTNGYIVYLGHHPISWSSKK 1345

BLAST of Cmc03g0072951 vs. ExPASy Swiss-Prot
Match: Q9ZT94 (Retrovirus-related Pol polyprotein from transposon RE2 OS=Arabidopsis thaliana OX=3702 GN=RE2 PE=4 SV=1)

HSP 1 Score: 178.7 bits (452), Expect = 8.5e-44
Identity = 104/266 (39.10%), Postives = 147/266 (55.26%), Query Frame = 0

Query: 1    MTQPEGFKISGQENKVCKLRKSLYGLKQAPKQWYDKFNNTLITNGFKINSSDTCVYSKMF 60
            M+QP GF    + + VC+LRK++YGLKQAP+ WY +    L+T GF  + SDT ++    
Sbjct: 1064 MSQPPGFVDKDRPDYVCRLRKAIYGLKQAPRAWYVELRTYLLTVGFVNSISDTSLFVLQR 1123

Query: 61   GADCILICLYVDDMLIFGTNMELITDTKFFLSSHFEMKDLGEADVILGVIRKN-KTSLSL 120
            G   I + +YVDD+LI G +  L+  T   LS  F +K+  +    LG+  K     L L
Sbjct: 1124 GRSIIYMLVYVDDILITGNDTVLLKHTLDALSQRFSVKEHEDLHYFLGIEAKRVPQGLHL 1183

Query: 121  CQSHYVEKILKKFDSFDVSPVRTPFDTSKYLKKNKGDSVSQP-EYAKIIGSVMYLMNYTR 180
             Q  Y   +L + +     PV TP  TS  L  + G  +  P EY  I+GS+ YL  +TR
Sbjct: 1184 SQRRYTLDLLARTNMLTAKPVATPMATSPKLTLHSGTKLPDPTEYRGIVGSLQYLA-FTR 1243

Query: 181  SDIAYDVSRLSRYTHNPDRYHGDALCYMLRYLKGTIDYCLHFNKFSAV-LEGYCDANWVT 240
             D++Y V+RLS+Y H P   H +AL  +LRYL GT D+ +   K + + L  Y DA+W  
Sbjct: 1244 PDLSYAVNRLSQYMHMPTDDHWNALKRVLRYLAGTPDHGIFLKKGNTLSLHAYSDADWAG 1303

Query: 241  DNDEVNFTSGYVFLLGGGAISWKSTK 264
            D D+   T+GY+  LG   ISW S K
Sbjct: 1304 DTDDYVSTNGYIVYLGHHPISWSSKK 1328

BLAST of Cmc03g0072951 vs. ExPASy Swiss-Prot
Match: P04146 (Copia protein OS=Drosophila melanogaster OX=7227 GN=GIP PE=1 SV=3)

HSP 1 Score: 138.7 bits (348), Expect = 9.8e-32
Identity = 93/270 (34.44%), Postives = 137/270 (50.74%), Query Frame = 0

Query: 1    MTQPEGFKISGQENKVCKLRKSLYGLKQAPKQWYDKFNNTLITNGFKINSSDTCVYSKMF 60
            M  P+G  IS   + VCKL K++YGLKQA + W++ F   L    F  +S D C+Y    
Sbjct: 1018 MRLPQG--ISCNSDNVCKLNKAIYGLKQAARCWFEVFEQALKECEFVNSSVDRCIYILDK 1077

Query: 61   G--ADCILICLYVDDMLIFGTNMELITDTKFFLSSHFEMKDLGEADVILGV-IRKNKTSL 120
            G   + I + LYVDD++I   +M  + + K +L   F M DL E    +G+ I   +  +
Sbjct: 1078 GNINENIYVLLYVDDVVIATGDMTRMNNFKRYLMEKFRMTDLNEIKHFIGIRIEMQEDKI 1137

Query: 121  SLCQSHYVEKILKKFDSFDVSPVRTPFDTSKYLKKNKGDSVSQPEYAKIIGSVMYLMNYT 180
             L QS YV+KIL KF+  + + V TP  +    +    D         +IG +MY+M  T
Sbjct: 1138 YLSQSAYVKKILSKFNMENCNAVSTPLPSKINYELLNSDEDCNTPCRSLIGCLMYIMLCT 1197

Query: 181  RSDIAYDVSRLSRYTHNPDRYHGDALCYMLRYLKGTIDYCLHFNK---FSAVLEGYCDAN 240
            R D+   V+ LSRY+   +      L  +LRYLKGTID  L F K   F   + GY D++
Sbjct: 1198 RPDLTTAVNILSRYSSKNNSELWQNLKRVLRYLKGTIDMKLIFKKNLAFENKIIGYVDSD 1257

Query: 241  WVTDNDEVNFTSGYVF-LLGGGAISWKSTK 264
            W     +   T+GY+F +     I W + +
Sbjct: 1258 WAGSEIDRKSTTGYLFKMFDFNLICWNTKR 1285

BLAST of Cmc03g0072951 vs. ExPASy Swiss-Prot
Match: P25600 (Putative transposon Ty5-1 protein YCL074W OS=Saccharomyces cerevisiae (strain ATCC 204508 / S288c) OX=559292 GN=TY5A PE=5 SV=2)

HSP 1 Score: 131.3 bits (329), Expect = 1.6e-29
Identity = 85/265 (32.08%), Postives = 129/265 (48.68%), Query Frame = 0

Query: 3   QPEGFKISGQENKVCKLRKSLYGLKQAPKQWYDKFNNTLITNGFKINSSDTCVYSKMFGA 62
           QP GF      + V +L   +YGLKQAP  W +  NNTL   GF  +  +  +Y +    
Sbjct: 20  QPPGFVNERNPDYVWELYGGMYGLKQAPLLWNEHINNTLKKIGFCRHEGEHGLYFRSTSD 79

Query: 63  DCILICLYVDDMLIFGTNMELITDTKFFLSSHFEMKDLGEADVILG--VIRKNKTSLSLC 122
             I I +YVDD+L+   + ++    K  L+  + MKDLG+ D  LG  + + +   ++L 
Sbjct: 80  GPIYIAVYVDDLLVAAPSPKIYDRVKQELTKLYSMKDLGKVDKFLGLNIHQSSNGDITLS 139

Query: 123 QSHYVEKILKKFDSFDVSPVRTPFDTSKYLKKNKGDSVSQ-PEYAKIIGSVMYLMNYTRS 182
              Y+ K   + +       +TP   SK L +     +     Y  I+G +++  N  R 
Sbjct: 140 LQDYIAKAASESEINTFKLTQTPLCNSKPLFETTSPHLKDITPYQSIVGQLLFCANTGRP 199

Query: 183 DIAYDVSRLSRYTHNPDRYHGDALCYMLRYLKGTIDYCLHFNKFSAV-LEGYCDANWVTD 242
           DI+Y VS LSR+   P   H ++   +LRYL  T   CL +   S + L  YCDA+    
Sbjct: 200 DISYPVSLLSRFLREPRAIHLESARRVLRYLYTTRSMCLKYRSGSQLALTVYCDASHGAI 259

Query: 243 NDEVNFTSGYVFLLGGGAISWKSTK 264
           +D  + T GYV LL G  ++W S K
Sbjct: 260 HDLPHSTGGYVTLLAGAPVTWSSKK 284

BLAST of Cmc03g0072951 vs. ExPASy TrEMBL
Match: A0A5D3C5T2 (Ty1-copia retrotransposon protein OS=Cucumis melo var. makuwa OX=1194695 GN=E5676_scaffold70G00500 PE=4 SV=1)

HSP 1 Score: 509.6 bits (1311), Expect = 7.8e-141
Identity = 248/264 (93.94%), Postives = 254/264 (96.21%), Query Frame = 0

Query: 1    MTQPEGFKISGQENKVCKLRKSLYGLKQAPKQWYDKFNNTLITNGFKINSSDTCVYSKMF 60
            MTQPEGFKISGQENKVCKLRKSLYGLKQAPKQWY+KFNNTLITNGFKINSSDTCVYSKM 
Sbjct: 961  MTQPEGFKISGQENKVCKLRKSLYGLKQAPKQWYEKFNNTLITNGFKINSSDTCVYSKMV 1020

Query: 61   GADCILICLYVDDMLIFGTNMELITDTKFFLSSHFEMKDLGEADVILGV-IRKNKTSLSL 120
            GADCILICLYVDDMLIFGTNMELITDTK+FLSSHFEMKDLGEADVILGV IRKNKTSLSL
Sbjct: 1021 GADCILICLYVDDMLIFGTNMELITDTKYFLSSHFEMKDLGEADVILGVKIRKNKTSLSL 1080

Query: 121  CQSHYVEKILKKFDSFDVSPVRTPFDTSKYLKKNKGDSVSQPEYAKIIGSVMYLMNYTRS 180
            CQSHYVEKILKKFDSFDVSPVRTPFD SK+LKKNKGDSVSQPEYAKIIGSVMYLMNYTR 
Sbjct: 1081 CQSHYVEKILKKFDSFDVSPVRTPFDASKHLKKNKGDSVSQPEYAKIIGSVMYLMNYTRP 1140

Query: 181  DIAYDVSRLSRYTHNPDRYHGDALCYMLRYLKGTIDYCLHFNKFSAVLEGYCDANWVTDN 240
            DIAY VSRLSRYTHNP+RYH DAL ++LRYLKGTIDYCLHF KF AVLEGYCDANWVTDN
Sbjct: 1141 DIAYAVSRLSRYTHNPNRYHWDALRHLLRYLKGTIDYCLHFKKFPAVLEGYCDANWVTDN 1200

Query: 241  DEVNFTSGYVFLLGGGAISWKSTK 264
            DEVN TSGYVFLLGGGAISWKSTK
Sbjct: 1201 DEVNSTSGYVFLLGGGAISWKSTK 1224

BLAST of Cmc03g0072951 vs. ExPASy TrEMBL
Match: A0A5N6PGV2 (Reverse transcriptase Ty1/copia-type domain-containing protein OS=Mikania micrantha OX=192012 GN=E3N88_08640 PE=4 SV=1)

HSP 1 Score: 386.7 bits (992), Expect = 7.6e-104
Identity = 184/264 (69.70%), Postives = 221/264 (83.71%), Query Frame = 0

Query: 1   MTQPEGFKISGQENKVCKLRKSLYGLKQAPKQWYDKFNNTLITNGFKINSSDTCVYSKMF 60
           M QPEGF +SG E+KVCKLRKSLYGLKQAPK+WY+KF+ TL  +G+ +N+SD+CVYSK  
Sbjct: 312 MLQPEGFVVSGLESKVCKLRKSLYGLKQAPKKWYEKFDRTLKQDGYIVNNSDSCVYSKRS 371

Query: 61  GADCILICLYVDDMLIFGTNMELITDTKFFLSSHFEMKDLGEADVILGV-IRKNKTSLSL 120
               +LICLYVDDMLIFG +M  I  TK FLSS FEMKDLGEADVILGV I++    +SL
Sbjct: 372 KTGYVLICLYVDDMLIFGADMHDINQTKAFLSSKFEMKDLGEADVILGVKIKRTWNGISL 431

Query: 121 CQSHYVEKILKKFDSFDVSPVRTPFDTSKYLKKNKGDSVSQPEYAKIIGSVMYLMNYTRS 180
           CQSHY+E++LKKFD F+++PV+TP+D S  LKKN  +SVSQ EYAKIIGSVM+LMNYTR 
Sbjct: 432 CQSHYIEQMLKKFDCFELNPVKTPYDPSILLKKNNHESVSQSEYAKIIGSVMFLMNYTRP 491

Query: 181 DIAYDVSRLSRYTHNPDRYHGDALCYMLRYLKGTIDYCLHFNKFSAVLEGYCDANWVTDN 240
           DIAY VSRLSRYTHNP + H  A+  ++RYL+GT++ CLH+NKF AVLEGYCDANWVTDN
Sbjct: 492 DIAYTVSRLSRYTHNPSKEHWSAIHRLMRYLRGTMELCLHYNKFPAVLEGYCDANWVTDN 551

Query: 241 DEVNFTSGYVFLLGGGAISWKSTK 264
           DEV+ TSGYVF++GGGAISWKS+K
Sbjct: 552 DEVSSTSGYVFIMGGGAISWKSSK 575

BLAST of Cmc03g0072951 vs. ExPASy TrEMBL
Match: A0A2N9EQT1 (Integrase catalytic domain-containing protein OS=Fagus sylvatica OX=28930 GN=FSB_LOCUS4851 PE=4 SV=1)

HSP 1 Score: 377.1 bits (967), Expect = 6.0e-101
Identity = 180/264 (68.18%), Postives = 215/264 (81.44%), Query Frame = 0

Query: 1   MTQPEGFKISGQENKVCKLRKSLYGLKQAPKQWYDKFNNTLITNGFKINSSDTCVYSKMF 60
           M QPEGF + GQENKVCKLRKSLYGLKQAPKQW++KF+ TL++NGF +N SD CVYSK  
Sbjct: 400 MDQPEGFVVQGQENKVCKLRKSLYGLKQAPKQWHEKFDKTLVSNGFAVNESDRCVYSKFS 459

Query: 61  GADCILICLYVDDMLIFGTNMELITDTKFFLSSHFEMKDLGEADVILGV-IRKNKTSLSL 120
           GA  ++ICLYVDDMLIFGT+M  + +TK FLSS+F+MKDLGEAD+ILG+ I +N   L+L
Sbjct: 460 GASGVIICLYVDDMLIFGTDMNAVKNTKDFLSSNFDMKDLGEADLILGIRIIRNNEGLTL 519

Query: 121 CQSHYVEKILKKFDSFDVSPVRTPFDTSKYLKKNKGDSVSQPEYAKIIGSVMYLMNYTRS 180
            QSHY+EK+LKKF+ +D  PVRTP+D S +LKKN G  VSQ EYAKIIGSVM+LMN TR 
Sbjct: 520 SQSHYIEKVLKKFNHYDYEPVRTPYDPSIHLKKNSGSPVSQSEYAKIIGSVMFLMNCTRP 579

Query: 181 DIAYDVSRLSRYTHNPDRYHGDALCYMLRYLKGTIDYCLHFNKFSAVLEGYCDANWVTDN 240
           DIAY VSRLSRYTHNP   H +A+  +L+YLKGT++  L +    AVLEGYCDANW++DN
Sbjct: 580 DIAYAVSRLSRYTHNPAHEHWNAITRLLKYLKGTMNLGLTYTGHPAVLEGYCDANWISDN 639

Query: 241 DEVNFTSGYVFLLGGGAISWKSTK 264
           DE N TSGYVF LGGGAISWKS+K
Sbjct: 640 DETNSTSGYVFTLGGGAISWKSSK 663

BLAST of Cmc03g0072951 vs. ExPASy TrEMBL
Match: A0A2N9H4B0 (Uncharacterized protein OS=Fagus sylvatica OX=28930 GN=FSB_LOCUS37208 PE=4 SV=1)

HSP 1 Score: 377.1 bits (967), Expect = 6.0e-101
Identity = 180/264 (68.18%), Postives = 215/264 (81.44%), Query Frame = 0

Query: 1    MTQPEGFKISGQENKVCKLRKSLYGLKQAPKQWYDKFNNTLITNGFKINSSDTCVYSKMF 60
            M QPEGF + GQENKVCKLRKSLYGLKQAPKQW++KF+ TL++NGF +N SD CVYSK  
Sbjct: 910  MDQPEGFVVQGQENKVCKLRKSLYGLKQAPKQWHEKFDKTLVSNGFAVNESDRCVYSKFS 969

Query: 61   GADCILICLYVDDMLIFGTNMELITDTKFFLSSHFEMKDLGEADVILGV-IRKNKTSLSL 120
            GA  ++ICLYVDDMLIFGT+M  + +TK FLSS+F+MKDLGEAD+ILG+ I +N   L+L
Sbjct: 970  GASGVIICLYVDDMLIFGTDMNAVKNTKDFLSSNFDMKDLGEADLILGIRIIRNNEGLTL 1029

Query: 121  CQSHYVEKILKKFDSFDVSPVRTPFDTSKYLKKNKGDSVSQPEYAKIIGSVMYLMNYTRS 180
             QSHY+EK+LKKF+ +D  PVRTP+D S +LKKN G  VSQ EYAKIIGSVM+LMN TR 
Sbjct: 1030 SQSHYIEKVLKKFNHYDYEPVRTPYDPSIHLKKNSGSPVSQSEYAKIIGSVMFLMNCTRP 1089

Query: 181  DIAYDVSRLSRYTHNPDRYHGDALCYMLRYLKGTIDYCLHFNKFSAVLEGYCDANWVTDN 240
            DIAY VSRLSRYTHNP   H +A+  +L+YLKGT++  L +    AVLEGYCDANW++DN
Sbjct: 1090 DIAYAVSRLSRYTHNPAHEHWNAITRLLKYLKGTMNLGLTYTGHPAVLEGYCDANWISDN 1149

Query: 241  DEVNFTSGYVFLLGGGAISWKSTK 264
            DE N TSGYVF LGGGAISWKS+K
Sbjct: 1150 DETNSTSGYVFTLGGGAISWKSSK 1173

BLAST of Cmc03g0072951 vs. ExPASy TrEMBL
Match: A0A6A2Y4J7 (Uncharacterized protein OS=Hibiscus syriacus OX=106335 GN=F3Y22_tig00112079pilonHSYRG00011 PE=4 SV=1)

HSP 1 Score: 364.8 bits (935), Expect = 3.1e-97
Identity = 177/264 (67.05%), Postives = 210/264 (79.55%), Query Frame = 0

Query: 1    MTQPEGFKISGQENKVCKLRKSLYGLKQAPKQWYDKFNNTLITNGFKINSSDTCVYSKMF 60
            M QP GF+  G E KV +L+KSLYGLKQAPKQWY+KF+ T+++ GF +N SD CVYSKMF
Sbjct: 784  MEQPLGFEAPGMEGKVYRLKKSLYGLKQAPKQWYEKFHKTILSFGFVVNGSDACVYSKMF 843

Query: 61   GADCILICLYVDDMLIFGTNMELITDTKFFLSSHFEMKDLGEADVILGV-IRKNKTSLSL 120
              +C++I LYVDDMLIF +N+E I   K FLS+ FEM  LGE DVILGV + K +   SL
Sbjct: 844  DTECVIISLYVDDMLIFSSNIESINKVKNFLSTKFEMTYLGEVDVILGVEVTKTEKGFSL 903

Query: 121  CQSHYVEKILKKFDSFDVSPVRTPFDTSKYLKKNKGDSVSQPEYAKIIGSVMYLMNYTRS 180
            CQ+HY++K+LKKFDSFDV PVRTP+D S +L KNKG SVSQ EYAK+IGS+M+LMNYTR 
Sbjct: 904  CQAHYIDKVLKKFDSFDVVPVRTPYDPSIHLVKNKGSSVSQTEYAKLIGSLMFLMNYTRP 963

Query: 181  DIAYDVSRLSRYTHNPDRYHGDALCYMLRYLKGTIDYCLHFNKFSAVLEGYCDANWVTDN 240
            DIAY VSRLSRYTHNP   H  AL  +L+YLKGT+D+ L F  F AVLEGYCDANWV+DN
Sbjct: 964  DIAYAVSRLSRYTHNPSGEHWIALKRLLKYLKGTLDWKLEFAGFPAVLEGYCDANWVSDN 1023

Query: 241  DEVNFTSGYVFLLGGGAISWKSTK 264
            DEV+ TSGYVF LGG AISWKS+K
Sbjct: 1024 DEVSSTSGYVFTLGGAAISWKSSK 1047

BLAST of Cmc03g0072951 vs. TAIR 10
Match: AT4G23160.1 (cysteine-rich RLK (RECEPTOR-like protein kinase) 8 )

HSP 1 Score: 150.2 bits (378), Expect = 2.3e-36
Identity = 88/253 (34.78%), Postives = 140/253 (55.34%), Query Frame = 0

Query: 14  NKVCKLRKSLYGLKQAPKQWYDKFNNTLITNGFKINSSDTCVYSKMFGADCILICLYVDD 73
           N VC L+KS+YGLKQA +QW+ KF+ TLI  GF  + SD   + K+     + + +YVDD
Sbjct: 227 NAVCYLKKSIYGLKQASRQWFLKFSVTLIGFGFVQSHSDHTYFLKITATLFLCVLVYVDD 286

Query: 74  MLIFGTNMELITDTKFFLSSHFEMKDLGEADVILGV-IRKNKTSLSLCQSHYVEKILKKF 133
           ++I   N   + + K  L S F+++DLG     LG+ I ++   +++CQ  Y   +L + 
Sbjct: 287 IIICSNNDAAVDELKSQLKSCFKLRDLGPLKYFLGLEIARSAAGINICQRKYALDLLDET 346

Query: 134 DSFDVSPVRTPFDTS-KYLKKNKGDSVSQPEYAKIIGSVMYLMNYTRSDIAYDVSRLSRY 193
                 P   P D S  +   + GD V    Y ++IG +MYL   TR DI++ V++LS++
Sbjct: 347 GLLGCKPSSVPMDPSVTFSAHSGGDFVDAKAYRRLIGRLMYL-QITRLDISFAVNKLSQF 406

Query: 194 THNPDRYHGDALCYMLRYLKGTIDYCLHFNKFSAV-LEGYCDANWVTDNDEVNFTSGYVF 253
           +  P   H  A+  +L Y+KGT+   L ++  + + L+ + DA++ +  D    T+GY  
Sbjct: 407 SEAPRLAHQQAVMKILHYIKGTVGQGLFYSSQAEMQLQVFSDASFQSCKDTRRSTNGYCM 466

Query: 254 LLGGGAISWKSTK 264
            LG   ISWKS K
Sbjct: 467 FLGTSLISWKSKK 478

BLAST of Cmc03g0072951 vs. TAIR 10
Match: ATMG00810.1 (DNA/RNA polymerases superfamily protein )

HSP 1 Score: 110.2 bits (274), Expect = 2.6e-24
Identity = 70/203 (34.48%), Postives = 109/203 (53.69%), Query Frame = 0

Query: 67  ICLYVDDMLIFGTNMELITDTKFFLSSHFEMKDLGEADVILGV-IRKNKTSLSLCQSHYV 126
           + LYVDD+L+ G++  L+    F LSS F MKDLG     LG+ I+ + + L L Q+ Y 
Sbjct: 3   LLLYVDDILLTGSSNTLLNMLIFQLSSTFSMKDLGPVHYFLGIQIKTHPSGLFLSQTKYA 62

Query: 127 EKILKKFDSFDVSPVRTPFDTSKYLKKNKGDSVSQ----PEYAKIIGSVMYLMNYTRSDI 186
           E+IL      D  P+ TP      LK N   S ++     ++  I+G++ YL   TR DI
Sbjct: 63  EQILNNAGMLDCKPMSTPLP----LKLNSSVSTAKYPDPSDFRSIVGALQYL-TLTRPDI 122

Query: 187 AYDVSRLSRYTHNPDRYHGDALCYMLRYLKGTIDYCLHFNKFSAV-LEGYCDANWVTDND 246
           +Y V+ + +  H P     D L  +LRY+KGTI + L+ +K S + ++ +CD++W     
Sbjct: 123 SYAVNIVCQRMHEPTLADFDLLKRVLRYVKGTIFHGLYIHKNSKLNVQAFCDSDWAGCTS 182

Query: 247 EVNFTSGYVFLLGGGAISWKSTK 264
               T+G+   LG   ISW + +
Sbjct: 183 TRRSTTGFCTFLGCNIISWSAKR 200

The following BLAST results are available for this feature:
Match NameE-valueIdentityDescription
TYK06518.11.6e-14093.94ty1-copia retrotransposon protein [Cucumis melo var. makuwa][more]
KAD6453934.11.6e-10369.70hypothetical protein E3N88_08640 [Mikania micrantha][more]
KAE8670806.16.4e-9767.05hypothetical protein F3Y22_tig00112079pilonHSYRG00011 [Hibiscus syriacus][more]
KAG7571733.11.7e-9465.91Integrase catalytic core [Arabidopsis suecica][more]
KAG7551885.12.3e-9465.91Ribonuclease H-like superfamily [Arabidopsis thaliana x Arabidopsis arenosa][more]
Match NameE-valueIdentityDescription
P109781.4e-5444.32Retrovirus-related Pol polyprotein from transposon TNT 1-94 OS=Nicotiana tabacum... [more]
Q94HW29.1e-4640.60Retrovirus-related Pol polyprotein from transposon RE1 OS=Arabidopsis thaliana O... [more]
Q9ZT948.5e-4439.10Retrovirus-related Pol polyprotein from transposon RE2 OS=Arabidopsis thaliana O... [more]
P041469.8e-3234.44Copia protein OS=Drosophila melanogaster OX=7227 GN=GIP PE=1 SV=3[more]
P256001.6e-2932.08Putative transposon Ty5-1 protein YCL074W OS=Saccharomyces cerevisiae (strain AT... [more]
Match NameE-valueIdentityDescription
A0A5D3C5T27.8e-14193.94Ty1-copia retrotransposon protein OS=Cucumis melo var. makuwa OX=1194695 GN=E567... [more]
A0A5N6PGV27.6e-10469.70Reverse transcriptase Ty1/copia-type domain-containing protein OS=Mikania micran... [more]
A0A2N9EQT16.0e-10168.18Integrase catalytic domain-containing protein OS=Fagus sylvatica OX=28930 GN=FSB... [more]
A0A2N9H4B06.0e-10168.18Uncharacterized protein OS=Fagus sylvatica OX=28930 GN=FSB_LOCUS37208 PE=4 SV=1[more]
A0A6A2Y4J73.1e-9767.05Uncharacterized protein OS=Hibiscus syriacus OX=106335 GN=F3Y22_tig00112079pilon... [more]
Match NameE-valueIdentityDescription
AT4G23160.12.3e-3634.78cysteine-rich RLK (RECEPTOR-like protein kinase) 8 [more]
ATMG00810.12.6e-2434.48DNA/RNA polymerases superfamily protein [more]
InterPro
Analysis Name: InterPro Annotations of Melon (Charmono) v1.1
Date Performed: 2022-10-13
IPR TermIPR DescriptionSourceSource TermSource DescriptionAlignment
IPR013103Reverse transcriptase, RNA-dependent DNA polymerasePFAMPF07727RVT_2coord: 1..144
e-value: 1.5E-39
score: 136.1
NoneNo IPR availablePANTHERPTHR45895FAMILY NOT NAMEDcoord: 1..252
IPR043502DNA/RNA polymerase superfamilySUPERFAMILY56672DNA/RNA polymerasescoord: 13..261

Relationships

The following mRNA feature(s) are a part of this gene:

Feature NameUnique NameType
Cmc03g0072951.1Cmc03g0072951.1mRNA


GO Annotation
GO Assignments
This gene is annotated with the following GO terms.
Category Term Accession Term Name
biological_process GO:0015074 DNA integration
biological_process GO:0045944 positive regulation of transcription by RNA polymerase II
cellular_component GO:0005634 nucleus
molecular_function GO:0046983 protein dimerization activity
molecular_function GO:0000977 RNA polymerase II transcription regulatory region sequence-specific DNA binding
molecular_function GO:0008270 zinc ion binding