Homology
BLAST of ClCG03G010470 vs. NCBI nr
Match:
RVW64408.1 (LINE-1 retrotransposable element ORF2 protein [Vitis vinifera])
HSP 1 Score: 713.4 bits (1840), Expect = 4.8e-201
Identity = 473/1543 (30.65%), Postives = 744/1543 (48.22%), Query Frame = 0
Query: 259 GWIVIKNLSLEYWSRETFEAIGQHFGGLEEISIETLNLLDVSETKIKVRKNVYGFIPATI 318
GW+ ++ L W I Q +G + +++ ETL L+D+S+ K+ V + +PA +
Sbjct: 262 GWLELRGLPFHLWDEFQLRYILQKWGRVTKVAKETLKLVDLSKVKMWVEMHPKVVLPALL 321
Query: 319 EINN---EFRGSIHLL---FGDIIPMKDSS---SIITEFIGSDFENPIDQVRL-SKVEED 378
E+ + F ++ ++ D + +S+ +T G + P + L + ++
Sbjct: 322 EVEDGAWSFTVAVSVIGEAEEDFLLRPESNRSKDEVTSAEGCVHQRPKNAEGLRATARDN 381
Query: 379 EVRVFSPSKLALLSSKPDQTQPEQEEKSLEISVGSPSVGKWSKSTANGLEGKSINEKAAK 438
E + P + + + E E+ + +G TA ++G + E K
Sbjct: 382 EYHRWRPRHRSRVRYSVSNSDTEVEKGRGKSCLG---------PTAGSVDGLTKPEAFFK 441
Query: 439 LNWCE-ETEEAAVGSSIGVLKSIRPIEVMKKEKRKERVFGRINENIKSIQTPDIPSGDFL 498
++ EE +G S+G P+ E G N P PS L
Sbjct: 442 GHFARAHFEEKNIGPSVG------PVH--------ETEAGSSNGG------PATPSSSKL 501
Query: 499 RSGGPTGFEV--IQKIRKVPNVLLTVAEESEKSPPNSMKSN------------------- 558
+ G + EV I + K+ +T ++ PP+ ++
Sbjct: 502 QRSGTSAKEVMPIAQSAKLKGNSVTARRKARSWPPSLKVTSIVPKRNLDGDGAPEANRGF 561
Query: 559 -WEEVPFQ--TLLPYHQLPRARVNAFPQQSPAHTISSVKSPFFPAVRRSSNLFKPSPKHF 618
W F+ L P + R T + VK P+V S K P
Sbjct: 562 IWGRSVFKKGALSPVSEKSRRFTGGTEGDEGISTCNWVKKVRPPSVALSE---KDLP--- 621
Query: 619 SRGKTSFLNLWSALTNPDILEVSCANSQVPQQIRDRSVLPLSSSQIFHQSEI----LIPG 678
NP+IL S +S +P + + S +PL SS + +S ++
Sbjct: 622 ----------LDGAFNPNILSDSSVSSVIPSRCQAFSPIPLESSLVSQRSPFPKIAVVEP 681
Query: 679 SNILFLRGSPSSYRPKPNKIQDSEEESLISISSDELNEPEE----------DENHLSLEL 738
S ++ + P ++SEE L D L+ PE+ + H SL L
Sbjct: 682 SRLVGVSRPEGVAFPLETSNRNSEERPLF---KDSLSSPEKLCVSGKAQSPNPEHPSLPL 741
Query: 739 D----------ESIEHRSVKVF-------------------SMKIVSWNIRGLGDQSKQL 798
+ + ++ K F MKI+SWN RGLG + K+
Sbjct: 742 EGFQVEGLTPGKMVKKEEEKCFWKPRRGLGLGAGCAGLFLVYMKILSWNTRGLGSKKKRR 801
Query: 799 AVKHLIMKTNLELVLIQETKKEAFEAEAIKKLWSSRDIGWSFVEAYGRSGGLLIMWDESK 858
V+ + N ++V++QETK+E ++ + +W + + W+ + A G SGG++I+WD SK
Sbjct: 802 IVRRFLSTQNPDIVMLQETKRETWDRRFVSSVWKGKRVEWAALPACGASGGIVILWDSSK 861
Query: 859 ISVIETIKGGYTLSVKCKTLCSKVCWVTNAYGPTDYKERRRIWRELQALAAYCTNAWCLG 918
+ E + G ++++VK + W+T+ YGP + R+ W ELQ L WC+G
Sbjct: 862 LECTEKVLGSFSVTVKFNSGEEGSFWLTSVYGPINPLWRKDFWLELQDLYGLTFPRWCVG 921
Query: 919 GDFNITRAIHERVPIGRLTRGMKKFSKFIEEAHLMEIPMSNGRFTWSREGNRVSRSLLDR 978
GDFN+ R I E++ RLT M+ F +FI E+ L++ P+ N FTWS LDR
Sbjct: 922 GDFNVIRRISEKLGETRLTLNMRCFDEFIRESGLIDPPLRNAAFTWSNMQADPICKRLDR 981
Query: 979 FLV-----------------RIVSDHFPLLLEAGALEWGPSPFRFCNSWLLNSQCNSIII 1038
FL R SDH P+ LE L+WGP+PFRF N WLL+ +
Sbjct: 982 FLFSSEWDTFFSQSFQEALPRWTSDHSPICLETNPLKWGPTPFRFENMWLLHPEFKEKFR 1041
Query: 1039 RALAAGNHQGWAGFVISAKFR------SEAEIVEKLD-KEEQGAELEDPSSI-LQDPRAS 1098
+GW G K + E I+ D KE + L D S I L + +
Sbjct: 1042 VWWLECTGEGWEGHKFMRKLKFVKSKLKEWNIMTFGDLKERKKLILTDLSRIDLIEQEGN 1101
Query: 1099 LKSDLM-----------NIYKKKERDLIQKSKLNWLHLGDENTRFFHRFLAAKKRKNLIA 1158
L SDL+ ++ K+E QKS++ W+ GD N++FFHR ++ + I
Sbjct: 1102 LNSDLVLERTLKRRELEDVLLKEEVQWRQKSRVKWIKEGDCNSKFFHRVATGRRSRKFIK 1161
Query: 1159 ELVNDQGFPTNSYCEIEDQILNFYKNLYTKTPSAGCFPANLEWQRVSVEQNSRLSSKFSR 1218
L++++G N+ +I ++I+NF+ NLY+K ++W +S E L F+
Sbjct: 1162 SLISERGETLNNIEDISEEIVNFFGNLYSKPVGESWRVEGIDWVPISGESGGWLDRPFTE 1221
Query: 1219 EEIRFALRGMGKNKAPGPDGFTVEFLNKFWDRIKDDFVALFNEFHENGRLNSCVKENFIC 1278
EE+R A+ + K KAPGPDGFT+ + WD IK+D + +F EFH NG +N FI
Sbjct: 1222 EEVRRAVFQLNKEKAPGPDGFTIAVYQECWDVIKEDLMRVFLEFHTNGVINQSTNATFIA 1281
Query: 1279 LIKKKEDAIMVKDFRPISLTTLTYKVIAKVLAERLKL------------------ILDPI 1338
L+ KK ++ + D+RPISL T YK+IAKVL+ RL+ ILD +
Sbjct: 1282 LVPKKSQSVKISDYRPISLVTSLYKIIAKVLSGRLRKVLHETISDSQGAFVEGRHILDAV 1341
Query: 1339 LIANELVEDYRIKKKKGWILKLDLEKAFGRVDWGFLEKALHGKNFDSKWISWILGCIKNP 1398
LIANE+V++ R ++G + K+D EKA+ VDWGFL+ L K F KW WI GC+ +
Sbjct: 1342 LIANEVVDEKRRSGEEGIVFKIDFEKAYDHVDWGFLDHVLQRKGFSQKWRLWIRGCLSSS 1401
Query: 1399 KFSIFINGRPRGRVQASRGVRQGDPLSPFLFLLVSEVLTSLISRLHKSKKFEGFIVGKKK 1458
F+I +NG +G V+ASRG+RQGDPLSPFLF LV++VL+ ++ R ++ EGF VG+ +
Sbjct: 1402 SFAILVNGNAKGWVKASRGLRQGDPLSPFLFTLVADVLSRMLFRAEETGLTEGFSVGRDR 1461
Query: 1459 VHVPILQFADDTLLFCKYDLDMLEALRKTIEFFEWCFGQKVNWDKSALCGLNIDDLEVKS 1518
V +LQFADDT+ F K ++ L+ L+ + F G K+N +KS + G+N + S
Sbjct: 1462 TRVSLLQFADDTIFFSKASMEHLQNLKIILLVFGQVSGLKINLEKSTISGINTRQELLSS 1521
Query: 1519 TAARLNCKAEKLPLMYLGLPLGGHPKKMVFWQPIIDKIQGKLSRWKRNNLSRGGRLTLCK 1578
A+ +C+ + PL YLGLPLGG+PK + FW P++++I +L WK+ LS GGR+TL +
Sbjct: 1522 LASVFDCRVSEWPLSYLGLPLGGNPKTIGFWDPVVERISRRLDGWKKAYLSLGGRITLIQ 1581
Query: 1579 TVLSNLPSYYMSIFLMPEKVVLLIERAMRNFFWEGHGGSKLNHLARWVTVTKNHKDGGLG 1638
+ LS++PSY++S+F +P + IE+ RNF W G G K +HL RW V++ + GGLG
Sbjct: 1582 SCLSHIPSYFLSLFKIPASIASKIEKMQRNFLWSGAGEGKKDHLVRWEVVSRPKELGGLG 1641
Query: 1639 LENLKIKNLALLSKWGWRFMQESEALWCKEVASL------------------RSPWISIS 1650
+ ++N+ALL KW WRF +E LW K + S+ R PW +I+
Sbjct: 1642 FGKISLRNIALLGKWLWRFPRERSGLWYKVIGSIYGTHPNGWDANMVVRWSHRCPWKAIA 1701
BLAST of ClCG03G010470 vs. NCBI nr
Match:
RVW16209.1 (Transposon TX1 uncharacterized 149 kDa protein [Vitis vinifera])
HSP 1 Score: 695.7 bits (1794), Expect = 1.0e-195
Identity = 366/1004 (36.45%), Postives = 547/1004 (54.48%), Query Frame = 0
Query: 701 FSMKIVSWNIRGLGDQSKQLAVKHLIMKTNLELVLIQETKKEAFEAEAIKKLWSSRDIGW 760
F MKI+SWN+RGLG ++K+ +K + N ++V+IQETKKE + + +W+ R+ W
Sbjct: 49 FPMKIISWNVRGLGSRNKRRMIKDFLRSENPDVVMIQETKKENCDRRFVGSVWTVRNKDW 108
Query: 761 SFVEAYGRSGGLLIMWDESKISVIETIKGGYTLSVKCKTLCSKVCWVTNAYGPTDYKERR 820
+ A G SGG+LI+WD +S E + G +++SVK W++ YGP R+
Sbjct: 109 VALPASGASGGILIIWDSKILSREEVVIGSFSVSVKFSLDGCGPLWISAVYGPNSPSLRK 168
Query: 821 RIWRELQALAAYCTNAWCLGGDFNITRAIHERVPIGRLTRGMKKFSKFIEEAHLMEIPMS 880
W EL + WC+GGDFN+ R E++ LT M+ F FI E L++ P+
Sbjct: 169 DFWVELFDIYGLTYPLWCVGGDFNVIRGSSEKMGGSSLTPSMRDFDSFISECELLDPPLR 228
Query: 881 NGRFTWSREGNRVSRSLLDRF-----------------LVRIVSDHFPLLLEAGALEWGP 940
N FTWS LDRF L+R SDH+P++++ WGP
Sbjct: 229 NASFTWSNIQESPVCKRLDRFLYSNEWGLLFPQGLQEALIRRTSDHWPIVMDTNPFMWGP 288
Query: 941 SPFRFCNSWLLNSQCNSIIIRALAAGNHQGWAGFVISAKFRSEAEIVEKLDKEEQGAELE 1000
+PFRF N WL ++ + GW G KF + V+ KE
Sbjct: 289 TPFRFENMWLQHTNFKENFRDWWSGFQGIGWEGH----KFMRRLQYVKAKLKEWN----- 348
Query: 1001 DPSSILQDPRASLKSDLMNIYKKKERDLIQKSKLNWLHLGDENTRFFHRFLAAKKRKNLI 1060
S + + K +L + ++E QK+K+ W+ GD N++F+H+ ++ + I
Sbjct: 349 --KSSFGELKEKKKRELEELILREEIHWRQKAKVKWVKEGDCNSKFYHKVANGRRNRKYI 408
Query: 1061 AELVNDQGFPTNSYCEIEDQILNFYKNLYTKTPSAGCFPANLEWQRVSVEQNSRLSSKFS 1120
EL N++G + I ++IL++++ LYT L+W +S E RL S F+
Sbjct: 409 KELENERGLVLKNAESITEEILHYFEKLYTNPTGESWGVEGLDWSPISEESALRLESPFT 468
Query: 1121 REEIRFALRGMGKNKAPGPDGFTVEFLNKFWDRIKDDFVALFNEFHENGRLNSCVKENFI 1180
EEI A+ + ++KAPGPDGFT+ + WD IK+D V +F EFH +G +N +FI
Sbjct: 469 EEEISKAIFQLDRDKAPGPDGFTIAVFQECWDVIKEDLVRVFAEFHRSGIINQSTNASFI 528
Query: 1181 CLIKKKEDAIMVKDFRPISLTTLTYKVIAKVLAERL------------------KLILDP 1240
LI KK + + DFRPISL T YK+IAKVL+ RL + ILD
Sbjct: 529 VLIPKKSLSKRISDFRPISLITSLYKIIAKVLSGRLRGVLHETIHYTQGAFVQGRQILDA 588
Query: 1241 ILIANELVEDYRIKKKKGWILKLDLEKAFGRVDWGFLEKALHGKNFDSKWISWILGCIKN 1300
+LIANE+V++ R ++G + K+D EKA+ V W FL+ L K F +W W+ GC+ +
Sbjct: 589 VLIANEIVDERRRSGEEGVVFKIDFEKAYDHVKWDFLDHMLEKKGFSPRWRKWMSGCLSS 648
Query: 1301 PKFSIFINGRPRGRVQASRGVRQGDPLSPFLFLLVSEVLTSLISRLHKSKKFEGFIVGKK 1360
F+I +NG +G V+ASRG+RQGDPLSPFLF LV++VL+ ++ R + EGF VG+
Sbjct: 649 VSFAILVNGSAKGWVKASRGLRQGDPLSPFLFTLVADVLSRMLMRAEERNMLEGFRVGRN 708
Query: 1361 KVHVPILQFADDTLLFCKYDLDMLEALRKTIEFFEWCFGQKVNWDKSALCGLNIDDLEVK 1420
+ V LQFADD + F + L+ L+ + F FG KVN +KS++ G+N+D +
Sbjct: 709 RTRVSHLQFADDAIFFSNSREEELQTLKSLLLVFGHIFGLKVNLNKSSIYGINLDQAHLS 768
Query: 1421 STAARLNCKAEKLPLMYLGLPLGGHPKKMVFWQPIIDKIQGKLSRWKRNNLSRGGRLTLC 1480
A L+CKA P++YLGLPLGG+PK FW P++++I +L W++ LS GGR+TL
Sbjct: 769 RLAEMLDCKASGWPILYLGLPLGGNPKSCGFWDPVVERISSRLDGWQKAYLSFGGRITLI 828
Query: 1481 KTVLSNLPSYYMSIFLMPEKVVLLIERAMRNFFWEGHGGSKLNHLARWVTVTKNHKDGGL 1540
++ L++LPSY++S+F MP V IER R+F W G G K +HL RW V + GGL
Sbjct: 829 QSCLTHLPSYFLSLFKMPATVAAKIERLQRDFLWSGIGEGKRDHLVRWDIVCRPKTIGGL 888
Query: 1541 GLENLKIKNLALLSKWGWRFMQESEALWCKEVASL------------------RSPWISI 1600
GL N+ +NLALL KW WR+ +E ALW + + S+ R PW +I
Sbjct: 889 GLGNISRRNLALLGKWLWRYPREGSALWHQVILSIYGSHSNGWDANTVVRWSHRCPWKAI 948
Query: 1601 SRQWQKIEALAIFKVGDGRRITFWFDPWLEDQPFKVRFPRLFELALKPNGTDADHWDPCS 1650
++ +Q+ + + G+G RI FW D W DQP ++PRLF + + N + + P
Sbjct: 949 AQVFQEFSLITRYVAGNGDRIRFWEDLWRGDQPLGTQYPRLFRVVVDKNISISSVLGPSR 1008
BLAST of ClCG03G010470 vs. NCBI nr
Match:
CAN68838.1 (hypothetical protein VITISV_030956 [Vitis vinifera])
HSP 1 Score: 689.5 bits (1778), Expect = 7.4e-194
Identity = 373/1052 (35.46%), Postives = 554/1052 (52.66%), Query Frame = 0
Query: 679 EEDENHLSLELDESIEHRSVKVFSMKIVSWNIRGLGDQSKQLAVKHLIMKTNLELVLIQE 738
EE H + L V F MKI+SWN RGLG + K+ VK + ++V+ QE
Sbjct: 806 EEQMLHRIVRLSGFGSEIRVTKFHMKIISWNTRGLGSKKKRRVVKDFLRSEKPDVVMFQE 865
Query: 739 TKKEAFEAEAIKKLWSSRDIGWSFVEAYGRSGGLLIMWDESKISVIETIKGGYTLSVKCK 798
TKKE + + +W++R+ W+ + A G SGG+LI+WD K+S E + G +++S+K
Sbjct: 866 TKKEECDRRFVGSVWTARNKDWAALPACGASGGILIIWDTKKLSREEVMLGSFSVSIKFT 925
Query: 799 TLCSKVCWVTNAYGPTDYKERRRIWRELQALAAYCTNAWCLGGDFNITRAIHERVPIGRL 858
+ W++ YGP + R+ +W EL +A + WC+GGDFN+ R E++ RL
Sbjct: 926 LNGCESLWLSAVYGPNNSALRKDLWVELSDIAGLASPRWCVGGDFNVIRRSSEKLGGSRL 985
Query: 859 TRGMKKFSKFIEEAHLMEIPMSNGRFTWSREGNRVSRSLLDRFLV--------------- 918
T MK F FI + L+++P+ + FTWS LDRFL
Sbjct: 986 TPSMKDFDDFISDCELIDLPLRSASFTWSNMQVNPVCKRLDRFLYSNEWEQTFPQSIQGV 1045
Query: 919 --RIVSDHFPLLLEAGALEWGPSPFRFCNSWLLNSQCNSIIIRALAAGNHQGWAGFVISA 978
R SDH+P++LE +WGP+PFRF N WL + R GW G
Sbjct: 1046 LPRWTSDHWPIVLETNPFKWGPTPFRFENMWLQHPSFKENFGRWWREFQGNGWEGHKFMR 1105
Query: 979 KF---RSEAEIVEKLDKEEQGAELEDPSSILQD----------------PRASLKSDLMN 1038
K +++ ++ K E ED S L + RA K +L
Sbjct: 1106 KLQFVKAKLKVWNKASFGELSKRKEDILSALVNFDSLEQEGGLSHELLAQRAIKKGELEE 1165
Query: 1039 IYKKKERDLIQKSKLNWLHLGDENTRFFHRFLAAKKRKNLIAELVNDQGFPTNSYCEIED 1098
+ ++E QK+++ W+ GD N++FFH+ ++ + I EL N+ G N+ I++
Sbjct: 1166 LILREEIHWRQKARVKWVKEGDCNSKFFHKVANGRRNRKFIKELENENGQMMNNSESIKE 1225
Query: 1099 QILNFYKNLYTKTPSAGCFPANLEWQRVSVEQNSRLSSKFSREEIRFALRGMGKNKAPGP 1158
+IL +++ LYT L+W +S E RL S F+ EEI A+ M ++KAPGP
Sbjct: 1226 EILRYFEKLYTSPSGESWRVEGLDWSPISGESAVRLESPFTEEEICKAIFQMDRDKAPGP 1285
Query: 1159 DGFTVEFLNKFWDRIKDDFVALFNEFHENGRLNSCVKENFICLIKKKEDAIMVKDFRPIS 1218
DGFT+ W+ IK+D V +F EFH +G +N +FI L+ KK + + DFRPIS
Sbjct: 1286 DGFTIAVFQDCWEVIKEDLVKVFTEFHRSGIINQSTNASFIVLLPKKSMSRRISDFRPIS 1345
Query: 1219 LTTLTYKVIAKVLAERL------------------KLILDPILIANELVEDYRIKKKKGW 1278
L T YK+IAKVLA R+ + ILD +LIANE+V++ R ++G
Sbjct: 1346 LITSLYKIIAKVLAGRIREVLHETIHSTQGAFVQGRQILDAVLIANEIVDEKRRSGEEGV 1405
Query: 1279 ILKLDLEKAFGRVDWGFLEKALHGKNFDSKWISWILGCIKNPKFSIFINGRPRGRVQASR 1338
+ K+D EKA+ V W FL+ + K F +W W+ GC+ + F++ +NG +G V+ASR
Sbjct: 1406 VFKIDFEKAYDHVSWDFLDHVMEMKGFGIRWRKWMRGCLSSVSFAVLVNGNAKGWVKASR 1465
Query: 1339 GVRQGDPLSPFLFLLVSEVLTSLISRLHKSKKFEGFIVGKKKVHVPILQFADDTLLFCKY 1398
G+RQGDPLSPFLF +V++VL+ ++ + + EGF VG+ + V LQFADDT+ F
Sbjct: 1466 GLRQGDPLSPFLFTIVADVLSRMLLKAEERNVLEGFKVGRNRTRVSHLQFADDTIFFSSS 1525
Query: 1399 DLDMLEALRKTIEFFEWCFGQKVNWDKSALCGLNIDDLEVKSTAARLNCKAEKLPLMYLG 1458
+ + L+ + F G KVN DKS + G+N++ + A L+CKA P++YLG
Sbjct: 1526 REEDMMTLKNVLLVFGHISGLKVNLDKSNIYGINLEQNHLSRLAEMLDCKASGWPILYLG 1585
Query: 1459 LPLGGHPKKMVFWQPIIDKIQGKLSRWKRNNLSRGGRLTLCKTVLSNLPSYYMSIFLMPE 1518
LPLGG+PK FW P+I++I +L W++ LS GGR+TL ++ L+++P Y++S+F +P
Sbjct: 1586 LPLGGNPKTSGFWDPVIERISRRLDGWQKAYLSFGGRITLIQSCLTHMPCYFLSLFKIPA 1645
Query: 1519 KVVLLIERAMRNFFWEGHGGSKLNHLARWVTVTKNHKDGGLGLENLKIKNLALLSKWGWR 1578
V IER R+F W G G K +HL W V K GGLG + I+N+ALL KW WR
Sbjct: 1646 SVAAKIERMQRDFLWSGVGEGKRDHLVNWDVVCKPKSRGGLGFGKISIRNVALLGKWLWR 1705
Query: 1579 FMQESEALWCKEVASL------------------RSPWISISRQWQKIEALAIFKVGDGR 1638
+ +E ALW + + S+ R PW +I+ +Q+ F VG+G
Sbjct: 1706 YPREGSALWHQVILSIYGSHSNGWDVNNIVRWSHRCPWKAIALVYQEFSKFTRFVVGNGD 1765
Query: 1639 RITFWFDPWLEDQPFKVRFPRLFELALKPNGTDADHWDPCSS--------SWDLLFKRRL 1650
RI FW D W +QP V++PRL + N P SS SW+ F+R L
Sbjct: 1766 RIRFWDDLWWGEQPLGVQYPRLLRVVTDKNA-------PISSILGSTRPFSWNFTFRRNL 1825
BLAST of ClCG03G010470 vs. NCBI nr
Match:
RVW70235.1 (LINE-1 retrotransposable element ORF2 protein [Vitis vinifera])
HSP 1 Score: 688.3 bits (1775), Expect = 1.6e-193
Identity = 396/1178 (33.62%), Postives = 594/1178 (50.42%), Query Frame = 0
Query: 564 PFFPAVRRSSNLFKPSPKHFSRGKTSFLNLWSALTNPDILEVSCANSQVPQQIR------ 623
PF P+ SN P H T +PD S + S P + R
Sbjct: 677 PFIPSSSGFSNSLLNPPVHIQCPSTPM--------SPD----SSSQSLAPMENRVKSKFF 736
Query: 624 -----DRSVLPLSSSQIFHQSEILIPGSNILFLRGSPSSYRPKPNKIQDSEEESLISISS 683
D P+ + ++E++ P + S S+ N +E S ++
Sbjct: 737 SKKGNDEGHFPVDIPSLEMETEVIQPADP---YQMSESANSLSANLRLPCKESSKATVHL 796
Query: 684 DELNEPEEDENHLSLELDESIEHRSVKVFSMKIVSWNIRGLGDQSKQLAVKHLIMKTNLE 743
+ + EE H + L V F MKI+SWN RGLG + K+ VK + +
Sbjct: 797 GGIFKEEEQMLHRIVRLSGFGSEIRVTKFHMKIISWNTRGLGSKKKRRVVKDFLRSEKPD 856
Query: 744 LVLIQETKKEAFEAEAIKKLWSSRDIGWSFVEAYGRSGGLLIMWDESKISVIETIKGGYT 803
+V+ QETKKE + + +W++R+ W+ + A G SGG+LI+WD K+S E + G ++
Sbjct: 857 VVMFQETKKEECDRRFVGSVWTARNKDWAALPACGASGGILIIWDTKKLSREEVMLGSFS 916
Query: 804 LSVKCKTLCSKVCWVTNAYGPTDYKERRRIWRELQALAAYCTNAWCLGGDFNITRAIHER 863
+S+K + W++ YGP + R+ +W EL +A + WC+GGDFN+ R E+
Sbjct: 917 VSIKFTLNGCESLWLSAVYGPNNSALRKDLWVELSDIAGLASPRWCVGGDFNVIRRSSEK 976
Query: 864 VPIGRLTRGMKKFSKFIEEAHLMEIPMSNGRFTWSREGNRVSRSLLDRFLV--------- 923
+ RLT MK F FI + L+++P+ + FTWS LDRFL
Sbjct: 977 LGGSRLTPSMKDFDDFISDCELIDLPLRSASFTWSNMQVNPVCKRLDRFLYSNEWEQTFP 1036
Query: 924 --------RIVSDHFPLLLEAGALEWGPSPFRFCNSWLLNSQCNSIIIRALAAGNHQGWA 983
R SDH+P++LE +WGP+PFRF N WL + R GW
Sbjct: 1037 QSIQGVLPRWTSDHWPIVLETNPFKWGPTPFRFENMWLQHPSFKENFGRWWREFQGNGWE 1096
Query: 984 GFVISAKF---RSEAEIVEKLDKEEQGAELEDPSSILQD----------------PRASL 1043
G K +++ ++ K E ED S L + RA
Sbjct: 1097 GHKFMRKLQFVKAKLKVWNKASFGELSKRKEDILSALVNFDSLEQEGGLSHELLAQRAIK 1156
Query: 1044 KSDLMNIYKKKERDLIQKSKLNWLHLGDENTRFFHRFLAAKKRKNLIAELVNDQGFPTNS 1103
K +L + ++E QK+++ W+ GD N++FFH+ ++ + I EL N+ G N+
Sbjct: 1157 KGELEELILREEIHWRQKARVKWVKEGDCNSKFFHKVANGRRNRKFIKELENENGQMMNN 1216
Query: 1104 YCEIEDQILNFYKNLYTKTPSAGCFPANLEWQRVSVEQNSRLSSKFSREEIRFALRGMGK 1163
I+++IL +++ LYT L+W +S E RL S F+ EEI A+ M +
Sbjct: 1217 SESIKEEILRYFEKLYTSPSGESWRVEGLDWSPISGESAVRLESPFTEEEICKAIFQMDR 1276
Query: 1164 NKAPGPDGFTVEFLNKFWDRIKDDFVALFNEFHENGRLNSCVKENFICLIKKKEDAIMVK 1223
+KAPGPDGFT+ W+ IK+D V +F EFH +G +N +FI L+ KK + +
Sbjct: 1277 DKAPGPDGFTIAVFQDCWEVIKEDLVKVFTEFHRSGIINQSTNASFIVLLPKKSMSRRIS 1336
Query: 1224 DFRPISLTTLTYKVIAKVLAERL------------------KLILDPILIANELVEDYRI 1283
DFRPISL T YK+IAKVLA R+ + ILD +LIANE+V++ R
Sbjct: 1337 DFRPISLITSLYKIIAKVLAGRIREVLHETIHSTQGAFVQGRQILDAVLIANEIVDEKRR 1396
Query: 1284 KKKKGWILKLDLEKAFGRVDWGFLEKALHGKNFDSKWISWILGCIKNPKFSIFINGRPRG 1343
++G + K+D EKA+ V W FL+ + K F +W W+ GC+ + F++ +NG +G
Sbjct: 1397 SGEEGVVFKIDFEKAYDHVSWDFLDHVMEMKGFGIRWRKWMRGCLSSVSFAVLVNGNAKG 1456
Query: 1344 RVQASRGVRQGDPLSPFLFLLVSEVLTSLISRLHKSKKFEGFIVGKKKVHVPILQFADDT 1403
V+ASRG+RQGDPLSPFLF +V++VL+ ++ + + EGF VG+ + V LQFADDT
Sbjct: 1457 WVKASRGLRQGDPLSPFLFTIVADVLSRMLLKAEERNVLEGFKVGRNRTRVSHLQFADDT 1516
Query: 1404 LLFCKYDLDMLEALRKTIEFFEWCFGQKVNWDKSALCGLNIDDLEVKSTAARLNCKAEKL 1463
+ F + + L+ + F G KVN DKS + G+N++ + A L+CKA
Sbjct: 1517 IFFSSSREEDMMTLKNVLLVFGHISGLKVNLDKSNIYGINLEQNHLSRLAEMLDCKASGW 1576
Query: 1464 PLMYLGLPLGGHPKKMVFWQPIIDKIQGKLSRWKRNNLSRGGRLTLCKTVLSNLPSYYMS 1523
P++YLGLPLGG+PK FW P+I++I +L W++ LS GGR+TL ++ L+++P Y++S
Sbjct: 1577 PILYLGLPLGGNPKTSGFWDPVIERISRRLDGWQKAYLSFGGRITLIQSCLTHMPCYFLS 1636
Query: 1524 IFLMPEKVVLLIERAMRNFFWEGHGGSKLNHLARWVTVTKNHKDGGLGLENLKIKNLALL 1583
+F +P V IER R+F W G G K +HL W V K GGLG + I+N+ALL
Sbjct: 1637 LFKIPASVAAKIERMQRDFLWSGVGEGKRDHLVNWDVVCKPKSRGGLGFGKISIRNVALL 1696
Query: 1584 SKWGWRFMQESEALWCKEVASL------------------RSPWISISRQWQKIEALAIF 1643
KW WR+ +E ALW + + S+ R PW +I+ +Q+ F
Sbjct: 1697 GKWLWRYPREGSALWHQVILSIYGSHSNGWDVNNIVRWSHRCPWKAIALVYQEFSKFTRF 1756
Query: 1644 KVGDGRRITFWFDPWLEDQPFKVRFPRLFELALKPNGTDADHWDPCSS--------SWDL 1650
VG+G RI FW D W +QP V++PRL + N P SS SW+
Sbjct: 1757 VVGNGDRIRFWDDLWWGEQPLGVQYPRLLRVVTDKNA-------PISSILGSTRPFSWNF 1816
BLAST of ClCG03G010470 vs. NCBI nr
Match:
RVW65579.1 (Transposon TX1 uncharacterized 149 kDa protein [Vitis vinifera])
HSP 1 Score: 688.0 bits (1774), Expect = 2.1e-193
Identity = 367/1021 (35.95%), Postives = 544/1021 (53.28%), Query Frame = 0
Query: 703 MKIVSWNIRGLGDQSKQLAVKHLIMKTNLELVLIQETKKEAFEAEAIKKLWSSRDIGWSF 762
MKI+SWN RGLG + K+ VK+ + ++V+IQETKKE + + +WS R+ W+
Sbjct: 1 MKIISWNTRGLGSKKKRRVVKNFLSSEKPDVVMIQETKKEECDRRLVGSVWSVRNKDWAA 60
Query: 763 VEAYGRSGGLLIMWDESKISVIETIKGGYTLSVKCKTLCSKVCWVTNAYGPTDYKERRRI 822
+ A G SGG+LI+WD K+ E + G +++S+K + W++ YGP + R+
Sbjct: 61 LPASGASGGILIIWDSIKMRREEVVLGSFSVSIKFAMDGCESLWLSAVYGPNNSALRKDF 120
Query: 823 WRELQALAAYCTNAWCLGGDFNITRAIHERVPIGRLTRGMKKFSKFIEEAHLMEIPMSNG 882
W EL +A WC+GGDFN+ R E++ RLT MK F +FI + L++ P+ +
Sbjct: 121 WVELSDIAGLSHPRWCVGGDFNVIRRSSEKLGGSRLTPCMKDFDEFIRDCELIDSPLRSV 180
Query: 883 RFTWSREGNRVSRSLLDRFLV-----------------RIVSDHFPLLLEAGALEWGPSP 942
+TWS LDRFL R SDH+P++LE +WGP+P
Sbjct: 181 SYTWSNMQENPVCKRLDRFLYSNEWEQVFPQSLQGVLPRWTSDHWPIVLETNPFKWGPTP 240
Query: 943 FRFCNSWLLNSQCNSIIIRALAAGNHQGWAGFVISAKFRSEAEIVEKLDKEEQGA----- 1002
FRF N WL +S R + GW G K + +++ +K G
Sbjct: 241 FRFENMWLQHSSFKENFGRWWSEFQGNGWEGHKFMRKLQFVKAKLKEWNKTSFGELSKKK 300
Query: 1003 -----------ELEDPSSILQD---PRASLKSDLMNIYKKKERDLIQKSKLNWLHLGDEN 1062
LE + Q+ RA K +L + ++E QK+++ W+ GD N
Sbjct: 301 KDILAVLANFDSLEQEGGLSQELLVQRAFSKGELEELILREEIHWRQKARVKWVKKGDCN 360
Query: 1063 TRFFHRFLAAKKRKNLIAELVNDQGFPTNSYCEIEDQILNFYKNLYTKTPSAGCFPANLE 1122
++FFH+ ++ + I EL N+ G N+ I+++IL +++ LY L+
Sbjct: 361 SKFFHKVANGRRNRKFIKELENESGLMLNNPESIKEEILKYFEKLYACPSRESWRVEGLD 420
Query: 1123 WQRVSVEQNSRLSSKFSREEIRFALRGMGKNKAPGPDGFTVEFLNKFWDRIKDDFVALFN 1182
W + E SRL S F+ EEI A+ M ++KAPGPDGFT+ WD IK+D V +F
Sbjct: 421 WSPIDGESASRLESPFTEEEIYKAIFQMDRDKAPGPDGFTIAVFQDCWDVIKEDLVRVFA 480
Query: 1183 EFHENGRLNSCVKENFICLIKKKEDAIMVKDFRPISLTTLTYKVIAKVLAERL------- 1242
EFH +G +N +FI L+ KK + + DFRPISL T YK+IAKVLA RL
Sbjct: 481 EFHRSGIINQSTNASFIVLLPKKSISRRISDFRPISLITSLYKIIAKVLAGRLRGVLHET 540
Query: 1243 -----------KLILDPILIANELVEDYRIKKKKGWILKLDLEKAFGRVDWGFLEKALHG 1302
+ ILD +LIANE+V++ R ++G + K+D EKA+ V W FL+ L
Sbjct: 541 IHSTQGAFVQGRQILDAVLIANEIVDEKRRTGEEGVVFKIDFEKAYDHVSWDFLDHVLEM 600
Query: 1303 KNFDSKWISWILGCIKNPKFSIFINGRPRGRVQASRGVRQGDPLSPFLFLLVSEVLTSLI 1362
K F +W W+ GC+ + +++ +NG +G V+ASRG+RQGDPLSPFLF +V++VL+ ++
Sbjct: 601 KGFSLRWRKWMRGCLSSVSYAVLVNGNAKGWVKASRGLRQGDPLSPFLFTIVADVLSRML 660
Query: 1363 SRLHKSKKFEGFIVGKKKVHVPILQFADDTLLFCKYDLDMLEALRKTIEFFEWCFGQKVN 1422
+ + EGF VG+ + V LQFADDT+ F + L L+ + F G KVN
Sbjct: 661 LKAEERNVLEGFRVGRNRTRVSHLQFADDTIFFSSTREEDLMTLKSVLLVFGHISGLKVN 720
Query: 1423 WDKSALCGLNIDDLEVKSTAARLNCKAEKLPLMYLGLPLGGHPKKMVFWQPIIDKIQGKL 1482
DKS + G+NI+ + A L+CKA P++YLGLPLGG+PK FW P+I++I +L
Sbjct: 721 LDKSNIYGINIEQNHLSRLAVMLDCKASGWPILYLGLPLGGNPKASGFWDPVIERISRRL 780
Query: 1483 SRWKRNNLSRGGRLTLCKTVLSNLPSYYMSIFLMPEKVVLLIERAMRNFFWEGHGGSKLN 1542
W++ LS GGR+TL ++ L+++P Y++S+F +P V IER R F W G G K +
Sbjct: 781 DGWQKAYLSFGGRITLIQSCLTHMPCYFLSLFRIPASVAAKIERMQREFLWSGVGEGKRD 840
Query: 1543 HLARWVTVTKNHKDGGLGLENLKIKNLALLSKWGWRFMQESEALWCKEVASL-------- 1602
HL W V K GGLG + ++N+ALL KW WR+ +E ALW + + S+
Sbjct: 841 HLVNWDVVCKPKSRGGLGFGKISMRNVALLGKWLWRYPREGSALWHQVILSIYGSHSNGW 900
Query: 1603 ----------RSPWISISRQWQKIEALAIFKVGDGRRITFWFDPWLEDQPFKVRFPRLFE 1650
R PW +I+ +Q+ F VGDG RI FW D W DQP ++PRL
Sbjct: 901 DVNNNVRWSHRCPWKAIALVFQEFSKFTRFVVGDGDRIRFWDDLWWGDQPLGTQYPRLLS 960
BLAST of ClCG03G010470 vs. ExPASy Swiss-Prot
Match:
O00370 (LINE-1 retrotransposable element ORF2 protein OS=Homo sapiens OX=9606 PE=1 SV=1)
HSP 1 Score: 196.1 bits (497), Expect = 3.4e-48
Identity = 210/906 (23.18%), Postives = 375/906 (41.39%), Query Frame = 0
Query: 705 IVSWNIRGLGDQSKQLAVKHLIMKTNLELVLIQETKKEAFEAEAIKKLWSSRDIGW-SFV 764
I++ N+ GL K+ + I + + IQET + +K GW
Sbjct: 10 ILTLNVNGLNSPIKRHRLASWIKSQDPSVCCIQETHLTCRDTHRLKIK------GWRKIY 69
Query: 765 EAYG---RSGGLLIMWDES--KISVIETIKGGYTLSVKCKTLCSKVCWVTNAYGPTDYKE 824
+A G ++G +++ D++ K + I+ K G+ + VK ++ + + N Y P +
Sbjct: 70 QANGKQKKAGVAILVSDKTDFKPTKIKRDKEGHYIMVK-GSIQQEELTILNIYAP-NTGA 129
Query: 825 RRRIWRELQALAAYCTNAWCLGGDFNITRAIHERVPIGRLTRGMKKFSKFIEEAHLMEIP 884
R I + L L + + GDFN +I +R ++ + ++ + + + L++I
Sbjct: 130 PRFIKQVLSDLQRDLDSHTLIMGDFNTPLSILDRSTRQKVNKDTQELNSALHQTDLIDIY 189
Query: 885 ------------MSNGRFTWSREGNRV-SRSLLDR-----FLVRIVSDHFPLLLE----- 944
S T+S+ + V S++LL + + +SDH + LE
Sbjct: 190 RTLHPKSTEYTFFSAPHHTYSKIDHIVGSKALLSKCKRTEIITNYLSDHSAIKLELRIKN 249
Query: 945 ---AGALEWGPSPFRFCNSWLLNSQCNSIIIRALAAGNHQG-----WAGF--VISAKF-- 1004
+ + W + + W+ N I + N W F V KF
Sbjct: 250 LTQSRSTTWKLNNLLLNDYWVHNEMKAEIKMFFETNENKDTTYQNLWDAFKAVCRGKFIA 309
Query: 1005 ---------RSEAEIVEKLDKEEQGAELEDPSSILQDPRASLKSDLMNIYKKKERDLIQK 1064
RS+ + + KE + E + + ++++L I +K I +
Sbjct: 310 LNAYKRKQERSKIDTLTSQLKELEKQEQTHSKASRRQEITKIRAELKEIETQKTLQKINE 369
Query: 1065 SKLNWLHLGDENTRFFHRFLAAKKRKNLIAELVNDQGFPTNSYCEIEDQILNFYKNLYT- 1124
S+ + ++ R R + K+ KN I + ND+G T EI+ I +YK+LY
Sbjct: 370 SRSWFFERINKIDRPLARLIKKKREKNQIDTIKNDKGDITTDPTEIQTTIREYYKHLYAN 429
Query: 1125 ---KTPSAGCFPANLEWQRVSVEQNSRLSSKFSREEIRFALRGMGKNKAPGPDGFTVEFL 1184
F R++ E+ L+ + EI + + K+PGPDGFT EF
Sbjct: 430 KLENLEEMDTFLDTYTLPRLNQEEVESLNRPITGSEIVAIINSLPTKKSPGPDGFTAEFY 489
Query: 1185 NKFWDRIKDDFVALFNEFHENGRL-NSCVKENFICLIKKKEDAIMVKDFRPISLTTLTYK 1244
++ + + + LF + G L NS + + I + K D ++FRPISL + K
Sbjct: 490 QRYKEELVPFLLKLFQSIEKEGILPNSFYEASIILIPKPGRDTTKKENFRPISLMNIDAK 549
Query: 1245 VIAKVLAERLKLILDPIL-------------------IANELVEDYRIKKKKGWILKLDL 1304
++ K+LA R++ + ++ N + R K K I+ +D
Sbjct: 550 ILNKILANRIQQHIKKLIHHDQVGFIPGMQGWFNIRKSINVIQHINRAKDKNHVIISIDA 609
Query: 1305 EKAFGRVDWGFLEKALHGKNFDSKWISWILGCIKNPKFSIFINGRPRGRVQASRGVRQGD 1364
EKAF ++ F+ K L+ D ++ I P +I +NG+ G RQG
Sbjct: 610 EKAFDKIQQPFMLKTLNKLGIDGMYLKIIRAIYDKPTANIILNGQKLEAFPLKTGTRQGC 669
Query: 1365 PLSPFLFLLVSEVLTSLISRLHKSKKFEGFIVGKKKVHVPILQFADDTLLFCKYDLDMLE 1424
PLSP LF +V EVL I + K+ +G +GK++V + + FADD +++ + + +
Sbjct: 670 PLSPLLFNIVLEVLARAI---RQEKEIKGIQLGKEEVKLSL--FADDMIVYLENPIVSAQ 729
Query: 1425 ALRKTIEFFEWCFGQKVNWDKSALCGLNIDDLEVKSTAARLNCKAEKLPLMYLGLPLGGH 1484
L K I F G K+N KS N + L + YLG+ L
Sbjct: 730 NLLKLISNFSKVSGYKINVQKSQAFLYNNNRQTESQIMGELPFTIASKRIKYLGIQLTRD 789
Query: 1485 PKKMV--FWQPIIDKIQGKLSRWKRNNLSRGGRLTLCKTVLSNLPSYYMSIFLMPEKVVL 1531
K + ++P++ +I+ ++WK S GR+ + K + LP +P K+ +
Sbjct: 790 VKDLFKENYKPLLKEIKEDTNKWKNIPCSWVGRINIVKMAI--LPKVIYRFNAIPIKLPM 849
BLAST of ClCG03G010470 vs. ExPASy Swiss-Prot
Match:
P14381 (Transposon TX1 uncharacterized 149 kDa protein OS=Xenopus laevis OX=8355 PE=4 SV=1)
HSP 1 Score: 192.6 bits (488), Expect = 3.7e-47
Identity = 199/784 (25.38%), Postives = 325/784 (41.45%), Query Frame = 0
Query: 809 NAYGPTDYKERRRIWRELQALAAYCTN--AWCLGGDFNITRAIHERVPIGRLTRGMKKFS 868
N Y PT ER R + L A + A +GGDFN T +R +
Sbjct: 108 NVYAPTTGPERARFFESLSAYMETIDSDEALIIGGDFNYTLDARDRNVPKKRDSSESVLR 167
Query: 869 KFIEEAHLMEIPMSNG----RFTWSR-EGNRVSRSLLDRFLVRIVSDHFPLLLEAGALEW 928
+ I L+++ FT+ R VS+S +DR +S H ++ +
Sbjct: 168 ELIAHFSLVDVWREQNPETVAFTYVRVRDGHVSQSRIDRI---YISSHLMSRAQSSTIRL 227
Query: 929 GPSPFRFCNS--------------WLLNSQCNSII----IRALAAGNHQGWAGF------ 988
P C S W N NS++ +GW F
Sbjct: 228 APFSDHNCVSLRMSIAPSLPKAAYWHFN---NSLLEDEGFAKSVRDTWRGWRAFQDEFAT 287
Query: 989 ---------------------VISAKFRSEAEIV--EKLDKEEQGAELEDPSSILQDPRA 1048
+S + +E E + E LD E++ + ED + LQ
Sbjct: 288 LNQWWDVGKVHLKLLCQEYTKSVSGQRNAEIEALNGEVLDLEQRLSGSEDQA--LQCEYL 347
Query: 1049 SLKSDLMNIYKKKERDLIQKSKLNWLHLGDENTRFFHRFLAAKKRKNLIAELVNDQGFPT 1108
K L N+ +++ R +S++ L D +RFF+ K + I L + G P
Sbjct: 348 ERKEALRNMEQRQARGAFVRSRMQLLCDMDRGSRFFYALEKKKGNRKQITCLFAEDGTPL 407
Query: 1109 NSYCEIEDQILNFYKNLYTKTPSAGCFPANLEWQR---VSVEQNSRLSSKFSREEIRFAL 1168
I D+ +FY+NL++ P + L W VS + RL + + +E+ AL
Sbjct: 408 EDPEAIRDRARSFYQNLFSPDPISPDACEEL-WDGLPVVSERRKERLETPITLDELSQAL 467
Query: 1169 RGMGKNKAPGPDGFTVEFLNKFWDRIKDDFVALFNEFHENGRLNSCVKENFICLIKKKED 1228
R M NK+PG DG T+EF FWD + DF + E + G L + + L+ KK D
Sbjct: 468 RLMPHNKSPGLDGLTIEFFQFFWDTLGPDFHRVLTEAFKKGELPLSCRRAVLSLLPKKGD 527
Query: 1229 AIMVKDFRPISLTTLTYKVIAKVLAERLKLIL------------------DPILIANELV 1288
++K++RP+SL + YK++AK ++ RLK +L D + + +L+
Sbjct: 528 LRLIKNWRPVSLLSTDYKIVAKAISLRLKSVLAEVIHPDQSYTVPGRTIFDNVFLIRDLL 587
Query: 1289 EDYRIKKKKGWILKLDLEKAFGRVDWGFLEKALHGKNFDSKWISWILGCIKNPKFSIFIN 1348
R L LD EKAF RVD +L L +F +++ ++ + + + IN
Sbjct: 588 HFARRTGLSLAFLSLDQEKAFDRVDHQYLIGTLQAYSFGPQFVGYLKTMYASAECLVKIN 647
Query: 1349 GRPRGRVQASRGVRQGDPLSPFLFLLVSEVLTSLISRLHKSKKFEGFIVGKKKVHVPILQ 1408
+ RGVRQG PLS L+ L E L+ K+ G ++ + + V +
Sbjct: 648 WSLTAPLAFGRGVRQGCPLSGQLYSLAIEPFLCLL-----RKRLTGLVLKEPDMRVVLSA 707
Query: 1409 FADDTLLFCKYDLDMLEALRKTIEFFEWCFGQKVNWDKSALCGLNIDDLEVK-STAARLN 1468
+ADD +L + DL LE ++ E + ++NW KS+ GL L+V A +
Sbjct: 708 YADDVILVAQ-DLVDLERAQECQEVYAAASSARINWSKSS--GLLEGSLKVDFLPPAFRD 767
Query: 1469 CKAEKLPLMYLGLPLGG--HPKKMVFWQPIIDKIQGKLSRWK---RNNLSRGGRLTLCKT 1508
E + YLG+ L +P F + + + + +L +WK + RG L + +
Sbjct: 768 ISWESKIIKYLGVYLSAEEYPVSQNFIE-LEECVLTRLGKWKGFAKVLSMRGRALVINQL 827
BLAST of ClCG03G010470 vs. ExPASy Swiss-Prot
Match:
P08548 (LINE-1 reverse transcriptase homolog OS=Nycticebus coucang OX=9470 PE=4 SV=1)
HSP 1 Score: 187.6 bits (475), Expect = 1.2e-45
Identity = 208/909 (22.88%), Postives = 386/909 (42.46%), Query Frame = 0
Query: 703 MKIVSWNIRGLGDQSKQLAVKHLIMKTNLELVLIQETKKEAFEAEAIKKLWSSRDIGWSF 762
+ I S N+ GL K+ + I K ++ IQE+ +K + + GWS
Sbjct: 7 LSIFSINVNGLNCPLKRHRLADWIQKLKPDICCIQESHL------TLKDKYRLKVKGWSS 66
Query: 763 V-EAYG--RSGGLLIMWDES---KISVIETIKGGYTLSVKCKTLCSKVCWVTNAYGPTDY 822
+ +A G + G+ I++ ++ K + I K G+ + VK T ++ + N Y P ++
Sbjct: 67 IFQANGKQKKAGIAILFADAIGFKPTKIRKDKDGHFIFVKGNTQYDEIS-IINIYAP-NH 126
Query: 823 KERRRIWRELQALAAYCTNAWCLGGDFNITRAIHERVPIGRLTRGMKKFSKFIEEAHLME 882
+ I L ++ ++ + GDFN A+ +R +L++ + + I+ L +
Sbjct: 127 NAPQFIRETLTDMSNLISSTSIVVGDFNTPLAVLDRSSKKKLSKEILDLNSTIQHLDLTD 186
Query: 883 IP------------MSNGRFTWSREGNRVS-RSLLDRF-----LVRIVSDHFPLLLEAG- 942
I S+ T+S+ + + +S L +F + I SDH + +E
Sbjct: 187 IYRTFHPNKTEYTFFSSAHGTYSKIDHILGHKSNLSKFKKIEIIPCIFSDHHGIKVELNN 246
Query: 943 -------ALEWGPSPFRFCNSWLLNSQCNSIIIRALAAGNHQG------W--AGFVISAK 1002
W + ++W+++ + I + L N+Q W A V+ K
Sbjct: 247 NRNLHTHTKTWKLNNLMLKDTWVID-EIKKEITKFLEQNNNQDTNYQNLWDTAKAVLRGK 306
Query: 1003 FRSEAEIVEKLDKEE-----------QGAELEDPSSILQDPRASLKSDLMNIYKKKERDL 1062
F + ++K ++EE + E +P + ++++L I K+
Sbjct: 307 FIALQAFLKKTEREEVNNLMGHLKQLEKEEHSNPKPSRRKEITKIRAELNEIENKRIIQQ 366
Query: 1063 IQKSKLNWLHLGDENTRFFHRFLAAKKRKNLIAELVNDQGFPTNSYCEIEDQILNFYKNL 1122
I KSK + ++ + K+ K+LI+ + N T EI+ + +YK L
Sbjct: 367 INKSKSWFFEKINKIDKPLANLTRKKRVKSLISSIRNGNDEITTDPSEIQKILNEYYKKL 426
Query: 1123 YT-KTPSAGCFPANLE---WQRVSVEQNSRLSSKFSREEIRFALRGMGKNKAPGPDGFTV 1182
Y+ K + LE R+S ++ L+ S EI ++ + K K+PGPDGFT
Sbjct: 427 YSHKYENLKEIDQYLEACHLPRLSQKEVEMLNRPISSSEIASTIQNLPKKKSPGPDGFTS 486
Query: 1183 EFLNKFWDRIKDDFVALFNEFHENGRL-NSCVKENFICLIKKKEDAIMVKDFRPISLTTL 1242
EF F + + + LF + G L N+ + N + K +D +++RPISL +
Sbjct: 487 EFYQTFKEELVPILLNLFQNIEKEGILPNTFYEANITLIPKPGKDPTRKENYRPISLMNI 546
Query: 1243 TYKVIAKVLAERLKLILDPIL-------------------IANELVEDYRIKKKKGWILK 1302
K++ K+L R++ + I+ N + ++K K IL
Sbjct: 547 DAKILNKILTNRIQQHIKKIIHHDQVGFIPGSQGWFNIRKSINVIQHINKLKNKDHMILS 606
Query: 1303 LDLEKAFGRVDWGFLEKALHGKNFDSKWISWILGCIKNPKFSIFINGRPRGRVQASRGVR 1362
+D EKAF + F+ + L + ++ I P +I +NG G R
Sbjct: 607 IDAEKAFDNIQHPFMIRTLKKIGIEGTFLKLIEAIYSKPTANIILNGVKLKSFPLRSGTR 666
Query: 1363 QGDPLSPFLFLLVSEVLTSLISRLHKSKKFEGFIVGKKKVHVPILQFADDTLLFCKYDLD 1422
QG PLSP LF +V EVL I + K +G +G +++ + + FADD +++ + D
Sbjct: 667 QGCPLSPLLFNIVMEVLAIAI---REEKAIKGIHIGSEEIKLSL--FADDMIVYLENTRD 726
Query: 1423 MLEALRKTIEFFEWCFGQKVNWDKSALCGLNIDDLEVKSTAARLNCKAEKLPLMYLGLPL 1482
L + I+ + G K+N KS ++ K+ + + YLG+ L
Sbjct: 727 STTKLLEVIKEYSNVSGYKINTHKSVAFIYTNNNQAEKTVKDSIPFTVVPKKMKYLGVYL 786
Query: 1483 GGHPKKMV--FWQPIIDKIQGKLSRWKRNNLSRGGRLTLCKTVLSNLPSYYMSIFLMPEK 1529
K + ++ + +I +++WK S GR+ + K +S LP + +P K
Sbjct: 787 TKDVKDLYKENYETLRKEIAEDVNKWKNIPCSWLGRINIVK--MSILPKAIYNFNAIPIK 846
BLAST of ClCG03G010470 vs. ExPASy Swiss-Prot
Match:
P11369 (LINE-1 retrotransposable element ORF2 protein OS=Mus musculus OX=10090 GN=Pol PE=1 SV=2)
HSP 1 Score: 177.2 bits (448), Expect = 1.6e-42
Identity = 212/913 (23.22%), Postives = 381/913 (41.73%), Query Frame = 0
Query: 705 IVSWNIRGLGDQSKQLAVKHLIMKTNLELVLIQETKKEAFEAEAIKKLWSSRDIGWSFV- 764
++S NI GL K+ + + K + +QET + + R GW +
Sbjct: 17 LISLNINGLNSPIKRHRLTDWLHKQDPTFCCLQETHLREKDRHYL------RVKGWKTIF 76
Query: 765 EAYG--RSGGLLIMWDES---KISVIETIKGGYTLSVKCKTLCSKVCWVTNAYGPTDYKE 824
+A G + G+ I+ + + VI+ K G+ + +K K L ++ + N Y P + +
Sbjct: 77 QANGLKKQAGVAILISDKIDFQPKVIKKDKEGHFILIKGKILQEELS-ILNIYAP-NARA 136
Query: 825 RRRIWRELQALAAYCTNAWCLGGDFNITRAIHERVPIGRLTRGMKKFSKFIEEAHLMEI- 884
I L L AY + GDFN + +R +L R K ++ +++ L +I
Sbjct: 137 ATFIRDTLVKLKAYIAPHTIIVGDFNTPLSSKDRSWKQKLNRDTVKLTEVMKQMDLTDIY 196
Query: 885 ----PMSNGRFTWSREGNRVS--------RSLLDRF-----LVRIVSDHFPL-LLEAGAL 944
P + G +S S ++ L+R+ + I+SDH L L+ +
Sbjct: 197 RTFYPKTKGYTFFSAPHGTFSKIDHIIGHKTGLNRYKNIEIVPCILSDHHGLRLIFNNNI 256
Query: 945 EWGPSPFRFCNSWLLNSQ-CNSIIIRALAAGNHQGWAGF-------------VISAKFRS 1004
G F +W LN+ N +++ + + F + A R
Sbjct: 257 NNGKPTF----TWKLNNTLLNDTLVKEGIKKEIKDFLEFNENEATTYPNLWDTMKAFLRG 316
Query: 1005 EAEIVEKLDKEEQGAELEDPSSILQDPRASLKSDLMNIYKKKERDLIQ-KSKLNWL---- 1064
+ + K+ + A SS+ +A K + + + + +++I+ + ++N +
Sbjct: 317 KLIALSASKKKRETAH---TSSLTTHLKALEKKEANSPKRSRRQEIIKLRGEINQVETRR 376
Query: 1065 ---HLGDENTRFFH----------RFLAAKKRKNLIAELVNDQGFPTNSYCEIEDQILNF 1124
+ + FF R + K LI ++ N++G T EI++ I +F
Sbjct: 377 TIQRINQTRSWFFEKINKIDKPLARLTKGHRDKILINKIRNEKGDITTDPEEIQNTIRSF 436
Query: 1125 YKNLYT----KTPSAGCFPANLEWQRVSVEQNSRLSSKFSREEIRFALRGMGKNKAPGPD 1184
YK LY+ F + +++ +Q L+S S +EI + + K+PGPD
Sbjct: 437 YKRLYSTKLENLDEMDKFLDRYQVPKLNQDQVDHLNSPISPKEIEAVINSLPTKKSPGPD 496
Query: 1185 GFTVEFLNKFWDRIKDDFVALFNEFHENGRLNSCVKENFICLI-KKKEDAIMVKDFRPIS 1244
GF+ EF F + + LF++ G L + E I LI K ++D +++FRPIS
Sbjct: 497 GFSAEFYQTFKEDLIPILHKLFHKIEVEGTLPNSFYEATITLIPKPQKDPTKIENFRPIS 556
Query: 1245 LTTLTYKVIAKVLA----ERLKLILDPILIA---------------NELVEDYRIKKKKG 1304
L + K++ K+LA E +K I+ P + N + ++K K
Sbjct: 557 LMNIDAKILNKILANRIQEHIKAIIHPDQVGFIPGMQGWFNIRKSINVIHYINKLKDKNH 616
Query: 1305 WILKLDLEKAFGRVDWGFLEKALHGKNFDSKWISWILGCIKNPKFSIFINGRPRGRVQAS 1364
I+ LD EKAF ++ F+ K L +++ I P +I +NG +
Sbjct: 617 MIISLDAEKAFDKIQHPFMIKVLERSGIQGPYLNMIKAIYSKPVANIKVNGEKLEAIPLK 676
Query: 1365 RGVRQGDPLSPFLFLLVSEVLTSLISRLHKSKKFEGFIVGKKKVHVPILQFADDTLLFCK 1424
G RQG PLSP+LF +V EVL I + K+ +G +GK++V + +L ADD +++
Sbjct: 677 SGTRQGCPLSPYLFNIVLEVLARAI---RQQKEIKGIQIGKEEVKISLL--ADDMIVYIS 736
Query: 1425 YDLDMLEALRKTIEFFEWCFGQKVNWDKSALCGLNIDDLEVKSTAARLNCKAEKLPLMYL 1484
+ L I F G K+N +KS + K + YL
Sbjct: 737 DPKNSTRELLNLINSFGEVVGYKINSNKSMAFLYTKNKQAEKEIRETTPFSIVTNNIKYL 796
Query: 1485 GLPLGGHPKKMV--FWQPIIDKIQGKLSRWKRNNLSRGGRLTLCKTVLSNLPSYYMSIFL 1531
G+ L K + ++ + +I+ L RWK S GR+ + K + LP
Sbjct: 797 GVTLTKEVKDLYDKNFKSLKKEIKEDLRRWKDLPCSWIGRINIVKMAI--LPKAIYRFNA 856
BLAST of ClCG03G010470 vs. ExPASy Swiss-Prot
Match:
P0C2F6 (Putative ribonuclease H protein At1g65750 OS=Arabidopsis thaliana OX=3702 GN=At1g65750 PE=3 SV=1)
HSP 1 Score: 120.6 bits (301), Expect = 1.8e-25
Identity = 70/202 (34.65%), Postives = 102/202 (50.50%), Query Frame = 0
Query: 1420 IIDKIQGKLSRWKRNNLSRGGRLTLCKTVLSNLPSYYMSIFLMPEKVVLLIERAMRNFFW 1479
I++++ ++S W+ LS GRLTL K VLS++P + MS L+P+ ++ +++ R F W
Sbjct: 16 ILERVSSRMSGWREKTLSFAGRLTLTKAVLSSMPVHSMSTILLPQSILNRLDQLSRTFLW 75
Query: 1480 EGHGGSKLNHLARWVTVTKNHKDGGLGLENLKIKNLALLSKWGWRFMQESEALWC----- 1539
K HL +W V K+GGLG+ K N AL+SK GWR +QE +LW
Sbjct: 76 GSTAEKKKQHLVKWSKVCSPKKEGGLGVRAAKSMNRALISKVGWRLLQEKNSLWTLVLQK 135
Query: 1540 -KEVASLR-SPWI----SISRQWQKIEALAIFKV---------GDGRRITFWFDPWLEDQ 1599
V +R S W+ S S W+ I A+ + V GDG++I FW D W+ +
Sbjct: 136 KYHVGEIRDSRWLIPKGSWSSTWRSI-AIGLRDVVSHGVGWIPGDGQQIRFWTDRWVSGK 195
Query: 1600 PFKVRFPRLFELALKPNGTDAD 1602
P L EL TD D
Sbjct: 196 P-------LLELDNGERPTDCD 209
BLAST of ClCG03G010470 vs. ExPASy TrEMBL
Match:
A0A803P8A0 (Uncharacterized protein OS=Cannabis sativa OX=3483 PE=4 SV=1)
HSP 1 Score: 734.9 bits (1896), Expect = 7.4e-208
Identity = 389/1024 (37.99%), Postives = 564/1024 (55.08%), Query Frame = 0
Query: 703 MKIVSWNIRGLGDQSKQLAVKHLIMKTNLELVLIQETKKEAFEAEAIKKLWSSRDIGWSF 762
MKI++WNIRG GD+ K+ A+K I K N ++V++QE K+ + I +W SR W
Sbjct: 1 MKILTWNIRGSGDKGKRAAIKATICKANPDMVILQEVKRATVDRRFIGSIWRSRFKAWIL 60
Query: 763 VEAYGRSGGLLIMWDESKISVIETIKGGYTLSVKCKTLCSKVCWVTNAYGPTDYKERRRI 822
+ A GRSGG L++WD ISV++++ G +++SV + W + YGP YK R
Sbjct: 61 LPAIGRSGGTLLIWDTRIISVLDSLVGEFSISVLINAEGKEPWWFSGVYGPCSYKIRHVF 120
Query: 823 WRELQALAAYCTNAWCLGGDFNITRAIHERVPIGRLTRGMKKFSKFIEEAHLMEIPMSNG 882
W EL L++ C +WC+GGDFN+TR + E++ TR MK F I E L++ + NG
Sbjct: 121 WDELAGLSSICGESWCVGGDFNVTRRVGEKLNSSSSTRSMKLFDGLIRELQLIDPKLENG 180
Query: 883 RFTWSREGNRVSRSLLDRF-----------------LVRIVSDHFPLLLEAGALEWGPSP 942
FTWS S LDRF LVR+VSDH P+++++ +WGP P
Sbjct: 181 SFTWSNFRAIPICSRLDRFLFLNNWNVVFPFVRQEMLVRLVSDHSPVVIDSKPPKWGPGP 240
Query: 943 FRFCNSWLLN---SQC--------------NSIIIRALAA--GNHQGWAGFVISAKFRSE 1002
FRF N WL + S+C + ++ L G + W+ F ++
Sbjct: 241 FRFDNHWLEHKSFSKCFESWWQEEIIDGWPGTKFMKKLKTLQGKAKEWSRFTYGQNKATK 300
Query: 1003 AEIVEKLDKEEQGAELEDPSSILQDPRASLKSDLMNIYKKKERDLIQKSKLNWLHLGDEN 1062
+ +L ++ + L D R LK + + ++ER + KSK W GD N
Sbjct: 301 NALEGRLGVLDRQEGTPSWNQSLYDERRKLKEEWQRLTFEEERSIWLKSKCKWAKEGDAN 360
Query: 1063 TRFFHRFLAAKKRKNLIAELVNDQGFPTNSYCEIEDQILNFYKNLYTKTPSAGCFPANLE 1122
+RFFH L A+K +N I+ + D G +S EI ++++ F+ LYT G +E
Sbjct: 361 SRFFHNLLNARKARNTISRIERDNGDIIDSEKEIVEELIAFFSKLYTSETRMGTGVEGIE 420
Query: 1123 WQRVSVEQNSRLSSKFSREEIRFALRGMGKNKAPGPDGFTVEFLNKFWDRIKDDFVALFN 1182
WQ ++ +L F +E+R + +KAPGPDGF++ W+ IK++ + +F
Sbjct: 421 WQHIAEPSARQLECPFEEDEVRNIVFSCEGSKAPGPDGFSLAVFQNNWEVIKNELMEVFR 480
Query: 1183 EFHENGRLNSCVKENFICLIKKKEDAIMVKDFRPISLTTLTYKVIAKVLAERL------- 1242
FH GR+ + + FICLI K+ ++ VKDFRPISL T YK+IAK LA RL
Sbjct: 481 AFHSEGRIEGSINDTFICLIPKRLNSCKVKDFRPISLITSVYKIIAKTLATRLRGVLGET 540
Query: 1243 -----------KLILDPILIANELVEDYRIKKKKGWILKLDLEKAFGRVDWGFLEKALHG 1302
+ ILD +L+ANE VEDYR + KKG++LK+D EKA+ RVDWGFL+ L
Sbjct: 541 ISETQSAFVEGRQILDSVLLANEAVEDYRSRGKKGFVLKIDFEKAYDRVDWGFLDLVLRK 600
Query: 1303 KNFDSKWISWILGCIKNPKFSIFINGRPRGRVQASRGVRQGDPLSPFLFLLVSEVLTSLI 1362
K F +W WI GC+ + FSIF+NGR RG+ SRG+RQGDPLSPFLF LV++VL ++
Sbjct: 601 KGFGERWRKWIRGCVSSTSFSIFVNGRVRGKFHGSRGLRQGDPLSPFLFTLVADVLGRMV 660
Query: 1363 SRLHKSKKFEGFIVGKKKVHVPILQFADDTLLFCKYDLDMLEALRKTIEFFEWCFGQKVN 1422
+ +++ F GF +GK + + LQFADDTL F K D D L+ L K +E F G KVN
Sbjct: 661 DKAVETEAFSGFQIGKDNIRLSHLQFADDTLFFVK-DEDSLQKLVKIVEAFCGISGLKVN 720
Query: 1423 WDKSALCGLNIDDLEVKSTAARLNCKAEKLPLMYLGLPLGGHPKKMVFWQPIIDKIQGKL 1482
+KS L G+ + D V A + C+ K P+ YLG+PLGG P+K FW+P++DK ++
Sbjct: 721 LNKSQLLGICLSDEAVAQGANLIGCEVGKWPMTYLGMPLGGSPRKKTFWEPVLDKCAKRM 780
Query: 1483 SRWKRNNLSRGGRLTLCKTVLSNLPSYYMSIFLMPEKVVLLIERAMRNFFWEGHGGSKLN 1542
WK + LSRGGRLTL ++VLS+LP YY+S+F +P+ V+ +E+ MR+FFWEG + +
Sbjct: 781 DGWKCSFLSRGGRLTLIQSVLSSLPIYYLSLFKVPKMVLKELEKMMRDFFWEGGDLAGGD 840
Query: 1543 HLARWVTVTKNHKDGGLGLENLKIKNLALLSKWGWRFMQESEALWCKEVASL-------- 1602
HL W V K +GGL + L+++N LL KW WRF ES +LW K + S
Sbjct: 841 HLVAWDEVCKPRAEGGLAIGRLEMRNKGLLMKWLWRFPLESNSLWHKVIKSRYGKADNFW 900
Query: 1603 ----------RSPWISISRQWQKIEALAIFKVGDGRRITFWFDPWLEDQPFKVRFPRLFE 1648
R PW+ I+ + + + FKVG+G I FW D W+ + +FP L
Sbjct: 901 DTKQGVRMSPRGPWMDIADLYHEYGKMVKFKVGNGASIRFWEDEWIGGPSLRDQFPTLAV 960
BLAST of ClCG03G010470 vs. ExPASy TrEMBL
Match:
A0A803QI00 (Uncharacterized protein OS=Cannabis sativa OX=3483 PE=4 SV=1)
HSP 1 Score: 721.8 bits (1862), Expect = 6.5e-204
Identity = 386/1016 (37.99%), Postives = 549/1016 (54.04%), Query Frame = 0
Query: 711 RGLGDQSKQLAVKHLIMKTNLELVLIQETKKEAFEAEAIKKLWSSRDIGWSFVEAYGRSG 770
+G GD+ K+ A+K I K N +LV++QE K+ + + I +W SR W + A GRSG
Sbjct: 908 KGSGDKGKRHAIKATICKANPDLVILQEVKRTSVDRRFIGSIWRSRFKAWIIIPAIGRSG 967
Query: 771 GLLIMWDESKISVIETIKGGYTLSVKCKTLCSKVCWVTNAYGPTDYKERRRIWRELQALA 830
G L++WD I+V++++ G +++SV K W + YGP YK R W EL L+
Sbjct: 968 GTLLIWDTRTITVLDSLVGEFSISVLIKAEGKDPWWFSGVYGPCSYKLRPAFWDELAGLS 1027
Query: 831 AYCTNAWCLGGDFNITRAIHERVPIGRLTRGMKKFSKFIEEAHLMEIPMSNGRFTWSREG 890
A C ++WC+GGDFN+TR E++ TR MK F I E L++ + NGRFTWS
Sbjct: 1028 AICGDSWCVGGDFNVTRRPGEKLNSSSCTRSMKLFDGLIRELRLIDPKLENGRFTWSNFR 1087
Query: 891 NRVSRSLLDRF-----------------LVRIVSDHFPLLLEAGALEWGPSPFRFCNSWL 950
S LDRF LVR+VSDH P+++++ WGP PFRF N WL
Sbjct: 1088 TSPVCSRLDRFLFTNNWNVIYPFVRQEMLVRLVSDHSPVVIDSNPPRWGPGPFRFDNQWL 1147
Query: 951 LNSQCNSIIIRALAAGNHQGWAGFVISAKFRSEAEIVEKLDKEEQGA------------- 1010
++ R + GW G +K + E V++ G
Sbjct: 1148 EHNSFPKSFGRWWKEASSNGWPGTKFMSKLKKTQEKVKEWSSSTFGQNKATKRALEGRLV 1207
Query: 1011 ---ELEDPSSILQ---DPRASLKSDLMNIYKKKERDLIQKSKLNWLHLGDENTRFFHRFL 1070
LE +S +Q + R LK + + ++ER + KSK W GD N+RFFH L
Sbjct: 1208 ALDRLEGTNSWVQSLVEERRKLKEEWQQLNFEEERSIWLKSKCKWAKEGDANSRFFHNLL 1267
Query: 1071 AAKKRKNLIAELVNDQGFPTNSYCEIEDQILNFYKNLYTKTPSAGCFPANLEWQRVSVEQ 1130
A+K +N I+ + + G + EI ++++ F+ LYT G ++EWQR++
Sbjct: 1268 NARKARNTISRIEREDGSIIDKEEEIVEELIGFFSKLYTSEARRGSGIESIEWQRIAYSS 1327
Query: 1131 NSRLSSKFSREEIRFALRGMGKNKAPGPDGFTVEFLNKFWDRIKDDFVALFNEFHENGRL 1190
+L S F EE++ ++ +KAPGPDGF++ W+ IKDD + +F F + GR+
Sbjct: 1328 ACQLESSFEEEEVKRSVFSCEGSKAPGPDGFSLAVFQNNWETIKDDLMEVFRTFEKEGRI 1387
Query: 1191 NSCVKENFICLIKKKEDAIMVKDFRPISLTTLTYKVIAKVLAERL--------------- 1250
+ E FICLI K+ ++ VKDFRPISL T YK++AK LA RL
Sbjct: 1388 EGSINETFICLIPKRLNSCKVKDFRPISLITSVYKIVAKTLATRLRGVLGETISETQSAF 1447
Query: 1251 ---KLILDPILIANELVEDYRIKKKKGWILKLDLEKAFGRVDWGFLEKALHGKNFDSKWI 1310
+ ILD +LIANE VED+R + KKG++ K+DLEKA+ RVDW FL+ L K F W
Sbjct: 1448 VEGRQILDSVLIANETVEDFRSRGKKGFVFKIDLEKAYDRVDWDFLDLVLKEKGFGEVWR 1507
Query: 1311 SWILGCIKNPKFSIFINGRPRGRVQASRGVRQGDPLSPFLFLLVSEVLTSLISRLHKSKK 1370
WI GC+ + FS+ INGR RG+ + SRG+RQGDPLSPFLF LV +VL L+ + +S
Sbjct: 1508 KWIRGCVSSTSFSLLINGRVRGKFRGSRGLRQGDPLSPFLFTLVVDVLGRLVDKAAQSDT 1567
Query: 1371 FEGFIVGKKKVHVPILQFADDTLLFCKYDLDMLEALRKTIEFFEWCFGQKVNWDKSALCG 1430
F GF VGK + + LQFADDTL F K D L L + +E F G KVN +KS L G
Sbjct: 1568 FSGFQVGKDNIQISHLQFADDTLFFVK-DEASLRKLVEIVEAFCGISGLKVNLNKSQLLG 1627
Query: 1431 LNIDDLEVKSTAARLNCKAEKLPLMYLGLPLGGHPKKMVFWQPIIDKIQGKLSRWKRNNL 1490
+++++ V A + C+ P+ YLG+PLGG P+K FW+P++DK +L WK + L
Sbjct: 1628 ISLEEEVVAQNAEIIGCEVGTWPMTYLGMPLGGSPRKGTFWEPVLDKCAKRLDGWKCSFL 1687
Query: 1491 SRGGRLTLCKTVLSNLPSYYMSIFLMPEKVVLLIERAMRNFFWEGHGGSKLNHLARWVTV 1550
SRGGRL L ++VLS+LP YY+S+F P+ V+ IE+ MR+FFWEG + +HL W V
Sbjct: 1688 SRGGRLILIQSVLSSLPIYYLSLFKAPKMVLQAIEKMMRDFFWEGGDLAGGDHLVAWDEV 1747
Query: 1551 TKNHKDGGLGLENLKIKNLALLSKWGWRFMQESEALWCKEV------------------A 1610
K +GGL + L+++N LL KW WR+ E +LW K + A
Sbjct: 1748 CKPRSEGGLAIGRLEMRNKGLLMKWLWRYPLEPNSLWHKVIKSRYGKADNFWDTKWGARA 1807
Query: 1611 SLRSPWISISRQWQKIEALAIFKVGDGRRITFWFDPWLEDQPFKVRFPRLFELALKPNGT 1648
S R PW IS + + L FKVG+G I FW D W+ K +FP + ++ N +
Sbjct: 1808 SPRGPWKDISDYYDEYGQLVKFKVGNGANIRFWEDVWIGGSSLKEQFPDVAVISKAKNAS 1867
BLAST of ClCG03G010470 vs. ExPASy TrEMBL
Match:
A0A803QEA6 (Uncharacterized protein OS=Cannabis sativa OX=3483 PE=4 SV=1)
HSP 1 Score: 721.1 bits (1860), Expect = 1.1e-203
Identity = 399/1088 (36.67%), Postives = 577/1088 (53.03%), Query Frame = 0
Query: 660 QDSEEESLISISSDELNEPEEDEN---------------------HLSLELDESIEHRSV 719
Q EE L+ S D+L+E E++E + +E+ + E
Sbjct: 289 QGLEEMRLVDGSDDKLDELEKEEGREADEIMIEATSWSNIVESMAEMGMEITQENEDSDQ 348
Query: 720 KVFSMKIVSWNIRGLGDQSKQLAVKHLIMKTNLELVLIQETKKEAFEAEAIKKLWSSRDI 779
K + +I++WNIRG GD+ K+ A+K I K N +LV++QE K+ + I +W SR
Sbjct: 349 K--TEEILTWNIRGSGDKGKRTAIKATICKANPDLVILQEVKRATVDRRFIGSIWRSRFK 408
Query: 780 GWSFVEAYGRSGGLLIMWDESKISVIETIKGGYTLSVKCKTLCSKVCWVTNAYGPTDYKE 839
W + A GRSGG L++WD ISV++++ G +++SV + W + YGP YK
Sbjct: 409 AWILIPAIGRSGGTLLIWDTRTISVLDSLVGEFSISVLINAEGKEPWWFSGVYGPCSYKL 468
Query: 840 RRRIWRELQALAAYCTNAWCLGGDFNITRAIHERVPIGRLTRGMKKFSKFIEEAHLMEIP 899
R W EL L++ C +WC+ GDFN+TR + E++ TR MK F I E L++
Sbjct: 469 RPEFWDELAGLSSICGKSWCVAGDFNVTRRVGEKLNSSSFTRSMKLFDGLIRELQLIDPK 528
Query: 900 MSNGRFTWSREGNRVSRSLLDRF-----------------LVRIVSDHFPLLLEAGALEW 959
+ NG FTWS S LDRF LVRIVSDH P+++++ +W
Sbjct: 529 LENGSFTWSNFRASPVCSRLDRFLFTNNWNIIFPFVRQELLVRIVSDHSPVVIDSNPPKW 588
Query: 960 GPSPFRFCNSWLLNSQCNSIIIRALAAGNHQGWAGFVISAKFRSEAEIVEKLDKEEQGA- 1019
GP PFRF N WL + + R + GW G K + V++ K G
Sbjct: 589 GPGPFRFDNHWLDHKSFSKCFERWWKEEINDGWPGTKFMKKLKILQGKVKEWSKSTFGQN 648
Query: 1020 ---------------ELEDPS---SILQDPRASLKSDLMNIYKKKERDLIQKSKLNWLHL 1079
+LE S L D R LK + + ++ER KSK W
Sbjct: 649 RAKKIALEGRLGVLDKLEGTSFWNQSLLDERRKLKEEWKWLNFEEERGTWLKSKCKWARE 708
Query: 1080 GDENTRFFHRFLAAKKRKNLIAELVNDQGFPTNSYCEIEDQILNFYKNLYTKTPSAGCFP 1139
GD N+RFFH L A+K +N I+ + + G ++ EI ++++ F+ LYT G
Sbjct: 709 GDANSRFFHNLLNARKARNTISRIERENGDIIDNEKEIAEELIAFFSKLYTSEARMGSGI 768
Query: 1140 ANLEWQRVSVEQNSRLSSKFSREEIRFALRGMGKNKAPGPDGFTVEFLNKFWDRIKDDFV 1199
+EWQ+++ +L F EE+R + NKAPGPDGF++ L W+ IK D +
Sbjct: 769 EGIEWQQIAESSAGQLECPFEEEEVRNIVFSCEGNKAPGPDGFSLAVLQHNWETIKHDLM 828
Query: 1200 ALFNEFHENGRLNSCVKENFICLIKKKEDAIMVKDFRPISLTTLTYKVIAKVLAERL--- 1259
+F FH GR+ + + FICLI K+ ++ VKDFRPISL T YK+IAK LA RL
Sbjct: 829 EVFTAFHREGRIEGSINDTFICLIPKRLNSCKVKDFRPISLITSVYKIIAKTLATRLRGV 888
Query: 1260 ---------------KLILDPILIANELVEDYRIKKKKGWILKLDLEKAFGRVDWGFLEK 1319
+ ILD +L+ANE VEDYR + +KG++LK+D EKA+ RVDWGFL+
Sbjct: 889 LGETISETQSAFVEGRQILDSVLMANEAVEDYRSRGRKGFVLKIDFEKAYDRVDWGFLDM 948
Query: 1320 ALHGKNFDSKWISWILGCIKNPKFSIFINGRPRGRVQASRGVRQGDPLSPFLFLLVSEVL 1379
L K F +W WI GC+ + FSIFINGR RG+ SRG+RQGDPLSPFLF ++++VL
Sbjct: 949 VLRKKGFGERWRKWIRGCVSSTSFSIFINGRVRGKFNGSRGLRQGDPLSPFLFTMIADVL 1008
Query: 1380 TSLISRLHKSKKFEGFIVGKKKVHVPILQFADDTLLFCKYDLDMLEALRKTIEFFEWCFG 1439
++ + +++ GF +GK + + LQFADDTL F K ++ L+ L K ++ F G
Sbjct: 1009 GRMVDKAIETESLTGFQIGKDDIRLSHLQFADDTLFFVKDEVS-LQKLVKVVKAFCGISG 1068
Query: 1440 QKVNWDKSALCGLNIDDLEVKSTAARLNCKAEKLPLMYLGLPLGGHPKKMVFWQPIIDKI 1499
KVN +KS L G+ +++ V +A + C+ + P+ YLG+ LGG P+K FW+P++DK
Sbjct: 1069 LKVNLNKSQLLGICMNEEAVAQSAILIGCEVGRWPMTYLGMSLGGSPRKRSFWEPVLDKC 1128
Query: 1500 QGKLSRWKRNNLSRGGRLTLCKTVLSNLPSYYMSIFLMPEKVVLLIERAMRNFFWEGHGG 1559
++ WK + LSRGGRLTL ++VLS+LP YY+S+F P+ V+ +E+ MR FFWEG
Sbjct: 1129 AKRMDGWKCSFLSRGGRLTLIQSVLSSLPIYYLSLFKAPKVVLKELEKMMREFFWEGGDL 1188
Query: 1560 SKLNHLARWVTVTKNHKDGGLGLENLKIKNLALLSKWGWRFMQESEALWCKEVASL---- 1619
+ +HL W V K +GGL + L ++N LL KW WRF E +LW K + S
Sbjct: 1189 AGGDHLVAWDEVCKPRAEGGLAIGKLDMRNKGLLMKWLWRFPLEPNSLWHKVIKSRYGKA 1248
Query: 1620 --------------RSPWISISRQWQKIEALAIFKVGDGRRITFWFDPWLEDQPFKVRFP 1648
R PW IS + + L FKVG+G RI FW D W+ + +FP
Sbjct: 1249 DNFWDTKQGVRISPRGPWKDISDLYDEYGKLVKFKVGNGERIRFWEDEWVGGSSLRDQFP 1308
BLAST of ClCG03G010470 vs. ExPASy TrEMBL
Match:
A0A438FWU5 (LINE-1 retrotransposable element ORF2 protein OS=Vitis vinifera OX=29760 GN=LORF2_70 PE=4 SV=1)
HSP 1 Score: 713.4 bits (1840), Expect = 2.3e-201
Identity = 473/1543 (30.65%), Postives = 744/1543 (48.22%), Query Frame = 0
Query: 259 GWIVIKNLSLEYWSRETFEAIGQHFGGLEEISIETLNLLDVSETKIKVRKNVYGFIPATI 318
GW+ ++ L W I Q +G + +++ ETL L+D+S+ K+ V + +PA +
Sbjct: 262 GWLELRGLPFHLWDEFQLRYILQKWGRVTKVAKETLKLVDLSKVKMWVEMHPKVVLPALL 321
Query: 319 EINN---EFRGSIHLL---FGDIIPMKDSS---SIITEFIGSDFENPIDQVRL-SKVEED 378
E+ + F ++ ++ D + +S+ +T G + P + L + ++
Sbjct: 322 EVEDGAWSFTVAVSVIGEAEEDFLLRPESNRSKDEVTSAEGCVHQRPKNAEGLRATARDN 381
Query: 379 EVRVFSPSKLALLSSKPDQTQPEQEEKSLEISVGSPSVGKWSKSTANGLEGKSINEKAAK 438
E + P + + + E E+ + +G TA ++G + E K
Sbjct: 382 EYHRWRPRHRSRVRYSVSNSDTEVEKGRGKSCLG---------PTAGSVDGLTKPEAFFK 441
Query: 439 LNWCE-ETEEAAVGSSIGVLKSIRPIEVMKKEKRKERVFGRINENIKSIQTPDIPSGDFL 498
++ EE +G S+G P+ E G N P PS L
Sbjct: 442 GHFARAHFEEKNIGPSVG------PVH--------ETEAGSSNGG------PATPSSSKL 501
Query: 499 RSGGPTGFEV--IQKIRKVPNVLLTVAEESEKSPPNSMKSN------------------- 558
+ G + EV I + K+ +T ++ PP+ ++
Sbjct: 502 QRSGTSAKEVMPIAQSAKLKGNSVTARRKARSWPPSLKVTSIVPKRNLDGDGAPEANRGF 561
Query: 559 -WEEVPFQ--TLLPYHQLPRARVNAFPQQSPAHTISSVKSPFFPAVRRSSNLFKPSPKHF 618
W F+ L P + R T + VK P+V S K P
Sbjct: 562 IWGRSVFKKGALSPVSEKSRRFTGGTEGDEGISTCNWVKKVRPPSVALSE---KDLP--- 621
Query: 619 SRGKTSFLNLWSALTNPDILEVSCANSQVPQQIRDRSVLPLSSSQIFHQSEI----LIPG 678
NP+IL S +S +P + + S +PL SS + +S ++
Sbjct: 622 ----------LDGAFNPNILSDSSVSSVIPSRCQAFSPIPLESSLVSQRSPFPKIAVVEP 681
Query: 679 SNILFLRGSPSSYRPKPNKIQDSEEESLISISSDELNEPEE----------DENHLSLEL 738
S ++ + P ++SEE L D L+ PE+ + H SL L
Sbjct: 682 SRLVGVSRPEGVAFPLETSNRNSEERPLF---KDSLSSPEKLCVSGKAQSPNPEHPSLPL 741
Query: 739 D----------ESIEHRSVKVF-------------------SMKIVSWNIRGLGDQSKQL 798
+ + ++ K F MKI+SWN RGLG + K+
Sbjct: 742 EGFQVEGLTPGKMVKKEEEKCFWKPRRGLGLGAGCAGLFLVYMKILSWNTRGLGSKKKRR 801
Query: 799 AVKHLIMKTNLELVLIQETKKEAFEAEAIKKLWSSRDIGWSFVEAYGRSGGLLIMWDESK 858
V+ + N ++V++QETK+E ++ + +W + + W+ + A G SGG++I+WD SK
Sbjct: 802 IVRRFLSTQNPDIVMLQETKRETWDRRFVSSVWKGKRVEWAALPACGASGGIVILWDSSK 861
Query: 859 ISVIETIKGGYTLSVKCKTLCSKVCWVTNAYGPTDYKERRRIWRELQALAAYCTNAWCLG 918
+ E + G ++++VK + W+T+ YGP + R+ W ELQ L WC+G
Sbjct: 862 LECTEKVLGSFSVTVKFNSGEEGSFWLTSVYGPINPLWRKDFWLELQDLYGLTFPRWCVG 921
Query: 919 GDFNITRAIHERVPIGRLTRGMKKFSKFIEEAHLMEIPMSNGRFTWSREGNRVSRSLLDR 978
GDFN+ R I E++ RLT M+ F +FI E+ L++ P+ N FTWS LDR
Sbjct: 922 GDFNVIRRISEKLGETRLTLNMRCFDEFIRESGLIDPPLRNAAFTWSNMQADPICKRLDR 981
Query: 979 FLV-----------------RIVSDHFPLLLEAGALEWGPSPFRFCNSWLLNSQCNSIII 1038
FL R SDH P+ LE L+WGP+PFRF N WLL+ +
Sbjct: 982 FLFSSEWDTFFSQSFQEALPRWTSDHSPICLETNPLKWGPTPFRFENMWLLHPEFKEKFR 1041
Query: 1039 RALAAGNHQGWAGFVISAKFR------SEAEIVEKLD-KEEQGAELEDPSSI-LQDPRAS 1098
+GW G K + E I+ D KE + L D S I L + +
Sbjct: 1042 VWWLECTGEGWEGHKFMRKLKFVKSKLKEWNIMTFGDLKERKKLILTDLSRIDLIEQEGN 1101
Query: 1099 LKSDLM-----------NIYKKKERDLIQKSKLNWLHLGDENTRFFHRFLAAKKRKNLIA 1158
L SDL+ ++ K+E QKS++ W+ GD N++FFHR ++ + I
Sbjct: 1102 LNSDLVLERTLKRRELEDVLLKEEVQWRQKSRVKWIKEGDCNSKFFHRVATGRRSRKFIK 1161
Query: 1159 ELVNDQGFPTNSYCEIEDQILNFYKNLYTKTPSAGCFPANLEWQRVSVEQNSRLSSKFSR 1218
L++++G N+ +I ++I+NF+ NLY+K ++W +S E L F+
Sbjct: 1162 SLISERGETLNNIEDISEEIVNFFGNLYSKPVGESWRVEGIDWVPISGESGGWLDRPFTE 1221
Query: 1219 EEIRFALRGMGKNKAPGPDGFTVEFLNKFWDRIKDDFVALFNEFHENGRLNSCVKENFIC 1278
EE+R A+ + K KAPGPDGFT+ + WD IK+D + +F EFH NG +N FI
Sbjct: 1222 EEVRRAVFQLNKEKAPGPDGFTIAVYQECWDVIKEDLMRVFLEFHTNGVINQSTNATFIA 1281
Query: 1279 LIKKKEDAIMVKDFRPISLTTLTYKVIAKVLAERLKL------------------ILDPI 1338
L+ KK ++ + D+RPISL T YK+IAKVL+ RL+ ILD +
Sbjct: 1282 LVPKKSQSVKISDYRPISLVTSLYKIIAKVLSGRLRKVLHETISDSQGAFVEGRHILDAV 1341
Query: 1339 LIANELVEDYRIKKKKGWILKLDLEKAFGRVDWGFLEKALHGKNFDSKWISWILGCIKNP 1398
LIANE+V++ R ++G + K+D EKA+ VDWGFL+ L K F KW WI GC+ +
Sbjct: 1342 LIANEVVDEKRRSGEEGIVFKIDFEKAYDHVDWGFLDHVLQRKGFSQKWRLWIRGCLSSS 1401
Query: 1399 KFSIFINGRPRGRVQASRGVRQGDPLSPFLFLLVSEVLTSLISRLHKSKKFEGFIVGKKK 1458
F+I +NG +G V+ASRG+RQGDPLSPFLF LV++VL+ ++ R ++ EGF VG+ +
Sbjct: 1402 SFAILVNGNAKGWVKASRGLRQGDPLSPFLFTLVADVLSRMLFRAEETGLTEGFSVGRDR 1461
Query: 1459 VHVPILQFADDTLLFCKYDLDMLEALRKTIEFFEWCFGQKVNWDKSALCGLNIDDLEVKS 1518
V +LQFADDT+ F K ++ L+ L+ + F G K+N +KS + G+N + S
Sbjct: 1462 TRVSLLQFADDTIFFSKASMEHLQNLKIILLVFGQVSGLKINLEKSTISGINTRQELLSS 1521
Query: 1519 TAARLNCKAEKLPLMYLGLPLGGHPKKMVFWQPIIDKIQGKLSRWKRNNLSRGGRLTLCK 1578
A+ +C+ + PL YLGLPLGG+PK + FW P++++I +L WK+ LS GGR+TL +
Sbjct: 1522 LASVFDCRVSEWPLSYLGLPLGGNPKTIGFWDPVVERISRRLDGWKKAYLSLGGRITLIQ 1581
Query: 1579 TVLSNLPSYYMSIFLMPEKVVLLIERAMRNFFWEGHGGSKLNHLARWVTVTKNHKDGGLG 1638
+ LS++PSY++S+F +P + IE+ RNF W G G K +HL RW V++ + GGLG
Sbjct: 1582 SCLSHIPSYFLSLFKIPASIASKIEKMQRNFLWSGAGEGKKDHLVRWEVVSRPKELGGLG 1641
Query: 1639 LENLKIKNLALLSKWGWRFMQESEALWCKEVASL------------------RSPWISIS 1650
+ ++N+ALL KW WRF +E LW K + S+ R PW +I+
Sbjct: 1642 FGKISLRNIALLGKWLWRFPRERSGLWYKVIGSIYGTHPNGWDANMVVRWSHRCPWKAIA 1701
BLAST of ClCG03G010470 vs. ExPASy TrEMBL
Match:
A0A803QQM3 (Uncharacterized protein OS=Cannabis sativa OX=3483 PE=4 SV=1)
HSP 1 Score: 708.8 bits (1828), Expect = 5.7e-200
Identity = 382/1014 (37.67%), Postives = 545/1014 (53.75%), Query Frame = 0
Query: 713 LGDQSKQLAVKHLIMKTNLELVLIQETKKEAFEAEAIKKLWSSRDIGWSFVEAYGRSGGL 772
LGD+ K+ A+K I K N +LV++QE K+ + I +W SR W + A GRSGG
Sbjct: 934 LGDKGKRAAIKATICKANPDLVILQEVKRATVDRRFIGSIWRSRFKAWILLPALGRSGGT 993
Query: 773 LIMWDESKISVIETIKGGYTLSVKCKTLCSKVCWVTNAYGPTDYKERRRIWRELQALAAY 832
L++WD ISV++++ G +++SV + W + YGP YK R W EL L++
Sbjct: 994 LLIWDTRTISVLDSLVGEFSISVLINAEGKEPWWFSGVYGPCSYKLRPEFWDELAGLSSI 1053
Query: 833 CTNAWCLGGDFNITRAIHERVPIGRLTRGMKKFSKFIEEAHLMEIPMSNGRFTWSREGNR 892
C +WC+GGDFN+TR + E++ TR MK F I E L++ + NG FTWS
Sbjct: 1054 CGESWCVGGDFNVTRRVGEKLNSSSCTRSMKLFDGLIRELQLIDPKLENGSFTWSNFRAS 1113
Query: 893 VSRSLLDRF-----------------LVRIVSDHFPLLLEAGALEWGPSPFRFCNSWLLN 952
S LDRF LVR+VSDH P+++++ +WGP PFRF N WL +
Sbjct: 1114 PVCSRLDRFLFSNNWNVIYPFVRQEMLVRLVSDHSPVVIDSNPPKWGPGPFRFDNHWLEH 1173
Query: 953 SQCNSIIIRALAAGNHQGWAGFVISAKFRSEAEIVEKLDKEEQGA--------------- 1012
+ + GW G K + V++ K G
Sbjct: 1174 KSFSKCFESWWKEEINDGWPGTKFMKKLKLLQGKVKEWSKSTFGQNKATKIALEGRLGVL 1233
Query: 1013 -ELEDPSSILQ---DPRASLKSDLMNIYKKKERDLIQKSKLNWLHLGDENTRFFHRFLAA 1072
LE SS Q D R LK + ++ ++ER + KSK W GD N+R FH L A
Sbjct: 1234 DRLEGTSSWNQSVLDERRKLKEEWQQLHFEEERGIWLKSKCKWAREGDANSRLFHNLLNA 1293
Query: 1073 KKRKNLIAELVNDQGFPTNSYCEIEDQILNFYKNLYTKTPSAGCFPANLEWQRVSVEQNS 1132
+K KN I+ + D G ++ EI ++++ F+ LYT +G +EW ++
Sbjct: 1294 RKAKNTISRIERDNGDIIDNEKEIVEELIAFFSKLYTSEARSGTGIEGIEWHKIEESSAR 1353
Query: 1133 RLSSKFSREEIRFALRGMGKNKAPGPDGFTVEFLNKFWDRIKDDFVALFNEFHENGRLNS 1192
+L F EE+R + NKAPGPDGF++ L W+ IK D + +F FH GR+
Sbjct: 1354 QLECPFEEEEVRNIVFSCEGNKAPGPDGFSLAALQNNWETIKYDLMEVFRAFHREGRIEG 1413
Query: 1193 CVKENFICLIKKKEDAIMVKDFRPISLTTLTYKVIAKVLAERL----------------- 1252
+ + FICLI K+ ++ VKD+RPISL T YK+IAK LA RL
Sbjct: 1414 SINDTFICLIPKRLNSCKVKDYRPISLITSVYKIIAKTLATRLRGVLGETISETQSAFVE 1473
Query: 1253 -KLILDPILIANELVEDYRIKKKKGWILKLDLEKAFGRVDWGFLEKALHGKNFDSKWISW 1312
+ ILD +L+ANE VEDYR + KKG +LK+D EKA+ RVDWGFL+ + K F +W W
Sbjct: 1474 GRQILDSVLMANEAVEDYRSRGKKGIVLKIDFEKAYDRVDWGFLDLVMRKKGFGERWTKW 1533
Query: 1313 ILGCIKNPKFSIFINGRPRGRVQASRGVRQGDPLSPFLFLLVSEVLTSLISRLHKSKKFE 1372
I GC+ FSIFINGR RG+ SRG+RQ DPLSPFLF L+++VL ++ + ++
Sbjct: 1534 IRGCVSTTSFSIFINGRVRGKFNGSRGLRQVDPLSPFLFTLIADVLGRMVDKAIDTESLS 1593
Query: 1373 GFIVGKKKVHVPILQFADDTLLFCKYDLDMLEALRKTIEFFEWCFGQKVNWDKSALCGLN 1432
GF +GK + + LQFADDTL F K D L+ L K +E F G KVN +KS L G+
Sbjct: 1594 GFQIGKDDIQLSHLQFADDTLFFVK-DEASLQKLVKIVEAFCGISGLKVNLNKSQLLGVC 1653
Query: 1433 IDDLEVKSTAARLNCKAEKLPLMYLGLPLGGHPKKMVFWQPIIDKIQGKLSRWKRNNLSR 1492
+D+ V +A ++ C+ + P+ YLG+PLGG P+K FW+P++DK ++ WK + LSR
Sbjct: 1654 MDEDAVAQSAIQIGCEVGRWPMTYLGMPLGGSPRKRSFWEPVLDKCATRMDGWKCSFLSR 1713
Query: 1493 GGRLTLCKTVLSNLPSYYMSIFLMPEKVVLLIERAMRNFFWEGHGGSKLNHLARWVTVTK 1552
GGRLTL ++VLS+LP Y++S+F P+ V+ +E+ MR+FFWEG + +HL W V K
Sbjct: 1714 GGRLTLIQSVLSSLPIYFLSLFKAPKVVLKELEKMMRDFFWEGGDLAGGDHLVAWDEVCK 1773
Query: 1553 NHKDGGLGLENLKIKNLALLSKWGWRFMQESEALWCKEVASL------------------ 1612
+GGL + L+++N LL KW WRF ES +LW K + S
Sbjct: 1774 PRAEGGLAIGRLEMRNKGLLMKWLWRFPLESNSLWHKVIKSRYGRADNFWDTKHGVRLSP 1833
Query: 1613 RSPWISISRQWQKIEALAIFKVGDGRRITFWFDPWLEDQPFKVRFPRLFELALKPNGT-- 1648
R PW IS + + L FKVG+G I FW D W+ + +F L ++ N +
Sbjct: 1834 RGPWKDISDLYDEYGKLVKFKVGNGACIRFWEDEWIGGSSLRDQFLNLAVISRAKNASIQ 1893
BLAST of ClCG03G010470 vs. TAIR 10
Match:
AT1G43760.1 (DNAse I-like superfamily protein )
HSP 1 Score: 109.8 bits (273), Expect = 2.2e-23
Identity = 66/184 (35.87%), Postives = 94/184 (51.09%), Query Frame = 0
Query: 1013 QKSKLNWLHLGDENTRFFHRFLAAKKRKNLIAELVNDQGFPTNSYCEIEDQILNFYKNLY 1072
QKS++ WL GD NTRFFH+ + A + KNLI L D + ++++ I+ +Y +L
Sbjct: 437 QKSRIKWLQDGDANTRFFHKVILANQAKNLIKFLRMDDDVRVENVTQVKEMIVAYYTHLL 496
Query: 1073 TK-----TPSAGCFPANLEWQRVSVEQNSRLSSKFSREEIRFALRGMGKNKAPGPDGFTV 1132
TP + ++ R + SRLS+ S +EI A+ M +NKAPGPD FT
Sbjct: 497 GSDSDILTPDSVQRIKDIHPFRCNDTLASRLSALPSDKEITAAVFAMPRNKAPGPDSFTA 556
Query: 1133 EFLNKFWDRIKDDFVALFNEFHENGRLNSCVKENFICLIKKKEDAIMVKDFRPISLTTLT 1192
EF + W +KD +A EF G L I LI K + FRP+S T+
Sbjct: 557 EFFWESWFVVKDSTIAAVKEFFRTGHLLKRFNATAITLIPKVTGVDQLSMFRPVSCCTVV 616
BLAST of ClCG03G010470 vs. TAIR 10
Match:
AT3G24255.1 (RNA-directed DNA polymerase (reverse transcriptase)-related family protein )
HSP 1 Score: 80.1 bits (196), Expect = 1.9e-14
Identity = 48/187 (25.67%), Postives = 78/187 (41.71%), Query Frame = 0
Query: 1398 LPLMYLGLPLGGHPKKMVFWQPIIDKIQGKLSRWKRNNLSRGGRLTLCKTVLSNLPSYYM 1457
LP+ YLGLPL + P+++KI+ ++ +W +LS GRL L +V+ +L +++M
Sbjct: 23 LPVRYLGLPLLTKKMTTSDYGPLVEKIRVRIGKWTARHLSFAGRLQLISSVIHSLTNFWM 82
Query: 1458 SIFLMPEKVVLLIERAMRNFFWEGHGGSKLNHLARWVTVTKNHKDGGLGLENLK------ 1517
S F +P + I+ +F W G + W V +GGLG+ +LK
Sbjct: 83 SAFRLPSACIKEIDSICSSFLWSGPELNTKKAKVAWSDVCTPKDEGGLGIRSLKEANKGS 142
Query: 1518 ---IKNLALLSKWGWRFMQESEALWCKEVASLRSPWISISRQWQKIEALAIFKVGDGRRI 1576
I L W W+ + + AL + +G
Sbjct: 143 FWSISGNTTLGSWMWKKILKHRAL---------------------ASGFVKHDIHNGSNT 188
BLAST of ClCG03G010470 vs. TAIR 10
Match:
AT4G29090.1 (Ribonuclease H-like superfamily protein )
HSP 1 Score: 70.5 bits (171), Expect = 1.5e-11
Identity = 43/145 (29.66%), Postives = 67/145 (46.21%), Query Frame = 0
Query: 1452 LPSYYMSIFLMPEKVVLLIERAMRNFFWEGHGGSKLNHLARWVTVTKNHKDGGLGLENLK 1511
LP+Y M+ FL+P+ V I + +F+W +K H W ++ +GG+G ++++
Sbjct: 3 LPTYTMACFLLPKTVCKQIISVLADFWWRNKQEAKGMHWKAWDHLSCYKAEGGIGFKDIE 62
Query: 1512 IKNLALLSKWGWRFMQESEALWCKEVAS--------LRSP--------WISISRQWQKIE 1571
NLALL K WR + E+L K S L +P W SI + +
Sbjct: 63 AFNLALLGKQMWRMLSRPESLMAKVFKSRYFHKSDPLNAPLGSRPSFVWKSIHASQEILR 122
Query: 1572 ALAIFKVGDGRRITFWFDPWLEDQP 1581
A VG+G I W WL+ +P
Sbjct: 123 QGARAVVGNGEDIIIWRHKWLDSKP 147
BLAST of ClCG03G010470 vs. TAIR 10
Match:
ATMG01250.1 (RNA-directed DNA polymerase (reverse transcriptase) )
HSP 1 Score: 68.6 bits (166), Expect = 5.7e-11
Identity = 33/67 (49.25%), Postives = 42/67 (62.69%), Query Frame = 0
Query: 1272 INGRPRGRVQASRGVRQGDPLSPFLFLLVSEVLTSLISRLHKSKKFEGFIVGKKKVHVPI 1331
ING P+G V SRG+RQGDPLSP+LF+L +EVL+ L R + + G V +
Sbjct: 14 INGAPQGLVTPSRGLRQGDPLSPYLFILCTEVLSGLCRRAQEQGRLPGIRVSNNSPRINH 73
Query: 1332 LQFADDT 1339
L FADDT
Sbjct: 74 LLFADDT 80
BLAST of ClCG03G010470 vs. TAIR 10
Match:
ATMG00310.1 (RNA-directed DNA polymerase (reverse transcriptase)-related family protein )
HSP 1 Score: 53.9 bits (128), Expect = 1.5e-06
Identity = 37/145 (25.52%), Postives = 59/145 (40.69%), Query Frame = 0
Query: 1452 LPSYYMSIFLMPEKVVLLIERAMRNFFWEGHGGSKLNHLARWVTVTKNHK-DGGLGLENL 1511
LP Y MS F + + + + AM F+W + W + K+ + DGGLG +L
Sbjct: 3 LPVYAMSCFRLSKLLCKKLTSAMTEFWWSSCENKRKISWVAWQKLCKSKEDDGGLGFRDL 62
Query: 1512 KIKNLALLSKWGWRFMQESEALWCKEVASLRSP----------------WISISRQWQKI 1571
N ALL+K +R + + L + + S P W SI + +
Sbjct: 63 GWFNQALLAKQSFRIIHQPHTLLSRLLRSRYFPHSSMMECSVGTRPSYAWRSIIHGRELL 122
Query: 1572 EALAIFKVGDGRRITFWFDPWLEDQ 1580
+ +GDG W D W+ D+
Sbjct: 123 SRGLLRTIGDGIHTKVWLDRWIMDE 147
The following BLAST results are available for this feature:
Match Name | E-value | Identity | Description | |
RVW64408.1 | 4.8e-201 | 30.65 | LINE-1 retrotransposable element ORF2 protein [Vitis vinifera] | [more] |
RVW16209.1 | 1.0e-195 | 36.45 | Transposon TX1 uncharacterized 149 kDa protein [Vitis vinifera] | [more] |
CAN68838.1 | 7.4e-194 | 35.46 | hypothetical protein VITISV_030956 [Vitis vinifera] | [more] |
RVW70235.1 | 1.6e-193 | 33.62 | LINE-1 retrotransposable element ORF2 protein [Vitis vinifera] | [more] |
RVW65579.1 | 2.1e-193 | 35.95 | Transposon TX1 uncharacterized 149 kDa protein [Vitis vinifera] | [more] |
Match Name | E-value | Identity | Description | |
O00370 | 3.4e-48 | 23.18 | LINE-1 retrotransposable element ORF2 protein OS=Homo sapiens OX=9606 PE=1 SV=1 | [more] |
P14381 | 3.7e-47 | 25.38 | Transposon TX1 uncharacterized 149 kDa protein OS=Xenopus laevis OX=8355 PE=4 SV... | [more] |
P08548 | 1.2e-45 | 22.88 | LINE-1 reverse transcriptase homolog OS=Nycticebus coucang OX=9470 PE=4 SV=1 | [more] |
P11369 | 1.6e-42 | 23.22 | LINE-1 retrotransposable element ORF2 protein OS=Mus musculus OX=10090 GN=Pol PE... | [more] |
P0C2F6 | 1.8e-25 | 34.65 | Putative ribonuclease H protein At1g65750 OS=Arabidopsis thaliana OX=3702 GN=At1... | [more] |
Match Name | E-value | Identity | Description | |
A0A803P8A0 | 7.4e-208 | 37.99 | Uncharacterized protein OS=Cannabis sativa OX=3483 PE=4 SV=1 | [more] |
A0A803QI00 | 6.5e-204 | 37.99 | Uncharacterized protein OS=Cannabis sativa OX=3483 PE=4 SV=1 | [more] |
A0A803QEA6 | 1.1e-203 | 36.67 | Uncharacterized protein OS=Cannabis sativa OX=3483 PE=4 SV=1 | [more] |
A0A438FWU5 | 2.3e-201 | 30.65 | LINE-1 retrotransposable element ORF2 protein OS=Vitis vinifera OX=29760 GN=LORF... | [more] |
A0A803QQM3 | 5.7e-200 | 37.67 | Uncharacterized protein OS=Cannabis sativa OX=3483 PE=4 SV=1 | [more] |
Match Name | E-value | Identity | Description | |
AT1G43760.1 | 2.2e-23 | 35.87 | DNAse I-like superfamily protein | [more] |
AT3G24255.1 | 1.9e-14 | 25.67 | RNA-directed DNA polymerase (reverse transcriptase)-related family protein | [more] |
AT4G29090.1 | 1.5e-11 | 29.66 | Ribonuclease H-like superfamily protein | [more] |
ATMG01250.1 | 5.7e-11 | 49.25 | RNA-directed DNA polymerase (reverse transcriptase) | [more] |
ATMG00310.1 | 1.5e-06 | 25.52 | RNA-directed DNA polymerase (reverse transcriptase)-related family protein | [more] |