Homology
BLAST of HG10021971 vs. NCBI nr
Match:
XP_038893506.1 (protein ALP1-like [Benincasa hispida])
HSP 1 Score: 706.8 bits (1823), Expect = 9.6e-200
Identity = 352/389 (90.49%), Postives = 359/389 (92.29%), Query Frame = 0
Query: 1 MGPIRGLRKKKKLERKLDSNGTASDSSEKEEAIDWWDDFSKRTNGLHSASKGLDRFKSTF 60
MGPIRGLRKKKKLERKLDSNGTASDSSEK+EAIDWWDDFSKRTNGLHSASKGLDRFKS F
Sbjct: 1 MGPIRGLRKKKKLERKLDSNGTASDSSEKDEAIDWWDDFSKRTNGLHSASKGLDRFKSIF 60
Query: 61 KVSRKTFDYICLLVKDDMTAKSGNFTFLNGRPLSLCDQVAVALRRLGSGESLVTIGDSLG 120
KVSRKTFDYICLLVKDDMTAKSG+FTFLNGRPLSLCDQVAVALRRLGSGESLVTIGDSLG
Sbjct: 61 KVSRKTFDYICLLVKDDMTAKSGHFTFLNGRPLSLCDQVAVALRRLGSGESLVTIGDSLG 120
Query: 121 LNH--------------------YLHWPSTEVEMAQVKSKFEKIQGLPNCCGSIDTTHIT 180
LNH +LHWPS EVEMAQVKSKFEKIQGLPNCCGSIDTTHIT
Sbjct: 121 LNHSTVSQVTWRFVESMEERGLRHLHWPSNEVEMAQVKSKFEKIQGLPNCCGSIDTTHIT 180
Query: 181 MCLPASDPTSYVWLDEEKNHSVVLQVIVDAEMRFRDILTGLPGKMSDWLVSQSSNFHKLC 240
MCLPASD TSYVWLDEEKNHS+VLQVIVDAEMRFRDILTGLPGKMSDWLV QSSNFHKLC
Sbjct: 181 MCLPASDSTSYVWLDEEKNHSMVLQVIVDAEMRFRDILTGLPGKMSDWLVFQSSNFHKLC 240
Query: 241 DKGERLNGKRLELNDRSEIREYIIGDSGYPLLPYLVTPYDGKKLSTSKAEFNKRHKETRL 300
DKGERLNGKRLELNDRSEIREYIIGDSGYPLLPYLVTPYDGK+ STSKAEFNKRHKETRL
Sbjct: 241 DKGERLNGKRLELNDRSEIREYIIGDSGYPLLPYLVTPYDGKEHSTSKAEFNKRHKETRL 300
Query: 301 VVQPALTMLKERWRIIQGVMWRPDKHRLPRIILVCCLLHNIIIDIGDET-EDGDVPLSIE 360
VVQ AL MLKERWRIIQGVMWRPDKHRLPRIILVCCLLHNIIIDIGD+T EDG VPLSIE
Sbjct: 301 VVQRALAMLKERWRIIQGVMWRPDKHRLPRIILVCCLLHNIIIDIGDDTEEDGGVPLSIE 360
Query: 361 HDVDYKQQVCDVFDSKGAYLRDKLSLLFI 369
HDVDYKQQVCDVFD KGAYLRD+LSLLFI
Sbjct: 361 HDVDYKQQVCDVFDPKGAYLRDRLSLLFI 389
BLAST of HG10021971 vs. NCBI nr
Match:
XP_004149039.1 (protein ALP1-like [Cucumis sativus] >KGN65671.1 hypothetical protein Csa_019843 [Cucumis sativus])
HSP 1 Score: 705.7 bits (1820), Expect = 2.1e-199
Identity = 346/388 (89.18%), Postives = 359/388 (92.53%), Query Frame = 0
Query: 1 MGPIRGLRKKKKLERKLDSNGTASDSSEKEEAIDWWDDFSKRTNGLHSASKGLDRFKSTF 60
MGPIRGLRKKKKLERKLD NGTASDSSEK++AIDWWDDFSKRTNGLHSASKGLDRFKS F
Sbjct: 1 MGPIRGLRKKKKLERKLDCNGTASDSSEKDDAIDWWDDFSKRTNGLHSASKGLDRFKSIF 60
Query: 61 KVSRKTFDYICLLVKDDMTAKSGNFTFLNGRPLSLCDQVAVALRRLGSGESLVTIGDSLG 120
KVSRKTFDYICLLVKDDMTAKSG+FTFLNGRPLSLCDQVAVALRRLGSGESLVTIGDSLG
Sbjct: 61 KVSRKTFDYICLLVKDDMTAKSGHFTFLNGRPLSLCDQVAVALRRLGSGESLVTIGDSLG 120
Query: 121 LN--------------------HYLHWPSTEVEMAQVKSKFEKIQGLPNCCGSIDTTHIT 180
LN H+LHWPS EVEMAQVKSKFEKIQGLPNCCGSIDTTHIT
Sbjct: 121 LNHSTVSQVTWRFVESMEERGLHHLHWPSNEVEMAQVKSKFEKIQGLPNCCGSIDTTHIT 180
Query: 181 MCLPASDPTSYVWLDEEKNHSVVLQVIVDAEMRFRDILTGLPGKMSDWLVSQSSNFHKLC 240
MCLPASDPTSYVWLD++KNHS+VLQVIVDAEMRFRDILTGLPGK+SDWLV QSSNFHKLC
Sbjct: 181 MCLPASDPTSYVWLDDKKNHSMVLQVIVDAEMRFRDILTGLPGKLSDWLVFQSSNFHKLC 240
Query: 241 DKGERLNGKRLELNDRSEIREYIIGDSGYPLLPYLVTPYDGKKLSTSKAEFNKRHKETRL 300
DKGERLNGKR EL DRSEIREYIIGDSGYPLLPYLVTPYDGK+LSTSK EFNKRHKETRL
Sbjct: 241 DKGERLNGKRFELPDRSEIREYIIGDSGYPLLPYLVTPYDGKELSTSKTEFNKRHKETRL 300
Query: 301 VVQPALTMLKERWRIIQGVMWRPDKHRLPRIILVCCLLHNIIIDIGDETEDGDVPLSIEH 360
VVQ AL MLKERWRIIQGVMWRPDKHRLPRIILVCCLLHNIIIDIGDETE+G+VPLSIEH
Sbjct: 301 VVQRALAMLKERWRIIQGVMWRPDKHRLPRIILVCCLLHNIIIDIGDETEEGEVPLSIEH 360
Query: 361 DVDYKQQVCDVFDSKGAYLRDKLSLLFI 369
DVDYKQQVCDVFDSKGAY+RD+LSLLFI
Sbjct: 361 DVDYKQQVCDVFDSKGAYVRDRLSLLFI 388
BLAST of HG10021971 vs. NCBI nr
Match:
TYK01291.1 (putative nuclease HARBI1 [Cucumis melo var. makuwa])
HSP 1 Score: 698.7 bits (1802), Expect = 2.6e-197
Identity = 345/388 (88.92%), Postives = 358/388 (92.27%), Query Frame = 0
Query: 1 MGPIRGLRKKKKLERKLDSNGTASDSSEKEEAIDWWDDFSKRTNGLHSASKGLDRFKSTF 60
MGPIRGLRKKKKLERKLDSNGTASDSSEK++AIDWWDDFSKRTNGLHSASKGLDRFKS F
Sbjct: 1 MGPIRGLRKKKKLERKLDSNGTASDSSEKDDAIDWWDDFSKRTNGLHSASKGLDRFKSIF 60
Query: 61 KVSRKTFDYICLLVKDDMTAKSGNFTFLNGRPLSLCDQVAVALRRLGSGESLVTIGDSLG 120
KVSRKTFDYICLLVKDDMTAKSG+FTFLNGRPLSLCDQVAVALRRLGSGESLVTIGDSLG
Sbjct: 61 KVSRKTFDYICLLVKDDMTAKSGHFTFLNGRPLSLCDQVAVALRRLGSGESLVTIGDSLG 120
Query: 121 LNH--------------------YLHWPSTEVEMAQVKSKFEKIQGLPNCCGSIDTTHIT 180
LNH +LHWPS EVEMAQVKSKFEKIQGLPNCCGSIDTTHIT
Sbjct: 121 LNHSTVSQVTWRFVESMEERGLRHLHWPSNEVEMAQVKSKFEKIQGLPNCCGSIDTTHIT 180
Query: 181 MCLPASDPTSYVWLDEEKNHSVVLQVIVDAEMRFRDILTGLPGKMSDWLVSQSSNFHKLC 240
MCLPASDPTS+VWLD+EKNHS+VLQVIVDAEMRFRDILTGLPGK+SD LV QSSNFHKLC
Sbjct: 181 MCLPASDPTSFVWLDDEKNHSMVLQVIVDAEMRFRDILTGLPGKLSDRLVFQSSNFHKLC 240
Query: 241 DKGERLNGKRLELNDRSEIREYIIGDSGYPLLPYLVTPYDGKKLSTSKAEFNKRHKETRL 300
DKGERLNGKRLEL DRSEI+EYI+GDSGYPLL YLVTPYDGK+LSTSKAEFNKRH TRL
Sbjct: 241 DKGERLNGKRLELPDRSEIQEYIVGDSGYPLLSYLVTPYDGKELSTSKAEFNKRHTATRL 300
Query: 301 VVQPALTMLKERWRIIQGVMWRPDKHRLPRIILVCCLLHNIIIDIGDETEDGDVPLSIEH 360
VVQ AL MLKERWRIIQGVMWRPDKHRLPRIILVCCLLHNIIIDIGDETEDGDVPLSIEH
Sbjct: 301 VVQQALAMLKERWRIIQGVMWRPDKHRLPRIILVCCLLHNIIIDIGDETEDGDVPLSIEH 360
Query: 361 DVDYKQQVCDVFDSKGAYLRDKLSLLFI 369
DVDYKQQVCDVFDSKGAYLR++LSLLFI
Sbjct: 361 DVDYKQQVCDVFDSKGAYLRERLSLLFI 388
BLAST of HG10021971 vs. NCBI nr
Match:
KAA0033909.1 (putative nuclease HARBI1 [Cucumis melo var. makuwa])
HSP 1 Score: 698.7 bits (1802), Expect = 2.6e-197
Identity = 345/388 (88.92%), Postives = 358/388 (92.27%), Query Frame = 0
Query: 1 MGPIRGLRKKKKLERKLDSNGTASDSSEKEEAIDWWDDFSKRTNGLHSASKGLDRFKSTF 60
MGPIRGLRKKKKLERKLDSNGTASDSSEK++AIDWWDDFSKRTNGLHSASKGLDRFKS F
Sbjct: 1 MGPIRGLRKKKKLERKLDSNGTASDSSEKDDAIDWWDDFSKRTNGLHSASKGLDRFKSIF 60
Query: 61 KVSRKTFDYICLLVKDDMTAKSGNFTFLNGRPLSLCDQVAVALRRLGSGESLVTIGDSLG 120
KVSRKTFDYICLLVKDDMTAKSG+FTFLNGRPLSLCDQVAVALRRLGSGESLVTIGDSLG
Sbjct: 61 KVSRKTFDYICLLVKDDMTAKSGHFTFLNGRPLSLCDQVAVALRRLGSGESLVTIGDSLG 120
Query: 121 LNH--------------------YLHWPSTEVEMAQVKSKFEKIQGLPNCCGSIDTTHIT 180
LNH +LHWPS EVEMAQVKSKFEKIQGLPNCCGSIDTTHIT
Sbjct: 121 LNHSTVSQVTWRFVESMEERGLCHLHWPSNEVEMAQVKSKFEKIQGLPNCCGSIDTTHIT 180
Query: 181 MCLPASDPTSYVWLDEEKNHSVVLQVIVDAEMRFRDILTGLPGKMSDWLVSQSSNFHKLC 240
MCLPASDPTS+VWLD+EKNHS+VLQVIVDAEMRFRDILTGLPGK+SD LV QSSNFHKLC
Sbjct: 181 MCLPASDPTSFVWLDDEKNHSMVLQVIVDAEMRFRDILTGLPGKLSDRLVFQSSNFHKLC 240
Query: 241 DKGERLNGKRLELNDRSEIREYIIGDSGYPLLPYLVTPYDGKKLSTSKAEFNKRHKETRL 300
DKGERLNGKRLEL DRSEI+EYI+GDSGYPLL YLVTPYDGK+LSTSKAEFNKRH TRL
Sbjct: 241 DKGERLNGKRLELPDRSEIQEYIVGDSGYPLLSYLVTPYDGKELSTSKAEFNKRHTATRL 300
Query: 301 VVQPALTMLKERWRIIQGVMWRPDKHRLPRIILVCCLLHNIIIDIGDETEDGDVPLSIEH 360
VVQ AL MLKERWRIIQGVMWRPDKHRLPRIILVCCLLHNIIIDIGDETEDGDVPLSIEH
Sbjct: 301 VVQQALAMLKERWRIIQGVMWRPDKHRLPRIILVCCLLHNIIIDIGDETEDGDVPLSIEH 360
Query: 361 DVDYKQQVCDVFDSKGAYLRDKLSLLFI 369
DVDYKQQVCDVFDSKGAYLR++LSLLFI
Sbjct: 361 DVDYKQQVCDVFDSKGAYLRERLSLLFI 388
BLAST of HG10021971 vs. NCBI nr
Match:
XP_008457540.1 (PREDICTED: LOW QUALITY PROTEIN: putative nuclease HARBI1 [Cucumis melo])
HSP 1 Score: 695.7 bits (1794), Expect = 2.2e-196
Identity = 344/388 (88.66%), Postives = 356/388 (91.75%), Query Frame = 0
Query: 1 MGPIRGLRKKKKLERKLDSNGTASDSSEKEEAIDWWDDFSKRTNGLHSASKGLDRFKSTF 60
MGPIRGLRKKKKLERKLDSNGTASDSSEK++AIDWWDDFSKRTNGLHSASKGLDRFKS F
Sbjct: 1 MGPIRGLRKKKKLERKLDSNGTASDSSEKDDAIDWWDDFSKRTNGLHSASKGLDRFKSIF 60
Query: 61 KVSRKTFDYICLLVKDDMTAKSGNFTFLNGRPLSLCDQVAVALRRLGSGESLVTIGDSLG 120
KVSRKTFDYICLLVKDDMTAKSG+FTFLNGRPLSLCDQVAVALRRLGSGESLVTIGDSLG
Sbjct: 61 KVSRKTFDYICLLVKDDMTAKSGHFTFLNGRPLSLCDQVAVALRRLGSGESLVTIGDSLG 120
Query: 121 LNH--------------------YLHWPSTEVEMAQVKSKFEKIQGLPNCCGSIDTTHIT 180
LNH +LHWPS EVEMAQVKSKFEKIQGLPNCCGSIDTTHIT
Sbjct: 121 LNHSTVSQVTWRFVESMEERGLCHLHWPSNEVEMAQVKSKFEKIQGLPNCCGSIDTTHIT 180
Query: 181 MCLPASDPTSYVWLDEEKNHSVVLQVIVDAEMRFRDILTGLPGKMSDWLVSQSSNFHKLC 240
MCLPASDPTS+VWLD KNHS+VLQVIVDAEMRFRDILTGLPGK+SD LV QSSNFHKLC
Sbjct: 181 MCLPASDPTSFVWLDXRKNHSMVLQVIVDAEMRFRDILTGLPGKLSDRLVFQSSNFHKLC 240
Query: 241 DKGERLNGKRLELNDRSEIREYIIGDSGYPLLPYLVTPYDGKKLSTSKAEFNKRHKETRL 300
DKGERLNGKRLEL DRSEI+EYI+GDSGYPLL YLVTPYDGK+LSTSKAEFNKRH TRL
Sbjct: 241 DKGERLNGKRLELPDRSEIQEYIVGDSGYPLLSYLVTPYDGKELSTSKAEFNKRHTATRL 300
Query: 301 VVQPALTMLKERWRIIQGVMWRPDKHRLPRIILVCCLLHNIIIDIGDETEDGDVPLSIEH 360
VVQ AL MLKERWRIIQGVMWRPDKHRLPRIILVCCLLHNIIIDIGDETEDGDVPLSIEH
Sbjct: 301 VVQQALAMLKERWRIIQGVMWRPDKHRLPRIILVCCLLHNIIIDIGDETEDGDVPLSIEH 360
Query: 361 DVDYKQQVCDVFDSKGAYLRDKLSLLFI 369
DVDYKQQVCDVFDSKGAYLR++LSLLFI
Sbjct: 361 DVDYKQQVCDVFDSKGAYLRERLSLLFI 388
BLAST of HG10021971 vs. ExPASy Swiss-Prot
Match:
Q9M2U3 (Protein ALP1-like OS=Arabidopsis thaliana OX=3702 GN=At3g55350 PE=2 SV=1)
HSP 1 Score: 415.6 bits (1067), Expect = 5.8e-115
Identity = 210/403 (52.11%), Postives = 277/403 (68.73%), Query Frame = 0
Query: 1 MGPIRGLRKKKKLERKLDSN----GTAS---------------DSSEKEEAIDWWDDFSK 60
MGPI+ ++KKK+ E+K+D N TA+ D +++DWWD FS+
Sbjct: 1 MGPIKTIKKKKRAEKKVDRNVLLAATAAATSASAAAALNNNDDDDDSSSQSLDWWDGFSR 60
Query: 61 RTNGLHSASKGLDRFKSTFKVSRKTFDYICLLVKDDMTAKSGNFTFLNGRPLSLCDQVAV 120
R ++ S F+S FK+SRKTFDYIC LVK D TAK NF+ NG PLSL D+VAV
Sbjct: 61 R---IYGGSTDPKTFESVFKISRKTFDYICSLVKADFTAKPANFSDSNGNPLSLNDRVAV 120
Query: 121 ALRRLGSGESLVTIGDSLGLN--------------------HYLHWPSTEVEMAQVKSKF 180
ALRRLGSGESL IG++ G+N H+L WPS ++ ++KSKF
Sbjct: 121 ALRRLGSGESLSVIGETFGMNQSTVSQITWRFVESMEERAIHHLSWPS---KLDEIKSKF 180
Query: 181 EKIQGLPNCCGSIDTTHITMCLPASDPTSYVWLDEEKNHSVVLQVIVDAEMRFRDILTGL 240
EKI GLPNCCG+ID THI M LPA +P++ VWLD EKN S+ LQ +VD +MRF D++ G
Sbjct: 181 EKISGLPNCCGAIDITHIVMNLPAVEPSNKVWLDGEKNFSMTLQAVVDPDMRFLDVIAGW 240
Query: 241 PGKMSDWLVSQSSNFHKLCDKGERLNGKRLELNDRSEIREYIIGDSGYPLLPYLVTPYDG 300
PG ++D +V ++S F+KL +KG+RLNG++L L++R+E+REYI+GDSG+PLLP+L+TPY G
Sbjct: 241 PGSLNDDVVLKNSGFYKLVEKGKRLNGEKLPLSERTELREYIVGDSGFPLLPWLLTPYQG 300
Query: 301 KKLSTSKAEFNKRHKETRLVVQPALTMLKERWRIIQGVMWRPDKHRLPRIILVCCLLHNI 360
K S + EFNKRH E Q AL+ LK+RWRII GVMW PD++RLPRII VCCLLHNI
Sbjct: 301 KPTSLPQTEFNKRHSEATKAAQMALSKLKDRWRIINGVMWMPDRNRLPRIIFVCCLLHNI 360
Query: 361 IIDIGDETEDGDVPLSIEHDVDYKQQVCDVFDSKGAYLRDKLS 365
IID+ D+T D D PLS +HD++Y+Q+ C + D + LRD+LS
Sbjct: 361 IIDMEDQTLD-DQPLSQQHDMNYRQRSCKLADEASSVLRDELS 396
BLAST of HG10021971 vs. ExPASy Swiss-Prot
Match:
Q94K49 (Protein ANTAGONIST OF LIKE HETEROCHROMATIN PROTEIN 1 OS=Arabidopsis thaliana OX=3702 GN=ALP1 PE=1 SV=1)
HSP 1 Score: 296.2 bits (757), Expect = 5.1e-79
Identity = 167/396 (42.17%), Postives = 233/396 (58.84%), Query Frame = 0
Query: 1 MGPIRGLRKKKKLERKLDSNGTASDSSEKE---------EAI--DWWDDFSKRTNGLHSA 60
M P++ +KKK ++ LD + + EK+ EAI DWWD F R +
Sbjct: 1 MAPVK--QKKKNKKKPLDKAKKLAKNKEKKRVNAVPLDPEAIDCDWWDTFWLRNSSPSVP 60
Query: 61 SKGLDRFKSTFKVSRKTFDYICLLVKDDMTAK-SGNFTFLNGRPLSLCDQVAVALRRLGS 120
S FK F+ S+ TF YIC LV++D+ ++ + GR LS+ QVA+ALRRL S
Sbjct: 61 SDEDYAFKHFFRASKTTFSYICSLVREDLISRPPSGLINIEGRLLSVEKQVAIALRRLAS 120
Query: 121 GESLVTIGDSLGL--------------------NHYLHWPSTEVEMAQVKSKFEKIQGLP 180
G+S V++G + G+ H+L WP ++ + ++KSKFE++ GLP
Sbjct: 121 GDSQVSVGAAFGVGQSTVSQVTWRFIEALEERAKHHLRWPDSD-RIEEIKSKFEEMYGLP 180
Query: 181 NCCGSIDTTHITMCLPASDPTSYVWLDEEKNHSVVLQVIVDAEMRFRDILTGLPGKMSDW 240
NCCG+IDTTHI M LPA S W D+EKN+S+ LQ + D EMRF +++TG PG M+
Sbjct: 181 NCCGAIDTTHIIMTLPAVQ-ASDDWCDQEKNYSMFLQGVFDHEMRFLNMVTGWPGGMTVS 240
Query: 241 LVSQSSNFHKLCDKGERLNGKRLELNDRSEIREYIIGDSGYPLLPYLVTPYDGKKLSTSK 300
+ + S F KLC+ + L+G L+ ++IREY++G YPLLP+L+TP+D S S
Sbjct: 241 KLLKFSGFFKLCENAQILDGNPKTLSQGAQIREYVVGGISYPLLPWLITPHDSDHPSDSM 300
Query: 301 AEFNKRHKETRLVVQPALTMLKERWRIIQGVMWRPDKHRLPRIILVCCLLHNIIIDIGDE 360
FN+RH++ R V A LK WRI+ VMWRPD+ +LP IILVCCLLHNIIID GD
Sbjct: 301 VAFNERHEKVRSVAATAFQQLKGSWRILSKVMWRPDRRKLPSIILVCCLLHNIIIDCGDY 360
Query: 361 TEDGDVPLSIEHDVDYKQQVCDVFDSKGAYLRDKLS 365
++ DVPLS HD Y + C + G+ LR L+
Sbjct: 361 LQE-DVPLSGHHDSGYADRYCKQTEPLGSELRGCLT 391
BLAST of HG10021971 vs. ExPASy Swiss-Prot
Match:
B0BN95 (Putative nuclease HARBI1 OS=Rattus norvegicus OX=10116 GN=Harbi1 PE=2 SV=1)
HSP 1 Score: 97.8 bits (242), Expect = 2.7e-19
Identity = 71/299 (23.75%), Postives = 132/299 (44.15%), Query Frame = 0
Query: 91 RPLSLCDQVAVALRRLGSGESLVTIGDSLGL--------------------NHYLHWPST 150
R +S Q+ AL SG +GD++G+ + ++H+P+
Sbjct: 65 RAISPETQILAALGFYTSGSFQTRMGDAIGISQASMSRCVANVTEALVERASQFIHFPAD 124
Query: 151 EVEMAQVKSKFEKIQGLPNCCGSIDTTHITMCLPASDPTSYVWLDEEKNHSVVLQVIVDA 210
E + +K +F + G+P G++D H+ + P ++ SYV + + HS+ V+ D
Sbjct: 125 EAAIQSLKDEFYGLAGMPGVIGAVDCIHVAIKAPNAEDLSYV--NRKGLHSLNCLVVCDI 184
Query: 211 EMRFRDILTGLPGKMSDWLVSQSSNFHKLCDKGERLNGKRLELNDRSEIREYIIGDSGYP 270
+ T PG + D V Q S+ + G + +++GDS +
Sbjct: 185 RGALMTVETSWPGSLQDCAVLQQSSLSSQFETGMPKD-------------SWLLGDSSFF 244
Query: 271 LLPYLVTPYDGKKLSTSKAEFNKRHKETRLVVQPALTMLKERWRIIQG----VMWRPDKH 330
L +L+TP + + ++ +N+ H T V++ L L R+R + G + + P+K
Sbjct: 245 LHTWLLTPLHIPE-TPAEYRYNRAHSATHSVIEKTLRTLCCRFRCLDGSKGALQYSPEKS 304
Query: 331 RLPRIILVCCLLHNIIIDIGDETEDGDVPLSIEHDVDYKQQVCDVFDSKGAYLRDKLSL 366
IIL CC+LHNI ++ G + V IE + + + + D + +R +L L
Sbjct: 305 --SHIILACCVLHNISLEHGMDVWSSPVTGPIEQPPEGEDEQMESLDLEADRIRQELIL 345
BLAST of HG10021971 vs. ExPASy Swiss-Prot
Match:
Q17QR8 (Putative nuclease HARBI1 OS=Bos taurus OX=9913 GN=HARBI1 PE=2 SV=1)
HSP 1 Score: 96.3 bits (238), Expect = 7.8e-19
Identity = 68/299 (22.74%), Postives = 129/299 (43.14%), Query Frame = 0
Query: 91 RPLSLCDQVAVALRRLGSGESLVTIGDSLGL--------------------NHYLHWPST 150
R +S Q+ AL SG +GD++G+ + ++H+P+
Sbjct: 65 RAISPETQILAALGFYTSGSFQTRMGDAIGISQASMSRCVANVTEALVERASQFIHFPAD 124
Query: 151 EVEMAQVKSKFEKIQGLPNCCGSIDTTHITMCLPASDPTSYVWLDEEKNHSVVLQVIVDA 210
E + +K +F + G+P G +D H+ + P ++ SYV + + HS+ ++ D
Sbjct: 125 EASVQALKDEFYGLAGIPGVIGVVDCMHVAIKAPNAEDLSYV--NRKGLHSLNCLMVCDI 184
Query: 211 EMRFRDILTGLPGKMSDWLVSQSSNFHKLCDKGERLNGKRLELNDRSEIREYIIGDSGYP 270
+ T PG + D +V Q S+ + G +++GDS +
Sbjct: 185 RGALMTVETSWPGSLQDCVVLQQSSLSSQFEAG-------------MHKESWLLGDSSFF 244
Query: 271 LLPYLVTPYDGKKLSTSKAEFNKRHKETRLVVQPALTMLKERWRIIQG----VMWRPDKH 330
L +L+TP + + ++ +N H T V++ L R+R + G + + P+K
Sbjct: 245 LRTWLMTPLHIPE-TPAEYRYNMAHSATHSVIEKTFRTLCSRFRCLDGSKGALQYSPEKS 304
Query: 331 RLPRIILVCCLLHNIIIDIGDETEDGDVPLSIEHDVDYKQQVCDVFDSKGAYLRDKLSL 366
IIL CC+LHNI ++ G + V +E + + + + D + +R +L L
Sbjct: 305 --SHIILACCVLHNISLEHGMDVWSSPVTGPVEQPPEEEYEHMESLDLEADRIRQELML 345
BLAST of HG10021971 vs. ExPASy Swiss-Prot
Match:
Q8BR93 (Putative nuclease HARBI1 OS=Mus musculus OX=10090 GN=Harbi1 PE=2 SV=1)
HSP 1 Score: 96.3 bits (238), Expect = 7.8e-19
Identity = 72/299 (24.08%), Postives = 130/299 (43.48%), Query Frame = 0
Query: 91 RPLSLCDQVAVALRRLGSGESLVTIGDSLGL--------------------NHYLHWPST 150
R +S Q+ AL SG +GD++G+ + ++H+P
Sbjct: 65 RAISPETQILAALGFYTSGSFQTRMGDAIGISQASMSRCVANVTEALVERASQFIHFPVD 124
Query: 151 EVEMAQVKSKFEKIQGLPNCCGSIDTTHITMCLPASDPTSYVWLDEEKNHSVVLQVIVDA 210
E + +K +F + G+P G D H+ + P ++ SYV + + HS+ V+ D
Sbjct: 125 EAAVQSLKDEFYGLAGMPGVIGVADCIHVAIKAPNAEDLSYV--NRKGLHSLNCLVVCDI 184
Query: 211 EMRFRDILTGLPGKMSDWLVSQSSNFHKLCDKGERLNGKRLELNDRSEIREYIIGDSGYP 270
+ T PG + D V Q S+ + G + +++GDS +
Sbjct: 185 RGALMTVETSWPGSLQDCAVLQRSSLTSQFETGMPKD-------------SWLLGDSSFF 244
Query: 271 LLPYLVTPYDGKKLSTSKAEFNKRHKETRLVVQPALTMLKERWRIIQG----VMWRPDKH 330
L +L+TP + + ++ +N+ H T V++ L L R+R + G + + P+K
Sbjct: 245 LRSWLLTPLPIPE-TAAEYRYNRAHSATHSVIERTLQTLCCRFRCLDGSKGALQYSPEK- 304
Query: 331 RLPRIILVCCLLHNIIIDIGDETEDGDVPLSIEHDVDYKQQVCDVFDSKGAYLRDKLSL 366
IIL CC+LHNI +D G + VP I+ + + + + D + +R +L L
Sbjct: 305 -CSHIILACCVLHNISLDHGMDVWSSPVPGPIDQPPEGEDEHMESLDLEADRIRQELIL 345
BLAST of HG10021971 vs. ExPASy TrEMBL
Match:
A0A0A0M0C2 (DDE Tnp4 domain-containing protein OS=Cucumis sativus OX=3659 GN=Csa_1G481730 PE=3 SV=1)
HSP 1 Score: 705.7 bits (1820), Expect = 1.0e-199
Identity = 346/388 (89.18%), Postives = 359/388 (92.53%), Query Frame = 0
Query: 1 MGPIRGLRKKKKLERKLDSNGTASDSSEKEEAIDWWDDFSKRTNGLHSASKGLDRFKSTF 60
MGPIRGLRKKKKLERKLD NGTASDSSEK++AIDWWDDFSKRTNGLHSASKGLDRFKS F
Sbjct: 1 MGPIRGLRKKKKLERKLDCNGTASDSSEKDDAIDWWDDFSKRTNGLHSASKGLDRFKSIF 60
Query: 61 KVSRKTFDYICLLVKDDMTAKSGNFTFLNGRPLSLCDQVAVALRRLGSGESLVTIGDSLG 120
KVSRKTFDYICLLVKDDMTAKSG+FTFLNGRPLSLCDQVAVALRRLGSGESLVTIGDSLG
Sbjct: 61 KVSRKTFDYICLLVKDDMTAKSGHFTFLNGRPLSLCDQVAVALRRLGSGESLVTIGDSLG 120
Query: 121 LN--------------------HYLHWPSTEVEMAQVKSKFEKIQGLPNCCGSIDTTHIT 180
LN H+LHWPS EVEMAQVKSKFEKIQGLPNCCGSIDTTHIT
Sbjct: 121 LNHSTVSQVTWRFVESMEERGLHHLHWPSNEVEMAQVKSKFEKIQGLPNCCGSIDTTHIT 180
Query: 181 MCLPASDPTSYVWLDEEKNHSVVLQVIVDAEMRFRDILTGLPGKMSDWLVSQSSNFHKLC 240
MCLPASDPTSYVWLD++KNHS+VLQVIVDAEMRFRDILTGLPGK+SDWLV QSSNFHKLC
Sbjct: 181 MCLPASDPTSYVWLDDKKNHSMVLQVIVDAEMRFRDILTGLPGKLSDWLVFQSSNFHKLC 240
Query: 241 DKGERLNGKRLELNDRSEIREYIIGDSGYPLLPYLVTPYDGKKLSTSKAEFNKRHKETRL 300
DKGERLNGKR EL DRSEIREYIIGDSGYPLLPYLVTPYDGK+LSTSK EFNKRHKETRL
Sbjct: 241 DKGERLNGKRFELPDRSEIREYIIGDSGYPLLPYLVTPYDGKELSTSKTEFNKRHKETRL 300
Query: 301 VVQPALTMLKERWRIIQGVMWRPDKHRLPRIILVCCLLHNIIIDIGDETEDGDVPLSIEH 360
VVQ AL MLKERWRIIQGVMWRPDKHRLPRIILVCCLLHNIIIDIGDETE+G+VPLSIEH
Sbjct: 301 VVQRALAMLKERWRIIQGVMWRPDKHRLPRIILVCCLLHNIIIDIGDETEEGEVPLSIEH 360
Query: 361 DVDYKQQVCDVFDSKGAYLRDKLSLLFI 369
DVDYKQQVCDVFDSKGAY+RD+LSLLFI
Sbjct: 361 DVDYKQQVCDVFDSKGAYVRDRLSLLFI 388
BLAST of HG10021971 vs. ExPASy TrEMBL
Match:
A0A5A7SXL0 (Putative nuclease HARBI1 OS=Cucumis melo var. makuwa OX=1194695 GN=E6C27_scaffold43059G001000 PE=3 SV=1)
HSP 1 Score: 698.7 bits (1802), Expect = 1.3e-197
Identity = 345/388 (88.92%), Postives = 358/388 (92.27%), Query Frame = 0
Query: 1 MGPIRGLRKKKKLERKLDSNGTASDSSEKEEAIDWWDDFSKRTNGLHSASKGLDRFKSTF 60
MGPIRGLRKKKKLERKLDSNGTASDSSEK++AIDWWDDFSKRTNGLHSASKGLDRFKS F
Sbjct: 1 MGPIRGLRKKKKLERKLDSNGTASDSSEKDDAIDWWDDFSKRTNGLHSASKGLDRFKSIF 60
Query: 61 KVSRKTFDYICLLVKDDMTAKSGNFTFLNGRPLSLCDQVAVALRRLGSGESLVTIGDSLG 120
KVSRKTFDYICLLVKDDMTAKSG+FTFLNGRPLSLCDQVAVALRRLGSGESLVTIGDSLG
Sbjct: 61 KVSRKTFDYICLLVKDDMTAKSGHFTFLNGRPLSLCDQVAVALRRLGSGESLVTIGDSLG 120
Query: 121 LNH--------------------YLHWPSTEVEMAQVKSKFEKIQGLPNCCGSIDTTHIT 180
LNH +LHWPS EVEMAQVKSKFEKIQGLPNCCGSIDTTHIT
Sbjct: 121 LNHSTVSQVTWRFVESMEERGLCHLHWPSNEVEMAQVKSKFEKIQGLPNCCGSIDTTHIT 180
Query: 181 MCLPASDPTSYVWLDEEKNHSVVLQVIVDAEMRFRDILTGLPGKMSDWLVSQSSNFHKLC 240
MCLPASDPTS+VWLD+EKNHS+VLQVIVDAEMRFRDILTGLPGK+SD LV QSSNFHKLC
Sbjct: 181 MCLPASDPTSFVWLDDEKNHSMVLQVIVDAEMRFRDILTGLPGKLSDRLVFQSSNFHKLC 240
Query: 241 DKGERLNGKRLELNDRSEIREYIIGDSGYPLLPYLVTPYDGKKLSTSKAEFNKRHKETRL 300
DKGERLNGKRLEL DRSEI+EYI+GDSGYPLL YLVTPYDGK+LSTSKAEFNKRH TRL
Sbjct: 241 DKGERLNGKRLELPDRSEIQEYIVGDSGYPLLSYLVTPYDGKELSTSKAEFNKRHTATRL 300
Query: 301 VVQPALTMLKERWRIIQGVMWRPDKHRLPRIILVCCLLHNIIIDIGDETEDGDVPLSIEH 360
VVQ AL MLKERWRIIQGVMWRPDKHRLPRIILVCCLLHNIIIDIGDETEDGDVPLSIEH
Sbjct: 301 VVQQALAMLKERWRIIQGVMWRPDKHRLPRIILVCCLLHNIIIDIGDETEDGDVPLSIEH 360
Query: 361 DVDYKQQVCDVFDSKGAYLRDKLSLLFI 369
DVDYKQQVCDVFDSKGAYLR++LSLLFI
Sbjct: 361 DVDYKQQVCDVFDSKGAYLRERLSLLFI 388
BLAST of HG10021971 vs. ExPASy TrEMBL
Match:
A0A5D3BNB9 (Putative nuclease HARBI1 OS=Cucumis melo var. makuwa OX=1194695 GN=E5676_scaffold49G00830 PE=3 SV=1)
HSP 1 Score: 698.7 bits (1802), Expect = 1.3e-197
Identity = 345/388 (88.92%), Postives = 358/388 (92.27%), Query Frame = 0
Query: 1 MGPIRGLRKKKKLERKLDSNGTASDSSEKEEAIDWWDDFSKRTNGLHSASKGLDRFKSTF 60
MGPIRGLRKKKKLERKLDSNGTASDSSEK++AIDWWDDFSKRTNGLHSASKGLDRFKS F
Sbjct: 1 MGPIRGLRKKKKLERKLDSNGTASDSSEKDDAIDWWDDFSKRTNGLHSASKGLDRFKSIF 60
Query: 61 KVSRKTFDYICLLVKDDMTAKSGNFTFLNGRPLSLCDQVAVALRRLGSGESLVTIGDSLG 120
KVSRKTFDYICLLVKDDMTAKSG+FTFLNGRPLSLCDQVAVALRRLGSGESLVTIGDSLG
Sbjct: 61 KVSRKTFDYICLLVKDDMTAKSGHFTFLNGRPLSLCDQVAVALRRLGSGESLVTIGDSLG 120
Query: 121 LNH--------------------YLHWPSTEVEMAQVKSKFEKIQGLPNCCGSIDTTHIT 180
LNH +LHWPS EVEMAQVKSKFEKIQGLPNCCGSIDTTHIT
Sbjct: 121 LNHSTVSQVTWRFVESMEERGLRHLHWPSNEVEMAQVKSKFEKIQGLPNCCGSIDTTHIT 180
Query: 181 MCLPASDPTSYVWLDEEKNHSVVLQVIVDAEMRFRDILTGLPGKMSDWLVSQSSNFHKLC 240
MCLPASDPTS+VWLD+EKNHS+VLQVIVDAEMRFRDILTGLPGK+SD LV QSSNFHKLC
Sbjct: 181 MCLPASDPTSFVWLDDEKNHSMVLQVIVDAEMRFRDILTGLPGKLSDRLVFQSSNFHKLC 240
Query: 241 DKGERLNGKRLELNDRSEIREYIIGDSGYPLLPYLVTPYDGKKLSTSKAEFNKRHKETRL 300
DKGERLNGKRLEL DRSEI+EYI+GDSGYPLL YLVTPYDGK+LSTSKAEFNKRH TRL
Sbjct: 241 DKGERLNGKRLELPDRSEIQEYIVGDSGYPLLSYLVTPYDGKELSTSKAEFNKRHTATRL 300
Query: 301 VVQPALTMLKERWRIIQGVMWRPDKHRLPRIILVCCLLHNIIIDIGDETEDGDVPLSIEH 360
VVQ AL MLKERWRIIQGVMWRPDKHRLPRIILVCCLLHNIIIDIGDETEDGDVPLSIEH
Sbjct: 301 VVQQALAMLKERWRIIQGVMWRPDKHRLPRIILVCCLLHNIIIDIGDETEDGDVPLSIEH 360
Query: 361 DVDYKQQVCDVFDSKGAYLRDKLSLLFI 369
DVDYKQQVCDVFDSKGAYLR++LSLLFI
Sbjct: 361 DVDYKQQVCDVFDSKGAYLRERLSLLFI 388
BLAST of HG10021971 vs. ExPASy TrEMBL
Match:
A0A1S3C6F2 (LOW QUALITY PROTEIN: putative nuclease HARBI1 OS=Cucumis melo OX=3656 GN=LOC103497206 PE=3 SV=1)
HSP 1 Score: 695.7 bits (1794), Expect = 1.1e-196
Identity = 344/388 (88.66%), Postives = 356/388 (91.75%), Query Frame = 0
Query: 1 MGPIRGLRKKKKLERKLDSNGTASDSSEKEEAIDWWDDFSKRTNGLHSASKGLDRFKSTF 60
MGPIRGLRKKKKLERKLDSNGTASDSSEK++AIDWWDDFSKRTNGLHSASKGLDRFKS F
Sbjct: 1 MGPIRGLRKKKKLERKLDSNGTASDSSEKDDAIDWWDDFSKRTNGLHSASKGLDRFKSIF 60
Query: 61 KVSRKTFDYICLLVKDDMTAKSGNFTFLNGRPLSLCDQVAVALRRLGSGESLVTIGDSLG 120
KVSRKTFDYICLLVKDDMTAKSG+FTFLNGRPLSLCDQVAVALRRLGSGESLVTIGDSLG
Sbjct: 61 KVSRKTFDYICLLVKDDMTAKSGHFTFLNGRPLSLCDQVAVALRRLGSGESLVTIGDSLG 120
Query: 121 LNH--------------------YLHWPSTEVEMAQVKSKFEKIQGLPNCCGSIDTTHIT 180
LNH +LHWPS EVEMAQVKSKFEKIQGLPNCCGSIDTTHIT
Sbjct: 121 LNHSTVSQVTWRFVESMEERGLCHLHWPSNEVEMAQVKSKFEKIQGLPNCCGSIDTTHIT 180
Query: 181 MCLPASDPTSYVWLDEEKNHSVVLQVIVDAEMRFRDILTGLPGKMSDWLVSQSSNFHKLC 240
MCLPASDPTS+VWLD KNHS+VLQVIVDAEMRFRDILTGLPGK+SD LV QSSNFHKLC
Sbjct: 181 MCLPASDPTSFVWLDXRKNHSMVLQVIVDAEMRFRDILTGLPGKLSDRLVFQSSNFHKLC 240
Query: 241 DKGERLNGKRLELNDRSEIREYIIGDSGYPLLPYLVTPYDGKKLSTSKAEFNKRHKETRL 300
DKGERLNGKRLEL DRSEI+EYI+GDSGYPLL YLVTPYDGK+LSTSKAEFNKRH TRL
Sbjct: 241 DKGERLNGKRLELPDRSEIQEYIVGDSGYPLLSYLVTPYDGKELSTSKAEFNKRHTATRL 300
Query: 301 VVQPALTMLKERWRIIQGVMWRPDKHRLPRIILVCCLLHNIIIDIGDETEDGDVPLSIEH 360
VVQ AL MLKERWRIIQGVMWRPDKHRLPRIILVCCLLHNIIIDIGDETEDGDVPLSIEH
Sbjct: 301 VVQQALAMLKERWRIIQGVMWRPDKHRLPRIILVCCLLHNIIIDIGDETEDGDVPLSIEH 360
Query: 361 DVDYKQQVCDVFDSKGAYLRDKLSLLFI 369
DVDYKQQVCDVFDSKGAYLR++LSLLFI
Sbjct: 361 DVDYKQQVCDVFDSKGAYLRERLSLLFI 388
BLAST of HG10021971 vs. ExPASy TrEMBL
Match:
A0A6J1GBJ6 (protein ALP1-like OS=Cucurbita moschata OX=3662 GN=LOC111452471 PE=3 SV=1)
HSP 1 Score: 646.4 bits (1666), Expect = 7.4e-182
Identity = 320/389 (82.26%), Postives = 340/389 (87.40%), Query Frame = 0
Query: 1 MGPIRGLRKKKKLERKLDSN-GTASDSSEKEEAIDWWDDFSKRTNGLHSASKGLDRFKST 60
MGPIRG RKKKKLERKLD+N TASDSSEK++A+DWWDDFS+RT GLHS +GLD FKS
Sbjct: 1 MGPIRGSRKKKKLERKLDANASTASDSSEKDDALDWWDDFSRRTIGLHSELEGLDGFKSI 60
Query: 61 FKVSRKTFDYICLLVKDDMTAKSGNFTFLNGRPLSLCDQVAVALRRLGSGESLVTIGDSL 120
FKVSRKTFDYICLLVKDDMTA+S NFTFLNGRPLSL DQVAVALRRLGSG+SLVTIG S
Sbjct: 61 FKVSRKTFDYICLLVKDDMTAESSNFTFLNGRPLSLYDQVAVALRRLGSGDSLVTIGYSF 120
Query: 121 GLNH--------------------YLHWPSTEVEMAQVKSKFEKIQGLPNCCGSIDTTHI 180
GLNH +LHWPSTE EMAQVK KFEKIQGLPNCCGSIDTTHI
Sbjct: 121 GLNHSTVSQVTWRFVESMEVRGLRHLHWPSTEEEMAQVKLKFEKIQGLPNCCGSIDTTHI 180
Query: 181 TMCLPASDPTSYVWLDEEKNHSVVLQVIVDAEMRFRDILTGLPGKMSDWLVSQSSNFHKL 240
TMCLP DPTS VWLD EKNHS+VLQVIVDAEMRFRDI+TGLPGKMSDWLV QSSNFHKL
Sbjct: 181 TMCLPVLDPTSNVWLDAEKNHSMVLQVIVDAEMRFRDIVTGLPGKMSDWLVFQSSNFHKL 240
Query: 241 CDKGERLNGKRLELNDRSEIREYIIGDSGYPLLPYLVTPYDGKKLSTSKAEFNKRHKETR 300
C+KGERLNGKRLE +RSEIREYIIGDSGYPLLPYLVTPYDGK+L SKAEFNKRH ETR
Sbjct: 241 CEKGERLNGKRLEFINRSEIREYIIGDSGYPLLPYLVTPYDGKELQPSKAEFNKRHTETR 300
Query: 301 LVVQPALTMLKERWRIIQGVMWRPDKHRLPRIILVCCLLHNIIIDIGDETEDGDVPLSIE 360
LVVQ AL LKERWRIIQGVMWRPDKHRLPRIILVCCLLHNIIID+GDE EDG+VP+S+E
Sbjct: 301 LVVQRALASLKERWRIIQGVMWRPDKHRLPRIILVCCLLHNIIIDVGDEMEDGNVPMSME 360
Query: 361 HDVDYKQQVCDVFDSKGAYLRDKLSLLFI 369
HD DYKQQ+CDV+DSKGAYLRDKLSLLFI
Sbjct: 361 HDADYKQQICDVYDSKGAYLRDKLSLLFI 389
BLAST of HG10021971 vs. TAIR 10
Match:
AT3G55350.1 (PIF / Ping-Pong family of plant transposases )
HSP 1 Score: 415.6 bits (1067), Expect = 4.1e-116
Identity = 210/403 (52.11%), Postives = 277/403 (68.73%), Query Frame = 0
Query: 1 MGPIRGLRKKKKLERKLDSN----GTAS---------------DSSEKEEAIDWWDDFSK 60
MGPI+ ++KKK+ E+K+D N TA+ D +++DWWD FS+
Sbjct: 1 MGPIKTIKKKKRAEKKVDRNVLLAATAAATSASAAAALNNNDDDDDSSSQSLDWWDGFSR 60
Query: 61 RTNGLHSASKGLDRFKSTFKVSRKTFDYICLLVKDDMTAKSGNFTFLNGRPLSLCDQVAV 120
R ++ S F+S FK+SRKTFDYIC LVK D TAK NF+ NG PLSL D+VAV
Sbjct: 61 R---IYGGSTDPKTFESVFKISRKTFDYICSLVKADFTAKPANFSDSNGNPLSLNDRVAV 120
Query: 121 ALRRLGSGESLVTIGDSLGLN--------------------HYLHWPSTEVEMAQVKSKF 180
ALRRLGSGESL IG++ G+N H+L WPS ++ ++KSKF
Sbjct: 121 ALRRLGSGESLSVIGETFGMNQSTVSQITWRFVESMEERAIHHLSWPS---KLDEIKSKF 180
Query: 181 EKIQGLPNCCGSIDTTHITMCLPASDPTSYVWLDEEKNHSVVLQVIVDAEMRFRDILTGL 240
EKI GLPNCCG+ID THI M LPA +P++ VWLD EKN S+ LQ +VD +MRF D++ G
Sbjct: 181 EKISGLPNCCGAIDITHIVMNLPAVEPSNKVWLDGEKNFSMTLQAVVDPDMRFLDVIAGW 240
Query: 241 PGKMSDWLVSQSSNFHKLCDKGERLNGKRLELNDRSEIREYIIGDSGYPLLPYLVTPYDG 300
PG ++D +V ++S F+KL +KG+RLNG++L L++R+E+REYI+GDSG+PLLP+L+TPY G
Sbjct: 241 PGSLNDDVVLKNSGFYKLVEKGKRLNGEKLPLSERTELREYIVGDSGFPLLPWLLTPYQG 300
Query: 301 KKLSTSKAEFNKRHKETRLVVQPALTMLKERWRIIQGVMWRPDKHRLPRIILVCCLLHNI 360
K S + EFNKRH E Q AL+ LK+RWRII GVMW PD++RLPRII VCCLLHNI
Sbjct: 301 KPTSLPQTEFNKRHSEATKAAQMALSKLKDRWRIINGVMWMPDRNRLPRIIFVCCLLHNI 360
Query: 361 IIDIGDETEDGDVPLSIEHDVDYKQQVCDVFDSKGAYLRDKLS 365
IID+ D+T D D PLS +HD++Y+Q+ C + D + LRD+LS
Sbjct: 361 IIDMEDQTLD-DQPLSQQHDMNYRQRSCKLADEASSVLRDELS 396
BLAST of HG10021971 vs. TAIR 10
Match:
AT3G63270.1 (CONTAINS InterPro DOMAIN/s: Putative harbinger transposase-derived nuclease (InterPro:IPR006912); BEST Arabidopsis thaliana protein match is: PIF / Ping-Pong family of plant transposases (TAIR:AT3G55350.1); Has 30201 Blast hits to 17322 proteins in 780 species: Archae - 12; Bacteria - 1396; Metazoa - 17338; Fungi - 3422; Plants - 5037; Viruses - 0; Other Eukaryotes - 2996 (source: NCBI BLink). )
HSP 1 Score: 296.2 bits (757), Expect = 3.6e-80
Identity = 167/396 (42.17%), Postives = 233/396 (58.84%), Query Frame = 0
Query: 1 MGPIRGLRKKKKLERKLDSNGTASDSSEKE---------EAI--DWWDDFSKRTNGLHSA 60
M P++ +KKK ++ LD + + EK+ EAI DWWD F R +
Sbjct: 1 MAPVK--QKKKNKKKPLDKAKKLAKNKEKKRVNAVPLDPEAIDCDWWDTFWLRNSSPSVP 60
Query: 61 SKGLDRFKSTFKVSRKTFDYICLLVKDDMTAK-SGNFTFLNGRPLSLCDQVAVALRRLGS 120
S FK F+ S+ TF YIC LV++D+ ++ + GR LS+ QVA+ALRRL S
Sbjct: 61 SDEDYAFKHFFRASKTTFSYICSLVREDLISRPPSGLINIEGRLLSVEKQVAIALRRLAS 120
Query: 121 GESLVTIGDSLGL--------------------NHYLHWPSTEVEMAQVKSKFEKIQGLP 180
G+S V++G + G+ H+L WP ++ + ++KSKFE++ GLP
Sbjct: 121 GDSQVSVGAAFGVGQSTVSQVTWRFIEALEERAKHHLRWPDSD-RIEEIKSKFEEMYGLP 180
Query: 181 NCCGSIDTTHITMCLPASDPTSYVWLDEEKNHSVVLQVIVDAEMRFRDILTGLPGKMSDW 240
NCCG+IDTTHI M LPA S W D+EKN+S+ LQ + D EMRF +++TG PG M+
Sbjct: 181 NCCGAIDTTHIIMTLPAVQ-ASDDWCDQEKNYSMFLQGVFDHEMRFLNMVTGWPGGMTVS 240
Query: 241 LVSQSSNFHKLCDKGERLNGKRLELNDRSEIREYIIGDSGYPLLPYLVTPYDGKKLSTSK 300
+ + S F KLC+ + L+G L+ ++IREY++G YPLLP+L+TP+D S S
Sbjct: 241 KLLKFSGFFKLCENAQILDGNPKTLSQGAQIREYVVGGISYPLLPWLITPHDSDHPSDSM 300
Query: 301 AEFNKRHKETRLVVQPALTMLKERWRIIQGVMWRPDKHRLPRIILVCCLLHNIIIDIGDE 360
FN+RH++ R V A LK WRI+ VMWRPD+ +LP IILVCCLLHNIIID GD
Sbjct: 301 VAFNERHEKVRSVAATAFQQLKGSWRILSKVMWRPDRRKLPSIILVCCLLHNIIIDCGDY 360
Query: 361 TEDGDVPLSIEHDVDYKQQVCDVFDSKGAYLRDKLS 365
++ DVPLS HD Y + C + G+ LR L+
Sbjct: 361 LQE-DVPLSGHHDSGYADRYCKQTEPLGSELRGCLT 391
BLAST of HG10021971 vs. TAIR 10
Match:
AT5G12010.1 (unknown protein; INVOLVED IN: response to salt stress; LOCATED IN: chloroplast, plasma membrane, membrane; EXPRESSED IN: 23 plant structures; EXPRESSED DURING: 13 growth stages; BEST Arabidopsis thaliana protein match is: unknown protein (TAIR:AT4G29780.1); Has 1807 Blast hits to 1807 proteins in 277 species: Archae - 0; Bacteria - 0; Metazoa - 736; Fungi - 347; Plants - 385; Viruses - 0; Other Eukaryotes - 339 (source: NCBI BLink). )
HSP 1 Score: 125.9 bits (315), Expect = 6.5e-29
Identity = 83/319 (26.02%), Postives = 143/319 (44.83%), Query Frame = 0
Query: 29 KEEAIDWWDDFSKRTNGLHSASKGLDRFKSTFKVSRKTFDYICLLVKDDMTAKSGNFTFL 88
K+ + WW++ S+ + FK F++S+ TF+ IC D++ +
Sbjct: 155 KDRSRAWWEECSR-------LDYPEEDFKKAFRMSKSTFELIC----DELNSAVAKEDTA 214
Query: 89 NGRPLSLCDQVAVALRRLGSGESLVTIGDSLGLN---------------------HYLHW 148
+ + +VAV + RL +GE L + GL YL W
Sbjct: 215 LRNAIPVRQRVAVCIWRLATGEPLRLVSKKFGLGISTCHKLVLEVCKAIKDVLMPKYLQW 274
Query: 149 PSTEVEMAQVKSKFEKIQGLPNCCGSIDTTHITMCLPASDPTSY-----VWLDEEKNHSV 208
P E + ++ +FE + G+PN GS+ TTHI + P SY +++ ++S+
Sbjct: 275 PDDE-SLRNIRERFESVSGIPNVVGSMYTTHIPIIAPKISVASYFNKRHTERNQKTSYSI 334
Query: 209 VLQVIVDAEMRFRDILTGLPGKMSDWLVSQSSNFHKLCDKGERLNGKRLELNDRSEIREY 268
+Q +V+ + F D+ G PG M D V + S ++ + G L G +
Sbjct: 335 TIQAVVNPKGVFTDLCIGWPGSMPDDKVLEKSLLYQRANNGGLLKGM------------W 394
Query: 269 IIGDSGYPLLPYLVTPYDGKKLSTSKAEFNKRHKETRLVVQPALTMLKERWRIIQGVMWR 322
+ G G+PLL +++ PY + L+ ++ FN++ E + V + A LK RW +Q
Sbjct: 395 VAGGPGHPLLDWVLVPYTQQNLTWTQHAFNEKMSEVQGVAKEAFGRLKGRWACLQ-KRTE 448
BLAST of HG10021971 vs. TAIR 10
Match:
AT4G29780.1 (unknown protein; BEST Arabidopsis thaliana protein match is: unknown protein (TAIR:AT5G12010.1); Has 945 Blast hits to 944 proteins in 87 species: Archae - 0; Bacteria - 0; Metazoa - 519; Fungi - 43; Plants - 365; Viruses - 0; Other Eukaryotes - 18 (source: NCBI BLink). )
HSP 1 Score: 123.6 bits (309), Expect = 3.2e-28
Identity = 91/319 (28.53%), Postives = 145/319 (45.45%), Query Frame = 0
Query: 29 KEEAIDWWDDFSKRTNGLHSASKGLDRFKSTFKVSRKTFDYICLLVKDDMTAKSGNFTFL 88
KE DWWD S+ D F+ F++S+ TF+ IC + +T K+ T L
Sbjct: 193 KERTTDWWDRVSR-------PDFPEDEFRREFRMSKSTFNLICEELDTTVTKKN---TML 252
Query: 89 NGRPLSLCDQVAVALRRLGSGESLVTIGDSLGLN---------------------HYLHW 148
+ +V V + RL +G L + + GL YL W
Sbjct: 253 RD-AIPAPKRVGVCVWRLATGAPLRHVSERFGLGISTCHKLVIEVCRAIYDVLMPKYLLW 312
Query: 149 PSTEVEMAQVKSKFEKIQGLPNCCGSIDTTHITMCLPASDPTSY-----VWLDEEKNHSV 208
PS + E+ K+KFE + +PN GSI TTHI + P +Y +++ ++S+
Sbjct: 313 PS-DSEINSTKAKFESVHKIPNVVGSIYTTHIPIIAPKVHVAAYFNKRHTERNQKTSYSI 372
Query: 209 VLQVIVDAEMRFRDILTGLPGKMSDWLVSQSSNFHKLCDKGERLNGKRLELNDRSEIREY 268
+Q +V+A+ F D+ G PG ++D + + S+ R R L D +
Sbjct: 373 TVQGVVNADGIFTDVCIGNPGSLTDDQILEKSSL-------SRQRAARGMLRD-----SW 432
Query: 269 IIGDSGYPLLPYLVTPYDGKKLSTSKAEFNKRHKETRLVVQPALTMLKERWRIIQGVMWR 322
I+G+SG+PL YL+ PY + L+ ++ FN+ E + + A LK RW +Q
Sbjct: 433 IVGNSGFPLTDYLLVPYTRQNLTWTQHAFNESIGEIQGIATAAFERLKGRWACLQ-KRTE 486
BLAST of HG10021971 vs. TAIR 10
Match:
AT3G19120.1 (PIF / Ping-Pong family of plant transposases )
HSP 1 Score: 95.5 bits (236), Expect = 9.4e-20
Identity = 72/259 (27.80%), Postives = 115/259 (44.40%), Query Frame = 0
Query: 87 FLNGRPLSLCDQVAVA--LRRLGSGESLVTIGDSLGLNHYL------------------- 146
F+ LSL AVA L RL G S T+ L+ YL
Sbjct: 138 FITASNLSLPADYAVAMVLSRLAHGCSAKTLASRYSLDPYLISKITNMVTRLLATKLYPE 197
Query: 147 --HWPSTEVEMAQVKSKFEKIQGLPNCCGSIDTTHITMCLPASDPTSYVWLDEEKNHSVV 206
P + + + FE++ LPN CG+ID+T + + ++ + +V+
Sbjct: 198 FIKIPVGKRRLIETTQGFEELTSLPNICGAIDSTPVKLRRRTKLNPRNIYGCKYGYDAVL 257
Query: 207 LQVIVDAEMRFRDILTGLPGKMSDWLVSQSSNFHKLCDKGERLNGKRLELNDRSEIREYI 266
LQV+ D + F D+ PG D + S +K G+ + K + + +R YI
Sbjct: 258 LQVVADHKKIFWDVCVKAPGGEDDSSHFRDSLLYKRLTSGDIVWEKVINIRGH-HVRPYI 317
Query: 267 IGDSGYPLLPYLVTPYDGKKLSTSKAE-FNKRHKETRLVVQPALTMLKERWRIIQGVMWR 322
+GD YPLL +L+TP+ T F+ + R VV A+ +LK RW+I+Q +
Sbjct: 318 VGDWCYPLLSFLMTPFSPNGSGTPPENLFDGMLMKGRSVVVEAIGLLKARWKILQSL--N 377
The following BLAST results are available for this feature:
Match Name | E-value | Identity | Description | |
XP_038893506.1 | 9.6e-200 | 90.49 | protein ALP1-like [Benincasa hispida] | [more] |
XP_004149039.1 | 2.1e-199 | 89.18 | protein ALP1-like [Cucumis sativus] >KGN65671.1 hypothetical protein Csa_019843 ... | [more] |
TYK01291.1 | 2.6e-197 | 88.92 | putative nuclease HARBI1 [Cucumis melo var. makuwa] | [more] |
KAA0033909.1 | 2.6e-197 | 88.92 | putative nuclease HARBI1 [Cucumis melo var. makuwa] | [more] |
XP_008457540.1 | 2.2e-196 | 88.66 | PREDICTED: LOW QUALITY PROTEIN: putative nuclease HARBI1 [Cucumis melo] | [more] |
Match Name | E-value | Identity | Description | |
Q9M2U3 | 5.8e-115 | 52.11 | Protein ALP1-like OS=Arabidopsis thaliana OX=3702 GN=At3g55350 PE=2 SV=1 | [more] |
Q94K49 | 5.1e-79 | 42.17 | Protein ANTAGONIST OF LIKE HETEROCHROMATIN PROTEIN 1 OS=Arabidopsis thaliana OX=... | [more] |
B0BN95 | 2.7e-19 | 23.75 | Putative nuclease HARBI1 OS=Rattus norvegicus OX=10116 GN=Harbi1 PE=2 SV=1 | [more] |
Q17QR8 | 7.8e-19 | 22.74 | Putative nuclease HARBI1 OS=Bos taurus OX=9913 GN=HARBI1 PE=2 SV=1 | [more] |
Q8BR93 | 7.8e-19 | 24.08 | Putative nuclease HARBI1 OS=Mus musculus OX=10090 GN=Harbi1 PE=2 SV=1 | [more] |
Match Name | E-value | Identity | Description | |
A0A0A0M0C2 | 1.0e-199 | 89.18 | DDE Tnp4 domain-containing protein OS=Cucumis sativus OX=3659 GN=Csa_1G481730 PE... | [more] |
A0A5A7SXL0 | 1.3e-197 | 88.92 | Putative nuclease HARBI1 OS=Cucumis melo var. makuwa OX=1194695 GN=E6C27_scaffol... | [more] |
A0A5D3BNB9 | 1.3e-197 | 88.92 | Putative nuclease HARBI1 OS=Cucumis melo var. makuwa OX=1194695 GN=E5676_scaffol... | [more] |
A0A1S3C6F2 | 1.1e-196 | 88.66 | LOW QUALITY PROTEIN: putative nuclease HARBI1 OS=Cucumis melo OX=3656 GN=LOC1034... | [more] |
A0A6J1GBJ6 | 7.4e-182 | 82.26 | protein ALP1-like OS=Cucurbita moschata OX=3662 GN=LOC111452471 PE=3 SV=1 | [more] |
Match Name | E-value | Identity | Description | |
AT3G55350.1 | 4.1e-116 | 52.11 | PIF / Ping-Pong family of plant transposases | [more] |
AT3G63270.1 | 3.6e-80 | 42.17 | CONTAINS InterPro DOMAIN/s: Putative harbinger transposase-derived nuclease (Int... | [more] |
AT5G12010.1 | 6.5e-29 | 26.02 | unknown protein; INVOLVED IN: response to salt stress; LOCATED IN: chloroplast, ... | [more] |
AT4G29780.1 | 3.2e-28 | 28.53 | unknown protein; BEST Arabidopsis thaliana protein match is: unknown protein (TA... | [more] |
AT3G19120.1 | 9.4e-20 | 27.80 | PIF / Ping-Pong family of plant transposases | [more] |