Tan0011004 (gene) Snake gourd v1

Overview
NameTan0011004
Typegene
OrganismTrichosanthes anguina (Snake gourd v1)
DescriptionCCHC-type domain-containing protein
LocationLG05: 23593733 .. 23599467 (+)
RNA-Seq ExpressionTan0011004
SyntenyTan0011004
Sequences
The following sequences are available for this feature:

Gene sequence (with intron)

Legend: exonCDSpolypeptide
Hold the cursor over a type above to highlight its positions in the sequence below.
ATGGAGGCCAAGCATCTCAGACTAATGGTCGAACGTCTAAATCTTACAGAGGAGGAAGGCAGAGCTATCGTGGTTGAGGACGATGACGTCGATGAGAGCGCCCGCCTCCTGTCGATATCCCTGATCTGCAAAAGTCTATCCTCGAAACCAGTTCATATTGATGTTTTCCGGCAAAAAATTCCAAAAATTTGGAAAACCACTGCCCCGATTGGTATCGATAAAGTTGGGGAAAATCTATTCCTCTGTAGCTTTGGGAATACAGGAGATTTACGAAGAGTCTTGGAAAACGAACCATGGTACTTTGATAAGGCGATTCTCCTATTTGATGAACCAAGAGGCAACTGTCGGTTCATAGATCTGGAATTCAGGTTCGTTAATTTTTGGATCCACATACATAATCTGCCTCCTGCTGAACAATCGCTGAAAATGGCACAAATCTTTGGAAATCGCCTCGGTGTGTTCCAGAAGGTGGACCTGAACGATACAGAAACATGTTGGGGAAGCTCCTTGTGCCTAAAGGTTCGGATCAACGTCACCATTCCTCTAAAACGGGGATTAAAGGTCAAAATAGGAACGATGGCAGAGGAGTTATGGTGCCCGGTCACCTACGAGAAACTTCCTGATTTTTGCTACAGCTGCGGGCGAACCGGGCACCTGGACAGGATATGCAATGAGGTTGATTGGTCAACTTCGAGTAAAAAGCAATTTGGTTCTGGACTAAGACATCCATCCTCAAATTCTGGACAACAGAATTGGGCACGATCGTTTTTATCAGATAGCAGAGGAAGAGGAAGGGGAGTTCATGGCAGAGGATCGAGAGACACAAAGAAGACAGTGGAGGACGATATCGAAGAAGACTCGTCGATGGATTCGAGTTCAGGGATAAATTTACAGACCGGAAGCAAACAGAAGACATCCTTGGACACAAATAATTTGGAAAAAAATTGTCGAAGATCGGAGTCGGAGGCCGGAGACAGAGCGACGGAGAGGCAAGGAAGCGAGGAGTTTCAAAAATTAATTAATGCCATTAATTTCTCCGTTGAATCGGTGTCAAAAGAGAGGTCGTTAGAAGTAACTGAAGGACATTCAAGGAAGGATCGAAATGGAAAGCTAAAAGGTCTACTGGAGGTTTCAAGAAAGCTGGTTTTCGACATTGAAGGAGAAGAAGACGTTTCGAAAAAGGTTCTTAATTCTTCTTCTGCAGTTGAGCCACGTGGGGTTGAGAAAGGAGAAAAATTAGCACAGGTACCAACAAAAGGGCTCTCAATTTTGCACTCCTCAACCGGGCCTAAAATAATGGGCTTTAAGGACAACGACATGGAGACTCGGATAATTGGGCTAGCCGAAGAATTTTTGCAGACTAATGGGCTAAATTTTCAAATAGGCTCCTCGGCTAAGTCCAACCAAAACTACAATGGGCTACTTAAGATGCAGAAAACGGGGAAGGTAAAAATGGGAAAAATCGAGTTAAATGGCAAACTAAATCCACTCTCCCTTATGGTTTCAGGAACAAAGCGTGGTATCCAACAGAATCTGAATGAGGTCTCGACTAAGAAAGCTCGCACAGATGAAGGAAAGGACCTCACTCCAGAAGCACCCATGACTTTGGCGGTGGTTGTGACCCAGCCCCGCCTCGAACCATGATTCTGTTATGTTGGAATGTGCGAGGACTGGGGAACCCTCGCACATTCCAAGCCTTATGTCGTCTGGTACAAAGGCATAAACCCCAAATCATTTTCCTTTCTGAAATTATGTCGTCTTCAATTTTCTTAGATCAAATAAAAGTTCAATTGCATTTTGATTATTGTTTTTCTGTCTCAAGCAATGGAAAAAGTGGTGGGTTAAGTTTGTTCTGGAACAAAAGTACCGATCTGTCAGTTATGTCTTATTCGTCCGGGCACATCGACTCCTTGATTAAAAGTGTTGACGGGGATTGGTGTTTTACTGGCTTTTATGGCAATCCACGCATAGACCAGAGAAAACACTCTTGGGAGCTTCTAAGACGCCTGCATGGACAACTAACACTTCCTTGGATAGTGGGCGGAGATTTTAATGAAATATTATCGAACTCAGAAAAACAAGGAGGGGAAGCTAGGAGAGAAGCTCCAATGGTTAATTTTAGGGAGCTACTAGATGACTGCAAGTTGGTTGACCCAGGGTATAAAGGCCATGATTACACCTGGTTCAGAAGACGTCGGGATAGGGACTCAATCTGGGAACGTCTAGACCGGTTTTTAATTAATAAAGAGATAATGGATATGTGGTTTGATTTCGAAACCATTCATCTAGGGACCTACTCTTCTGATCATCGACCAATCTTGGGGGTTACAGGTGAACGAGCGCATTTTCAAAGGCAACATGATAGGGGGCTTCTAAGATTCGAACCAAATTGGGGTACATATCCAGACTGTTGGGAGATTGTAGCAAATGTGTGGCAACGACATCCTCATAATACCATTGGGAACATGCAGGTTAGACTAAATGCTTGTTTGATGGAATTAAAAAGATGGAACCGAACTCGTTTGGAGGGATTTTTAAAGGGGGCCATTTCAAAAAAAGAAAAAGAAATCCAATCCTTAGAACAATACTTGACCCCAGACACAGAGGAAACTTGGTTTCAAAAGAGAAGGGAGTTAGACAACCTACTTCAAGAGGATGAGATATATTGGAGACAAAGGTCAAGGGTGGATTGGTTAAAATGGGGGGATCGAAACACGAAATGGTTTCACATCAAGGCCACACAGCGGAAAAAACAAAACAAGATCCGGACACTTCAAACTCTGAACGAATCGTGGATTTCTGATGAAAAAGAAATAGGAGAGTTTGCAACCTCGTACTTCCAACATCTTTTTTCTTCTGATGACCCGACTTCTGATATGATTGAAGGCGTAACAGATTGTATCTTACCATCGGTCACAGATGATACTAATAGAATGCTTCTATCGGATTTCACACGTGAAGAGATTGAGCTAGTCATGAAAAATATGCACCCTTCTAAAGCTCCAAGTCCGGATGGGATTCACGCGACTTTCTACCAAGTTTACTGGGATATTGTTGGAACAGAAATTACTGACATCTGCCTTGGCTGCCTAAACGACACCTTAGAGTTAGGCCCTCTGAACCATACCTTAATTGCACTTATCCCAAAGAAAAATAATCCCTCTCAGATGACTGACTTTCGACCTATTAGTCTCTGTAATGTGGTCTACAAAATTATCTCTAAGACCTTGGCCAACAGGTTAAAAAAGGTATTAAACCAAATCATCTCTCCTGAACAGGCTGTTTTCATTCCAGACCGCTCAATATCGGATAATGCCTTAACAGGTTTTGAATGCCTCCATGCAATAAGAAATAAGAGGAAAGGGAATATTGGATCGGTGGCTATGAAATTAGATATCAGTAAGGCTTATGATAGTAGAATGGGATTTTCTTAGAAAGGTAATGGACAAAATGGGCTTTCATTCTGACTGGGTGGACAAAATAATGCGGTGTGTCGAATCCGTTACCTTCTCTGTTCTTTTGAATGGGAATCTGCAGTCACGTTTTACTCTGACGAGAGGTCTCAGACAGGGAGATCCTCTGTCACCTTATCTATTTCTCCTTTGTGCGGAGGGGCTATCTAGCCTCCTTCACAAGAAACAGAGCGAAAAAAACTTTACTGGTTTAAAAATTAATCGTTATTGCCCGACCATTTCTCATTTATTCTATGTGGATGATTGTCTCCTTTTCTTTCAGGCAAAACAACAGGACATCAAAACTATAAAGGACATTCTCAATGTTTATGAAATTGCTTCGGGCCAATCGATTAATTTTGCTAAATCATCTTTTCATGTTAGTAAGAACACACATGTTGGAATCGCGAAACAAATTCAGGATACTTTCAATATCCCCGAAGTGGACTCCTTGGGATGCTATTTGGGTCTTCCTTCTCAAATGTCAAGAAACAAATCACAGATTTTTAGTTCTCTAAAGGAACGGGTTTGTAAGACTTTACAAGGTTGGAAAGGGAACTTATTCTCGGTAGGGGGTAGAGAAGTGTTTATTAAAGCAGTTGCTCAGACCATTCCTATTTACACAATGAGTTGTTTCAAGCTCCCCGAACGGTTGTGTACTGATATAAACATGATGTGTGCAAAATTCTGGTGGGGCTCAAATGAAATAGAAAGGAAAATTCACTGGATGAGTTGGAAGCGCATGTGCGCCCCAAAGGCTTGTGGTGGAATGGGCTTTAGAGATTTAACTCTGTTCAACCAAGCCATGTTAGCAAAACAAAGCTGGCGACTGCTTCGGGATACAGGAAGTCTGTTGTACAAAGTGCTCCGGGGAAGGTATTTCCCGTCGGGTACCTTTTTACATGCGAATATTGGCTCCAATCTCTCATATATTTGGCGTAGTATTCTTTGGGGTCGAGAGCTTTTCAACAAGGGATACAGGTGGAAAATAGGTAATGGTTTCCAGGTAAGCATTAAAAATGACCCCTGGCTATCGGTAGAAGGAAGAGACAAGCCCTTAGTTGTTGACGCCCTTCTTGCTGGAAGAACGGTTAGCTACATTCTTAAAGAGGATGGGACATGGGATGCTGGAAAATACAACGACTTTTTTTATCTGAAGATGCAGAAAATATTCTTAAGTTACCCCGGACTGGGACGATGGGTTGTGACGAGATTATATGGAAATGCCACCCTCGGGGTGTCTTCACAGTTAAAAGTGCCTACCAACTAAGACTTCGTATCCAAGATTCACAGGAGGCTTCTAATTCGACCAACAGGAGAGACTCTATTTGGATGGCATTATGGAACGCTAATACTCCATCCAAGATCAAGATTTGTTGTTGGAGAATTCTTCACAATATCCTCCCCACAAAGACAAATCTGATCCAAAAGGGCCTCGACATTCAACCATGGTGCCCTTTCTGCATGAAACAACCGGAGACGAGCTGCCATATCCTATGGGGATGCAAGGTAACAAGGGTGCTTTGGAACCATTTTCTACCTTCTTACACGAATTTGTTTTATGATTTCAGGGAAGATTGGAATGCGGGAACTTATTTTCAGTGGATGTTGGAAGACAACAATCGAAAAGACTTTAACGTCTTTCTGATCATTTTATGGAAGATCTGGACCTGGAGGAATTTAGCGATTAGGGATAAACAAATTTGGAACCAGGAAGAACTCATCAGAATCACACGGTGTCATGTCACGGAGTTCATTTCCTCTCCAGTTGTGCCTCCCCCACAAGCATCCTTACCTTCGAACAATGGAGGAAACTAGAATCCGCCAAGACAAGGAACTTGGTATCTAAACACAGATGCTTCATGGAGTACAGACCGCGATTGTGGCGGTTTAGGTTGGATATTTCGAGAGTGGGATGGTCGGCTGGTTCGTGCTGGACATCATTTCATCCGCACAAACTGGTCGATCCTGATTCTGGAACTCAGGGGCATTATCGAGGGTTTGAAGGCAATCCCAAACAAAACTATCCCACTCGTAGTGGAATCAGACTCTTTAGAAGCCATTCAACAAATTAATGGTTCGTCAGTTGACTACACTGAGACCAGTGAGTTTATTAATGAAATCAAAACGATGGCAAGCATGTGGTCACAGATAGCTTTTAAACATATTCCTAGATCGGCAAACCAGACGACCCACAAACTAGCACAAAGGGCCTCACGGCTACAAACAAATGAATCTTGGTTGGATGGTCCCTCTTCGAACCTTGATACTTTTCTATCTTAA

mRNA sequence

ATGGAGGCCAAGCATCTCAGACTAATGGTCGAACGTCTAAATCTTACAGAGGAGGAAGGCAGAGCTATCGTGGTTGAGGACGATGACGTCGATGAGAGCGCCCGCCTCCTGTCGATATCCCTGATCTGCAAAAGTCTATCCTCGAAACCAGTTCATATTGATGTTTTCCGGCAAAAAATTCCAAAAATTTGGAAAACCACTGCCCCGATTGGTATCGATAAAGTTGGGGAAAATCTATTCCTCTGTAGCTTTGGGAATACAGGAGATTTACGAAGAGTCTTGGAAAACGAACCATGGTACTTTGATAAGGCGATTCTCCTATTTGATGAACCAAGAGGCAACTGTCGGTTCATAGATCTGGAATTCAGGTTCGTTAATTTTTGGATCCACATACATAATCTGCCTCCTGCTGAACAATCGCTGAAAATGGCACAAATCTTTGGAAATCGCCTCGGTGTGTTCCAGAAGGTGGACCTGAACGATACAGAAACATGTTGGGGAAGCTCCTTGTGCCTAAAGGTTCGGATCAACGTCACCATTCCTCTAAAACGGGGATTAAAGGTCAAAATAGGAACGATGGCAGAGGAGTTATGGTGCCCGGTCACCTACGAGAAACTTCCTGATTTTTGCTACAGCTGCGGGCGAACCGGGCACCTGGACAGGATATGCAATGAGGTTGATTGGTCAACTTCGAGTAAAAAGCAATTTGGTTCTGGACTAAGACATCCATCCTCAAATTCTGGACAACAGAATTGGGCACGATCGGGAGTTCATGGCAGAGGATCGAGAGACACAAAGAAGACAGTGGAGGACGATATCGAAGAAGACTCGTCGATGGATTCGAGTTCAGGGATAAATTTACAGACCGGAAGCAAACAGAAGACATCCTTGGACACAAATAATTTGGAAAAAAATTGTCGAAGATCGGAGTCGGAGGCCGGAGACAGAGCGACGGAGAGGCAAGGAAGCGAGGAGTTTCAAAAATTAATTAATGCCATTAATTTCTCCGTTGAATCGGTGTCAAAAGAGAGGTCGTTAGAAGTAACTGAAGGACATTCAAGGAAGGATCGAAATGGAAAGCTAAAAGGTCTACTGGAGGTTTCAAGAAAGCTGGTTTTCGACATTGAAGGAGAAGAAGACGTTTCGAAAAAGGTTCTTAATTCTTCTTCTGCAGTTGAGCCACGTGGGGTTGAGAAAGGAGAAAAATTAGCACAGGTACCAACAAAAGGGCTCTCAATTTTGCACTCCTCAACCGGGCCTAAAATAATGGGCTTTAAGGACAACGACATGGAGACTCGGATAATTGGGCTAGCCGAAGAATTTTTGCAGACTAATGGGCTAAATTTTCAAATAGGCTCCTCGATAATGGATATGTGGTTTGATTTCGAAACCATTCATCTAGGGACCTACTCTTCTGATCATCGACCAATCTTGGGGGTTACAGGTGAACGAGCGCATTTTCAAAGGCAACATGATAGGGGGCTTCTAAGATTCGAACCAAATTGGGGTACATATCCAGACTGTTGGGAGATTGTAGCAAATGTGTGGCAACGACATCCTCATAATACCATTGGGAACATGCAGGTTAGACTAAATGCTTGTTTGATGGAATTAAAAAGATGGAACCGAACTCGTTTGGAGGGATTTTTAAAGGGGGCCATTTCAAAAAAAGAAAAAGAAATCCAATCCTTAGAACAATACTTGACCCCAGACACAGAGGAAACTTGGTTTCAAAAGAGAAGGGAGTTAGACAACCTACTTCAAGAGGATGAGATATATTGGAGACAAAGGTCAAGGGTGGATTGGTTAAAATGGGGGGATCGAAACACGAAATGGTTTCACATCAAGGCCACACAGCGGAAAAAACAAAACAAGATCCGGACACTTCAAACTCTGAACGAATCGTGGATTTCTGATGAAAAAGAAATAGGAGAGTTTGCAACCTCGTACTTCCAACATCTTTTTTCTTCTGATGACCCGACTTCTGATATGATTGAAGGCGTAACAGATTGTATCTTACCATCGGTCACAGATGATACTAATAGAATGCTTCTATCGGATTTCACACATGCAGAAAATATTCTTAAGTTACCCCGGACTGGGACGATGGGTTGTGACGAGATTATATGGAAATGCCACCCTCGGGGTGTCTTCACAGTTAAAAGTGCCTACCAACTAAGACTTCGTATCCAAGATTCACAGGAGGCTTCTAATTCGACCAACAGGAGAGACTCTATTTGGATGGCATTATGGAACGCTAATACTCCATCCAAGATCAAGATTTGTTGTTGGAGAATTCTTCACAATATCCTCCCCACAAAGACAAATCTGATCCAAAAGGGCCTCGACATTCAACCATGGTGCCCTTTCTGCATGAAACAACCGGAGACGAGCTGCCATATCCTATGGGGATGCAAGGTAACAAGGGTGCTTTGGAACCATTTTCTACCTTCTTACACGAATTTGTTTTATGATTTCAGGGAAGATTGGAATGCGGGAACTTATTTTCAGTGGATGTTGGAAGACAACAATCGAAAAGACTTTAACGTCTTTCTGATCATTTTATGGAAGATCTGGACCTGGAGGAATTTAGCGATTAGGGATAAACAAATTTGGAACCAGGAAGAACTCATCAGAATCACACGGTGTCATGTCACGGAGTTCATTTCCTCTCCAGTTAATCCGCCAAGACAAGGAACTTGGTATCTAAACACAGATGCTTCATGGAGTACAGACCGCGATTGTGGCGGTTTAGGTTGGATATTTCGAGAGTGGGATGGTCGGCTGGTTCGTGCTGGACATCATTTCATCCGCACAAACTGGTCGATCCTGATTCTGGAACTCAGGGGCATTATCGAGGGTTTGAAGGCAATCCCAAACAAAACTATCCCACTCGTAGTGGAATCAGACTCTTTAGAAGCCATTCAACAAATTAATGGTTCGTCAGTTGACTACACTGAGACCAGTGAGTTTATTAATGAAATCAAAACGATGGCAAGCATGTGGTCACAGATAGCTTTTAAACATATTCCTAGATCGGCAAACCAGACGACCCACAAACTAGCACAAAGGGCCTCACGGCTACAAACAAATGAATCTTGGTTGGATGGTCCCTCTTCGAACCTTGATACTTTTCTATCTTAA

Coding sequence (CDS)

ATGGAGGCCAAGCATCTCAGACTAATGGTCGAACGTCTAAATCTTACAGAGGAGGAAGGCAGAGCTATCGTGGTTGAGGACGATGACGTCGATGAGAGCGCCCGCCTCCTGTCGATATCCCTGATCTGCAAAAGTCTATCCTCGAAACCAGTTCATATTGATGTTTTCCGGCAAAAAATTCCAAAAATTTGGAAAACCACTGCCCCGATTGGTATCGATAAAGTTGGGGAAAATCTATTCCTCTGTAGCTTTGGGAATACAGGAGATTTACGAAGAGTCTTGGAAAACGAACCATGGTACTTTGATAAGGCGATTCTCCTATTTGATGAACCAAGAGGCAACTGTCGGTTCATAGATCTGGAATTCAGGTTCGTTAATTTTTGGATCCACATACATAATCTGCCTCCTGCTGAACAATCGCTGAAAATGGCACAAATCTTTGGAAATCGCCTCGGTGTGTTCCAGAAGGTGGACCTGAACGATACAGAAACATGTTGGGGAAGCTCCTTGTGCCTAAAGGTTCGGATCAACGTCACCATTCCTCTAAAACGGGGATTAAAGGTCAAAATAGGAACGATGGCAGAGGAGTTATGGTGCCCGGTCACCTACGAGAAACTTCCTGATTTTTGCTACAGCTGCGGGCGAACCGGGCACCTGGACAGGATATGCAATGAGGTTGATTGGTCAACTTCGAGTAAAAAGCAATTTGGTTCTGGACTAAGACATCCATCCTCAAATTCTGGACAACAGAATTGGGCACGATCGGGAGTTCATGGCAGAGGATCGAGAGACACAAAGAAGACAGTGGAGGACGATATCGAAGAAGACTCGTCGATGGATTCGAGTTCAGGGATAAATTTACAGACCGGAAGCAAACAGAAGACATCCTTGGACACAAATAATTTGGAAAAAAATTGTCGAAGATCGGAGTCGGAGGCCGGAGACAGAGCGACGGAGAGGCAAGGAAGCGAGGAGTTTCAAAAATTAATTAATGCCATTAATTTCTCCGTTGAATCGGTGTCAAAAGAGAGGTCGTTAGAAGTAACTGAAGGACATTCAAGGAAGGATCGAAATGGAAAGCTAAAAGGTCTACTGGAGGTTTCAAGAAAGCTGGTTTTCGACATTGAAGGAGAAGAAGACGTTTCGAAAAAGGTTCTTAATTCTTCTTCTGCAGTTGAGCCACGTGGGGTTGAGAAAGGAGAAAAATTAGCACAGGTACCAACAAAAGGGCTCTCAATTTTGCACTCCTCAACCGGGCCTAAAATAATGGGCTTTAAGGACAACGACATGGAGACTCGGATAATTGGGCTAGCCGAAGAATTTTTGCAGACTAATGGGCTAAATTTTCAAATAGGCTCCTCGATAATGGATATGTGGTTTGATTTCGAAACCATTCATCTAGGGACCTACTCTTCTGATCATCGACCAATCTTGGGGGTTACAGGTGAACGAGCGCATTTTCAAAGGCAACATGATAGGGGGCTTCTAAGATTCGAACCAAATTGGGGTACATATCCAGACTGTTGGGAGATTGTAGCAAATGTGTGGCAACGACATCCTCATAATACCATTGGGAACATGCAGGTTAGACTAAATGCTTGTTTGATGGAATTAAAAAGATGGAACCGAACTCGTTTGGAGGGATTTTTAAAGGGGGCCATTTCAAAAAAAGAAAAAGAAATCCAATCCTTAGAACAATACTTGACCCCAGACACAGAGGAAACTTGGTTTCAAAAGAGAAGGGAGTTAGACAACCTACTTCAAGAGGATGAGATATATTGGAGACAAAGGTCAAGGGTGGATTGGTTAAAATGGGGGGATCGAAACACGAAATGGTTTCACATCAAGGCCACACAGCGGAAAAAACAAAACAAGATCCGGACACTTCAAACTCTGAACGAATCGTGGATTTCTGATGAAAAAGAAATAGGAGAGTTTGCAACCTCGTACTTCCAACATCTTTTTTCTTCTGATGACCCGACTTCTGATATGATTGAAGGCGTAACAGATTGTATCTTACCATCGGTCACAGATGATACTAATAGAATGCTTCTATCGGATTTCACACATGCAGAAAATATTCTTAAGTTACCCCGGACTGGGACGATGGGTTGTGACGAGATTATATGGAAATGCCACCCTCGGGGTGTCTTCACAGTTAAAAGTGCCTACCAACTAAGACTTCGTATCCAAGATTCACAGGAGGCTTCTAATTCGACCAACAGGAGAGACTCTATTTGGATGGCATTATGGAACGCTAATACTCCATCCAAGATCAAGATTTGTTGTTGGAGAATTCTTCACAATATCCTCCCCACAAAGACAAATCTGATCCAAAAGGGCCTCGACATTCAACCATGGTGCCCTTTCTGCATGAAACAACCGGAGACGAGCTGCCATATCCTATGGGGATGCAAGGTAACAAGGGTGCTTTGGAACCATTTTCTACCTTCTTACACGAATTTGTTTTATGATTTCAGGGAAGATTGGAATGCGGGAACTTATTTTCAGTGGATGTTGGAAGACAACAATCGAAAAGACTTTAACGTCTTTCTGATCATTTTATGGAAGATCTGGACCTGGAGGAATTTAGCGATTAGGGATAAACAAATTTGGAACCAGGAAGAACTCATCAGAATCACACGGTGTCATGTCACGGAGTTCATTTCCTCTCCAGTTAATCCGCCAAGACAAGGAACTTGGTATCTAAACACAGATGCTTCATGGAGTACAGACCGCGATTGTGGCGGTTTAGGTTGGATATTTCGAGAGTGGGATGGTCGGCTGGTTCGTGCTGGACATCATTTCATCCGCACAAACTGGTCGATCCTGATTCTGGAACTCAGGGGCATTATCGAGGGTTTGAAGGCAATCCCAAACAAAACTATCCCACTCGTAGTGGAATCAGACTCTTTAGAAGCCATTCAACAAATTAATGGTTCGTCAGTTGACTACACTGAGACCAGTGAGTTTATTAATGAAATCAAAACGATGGCAAGCATGTGGTCACAGATAGCTTTTAAACATATTCCTAGATCGGCAAACCAGACGACCCACAAACTAGCACAAAGGGCCTCACGGCTACAAACAAATGAATCTTGGTTGGATGGTCCCTCTTCGAACCTTGATACTTTTCTATCTTAA

Protein sequence

MEAKHLRLMVERLNLTEEEGRAIVVEDDDVDESARLLSISLICKSLSSKPVHIDVFRQKIPKIWKTTAPIGIDKVGENLFLCSFGNTGDLRRVLENEPWYFDKAILLFDEPRGNCRFIDLEFRFVNFWIHIHNLPPAEQSLKMAQIFGNRLGVFQKVDLNDTETCWGSSLCLKVRINVTIPLKRGLKVKIGTMAEELWCPVTYEKLPDFCYSCGRTGHLDRICNEVDWSTSSKKQFGSGLRHPSSNSGQQNWARSGVHGRGSRDTKKTVEDDIEEDSSMDSSSGINLQTGSKQKTSLDTNNLEKNCRRSESEAGDRATERQGSEEFQKLINAINFSVESVSKERSLEVTEGHSRKDRNGKLKGLLEVSRKLVFDIEGEEDVSKKVLNSSSAVEPRGVEKGEKLAQVPTKGLSILHSSTGPKIMGFKDNDMETRIIGLAEEFLQTNGLNFQIGSSIMDMWFDFETIHLGTYSSDHRPILGVTGERAHFQRQHDRGLLRFEPNWGTYPDCWEIVANVWQRHPHNTIGNMQVRLNACLMELKRWNRTRLEGFLKGAISKKEKEIQSLEQYLTPDTEETWFQKRRELDNLLQEDEIYWRQRSRVDWLKWGDRNTKWFHIKATQRKKQNKIRTLQTLNESWISDEKEIGEFATSYFQHLFSSDDPTSDMIEGVTDCILPSVTDDTNRMLLSDFTHAENILKLPRTGTMGCDEIIWKCHPRGVFTVKSAYQLRLRIQDSQEASNSTNRRDSIWMALWNANTPSKIKICCWRILHNILPTKTNLIQKGLDIQPWCPFCMKQPETSCHILWGCKVTRVLWNHFLPSYTNLFYDFREDWNAGTYFQWMLEDNNRKDFNVFLIILWKIWTWRNLAIRDKQIWNQEELIRITRCHVTEFISSPVNPPRQGTWYLNTDASWSTDRDCGGLGWIFREWDGRLVRAGHHFIRTNWSILILELRGIIEGLKAIPNKTIPLVVESDSLEAIQQINGSSVDYTETSEFINEIKTMASMWSQIAFKHIPRSANQTTHKLAQRASRLQTNESWLDGPSSNLDTFLS
Homology
BLAST of Tan0011004 vs. NCBI nr
Match: KAE8800683.1 (retrotransposon unclassified [Hordeum vulgare])

HSP 1 Score: 234.6 bits (597), Expect = 4.0e-57
Identity = 251/1083 (23.18%), Postives = 426/1083 (39.34%), Query Frame = 0

Query: 9    MVERLNLTEEEGRAIVVEDDDVDESARLLSISLICKSLSSKPVHIDVFRQKIPKIWKTTA 68
            M+  L L+EEE + + +     D+  + +    + K LS K  H D     + K+W    
Sbjct: 1    MLRGLKLSEEEKKGVKIRGTAKDK-GKSMGAKAVGKILSEKLAHPDAISLSLGKVWCPIK 60

Query: 69   PIGIDKVGENLFLCSFGNTGDLRRVLENEPWYFDKAILLFDEPRGNCRFIDLEFRFVNFW 128
             I   ++GEN F+ +F      RR +E+ PW FD  +++ +E     R  + EF  +  W
Sbjct: 61   GINCKEMGENRFVFTFMQDSGKRRAIEDGPWMFDNDLVVVEEFDAQKRLEEYEFNNIPIW 120

Query: 129  IHIHNLPPAEQSLKMAQIFGNRLGVFQKVDLNDTETCWGSSLCLKVRINVTIPLKRGL-- 188
            + ++NLP    + + A+  GN +G F + D     +  G  L +K+R+ +  PL RG   
Sbjct: 121  VRVYNLPLGMMNEESAEDIGNIIGQFVEADTGVDGSAIGMYLRIKIRMRIDKPLMRGFTL 180

Query: 189  -------KVKIGTMAEEL----WCPVTYEKLPDFCYSCGRTGHLDRICNEVDWSTSSKKQ 248
                   K K   M +E     WC   YE LPDFCY+CG  GH ++ CN        K+Q
Sbjct: 181  DDDDERKKHKGKNMGKEEDGSGWCRFEYEFLPDFCYTCGLLGHGEKDCN-TKLQKGEKQQ 240

Query: 249  FG------SGLRHPSSNSG-----------QQNWARSGVHG----------------RGS 308
            FG       G R  + ++G           Q+N+  S  +G                R S
Sbjct: 241  FGRWIKADMGHRRAAGDAGGWRKGGRENGTQRNYGYSRSNGRTGSGSDSLSWRKDGFRAS 300

Query: 309  RDTKKTVEDDIEEDSSMDSSSGINLQTGSKQKTSLDTNNLEKNCRRSES-EAGDRATERQ 368
             D  +  E   E  S + ++ G  +Q+G  +K  L  N  EK+    E   AG +     
Sbjct: 301  GDGPENTEKGEEVTSPVKNTQG-RVQSGVPKKLLLGENVKEKDGNDGEELVAGGKGVRED 360

Query: 369  G--SEEFQKLINAINFSVESVSKERSLEVTEGHSRKDRNGKLKGLLEVSRKLVFDIEGEE 428
            G   EE   L      +    +++ +    +G+  K  +    G     R  +  + G  
Sbjct: 361  GHSREEQDLLTKQAELTGIPQAQQDTQRTGDGNKSKTNDKNPNGKKFRRRDRMEHMSGPN 420

Query: 429  DVSKKVLNSSSAVEPRGVEKGEKLAQVPTKGLSILHSSTGPKIMGFKDNDMETRIIGLAE 488
               + +L             G+K  Q  T+G     +  G  ++  +  ++   I     
Sbjct: 421  PQRESIL-------------GQKRTQTATEGEEEKDAKKGRLVVVMERGELNYYIC---- 480

Query: 489  EFLQTNGLNFQIGSSIMDMWFDFETIHLGTYSSDHRPILGVTGERAHFQRQHDRGLLRFE 548
            E L     N    +   + + +FE I+     SDHRPI+  T      +     G  RFE
Sbjct: 481  ERLDRATAN----ARWCEAFPEFEVINSIPRHSDHRPIIVNTKGEGRRRSGKGDGSFRFE 540

Query: 549  PNWGTYPDCWEIVANVWQRHPHNTIGNMQVRLNACLMELKRWNRTRLEGFLKGAISKKEK 608
              W     C E +   W+       G +   +      +++W++  + G L+G + K   
Sbjct: 541  AWWLEEEGCTEEIQGAWEESWMTGEGGVAGAMRRVAGRMRKWHK-GVVGELEGRVKKARA 600

Query: 609  EIQSLEQYLTPDTEETWFQKRR---ELDNLLQEDEIYWRQRSRVDWLKWGDRNTKWFHIK 668
            E++   +  +P +E    ++ R    L  L ++  I  +QRS + WL+ G+RNT++F   
Sbjct: 601  ELERCMR--SPPSEHKVREEARLRCVLRELEEKKSIKAKQRSHITWLRKGNRNTRYFMSV 660

Query: 669  ATQRKKQNKIRTLQTLNESWISDEKEIGEFATSYFQHLFSSDDPTSDMIEGVTDCILPSV 728
               RKKQN+++ L+  + S + +  E+  +  SYFQ LF+++                  
Sbjct: 661  VAARKKQNRLKMLRKEDGSDVKEGTELTNYVRSYFQELFTTN------------------ 720

Query: 729  TDDTNRMLLSDFTHAENILKLPRTGTMGCDEIIWKCHPRGVFTVKSAYQLRLRIQDSQEA 788
                             + ++ + G +G                            S +A
Sbjct: 721  ---------------VEMQRMNKNGGIG---------------------------GSSDA 780

Query: 789  SNSTNR-RDSIWMALWNANTPSKIKICCWRILHNILPTKTNLIQKGLDIQPW-CPFCMKQ 848
                N+  +  W  +W    P  +++  WRI H  L   TN+ ++G  +Q   C FC   
Sbjct: 781  VGDLNKCTEDSWKRIWKLACPRNVQMFAWRIKHESLALLTNMQRRGFQLQTTRCFFCGLA 840

Query: 849  PETSCHILWGCKVTRVLWNHFLPSYTNLFYDFREDWNAGTYFQWMLEDNNRKDFNVFLII 908
             E   H+   CK+ + +W         +  +   D +A   + W L++  R      L  
Sbjct: 841  DEDGAHLFVKCKMVKEVWRELALEKERINLEAITDVHAMMDYLWGLDECKRLH---VLTF 900

Query: 909  LWKIWTWRNLAIRDKQIWNQEELIRITRCHVTEF--ISSP---------VNPPRQGTWYL 968
             W  W++RN   + +      E+ R TR  V E+  I +P           PP +G    
Sbjct: 901  WWLWWSYRNKVRQGELPSTPAEVARRTRSSVLEYRQIYAPGTKKISADEWKPPGEGVVKF 960

Query: 969  NTDASWSTDRDCGGLGWIFREWDGRLV--RAG-HHFIRTNWSILILELRGIIEGLKAIPN 1023
            N D S++      G G   R  DG LV  RAG   +I   +   ++ +   +    A   
Sbjct: 961  NLDGSFTPGDQHVGWGVAARTSDGDLVAARAGRQEYISDPFGAEVIAMANAV--ALAADL 991

BLAST of Tan0011004 vs. NCBI nr
Match: VFQ81500.1 (unnamed protein product [Cuscuta campestris])

HSP 1 Score: 204.9 bits (520), Expect = 3.4e-48
Identity = 265/1209 (21.92%), Postives = 448/1209 (37.06%), Query Frame = 0

Query: 10   VERLNLTEEEGRAIVVEDDDVDESAR--LLSISLICKSLSSKPVHIDVFRQKIPKIWKTT 69
            +  ++L  E+   +V  D+  D S +       L+ + L+ +PV+    +  +  +W+  
Sbjct: 10   IAAISLDNEDEDGLVFGDEIGDSSIQQPAYDFCLVGRFLTERPVNFVAMKNTMASLWRPE 69

Query: 70   APIGIDKVGENLFLCSFGNTGDLRRVLENEPWYFDKAILLFDEPRGNCRFIDLEFRFVNF 129
              + + +VG  L++  FG   ++ RV+E  PW F+   LL +    +    ++    +  
Sbjct: 70   EGMVVKEVGAGLYIFQFGALAEMERVMEMCPWSFNNQALLLERLGRSKDPSEVSLHHLYV 129

Query: 130  WIHIHNLPPAEQSLKMAQIFGNRLGVFQKVDLNDTETCWGSSLCLKVRINVTIPLKRGLK 189
            W+ ++ L     S ++AQ  GN +G F + D  +    W S L ++V+++V  PLK+G +
Sbjct: 130  WVRVYGLKRGFFSERVAQRLGNEIGQFVEADPKNFSNPWSSYLRIRVKLDVQKPLKKGTR 189

Query: 190  VKIGTMAEELWCPVTYEKLPDFCYSCGRTGHLDRICNEV--DWSTSSKKQFGSGLR---H 249
            +K     E       YE+LP FC+ CGR GH +R C E    W    +++FG  LR    
Sbjct: 190  LK-REGKEWFHVDFAYERLPTFCFVCGRLGHGERFCPETLRQWGRQVERKFGPELRASTR 249

Query: 250  PSSNSGQQNWARSGVHGRG--SRDTKKTVEDDIEEDSSMDS--SSGINLQTGSKQKTSLD 309
             +SN+    W R  +  RG  ++  +K  + D    +   +  SS +  Q G    T L 
Sbjct: 250  RASNNIGARWLREELPPRGDEAQQQEKIGKGDSRRSTPAYTGLSSVLCKQDGVPPSTELA 309

Query: 310  TNNLEKNCRRSESEAGDRATERQGSEEFQKLINAINFSVESVSKERSLEVTEGHSRKDRN 369
                ++N   +          R       K  +     + +  K+R   V E +     +
Sbjct: 310  V--WQRNREATHMMESSNLQNRFDQNLMDKQPDVEMLLIPTDPKKRKKSVDEKNGESSGS 369

Query: 370  GKLK---------GLLEVSRKLVFDIEGEEDVSKKVLNSSSAVEPRGVEKGEKLAQVPTK 429
            G L           LL +SR    D+E E+    +  ++     P    + E    +   
Sbjct: 370  GGLALLWKSPLNVTLLSLSR-FHIDVEVEDLALGRWRHTGFYGHPDQSRRSESWDLL--- 429

Query: 430  GLSILHSSTGPKIMGFKDNDMETR------------IIGLAEEFLQTNGLNFQIGSSIMD 489
               +  +ST P I G   N +  +            +I    E ++  GL          
Sbjct: 430  -RDLSQASTLPWICGGDFNAIMEQHEKQGGPPKPRYLIQAFREAVEDAGL---------- 489

Query: 490  MWFDFETIHLGTYSSDHRPILGVTGERAHFQRQHDRGLLRFEPNWGTYP-------DCWE 549
               DF             P+ G    + +  R       RFE  W + P       DCWE
Sbjct: 490  --MDF-------------PLGGAWVPKPYAHR------FRFENAWRSDPTLRPLLLDCWE 549

Query: 550  IVANVWQRHPHNTIGNMQVRLNACLMELKRWNRTRLEGFLKGAISKKEKEIQSLEQYLTP 609
            +          +   +++ +LN C   L  W     + F   +     + IQ   +    
Sbjct: 550  V----------SHAASLEEKLNFCSTRLHVWGMEWKDQF--ASEINIMRTIQRRTRGRRD 609

Query: 610  DTEETWFQK-RRELDNLLQEDEIYWRQRSRVDWLKWGDRNTKWFHIKATQRKKQNKI--- 669
                  FQ+ ++ L  L    E +WRQ+++  WL  GDRNTK+FH +A++R++ N+I   
Sbjct: 610  HQSRIQFQRAKKRLFELYALKEAFWRQQAKQFWLTQGDRNTKFFHAQASERQRLNRIDRL 669

Query: 670  ---------------------------------RTLQTLNESWISD-------------- 729
                                               +Q+L  S +S               
Sbjct: 670  TDSNGQLRTWSNGLEDTIKEYFEELYHAQGNPGHIIQSLIPSLVSREDNAMLRQPYSYEE 729

Query: 730  ---------------------------------------------------EKEIGE--- 789
                                                               E +IG    
Sbjct: 730  VRQAVFSMHPDKSPGGDGFNPGKMAWKFLTQPDLLVSKVFKAKYFRDCSFLEAQIGSNSS 789

Query: 790  ------FATSYFQH---------------------------LFSSDDPTSDMIEGVTDCI 849
                  FAT    H                             ++++P S  +  V+D  
Sbjct: 790  FVWKSIFATKDLLHTGIRWKVGPGTNISIWEDPWLMDNENPFITTENPGSIALTRVSDLR 849

Query: 850  LPSVTDDTNRMLLSDFTHAENILKLPRTGTMGCDEIIWKCHPRGVFTVKSAYQLRLRIQD 909
             P+  D      + +    + I  +P TG+   D +IW     G++TVKSAY  +    D
Sbjct: 850  SPNGWDWPTIDAIFNARDRQCIEAIPLTGSSSNDMLIWAHDKSGIYTVKSAY--KALTWD 909

Query: 910  SQEASNSTNRRDSIWMALWNANTPSKIKICCWRILHNILPTKTNLIQKGLDIQPWCPFCM 969
            +    +  +R  ++W  LW      K++   WR  +NILP   NL+ K + +Q  CP C 
Sbjct: 910  ATVLGHGMDR--ALWKKLWAVRVLPKVRNLIWRAANNILPCLNNLVTKRVTVQDVCPLCH 969

Query: 970  KQPETSCHILWGCKVTRVLWN-HFLPSYTNLFYDFREDWNAGTYFQWMLEDNNRKDFNVF 1029
               ET  HI   C   R +W   FL  Y      F  DW        +L   N  D  + 
Sbjct: 970  TSEETVLHIFVHCPFARQVWGASFLGWYAPAVGSF-HDWLLA-----VLHLFNEYDKGLA 1029

Query: 1030 LIILWKIWTWRNLAIRDKQIWNQEELIRITRCHVTEFISSPVNPPRQGTWYLNTDASWST 1039
              +LW IW+ RN      ++W       + R  +  +   PVN  +     +N DAS   
Sbjct: 1030 FHLLWMIWSARN-----AKVWK-----GVRRSPLEGWERPPVNYLK-----VNVDASSGL 1089

BLAST of Tan0011004 vs. NCBI nr
Match: PWA36168.1 (hypothetical protein CTI12_AA602590 [Artemisia annua])

HSP 1 Score: 203.0 bits (515), Expect = 1.3e-47
Identity = 245/1118 (21.91%), Postives = 444/1118 (39.71%), Query Frame = 0

Query: 79   LFLCSFGNTGDLRRVLENEPWYFDKAILLFDEPRGNCRFIDLEFRFVNFWIHIHNLPPAE 138
            +FL   G+  DLRRVLE+ PW F++ +++      + +  + +   V FW+ + N+P   
Sbjct: 1    MFLVQLGHDVDLRRVLEDGPWSFERNLVVLKLIENDEQPTETDMTKVPFWVRLINMPLGR 60

Query: 139  QSLKMAQIFGNRLGVFQKVDLNDTETCWGSSLCLKVRINVTIPLKRGLKVKIGTMAEELW 198
            +     +    ++G   +VD  D     GS     +R+  T       KV++        
Sbjct: 61   RDESSVRRVAAKIGDVLEVD--DAYFTKGSK---HIRVKNT-------KVRVN------- 120

Query: 199  CPVTYEKLPDFCYSCGRTGHLDRIC----NEVDWSTSSKKQFGSGLR------------- 258
              + YE+LP+FCY CG  GH ++ C     E++  T     F   LR             
Sbjct: 121  --IQYERLPNFCYWCGLLGHTEKECLTKPFEINGKTFKDWPFHENLRASNSRDSVSLSVA 180

Query: 259  ----HPSSNSGQQN-----------WARSGVHGRGSRDTKK---------------TVED 318
                HP+ N+ QQ            +  S ++   ++D  +               T++ 
Sbjct: 181  SPLTHPAFNNFQQTTNQRLLIQDNIFNESQINESSNQDLDERKCLSGGTSNYCPTSTLKI 240

Query: 319  DIEEDSSMDSSSGINLQTG----------SKQKTSLDTNNLEKNC-RRSESEAGDRATER 378
              +++      SGI L  G          +K +  L+  NL K   RR+  E   + T  
Sbjct: 241  TGQKEPKNSMGSGIELDMGRKIEPHLRGSTKAQQDLENTNLPKLWKRRTREETNTKTTTG 300

Query: 379  QGSEEF---QKLINAINFSVE------------SVSKERSLEVTEGHSRKDRNGKLKG-- 438
              S       + ++ ++++V+            S+ KE S  +      +    ++ G  
Sbjct: 301  TNSISIAAPPRPMSILSWNVQGLGNPWTVQHIRSLVKELSPSIIFLMETRLHGSEVTGFR 360

Query: 439  --------LLEVSRKLVFDIEGEEDVS-----------KKVLNSSSAVEPRGVEKGEKLA 498
                    L+  S +   D   +ED +           ++     +    R +   ++ A
Sbjct: 361  YIFPQYNLLVVDSIRRAGDFVVKEDGNFWRGTGIYGWPRRQEKHRTWALLRSLRTNQEQA 420

Query: 499  QVPTKGLSIL---------HSSTGPKIMGFKDN----DMETRIIGLAEEFLQTNGL---- 558
             V     + +           S   ++  F++     ++E R   +  +   +NG     
Sbjct: 421  WVCFGDFNEIMYAFEKEGQRGSNNTEMSAFREACSFCNLEDR-SAMGVKLTWSNGRRGNE 480

Query: 559  -------NFQIGSSIMDMWFDFETIHLGTYSSDHRPILGVTGERAHFQRQHDRGLLRFEP 618
                    F   S   D++ D    +L   +SDH PI+     R     +    + RFE 
Sbjct: 481  NVRKRLDRFLTNSHWFDLYPDASFENLPRIASDHSPIIC----RLSPMVKKKNRMFRFES 540

Query: 619  NW-------GTYPDCWEIVANVWQRHPHNTIGNMQVRLNACLMELKRWNRTRLEGFLKGA 678
             W       G   D W        +H    I      ++ C   L  WN+ R  G ++ +
Sbjct: 541  MWLRDKSIHGVVRDGWAYGLAAGMQHDPCGI------VSECANRLSDWNK-RSFGHVQRS 600

Query: 679  ISKKEKEIQSLEQYLTPDTEETWFQKRRELDNLLQEDEIYWRQRSRVDWLKWGDRNTKWF 738
            I  K++ +Q+L+      T       R ++  LL  +E+ W+QRSR++WL+ GD+NT++F
Sbjct: 601  IKSKQRSLQTLQSRFDGSTRAEQQALREQIKELLTREELMWKQRSRIEWLREGDKNTRFF 660

Query: 739  HIKATQRKKQNKIRTLQTLNESWISDEKEIGEFATSYFQHLFSSDDPTSDMIEGVTDCIL 798
            H +A+ R+++N I  L+  +  W+ +  E+ +  +SYF  LFSS  P     E V   I 
Sbjct: 661  HTRASNRQRRNSILRLKGPDGRWVEEHNEVCKLVSSYFSDLFSSSSPQG--CESVVRDID 720

Query: 799  PSVTDDTNRMLLSDFTHAE--NILKLPRTG----------------TMGC--------DE 858
              +T++  + L    T +E  ++L     G                 +GC        D 
Sbjct: 721  RRLTENERQALERPVTSSEVRDLLNTEGDGWNHELMYSLFPHNIASKIGCCFISKSRNDI 780

Query: 859  IIWKCHPRGVFTVKSAYQLRLRI-QDSQEASNSTNRRDSIWMALWNANTPSKIKICCWRI 918
            + W  +P G F+ KSAY L L   +D    +  +N     W  +W A  PSK+K+  WR 
Sbjct: 781  LYWHNNPGGRFSCKSAYLLALEADEDMVRTTTISNSLIDFWRVVWKARVPSKVKLFMWRA 840

Query: 919  LHNILPTKTNLIQKGLDIQPWCPFCMKQPETSCHILWGCKVTRVLWNHFLPSYTNLFYDF 978
             +N +PT  NL  +GL+    C  C +  E   H+L+ C V + +WN         FYD 
Sbjct: 841  WNNYVPTIDNLKSRGLNPTSSCTHCGQTSENLVHVLFKCSVAKDVWNR---CNFGCFYDT 900

Query: 979  REDWNAGTYFQWMLEDNNRKDFNVFLIILWKIWTWRNLAIRDKQIWNQEELIRITRCHVT 1026
            +       + Q +LE     ++  F++ILW +WT RN     +    +  +  I +  ++
Sbjct: 901  QGAITFQDFCQVILE-KFLAEWETFMMILWGLWTRRNRHFHGQLNGREGNVEVIAKSVLS 960

BLAST of Tan0011004 vs. NCBI nr
Match: TXG53380.1 (hypothetical protein EZV62_022549 [Acer yangbiense])

HSP 1 Score: 198.7 bits (504), Expect = 2.4e-46
Identity = 195/762 (25.59%), Postives = 344/762 (45.14%), Query Frame = 0

Query: 9   MVERLNLTEEEGRAIVVEDDDVDESARLLSISLICKSLSSKPVHIDVFRQKIPKIWKTTA 68
           + E L   E +   ++++ +   E  + +S  L+ K L  K ++ + F+  I ++W +T 
Sbjct: 9   LCEMLATLELDRPEVLIKGEAHREGIKEVSHCLMRKVLVGKRINREAFKGVIEQLWSSTD 68

Query: 69  PIGIDKVGENLFLCSFGNTGDLRRVLENEPWYFDKAILLFDEPRGNCRFIDLEFRFVNFW 128
            + ++K+ +NLF+  F        V    PW+FD  I++ ++  G      +EF+ V+FW
Sbjct: 69  TVEVEKMDDNLFVFYFPRKEVRGLVWARGPWHFDNHIIVLEKLEGPRDMASMEFKMVDFW 128

Query: 129 IHIHNLPPAEQSLKMAQIFGNRLGVFQKVDLNDTETCWGSSLCLKVRINVTIPLKRGLKV 188
           + IH +P    + ++ +I   ++G+  ++   D++  WG  L ++VRI+++ PLKR LKV
Sbjct: 129 VQIHQVPMLCMNSRITKILAKQIGIVVEIPA-DSKESWGKFLRVRVRIDISKPLKRCLKV 188

Query: 189 KIGTMAEELWCPVTYEKLPDFCYSCGRTGHLDRIC-------NEVDWSTS-----SKKQF 248
           ++    + +   + YE+L + C++ G+ GH+ R C         +D  T+      K   
Sbjct: 189 RLEGFEKAIVALIHYERLLELCFAYGKIGHVMRDCCDEEAKKEALDGKTTRYGVWMKAAA 248

Query: 249 GSGLRHPSSNSGQQNWARSGVHGRGSRDTKKTVEDDIEEDSSMDSSSGINLQTG------ 308
             GL+   S +G++     G+  RG  +      D + + SS  S SG+ ++ G      
Sbjct: 249 PKGLKRGVS-TGREREVGEGLITRGVENIDVGFADIVSQLSSEVSGSGMGIKEGRIVSKA 308

Query: 309 ---------SKQKT------SLDTNNLEKN--CRR-----SESEAGDRATERQ------- 368
                      QK        +  + LE+N  C        ESE  DR  E+        
Sbjct: 309 DIRNEIIEEMNQKVLAVAYEEMPMDRLEENDGCLGGNQCVEESEWVDRVREKHMVLSPRK 368

Query: 369 --------------------GSEEFQKLINAINFSVESVSKERSL----EVTEGHSRKDR 428
                               G     K I   +   +S ++++SL    +  +    K  
Sbjct: 369 VSFRKWRRAARKWHAPKGMLGVTSPIKRILEAHHIAKSKTRDKSLFPKGKKQQIRVLKKN 428

Query: 429 NGKLKGLLEVSRKLVFDIEGEEDVSKK-------VLNSSSAVEPRGVEKGEK-------- 488
           + K +    V RK++  ++ E  V KK       +L+  +A     + +GE         
Sbjct: 429 SPKKRAEGVVKRKIILPVKEEGAVEKKLKLSPDDLLSKETAEHDVQLHRGEADKFRCLLG 488

Query: 489 ---LAQVPTKGLS--ILHSSTGPKIMGF--------KDNDMETRIIGLAEEFLQT----N 548
              + QV ++G S     SST   I           + +++     G   E L++     
Sbjct: 489 FEGVLQVDSEGKSGGFYGSSTQDNIAASWELLKWLRRVDNLPWVCGGDFNEILRSEEKHR 548

Query: 549 GLNFQI-GSSIMDMWFDFETIHLGTYSSDHRP-ILGVTGERAHFQRQHDRGLLRFEPNWG 608
           G + Q+ G +I     D    HLG  +SDHRP IL   G   + ++  D G  + EP W 
Sbjct: 549 GSDRQVLGMAIFRQAIDDYVKHLGYNNSDHRPIILNTRGSLKNPKKGSDFG-FKCEPFWL 608

Query: 609 TYPDCWEIVANVW-QRHPHNTIGNMQVRLNACLMELKRWNRTRLEGFLKGAISKKEKEIQ 664
           T   C E+V + W      + +  ++ +L++C  +L+ W++ +  G L   IS K +E++
Sbjct: 609 TEEKCAEVVTSAWGDSEVSSLVDYLRRKLSSCARKLEAWSKEKF-GSLGKLISLKSEELE 668

BLAST of Tan0011004 vs. NCBI nr
Match: GAU41525.1 (hypothetical protein TSUD_140560 [Trifolium subterraneum])

HSP 1 Score: 196.4 bits (498), Expect = 1.2e-45
Identity = 202/747 (27.04%), Postives = 335/747 (44.85%), Query Frame = 0

Query: 16  TEEEGRAIVVEDDDVDESARLLSISLICKSLSSKPVHIDVFRQKIPKIWKTTAPIGIDKV 75
           TE+E   I VE +++ E       +L+ K  +  P ++  F+Q I + W+    I +  +
Sbjct: 11  TEDEEDCITVEAEEICEEEETFKRTLVGKLWTENPYNVRAFKQTIAQAWRLKNSIEVQDL 70

Query: 76  GENLFLCSFGNTGDLRRVLENEPWYFDKAILLFDEPRGNCRFIDLEFRFVNFWIHIHNLP 135
            ENLFL  F    D   VL N PW FD+ +L+     G  +  DL+   VNFW+ +++LP
Sbjct: 71  EENLFLFRFTTKKDADTVLRNGPWSFDRNLLILHRVSGEEQPSDLDMHNVNFWVRVYDLP 130

Query: 136 PAEQSLKMAQIFGNRLGVFQKVDLNDTETCWGSSLCLKVRINVTIPLKRGLKVKIGTMAE 195
              +S  MA+  GN +G F++VD  D     G  L LK  +++  PLKRG K+K     +
Sbjct: 131 FKLRSEAMAKKLGNIMGNFEEVDPKDANRT-GRFLRLKASLDLRKPLKRGTKIKF--QDK 190

Query: 196 ELWCPVTYEKLPDFCYSCGRTGHLDRICNEVD------WSTSSKKQ--FGSGLR------ 255
            +W    YE+LP+FC++CG+ GH  + C +V+      +S  ++K   +G  LR      
Sbjct: 191 NMWVDFKYERLPNFCFACGKIGHQMKECEDVEDVDENNYSDIAEKSQAYGPWLRASPLPR 250

Query: 256 -------HPSSNSGQQNWARSGVHGRGSRDTKKTVEDDIEEDSSMDSSSGINLQ-TGSKQ 315
                    SS S  +N   S    +G +   K    D E++       GI  Q   +K 
Sbjct: 251 IFEEPRKDASSGSCSKNLFPSSSQSKGGQSEGK---KDKEQEVDQQPVVGIEKQLVPTKA 310

Query: 316 KTSLDTNNLEKNCRR---SESEAGDRATERQGSEEFQKLINAINFSVESVSKERSLEVTE 375
             +LD  ++ ++      S S    + T+ QG  + +K +   N S + V  + + ++ +
Sbjct: 311 TNNLDVEDVAESLGAVAISTSFVVGKTTKSQGGSKGRKWVR--NKSSKPVKAQTARKLAK 370

Query: 376 GHSRKD----RNGKLKGLLEVSR----KLVFDIEGEEDVS-------KKVLNSSSAVEPR 435
              +++       +++ L  + R    ++VF +E    V        K   +S  A++ +
Sbjct: 371 ELGKRNLVDVAISEVRALSRLIRLENPQVVFLMETRLKVPEIDRLKFKLGFSSGLAIDCK 430

Query: 436 GV--EKGEKLAQVPTKGLSILHSSTGPKIMGFKDNDMETR----IIGL-AEE----FLQT 495
           GV  E+   LA      + I   S     +  +  D+ET     + G+ AEE     ++T
Sbjct: 431 GVGRERAGGLALFWKDHMDITIKSYSLNHIHGQCVDVETNEPWDLTGIYAEEKKGGLVRT 490

Query: 496 NGLNFQIG------SSIMDMWFD-----------------------------------FE 555
            G   +IG      S + D+ F+                                     
Sbjct: 491 QG-QLEIGRQAILESRLNDLGFEGYPLTWSNGRNSDDNKQCRQDRAMSSDEFINRFSPIH 550

Query: 556 TIHLGTYSSDHRPILGVTGE----RAHFQRQHDRGLLRFEPNWGTYPDCWEIVANVWQRH 615
            +HL  Y SDH  +L +T E    R   +R+  + L RFE +W +   C  ++   W + 
Sbjct: 551 VLHLPRYGSDH-VVLVITLEAPTHRDQRRRRRRKRLFRFEESWTSNARCEPLIQACWSQ- 610

Query: 616 PHNTIGNMQVRLNACLMELKRWNRTRLEGFLKGAISKKEKEIQSLEQYLTPDTEETWFQK 663
           P  +  +   RL     EL         G +   I + EK IQ+ + +   D  ET  Q+
Sbjct: 611 PCLSFSDKLGRLRDMGNEL----GDHSVGSIHKEIVRIEKLIQNHDMW---DESETSIQR 670

BLAST of Tan0011004 vs. ExPASy TrEMBL
Match: A0A2N9FNT0 (RNase H domain-containing protein OS=Fagus sylvatica OX=28930 GN=FSB_LOCUS16695 PE=4 SV=1)

HSP 1 Score: 211.5 bits (537), Expect = 1.7e-50
Identity = 190/706 (26.91%), Postives = 323/706 (45.75%), Query Frame = 0

Query: 9   MVERLNLTEEEGRAIVVEDDDVDESARLLSISLICKSLSSKPVHIDVFRQKIPKIWKTTA 68
           M +  +L+++EG      D D+  +++     L  K L+S+ +++D   +    +WKT  
Sbjct: 7   MWKNFSLSDKEGL-----DVDLANTSQQSENILAAKFLTSRVLNMDAVARTFKPLWKTRQ 66

Query: 69  PIGIDKVGENLFLCSFGNTGDLRRVLENEPWYFDKAILLFDEPRGNCRFIDLEFRFVNFW 128
              +  +G N     F +  DL RVL NEPW +DK +++F   +G+    D  F   +FW
Sbjct: 67  SFTVQDIGGNKVAFVFEDAMDLERVLMNEPWTYDKFLVVFQRVQGDEPIQDSMFSHTSFW 126

Query: 129 IHIHNLPPAEQSLKMAQIFGNRLGVFQKVDLNDTETCWGSSLCLKVRINVTI--PLKRGL 188
           + +HNLP   ++ + A+  G  +G+ +KV  ++ E   G   C++VRI + I  PL RG 
Sbjct: 127 VQLHNLPIRRRTEEAAESIGRSIGLVEKVAASEDER--GGENCMRVRIRLEINRPLCRGR 186

Query: 189 KVKIGTMAEELWCPVTYEKLPDFCYSCGRTGHLDRICN----EVDWSTSSKKQFGSGLRH 248
            VK     +  W    YE+LP+FCY CG   H ++ C+    +   S   + QFG+ LR 
Sbjct: 187 LVKFEEGIKG-WVAFRYERLPNFCYWCGCLDHGEKDCDVGIQQRQTSNKQEYQFGAWLRA 246

Query: 249 PSSNSGQQNWARSGVHGRGSRDTKKTVEDDIEEDSSMDSSSGINLQTGSKQKTSLDTNNL 308
            S  +  +       +   SRD     +    +D               + + +++  +L
Sbjct: 247 TSDRAPHKTVVIVPGNQPKSRDKSCRQDPPCRKDP-------------PQHQPTVEAEDL 306

Query: 309 EKNCRRSESEAGDRATERQGSEEFQK-----LINAINFSVESVSKERSLEVTEGHSRKDR 368
            +N R +     +   + +   E ++     + + I+ S +           E H R + 
Sbjct: 307 TRNQRENGKTTDNTEEDPENEMEIEQNPGFPIPDRIHNSDQPWRLTCFYGAPETHLR-EH 366

Query: 369 NGKLKGLLEVSRKLVFDIEGEEDVSKKVLNSSSAVEPRGVEKGEK---LAQVPTKGLSIL 428
           +  L   L     L +   G+     +++ SS     R   + +     A +   G   L
Sbjct: 367 SWNLLRTLNGQHALPWCCFGD---FNEIVRSSEKSGRRNQSESQMQRFRAVIDECGFIDL 426

Query: 429 HSSTGPKIMGFKDNDMETRIIGLAEEFLQTNGLNFQIGSSIMDMWFDFETIHLGTYSSDH 488
                P           T  + L + F+ TN    +  S+ +D        H+   +SDH
Sbjct: 427 GFRGLPFTWCNNRRGNATTWLRL-DRFMATNEWVLRFSSAAVD--------HMECTTSDH 486

Query: 489 RPILGVTGERAHFQRQHDRGLLRFEPNWGTYPDCWEIVANVW-QRHPHNTIGNMQVRLNA 548
           +PI  +  +     R   + L RFE  W  +PDC  +V   W  +   + I  ++ ++  
Sbjct: 487 KPIC-LNTQPVQVPRPRQK-LFRFEDMWRMHPDCEPVVTQAWVPKTRGSPIAQVKTKIQR 546

Query: 549 CLMELKRWNRTRLEGFLKGAISKKEKEIQSLEQYLTPDT-----EETWFQKRRELDNLLQ 608
           C  EL RW+RT+      G I+K  KE   L +    D+      +T    R+E+++LL 
Sbjct: 547 CGDELTRWSRTQF-----GNITKLLKEKTELLRQAEVDSTLGFGHDTVISIRKEVNDLLI 606

Query: 609 EDEIYWRQRSRVDWLKWGDRNTKWFHIKATQRKKQNKIRTLQTLNESWISDEKEIGEFAT 668
           ++E  W+QRSR  WLK GDRNTK+FH +A+ R+++N I TL   +   +++ + IG   T
Sbjct: 607 KEEKMWKQRSRDSWLKEGDRNTKYFHSRASHRRRRNSILTLIRDDGEIVTNPEAIGTQFT 666

Query: 669 SYFQHLFSSDDPTSDMIEGVTDCILPSVTDDTNRMLLSDFTHAENI 695
            Y+Q LF++ +P  D +E V D I P VT + N+ L+S FT AE I
Sbjct: 667 DYYQALFTA-NPLED-VEVVLDGIQPCVTQEMNQNLISQFTEAEVI 669

BLAST of Tan0011004 vs. ExPASy TrEMBL
Match: A0A2N9FNT0 (RNase H domain-containing protein OS=Fagus sylvatica OX=28930 GN=FSB_LOCUS16695 PE=4 SV=1)

HSP 1 Score: 115.5 bits (288), Expect = 1.3e-21
Identity = 90/360 (25.00%), Postives = 148/360 (41.11%), Query Frame = 0

Query: 691  AENILKLPRTGTMGCDEIIWKCHPRGVFTVKSAYQLRLRIQDSQEASNSTN-RRDSIWMA 750
            AE ILK+P +  +  D+I W  +  G ++V+S Y+L L+ +   +A +S     D +W  
Sbjct: 1043 AEAILKIPLSERVQEDKIFWFDNRDGKYSVRSGYKLLLKDEQVLQAESSRQWDPDPLWKR 1102

Query: 751  LWNANTPSKIKICCWRILHNILPTKTNLIQKGLDIQPWCPFCMKQPETSCHILWGCKVTR 810
            +W A  P+KIK   WR  H+ LPT + L Q+ +   P C  C  Q E S H LW C    
Sbjct: 1103 IWGARVPAKIKSFLWRACHDSLPTNSGLFQRKVLPNPLCGLCHSQREDSFHALWACPNVN 1162

Query: 811  VLWNHFLPSYTNLFYDFRED---------WNAGTYFQWMLEDNNRKDFNVFLIILWKIWT 870
             +W     S  + F  FR           WN   +      D N      +     +IWT
Sbjct: 1163 QVW-----SGASEFSVFRNSDPSNLLALIWNKRNH------DRNHPPSEQYS----QIWT 1222

Query: 871  WRNLAIRDKQIWNQEELIRITRCHVTEFISSPVNPPRQGTWYLNTDASWSTDRDCGGLGW 930
                 + +      EE         T +       P    + +N D +   + + GG+G 
Sbjct: 1223 RAQTVLHEYLAVTTEEKAEKQTPPQTRW-----RLPVTNYYKMNFDGAIFKESNSGGIGV 1282

Query: 931  IFREWDGRLVRAGHHFIRTNWSILILEL----RGIIEGLKAIPNKTIPLVVESDSLEAIQ 990
            + R+  G  +      +    ++ ++E     R II   +      + + VE D+   I+
Sbjct: 1283 VIRDHTGMAIATLSQKVHGTHTVEMIEALAARRAIIFAKEV---GIVDVEVEGDAENIIK 1342

Query: 991  QINGSSVDYTETSEFINEIKTMASMWSQIAFKHIPRSANQTTHKLAQRASRLQTNESWLD 1037
             +N +   +T     I + K +   + + +  H  RS N   H LA+RAS   +   WL+
Sbjct: 1343 DLNSNHPIHTPYGLVIEDAKALIQDFQRFSLSHTRRSGNSVAHALARRASGCNSFSVWLE 1379


HSP 2 Score: 204.9 bits (520), Expect = 1.6e-48
Identity = 265/1209 (21.92%), Postives = 448/1209 (37.06%), Query Frame = 0

Query: 10   VERLNLTEEEGRAIVVEDDDVDESAR--LLSISLICKSLSSKPVHIDVFRQKIPKIWKTT 69
            +  ++L  E+   +V  D+  D S +       L+ + L+ +PV+    +  +  +W+  
Sbjct: 10   IAAISLDNEDEDGLVFGDEIGDSSIQQPAYDFCLVGRFLTERPVNFVAMKNTMASLWRPE 69

Query: 70   APIGIDKVGENLFLCSFGNTGDLRRVLENEPWYFDKAILLFDEPRGNCRFIDLEFRFVNF 129
              + + +VG  L++  FG   ++ RV+E  PW F+   LL +    +    ++    +  
Sbjct: 70   EGMVVKEVGAGLYIFQFGALAEMERVMEMCPWSFNNQALLLERLGRSKDPSEVSLHHLYV 129

Query: 130  WIHIHNLPPAEQSLKMAQIFGNRLGVFQKVDLNDTETCWGSSLCLKVRINVTIPLKRGLK 189
            W+ ++ L     S ++AQ  GN +G F + D  +    W S L ++V+++V  PLK+G +
Sbjct: 130  WVRVYGLKRGFFSERVAQRLGNEIGQFVEADPKNFSNPWSSYLRIRVKLDVQKPLKKGTR 189

Query: 190  VKIGTMAEELWCPVTYEKLPDFCYSCGRTGHLDRICNEV--DWSTSSKKQFGSGLR---H 249
            +K     E       YE+LP FC+ CGR GH +R C E    W    +++FG  LR    
Sbjct: 190  LK-REGKEWFHVDFAYERLPTFCFVCGRLGHGERFCPETLRQWGRQVERKFGPELRASTR 249

Query: 250  PSSNSGQQNWARSGVHGRG--SRDTKKTVEDDIEEDSSMDS--SSGINLQTGSKQKTSLD 309
             +SN+    W R  +  RG  ++  +K  + D    +   +  SS +  Q G    T L 
Sbjct: 250  RASNNIGARWLREELPPRGDEAQQQEKIGKGDSRRSTPAYTGLSSVLCKQDGVPPSTELA 309

Query: 310  TNNLEKNCRRSESEAGDRATERQGSEEFQKLINAINFSVESVSKERSLEVTEGHSRKDRN 369
                ++N   +          R       K  +     + +  K+R   V E +     +
Sbjct: 310  V--WQRNREATHMMESSNLQNRFDQNLMDKQPDVEMLLIPTDPKKRKKSVDEKNGESSGS 369

Query: 370  GKLK---------GLLEVSRKLVFDIEGEEDVSKKVLNSSSAVEPRGVEKGEKLAQVPTK 429
            G L           LL +SR    D+E E+    +  ++     P    + E    +   
Sbjct: 370  GGLALLWKSPLNVTLLSLSR-FHIDVEVEDLALGRWRHTGFYGHPDQSRRSESWDLL--- 429

Query: 430  GLSILHSSTGPKIMGFKDNDMETR------------IIGLAEEFLQTNGLNFQIGSSIMD 489
               +  +ST P I G   N +  +            +I    E ++  GL          
Sbjct: 430  -RDLSQASTLPWICGGDFNAIMEQHEKQGGPPKPRYLIQAFREAVEDAGL---------- 489

Query: 490  MWFDFETIHLGTYSSDHRPILGVTGERAHFQRQHDRGLLRFEPNWGTYP-------DCWE 549
               DF             P+ G    + +  R       RFE  W + P       DCWE
Sbjct: 490  --MDF-------------PLGGAWVPKPYAHR------FRFENAWRSDPTLRPLLLDCWE 549

Query: 550  IVANVWQRHPHNTIGNMQVRLNACLMELKRWNRTRLEGFLKGAISKKEKEIQSLEQYLTP 609
            +          +   +++ +LN C   L  W     + F   +     + IQ   +    
Sbjct: 550  V----------SHAASLEEKLNFCSTRLHVWGMEWKDQF--ASEINIMRTIQRRTRGRRD 609

Query: 610  DTEETWFQK-RRELDNLLQEDEIYWRQRSRVDWLKWGDRNTKWFHIKATQRKKQNKI--- 669
                  FQ+ ++ L  L    E +WRQ+++  WL  GDRNTK+FH +A++R++ N+I   
Sbjct: 610  HQSRIQFQRAKKRLFELYALKEAFWRQQAKQFWLTQGDRNTKFFHAQASERQRLNRIDRL 669

Query: 670  ---------------------------------RTLQTLNESWISD-------------- 729
                                               +Q+L  S +S               
Sbjct: 670  TDSNGQLRTWSNGLEDTIKEYFEELYHAQGNPGHIIQSLIPSLVSREDNAMLRQPYSYEE 729

Query: 730  ---------------------------------------------------EKEIGE--- 789
                                                               E +IG    
Sbjct: 730  VRQAVFSMHPDKSPGGDGFNPGKMAWKFLTQPDLLVSKVFKAKYFRDCSFLEAQIGSNSS 789

Query: 790  ------FATSYFQH---------------------------LFSSDDPTSDMIEGVTDCI 849
                  FAT    H                             ++++P S  +  V+D  
Sbjct: 790  FVWKSIFATKDLLHTGIRWKVGPGTNISIWEDPWLMDNENPFITTENPGSIALTRVSDLR 849

Query: 850  LPSVTDDTNRMLLSDFTHAENILKLPRTGTMGCDEIIWKCHPRGVFTVKSAYQLRLRIQD 909
             P+  D      + +    + I  +P TG+   D +IW     G++TVKSAY  +    D
Sbjct: 850  SPNGWDWPTIDAIFNARDRQCIEAIPLTGSSSNDMLIWAHDKSGIYTVKSAY--KALTWD 909

Query: 910  SQEASNSTNRRDSIWMALWNANTPSKIKICCWRILHNILPTKTNLIQKGLDIQPWCPFCM 969
            +    +  +R  ++W  LW      K++   WR  +NILP   NL+ K + +Q  CP C 
Sbjct: 910  ATVLGHGMDR--ALWKKLWAVRVLPKVRNLIWRAANNILPCLNNLVTKRVTVQDVCPLCH 969

Query: 970  KQPETSCHILWGCKVTRVLWN-HFLPSYTNLFYDFREDWNAGTYFQWMLEDNNRKDFNVF 1029
               ET  HI   C   R +W   FL  Y      F  DW        +L   N  D  + 
Sbjct: 970  TSEETVLHIFVHCPFARQVWGASFLGWYAPAVGSF-HDWLLA-----VLHLFNEYDKGLA 1029

Query: 1030 LIILWKIWTWRNLAIRDKQIWNQEELIRITRCHVTEFISSPVNPPRQGTWYLNTDASWST 1039
              +LW IW+ RN      ++W       + R  +  +   PVN  +     +N DAS   
Sbjct: 1030 FHLLWMIWSARN-----AKVWK-----GVRRSPLEGWERPPVNYLK-----VNVDASSGL 1089

BLAST of Tan0011004 vs. ExPASy TrEMBL
Match: A0A2U1KHJ0 (CCHC-type domain-containing protein OS=Artemisia annua OX=35608 GN=CTI12_AA602590 PE=4 SV=1)

HSP 1 Score: 203.0 bits (515), Expect = 6.2e-48
Identity = 245/1118 (21.91%), Postives = 444/1118 (39.71%), Query Frame = 0

Query: 79   LFLCSFGNTGDLRRVLENEPWYFDKAILLFDEPRGNCRFIDLEFRFVNFWIHIHNLPPAE 138
            +FL   G+  DLRRVLE+ PW F++ +++      + +  + +   V FW+ + N+P   
Sbjct: 1    MFLVQLGHDVDLRRVLEDGPWSFERNLVVLKLIENDEQPTETDMTKVPFWVRLINMPLGR 60

Query: 139  QSLKMAQIFGNRLGVFQKVDLNDTETCWGSSLCLKVRINVTIPLKRGLKVKIGTMAEELW 198
            +     +    ++G   +VD  D     GS     +R+  T       KV++        
Sbjct: 61   RDESSVRRVAAKIGDVLEVD--DAYFTKGSK---HIRVKNT-------KVRVN------- 120

Query: 199  CPVTYEKLPDFCYSCGRTGHLDRIC----NEVDWSTSSKKQFGSGLR------------- 258
              + YE+LP+FCY CG  GH ++ C     E++  T     F   LR             
Sbjct: 121  --IQYERLPNFCYWCGLLGHTEKECLTKPFEINGKTFKDWPFHENLRASNSRDSVSLSVA 180

Query: 259  ----HPSSNSGQQN-----------WARSGVHGRGSRDTKK---------------TVED 318
                HP+ N+ QQ            +  S ++   ++D  +               T++ 
Sbjct: 181  SPLTHPAFNNFQQTTNQRLLIQDNIFNESQINESSNQDLDERKCLSGGTSNYCPTSTLKI 240

Query: 319  DIEEDSSMDSSSGINLQTG----------SKQKTSLDTNNLEKNC-RRSESEAGDRATER 378
              +++      SGI L  G          +K +  L+  NL K   RR+  E   + T  
Sbjct: 241  TGQKEPKNSMGSGIELDMGRKIEPHLRGSTKAQQDLENTNLPKLWKRRTREETNTKTTTG 300

Query: 379  QGSEEF---QKLINAINFSVE------------SVSKERSLEVTEGHSRKDRNGKLKG-- 438
              S       + ++ ++++V+            S+ KE S  +      +    ++ G  
Sbjct: 301  TNSISIAAPPRPMSILSWNVQGLGNPWTVQHIRSLVKELSPSIIFLMETRLHGSEVTGFR 360

Query: 439  --------LLEVSRKLVFDIEGEEDVS-----------KKVLNSSSAVEPRGVEKGEKLA 498
                    L+  S +   D   +ED +           ++     +    R +   ++ A
Sbjct: 361  YIFPQYNLLVVDSIRRAGDFVVKEDGNFWRGTGIYGWPRRQEKHRTWALLRSLRTNQEQA 420

Query: 499  QVPTKGLSIL---------HSSTGPKIMGFKDN----DMETRIIGLAEEFLQTNGL---- 558
             V     + +           S   ++  F++     ++E R   +  +   +NG     
Sbjct: 421  WVCFGDFNEIMYAFEKEGQRGSNNTEMSAFREACSFCNLEDR-SAMGVKLTWSNGRRGNE 480

Query: 559  -------NFQIGSSIMDMWFDFETIHLGTYSSDHRPILGVTGERAHFQRQHDRGLLRFEP 618
                    F   S   D++ D    +L   +SDH PI+     R     +    + RFE 
Sbjct: 481  NVRKRLDRFLTNSHWFDLYPDASFENLPRIASDHSPIIC----RLSPMVKKKNRMFRFES 540

Query: 619  NW-------GTYPDCWEIVANVWQRHPHNTIGNMQVRLNACLMELKRWNRTRLEGFLKGA 678
             W       G   D W        +H    I      ++ C   L  WN+ R  G ++ +
Sbjct: 541  MWLRDKSIHGVVRDGWAYGLAAGMQHDPCGI------VSECANRLSDWNK-RSFGHVQRS 600

Query: 679  ISKKEKEIQSLEQYLTPDTEETWFQKRRELDNLLQEDEIYWRQRSRVDWLKWGDRNTKWF 738
            I  K++ +Q+L+      T       R ++  LL  +E+ W+QRSR++WL+ GD+NT++F
Sbjct: 601  IKSKQRSLQTLQSRFDGSTRAEQQALREQIKELLTREELMWKQRSRIEWLREGDKNTRFF 660

Query: 739  HIKATQRKKQNKIRTLQTLNESWISDEKEIGEFATSYFQHLFSSDDPTSDMIEGVTDCIL 798
            H +A+ R+++N I  L+  +  W+ +  E+ +  +SYF  LFSS  P     E V   I 
Sbjct: 661  HTRASNRQRRNSILRLKGPDGRWVEEHNEVCKLVSSYFSDLFSSSSPQG--CESVVRDID 720

Query: 799  PSVTDDTNRMLLSDFTHAE--NILKLPRTG----------------TMGC--------DE 858
              +T++  + L    T +E  ++L     G                 +GC        D 
Sbjct: 721  RRLTENERQALERPVTSSEVRDLLNTEGDGWNHELMYSLFPHNIASKIGCCFISKSRNDI 780

Query: 859  IIWKCHPRGVFTVKSAYQLRLRI-QDSQEASNSTNRRDSIWMALWNANTPSKIKICCWRI 918
            + W  +P G F+ KSAY L L   +D    +  +N     W  +W A  PSK+K+  WR 
Sbjct: 781  LYWHNNPGGRFSCKSAYLLALEADEDMVRTTTISNSLIDFWRVVWKARVPSKVKLFMWRA 840

Query: 919  LHNILPTKTNLIQKGLDIQPWCPFCMKQPETSCHILWGCKVTRVLWNHFLPSYTNLFYDF 978
             +N +PT  NL  +GL+    C  C +  E   H+L+ C V + +WN         FYD 
Sbjct: 841  WNNYVPTIDNLKSRGLNPTSSCTHCGQTSENLVHVLFKCSVAKDVWNR---CNFGCFYDT 900

Query: 979  REDWNAGTYFQWMLEDNNRKDFNVFLIILWKIWTWRNLAIRDKQIWNQEELIRITRCHVT 1026
            +       + Q +LE     ++  F++ILW +WT RN     +    +  +  I +  ++
Sbjct: 901  QGAITFQDFCQVILE-KFLAEWETFMMILWGLWTRRNRHFHGQLNGREGNVEVIAKSVLS 960

BLAST of Tan0011004 vs. ExPASy TrEMBL
Match: A0A5C7H8M7 (Uncharacterized protein OS=Acer yangbiense OX=1000413 GN=EZV62_022549 PE=4 SV=1)

HSP 1 Score: 198.7 bits (504), Expect = 1.2e-46
Identity = 195/762 (25.59%), Postives = 344/762 (45.14%), Query Frame = 0

Query: 9   MVERLNLTEEEGRAIVVEDDDVDESARLLSISLICKSLSSKPVHIDVFRQKIPKIWKTTA 68
           + E L   E +   ++++ +   E  + +S  L+ K L  K ++ + F+  I ++W +T 
Sbjct: 9   LCEMLATLELDRPEVLIKGEAHREGIKEVSHCLMRKVLVGKRINREAFKGVIEQLWSSTD 68

Query: 69  PIGIDKVGENLFLCSFGNTGDLRRVLENEPWYFDKAILLFDEPRGNCRFIDLEFRFVNFW 128
            + ++K+ +NLF+  F        V    PW+FD  I++ ++  G      +EF+ V+FW
Sbjct: 69  TVEVEKMDDNLFVFYFPRKEVRGLVWARGPWHFDNHIIVLEKLEGPRDMASMEFKMVDFW 128

Query: 129 IHIHNLPPAEQSLKMAQIFGNRLGVFQKVDLNDTETCWGSSLCLKVRINVTIPLKRGLKV 188
           + IH +P    + ++ +I   ++G+  ++   D++  WG  L ++VRI+++ PLKR LKV
Sbjct: 129 VQIHQVPMLCMNSRITKILAKQIGIVVEIPA-DSKESWGKFLRVRVRIDISKPLKRCLKV 188

Query: 189 KIGTMAEELWCPVTYEKLPDFCYSCGRTGHLDRIC-------NEVDWSTS-----SKKQF 248
           ++    + +   + YE+L + C++ G+ GH+ R C         +D  T+      K   
Sbjct: 189 RLEGFEKAIVALIHYERLLELCFAYGKIGHVMRDCCDEEAKKEALDGKTTRYGVWMKAAA 248

Query: 249 GSGLRHPSSNSGQQNWARSGVHGRGSRDTKKTVEDDIEEDSSMDSSSGINLQTG------ 308
             GL+   S +G++     G+  RG  +      D + + SS  S SG+ ++ G      
Sbjct: 249 PKGLKRGVS-TGREREVGEGLITRGVENIDVGFADIVSQLSSEVSGSGMGIKEGRIVSKA 308

Query: 309 ---------SKQKT------SLDTNNLEKN--CRR-----SESEAGDRATERQ------- 368
                      QK        +  + LE+N  C        ESE  DR  E+        
Sbjct: 309 DIRNEIIEEMNQKVLAVAYEEMPMDRLEENDGCLGGNQCVEESEWVDRVREKHMVLSPRK 368

Query: 369 --------------------GSEEFQKLINAINFSVESVSKERSL----EVTEGHSRKDR 428
                               G     K I   +   +S ++++SL    +  +    K  
Sbjct: 369 VSFRKWRRAARKWHAPKGMLGVTSPIKRILEAHHIAKSKTRDKSLFPKGKKQQIRVLKKN 428

Query: 429 NGKLKGLLEVSRKLVFDIEGEEDVSKK-------VLNSSSAVEPRGVEKGEK-------- 488
           + K +    V RK++  ++ E  V KK       +L+  +A     + +GE         
Sbjct: 429 SPKKRAEGVVKRKIILPVKEEGAVEKKLKLSPDDLLSKETAEHDVQLHRGEADKFRCLLG 488

Query: 489 ---LAQVPTKGLS--ILHSSTGPKIMGF--------KDNDMETRIIGLAEEFLQT----N 548
              + QV ++G S     SST   I           + +++     G   E L++     
Sbjct: 489 FEGVLQVDSEGKSGGFYGSSTQDNIAASWELLKWLRRVDNLPWVCGGDFNEILRSEEKHR 548

Query: 549 GLNFQI-GSSIMDMWFDFETIHLGTYSSDHRP-ILGVTGERAHFQRQHDRGLLRFEPNWG 608
           G + Q+ G +I     D    HLG  +SDHRP IL   G   + ++  D G  + EP W 
Sbjct: 549 GSDRQVLGMAIFRQAIDDYVKHLGYNNSDHRPIILNTRGSLKNPKKGSDFG-FKCEPFWL 608

Query: 609 TYPDCWEIVANVW-QRHPHNTIGNMQVRLNACLMELKRWNRTRLEGFLKGAISKKEKEIQ 664
           T   C E+V + W      + +  ++ +L++C  +L+ W++ +  G L   IS K +E++
Sbjct: 609 TEEKCAEVVTSAWGDSEVSSLVDYLRRKLSSCARKLEAWSKEKF-GSLGKLISLKSEELE 668

BLAST of Tan0011004 vs. ExPASy TrEMBL
Match: A0A2N9IXK4 (RNase H domain-containing protein OS=Fagus sylvatica OX=28930 GN=FSB_LOCUS57484 PE=4 SV=1)

HSP 1 Score: 196.8 bits (499), Expect = 4.4e-46
Identity = 187/751 (24.90%), Postives = 298/751 (39.68%), Query Frame = 0

Query: 41  LICKSLSSKPVHIDVFRQKIPKIWKTTAPIGIDKVGENLFLCSFGNTGDLRRVLENEPWY 100
           L  + L+ +PV++D   +    +W+T     +  +G+N+ +  F +  DL RV+ + PW 
Sbjct: 35  LAARFLTRRPVNLDAVARTFRPLWRTDNDFQLQDMGDNIVIIRFSDPADLERVIASGPWT 94

Query: 101 FDKAILLFDEPRGNCRFIDLEFRFVNFWIHIHNLPPAEQSLKMAQIFGNRLGVFQKVDLN 160
           +DK+++LF           + F   + W+ IH LPP       A   G  +G   +    
Sbjct: 95  YDKSLILFQRSEEGVPATSMVFDKADLWVQIHGLPPHLLDSATAHQIGGTIGKVCQESDE 154

Query: 161 DTETCWGSSLCLKVRINVTIPLKRGLKVKIGTMAEELWCPVTYEKLPDFCYSCGRTGHLD 220
           + E  WG  + ++V I+V  PL RG K+ IG   +E+     YEKLP+FCY CG   H D
Sbjct: 155 EQEMGWGELVRVRVTIDVHKPLCRGRKIGIGD-NKEILVSFKYEKLPNFCYWCGLITHCD 214

Query: 221 RICN----EVDWSTSSKKQFGSGLRHPSSNSGQQNWARSGVHGRGSRDTKKTVEDDIEED 280
           + C+      D   +SK+Q+G+ LR P               G   R    +V+  +   
Sbjct: 215 KDCSFWLRNRDTLEASKQQYGAWLRAPP--------------GLPHRRKTVSVKGQVFRS 274

Query: 281 SSMDSSSGINLQTGSKQKTSLDTNNLEKNCRRSESEAGDRATERQ---GSEEFQKLI--- 340
               +SS  N  TG          +  K+C + E++ G           S  F  LI   
Sbjct: 275 QRASTSSADNFPTGKP--------DAAKSCPQKETDLGGDHEPHDLMPNSSPFDPLIPQK 334

Query: 341 ---NAINFSVESVSKERSLEVTEGHSRKDRNGKLKGLLEVSRKLVFDIEGEEDVSKKVLN 400
              NAI    +  S +   E+   + RK  + +      ++       E          +
Sbjct: 335 NPPNAIITEADFTS-DLHAEIHVPNQRKKASMRPMNRRWLASSTASPNEFVNVELSWAWD 394

Query: 401 SSSAVEPRGVEKG------EKLAQVPTKGLS----------------------------- 460
            S  V PR  + G      ++ A V  K  S                             
Sbjct: 395 PSKLVVPRREQGGGLAMFWKQEAHVSIKSFSHHHIDAIIDEGEPNSWRFTGFYGAPETHR 454

Query: 461 ---------ILHSST---------------------GPKIMGFKDNDMETRIIGLAEEFL 520
                    +LHS +                     GP     +  D    I     E L
Sbjct: 455 RHESWSLLRLLHSQSSLPWCCMGDFNELLSYEEKQGGPIRSHRQMQDFRDAIDHCGFEDL 514

Query: 521 QTNG-----LNFQIGSSIM----------DMWFDF----ETIHLGTYSSDHRPILGVTGE 580
             NG      N ++GS  +            W       +  HL   SSDH PI      
Sbjct: 515 GFNGPPFTWCNNRLGSHTVWERLDRVLATTSWISLFPLAQVQHLHAVSSDHNPISNQFSP 574

Query: 581 RAHFQRQHDRGLLRFEPNWGTYPDCWEIVANVWQRHPHNT-IGNMQVRLNACLMELKRWN 640
               + + +R + RFE  W ++P C E + + WQ   H T +  +  +L  C   L++W+
Sbjct: 575 SPSSRPRSNR-IFRFEEMWLSHPGCKETITSAWQTQKHGTAMFQVHDKLRTCRNSLRQWS 634

Query: 641 RTRLEGFLKGAISKKEKEIQSLE-QYLTPDTEETWFQKRRELDNLLQEDEIYWRQRSRVD 693
           R    G +   + KK + ++  E + +           +RE++ LL  +E  WRQRSR  
Sbjct: 635 RDSF-GNVTSELKKKTQMLREAESESMKGKGHAKAHALKREVNTLLNREECMWRQRSRKK 694

BLAST of Tan0011004 vs. TAIR 10
Match: AT4G29090.1 (Ribonuclease H-like superfamily protein )

HSP 1 Score: 100.1 bits (248), Expect = 1.1e-20
Identity = 79/348 (22.70%), Postives = 144/348 (41.38%), Query Frame = 0

Query: 706  DEIIWKCHPRGVFTVKSAYQLRLRIQDSQEASNSTNR--RDSIWMALWNANTPSKIKICC 765
            D   W     G +TVKS Y +  +I + + +    +    + I+  +W + T  KI+   
Sbjct: 211  DSYTWDYTSSGDYTVKSGYWVLTQIINKRSSPQEVSEPSLNPIYQKIWKSQTSPKIQHFL 270

Query: 766  WRILHNILPTKTNLIQKGLDIQPWCPFCMKQPETSCHILWGCKVTRVLWNHFLPSYTNLF 825
            W+ L N LP    L  + L  +  C  C    ET  H+L+ C   R+ W     + +++ 
Sbjct: 271  WKCLSNSLPVAGALAYRHLSKESACIRCPSCKETVNHLLFKCTFARLTW-----AISSIP 330

Query: 826  YDFREDWNAGTY--FQWMLEDNN-----RKDFNVFLIILWKIWTWRNLAIRDKQIWNQEE 885
                 +W    Y    W+    N      K   +   +LW++W  RN  +   + +N +E
Sbjct: 331  IPLGGEWADSIYVNLYWVFNLGNGNPQWEKASQLVPWLLWRLWKNRNELVFRGREFNAQE 390

Query: 886  LIRITRCHVTEF----------ISSPVNPPRQGTW--------YLNTDASWSTDRDCGGL 945
            ++R     + E+              VN    G W          NTDA+W+ D +  G+
Sbjct: 391  VLRRAEDDLEEWRIRTEAESCGTKPQVNRSSCGRWRPPPHQWVKCNTDATWNRDNERCGI 450

Query: 946  GWIFREWDGRLVRAGHHFIRTNWSILILELRGIIEGLKAIPNKTIPLVV-ESDSLEAIQQ 1005
            GW+ R   G +   G   +    S+L  EL  +   + ++       V+ ESDS   I+ 
Sbjct: 451  GWVLRNEKGEVKWMGARALPKLKSVLEAELEAMRWAVLSLSRFQYNYVIFESDSQVLIEI 510

Query: 1006 INGSSVDYTETSEFINEIKTMASMWSQIAFKHIPRSANQTTHKLAQRA 1026
            +N   + +      I +++ + S ++++ F  IPR  N    ++A+ +
Sbjct: 511  LNNDEI-WPSLKPTIQDLQRLLSQFTEVKFVFIPREGNTLAERVARES 552

BLAST of Tan0011004 vs. TAIR 10
Match: AT3G09510.1 (Ribonuclease H-like superfamily protein )

HSP 1 Score: 87.4 bits (215), Expect = 7.3e-17
Identity = 88/365 (24.11%), Postives = 145/365 (39.73%), Query Frame = 0

Query: 706  DEIIWKCHPRGVFTVKSAYQLRLRIQDSQ-EASNSTNRRDSIWMALWNANTPSKIKICCW 765
            D+IIW  +  G +TV+S Y L      +   A N  +    +   +WN     K+K   W
Sbjct: 117  DKIIWNYNTTGEYTVRSGYWLLTHDPSTNIPAINPPHGSIDLKTRIWNLPIMPKLKHFLW 176

Query: 766  RILHNILPTKTNLIQKGLDIQPWCPFCMKQPETSCHILWGCKVTRVLW---NHFLPSYTN 825
            R L   L T   L  +G+ I P CP C ++ E+  H L+ C    + W   +  L     
Sbjct: 177  RALSQALATTERLTTRGMRIDPSCPRCHRENESINHALFTCPFATMAWRLSDSSLIRNQL 236

Query: 826  LFYDFREDWNAGTYFQWMLEDNNRKDFNVFLII--LWKIWTWRNLAIRDKQIWNQEELIR 885
            +  DF E+ +    F   ++D    DF+  L +  +W+IW  RN  + +K   +  + + 
Sbjct: 237  MSNDFEENISNILNF---VQDTTMSDFHKLLPVWLIWRIWKARNNVVFNKFRESPSKTVL 296

Query: 886  ITRCHVTEFIS---------SPV-----------NPPRQGTWYLNTDASWSTDRDCGGLG 945
              +    ++++         SP            NPP       N DA +   +     G
Sbjct: 297  SAKAETHDWLNATQSHKKTPSPTRQIAENKIEWRNPPATYV-KCNFDAGFDVQKLEATGG 356

Query: 946  WIFREWDGRLVRAGHHFIRTNWSILILELRGIIEGLKAIPNKTIPLV-VESDSLEAIQQI 1005
            WI R   G  +  G   +    + L  E + ++  L+    +    V +E D    I  I
Sbjct: 357  WIIRNHYGTPISWGSMKLAHTSNPLEAETKALLAALQQTWIRGYTQVFMEGDCQTLINLI 416

Query: 1006 NGSSVDYTETSEFINEIKTMASMWSQIAFKHIPRSANQTTHKLAQRASRLQTNES----- 1037
            NG S  ++  +  + +I   A+ ++ I F  I R  N+  H LA+      T  S     
Sbjct: 417  NGISF-HSSLANHLEDISFWANKFASIQFGFIRRKGNKLAHVLAKYGCTYSTFYSGSGSL 476

BLAST of Tan0011004 vs. TAIR 10
Match: AT2G34320.1 (Polynucleotidyl transferase, ribonuclease H-like superfamily protein )

HSP 1 Score: 58.2 bits (139), Expect = 4.7e-08
Identity = 55/262 (20.99%), Postives = 106/262 (40.46%), Query Frame = 0

Query: 788  CPFCMKQPETSCHILWGCKVTRVLWN-HFLPSYTNLFYDFREDWNAGTYFQWMLE---DN 847
            C  C    ET  H+L+ C   R++W    +P+Y     ++ +   A  Y+   LE     
Sbjct: 12   CVRCPDSRETVNHLLFKCCFARLVWAISPIPAYPE--GEWTDSLYANLYWVLNLEVEIPK 71

Query: 848  NRKDFNVFLIILWKIWTWRNLAIRDKQIWNQEELIR---------ITRCHVTEFISSP-- 907
              K  N+   +LW++W  RN  +   + ++  E++R          TR  +    S P  
Sbjct: 72   LGKIGNLVPWLLWRLWKSRNELMFKGKEYDAPEVLRRAMEDFEEWSTRRELEGKASGPQV 131

Query: 908  --------VNPPRQGTWY-LNTDASWSTDRDCGGLGWIFREWDGRLVRAGHHFIRTNWSI 967
                      PP Q  W   NTDA+W  +    G+GWI R   G ++  G   +    ++
Sbjct: 132  ERNLSVQWKAPPYQ--WVKCNTDATWQLENPRCGIGWILRNESGGVLWMGARALPRTKNV 191

Query: 968  LILELRGIIEGLKAIPNKTIPLVVESDSLEAIQQINGSSVDYTETSEFINEIKTMASMWS 1026
            L  EL  +   +  +       ++     +A+  +  S   +      + +I+ +   + 
Sbjct: 192  LEAELEALRWAVLTMSRFNYKRIIFESDAQALVNLLNSDDFWPTLQPALEDIQQLLHHFE 251

BLAST of Tan0011004 vs. TAIR 10
Match: AT1G43760.1 (DNAse I-like superfamily protein )

HSP 1 Score: 56.2 bits (134), Expect = 1.8e-07
Identity = 51/191 (26.70%), Postives = 92/191 (48.17%), Query Frame = 0

Query: 547 EGFLKGAISKKEKE----IQSLEQYLTPDTEETWFQ----KRRELDNLLQEDEIYWRQRS 606
           +GF  G I  K KE    ++S++  L  +  ++ F+     R++ +      E ++RQ+S
Sbjct: 382 QGF--GNIQHKTKEALDSLESIQSQLLTNPSDSLFRVEHVARKKWNFFAAALESFYRQKS 441

Query: 607 RVDWLKWGDRNTKWFHIKATQRKKQNKIRTLQTLNESWISDEKEIGEFATSYFQHLFSSD 666
           R+ WL+ GD NT++FH      + +N I+ L+  ++  + +  ++ E   +Y+ HL  SD
Sbjct: 442 RIKWLQDGDANTRFFHKVILANQAKNLIKFLRMDDDVRVENVTQVKEMIVAYYTHLLGSD 501

Query: 667 DP--TSDMIEGVTDCILPSVTDDT--NRM--LLSDFTHAENILKLPRTGTMGCDEIIWKC 724
               T D ++ + D I P   +DT  +R+  L SD      +  +PR    G D    + 
Sbjct: 502 SDILTPDSVQRIKD-IHPFRCNDTLASRLSALPSDKEITAAVFAMPRNKAPGPDSFTAEF 561

BLAST of Tan0011004 vs. TAIR 10
Match: AT3G26855.1 (RNA-directed DNA polymerase (reverse transcriptase)-related family protein )

HSP 1 Score: 49.7 bits (117), Expect = 1.7e-05
Identity = 20/56 (35.71%), Postives = 32/56 (57.14%), Query Frame = 0

Query: 750 LWNANTPSKIKICCWRILHNILPTKTNLIQKGLDIQPWCPFCMKQPETSCHILWGC 806
           +W+     KIK+  W+ L+N LP    L+ + + I+P+C  C +  ET  HIL+ C
Sbjct: 9   IWSLKISPKIKLLIWKALNNALPVGAQLLSRNISIEPFCTRC-RDFETITHILFNC 63

The following BLAST results are available for this feature:
Match NameE-valueIdentityDescription
Match NameE-valueIdentityDescription
KAE8800683.14.0e-5723.18retrotransposon unclassified [Hordeum vulgare][more]
VFQ81500.13.4e-4821.92unnamed protein product [Cuscuta campestris][more]
PWA36168.11.3e-4721.91hypothetical protein CTI12_AA602590 [Artemisia annua][more]
TXG53380.12.4e-4625.59hypothetical protein EZV62_022549 [Acer yangbiense][more]
GAU41525.11.2e-4527.04hypothetical protein TSUD_140560 [Trifolium subterraneum][more]

Pages

Match NameE-valueIdentityDescription
A0A2N9FNT01.7e-5026.91RNase H domain-containing protein OS=Fagus sylvatica OX=28930 GN=FSB_LOCUS16695 ... [more]
A0A2N9FNT01.3e-2125.00RNase H domain-containing protein OS=Fagus sylvatica OX=28930 GN=FSB_LOCUS16695 ... [more]
A0A2U1KHJ06.2e-4821.91CCHC-type domain-containing protein OS=Artemisia annua OX=35608 GN=CTI12_AA60259... [more]
A0A5C7H8M71.2e-4625.59Uncharacterized protein OS=Acer yangbiense OX=1000413 GN=EZV62_022549 PE=4 SV=1[more]
A0A2N9IXK44.4e-4624.90RNase H domain-containing protein OS=Fagus sylvatica OX=28930 GN=FSB_LOCUS57484 ... [more]

Pages

Match NameE-valueIdentityDescription
AT4G29090.11.1e-2022.70Ribonuclease H-like superfamily protein [more]
AT3G09510.17.3e-1724.11Ribonuclease H-like superfamily protein [more]
AT2G34320.14.7e-0820.99Polynucleotidyl transferase, ribonuclease H-like superfamily protein [more]
AT1G43760.11.8e-0726.70DNAse I-like superfamily protein [more]
AT3G26855.11.7e-0535.71RNA-directed DNA polymerase (reverse transcriptase)-related family protein [more]
InterPro
Analysis Name: InterPro Annotations of Snake gourd (anguina) v1
Date Performed: 2021-10-25
IPR TermIPR DescriptionSourceSource TermSource DescriptionAlignment
NoneNo IPR availableCOILSCoilCoilcoord: 615..635
NoneNo IPR availableMOBIDB_LITEmobidb-litedisorder_predictioncoord: 301..323
NoneNo IPR availableMOBIDB_LITEmobidb-litedisorder_predictioncoord: 259..273
NoneNo IPR availableMOBIDB_LITEmobidb-litedisorder_predictioncoord: 234..324
NoneNo IPR availableMOBIDB_LITEmobidb-litedisorder_predictioncoord: 234..256
NoneNo IPR availableMOBIDB_LITEmobidb-litedisorder_predictioncoord: 279..300
NoneNo IPR availablePANTHERPTHR31286:SF84SUBFAMILY NOT NAMEDcoord: 10..274
IPR036397Ribonuclease H superfamilyGENE3D3.30.420.10coord: 901..1032
e-value: 7.6E-16
score: 60.3
IPR026960Reverse transcriptase zinc-binding domainPFAMPF13966zf-RVTcoord: 718..812
e-value: 8.2E-16
score: 58.5
IPR002156Ribonuclease H domainPFAMPF13456RVT_3coord: 904..1025
e-value: 8.5E-23
score: 80.5
IPR025558Domain of unknown function DUF4283PFAMPF14111DUF4283coord: 30..168
e-value: 6.2E-17
score: 61.5
IPR025836Zinc knuckle CX2CX4HX4CPFAMPF14392zf-CCHC_4coord: 177..224
e-value: 4.7E-11
score: 42.2
IPR040256Uncharacterized protein At4g02000-likePANTHERPTHR31286GLYCINE-RICH CELL WALL STRUCTURAL PROTEIN 1.8-LIKEcoord: 10..274
IPR001878Zinc finger, CCHC-typePROSITEPS50158ZF_CCHCcoord: 210..225
score: 9.438442
IPR044730Ribonuclease H-like domain, plant typeCDDcd06222RNase_H_likecoord: 903..1022
e-value: 2.21576E-18
score: 80.0508
IPR012337Ribonuclease H-like superfamilySUPERFAMILY53098Ribonuclease H-likecoord: 900..1030

Relationships

The following mRNA feature(s) are a part of this gene:

Feature NameUnique NameType
Tan0011004.1Tan0011004.1mRNA


GO Annotation
GO Assignments
This gene is annotated with the following GO terms.
Category Term Accession Term Name
molecular_function GO:0003676 nucleic acid binding
molecular_function GO:0004523 RNA-DNA hybrid ribonuclease activity
molecular_function GO:0008270 zinc ion binding