Tan0019688 (gene) Snake gourd v1

Overview
NameTan0019688
Typegene
OrganismTrichosanthes anguina (Snake gourd v1)
DescriptionReverse transcriptase domain-containing protein
LocationLG07: 12279122 .. 12281442 (-)
RNA-Seq ExpressionTan0019688
SyntenyTan0019688
Sequences
The following sequences are available for this feature:

Gene sequence (with intron)

Legend: polypeptideCDSexon
Hold the cursor over a type above to highlight its positions in the sequence below.
ATGGCATTGGGTGGATCGTCAGAGATTGGCAAGGTGAGCTCGTTTTGGGTGGGATGAAGTTTGTTGATCGGAGTTGGCCAATCAAGATACTGGAAGCTAAAGTCGTGGTTGAAGGGCAGTCTCAAGTTTTTATCTGTGGGAGCCAATTCCTCCCCTCATTGTGGAATCCGACTCCCTTGAGGTTATTAGCTTGCTCAACCGCGACAGTGTGGACTTTTCTGAGGTTAGTCTCTTCGTTGATGAAGTTATTGGGTTAGCTCAAGGATTAACTTACTGTATTTTTGTAAAGTTCCCAGAGGGGATAATAAAGATCTTGTGGCTTTGGCATACTCTCCAGGTGATTCTTTGATCTAGAAAGAGTATTTTCCGGTAGACATTTATCCTCACCTTGATGAGAGGAGTTGCTTGGGGTTTGTTTTGTTTCTTGGGTGTTTGTTGTTGTGCTTTAGTTTTAAAAAAAATTGAAAAGGTATATAAATTATAAAAAAAAATTGAAAAACATATATCATATTGTCGAGAATGTTAAGTAATTTGAATGTATAAATGATATAACATTCTAAAAAAAATAAGTAGTACATGTTGATGTTTATATAATATATTTCTATGTATATATTAATTATGAGTACAATATATACTGTACTGCATAATATATCATGATCATCGTATATTTTTAAGAATGTTCTATATAAATAATGACATTTGTAAATAATATATTTGCATAATGTATATTTATTATTCGGTAGTTTAACATATATATATCATGTATAATTTATCATAATCATCGTATATTTGGGTTTTCGTGATCTTTCAATTTTCAATCAGGCAATGCCAGCGAAACAATGCTGGCGTTTACTTCAGAATCCGAATTCTCTGCTTTACAAAGTTTATCGTGGACGTTATTTCAAGACAGGGAACTTTCTAAAGGCAACCTTAGGGACAAATCCATCATATACCTGGCGTAGTATTTTGTGGGGGCGCAACCTTTTTAAGCATGGCTATCGTTGGAAGGTTGGCAGTGGACACCAGATCAACATCAGGGAGGATCCCTGGCTGTTGGCAGAAGGACGGGATACACCCCTCTGGGTGGATCCAAACCTGACAGGAGTGAATGTGTGCAACTTGCTACGGAACGATGGTTTTTGGGACGAAGATAAGATACGAGAGCATTTTAACCAGGATGATGCAGATCACATCCTCTCCATCCTGCGAACTGGAGATCTGATTACTGACGAAATTATTTGGAAGTGTACCAAAAATGGGGTCTTTTCGGTCAAAAGCGCCTACCATTTAGGTATGAGCATTAGAGCATATAATGAAGCTTCAAGTTCAAACAACTCCTTAACAAAACAAATGTGGAAGTCAATATGGAGTACGCCTATTCCAAACAAGATCAAGATTTATTGTTGGAAGATCATTCACGACATTCTCCCGACCCGAGCTAACCTGTTACGAAAGGGCATCATCCTGAATCCAATCTGTCCATTCTGTCTAAAAAAGATGGAGACAAGGAATCATCTATTTTGGGGTTGCAAGGTATCTAGTAAATTTTGAGATCTTTTTTACCTGCTACCTCTATTTTGTTTTATGATTGCAGGGATGCTTGGAGCGCAGTAGATTATTTTTGTTGGTTGTTAGATAGGCATAGCCGTATGGACCAAGCGGTGTTCATGATAATTCTTTGGAAGATTTAGTCATGTCGGAATGTTTTGCTACAGAAACAAGGTAGTATTAACTGGAAAAGGATGTTCCTTAACACACAACTCCAAATCCAAGAGTTCACTCAGTCTGTGGCAAACAGGATCCAAGTTCCCAATCATTTGACTATAGCGACGGAAACGTGGCAGCTGCCAAAAGAGGGTTGGTGGAAATTAAATATTGATGCCTCTTGGTGTACTACCACTAATCATGGGGGTGTAGGTTGGATCTTACGGGATTGGACATGGAGGATAGTGAGGGCAGGGCACACCCATATTACAGACAGATGGCCAATCACCATTTTGGAATTTTATGGTATTCTCAAGGGTTTGGATTTCATTCATGAGTACAACATACCCCTCCTGGTGGAATCTGACTCTTGGGAGGCTATACGACTCATCAATGCTGTTGACAATGATCGAATAGAGGCGAGAGACTTTGCAAGGAAGATCAGACAACGAACAACTTCTTGGACCAACATTTCTTTTCATCACAACAGGCGAGAGACAAATATGGTCGCTCACAAACTGGCGCAACGAGGGGAACACCTTCTTGGAGAAGAACTTTGGCAAGATGGGCCCACTATAGGCGTTTTGGAATTTCTCTTAATTTGTTTTCATGTTTAG

mRNA sequence

ATGGCATTGGGTGGATCGTCAGAGATTGGCAAGCCAATTCCTCCCCTCATTGTGGAATCCGACTCCCTTGAGGTTATTAGCTTGCTCAACCGCGACAGTGTGGACTTTTCTGAGGTTAGTCTCTTCGTTGATGAAGTTATTGGGTTAGCTCAAGGATTAACTTACTGTATTTTTGTAAAGTTCCCAGAGGGGATAATAAAGATCTTGTGGCTTTGGCATACTCTCCAGGCAATGCCAGCGAAACAATGCTGGCGTTTACTTCAGAATCCGAATTCTCTGCTTTACAAAGTTTATCGTGGACGTTATTTCAAGACAGGGAACTTTCTAAAGGCAACCTTAGGGACAAATCCATCATATACCTGGCGTAGTATTTTGTGGGGGCGCAACCTTTTTAAGCATGGCTATCGTTGGAAGGTTGGCAGTGGACACCAGATCAACATCAGGGAGGATCCCTGGCTGTTGGCAGAAGGACGGGATACACCCCTCTGGGTGGATCCAAACCTGACAGGAGTGAATGTGTGCAACTTGCTACGGAACGATGGTTTTTGGGACGAAGATAAGATACGAGAGCATTTTAACCAGGATGATGCAGATCACATCCTCTCCATCCTGCGAACTGGAGATCTGATTACTGACGAAATTATTTGGAAGTGTACCAAAAATGGGGTCTTTTCGGTCAAAAGCGCCTACCATTTAGGTATGAGCATTAGAGCATATAATGAAGCTTCAAGTTCAAACAACTCCTTAACAAAACAAATGTGGAAGTCAATATGGAGTACGCCTATTCCAAACAAGATCAAGATTTATTGTTGGAAGATCATTCACGACATTCTCCCGACCCGAGCTAACCTGTTACGAAAGGGCATCATCCTGAATCCAATCTGTCCATTCTGTCTAAAAAAGATGGAGACAAGGAATCATCTATTTTGGGGTTGCAAGTCATGTCGGAATGTTTTGCTACAGAAACAAGGTAGTATTAACTGGAAAAGGATGTTCCTTAACACACAACTCCAAATCCAAGAGTTCACTCAGTCTGTGGCAAACAGGATCCAAGTTCCCAATCATTTGACTATAGCGACGGAAACGTGGCAGCTGCCAAAAGAGGGTTGGTGGAAATTAAATATTGATGCCTCTTGGTGTACTACCACTAATCATGGGGGTGTAGGTTGGATCTTACGGGATTGGACATGGAGGATAGTGAGGGCAGGGCACACCCATATTACAGACAGATGGCCAATCACCATTTTGGAATTTTATGGTATTCTCAAGGGTTTGGATTTCATTCATGAGTACAACATACCCCTCCTGGTGGAATCTGACTCTTGGGAGGCTATACGACTCATCAATGCTGTTGACAATGATCGAATAGAGGCGAGAGACTTTGCAAGGAAGATCAGACAACGAACAACTTCTTGGACCAACATTTCTTTTCATCACAACAGGCGAGAGACAAATATGGTCGCTCACAAACTGGCGCAACGAGGGGAACACCTTCTTGGAGAAGAACTTTGGCAAGATGGGCCCACTATAGGCGTTTTGGAATTTCTCTTAATTTGTTTTCATGTTTAG

Coding sequence (CDS)

ATGGCATTGGGTGGATCGTCAGAGATTGGCAAGCCAATTCCTCCCCTCATTGTGGAATCCGACTCCCTTGAGGTTATTAGCTTGCTCAACCGCGACAGTGTGGACTTTTCTGAGGTTAGTCTCTTCGTTGATGAAGTTATTGGGTTAGCTCAAGGATTAACTTACTGTATTTTTGTAAAGTTCCCAGAGGGGATAATAAAGATCTTGTGGCTTTGGCATACTCTCCAGGCAATGCCAGCGAAACAATGCTGGCGTTTACTTCAGAATCCGAATTCTCTGCTTTACAAAGTTTATCGTGGACGTTATTTCAAGACAGGGAACTTTCTAAAGGCAACCTTAGGGACAAATCCATCATATACCTGGCGTAGTATTTTGTGGGGGCGCAACCTTTTTAAGCATGGCTATCGTTGGAAGGTTGGCAGTGGACACCAGATCAACATCAGGGAGGATCCCTGGCTGTTGGCAGAAGGACGGGATACACCCCTCTGGGTGGATCCAAACCTGACAGGAGTGAATGTGTGCAACTTGCTACGGAACGATGGTTTTTGGGACGAAGATAAGATACGAGAGCATTTTAACCAGGATGATGCAGATCACATCCTCTCCATCCTGCGAACTGGAGATCTGATTACTGACGAAATTATTTGGAAGTGTACCAAAAATGGGGTCTTTTCGGTCAAAAGCGCCTACCATTTAGGTATGAGCATTAGAGCATATAATGAAGCTTCAAGTTCAAACAACTCCTTAACAAAACAAATGTGGAAGTCAATATGGAGTACGCCTATTCCAAACAAGATCAAGATTTATTGTTGGAAGATCATTCACGACATTCTCCCGACCCGAGCTAACCTGTTACGAAAGGGCATCATCCTGAATCCAATCTGTCCATTCTGTCTAAAAAAGATGGAGACAAGGAATCATCTATTTTGGGGTTGCAAGTCATGTCGGAATGTTTTGCTACAGAAACAAGGTAGTATTAACTGGAAAAGGATGTTCCTTAACACACAACTCCAAATCCAAGAGTTCACTCAGTCTGTGGCAAACAGGATCCAAGTTCCCAATCATTTGACTATAGCGACGGAAACGTGGCAGCTGCCAAAAGAGGGTTGGTGGAAATTAAATATTGATGCCTCTTGGTGTACTACCACTAATCATGGGGGTGTAGGTTGGATCTTACGGGATTGGACATGGAGGATAGTGAGGGCAGGGCACACCCATATTACAGACAGATGGCCAATCACCATTTTGGAATTTTATGGTATTCTCAAGGGTTTGGATTTCATTCATGAGTACAACATACCCCTCCTGGTGGAATCTGACTCTTGGGAGGCTATACGACTCATCAATGCTGTTGACAATGATCGAATAGAGGCGAGAGACTTTGCAAGGAAGATCAGACAACGAACAACTTCTTGGACCAACATTTCTTTTCATCACAACAGGCGAGAGACAAATATGGTCGCTCACAAACTGGCGCAACGAGGGGAACACCTTCTTGGAGAAGAACTTTGGCAAGATGGGCCCACTATAGGCGTTTTGGAATTTCTCTTAATTTGTTTTCATGTTTAG

Protein sequence

MALGGSSEIGKPIPPLIVESDSLEVISLLNRDSVDFSEVSLFVDEVIGLAQGLTYCIFVKFPEGIIKILWLWHTLQAMPAKQCWRLLQNPNSLLYKVYRGRYFKTGNFLKATLGTNPSYTWRSILWGRNLFKHGYRWKVGSGHQINIREDPWLLAEGRDTPLWVDPNLTGVNVCNLLRNDGFWDEDKIREHFNQDDADHILSILRTGDLITDEIIWKCTKNGVFSVKSAYHLGMSIRAYNEASSSNNSLTKQMWKSIWSTPIPNKIKIYCWKIIHDILPTRANLLRKGIILNPICPFCLKKMETRNHLFWGCKSCRNVLLQKQGSINWKRMFLNTQLQIQEFTQSVANRIQVPNHLTIATETWQLPKEGWWKLNIDASWCTTTNHGGVGWILRDWTWRIVRAGHTHITDRWPITILEFYGILKGLDFIHEYNIPLLVESDSWEAIRLINAVDNDRIEARDFARKIRQRTTSWTNISFHHNRRETNMVAHKLAQRGEHLLGEELWQDGPTIGVLEFLLICFHV
Homology
BLAST of Tan0019688 vs. ExPASy Swiss-Prot
Match: P0C2F6 (Putative ribonuclease H protein At1g65750 OS=Arabidopsis thaliana OX=3702 GN=At1g65750 PE=3 SV=1)

HSP 1 Score: 77.4 bits (189), Expect = 5.3e-13
Identity = 64/250 (25.60%), Postives = 110/250 (44.00%), Query Frame = 0

Query: 76  QAMPAKQCWRLLQNPNSL----LYKVYRGRYFKTGNFLKATLGTNPSYTWRSILWG-RNL 135
           +A+ +K  WRLLQ  NSL    L K Y     +   +L      + S TWRSI  G R++
Sbjct: 111 RALISKVGWRLLQEKNSLWTLVLQKKYHVGEIRDSRWLIPK--GSWSSTWRSIAIGLRDV 170

Query: 136 FKHGYRWKVGSGHQINIREDPWLLAEGRDTPLWVDPNLTGVNVCNLLRNDGF-----WDE 195
             HG  W  G G QI    D W+  +     L +D      +   ++  D +     WD 
Sbjct: 171 VSHGVGWIPGDGQQIRFWTDRWVSGK---PLLELDNGERPTDCDTVVAKDLWIPGRGWDF 230

Query: 196 DKIREHFNQDDADHILSILRTGDLIT---DEIIWKCTKNGVFSVKSAYHLGMSIRAYNEA 255
            KI  +   +    + +++   DL+T   D + WK +++G FSV+SAY +       +E 
Sbjct: 231 AKIDPYTTNNTRLELRAVVL--DLVTGARDRLSWKFSQDGQFSVRSAYEM----LTVDEV 290

Query: 256 SSSNNSLTKQMWKSIWSTPIPNKIKIYCWKIIHDILPTRANLLRKGIILNPICPFCLKKM 313
              N +     +  +W   +P ++K + W + +  + T     R+ +  + +C  C   +
Sbjct: 291 PRPNMA---SFFNCLWKVRVPERVKTFLWLVGNQAVMTEEERHRRHLSASNVCQVCKGGV 346

BLAST of Tan0019688 vs. ExPASy Swiss-Prot
Match: P93295 (Uncharacterized mitochondrial protein AtMg00310 OS=Arabidopsis thaliana OX=3702 GN=AtMg00310 PE=4 SV=1)

HSP 1 Score: 74.3 bits (181), Expect = 4.5e-12
Identity = 34/87 (39.08%), Postives = 52/87 (59.77%), Query Frame = 0

Query: 76  QAMPAKQCWRLLQNPNSLLYKVYRGRYFKTGNFLKATLGTNPSYTWRSILWGRNLFKHGY 135
           QA+ AKQ +R++  P++LL ++ R RYF   + ++ ++GT PSY WRSI+ GR L   G 
Sbjct: 67  QALLAKQSFRIIHQPHTLLSRLLRSRYFPHSSMMECSVGTRPSYAWRSIIHGRELLSRGL 126

Query: 136 RWKVGSGHQINIREDPWLLAEGRDTPL 163
              +G G    +  D W++ E   TPL
Sbjct: 127 LRTIGDGIHTKVWLDRWIMDE---TPL 150

BLAST of Tan0019688 vs. NCBI nr
Match: XP_024950112.1 (uncharacterized protein LOC112496847 [Citrus sinensis])

HSP 1 Score: 229.2 bits (583), Expect = 8.3e-56
Identity = 142/495 (28.69%), Postives = 234/495 (47.27%), Query Frame = 0

Query: 76   QAMPAKQCWRLLQNPNSLLYKVYRGRYFKTGNFLKATLGTNPSYTWRSILWGRNLFKHGY 135
            QA+ AKQ WRLLQ PNSL+ +V + RYF+  +FL A  G N SY WRSI+WGR + K G 
Sbjct: 898  QALVAKQAWRLLQYPNSLVSRVLQARYFRNSSFLCAKAGANASYIWRSIMWGRQVIKKGM 957

Query: 136  RWKVGSGHQINIREDPWLLAEGRDTPLWVDPNLTGVNVCNLLRNDGFWDEDKIREHFNQD 195
            RW++G+G +I I  D WL       P++         V +L++ D  WDE K+R+HF   
Sbjct: 958  RWRIGNGKKIAIFSDNWLPRPETFRPIFPLSLPVSSVVADLIKADNQWDEIKLRQHFLDV 1017

Query: 196  DADHILSILRTGDLITDEIIWKCTKNGVFSVKSAYHLGMSIRAYNEASSSNNSLTKQMWK 255
            D   IL I    +   DE++W   K G +SVKS Y L  ++R+    S+S    + + W 
Sbjct: 1018 DTAEILKIPLPAEKAEDEVLWHYDKRGNYSVKSGYQL--ALRSKFPDSTSCTEASHKYWS 1077

Query: 256  SIWSTPIPNKIKIYCWKIIHDILPTRANLLRKGIILNPICPFCLKKMETRNHLFWGCKSC 315
            ++W+  +P K+KI+ W+  +++LP+  NL ++ ++  P C  C   +ET +H    CK+ 
Sbjct: 1078 ALWTLELPEKLKIFMWRASNNLLPSAENLWKRKVVEEPTCKRCKLSVETISHALLECKAA 1137

Query: 316  RNVLLQKQGS-----INWKRMFLNTQLQIQEFTQS------------------------- 375
            R + LQ   S      N + +F   Q   +E  +S                         
Sbjct: 1138 RKIWLQSPFSAPRLEANSQDIFSTLQNMAKELRKSDLELMVALCWSAWYARNKCIFDGRE 1197

Query: 376  ---------------VANRIQVP--NHLTIA----TETWQLPKEGWWKLNIDASWCTTTN 435
                              R++ P  +H++I+     + W  P +  +K+N+DA++ +   
Sbjct: 1198 LNPIISAAKAESVLTAFQRVRKPQQSHISISIKEKQQEWLPPPQNVFKVNVDAAFNSKNL 1257

Query: 436  HGGVGWILRDWTWRIVRAGHTHITDRWPITILEFYGILKGLDFIHEYNI-PLLVESDSWE 495
              GVG ++RD   +IV AG      +   ++ E   +L GL      ++  L++ESD  E
Sbjct: 1258 SAGVGAVIRDSNGKIVAAGVNQNLLKGSASLAEAEAVLWGLQLARNADVSSLIIESDCLE 1317

Query: 496  AIRLINAVDNDRIEARDFARKIRQRTTSWTNISFHHNRRETNMVAHKLAQRGEHLLGEEL 519
             ++L+N     R E       I+ +   +  +  +H  R  N  AH LA+     +   +
Sbjct: 1318 VVQLVNNTKGSRSEIFWTILAIQNQMKIFQKVVVNHIPRHCNACAHYLAKIALGKISPCM 1377

BLAST of Tan0019688 vs. NCBI nr
Match: XP_006491472.1 (uncharacterized protein LOC102626455 [Citrus sinensis])

HSP 1 Score: 224.2 bits (570), Expect = 2.7e-54
Identity = 140/481 (29.11%), Postives = 226/481 (46.99%), Query Frame = 0

Query: 76   QAMPAKQCWRLLQNPNSLLYKVYRGRYFKTGNFLKATLGTNPSYTWRSILWGRNLFKHGY 135
            QA+ AKQ WRL++ PNSL+ +V + RY+K   F  A +G+NPS+ WRSILWG  + K G 
Sbjct: 957  QALVAKQGWRLVRYPNSLMARVMKARYYKNSTFWNAKVGSNPSFIWRSILWGSQVIKKGV 1016

Query: 136  RWKVGSGHQINIREDPWLLAEGRDTPLWVDPNLTGVNVCNLLRNDGFWDEDKIREHFNQD 195
            RW++G G ++ + +D W+       P+          V +L+ ++  W  D++ +HF ++
Sbjct: 1017 RWRIGDGKKVLVYKDKWIPRPATFQPISPKTLPHETVVADLIDSENKWRVDRLEQHFMKE 1076

Query: 196  DADHILSILRTGDLITDEIIWKCTKNGVFSVKSAYHLGMSIRAYNEASSSNNSLTKQMWK 255
            D + IL IL       DE++W   K G +SVKS Y L ++    NE  SSN+S   ++WK
Sbjct: 1077 DIEAILKILLPSGKEEDEVLWHFDKKGEYSVKSGYQLALNQNFPNEPESSNSS--SRLWK 1136

Query: 256  SIWSTPIPNKIKIYCWKIIHDILPTRANLLRKGIILNPICPFCLKKMETRNHLFWGCKSC 315
              W   +P K+KI+ W+ + +ILPT  NL ++  +  PIC  C  ++ET +H+   CK+ 
Sbjct: 1137 IPWMLDLPEKVKIFMWRALKNILPTAENLWKRRSLQEPICQRCKLQVETVSHVLIECKAA 1196

Query: 316  RNV------LLQKQGSIN----------WKR-MFLNTQLQI------------------- 375
            R +      ++Q     N          W R      +L I                   
Sbjct: 1197 RKIWDLAPLIVQPSKDHNQDFFSAIQEMWSRSSTAEAELMIVYCWVIWSARNKFIFEGKK 1256

Query: 376  --QEFTQSVAN-------RIQVPNHL------TIATETWQLPKEGWWKLNIDASWCTTTN 435
                F  + A+       R+  P ++       I  + W+ P +   KLN+DA+  T   
Sbjct: 1257 SDSRFLAAKADSVLKAYQRVSKPGNVHGAKDRGIDQQKWKPPSQNVLKLNVDAAVSTKDQ 1316

Query: 436  HGGVGWILRDWTWRIVRAGHTHITDRWPITILEFYGILKGLDFIHEY-NIPLLVESDSWE 495
              G+G I+RD   +I+  G      R  +++ E   I  GL   ++  +  L+VESD  E
Sbjct: 1317 KVGLGAIVRDAEGKILAVGIKQAQFRERVSLAEAEAIHWGLQVANQISSSSLIVESDCKE 1376

Query: 496  AIRLINAVDNDRIEARDFARKIRQRTTSWTNISFHHNRRETNMVAHKLAQRGEHLLGEEL 505
             + L+N     R E       +R+ +  +  + F    R  N  AH LA+        ++
Sbjct: 1377 VVELLNNTKGSRTEIHWILSDVRRESKEFKQVQFSFIPRTCNTYAHALAKFALRNSSTDV 1435

BLAST of Tan0019688 vs. NCBI nr
Match: XP_022150918.1 (uncharacterized protein LOC111018954 [Momordica charantia])

HSP 1 Score: 217.2 bits (552), Expect = 3.3e-52
Identity = 145/483 (30.02%), Postives = 225/483 (46.58%), Query Frame = 0

Query: 76   QAMPAKQCWRLLQNPNSLLYKVYRGRYFKTGNFLKATLGTNPSYTWRSILWGRNLFKHGY 135
            +A+ AKQCWR+L +PNS+L +V +GRYFK  +F++A +  NPSY WRSILWGR+L K G 
Sbjct: 656  KALLAKQCWRILNHPNSMLSRVLKGRYFKDCSFMEAKISGNPSYIWRSILWGRDLLKKGL 715

Query: 136  RWKVGSGHQINIREDPWLLAEGRDTPLWVDPNLTGVNVCNLL--RNDGFWDEDKIREHFN 195
            RW++G+G  + I  D W +       +   P L  V+  + L    +G W  D +R+ F 
Sbjct: 716  RWRIGNGDSVFIYGDNW-VPNQPTLKILSSPRLPLVSRVSSLVDHEEGGWQGDVVRDEFT 775

Query: 196  QDDADHILSILRTGDLITDEIIWKCTKNGVFSVKSAYHLGMSIRAYNEA-SSSNNSLTKQ 255
             D+A  ILSI        D +IW   K GV+SV+S Y + +      +A SSS++   + 
Sbjct: 776  PDEAKGILSIPIGRGAEEDRLIWNYEKTGVYSVRSGYKVALLNNPCVQAPSSSSSEEVRC 835

Query: 256  MWKSIWSTPIPNKIKIYCWKIIHDILPTRANLLRKGIILNPICPFCLKKMETRNHLFWGC 315
             W   W   IPNKIK++ W++  D LPT  NL ++G+ +   C FC +  E   HLFW C
Sbjct: 836  WWNGFWKMHIPNKIKVFLWRLCLDRLPTGCNLSKRGVEITNCCYFCGRNGEDSIHLFWIC 895

Query: 316  KSCRN--------------VLLQKQGSIN--------------WK----RMFLNTQLQI- 375
            K                  +L +   S++              W     R F ++   + 
Sbjct: 896  KFAEALWINSKFGKLSPFLILRESHESLSKADFEELCVVIWGLWNQRNARAFNDSTKTVF 955

Query: 376  --------------QEFTQSVANRI--QVPNHLTIATETWQLPKEGWWKLNIDASWCTTT 435
                           EF ++ +N I  +V N   I    WQ P EG +K+N DAS+  + 
Sbjct: 956  KIGMELVEWANKYAMEFREAKSNPITGRVTNTAEI---LWQPPDEGIYKINTDASFLASD 1015

Query: 436  NHGGVGWILRDWTWRIVRAGHTHITDRWPITILEFYGILKGLDFIHEYNIPLLVESDSWE 495
             H G+G I+ +   +++ A   ++ +   + + E    ++GL    E  +   +E     
Sbjct: 1016 QHAGLGIIIHNDRGQVMAAATKYLENIQSVDMAEAIAAVEGLQLASEIGMHPALE----- 1075

Query: 496  AIRLINAVDNDRIEARDFARKIRQRTTSWTNISFHHNRRETNMVAHKLAQRGEHLLGEEL 507
                      D  E  +   K +   T   + SF+  +RE N  AH LA+R   L    +
Sbjct: 1076 ----------DLSETGEIVLKAKNFWTQSLHASFNFVKREGNKAAHMLARRALLLHEFSI 1119

BLAST of Tan0019688 vs. NCBI nr
Match: XP_023917061.1 (uncharacterized protein LOC112028598 [Quercus suber])

HSP 1 Score: 214.2 bits (544), Expect = 2.8e-51
Identity = 133/459 (28.98%), Postives = 222/459 (48.37%), Query Frame = 0

Query: 77   AMPAKQCWRLLQNPNSLLYKVYRGRYFKTGNFLKATLGTNPSYTWRSILWGRNLFKHGYR 136
            A  AKQ WR+L N NSL  +VY+ +YF   NF +AT+G +PSY WRS++  +++ + G R
Sbjct: 806  AFLAKQGWRILTNSNSLFSRVYKAKYFPHCNFAEATMGRSPSYAWRSLMAAQSIVQRGMR 865

Query: 137  WKVGSGHQINIREDPWLLAEGRDTPLWVD-PNLTGVNVCNLL-RNDGFWDEDKIREHFNQ 196
            W+V +G++I +  D W+        +  + PN     VC L+ +  G W+ DK+   F  
Sbjct: 866  WQVRNGNKIRVWHDKWIPRPCTYKVISKEKPNSANALVCELINKGTGEWNIDKLNSWFLP 925

Query: 197  DDADHILSILRTGDLITDEIIWKCTKNGVFSVKSAYHLGMSIRAYNE-ASSSNNSLTKQM 256
            DD D I+ IL +     D ++W   ++G F++KSAY L +  +  N  A  SN S  +++
Sbjct: 926  DDKDTIMGILLSSSNANDRLVWAKNRSGKFTIKSAYALALEEKTQNTMAGCSNESARRKI 985

Query: 257  WKSIWSTPIPNKIKIYCWKIIHDILPTRANLLRKGIILNPICPFCLKKMETRNHLFWGCK 316
            WK+IW   IP KIK + W+   DIL T+ANL ++ I  N +C  C    ET  HL WGC 
Sbjct: 986  WKTIWQMRIPQKIKHFAWRAGRDILATKANLAKQRITPNGLCDLCGNYEETVYHLLWGCD 1045

Query: 317  SCRNV----------------LLQKQGSINWK-----------------RMFLNTQLQ-I 376
              R V                 ++K  S+ W                   M L +    +
Sbjct: 1046 HAREVWKNSKFALPFEHSRPGQMEKFISVCWSIWKDINVLRTSGNGKAGSMILRSATHLV 1105

Query: 377  QEFTQSVANRIQVPNHLTIATETWQLPKEGWWKLNIDASWCTTTNHGGVGWILRDWTWRI 436
            +EF  +   + +    + +   +WQ P++G +K+N+D     ++   G G I+RD    +
Sbjct: 1106 EEFWLANEEKTEY-QAVLVHLASWQPPRQGCYKVNMDGMVFRSSKQAGAGVIIRDGADEV 1165

Query: 437  VRAGHTHITDRWPITI----LEFYGILKGLDFIHEYNI-PLLVESDSWEAIRLINAVDND 494
            + A    ++ +W   +     E   +  G++F+ E  I     E DS      ++ +++ 
Sbjct: 1166 IAA----LSKKWKCPLGAIEAEAKALEAGINFVWEVGIREAEFEMDSLMICNALHGLESP 1225

BLAST of Tan0019688 vs. NCBI nr
Match: XP_024037590.1 (uncharacterized protein LOC112097210 [Citrus clementina])

HSP 1 Score: 213.8 bits (543), Expect = 3.6e-51
Identity = 136/488 (27.87%), Postives = 212/488 (43.44%), Query Frame = 0

Query: 76   QAMPAKQCWRLLQNPNSLLYKVYRGRYFKTGNFLKATLGTNPSYTWRSILWGRNLFKHGY 135
            QA+ AKQ WR++Q P+SL+ +V + RYFK   F+ A LG+ PS+ WRSI+WGR +   G 
Sbjct: 662  QALVAKQGWRIMQFPSSLVARVLKARYFKHTGFMNAGLGSKPSFVWRSIVWGRQVLHKGA 721

Query: 136  RWKVGSGHQINIREDPWLLAEGRDTPLWVDPNLTGVNVCNLLRNDGFWDEDKIREHFNQD 195
            RW++G+G  + +  + W+       P+      T   V  L+     W ED I +HF  +
Sbjct: 722  RWRIGNGQNVLVYGNNWIPRPTTFKPISAPSMGTDTTVAELIDEKQQWREDLILQHFRPE 781

Query: 196  DADHILSILRTGDLITDEIIWKCTKNGVFSVKSAYHLGMSIRAYNEASSSNNSLTKQMWK 255
            DA+ I+ I        D++IW   K G +SVKS Y + M I+   + S SN+   + +W+
Sbjct: 782  DAEAIMQIPLPKRPKEDQLIWHYDKKGYYSVKSGYQVAMRIKFPEDPSCSNHD--QNLWR 841

Query: 256  SIWSTPIPNKIKIYCWKIIHDILPTRANLLRKGIILNPICPFCLKKMETRNH-------- 315
             IW   IP K+KI+ W+  HD+LPT  NL +K ++  P+C  C   +ET +H        
Sbjct: 842  FIWKLAIPEKVKIFLWRAAHDLLPTAENLWKKKVLQEPMCQSCHCHVETVSHALVECNRA 901

Query: 316  ------------------------------------------LFWGCKSCRNVLLQKQGS 375
                                                      L W     RN  L +   
Sbjct: 902  RKIWRYSNLAEELRGVYRCDIVWMLQFWPRQHAKVEGAEVAALLWAIWKARNKWLFEGKK 961

Query: 376  INWKRMFLNTQLQIQEFTQSVANRIQVPNHL------TIATETWQLPKEGWWKLNIDASW 435
             N  R+  N +  ++ F      +I+ P  +          + W  P  GW K+N+DA+ 
Sbjct: 962  ENPLRVVANAEAIVESF-----KKIRQPEMVYKTKGNAERQKQWSPPPNGWQKVNVDAAV 1021

Query: 436  CTTTNHGGVGWILRDWTWRIVRAGHTHITDRWPITILEFYGILKGLDFIHEYNIPL-LVE 495
                   G+G ++RD       A    +     + + E   +  GL    + +I   + E
Sbjct: 1022 DVENQMAGLGVVVRDSDGNCRAAAIKSLRLPGSVAMAEATAMEWGLKVAEKAHITFGIFE 1081

Query: 496  SDSWEAIRLINAVDNDRIEARDFARKIRQRTTSWTNISFHHNRRETNMVAHKLAQRGEHL 507
            SDS E I LIN   +   E       I++   ++ N    H+ R+ N  AH LA+     
Sbjct: 1082 SDSLEVIDLINKKSSSLTEIGWLISDIQENLQNFQNFKAQHSPRDCNYAAHSLAKLALQK 1141

BLAST of Tan0019688 vs. ExPASy TrEMBL
Match: A0A803PV88 (Uncharacterized protein OS=Cannabis sativa OX=3483 PE=4 SV=1)

HSP 1 Score: 229.2 bits (583), Expect = 4.0e-56
Identity = 152/488 (31.15%), Postives = 220/488 (45.08%), Query Frame = 0

Query: 76   QAMPAKQCWRLLQNPNSLLYKVYRGRYFKTGNFLKATLGTNPSYTWRSILWGRNLFKHGY 135
            QAM AKQ WR+L NP SLL  V + +YF   +FLKA LG +PSYTW S+LWGR+L KHG 
Sbjct: 670  QAMLAKQAWRVLSNPTSLLATVLKAKYFHHNDFLKAKLGHSPSYTWSSLLWGRDLLKHGL 729

Query: 136  RWKVGSGHQINIREDPWLLAEGRDTPLWVDPNLTGVNVCN-LLRNDGFWDEDKIREHFNQ 195
             WK+G+G  +   EDPW+      +P     +L  V   +  + + GFW+ DK+  +F+ 
Sbjct: 730  VWKIGNGCSVRTFEDPWI--PDMKSPCLGSNDLPPVETVDFFIDHQGFWNRDKLNYYFDN 789

Query: 196  DDADHILSILRTGDLITDEIIWKCTKNGVFSVKSAYHLGMSIRAYNEASSSNNSLTKQMW 255
                 IL +   G    D ++WK   +GVF+VKSAYHL  S       SSSN S     W
Sbjct: 790  LSVSSILRVPIGGLHRDDTLVWKHDPSGVFTVKSAYHLANSTSL--PPSSSNPSFFMSWW 849

Query: 256  KSIWSTPIPNKIKIYCWKIIHDILPTRANLLRKGIILNPICPFCLKKMETRNHLFWGCKS 315
            K+ W+  +P K+K + W++ H ILP   NL R+  I  P C FC   +ET  H    C  
Sbjct: 850  KTFWNLNLPPKVKNFSWRVYHHILPVALNLFRRKTIPQPHCSFCKNPVETVTHALLDCSR 909

Query: 316  CRNV---------------------------LLQKQG-----SINW-------KRMFLNT 375
               +                            L K       SI W       K++F N+
Sbjct: 910  AAKIWKASPFRTFYLSNRYVDVKEFMLNGFDQLNKDSLSLLLSIMWAIWNSRNKKLFANS 969

Query: 376  QLQ-----------IQEFTQSVAN-------RIQVPNHLTIATETWQLPKEGWWKLNIDA 435
             +            I ++ ++++N       RI   N + +            ++LN DA
Sbjct: 970  DMAPTDIVAWTHTFISDYQEALSNVNRYKSARILQRNDIDVTVPV------NSYRLNTDA 1029

Query: 436  SWCTTTNHGGVGWILRDWTWRIVRAGHTHITDRWPITILEFYGILKGLDFIHEYNIPL-L 495
            +  T  +  G+G +++DW   +V             T+ E   +  GL++     IPL L
Sbjct: 1030 ALSTERSKLGIGAVVKDWKGCVVANLAIPAAGALQSTLAEALALKAGLNWCQHIKIPLAL 1089

Query: 496  VESDSWEAIRLINAVDNDRIEARDFARKIRQRTTSWTNISFHHNRRETNMVAHKLAQRGE 505
            VE+DS   +  +    ND     D    IR   + + N+   H RR+ N  AH LA+R  
Sbjct: 1090 VETDSKLLVDKVLGGKNDLSALSDIVTDIRNSLSFFPNVILRHTRRQFNGDAHNLARRAL 1147

BLAST of Tan0019688 vs. ExPASy TrEMBL
Match: A0A6J1DAR4 (uncharacterized protein LOC111018954 OS=Momordica charantia OX=3673 GN=LOC111018954 PE=4 SV=1)

HSP 1 Score: 217.2 bits (552), Expect = 1.6e-52
Identity = 145/483 (30.02%), Postives = 225/483 (46.58%), Query Frame = 0

Query: 76   QAMPAKQCWRLLQNPNSLLYKVYRGRYFKTGNFLKATLGTNPSYTWRSILWGRNLFKHGY 135
            +A+ AKQCWR+L +PNS+L +V +GRYFK  +F++A +  NPSY WRSILWGR+L K G 
Sbjct: 656  KALLAKQCWRILNHPNSMLSRVLKGRYFKDCSFMEAKISGNPSYIWRSILWGRDLLKKGL 715

Query: 136  RWKVGSGHQINIREDPWLLAEGRDTPLWVDPNLTGVNVCNLL--RNDGFWDEDKIREHFN 195
            RW++G+G  + I  D W +       +   P L  V+  + L    +G W  D +R+ F 
Sbjct: 716  RWRIGNGDSVFIYGDNW-VPNQPTLKILSSPRLPLVSRVSSLVDHEEGGWQGDVVRDEFT 775

Query: 196  QDDADHILSILRTGDLITDEIIWKCTKNGVFSVKSAYHLGMSIRAYNEA-SSSNNSLTKQ 255
             D+A  ILSI        D +IW   K GV+SV+S Y + +      +A SSS++   + 
Sbjct: 776  PDEAKGILSIPIGRGAEEDRLIWNYEKTGVYSVRSGYKVALLNNPCVQAPSSSSSEEVRC 835

Query: 256  MWKSIWSTPIPNKIKIYCWKIIHDILPTRANLLRKGIILNPICPFCLKKMETRNHLFWGC 315
             W   W   IPNKIK++ W++  D LPT  NL ++G+ +   C FC +  E   HLFW C
Sbjct: 836  WWNGFWKMHIPNKIKVFLWRLCLDRLPTGCNLSKRGVEITNCCYFCGRNGEDSIHLFWIC 895

Query: 316  KSCRN--------------VLLQKQGSIN--------------WK----RMFLNTQLQI- 375
            K                  +L +   S++              W     R F ++   + 
Sbjct: 896  KFAEALWINSKFGKLSPFLILRESHESLSKADFEELCVVIWGLWNQRNARAFNDSTKTVF 955

Query: 376  --------------QEFTQSVANRI--QVPNHLTIATETWQLPKEGWWKLNIDASWCTTT 435
                           EF ++ +N I  +V N   I    WQ P EG +K+N DAS+  + 
Sbjct: 956  KIGMELVEWANKYAMEFREAKSNPITGRVTNTAEI---LWQPPDEGIYKINTDASFLASD 1015

Query: 436  NHGGVGWILRDWTWRIVRAGHTHITDRWPITILEFYGILKGLDFIHEYNIPLLVESDSWE 495
             H G+G I+ +   +++ A   ++ +   + + E    ++GL    E  +   +E     
Sbjct: 1016 QHAGLGIIIHNDRGQVMAAATKYLENIQSVDMAEAIAAVEGLQLASEIGMHPALE----- 1075

Query: 496  AIRLINAVDNDRIEARDFARKIRQRTTSWTNISFHHNRRETNMVAHKLAQRGEHLLGEEL 507
                      D  E  +   K +   T   + SF+  +RE N  AH LA+R   L    +
Sbjct: 1076 ----------DLSETGEIVLKAKNFWTQSLHASFNFVKREGNKAAHMLARRALLLHEFSI 1119

BLAST of Tan0019688 vs. ExPASy TrEMBL
Match: A0A803P9P5 (Uncharacterized protein OS=Cannabis sativa OX=3483 PE=4 SV=1)

HSP 1 Score: 213.8 bits (543), Expect = 1.7e-51
Identity = 144/479 (30.06%), Postives = 219/479 (45.72%), Query Frame = 0

Query: 73  HTLQAMPAKQCWRLLQNPNSLLYKVYRGRYFKTGNFLKATLGTNPSYTWRSILWGRNLFK 132
           H  QA+ AKQ W +  NP+SLL+K+ + RYFK   FL+A +G+ PS TWRSI+WG+ L  
Sbjct: 339 HYNQALLAKQSWFIFDNPSSLLHKILKARYFKYNTFLEAEIGSYPSLTWRSIIWGKELLT 398

Query: 133 HGYRWKVGSGHQINIREDPWLLAEGRDTPL-WVDPNLTGVNVCNLLRNDGFWDEDKIREH 192
            G RWKVG+G+QI    DPWL       PL + +P+L+ + V  L+ N   W+   +++ 
Sbjct: 399 KGLRWKVGNGNQILCASDPWLPGITSFKPLIFKNPSLS-MKVSELITNQRQWNHSLLQQC 458

Query: 193 FNQDDADHILSILRTGDLITDEIIWKCTKNGVFSVKSAYHLGMSIRAYNEASSSNNSLTK 252
           F + D   I SI  T    +D++IW    NG++SVKS Y L   +     A+SSNNSL  
Sbjct: 459 FLESDVAKIQSIPLTLCDQSDQLIWNFENNGMYSVKSGYTLATRLEEQLPAASSNNSL-- 518

Query: 253 QMWKSIWSTPIPNKIKIYCWKIIHDILPTRANLLRKGIILNPICPFCLKKMETRNHLFWG 312
           Q WK  WS  +P+KIKI+ W+ +HD LP    L  + I  + IC  C + +E+  H  + 
Sbjct: 519 QWWKKFWSLTLPSKIKIFLWRAMHDCLPVADILHHRHISDSAICTLCHQAIESTTHALFW 578

Query: 313 CKSCRNVLLQKQGSI---------------NWKRMFLNTQLQ------------------ 372
           CK  R +      SI               N  +++ + Q++                  
Sbjct: 579 CKRPRKIWQLSSFSISDFVTHNMSLMDVLQNMSQIWSSKQIEQFACILWSIWNERNKERH 638

Query: 373 -----------------IQEFTQSVANR--IQVPNHLTIATE----TWQLPKEGWWKLNI 432
                            I+EF  +  N        H T  ++     W  P  G  KLN 
Sbjct: 639 GSKTKPPEMTLFFAMDYIEEFQSARLNTSFADSATHATTRSQQQELPWMNPPSGRLKLNT 698

Query: 433 DASWCTTTNHGGVGWILRDWTWRIVRAGHTHITDRWPITILEFYGILKGLDFIHEYNIPL 492
           DA+     N  G G ILR+ T  I+ A        +   I+E   ++  L ++ +  +P+
Sbjct: 699 DAAVNAVENTSGFGAILRNDTGDIIAAMAMPFKGCFKPEIMEALALIYSLQWLKDSQLPV 758

Query: 493 -LVESDSWEAIRLINAVDNDRIEARDFARKIRQRTTSWTNISFHHNRRETNMVAHKLAQ 494
             +E+DS   ++ + A      +       I    +++      H  R  N  AH LA+
Sbjct: 759 HFIETDSLLVVKGLQATQRHISDFHCLLNNISLLVSNFPEAQISHTYRSANNAAHLLAK 814

BLAST of Tan0019688 vs. ExPASy TrEMBL
Match: A0A803QI38 (Uncharacterized protein OS=Cannabis sativa OX=3483 PE=4 SV=1)

HSP 1 Score: 213.4 bits (542), Expect = 2.3e-51
Identity = 143/473 (30.23%), Postives = 217/473 (45.88%), Query Frame = 0

Query: 76   QAMPAKQCWRLLQNPNSLLYKVYRGRYFKTGNFLKATLGTNPSYTWRSILWGRNLFKHGY 135
            QAM AKQ WR+L NP SLL  + + +YFK  NFL+A LG  PSYTW S++WGR+L   G 
Sbjct: 1279 QAMLAKQAWRVLSNPTSLLACILKAKYFKLNNFLEAKLGHTPSYTWSSLIWGRDLLVRGL 1338

Query: 136  RWKVGSGHQINIREDPWLLAEGRDTP-LWVDPNLTGVNVCNLLRNDGFWDEDKIREHFNQ 195
             WKVG+G  I   +DPW+   G   P L          V   + N GFWD +K++++F++
Sbjct: 1339 LWKVGNGCSIRTFQDPWI--PGMKYPSLRSSDQPPDDKVSFFIDNSGFWDREKLQQYFDE 1398

Query: 196  DDADHILSILRTGDLITDEIIWKCTKNGVFSVKSAYHLGMSIRAYNEASSSNNSLTKQMW 255
                 IL I   G   TD +IW    +G+FSVKSAYH+  +       SSS++SL+K  W
Sbjct: 1399 YSVSTILKIPIGGPQKTDSLIWTQDNSGIFSVKSAYHIANNTSM--PPSSSDSSLSKTWW 1458

Query: 256  KSIWSTPIPNKIKIYCWKIIHDILPTRANLLRKGIILNPICPFCLKKMETRNHLFWGCK- 315
            K++W+  +  KIK + W++ H ILP   NL  +  I  P C FC    ET  H    C  
Sbjct: 1459 KTLWNLNVQPKIKNFAWRVYHHILPVGLNLFVRKTISQPTCSFCPNPTETVTHALLDCPR 1518

Query: 316  ---------------SCRNVLLQKQGSINW-----------------------KRMFLNT 375
                           S R+V +++   I +                       K++F N 
Sbjct: 1519 ASKIWKASPLRSFYLSNRHVDVKEFMIIGFEQLHKDQLVLLVTTLWAIWYSRNKKLFANL 1578

Query: 376  QL-----------QIQEFTQSVA--NRIQVPNHLTIATETWQLPKEGWWKLNIDASWCTT 435
             L            I ++  ++A  NR     +   +  T ++  +  + L  DA+   T
Sbjct: 1579 DLTPNDTIAWIDSYISDYNAAMAMKNRYSGVLNFPASDRTPKVVPQNQYLLQTDAAINQT 1638

Query: 436  TNHGGVGWILRDWTWRIVRAGHTHITDRWPITILEFYGILKGLDFIHEYNIPL-LVESDS 495
             +  G G +L DW  ++V      +       I E   +   L++     +PL ++E+DS
Sbjct: 1639 QSKMGFGAVLLDWQGKVVAGLSAPVAGNLQPLIAEALSLRASLEWCINIQMPLAVIETDS 1698

BLAST of Tan0019688 vs. ExPASy TrEMBL
Match: A0A1S8ACU2 (Ribonuclease H-like superfamily protein OS=Citrus limon OX=2708 PE=4 SV=1)

HSP 1 Score: 213.0 bits (541), Expect = 3.0e-51
Identity = 136/472 (28.81%), Postives = 221/472 (46.82%), Query Frame = 0

Query: 76  QAMPAKQCWRLLQNPNSLLYKVYRGRYFKTGNFLKATLGTNPSYTWRSILWGRNLFKHGY 135
           QA+ AKQ WR+++ P SL+ K+ + +YFK  +FL+A LG+ PS+ WRSI+WGR +  +G 
Sbjct: 108 QALVAKQSWRIIKYPESLMAKILQAKYFKGADFLQAKLGSKPSFVWRSIIWGRQVIVNGM 167

Query: 136 RWKVGSGHQINIREDPWLLAEGRDTPLWVDPNLTGVNVCNLLRNDGFWDEDKIREHFNQD 195
           RW++G+G ++ I +  W+       P+          V  L+  +  W E  I++HFN +
Sbjct: 168 RWRIGTGDRVKIYKSHWIPRPQAFKPISAPTLGMDCTVAELIDENQKWKESLIQQHFNFE 227

Query: 196 DADHILSILRTGDLITDEIIWKCTKNGVFSVKSAYHLGMSIRAYNEASSSNNSLTKQMWK 255
           DA+ I  I        D+I+W   K G +SVKS Y + M I+     SSS+++L +  W 
Sbjct: 228 DAELISRIQLPISPKPDQILWHYDKKGNYSVKSGYQIAMRIKHPARPSSSSSNLGQ--WN 287

Query: 256 SIWSTPIPNKIKIYCWKIIHDILPTRANLLRKGIILNPICPFCLKKMETRNHLFWGCKSC 315
            IWS  +P KIKI+ WK   + LPT  NL R+ ++  PICP C  K E  NH    CK+ 
Sbjct: 288 VIWSLELPEKIKIFMWKAARNFLPTSENLWRRKMVQEPICPRCKMKKEDINHAIMVCKAA 347

Query: 316 RNV-----------LLQKQGSINWKRMFLNTQLQIQEFTQSVA---------------NR 375
           + +           LL  Q  ++  +  +N + +  E    +A               N+
Sbjct: 348 KKMWKLTPFVEEMQLLDNQDLLSMLQELVNRRSK-DELRLIIALCWTAWHTRNIFVFENK 407

Query: 376 IQVPNHLTIATET--------------------------WQLPKEGWWKLNIDASWCTTT 435
            Q P       E                           W  P +G +K N+DA+     
Sbjct: 408 RQDPQISVAKAEAVVESYARVRMTKVQAAAKAIPAKGAKWIPPPQGHFKANVDAAVNKEK 467

Query: 436 NHGGVGWILRDWTWRIVRAGHTHITDRWPITILEFYGILKGLDFIHEYNI-PLLVESDSW 494
           N  G+G ++RD +  I+ A   H      +   E   +  GL  + E ++ PL++E+D  
Sbjct: 468 NQVGLGVVIRDDSGAIIIAAVNHTKYHGDVAQAEAAAVNFGLQVVMEASLSPLILETDCQ 527

BLAST of Tan0019688 vs. TAIR 10
Match: AT4G29090.1 (Ribonuclease H-like superfamily protein )

HSP 1 Score: 123.2 bits (308), Expect = 6.0e-28
Identity = 112/490 (22.86%), Postives = 189/490 (38.57%), Query Frame = 0

Query: 77  AMPAKQCWRLLQNPNSLLYKVYRGRYFKTGNFLKATLGTNPSYTWRSILWGRNLFKHGYR 136
           A+  KQ WR+L  P SL+ KV++ RYF   + L A LG+ PS+ W+SI   + + + G R
Sbjct: 67  ALLGKQMWRMLSRPESLMAKVFKSRYFHKSDPLNAPLGSRPSFVWKSIHASQEILRQGAR 126

Query: 137 WKVGSGHQINIREDPWLLAEGRDTPLWVD--PNLTGVNVCNLLRNDGF-------WDEDK 196
             VG+G  I I    WL ++     L +   P     +V ++L+           W +D 
Sbjct: 127 AVVGNGEDIIIWRHKWLDSKPASAALRMQRVPPQEYASVSSILKVSDLIDESGREWRKDV 186

Query: 197 IREHFNQDDADHILSILRTGDLITDEIIWKCTKNGVFSVKSAYH-LGMSIRAYNEASSSN 256
           I   F + +   I  +   G  I D   W  T +G ++VKS Y  L   I   +     +
Sbjct: 187 IEMLFPEVERKLIGELRPGGRRILDSYTWDYTSSGDYTVKSGYWVLTQIINKRSSPQEVS 246

Query: 257 NSLTKQMWKSIWSTPIPNKIKIYCWKIIHDILPTRANLLRKGIILNPICPFCLKKMETRN 316
                 +++ IW +    KI+ + WK + + LP    L  + +     C  C    ET N
Sbjct: 247 EPSLNPIYQKIWKSQTSPKIQHFLWKCLSNSLPVAGALAYRHLSKESACIRCPSCKETVN 306

Query: 317 HLFWGCKSCRN-------------------------VLLQKQGSINWKR----------- 376
           HL + C   R                          V     G+  W++           
Sbjct: 307 HLLFKCTFARLTWAISSIPIPLGGEWADSIYVNLYWVFNLGNGNPQWEKASQLVPWLLWR 366

Query: 377 --------MFLNTQLQIQEFTQSVANRIQ-------------VPNHLTIATETWQLPKEG 436
                   +F   +   QE  +   + ++              P     +   W+ P   
Sbjct: 367 LWKNRNELVFRGREFNAQEVLRRAEDDLEEWRIRTEAESCGTKPQVNRSSCGRWRPPPHQ 426

Query: 437 WWKLNIDASWCTTTNHGGVGWILRDWTWRIVRAGHTHITDRWPITILEF----YGILKGL 494
           W K N DA+W       G+GW+LR+    +   G   +     +   E     + +L   
Sbjct: 427 WVKCNTDATWNRDNERCGIGWVLRNEKGEVKWMGARALPKLKSVLEAELEAMRWAVLSLS 486

BLAST of Tan0019688 vs. TAIR 10
Match: AT3G09510.1 (Ribonuclease H-like superfamily protein )

HSP 1 Score: 109.0 bits (271), Expect = 1.2e-23
Identity = 77/283 (27.21%), Postives = 122/283 (43.11%), Query Frame = 0

Query: 99  RGRYFKTGNFLKATLGTNPSYTWRSILWGRNLFKHGYRWKVGSGHQINIREDPWLLAEGR 158
           + RYFK  + L A +    SY W S+L G  L K G R  +G G  I I  D  ++    
Sbjct: 2   KARYFKDVSILDAKVRKQQSYGWASLLDGIALLKKGTRHLIGDGQNIRIGLDN-IVDSHP 61

Query: 159 DTPLWVDPNLTGVNVCNLLRNDG---FWDEDKIREHFNQDDADHILSILRTGDLITDEII 218
             PL  +     + + NL    G   FWD+ KI +  +Q D   I  I        D+II
Sbjct: 62  PRPLNTEETYKEMTINNLFERKGSYYFWDDSKISQFVDQSDHGFIHRIYLAKSKKPDKII 121

Query: 219 WKCTKNGVFSVKSAYHL-----GMSIRAYNEASSSNNSLTKQMWKSIWSTPIPNKIKIYC 278
           W     G ++V+S Y L       +I A N    S +  T+     IW+ PI  K+K + 
Sbjct: 122 WNYNTTGEYTVRSGYWLLTHDPSTNIPAINPPHGSIDLKTR-----IWNLPIMPKLKHFL 181

Query: 279 WKIIHDILPTRANLLRKGIILNPICPFCLKKMETRNHLFWGCKSCRNVLLQKQGSINWKR 338
           W+ +   L T   L  +G+ ++P CP C ++ E+ NH  + C            S+    
Sbjct: 182 WRALSQALATTERLTTRGMRIDPSCPRCHRENESINHALFTCPFATMAWRLSDSSL---- 241

Query: 339 MFLNTQLQIQEFTQSVANRIQVPNHLTIATETWQLPKEGWWKL 374
             +  QL   +F ++++N +      T++     LP    W++
Sbjct: 242 --IRNQLMSNDFEENISNILNFVQDTTMSDFHKLLPVWLIWRI 272

BLAST of Tan0019688 vs. TAIR 10
Match: ATMG00310.1 (RNA-directed DNA polymerase (reverse transcriptase)-related family protein )

HSP 1 Score: 74.3 bits (181), Expect = 3.2e-13
Identity = 34/87 (39.08%), Postives = 52/87 (59.77%), Query Frame = 0

Query: 76  QAMPAKQCWRLLQNPNSLLYKVYRGRYFKTGNFLKATLGTNPSYTWRSILWGRNLFKHGY 135
           QA+ AKQ +R++  P++LL ++ R RYF   + ++ ++GT PSY WRSI+ GR L   G 
Sbjct: 67  QALLAKQSFRIIHQPHTLLSRLLRSRYFPHSSMMECSVGTRPSYAWRSIIHGRELLSRGL 126

Query: 136 RWKVGSGHQINIREDPWLLAEGRDTPL 163
              +G G    +  D W++ E   TPL
Sbjct: 127 LRTIGDGIHTKVWLDRWIMDE---TPL 150

BLAST of Tan0019688 vs. TAIR 10
Match: AT3G26855.1 (RNA-directed DNA polymerase (reverse transcriptase)-related family protein )

HSP 1 Score: 50.4 bits (119), Expect = 4.9e-06
Identity = 24/74 (32.43%), Postives = 37/74 (50.00%), Query Frame = 0

Query: 249 LTKQMWKSIWSTPIPNKIKIYCWKIIHDILPTRANLLRKGIILNPICPFCLKKMETRNHL 308
           +T      IWS  I  KIK+  WK +++ LP  A LL + I + P C  C +  ET  H+
Sbjct: 1   MTNNWIGDIWSLKISPKIKLLIWKALNNALPVGAQLLSRNISIEPFCTRC-RDFETITHI 60

Query: 309 FWGCKSCRNVLLQK 323
            + C   +  ++ K
Sbjct: 61  LFNCPFAQREVIMK 73

BLAST of Tan0019688 vs. TAIR 10
Match: AT2G34320.1 (Polynucleotidyl transferase, ribonuclease H-like superfamily protein )

HSP 1 Score: 46.2 bits (108), Expect = 9.3e-05
Identity = 42/192 (21.88%), Postives = 80/192 (41.67%), Query Frame = 0

Query: 308 LFWGCKSCRNVLLQKQGSINWKRMFLNTQLQIQEFT-----QSVANRIQVPNHLTIATET 367
           L W     RN L+ K    +   +        +E++     +  A+  QV  +L++    
Sbjct: 80  LLWRLWKSRNELMFKGKEYDAPEVLRRAMEDFEEWSTRRELEGKASGPQVERNLSV---Q 139

Query: 368 WQLPKEGWWKLNIDASWCTTTNHGGVGWILRDWTWRIVRAGHTHITDRWPITILEFYGIL 427
           W+ P   W K N DA+W       G+GWILR+ +  ++  G   +     +   E   + 
Sbjct: 140 WKAPPYQWVKCNTDATWQLENPRCGIGWILRNESGGVLWMGARALPRTKNVLEAELEALR 199

Query: 428 KGLDFIHEYNIP-LLVESDSWEAIRLINAVDNDRIEARDFARKIRQRTTSWTNISFHHNR 487
             +  +  +N   ++ ESD+   + L+N+ D+     +     I+Q    +  + F    
Sbjct: 200 WAVLTMSRFNYKRIIFESDAQALVNLLNS-DDFWPTLQPALEDIQQLLHHFEEVKFEFTP 259

Query: 488 RETNMVAHKLAQ 494
           R  N VA ++A+
Sbjct: 260 RGGNKVADRIAR 267

The following BLAST results are available for this feature:
Match NameE-valueIdentityDescription
P0C2F65.3e-1325.60Putative ribonuclease H protein At1g65750 OS=Arabidopsis thaliana OX=3702 GN=At1... [more]
P932954.5e-1239.08Uncharacterized mitochondrial protein AtMg00310 OS=Arabidopsis thaliana OX=3702 ... [more]
Match NameE-valueIdentityDescription
XP_024950112.18.3e-5628.69uncharacterized protein LOC112496847 [Citrus sinensis][more]
XP_006491472.12.7e-5429.11uncharacterized protein LOC102626455 [Citrus sinensis][more]
XP_022150918.13.3e-5230.02uncharacterized protein LOC111018954 [Momordica charantia][more]
XP_023917061.12.8e-5128.98uncharacterized protein LOC112028598 [Quercus suber][more]
XP_024037590.13.6e-5127.87uncharacterized protein LOC112097210 [Citrus clementina][more]
Match NameE-valueIdentityDescription
A0A803PV884.0e-5631.15Uncharacterized protein OS=Cannabis sativa OX=3483 PE=4 SV=1[more]
A0A6J1DAR41.6e-5230.02uncharacterized protein LOC111018954 OS=Momordica charantia OX=3673 GN=LOC111018... [more]
A0A803P9P51.7e-5130.06Uncharacterized protein OS=Cannabis sativa OX=3483 PE=4 SV=1[more]
A0A803QI382.3e-5130.23Uncharacterized protein OS=Cannabis sativa OX=3483 PE=4 SV=1[more]
A0A1S8ACU23.0e-5128.81Ribonuclease H-like superfamily protein OS=Citrus limon OX=2708 PE=4 SV=1[more]
Match NameE-valueIdentityDescription
AT4G29090.16.0e-2822.86Ribonuclease H-like superfamily protein [more]
AT3G09510.11.2e-2327.21Ribonuclease H-like superfamily protein [more]
ATMG00310.13.2e-1339.08RNA-directed DNA polymerase (reverse transcriptase)-related family protein [more]
AT3G26855.14.9e-0632.43RNA-directed DNA polymerase (reverse transcriptase)-related family protein [more]
AT2G34320.19.3e-0521.88Polynucleotidyl transferase, ribonuclease H-like superfamily protein [more]
InterPro
Analysis Name: InterPro Annotations of Snake gourd (anguina) v1
Date Performed: 2021-10-25
IPR TermIPR DescriptionSourceSource TermSource DescriptionAlignment
IPR036397Ribonuclease H superfamilyGENE3D3.30.420.10coord: 369..496
e-value: 5.3E-11
score: 44.7
IPR026960Reverse transcriptase zinc-binding domainPFAMPF13966zf-RVTcoord: 224..315
e-value: 8.0E-17
score: 61.7
IPR002156Ribonuclease H domainPFAMPF13456RVT_3coord: 374..494
e-value: 1.7E-17
score: 63.4
NoneNo IPR availablePANTHERPTHR46736:SF6SUBFAMILY NOT NAMEDcoord: 100..436
NoneNo IPR availablePANTHERPTHR46736FAMILY NOT NAMEDcoord: 100..436
IPR044730Ribonuclease H-like domain, plant typeCDDcd06222RNase_H_likecoord: 373..492
e-value: 1.10283E-14
score: 68.4948
IPR012337Ribonuclease H-like superfamilySUPERFAMILY53098Ribonuclease H-likecoord: 369..495

Relationships

The following mRNA feature(s) are a part of this gene:

Feature NameUnique NameType
Tan0019688.1Tan0019688.1mRNA


GO Annotation
GO Assignments
This gene is annotated with the following GO terms.
Category Term Accession Term Name
molecular_function GO:0003676 nucleic acid binding
molecular_function GO:0004523 RNA-DNA hybrid ribonuclease activity