Cp4.1LG11g07390 (gene) Cucurbita pepo (Zucchini)

NameCp4.1LG11g07390
Typegene
OrganismCucurbita pepo (Cucurbita pepo (Zucchini))
DescriptionRNase H family protein
LocationCp4.1LG11 : 5909146 .. 5914427 (+)
The following sequences are available for this feature:

Gene sequence (with intron)

Legend: five_prime_UTRCDSthree_prime_UTR
Hold the cursor over a type above to highlight its positions in the sequence below.
TGGAGCGACTGGAGCGAGAAACTTAGTATGGAGAAACCTATTTGGGTGGATGGATGACATTGATTTGATTTGGAAACCCTGAAATCCCCATAGTTCCTGGAAAATTCCCCTCTCATGAACTGCTTCTCCCAATTCTCTACCTACACTCGCGCCATTTTCAGAGCCACCAATCTTGCTTTTGCAGCTTCCACCTCCATTCATGGCTGCCGCTTTAATCCCTACTGGACCTCGAGCTTTCACAGCGTCACTCTTAAACCTACTCCTTTAGACTCCTTGTGTTCCAGATTTCGTCTACGTTGCTACTCCTCTCGAAAACTCCGCAAGGGCGCTTCTCCTTCTCCCAACTTAGATTCTAAACCTCCCATGGAACCAGACATGGGCGACTTCTTTGTCGTTCGGAAGGGGGACGTTGTTGGAGTCTATAAAAGTTTTACTGACTGTCAGGCGCAAATTGGATCTTCGGTATTCATTTCTTTTACTTTTGATTGGTGATTTAACACATTTTGAATTCATGTTCTCTCTGCGAACTTAGATTAGGTTTTCCTACTGGTTACATTTAATTGTTTTTGAAGTATTGGCTCAGTGATCGATCAGTTCATCATACTTCTTATTGCTTAAGGATTTTGTCTTACTTGACAAGCATAGTTCTTTGTTAATTTTACTTATGGTTGCCGGTTTGTTGTAACTGGTGCATTTCGTTCAGTGGATTAAGCTCGGGTGTTATCGTTTCTTTTCTGTGGCAGATATGCGATCTTCCTGTTAGCGTGTATAAGGGACACTCATTGCCGAAAGACACTGGGGAATATCTTGCTTCCGTCGGGCTTAAGAATGCTCTGTACACTATTAAAGCTGCAGATATGAGACCTGATCTTTTCAGTTCGCTCGTTCCTTGCACTTTTCATGTATGATGCTTCATTAGTTGAATGATTATCGAGTTAAAAGTCTAAGAGAACATTACTGGCATGAATTATATAAGAAAAAAGTAGGAAAAATATAGTAATTCTGCTCTTATACTAGACTTATTCAAGCATCACAAAGATTAATATTTCCCAAATGTTAGATCATCCATGTTTGGTCATTATTGAAGAGGGTGAATTTGGCCCTATCATGGGATGCTCTACAGATGTTTGCGAGAAAGGAGCATATTGTTTTAGTCTCCCTAGGTCACAATTTTGAGTTTGTACTCCATGCCTCCCTTGCCTTGTAATTTTCTATTTTGTAGAGAACACGGTTTCATGTTTTATAGGAAGAATAGTATAACTAGCACAGTTTGCACTTTAAAGTCAGACGATGAAGGCTGTAAACTACAACAACATGAAGCATTTTATACATATTGGTTTTTTACAATAGCAAATTAGGATGGTAGGTTTTCAATTTTGTTTTTATGTAGATTCAATCTCCTTCCTGGACTATCTTTATGCACTCTAGCATAAGTTTGTTATATTTATAAAAGGTTTCTGTTTTCCAGGATGAAGCTACTTCTCTTAAAGGTGAGGCTTCTGGCCAGGATGCCATAAAGAAGAGATCAAGAGAGACCATTGTATCAGAAAATATTGTAAGTTGATTTTCATGTCATCTTCTAAGCTAAAATTTTAATTGAAGCGATAAAGCTCTTTACAAAGGAACTTGCAACACTGAATTTGATATGCAACTTATGGTTTTTGTTGTCAATGCTCGTTTACTTGGGGTTAATGCAGATTTAAATGTTTGCAGGGGTCAACTGTTTTAACTCCTACTTCAAAAGATCCCTTGAGGAAACATGTCAAGTTGGAAGATTCCGTTGTGTCCCAAGCACCCTCTAACCATGTAAGTCAAAGTCTCGGTGATGTCATAGTGATTGGACCAATACTATGAACTGCATACTCTTGTCTGTAGCTAAATATTTGTGTACATGTGATTCTGTATTTTTTGCCTAGTGTTGCTTAAGTGGTTGATTAAAAAAAGAAATAAATTACTGCATTTGTTTGTTGTCCTCTGAGTTGTATCAATTTTTATTTTAGGCTATTAATTAGGACGTAAAGTTCTGTTTCTAAACTAATTATGTATTTGATTGTGTCCTTGTGTTGTTAACATCATTCAAAAGGTGCCAGCTCGTTAAACCTTCAGAATTTTCACATCTGTCAACTGATATCAAGTCATCAATTTCGTTATGATCATAATAAGGTTGGTACAGGGTAGTTCTTATGCAAAGATATAATTAGGGTCTGAAATATTACTTTTAATTTAAAAGGGCCAAAATGAAACCAGCAAATGGTTATGGTTTCCTGATGGTAATCGTATTTTATAATGAACTAGCGAGGATGCAAACTAGTGGAGTCTCTTTTTAGTTTTGTTTTGACGCTGATATGCTTTCATTTCATTGTCCAGATTCCAAAATGATATTTCCATAATACAATGGAATAGTCCCTTAGTTTATTTTTCAACTTACTCGACAGATTTTATAGCCATTATAATATTTCATGTACAGGAATCTTGCTTTCTAGAATTCGATGGTGCCTCAAAGGGAAACCCTGGGCAAGCTGGTGCAGGAGCTGTTCTACGAGCTCATGATGGGAGTGTGGTATACATTCTTCCGTTTAGATCCCACATTACTTATGTATTCTTATTTATATATACCTAATCTATCTCTCCCCCCCTTGGAATAGATATGTAGACTGCGTGAAGGCCTAGGTATAGCAACCAATAATGTTGCTGAATATCGAGCAATTCTTTTAGGGTTGAAGTATGCACTTGAGAAAGGGTTTACTAGGATCCATGTCCAAGGCGACTCCAAACTTGTCTGTATGCAGGTTGGTTAAAATATTTTCCCCTTACAAGTATTGTTGGGTTTTTGTATTTATGTTTATGTTTATGTTTTTTTTAATTTATTTGTCAACTTAATTCTTGTTGTGGCCTCACATTGTAAGCTTGTTAAGTAGGTCTCATTTTGTCTTTTATATTTAGGAAAAATTCGGAGGTATGGTTTTAATGGTTGTCTTAGGACTAAGAAGTTCAAAGCTGAAAACTTTGTATTTAAAGATAATCCTTGTAAATTATTATTACACTGTAAGAAACATTTGTACTGAAGGGAAATAATTAGTCCAAACTGCATGAAACTAATGTTGTCATACCCGATTCCGGGAACATCCCAATAGTTTTCCTGACCCATGGTTTGCAGTCTCTGTGAAAATACTATTTCATCGAGAGTTGTACTTCTATTTTCTTCAAAGGGAAGGAGGCTAACGTCCTTTTGTCGATATGCCTGCTCTGTGAAGATTCCCTTGACACTACTTTCACCCTAACCCAAGGGGGCAGGAACGTTAGAAGAAAACACACCAGTTGACAAATTGAAAAAGTTGTAGAACATTTGTTTAAGGCATTCTGATTCCTGAAACTTTTTCTTGCTATCGATTGCCCCTTTTATCAACACATGGGATGAAGTTTTAATGCAGTGGAACAATTTTCTGGTTAACAACATTGACGAAAAACCTTTCCTCTATTGAGATGTTAACAAACCTTACCTTCAAGTTGAAAAGTATATCATCATCAATGTAGCTCACTTCCATATGAGTTCTTTTGAATATCTAGCATTGGTTTTTGTGGCTCAGCATTGTGGAGGTTCATTAATAGAAAGTTGTAGGAAAGATCCAGAATTTGAACCAAACACAGTTCTTATATGGAGTTAGAAATTTACTAATTACTATGCATAGCAATTAAAGAATTAAAATGTATCTGGCCACGTCTTTTGGTTAGTTGCATTTTCAAGTTCGCATCCCCAGTTTCTCTCGCTAGTAGTATGGTTTTGCTGAAATACAGTGTTTATGTAAATATTTAATATTAGGGACATTTAAAGCTGCAACTGTTTTGTTATTGTCAACCTCCGTATATCAAAACCAACCTTATCTCTTAGTCCTACCTCTCTGTCTGGATCCATCTGTTTTTTGGTAAAAAATAAAATTCATGGCATCTTCGTTTGTTCCCCTTTTTCTTATGTATTCTAAATCAATCATACTTCATTGAACAGGTTCAAGGATTATGGAAGGTAAAAAATGAGAATATCGCTGAGTTATGTAATGAAGTTATGAAGCTGAAGGATAAATTTCTCTCCTTCGAGATCAGTCATGTACTAAGGGTCAGTACCATTAATTCCTTTCTATTTGCATCTCACGATGTTGTGTAGTTGGTTTCTGCATATTATTGTTTCTTATGCTCTACGCACTTTACTATTTAAGATCTGAAGGTGGTGAAAAAAAAAAATATCCTTAAGGGTCAGTACCCTACAAACATAGGCTTAAACTGTAGCCTTATTTTAGTTCAGTTTTTCGATAGAACAGTGAACGTAAGTCAAGTTTGAGTTTGGACGTTGCTTTCTCCACATTATTTGTCCGACAACCGAACGAGACTTGCATACTCGAAAAAATAATAAATACGTACTCATAAAACAGTCTGGTACTTATTCCTTATCAAGGTAATGTCCTTAATGGAATGGATACCAATTCTAGCGACCATTGAATCCATTTTCTTATTTGTTGATATCTGTACTTGTTCATTCTTTGAAATTCTGACATTCTGTCGTTCATTCTGGAAAGAATCTAAACTCTGAAGCCGATGCTGAAGCAAACTTGGCTGTCACTCTACCTGGTAATGCTTATTTGAAATAATTCTAATAACTACTTTGAGTTTTCCTTAAAGTTTAAGAGTAACCAAGGTGGTCCTTCCTATTGCAGACGGCGAAGCCCAAGAGTTTGAAGAATAATAGTTAGAATGCACGGCAGGATACATCTTACAGCATAGAATTATTTTCTGAGGAATGCATTTATAGTCCAATGCTTTGTATGCTACTATTCTTTCCCCAAGGTTGACACGAGCTTTGGGCGTTGTTTCGGTTGCTCGCTTATGCTCCATGACCCTGACATAGGGAACTGAAGTTCTAGTGGCAATGGACTGGACAGTGCACAAGTGCATTTGTCCTGCATATATGAGATAAGAGTCCTAATTCAATTTATAACTGGAATGTTGCATTTATTTTCTTATTCATGTTACCAGTTGATTAGGCAATGATGCAGGCTATAATCGAGAATTAAAGAACACAGAATTTTTGGATTGTTCAATGAAGATCTTTGAGAAAGACGTTTTTTTTTTTCTGCTTCAGCCATCGAGGATTGTCAAACTTTGTAGCTTTTTCTCTAGTTTAAGTCTTGATTAGATCTTCTATTCCTTCAGTTGGCTACTGGCGTTTATAGTCTTCTTCTTTTCTAATTTCAGGATTGGTTAGATCAAATAAGACTTGGGACAATCTCAAA

mRNA sequence

TGGAGCGACTGGAGCGAGAAACTTAGTATGGAGAAACCTATTTGGGTGGATGGATGACATTGATTTGATTTGGAAACCCTGAAATCCCCATAGTTCCTGGAAAATTCCCCTCTCATGAACTGCTTCTCCCAATTCTCTACCTACACTCGCGCCATTTTCAGAGCCACCAATCTTGCTTTTGCAGCTTCCACCTCCATTCATGGCTGCCGCTTTAATCCCTACTGGACCTCGAGCTTTCACAGCGTCACTCTTAAACCTACTCCTTTAGACTCCTTGTGTTCCAGATTTCGTCTACGTTGCTACTCCTCTCGAAAACTCCGCAAGGGCGCTTCTCCTTCTCCCAACTTAGATTCTAAACCTCCCATGGAACCAGACATGGGCGACTTCTTTGTCGTTCGGAAGGGGGACGTTGTTGGAGTCTATAAAAGTTTTACTGACTGTCAGGCGCAAATTGGATCTTCGATATGCGATCTTCCTGTTAGCGTGTATAAGGGACACTCATTGCCGAAAGACACTGGGGAATATCTTGCTTCCGTCGGGCTTAAGAATGCTCTGTACACTATTAAAGCTGCAGATATGAGACCTGATCTTTTCAGTTCGCTCGTTCCTTGCACTTTTCATGATGAAGCTACTTCTCTTAAAGGTGAGGCTTCTGGCCAGGATGCCATAAAGAAGAGATCAAGAGAGACCATTGTATCAGAAAATATTGGGTCAACTGTTTTAACTCCTACTTCAAAAGATCCCTTGAGGAAACATGTCAAGTTGGAAGATTCCGTTGTGTCCCAAGCACCCTCTAACCATGAATCTTGCTTTCTAGAATTCGATGGTGCCTCAAAGGGAAACCCTGGGCAAGCTGGTGCAGGAGCTGTTCTACGAGCTCATGATGGGAGTGTGATATGTAGACTGCGTGAAGGCCTAGGTATAGCAACCAATAATGTTGCTGAATATCGAGCAATTCTTTTAGGGTTGAAGTATGCACTTGAGAAAGGGTTTACTAGGATCCATGTCCAAGGCGACTCCAAACTTGTCTGTATGCAGGTTCAAGGATTATGGAAGGTAAAAAATGAGAATATCGCTGAGTTATGTAATGAAGTTATGAAGCTGAAGGATAAATTTCTCTCCTTCGAGATCAGTCATGTACTAAGGAATCTAAACTCTGAAGCCGATGCTGAAGCAAACTTGGCTGTCACTCTACCTGACGGCGAAGCCCAAGAGTTTGAAGAATAATAGTTAGAATGCACGGCAGGATACATCTTACAGCATAGAATTATTTTCTGAGGAATGCATTTATAGTCCAATGCTTTGTATGCTACTATTCTTTCCCCAAGGTTGACACGAGCTTTGGGCGTTGTTTCGGTTGCTCGCTTATGCTCCATGACCCTGACATAGGGAACTGAAGTTCTAGTGGCAATGGACTGGACAGTGCACAAGTGCATTTGTCCTGCATATATGAGATAAGAGTCCTAATTCAATTTATAACTGGAATGTTGCATTTATTTTCTTATTCATGTTACCAGTTGATTAGGCAATGATGCAGGCTATAATCGAGAATTAAAGAACACAGAATTTTTGGATTGTTCAATGAAGATCTTTGAGAAAGACGTTTTTTTTTTTCTGCTTCAGCCATCGAGGATTGTCAAACTTTGTAGCTTTTTCTCTAGTTTAAGTCTTGATTAGATCTTCTATTCCTTCAGTTGGCTACTGGCGTTTATAGTCTTCTTCTTTTCTAATTTCAGGATTGGTTAGATCAAATAAGACTTGGGACAATCTCAAA

Coding sequence (CDS)

ATGAACTGCTTCTCCCAATTCTCTACCTACACTCGCGCCATTTTCAGAGCCACCAATCTTGCTTTTGCAGCTTCCACCTCCATTCATGGCTGCCGCTTTAATCCCTACTGGACCTCGAGCTTTCACAGCGTCACTCTTAAACCTACTCCTTTAGACTCCTTGTGTTCCAGATTTCGTCTACGTTGCTACTCCTCTCGAAAACTCCGCAAGGGCGCTTCTCCTTCTCCCAACTTAGATTCTAAACCTCCCATGGAACCAGACATGGGCGACTTCTTTGTCGTTCGGAAGGGGGACGTTGTTGGAGTCTATAAAAGTTTTACTGACTGTCAGGCGCAAATTGGATCTTCGATATGCGATCTTCCTGTTAGCGTGTATAAGGGACACTCATTGCCGAAAGACACTGGGGAATATCTTGCTTCCGTCGGGCTTAAGAATGCTCTGTACACTATTAAAGCTGCAGATATGAGACCTGATCTTTTCAGTTCGCTCGTTCCTTGCACTTTTCATGATGAAGCTACTTCTCTTAAAGGTGAGGCTTCTGGCCAGGATGCCATAAAGAAGAGATCAAGAGAGACCATTGTATCAGAAAATATTGGGTCAACTGTTTTAACTCCTACTTCAAAAGATCCCTTGAGGAAACATGTCAAGTTGGAAGATTCCGTTGTGTCCCAAGCACCCTCTAACCATGAATCTTGCTTTCTAGAATTCGATGGTGCCTCAAAGGGAAACCCTGGGCAAGCTGGTGCAGGAGCTGTTCTACGAGCTCATGATGGGAGTGTGATATGTAGACTGCGTGAAGGCCTAGGTATAGCAACCAATAATGTTGCTGAATATCGAGCAATTCTTTTAGGGTTGAAGTATGCACTTGAGAAAGGGTTTACTAGGATCCATGTCCAAGGCGACTCCAAACTTGTCTGTATGCAGGTTCAAGGATTATGGAAGGTAAAAAATGAGAATATCGCTGAGTTATGTAATGAAGTTATGAAGCTGAAGGATAAATTTCTCTCCTTCGAGATCAGTCATGTACTAAGGAATCTAAACTCTGAAGCCGATGCTGAAGCAAACTTGGCTGTCACTCTACCTGACGGCGAAGCCCAAGAGTTTGAAGAATAA

Protein sequence

MNCFSQFSTYTRAIFRATNLAFAASTSIHGCRFNPYWTSSFHSVTLKPTPLDSLCSRFRLRCYSSRKLRKGASPSPNLDSKPPMEPDMGDFFVVRKGDVVGVYKSFTDCQAQIGSSICDLPVSVYKGHSLPKDTGEYLASVGLKNALYTIKAADMRPDLFSSLVPCTFHDEATSLKGEASGQDAIKKRSRETIVSENIGSTVLTPTSKDPLRKHVKLEDSVVSQAPSNHESCFLEFDGASKGNPGQAGAGAVLRAHDGSVICRLREGLGIATNNVAEYRAILLGLKYALEKGFTRIHVQGDSKLVCMQVQGLWKVKNENIAELCNEVMKLKDKFLSFEISHVLRNLNSEADAEANLAVTLPDGEAQEFEE
BLAST of Cp4.1LG11g07390 vs. Swiss-Prot
Match: Y2253_MYCBO (Uncharacterized protein Mb2253c OS=Mycobacterium bovis (strain ATCC BAA-935 / AF2122/97) GN=Mb2253c PE=3 SV=1)

HSP 1 Score: 90.1 bits (222), Expect = 5.4e-17
Identity = 54/126 (42.86%), Postives = 74/126 (58.73%), Query Frame = 1

Query: 234 LEFDGASKGNPGQAGAGAVLRAHDGS-VICRLREGLGIATNNVAEYRAILLGLKYALEKG 293
           +E DG S+GNPG AG GAV+   D S V+   ++ +G ATNNVAEYR ++ GL  A++ G
Sbjct: 5   IEADGGSRGNPGPAGYGAVVWTADHSTVLAESKQAIGRATNNVAEYRGLIAGLDDAVKLG 64

Query: 294 FTRIHVQGDSKLVCMQVQGLWKVKNENIAELCNEVMKLKDKFLSFEISHVLRNLNSEADA 353
            T   V  DSKLV  Q+ G WKVK+ ++ +L  +   L  +F       V R  N+ AD 
Sbjct: 65  ATEAAVLMDSKLVVEQMSGRWKVKHPDLLKLYVQAQALASQFRRINYEWVPRARNTYADR 124

Query: 354 EANLAV 359
            AN A+
Sbjct: 125 LANDAM 130

BLAST of Cp4.1LG11g07390 vs. Swiss-Prot
Match: Y2228_MYCTU (Uncharacterized protein Rv2228c OS=Mycobacterium tuberculosis (strain ATCC 25618 / H37Rv) GN=Rv2228c PE=1 SV=1)

HSP 1 Score: 90.1 bits (222), Expect = 5.4e-17
Identity = 54/126 (42.86%), Postives = 74/126 (58.73%), Query Frame = 1

Query: 234 LEFDGASKGNPGQAGAGAVLRAHDGS-VICRLREGLGIATNNVAEYRAILLGLKYALEKG 293
           +E DG S+GNPG AG GAV+   D S V+   ++ +G ATNNVAEYR ++ GL  A++ G
Sbjct: 5   IEADGGSRGNPGPAGYGAVVWTADHSTVLAESKQAIGRATNNVAEYRGLIAGLDDAVKLG 64

Query: 294 FTRIHVQGDSKLVCMQVQGLWKVKNENIAELCNEVMKLKDKFLSFEISHVLRNLNSEADA 353
            T   V  DSKLV  Q+ G WKVK+ ++ +L  +   L  +F       V R  N+ AD 
Sbjct: 65  ATEAAVLMDSKLVVEQMSGRWKVKHPDLLKLYVQAQALASQFRRINYEWVPRARNTYADR 124

Query: 354 EANLAV 359
            AN A+
Sbjct: 125 LANDAM 130

BLAST of Cp4.1LG11g07390 vs. Swiss-Prot
Match: Y2228_MYCTO (Uncharacterized protein MT2287 OS=Mycobacterium tuberculosis (strain CDC 1551 / Oshkosh) GN=MT2287 PE=3 SV=1)

HSP 1 Score: 90.1 bits (222), Expect = 5.4e-17
Identity = 54/126 (42.86%), Postives = 74/126 (58.73%), Query Frame = 1

Query: 234 LEFDGASKGNPGQAGAGAVLRAHDGS-VICRLREGLGIATNNVAEYRAILLGLKYALEKG 293
           +E DG S+GNPG AG GAV+   D S V+   ++ +G ATNNVAEYR ++ GL  A++ G
Sbjct: 5   IEADGGSRGNPGPAGYGAVVWTADHSTVLAESKQAIGRATNNVAEYRGLIAGLDDAVKLG 64

Query: 294 FTRIHVQGDSKLVCMQVQGLWKVKNENIAELCNEVMKLKDKFLSFEISHVLRNLNSEADA 353
            T   V  DSKLV  Q+ G WKVK+ ++ +L  +   L  +F       V R  N+ AD 
Sbjct: 65  ATEAAVLMDSKLVVEQMSGRWKVKHPDLLKLYVQAQALASQFRRINYEWVPRARNTYADR 124

Query: 354 EANLAV 359
            AN A+
Sbjct: 125 LANDAM 130

BLAST of Cp4.1LG11g07390 vs. Swiss-Prot
Match: RNH_HALSA (Ribonuclease HI OS=Halobacterium salinarum (strain ATCC 700922 / JCM 11081 / NRC-1) GN=rnhA PE=1 SV=1)

HSP 1 Score: 88.6 bits (218), Expect = 1.6e-16
Identity = 50/123 (40.65%), Postives = 71/123 (57.72%), Query Frame = 1

Query: 236 FDGASKGNPGQAGAGAVLRAHDGSVICRLREGLGIATNNVAEYRAILLGLKYALEKGFTR 295
           FDGAS+GNPG A  G VL + DG ++    + +G ATNN AEY A++  L+ A + GF  
Sbjct: 74  FDGASRGNPGPAAVGWVLVSGDGGIVAEGGDTIGRATNNQAEYDALIAALEAAADFGFDD 133

Query: 296 IHVQGDSKLVCMQVQGLWKVKNENIAELCNEVMKLKDKFLSFEISHVLRNLNSEADAEAN 355
           I ++GDS+LV  Q+ G W   + ++        +L   F  + I+HV R  N  ADA AN
Sbjct: 134 IELRGDSQLVEKQLTGAWDTNDPDLRRKRVRARELLTGFDDWSITHVPRATNERADALAN 193

Query: 356 LAV 359
            A+
Sbjct: 194 EAL 196

BLAST of Cp4.1LG11g07390 vs. Swiss-Prot
Match: RNHL_BACSU (14.7 kDa ribonuclease H-like protein OS=Bacillus subtilis (strain 168) GN=rnhA PE=1 SV=1)

HSP 1 Score: 58.2 bits (139), Expect = 2.3e-07
Identity = 40/124 (32.26%), Postives = 62/124 (50.00%), Query Frame = 1

Query: 237 DGASKGNPGQAGAGAVLRAHDGSVICRLREGLGIATNNVAEYRAILLGLKYALEKGFTRI 296
           DGAS GNPG +G G  ++ H+G         +G+ TN  AE+ A++ G+K    +G+  +
Sbjct: 8   DGASAGNPGPSGIGIFIK-HEGKAES-FSIPIGVHTNQEAEFLALIEGMKLCATRGYQSV 67

Query: 297 HVQGDSKLVCMQVQGLWKVKNENIAELCNEVMKLKDKFLSFEISHVLRNLNSEADAEANL 356
             + DS +V  +   L  VKN        E+++LK  F  F I  +    N +AD  A  
Sbjct: 68  SFRTDSDIV-ERATELEMVKNITFQPFVEEIIRLKAAFPLFFIKWIPGKQNQKADLLAKE 127

Query: 357 AVTL 361
           A+ L
Sbjct: 128 AIRL 128

BLAST of Cp4.1LG11g07390 vs. TrEMBL
Match: A0A0A0KTZ9_CUCSA (Uncharacterized protein OS=Cucumis sativus GN=Csa_5G489310 PE=4 SV=1)

HSP 1 Score: 612.8 bits (1579), Expect = 2.7e-172
Identity = 310/374 (82.89%), Postives = 334/374 (89.30%), Query Frame = 1

Query: 1   MNCFSQFSTYTRAIFRATNLAFAASTSIHGCRFNPYWTSSFHSVTLKPTPLDSLCSRFRL 60
           MNCFSQ STYTR IFR TNL FAASTSIHGC  N YWTSSFH+V +K T LDSLCSRF L
Sbjct: 1   MNCFSQVSTYTRVIFRRTNLVFAASTSIHGCS-NAYWTSSFHNVAVKTTALDSLCSRFGL 60

Query: 61  RCYSSRKLRKG---ASPSPNLDSKPPMEPDMGDFFVVRKGDVVGVYKSFTDCQAQIGSSI 120
           RCYS+RK RK     SPSP LDS+PP+E +MGDFFVVRKGDVVGVYKSF+DCQAQIGSSI
Sbjct: 61  RCYSTRKPRKPRKPTSPSPKLDSEPPVESEMGDFFVVRKGDVVGVYKSFSDCQAQIGSSI 120

Query: 121 CDLPVSVYKGHSLPKDTGEYLASVGLKNALYTIKAADMRPDLFSSLVPCTFHDEATSLKG 180
           CDLPVSV+KGHSLPKDT EYLASVGLKNALYTIKAADMRPDLF SL PCTFH   TSL G
Sbjct: 121 CDLPVSVFKGHSLPKDTEEYLASVGLKNALYTIKAADMRPDLFGSLAPCTFHGGDTSLTG 180

Query: 181 EASGQDAIKKRSRETIVSENIGSTVLTPTSKDPLRKHVKLEDSVVSQA-PSNHESCFLEF 240
           E SGQDAIKKRSRE IV EN+GSTVLTPT KDP RKH+KLEDS+VS +  SN ESCFLEF
Sbjct: 181 ETSGQDAIKKRSREAIVPENVGSTVLTPTLKDPTRKHIKLEDSIVSHSVSSNRESCFLEF 240

Query: 241 DGASKGNPGQAGAGAVLRAHDGSVICRLREGLGIATNNVAEYRAILLGLKYALEKGFTRI 300
           DGASKGNPGQAGAGAVLRAHDGSVICRLREGLGIATNNVAEYRAILLGLK AL+KGFTRI
Sbjct: 241 DGASKGNPGQAGAGAVLRAHDGSVICRLREGLGIATNNVAEYRAILLGLKSALKKGFTRI 300

Query: 301 HVQGDSKLVCMQVQGLWKVKNENIAELCNEVMKLKDKFLSFEISHVLRNLNSEADAEANL 360
           HVQGDSKLVCMQVQGLWK K+EN++ELCNEV KLK+KFLSFE++HVLR+LNSEADA+ANL
Sbjct: 301 HVQGDSKLVCMQVQGLWKAKHENMSELCNEVTKLKNKFLSFEVNHVLRHLNSEADAQANL 360

Query: 361 AVTLPDGEAQEFEE 371
           A+TL +GE QEFE+
Sbjct: 361 ALTLAEGEVQEFED 373

BLAST of Cp4.1LG11g07390 vs. TrEMBL
Match: A0A061E5R3_THECC (RNase H family protein, putative isoform 2 OS=Theobroma cacao GN=TCM_010210 PE=4 SV=1)

HSP 1 Score: 397.5 bits (1020), Expect = 1.8e-107
Identity = 212/370 (57.30%), Postives = 267/370 (72.16%), Query Frame = 1

Query: 1   MNCFSQFSTYTRAIFRATNLAFAASTSIHGCRFNPYWTSSFHSVTLKPTPLDSLCSRFRL 60
           MNC S    Y  AIFR T   F  +++ + CRF P W  +F    +K   L+ L +RF  
Sbjct: 1   MNCLSHVRAYGSAIFRKTG-HFIETSTCNQCRF-PSWKRNFQHAGVKTVDLEFLLTRFHA 60

Query: 61  RCYSSRKLRKG--ASPSPNLDSKPPMEPDMGDFFVVRKGDVVGVYKSFTDCQAQIGSSIC 120
           +CYS+RK   G  A  +  +D +P ME +   F+VVRKGDVVGVYKSF DC+AQ+G SIC
Sbjct: 61  QCYSARKSSSGKKAPRTKKVDPEPVMENEKDAFYVVRKGDVVGVYKSFADCRAQVGPSIC 120

Query: 121 DLPVSVYKGHSLPKDTGEYLASVGLKNALYTIKAADMRPDLFSSLVPCTFHDEATSLKGE 180
           D PVSVYKG+SL KDT EYL S GLKNALYT++AAD++ DLF  L+PC+F + A+S KGE
Sbjct: 121 DPPVSVYKGYSLTKDTKEYLVSCGLKNALYTVRAADVKEDLFGLLMPCSFQEPASS-KGE 180

Query: 181 ASGQDAIKKRSRETIVSENIGSTVLTPTS-KDPLRKHVKLEDSVVSQAPSNHESCFLEFD 240
            S  DA KKRS++ + SE  G   L   +  DP+ KH+KL+     Q  S++ SC LEFD
Sbjct: 181 TSHMDAAKKRSQDMLKSEYGGLGALGSIAVADPVSKHIKLDPYAEVQIASSNCSCILEFD 240

Query: 241 GASKGNPGQAGAGAVLRAHDGSVICRLREGLGIATNNVAEYRAILLGLKYALEKGFTRIH 300
           GASKGNPG AGA AVLR   G VIC+LREGLGIAT N AEYRA++LGLK+AL KG++ I 
Sbjct: 241 GASKGNPGPAGAAAVLRTDTGKVICKLREGLGIATCNAAEYRAVILGLKHALRKGYSSIC 300

Query: 301 VQGDSKLVCMQVQGLWKVKNENIAELCNEVMKLKDKFLSFEISHVLRNLNSEADAEANLA 360
           V+GDSKLVCMQ+QGLWKVK+E+++EL  +  KLK+KFLSF+I+HVLR LN+EADA+ANLA
Sbjct: 301 VRGDSKLVCMQMQGLWKVKHEHMSELYEQAKKLKNKFLSFQINHVLRELNAEADAQANLA 360

Query: 361 VTLPDGEAQE 368
           V L +G+ QE
Sbjct: 361 VNLAEGQIQE 367

BLAST of Cp4.1LG11g07390 vs. TrEMBL
Match: A0A0B0MTD0_GOSAR (Uncharacterized protein OS=Gossypium arboreum GN=F383_25913 PE=4 SV=1)

HSP 1 Score: 394.4 bits (1012), Expect = 1.5e-106
Identity = 216/374 (57.75%), Postives = 268/374 (71.66%), Query Frame = 1

Query: 1   MNCFSQFSTYTRAIFRATNLAFAASTSIHGCRFNPYWTSSFHSVTLKPTPLDSLCSRFRL 60
           MNC     TY  A+FR        STS++ C   P W  +F   ++K   L+ L +RFR 
Sbjct: 1   MNCPPHVQTYGSALFRKAGNFI--STSLNQCHV-PLWKRNFEHASVKIVDLEFLLTRFRT 60

Query: 61  RCYSSRKLR---KGASPSPNLDSKPP--MEPDMGDFFVVRKGDVVGVYKSFTDCQAQIGS 120
           +CYSSRK     K  S +  +D + P  ME +   FFVVRKGDVVGV+KSF DCQ Q+GS
Sbjct: 61  QCYSSRKSSSSTKKTSRTKKVDPEQPKVMENEKDAFFVVRKGDVVGVFKSFADCQTQVGS 120

Query: 121 SICDLPVSVYKGHSLPKDTGEYLASVGLKNALYTIKAADMRPDLFSSLVPCTFHDEATSL 180
           SICD PVSVYKG+SL K+T  YL+S GLKNA YTI+AAD++ D+F +L+PC F + A+S 
Sbjct: 121 SICDPPVSVYKGYSLTKETEIYLSSYGLKNARYTIRAADVKEDIFGALMPCPFQEPASS- 180

Query: 181 KGEASGQDAIKKRSRETIVSENIGSTVLTPTS-KDPLRKHVKLEDSVVSQAPSN-HESCF 240
           KGE S  DA KKR ++ + SE  G   L   +  DP+RKH KL+    +Q  S+ H+SC 
Sbjct: 181 KGETSHNDATKKRPQDMLQSEYGGLGSLGSIAVADPVRKHFKLDPHAEAQITSSGHQSCI 240

Query: 241 LEFDGASKGNPGQAGAGAVLRAHDGSVICRLREGLGIATNNVAEYRAILLGLKYALEKGF 300
           LEFDGASKGNPG AGA AVL+   G+VIC+LREGLGIATNN AEYRAI+LGLK+AL KG+
Sbjct: 241 LEFDGASKGNPGPAGAAAVLKTDAGNVICKLREGLGIATNNAAEYRAIILGLKHALRKGY 300

Query: 301 TRIHVQGDSKLVCMQVQGLWKVKNENIAELCNEVMKLKDKFLSFEISHVLRNLNSEADAE 360
           T I V+GDSKLVCMQ+QGLWKVK+E+++EL  + MKLKDKFLSF+I+HVLR LN EADAE
Sbjct: 301 TNIRVRGDSKLVCMQLQGLWKVKHEHMSELYEQAMKLKDKFLSFQINHVLRELNGEADAE 360

Query: 361 ANLAVTLPDGEAQE 368
           ANLAV L +G+ QE
Sbjct: 361 ANLAVKLAEGQIQE 370

BLAST of Cp4.1LG11g07390 vs. TrEMBL
Match: A0A0D2RNX8_GOSRA (Uncharacterized protein OS=Gossypium raimondii GN=B456_005G185700 PE=4 SV=1)

HSP 1 Score: 394.4 bits (1012), Expect = 1.5e-106
Identity = 214/369 (57.99%), Postives = 262/369 (71.00%), Query Frame = 1

Query: 1   MNCFSQFSTYTRAIFRATNLAFAASTSIHGCRFNPYWTSSFHSVTLKPTPLDSLCSRFRL 60
           MNC      Y  A+FR     F A+TS++ C   P W  +F    +K   L+ L +RFR 
Sbjct: 1   MNCLPHVRAYGSALFRKAG-NFIATTSLNQCHV-PLWRRNFEHAGVKTVDLEFLLTRFRT 60

Query: 61  RCYSSRKLRKGASPSPNLDSK-------PPMEPDMGDFFVVRKGDVVGVYKSFTDCQAQI 120
           +CYSSRK +  +S      +K       P ME +   FFVVRKGD VGV+KSF DCQAQ+
Sbjct: 61  QCYSSRKSKSSSSTKKASRTKKVHPEQPPVMENEKDAFFVVRKGDTVGVFKSFADCQAQV 120

Query: 121 GSSICDLPVSVYKGHSLPKDTGEYLASVGLKNALYTIKAADMRPDLFSSLVPCTFHDEAT 180
           GSSICD PVSVYKG+SL K+T  YL+S GLKNALYTI+AAD++ DLF +L+PC F + A+
Sbjct: 121 GSSICDPPVSVYKGYSLTKETEIYLSSCGLKNALYTIRAADVKEDLFGALMPCPFQEPAS 180

Query: 181 SLKGEASGQDAIKKRSRETIVSENIGSTVLTPTS-KDPLRKHVKLEDSVVSQAPSN-HES 240
           S KGE S  DA KKR ++ + SE  G   L   +  DP+RKH KL+    +Q  S+ H+S
Sbjct: 181 S-KGETSHNDATKKRPQDMLQSEYGGLGSLGSIAVADPVRKHFKLDPHAEAQITSSGHQS 240

Query: 241 CFLEFDGASKGNPGQAGAGAVLRAHDGSVICRLREGLGIATNNVAEYRAILLGLKYALEK 300
           C LEFDGASKGNPG AGA AVL+   G+VIC+LREGLGIATNN AEYRAI+LGLK AL K
Sbjct: 241 CILEFDGASKGNPGPAGAAAVLKTDSGNVICKLREGLGIATNNAAEYRAIILGLKQALRK 300

Query: 301 GFTRIHVQGDSKLVCMQVQGLWKVKNENIAELCNEVMKLKDKFLSFEISHVLRNLNSEAD 360
           G+T I V+GDSKLVCMQ+QGLWKVK+E+++EL  + MKLKDKFLSF+I+HVLR LN EAD
Sbjct: 301 GYTNIRVRGDSKLVCMQLQGLWKVKHEHMSELYEQAMKLKDKFLSFQINHVLRELNGEAD 360

BLAST of Cp4.1LG11g07390 vs. TrEMBL
Match: A0A0D2SDN8_GOSRA (Uncharacterized protein OS=Gossypium raimondii GN=B456_009G350100 PE=4 SV=1)

HSP 1 Score: 394.0 bits (1011), Expect = 2.0e-106
Identity = 214/379 (56.46%), Postives = 269/379 (70.98%), Query Frame = 1

Query: 1   MNCFSQFSTYTRAIFRATNLAFAASTSIHGCRFNPYWTSSFHSVTLKPTPLDSLCSRFRL 60
           MNC      Y  A+FR T      STS++ C   P W  +F   ++K   L+ L +RFR 
Sbjct: 1   MNCLPHMHAYGSALFRKTGNFI--STSLNQCHV-PLWKRNFEDASVKTVDLEFLLTRFRT 60

Query: 61  RCYSSRKLR---KGASPSPNLDSKPP--MEPDMGDFFVVRKGDVVGVYKSFTDCQAQIGS 120
           +CYSSRK     K  S +  +D + P  ME +   FFVVRKGDVVGV+KSF DCQ Q+GS
Sbjct: 61  QCYSSRKSSSSTKKTSGTKKVDPEQPQVMENEKDAFFVVRKGDVVGVFKSFADCQTQVGS 120

Query: 121 SICDLPVSVYKGHSLPKDTGEYLASVGLKNALYTIKAADMRPDLFSSLVPCTFHDEATSL 180
           SICD PVSVYKG++L K+T  YL+S GLKNA YTI+AAD++ D+F +L+PC F + A+S 
Sbjct: 121 SICDPPVSVYKGYALTKETEIYLSSYGLKNARYTIRAADVKEDIFGALMPCPFQEPASS- 180

Query: 181 KGEASGQDAIKKRSRETIVSE------NIGSTVLTPTSKDPLRKHVKLEDSVVSQAPSN- 240
           KGE S  DA KKR ++ +  E      ++GS  +     D  RKHVKL+    +Q  S+ 
Sbjct: 181 KGETSHYDATKKRPQDMLQLEYGVGLGSLGSIAVA----DLARKHVKLDPHAEAQITSSG 240

Query: 241 HESCFLEFDGASKGNPGQAGAGAVLRAHDGSVICRLREGLGIATNNVAEYRAILLGLKYA 300
           H+SC LEFDGASKGNPG AGA AVL+   G+VIC+LREGLGIATNN AEYRA++LGLK+A
Sbjct: 241 HQSCTLEFDGASKGNPGPAGAAAVLKTDAGNVICKLREGLGIATNNAAEYRALILGLKHA 300

Query: 301 LEKGFTRIHVQGDSKLVCMQVQGLWKVKNENIAELCNEVMKLKDKFLSFEISHVLRNLNS 360
           L KG+T IHV+GDSKLVCMQ+QGLWKVK+E+++ELC + MKLKDKFLSF+I+HVLR LN 
Sbjct: 301 LRKGYTNIHVRGDSKLVCMQLQGLWKVKHEHMSELCEQAMKLKDKFLSFQINHVLRELNG 360

Query: 361 EADAEANLAVTLPDGEAQE 368
            ADAEANLAV L +G+ QE
Sbjct: 361 AADAEANLAVKLAEGQIQE 371

BLAST of Cp4.1LG11g07390 vs. TAIR10
Match: AT1G24090.1 (AT1G24090.1 RNase H family protein)

HSP 1 Score: 318.9 bits (816), Expect = 4.0e-87
Identity = 180/367 (49.05%), Postives = 242/367 (65.94%), Query Frame = 1

Query: 1   MNCFSQFSTYTRAIFRATNLAFAASTSIHGCRFNPYWTSSFHSVTLKPTPLDSLCSRFRL 60
           MNC S   +Y  A+      ++ +S   + C F  Y  S      LKP  + S+     +
Sbjct: 1   MNCLSHARSYI-ALGLLKRSSYVSSIPWNECFF--YMPSKS---CLKPVAVSSVFGICSV 60

Query: 61  RCYSSR-KLRKGASPSPNLDSKPPMEPDMGDFFVVRKGDVVGVYKSFTDCQAQIGSSICD 120
             YSSR K  K    S  + S    E D   FFVVRKGDV+G+YK  +DCQAQ+GSS+ D
Sbjct: 61  HSYSSRSKAVKSKMLSSTVVSAVDKEKDA--FFVVRKGDVIGIYKDLSDCQAQVGSSVFD 120

Query: 121 LPVSVYKGHSLPKDTGEYLASVGLKNALYTIKAADMRPDLFSSLVPCTFHDEATSLKGEA 180
           LPVSVYKG+SLPKDT EYL+SVGLK  LY+++A+D++ D+F +L PC F + A      +
Sbjct: 121 LPVSVYKGYSLPKDTEEYLSSVGLKKPLYSLRASDLKDDMFGALTPCLFQEPAPCTVKVS 180

Query: 181 SGQDAIKKRSRETIVSENIGSTVLTPTSKDPLRKHVKLEDSVVSQAPSNHESCFLEFDGA 240
             +   + +S++    +   +++    S DPL K  K+E S    A  + E+CF+EFDGA
Sbjct: 181 EDETTSETKSKDDKKDQLPSASI----SYDPLEKLSKVEPS----AYISDETCFIEFDGA 240

Query: 241 SKGNPGQAGAGAVLRAHDGSVICRLREGLGIATNNVAEYRAILLGLKYALEKGFTRIHVQ 300
           SKGNPG +GA AVL+  DGS+ICR+R+GLGIATNN AEY A++LGLKYA+EKG+  I V+
Sbjct: 241 SKGNPGLSGAAAVLKTEDGSLICRVRQGLGIATNNAAEYHALILGLKYAIEKGYKNIKVK 300

Query: 301 GDSKLVCMQVQGLWKVKNENIAELCNEVMKLKDKFLSFEISHVLRNLNSEADAEANLAVT 360
           GDSKLVCMQ++G WKV +E +A+L  E   L +K +SFEISHVLRNLN++AD +ANLAV 
Sbjct: 301 GDSKLVCMQIKGQWKVNHEVLAKLHKEAKLLCNKCVSFEISHVLRNLNADADEQANLAVR 351

Query: 361 LPDGEAQ 367
           LP+GE +
Sbjct: 361 LPEGEVE 351

BLAST of Cp4.1LG11g07390 vs. TAIR10
Match: AT3G01410.1 (AT3G01410.1 Polynucleotidyl transferase, ribonuclease H-like superfamily protein)

HSP 1 Score: 272.7 bits (696), Expect = 3.3e-73
Identity = 138/292 (47.26%), Postives = 197/292 (67.47%), Query Frame = 1

Query: 84  MEPDMGDFFVVRKGDVVGVYKSFTDCQAQIGSSICDLPVSVYKGHSLPKDTGEYLASVGL 143
           ME +   F++VRKGD++GVY+S ++CQ Q GSS+    +SVYKG+  PK   + L+S G+
Sbjct: 5   MEDEKDAFYIVRKGDIIGVYRSLSECQGQAGSSVSHPAMSVYKGYGWPKGAEDLLSSFGI 64

Query: 144 KNALYTIKAADMRPDLFSSLVPCTFHDEATSLKGEASGQDAIKKRSRETIVSENIGSTVL 203
           KNAL+++ A+ ++ D F  L+PC     ++S +GE+  + +  KR ++      +GS   
Sbjct: 65  KNALFSVNASHVKDDAFGKLIPCPVQQPSSS-QGESLNKSSPSKRLQD------MGSGES 124

Query: 204 TPTSKDPLRKHVKLEDSVVSQAPSN---------HESCFLEFDGASKGNPGQAGAGAVLR 263
              S  P +K +K+E+ ++ + PS+         ++SC +EFDGASKGNPG+AGAGAVLR
Sbjct: 125 GSFSPSPPQKQLKIENDMLRRIPSSLLTRTPIRQNDSCTIEFDGASKGNPGKAGAGAVLR 184

Query: 264 AHDGSVICRLREGLGIATNNVAEYRAILLGLKYALEKGFTRIHVQGDSKLVCMQVQGLWK 323
           A D SV+  LREG+G ATNNVAEYRA+LLGL+ AL+KGF  +HV GDS LVCMQVQG WK
Sbjct: 185 ASDNSVLFYLREGVGNATNNVAEYRALLLGLRSALDKGFKNVHVLGDSMLVCMQVQGAWK 244

Query: 324 VKNENIAELCNEVMKLKDKFLSFEISHVLRNLNSEADAEANLAVTLPDGEAQ 367
             +  +AELC +  +L + F +F+I H+ R  NSEAD +AN A+ L DG+ Q
Sbjct: 245 TNHPKMAELCKQAKELMNSFKTFDIKHIAREKNSEADKQANSAIFLADGQTQ 289

BLAST of Cp4.1LG11g07390 vs. TAIR10
Match: AT5G51080.1 (AT5G51080.1 RNase H family protein)

HSP 1 Score: 182.6 bits (462), Expect = 4.5e-46
Identity = 90/162 (55.56%), Postives = 125/162 (77.16%), Query Frame = 1

Query: 205 PTSKDPLRKHVKLEDSVVSQAPSNHESCFLEFDGASKGNPGQAGAGAVLRAHDGSVICRL 264
           P++   + K  +LE S    A +++E+C +EFDGASKGNPG +GA AVL+  DGS+I ++
Sbjct: 163 PSASMSVEKLAELEPS----ADTSYETCIIEFDGASKGNPGLSGAAAVLKTEDGSLIFKM 222

Query: 265 REGLGIATNNVAEYRAILLGLKYALEKGFTRIHVQGDSKLVCMQVQGLWKVKNENIAELC 324
           R+GLGIATNN AEY  ++LGLK+A+EKG+T+I V+ DSKLVCMQ++G WKV +E +++L 
Sbjct: 223 RQGLGIATNNAAEYHGLILGLKHAIEKGYTKIKVKTDSKLVCMQMKGQWKVNHEVLSKLH 282

Query: 325 NEVMKLKDKFLSFEISHVLRNLNSEADAEANLAVTLPDGEAQ 367
            E  +L DK LSFEISHVLR+LNS+AD +AN+A  L +GE +
Sbjct: 283 KEAKQLSDKCLSFEISHVLRSLNSDADEQANMAARLSEGEVE 320

BLAST of Cp4.1LG11g07390 vs. NCBI nr
Match: gi|449452100|ref|XP_004143798.1| (PREDICTED: uncharacterized protein LOC101210930 isoform X1 [Cucumis sativus])

HSP 1 Score: 612.8 bits (1579), Expect = 3.8e-172
Identity = 310/374 (82.89%), Postives = 334/374 (89.30%), Query Frame = 1

Query: 1   MNCFSQFSTYTRAIFRATNLAFAASTSIHGCRFNPYWTSSFHSVTLKPTPLDSLCSRFRL 60
           MNCFSQ STYTR IFR TNL FAASTSIHGC  N YWTSSFH+V +K T LDSLCSRF L
Sbjct: 1   MNCFSQVSTYTRVIFRRTNLVFAASTSIHGCS-NAYWTSSFHNVAVKTTALDSLCSRFGL 60

Query: 61  RCYSSRKLRKG---ASPSPNLDSKPPMEPDMGDFFVVRKGDVVGVYKSFTDCQAQIGSSI 120
           RCYS+RK RK     SPSP LDS+PP+E +MGDFFVVRKGDVVGVYKSF+DCQAQIGSSI
Sbjct: 61  RCYSTRKPRKPRKPTSPSPKLDSEPPVESEMGDFFVVRKGDVVGVYKSFSDCQAQIGSSI 120

Query: 121 CDLPVSVYKGHSLPKDTGEYLASVGLKNALYTIKAADMRPDLFSSLVPCTFHDEATSLKG 180
           CDLPVSV+KGHSLPKDT EYLASVGLKNALYTIKAADMRPDLF SL PCTFH   TSL G
Sbjct: 121 CDLPVSVFKGHSLPKDTEEYLASVGLKNALYTIKAADMRPDLFGSLAPCTFHGGDTSLTG 180

Query: 181 EASGQDAIKKRSRETIVSENIGSTVLTPTSKDPLRKHVKLEDSVVSQA-PSNHESCFLEF 240
           E SGQDAIKKRSRE IV EN+GSTVLTPT KDP RKH+KLEDS+VS +  SN ESCFLEF
Sbjct: 181 ETSGQDAIKKRSREAIVPENVGSTVLTPTLKDPTRKHIKLEDSIVSHSVSSNRESCFLEF 240

Query: 241 DGASKGNPGQAGAGAVLRAHDGSVICRLREGLGIATNNVAEYRAILLGLKYALEKGFTRI 300
           DGASKGNPGQAGAGAVLRAHDGSVICRLREGLGIATNNVAEYRAILLGLK AL+KGFTRI
Sbjct: 241 DGASKGNPGQAGAGAVLRAHDGSVICRLREGLGIATNNVAEYRAILLGLKSALKKGFTRI 300

Query: 301 HVQGDSKLVCMQVQGLWKVKNENIAELCNEVMKLKDKFLSFEISHVLRNLNSEADAEANL 360
           HVQGDSKLVCMQVQGLWK K+EN++ELCNEV KLK+KFLSFE++HVLR+LNSEADA+ANL
Sbjct: 301 HVQGDSKLVCMQVQGLWKAKHENMSELCNEVTKLKNKFLSFEVNHVLRHLNSEADAQANL 360

Query: 361 AVTLPDGEAQEFEE 371
           A+TL +GE QEFE+
Sbjct: 361 ALTLAEGEVQEFED 373

BLAST of Cp4.1LG11g07390 vs. NCBI nr
Match: gi|659131431|ref|XP_008465682.1| (PREDICTED: uncharacterized protein LOC103503315 isoform X1 [Cucumis melo])

HSP 1 Score: 612.8 bits (1579), Expect = 3.8e-172
Identity = 310/378 (82.01%), Postives = 338/378 (89.42%), Query Frame = 1

Query: 1   MNCFSQFSTYTRAIFRATNLAFAASTSIHGCRFNPYWTSSFHSVTLKPTPLDSLCSRFRL 60
           MNC SQ STYTR IFR TNL FAASTSIHGC  NPYW+S+FH+V +K T LDSLCSRF L
Sbjct: 1   MNCLSQVSTYTRVIFRRTNLVFAASTSIHGCS-NPYWSSTFHNVAVKATALDSLCSRFGL 60

Query: 61  RCYSSRKLRKG---ASPSPNLDSKPPMEPDMGDFFVVRKGDVVGVYKSFTDCQAQIGSSI 120
           RCYS+RK RK     SPSP LDS+PPME +MGDFFVVRKGDVVGVYKSF+DC AQIGSSI
Sbjct: 61  RCYSTRKPRKPRKPTSPSPKLDSEPPMESEMGDFFVVRKGDVVGVYKSFSDCHAQIGSSI 120

Query: 121 CDLPVSVYKGHSLPKDTGEYLASVGLKNALYTIKAADMRPDLFSSLVPCTFHDEATSLKG 180
           CDLPVSV+KGHSLPKD+ EYLAS+GLKNALYTIKAADMRPDLF SLVPCTFHD   SL G
Sbjct: 121 CDLPVSVFKGHSLPKDSEEYLASIGLKNALYTIKAADMRPDLFGSLVPCTFHDGDASLTG 180

Query: 181 EASGQDAIKKRSRETIVSENIGSTVLTPTS-----KDPLRKHVKLEDSVVSQAPSNHESC 240
           E SGQDAIKKRSRE IVSEN+GS+VLTPTS     +DP RKH+KLEDS+VS + SNHESC
Sbjct: 181 ETSGQDAIKKRSREAIVSENVGSSVLTPTSVTPTSEDPTRKHIKLEDSIVSLS-SNHESC 240

Query: 241 FLEFDGASKGNPGQAGAGAVLRAHDGSVICRLREGLGIATNNVAEYRAILLGLKYALEKG 300
           FLEFDGASKGNPGQAGAGAVLRAHDGSVICRLREGLGIATNNVAEYRAILLGLK+AL+KG
Sbjct: 241 FLEFDGASKGNPGQAGAGAVLRAHDGSVICRLREGLGIATNNVAEYRAILLGLKFALKKG 300

Query: 301 FTRIHVQGDSKLVCMQVQGLWKVKNENIAELCNEVMKLKDKFLSFEISHVLRNLNSEADA 360
           FTRIHVQGDSKLVCMQVQGLWK KNENI+ELCNEV+KLK+KFLSFE++HVLR+LNSEADA
Sbjct: 301 FTRIHVQGDSKLVCMQVQGLWKAKNENISELCNEVVKLKNKFLSFEVNHVLRHLNSEADA 360

Query: 361 EANLAVTLPDGEAQEFEE 371
           +ANLA+TL DGE QE E+
Sbjct: 361 QANLALTLADGEIQESED 376

BLAST of Cp4.1LG11g07390 vs. NCBI nr
Match: gi|778703066|ref|XP_011655308.1| (PREDICTED: uncharacterized protein LOC101210930 isoform X2 [Cucumis sativus])

HSP 1 Score: 578.2 bits (1489), Expect = 1.0e-161
Identity = 292/350 (83.43%), Postives = 311/350 (88.86%), Query Frame = 1

Query: 1   MNCFSQFSTYTRAIFRATNLAFAASTSIHGCRFNPYWTSSFHSVTLKPTPLDSLCSRFRL 60
           MNCFSQ STYTR IFR TNL FAASTSIHGC  N YWTSSFH+V +K T LDSLCSRF L
Sbjct: 1   MNCFSQVSTYTRVIFRRTNLVFAASTSIHGCS-NAYWTSSFHNVAVKTTALDSLCSRFGL 60

Query: 61  RCYSSRKLRKG---ASPSPNLDSKPPMEPDMGDFFVVRKGDVVGVYKSFTDCQAQIGSSI 120
           RCYS+RK RK     SPSP LDS+PP+E +MGDFFVVRKGDVVGVYKSF+DCQAQIGSSI
Sbjct: 61  RCYSTRKPRKPRKPTSPSPKLDSEPPVESEMGDFFVVRKGDVVGVYKSFSDCQAQIGSSI 120

Query: 121 CDLPVSVYKGHSLPKDTGEYLASVGLKNALYTIKAADMRPDLFSSLVPCTFHDEATSLKG 180
           CDLPVSV+KGHSLPKDT EYLASVGLKNALYTIKAADMRPDLF SL PCTFH   TSL G
Sbjct: 121 CDLPVSVFKGHSLPKDTEEYLASVGLKNALYTIKAADMRPDLFGSLAPCTFHGGDTSLTG 180

Query: 181 EASGQDAIKKRSRETIVSENIGSTVLTPTSKDPLRKHVKLEDSVVSQA-PSNHESCFLEF 240
           E SGQDAIKKRSRE IV EN+GSTVLTPT KDP RKH+KLEDS+VS +  SN ESCFLEF
Sbjct: 181 ETSGQDAIKKRSREAIVPENVGSTVLTPTLKDPTRKHIKLEDSIVSHSVSSNRESCFLEF 240

Query: 241 DGASKGNPGQAGAGAVLRAHDGSVICRLREGLGIATNNVAEYRAILLGLKYALEKGFTRI 300
           DGASKGNPGQAGAGAVLRAHDGSVICRLREGLGIATNNVAEYRAILLGLK AL+KGFTRI
Sbjct: 241 DGASKGNPGQAGAGAVLRAHDGSVICRLREGLGIATNNVAEYRAILLGLKSALKKGFTRI 300

Query: 301 HVQGDSKLVCMQVQGLWKVKNENIAELCNEVMKLKDKFLSFEISHVLRNL 347
           HVQGDSKLVCMQVQGLWK K+EN++ELCNEV KLK+KFLSFE++HVLR L
Sbjct: 301 HVQGDSKLVCMQVQGLWKAKHENMSELCNEVTKLKNKFLSFEVNHVLRKL 349

BLAST of Cp4.1LG11g07390 vs. NCBI nr
Match: gi|659131433|ref|XP_008465683.1| (PREDICTED: uncharacterized protein LOC103503315 isoform X2 [Cucumis melo])

HSP 1 Score: 578.2 bits (1489), Expect = 1.0e-161
Identity = 291/352 (82.67%), Postives = 315/352 (89.49%), Query Frame = 1

Query: 1   MNCFSQFSTYTRAIFRATNLAFAASTSIHGCRFNPYWTSSFHSVTLKPTPLDSLCSRFRL 60
           MNC SQ STYTR IFR TNL FAASTSIHGC  NPYW+S+FH+V +K T LDSLCSRF L
Sbjct: 1   MNCLSQVSTYTRVIFRRTNLVFAASTSIHGCS-NPYWSSTFHNVAVKATALDSLCSRFGL 60

Query: 61  RCYSSRKLRKG---ASPSPNLDSKPPMEPDMGDFFVVRKGDVVGVYKSFTDCQAQIGSSI 120
           RCYS+RK RK     SPSP LDS+PPME +MGDFFVVRKGDVVGVYKSF+DC AQIGSSI
Sbjct: 61  RCYSTRKPRKPRKPTSPSPKLDSEPPMESEMGDFFVVRKGDVVGVYKSFSDCHAQIGSSI 120

Query: 121 CDLPVSVYKGHSLPKDTGEYLASVGLKNALYTIKAADMRPDLFSSLVPCTFHDEATSLKG 180
           CDLPVSV+KGHSLPKD+ EYLAS+GLKNALYTIKAADMRPDLF SLVPCTFHD   SL G
Sbjct: 121 CDLPVSVFKGHSLPKDSEEYLASIGLKNALYTIKAADMRPDLFGSLVPCTFHDGDASLTG 180

Query: 181 EASGQDAIKKRSRETIVSENIGSTVL-----TPTSKDPLRKHVKLEDSVVSQAPSNHESC 240
           E SGQDAIKKRSRE IVSEN+GS+VL     TPTS+DP RKH+KLEDS+VS + SNHESC
Sbjct: 181 ETSGQDAIKKRSREAIVSENVGSSVLTPTSVTPTSEDPTRKHIKLEDSIVSLS-SNHESC 240

Query: 241 FLEFDGASKGNPGQAGAGAVLRAHDGSVICRLREGLGIATNNVAEYRAILLGLKYALEKG 300
           FLEFDGASKGNPGQAGAGAVLRAHDGSVICRLREGLGIATNNVAEYRAILLGLK+AL+KG
Sbjct: 241 FLEFDGASKGNPGQAGAGAVLRAHDGSVICRLREGLGIATNNVAEYRAILLGLKFALKKG 300

Query: 301 FTRIHVQGDSKLVCMQVQGLWKVKNENIAELCNEVMKLKDKFLSFEISHVLR 345
           FTRIHVQGDSKLVCMQVQGLWK KNENI+ELCNEV+KLK+KFLSFE++HVLR
Sbjct: 301 FTRIHVQGDSKLVCMQVQGLWKAKNENISELCNEVVKLKNKFLSFEVNHVLR 350

BLAST of Cp4.1LG11g07390 vs. NCBI nr
Match: gi|659131435|ref|XP_008465684.1| (PREDICTED: uncharacterized protein LOC103503315 isoform X3 [Cucumis melo])

HSP 1 Score: 578.2 bits (1489), Expect = 1.0e-161
Identity = 291/352 (82.67%), Postives = 315/352 (89.49%), Query Frame = 1

Query: 1   MNCFSQFSTYTRAIFRATNLAFAASTSIHGCRFNPYWTSSFHSVTLKPTPLDSLCSRFRL 60
           MNC SQ STYTR IFR TNL FAASTSIHGC  NPYW+S+FH+V +K T LDSLCSRF L
Sbjct: 1   MNCLSQVSTYTRVIFRRTNLVFAASTSIHGCS-NPYWSSTFHNVAVKATALDSLCSRFGL 60

Query: 61  RCYSSRKLRKG---ASPSPNLDSKPPMEPDMGDFFVVRKGDVVGVYKSFTDCQAQIGSSI 120
           RCYS+RK RK     SPSP LDS+PPME +MGDFFVVRKGDVVGVYKSF+DC AQIGSSI
Sbjct: 61  RCYSTRKPRKPRKPTSPSPKLDSEPPMESEMGDFFVVRKGDVVGVYKSFSDCHAQIGSSI 120

Query: 121 CDLPVSVYKGHSLPKDTGEYLASVGLKNALYTIKAADMRPDLFSSLVPCTFHDEATSLKG 180
           CDLPVSV+KGHSLPKD+ EYLAS+GLKNALYTIKAADMRPDLF SLVPCTFHD   SL G
Sbjct: 121 CDLPVSVFKGHSLPKDSEEYLASIGLKNALYTIKAADMRPDLFGSLVPCTFHDGDASLTG 180

Query: 181 EASGQDAIKKRSRETIVSENIGSTVL-----TPTSKDPLRKHVKLEDSVVSQAPSNHESC 240
           E SGQDAIKKRSRE IVSEN+GS+VL     TPTS+DP RKH+KLEDS+VS + SNHESC
Sbjct: 181 ETSGQDAIKKRSREAIVSENVGSSVLTPTSVTPTSEDPTRKHIKLEDSIVSLS-SNHESC 240

Query: 241 FLEFDGASKGNPGQAGAGAVLRAHDGSVICRLREGLGIATNNVAEYRAILLGLKYALEKG 300
           FLEFDGASKGNPGQAGAGAVLRAHDGSVICRLREGLGIATNNVAEYRAILLGLK+AL+KG
Sbjct: 241 FLEFDGASKGNPGQAGAGAVLRAHDGSVICRLREGLGIATNNVAEYRAILLGLKFALKKG 300

Query: 301 FTRIHVQGDSKLVCMQVQGLWKVKNENIAELCNEVMKLKDKFLSFEISHVLR 345
           FTRIHVQGDSKLVCMQVQGLWK KNENI+ELCNEV+KLK+KFLSFE++HVLR
Sbjct: 301 FTRIHVQGDSKLVCMQVQGLWKAKNENISELCNEVVKLKNKFLSFEVNHVLR 350

The following BLAST results are available for this feature:
Match NameE-valueIdentityDescription
Y2253_MYCBO5.4e-1742.86Uncharacterized protein Mb2253c OS=Mycobacterium bovis (strain ATCC BAA-935 / AF... [more]
Y2228_MYCTU5.4e-1742.86Uncharacterized protein Rv2228c OS=Mycobacterium tuberculosis (strain ATCC 25618... [more]
Y2228_MYCTO5.4e-1742.86Uncharacterized protein MT2287 OS=Mycobacterium tuberculosis (strain CDC 1551 / ... [more]
RNH_HALSA1.6e-1640.65Ribonuclease HI OS=Halobacterium salinarum (strain ATCC 700922 / JCM 11081 / NRC... [more]
RNHL_BACSU2.3e-0732.2614.7 kDa ribonuclease H-like protein OS=Bacillus subtilis (strain 168) GN=rnhA P... [more]
Match NameE-valueIdentityDescription
A0A0A0KTZ9_CUCSA2.7e-17282.89Uncharacterized protein OS=Cucumis sativus GN=Csa_5G489310 PE=4 SV=1[more]
A0A061E5R3_THECC1.8e-10757.30RNase H family protein, putative isoform 2 OS=Theobroma cacao GN=TCM_010210 PE=4... [more]
A0A0B0MTD0_GOSAR1.5e-10657.75Uncharacterized protein OS=Gossypium arboreum GN=F383_25913 PE=4 SV=1[more]
A0A0D2RNX8_GOSRA1.5e-10657.99Uncharacterized protein OS=Gossypium raimondii GN=B456_005G185700 PE=4 SV=1[more]
A0A0D2SDN8_GOSRA2.0e-10656.46Uncharacterized protein OS=Gossypium raimondii GN=B456_009G350100 PE=4 SV=1[more]
Match NameE-valueIdentityDescription
AT1G24090.14.0e-8749.05 RNase H family protein[more]
AT3G01410.13.3e-7347.26 Polynucleotidyl transferase, ribonuclease H-like superfamily protein[more]
AT5G51080.14.5e-4655.56 RNase H family protein[more]
Match NameE-valueIdentityDescription
gi|449452100|ref|XP_004143798.1|3.8e-17282.89PREDICTED: uncharacterized protein LOC101210930 isoform X1 [Cucumis sativus][more]
gi|659131431|ref|XP_008465682.1|3.8e-17282.01PREDICTED: uncharacterized protein LOC103503315 isoform X1 [Cucumis melo][more]
gi|778703066|ref|XP_011655308.1|1.0e-16183.43PREDICTED: uncharacterized protein LOC101210930 isoform X2 [Cucumis sativus][more]
gi|659131433|ref|XP_008465683.1|1.0e-16182.67PREDICTED: uncharacterized protein LOC103503315 isoform X2 [Cucumis melo][more]
gi|659131435|ref|XP_008465684.1|1.0e-16182.67PREDICTED: uncharacterized protein LOC103503315 isoform X3 [Cucumis melo][more]
The following terms have been associated with this gene:
Vocabulary: Molecular Function
TermDefinition
GO:0004523RNA-DNA hybrid ribonuclease activity
GO:0003676nucleic acid binding
Vocabulary: INTERPRO
TermDefinition
IPR012337RNaseH-like_sf
IPR011320RNase_H1_N
IPR002156RNaseH_domain
GO Assignments
This gene is annotated with the following GO terms.
Category Term Accession Term Name
biological_process GO:0051252 regulation of RNA metabolic process
biological_process GO:0090502 RNA phosphodiester bond hydrolysis, endonucleolytic
cellular_component GO:0005575 cellular_component
molecular_function GO:0003676 nucleic acid binding
molecular_function GO:0004523 RNA-DNA hybrid ribonuclease activity
molecular_function GO:0008270 zinc ion binding

The following mRNA feature(s) are a part of this gene:

Feature NameUnique NameType
Cp4.1LG11g07390.1Cp4.1LG11g07390.1mRNA


Analysis Name: InterPro Annotations of Cucurbita pepo
Date Performed: 2017-12-02
IPR TermIPR DescriptionSourceSource TermSource DescriptionAlignment
IPR002156Ribonuclease H domainPROFILEPS50879RNASE_Hcoord: 228..359
score: 16
IPR011320Ribonuclease H1, N-terminalGENE3DG3DSA:3.40.970.10coord: 89..133
score: 3.
IPR012337Ribonuclease H-like domainGENE3DG3DSA:3.30.420.10coord: 235..362
score: 5.7
IPR012337Ribonuclease H-like domainunknownSSF53098Ribonuclease H-likecoord: 231..355
score: 2.91
NoneNo IPR availablePANTHERPTHR33033FAMILY NOT NAMEDcoord: 84..210
score: 4.1E-110coord: 232..370
score: 4.1E
NoneNo IPR availablePANTHERPTHR33033:SF11RNASE H DOMAIN-CONTAINING PROTEINcoord: 232..370
score: 4.1E-110coord: 84..210
score: 4.1E
NoneNo IPR availablePFAMPF13456RVT_3coord: 236..357
score: 1.3

The following gene(s) are paralogous to this gene:

None

The following block(s) are covering this gene:
GeneOrganismBlock
Cp4.1LG11g07390Silver-seed gourdcarcpeB0191
Cp4.1LG11g07390Cucurbita pepo (Zucchini)cpecpeB149
Cp4.1LG11g07390Cucurbita maxima (Rimu)cmacpeB181
Cp4.1LG11g07390Cucurbita moschata (Rifu)cmocpeB155
Cp4.1LG11g07390Cucurbita moschata (Rifu)cmocpeB721
Cp4.1LG11g07390Watermelon (Charleston Gray)cpewcgB123