ClCG02G004040.1 (mRNA) Watermelon (Charleston Gray)

NameClCG02G004040.1
TypemRNA
OrganismCitrullus lanatus (Watermelon (Charleston Gray))
DescriptionAcyl-CoA N-acyltransferases (NAT) superfamily protein LENGTH=274
LocationCG_Chr02 : 4130778 .. 4135960 (-)
Sequence length1008
The following sequences are available for this feature:

Gene sequence (with intron)

Legend: five_prime_UTRCDSthree_prime_UTR
Hold the cursor over a type above to highlight its positions in the sequence below.
GGAAGAAACAAATGATGACAGTCCTCTCATCTGCTCCTTCTTCTTCCTCCGTTTCTTGGCTTTCATTCTTCTCTTCAAACTCTTTCGCTTACTTCAGAACCCAACCCTCTGCCCCTTCAACCTCCTCTTGCTTTCTCAATCGTCCACCCATTAGAATCTCAAATGTTTTCACCAACCAACAACCTACCTTCCCACTTCACAAATCTAAGTTTAGGGTTTCAGAAGGTGCTTCTGAGGATGAGTTTTGGGCTGCTGCTTGTCTCCGCATTCGCACCTTCAATCACTTCCCCCCCGATTCCTTTGGCATCGATGTGAGTTGTCTAAATTTTATGTTGTATGCTTTTCCGCAATCAATTTATTCAACTGTTTGGTGGAACGTGGATGTTTGATTCTTGAGACCCAATTCTCATTTCTCTGCTTTATTTTAGGTTTAATTATATATAAAAAAATAAAAGAAAAAAACCATCAACTAGGAGGTTGGTTTCAATTATGCCTTCGAATTCTGAAAAGTTTCATTTATTAAAACCCTTAAGTTTGAAAAGTTTTTTCTTTTCAAAACAACCCTTGAAAGCTTTTGTCCTTAAATTCATTGGTGGAAAAGGTAAGTTTGTGTATGTTCATATGTGGTTTAAGTGAACAAATCTAAACACATACGTGAGGTAGAGTTAGCGAAAAAGTCACGAACGAAATAACTGAAGATTTGGGACTTGGTTGTAAGGAGTAAGACGGAAAAAGGTATGGTTTACAGATGCTCATGTGTCTTTATTATTAATATTATTATTTCCATGTATGTATTTAGATAAATACTTTTTGTCTATGAATTTAACCACAAAAGCAAGTTAAAATGAAACTTTAAAAAGTTCAGGTATAATTGAAACCAACTCGATAGTTTAACCATATATATATATATATATATTTTAATTTAACCTTTACTAGCTCATTATCCATAACTACACGAACCTGGTTTCATATTTTCATTTTTAATGTTCTGGAAAGTGGGAACTAATACAAATTTCTTGAAGTCTAATTTAATTTTTAGTTACCATTGTTTTGTAAATTCTCTAAATCCAAATATCGTTCACTTTGAATGGATCTAAAGTTGTGCATGCTTCTATTAGTAGAGTGGATCCAATTTTGTTGGTATATATCTTCGAATGGAATTGAATGGAGAGTGTTGAGGATACCACATCGAAAGGATGAAAACATCTCACAATCTTTATAAGATACATCAGTTGCTCCTTTTGTTGCTAAATGGTTTAGAGATAGAACTTCATGTTTATTGAATATGATATTAGAGCCTCTGAAACCCAAATAGGTAACAAAAGAGGTACAACATTGAGGATTCTACATTGGAAAGAATGAAGAGACCTTACCATTTAGATAAGATATATGTGAAATCTTGTTGTAGAAAGATGAAGGGAATAGTAAATAAAGAAAGATAGAGATGGAAAAATGAACTGTTAGCCCGTCCTACTTAAAGCTTCCATTTCTTATCTTTCGTGACTAAAACTTCATTTTGTATGTGTAATTTTTCTTGCTCTCTATATGATATACTGATGTCGTTCCTTTTTTGTCAAGTTCTATTAGGATCATAAGAAGTACTTGGCTGAGCATGAATTTGAAGCAATGAAAGAACGTATTGCTGGAAAAAGGGTTGGGTTTAAAAGAGTATCTTGCATAAATGCTACTCTTCCATTATCAGAAATATCAACCTTAGCTGAAGATTTATGTTCAACATGTAAGGTCTGTTTATTTCCTTATTGTTTCGATATGAATGTGGTATGTCATGGTCTCCATTCCTAGATTAACTTGGAACCCATGTTTCAATCCGTTCAAATGGTTATGATGTTTGAAGAATAGGAAATTAGTTAAAATACAAACCTATAAATTCTCAAGTTTTCATATCTGAAAAATGATGAAAGAAAAATTGAAGTGGTTGTTTAAAGAAATTAGGACTTTTTGTCCTTTGTTTAAAATGAAAATGTGAATGTTGATATGCAGGAAAAGCAAGGTTCGTGCGTTAATGAGGCTGCCTCACCTCACATGAAGTGAGTTGCCTCTGTTGCGCTAGACGTGTTAAAACAGTGTATTTGTTTGTCGTTTGCAACAATGTTATGCATACCAAAGTTATGGAAAAAAGATCTTATGAAGATGGGGAAAGAACTAAACATATTCTGGCTATTTTTTCCACATTATGTAAATGTAGGTTATTATAGTGCAAAAAAGCTAGAATGTGATTGTAATGTGATAGGCCCCCGATTTACAAAGGTGTACATGCAAGGGAGGAAAGTGAGCGAGTGAGTGAGGGGGAATTACGTGTACAAGTAGGGGCCGCACTTTGTGTTTGGGCTGTTTGGTTGGGTATATAGCTGGTTGGGTATCTTCTTGCTTCATTTTGGTATGCTCTGTGAGAATTGAGAGAGGTAGGAAGCTGTTTAATCTTTCTTGTTACCTTGCTATCCGACATAAATAAAATTGTTGTTGGGCAAGGCCTATGATAATCCTCTGGCCATGTTTTTTCCTTTCTTGCTGCTGTTGGTTTTGCGTTGAAGCAGATTATGCAACCTTATATATGTCTACTCTTTGAGCTTAGACTGTATTAATTCTACAAACTACAAAGTTGTCCTTTGTTCTGATCCACAATTTGACAATCATTTTCAATTTTAGTTTTCTGATAATGGAGAAGACAGAGTTGTAGTTGGGTCACTTGACCTTAATCAATGTGTAAGGCTTCCAGATGAAATAACAGGACTGAAACCTGAGGTAAATGATTAATTTCTTTATTTTTATAATATTTTTATTTCTGTTTACAGTCTCCTGATTAAAAATCATGACCATGTTTCATTATTATTTTTGAACATCAGAGTTACTCTTTGGCTCAATCTCTGTTCAACCTGGCAAATAATTGACTATCGTTTGGAAAATATTCTTCTGATAATGAACAAGTCTCATTATTATTTATTAGCTCTTGATTTTTGAGTTCTGAATTTGATAGCCAACATGTGTTTTCCAAGATTCAGATAATTGAAATAAAAGTATAGTATTGCTGTTTTTATTGAGAAAAAAAATGAAGGAATACAAGGACATACAAAAAAACCAAGACCACCAAAATCTCCACTAAAGCTACCAATTTAGCTGGAAGAAAAATTACAAAAAGTTTTCAAAATTGAAGTCCAAAGGGATACATGAAACTTCACCACGAACCAAACATCATTAGGGTCTCACTCTACCCTTCCATTCCACCCAAGGCTCCATAACAGATCACACACCCTAGCAAACTACGAAAAATGACCCTTTTTTTTTTTTTAAGGAGGATAGAGGAGGAACTATCTGATCGAATTACTTGTCATCCATATAATAATTGTTTAAGTTCTTTTATACAATAGAAACTAGAAAGGTCGTTCTTGAAATTAGAATGAAGGAAGCCTCTCCATGAAGAAATATCTGAATAAGATGAAGAGTGATGCTGATTTGCTAACCTCATCTTTAAATACAATTGCAGAGATCTGATTCTTGTATTAACCCATATTGAAGAAAGTGTCCATAACAATGTCAATATCAATAAAATGTCAACATGGATGGAAATTTTTGAAAAATAATTGAAAAAAAAACATTTAATAGTGAATTAAAATATTTATTATTTATATTATATTTACATTTGTGACATTTTGTTGCTTATTTTTTATGTATAACAATGGAAATGTCCACCCGACTCACCCCTCATATCAAATATTGAACCCATAAACATGTGGAAATATCAGTAAAATGTTGACTTTTATAAGAGATTTAGATAATAAGTCAAATTTTCATGATATCTAAAATGTTTTTTTTTTCCATCAGGGAATTGGGGCTGATTTTGCAAGGGCATACCTGAGTAATGTATGTGTTGCCAAGGAACTGCAAAGGAATGGGTTGGGTTATGCACTTGTTGCCAAGGCAAAAACAATTGCACAAGATTGGGGTAATTATAACCATCTTTTCACACTTATGTTTTACTCAGTTTCATTCATGTCTAAAATGAGAGAAACTGTCATGACCCTTTTTTGCTTTATGAATGATTCCCATCTCCTTGTTTATGGCTTCTACTTCACAATGTGATCTCACCATCGCAAACTCATGGTCCGTTTTGGCTTAGGCCCTGAATTTTTAAGTTTGTGTCTATTTGTTTTTTAAATTTTGTAAGTGTCTAATAAATACCTAAGAATTCAAGTGTTTCTTTAACAAGAATACGATCACATTATGGTTAAATTCAACTGTTTAATAAGTTTCTACACTTCAAATTTTGCATTTAATAGAGGTTCCTAAATTTTCAATTTTGTATCTAATAAATCTGTTTTGTTTTATATATTTTTTTTTTAAAAAAATATTTCATTATTGATCTATTAAATGAAAGATTGAATTTTACGTTTAGTATAATCTTAAAATTTCTATTTTATGTTTGGTTGATACATATTTTTTTAAGAAAATTGTTGAATGAGTCAAAAACAATAAATAAATATCTATGTATCTTATTTCTGTTTGTTCTCCCAATTTTTCAAGTATGGTTTTATCATTATTAAGAAAACGACTGAATTTCTAGTAAAATCCTAAAAACAGAAACAATTTTATGAAAACTACATTTTTAAGTTTTTAAAACTTGACTTGTTTTTGAAAACATTAGTCGAAAATAGCTAATAGAACATGTGCTCTTAATTTTTAGAAACCAAATAGTTAATAGAAGCCTTAAAAATCTGTTGGACACAAAATTGGAAGGGGGAGGATCTATTAAAGTTAAGGAGCTTAATGAAGATTAGACTCGTTTTAAAGTCTTGGTACTTAATGATCAATATTTTGACAAGTTTATTAATCAAATAGACACGAACAAAGGTGAGAGTTAGAAAGATTGAAGTTTATTTATGATATAAATAAATGGATGTGATGTGAGGTGAAATGTTTATGTGTTTATGTTATAGGGATAAGCGATCTATACGTTCATGTAGCATTCGACAACGAAGGTGGAAAGAAGCTTTATATGAAAAGTGGTTTTGTTTATGAAAGCGACGAACCAAGTTGGCAAGCCAGGTTTCTTGATCGTCCTCGCAGGATTCTCTTCTGGACTCCTCTCTCTCAATCTCTTCTCTGATTTTTGTTTTTTTCCTAATTTACTAAAATTTTATTCATTGTAGAATTGTATGAATACGGAG

mRNA sequence

GGAAGAAACAAATGATGACAGTCCTCTCATCTGCTCCTTCTTCTTCCTCCGTTTCTTGGCTTTCATTCTTCTCTTCAAACTCTTTCGCTTACTTCAGAACCCAACCCTCTGCCCCTTCAACCTCCTCTTGCTTTCTCAATCGTCCACCCATTAGAATCTCAAATGTTTTCACCAACCAACAACCTACCTTCCCACTTCACAAATCTAAGTTTAGGGTTTCAGAAGGTGCTTCTGAGGATGAGTTTTGGGCTGCTGCTTGTCTCCGCATTCGCACCTTCAATCACTTCCCCCCCGATTCCTTTGGCATCGATGATCATAAGAAGTACTTGGCTGAGCATGAATTTGAAGCAATGAAAGAACGTATTGCTGGAAAAAGGGTTGGGTTTAAAAGAGTATCTTGCATAAATGCTACTCTTCCATTATCAGAAATATCAACCTTAGCTGAAGATTTATGTTCAACATGTAAGGAAAAGCAAGGTTCGTGCGTTAATGAGGCTGCCTCACCTCACATGAAGCCCCCGATTTACAAAGGTGTACATGCAAGGGAGGAAAGTGAGCGATTTTCTGATAATGGAGAAGACAGAGTTGTAGTTGGGTCACTTGACCTTAATCAATGTGTAAGGCTTCCAGATGAAATAACAGGACTGAAACCTGAGGGAATTGGGGCTGATTTTGCAAGGGCATACCTGAGTAATGTATGTGTTGCCAAGGAACTGCAAAGGAATGGGTTGGGTTATGCACTTGTTGCCAAGGCAAAAACAATTGCACAAGATTGGGGGATAAGCGATCTATACGTTCATGTAGCATTCGACAACGAAGGTGGAAAGAAGCTTTATATGAAAAGTGGTTTTGTTTATGAAAGCGACGAACCAAGTTGGCAAGCCAGGTTTCTTGATCGTCCTCGCAGGATTCTCTTCTGGACTCCTCTCTCTCAATCTCTTCTCTGATTTTTGTTTTTTTCCTAATTTACTAAAATTTTATTCATTGTAGAATTGTATGAATACGGAG

Coding sequence (CDS)

ATGATGACAGTCCTCTCATCTGCTCCTTCTTCTTCCTCCGTTTCTTGGCTTTCATTCTTCTCTTCAAACTCTTTCGCTTACTTCAGAACCCAACCCTCTGCCCCTTCAACCTCCTCTTGCTTTCTCAATCGTCCACCCATTAGAATCTCAAATGTTTTCACCAACCAACAACCTACCTTCCCACTTCACAAATCTAAGTTTAGGGTTTCAGAAGGTGCTTCTGAGGATGAGTTTTGGGCTGCTGCTTGTCTCCGCATTCGCACCTTCAATCACTTCCCCCCCGATTCCTTTGGCATCGATGATCATAAGAAGTACTTGGCTGAGCATGAATTTGAAGCAATGAAAGAACGTATTGCTGGAAAAAGGGTTGGGTTTAAAAGAGTATCTTGCATAAATGCTACTCTTCCATTATCAGAAATATCAACCTTAGCTGAAGATTTATGTTCAACATGTAAGGAAAAGCAAGGTTCGTGCGTTAATGAGGCTGCCTCACCTCACATGAAGCCCCCGATTTACAAAGGTGTACATGCAAGGGAGGAAAGTGAGCGATTTTCTGATAATGGAGAAGACAGAGTTGTAGTTGGGTCACTTGACCTTAATCAATGTGTAAGGCTTCCAGATGAAATAACAGGACTGAAACCTGAGGGAATTGGGGCTGATTTTGCAAGGGCATACCTGAGTAATGTATGTGTTGCCAAGGAACTGCAAAGGAATGGGTTGGGTTATGCACTTGTTGCCAAGGCAAAAACAATTGCACAAGATTGGGGGATAAGCGATCTATACGTTCATGTAGCATTCGACAACGAAGGTGGAAAGAAGCTTTATATGAAAAGTGGTTTTGTTTATGAAAGCGACGAACCAAGTTGGCAAGCCAGGTTTCTTGATCGTCCTCGCAGGATTCTCTTCTGGACTCCTCTCTCTCAATCTCTTCTCTGA

Protein sequence

MMTVLSSAPSSSSVSWLSFFSSNSFAYFRTQPSAPSTSSCFLNRPPIRISNVFTNQQPTFPLHKSKFRVSEGASEDEFWAAACLRIRTFNHFPPDSFGIDDHKKYLAEHEFEAMKERIAGKRVGFKRVSCINATLPLSEISTLAEDLCSTCKEKQGSCVNEAASPHMKPPIYKGVHAREESERFSDNGEDRVVVGSLDLNQCVRLPDEITGLKPEGIGADFARAYLSNVCVAKELQRNGLGYALVAKAKTIAQDWGISDLYVHVAFDNEGGKKLYMKSGFVYESDEPSWQARFLDRPRRILFWTPLSQSLL
BLAST of ClCG02G004040.1 vs. TrEMBL
Match: A0A0A0K1B5_CUCSA (Uncharacterized protein OS=Cucumis sativus GN=Csa_7G013950 PE=4 SV=1)

HSP 1 Score: 470.3 bits (1209), Expect = 1.8e-129
Identity = 238/304 (78.29%), Postives = 252/304 (82.89%), Query Frame = 1

Query: 5   LSSAPSSSSVSWLSFFSSNSFAYFRTQPSAPSTSSCFLNRPPIRISNVFTNQQPTFPLHK 64
           +SS+ SSS +S LSFFSS+SF+  +T+PS PSTSSCFLN   I+ISN+FTNQQ T  LH 
Sbjct: 13  ISSSSSSSFISKLSFFSSDSFSCLKTKPSVPSTSSCFLNPSSIKISNLFTNQQQTITLHN 72

Query: 65  SKFRVSEGASEDEFWAAACLRIRTFNHFPPDSFGIDDHKKYLAEHEFEAMKERIAGKRVG 124
           SKFRVSEG S DE WAAA LR+RTFN  PPDSFGI DHKKYLAEHEFEAMKERIAGKRVG
Sbjct: 73  SKFRVSEGTSHDELWAAASLRVRTFNQLPPDSFGIHDHKKYLAEHEFEAMKERIAGKRVG 132

Query: 125 FKRVSCINATLPLSEISTLAEDLCSTCKEKQGSCVNEAASPHMKPPIYKGVHAREESERF 184
           FKRVSCINATLPLSEISTLAEDLCSTCK                               F
Sbjct: 133 FKRVSCINATLPLSEISTLAEDLCSTCK-------------------------------F 192

Query: 185 SDNGEDRVVVGSLDLNQCVRLPDEITGLKPEGIGADFARAYLSNVCVAKELQRNGLGYAL 244
           SDNGEDRVVVGSLD+NQCVRLPDEITG+KPEGIGADFARAYLSNVCVAKELQRNGLGYAL
Sbjct: 193 SDNGEDRVVVGSLDINQCVRLPDEITGMKPEGIGADFARAYLSNVCVAKELQRNGLGYAL 252

Query: 245 VAKAKTIAQDWGISDLYVHVAFDNEGGKKLYMKSGFVYESDEPSWQARFLDRPRRILFWT 304
           +AKAKTIA DWGISDLYVHVAF+NEGGKKLYMKSGFVYESDEPSWQARFLDRPRRILFWT
Sbjct: 253 IAKAKTIALDWGISDLYVHVAFNNEGGKKLYMKSGFVYESDEPSWQARFLDRPRRILFWT 285

Query: 305 PLSQ 309
           PLSQ
Sbjct: 313 PLSQ 285

BLAST of ClCG02G004040.1 vs. TrEMBL
Match: M5XG35_PRUPE (Uncharacterized protein OS=Prunus persica GN=PRUPE_ppa010119mg PE=4 SV=1)

HSP 1 Score: 334.0 bits (855), Expect = 2.0e-88
Identity = 167/270 (61.85%), Postives = 198/270 (73.33%), Query Frame = 1

Query: 36  STSSCFLNRPPIRISNVFTNQQPTFPLHKSKFRVSEGASEDEFWAAACLRIRTFNHFPPD 95
           S +S  + RPPI    + T++Q      +S   V+EG+SE E WAAACLR+R+F HF P 
Sbjct: 17  SPASLRVRRPPITACQLCTHKQSVRQFDQSILTVAEGSSESELWAAACLRVRSFYHFKPS 76

Query: 96  SFGIDDHKKYLAEHEFEAMKERIAGKRVGFKRVSCINATLPLSEIST--LAEDLCSTCKE 155
            FG+ DH++YLAE E EAMKER+ GKR GF++VSCINAT+PLS+IS+  +++D CS+CK 
Sbjct: 77  MFGLQDHRRYLAERELEAMKERVGGKRKGFRKVSCINATVPLSQISSPSVSDDFCSSCK- 136

Query: 156 KQGSCVNEAASPHMKPPIYKGVHAREESERFSDNGEDRVVVGSLDLNQCVRLPDEITGLK 215
                                         F++NGEDRVVVG+LDLNQCV LPDEITG +
Sbjct: 137 ------------------------------FNNNGEDRVVVGTLDLNQCVSLPDEITGNR 196

Query: 216 PEGIGADFARAYLSNVCVAKELQRNGLGYALVAKAKTIAQDWGISDLYVHVAFDNEGGKK 275
           PEGIGADFARAYLSNVCVAKEL RNGLGYALVAK+K +AQ+WGISDLYVHVA DNE  KK
Sbjct: 197 PEGIGADFARAYLSNVCVAKELHRNGLGYALVAKSKLVAQEWGISDLYVHVAVDNEPAKK 255

Query: 276 LYMKSGFVYESDEPSWQARFLDRPRRILFW 304
           LYMKSGFVYE DEP+WQARFLDRPRRIL W
Sbjct: 257 LYMKSGFVYEKDEPAWQARFLDRPRRILLW 255

BLAST of ClCG02G004040.1 vs. TrEMBL
Match: A0A067F4Q5_CITSI (Uncharacterized protein OS=Citrus sinensis GN=CISIN_1g023633mg PE=4 SV=1)

HSP 1 Score: 334.0 bits (855), Expect = 2.0e-88
Identity = 178/307 (57.98%), Postives = 218/307 (71.01%), Query Frame = 1

Query: 2   MTVLSSAPSSSSVSWLSFFSSNSFAYFRTQPSAPSTSSCFLNRPPIRISNVFT--NQQPT 61
           + VLSS+ S S+ S +S    ++    R++ SA + S  F  RP I + +V T  +Q+ +
Sbjct: 3   VAVLSSSISCSTTSIISLNHKHN----RSKFSAFTLSLRFPLRP-INLLHVCTPPHQEDS 62

Query: 62  FPLHKSKFRVSEGASEDEFWAAACLRIRTFNHFPPDSFGIDDHKKYLAEHEFEAMKERIA 121
             + KS   V E  +ED+ WAAACLR+R+F+ F PDSFG+ DHKK+LAE EFEAMKERIA
Sbjct: 63  LSIDKSSLVVDETTAEDQLWAAACLRVRSFHQFDPDSFGVQDHKKHLAEREFEAMKERIA 122

Query: 122 GKRVGFKRVSCINATLPLSEISTLAEDLCSTCKEKQGSCVNEAASPHMKPPIYKGVHARE 181
           GKR  F+ V+CINATLPLS+IS+++E+LC+ CK                           
Sbjct: 123 GKRKEFRTVACINATLPLSQISSVSEELCAECK--------------------------- 182

Query: 182 ESERFSDNGEDRVVVGSLDLNQCVRLPDEITGLKPEGIGADFARAYLSNVCVAKELQRNG 241
               F+D+GEDRVVVG+LDLNQC RLPDEITG KPEGIG DFARAYLSNVCVAKEL RNG
Sbjct: 183 ----FTDDGEDRVVVGTLDLNQCYRLPDEITGKKPEGIGGDFARAYLSNVCVAKELHRNG 242

Query: 242 LGYALVAKAKTIAQDWGISDLYVHVAFDNEGGKKLYMKSGFVYESDEPSWQARFLDRPRR 301
           LGY +VAK+K +AQ WGISDLYVHVAFDNE  KKLYMKSGF++E+DEP+W ARFLDRPRR
Sbjct: 243 LGYEIVAKSKLVAQGWGISDLYVHVAFDNEPAKKLYMKSGFIFENDEPAWHARFLDRPRR 273

Query: 302 ILFWTPL 307
           IL W  L
Sbjct: 303 ILLWIGL 273

BLAST of ClCG02G004040.1 vs. TrEMBL
Match: W9RVU4_9ROSA (Uncharacterized protein OS=Morus notabilis GN=L484_023026 PE=4 SV=1)

HSP 1 Score: 332.0 bits (850), Expect = 7.7e-88
Identity = 175/304 (57.57%), Postives = 210/304 (69.08%), Query Frame = 1

Query: 6   SSAPSSSSVSWLSFFSSNSFAYFRTQPSAPSTSSCFLNRPPIRISNVFTNQQ---PTFPL 65
           SS P  SS  W    SS+S +   T  +A     C     PI  S++ T+ Q   P+  +
Sbjct: 11  SSHPFPSSDRWNCRLSSSSSSPLHTSMAASRRLRC----RPIAASSLCTDNQTIPPSSAI 70

Query: 66  HKSKFRVSEGASEDEFWAAACLRIRTFNHFPPDSFGIDDHKKYLAEHEFEAMKERIAGKR 125
            +S   V+E  SED+ WAAA LR+R+F  F P S+ I+DHKKYL E EFEA+KERIAG+R
Sbjct: 71  DRSAVSVAEAFSEDQLWAAASLRVRSFYEFNPSSYRIEDHKKYLTEREFEALKERIAGRR 130

Query: 126 VGFKRVSCINATLPLSEISTLAEDLCSTCKEKQGSCVNEAASPHMKPPIYKGVHAREESE 185
             F+RVSCINAT+PLS+IS L++DLC++CK                              
Sbjct: 131 EEFRRVSCINATVPLSQISKLSDDLCASCK------------------------------ 190

Query: 186 RFSDNGEDRVVVGSLDLNQCVRLPDEITGLKPEGIGADFARAYLSNVCVAKELQRNGLGY 245
            FS NGEDRVVVG+LDLNQC+RLPDEI G KP+GIGADFARAYLSNVCVA EL R+GLGY
Sbjct: 191 -FSSNGEDRVVVGTLDLNQCIRLPDEIVGKKPQGIGADFARAYLSNVCVAAELHRHGLGY 250

Query: 246 ALVAKAKTIAQDWGISDLYVHVAFDNEGGKKLYMKSGFVYESDEPSWQARFLDRPRRILF 305
           A++AK+K +AQ+WGISDLYVHVA DNE  KKLY+KSGFVYESDEP+WQARFLDRPRRIL 
Sbjct: 251 AVIAKSKLVAQEWGISDLYVHVAVDNEPAKKLYLKSGFVYESDEPAWQARFLDRPRRILL 279

Query: 306 WTPL 307
           WT L
Sbjct: 311 WTGL 279

BLAST of ClCG02G004040.1 vs. TrEMBL
Match: D7U1K2_VITVI (Putative uncharacterized protein OS=Vitis vinifera GN=VIT_11s0037g01280 PE=4 SV=1)

HSP 1 Score: 330.1 bits (845), Expect = 2.9e-87
Identity = 176/302 (58.28%), Postives = 213/302 (70.53%), Query Frame = 1

Query: 5   LSSAPSSSSVSWLSFFSSNSFAYFRTQPSAPSTSSCFLNRPPIRISNVFTNQQPTFPLHK 64
           L ++ SSS++     +SS+S  + R +          L RP I  S +   Q  TF + K
Sbjct: 17  LPTSTSSSNLCCPRIYSSSSSYHLRRR----------LLRPLIA-SQLCAPQ--TFKIDK 76

Query: 65  SKFRVSEGASEDEFWAAACLRIRTFNHFPPDSFGIDDHKKYLAEHEFEAMKERIAGKRVG 124
           S   V+E  SED+ WAAACLRIR+F  F P S+GIDDHK+YLAE EFEA+KER+AGKR G
Sbjct: 77  SSLVVAETVSEDQLWAAACLRIRSFYQFGP-SYGIDDHKRYLAEREFEALKERVAGKREG 136

Query: 125 FKRVSCINATLPLSEISTLAEDLCSTCKEKQGSCVNEAASPHMKPPIYKGVHAREESERF 184
           F+RVSCINAT+PLSEIS+ ++DLC+ CK                               F
Sbjct: 137 FRRVSCINATIPLSEISSFSDDLCAACK-------------------------------F 196

Query: 185 SDNGEDRVVVGSLDLNQCVRLPDEITGLKPEGIGADFARAYLSNVCVAKELQRNGLGYAL 244
           + NGEDRVV+G+LDLNQCV LPDEITG+KP+GIGADF RAYLSNVCVAKEL RNGLGYAL
Sbjct: 197 THNGEDRVVIGTLDLNQCVSLPDEITGMKPQGIGADFLRAYLSNVCVAKELHRNGLGYAL 256

Query: 245 VAKAKTIAQDWGISDLYVHVAFDNEGGKKLYMKSGFVYESDEPSWQARFLDRPRRILFWT 304
           VAK+K +AQ+WGI+DLYVH A DNE  K+LYMKSGF+YE+DEP+W+ARFLDRPRRIL WT
Sbjct: 257 VAKSKMVAQEWGITDLYVHFAVDNEPAKQLYMKSGFIYENDEPAWKARFLDRPRRILLWT 273

Query: 305 PL 307
            L
Sbjct: 317 GL 273

BLAST of ClCG02G004040.1 vs. TAIR10
Match: AT4G28030.1 (AT4G28030.1 Acyl-CoA N-acyltransferases (NAT) superfamily protein)

HSP 1 Score: 303.5 bits (776), Expect = 1.5e-82
Identity = 165/309 (53.40%), Postives = 196/309 (63.43%), Query Frame = 1

Query: 1   MMTVLSSAPSSSSVSWLSFFSSNSFAYFRTQPSAPSTSSCFLNRPPIRISNVFTNQQPTF 60
           M  + SS PSSSS++   F   N+    R+  S PS    F  RP    S++        
Sbjct: 1   MAFLCSSLPSSSSIA--IFGDPNTDGSSRSYLSIPSLKLRF--RPVAASSHICAPA---- 60

Query: 61  PLHKSKFRVSEGASEDEFWAAACLRIRTFNHFPPDSFGIDDHKKYLAEHEFEAMKERIAG 120
            + KS F +SE  SEDE WAAACLR+RTFN   P ++ I DH++YLAE EFEA+KER +G
Sbjct: 61  -IDKSTFVISESVSEDELWAAACLRVRTFNELNPSAYNIQDHRRYLAEREFEALKERTSG 120

Query: 121 KRVGFKRVSCINATLPLSEISTLAEDLCSTCKEKQGSCVNEAASPHMKPPIYKGVHAREE 180
           KR GF RV+CINATLPLS++S+  EDLCS CK                            
Sbjct: 121 KREGFTRVACINATLPLSQLSSSFEDLCSACK---------------------------- 180

Query: 181 SERFSDNGEDRVVVGSLDLNQCVRLPDEITGLKPEGIGADFARAYLSNVCVAKELQRNGL 240
              FSD  EDRVVVGSLDLNQC  LPDEI G KPEGIG DFARAYLSNVCVAKEL RNG+
Sbjct: 181 ---FSDGIEDRVVVGSLDLNQCRWLPDEIAGTKPEGIGVDFARAYLSNVCVAKELHRNGV 240

Query: 241 GYALVAKAKTIAQDWGISDLYVHVAFDNEGGKKLYMKSGFVYESDEPSWQARFLDRPRRI 300
           GY L+ K+K +A +WGI+D+YVHV  DNE  K LYMKSGF  E+ EP+WQAR+L+RP+R+
Sbjct: 241 GYKLIDKSKRVAGEWGITDMYVHVTVDNEAAKSLYMKSGFEQETAEPAWQARYLNRPQRL 269

Query: 301 LFWTPLSQS 310
           L W  L  S
Sbjct: 301 LLWLALPTS 269

BLAST of ClCG02G004040.1 vs. TAIR10
Match: AT2G39000.1 (AT2G39000.1 Acyl-CoA N-acyltransferases (NAT) superfamily protein)

HSP 1 Score: 49.3 bits (116), Expect = 5.0e-06
Identity = 22/57 (38.60%), Postives = 32/57 (56.14%), Query Frame = 1

Query: 224 AYLSNVCVAKELQRNGLGYALVAKAKTIAQDWGISDLYVHVAFDNEGGKKLYMKSGF 281
           AY+SNV V +  +R G+   L+ KA+ +A++WG   + +H   +N G  KLY   GF
Sbjct: 194 AYVSNVAVRENFRRKGIAKRLIWKAEALAKNWGCRAIGLHCDLNNLGATKLYKDQGF 250

BLAST of ClCG02G004040.1 vs. NCBI nr
Match: gi|659094224|ref|XP_008447948.1| (PREDICTED: uncharacterized protein LOC103490280 isoform X1 [Cucumis melo])

HSP 1 Score: 476.1 bits (1224), Expect = 4.7e-131
Identity = 246/315 (78.10%), Postives = 256/315 (81.27%), Query Frame = 1

Query: 2   MTVLSSAP--------SSSSVSWLSFFSSNSFAYFRTQPSAPSTSSCFLNRPPIRISNVF 61
           MTVLSS P        SSSS+S LSFFSSNSF+  RT+PS PSTSSCFLNR  IRISN+F
Sbjct: 1   MTVLSSPPPPLCTSSSSSSSISKLSFFSSNSFSCLRTKPSVPSTSSCFLNRSSIRISNLF 60

Query: 62  TNQQPTFPLHKSKFRVSEGASEDEFWAAACLRIRTFNHFPPDSFGIDDHKKYLAEHEFEA 121
           TNQQ T  LH S FRVSEG S DE WAAA LR+RTFN FPPDSF I DHKKYLAEHEFEA
Sbjct: 61  TNQQQTITLHNSNFRVSEGTSHDELWAAASLRVRTFNQFPPDSFAIHDHKKYLAEHEFEA 120

Query: 122 MKERIAGKRVGFKRVSCINATLPLSEISTLAEDLCSTCKEKQGSCVNEAASPHMKPPIYK 181
           MKERIAGKRVGFKRVSCINATLPLSEISTLAEDLCSTCK                     
Sbjct: 121 MKERIAGKRVGFKRVSCINATLPLSEISTLAEDLCSTCK--------------------- 180

Query: 182 GVHAREESERFSDNGEDRVVVGSLDLNQCVRLPDEITGLKPEGIGADFARAYLSNVCVAK 241
                     FSD+G DRVVVGSLDLNQCVRLPDEITG+KPEGIGADFARAYLSNVCVAK
Sbjct: 181 ----------FSDSGGDRVVVGSLDLNQCVRLPDEITGMKPEGIGADFARAYLSNVCVAK 240

Query: 242 ELQRNGLGYALVAKAKTIAQDWGISDLYVHVAFDNEGGKKLYMKSGFVYESDEPSWQARF 301
           ELQRNGLGYAL+A+AKTIAQDWGISDLYVHVAF+NEGGKKLYMKSGFVYESDEPSWQARF
Sbjct: 241 ELQRNGLGYALIAEAKTIAQDWGISDLYVHVAFNNEGGKKLYMKSGFVYESDEPSWQARF 284

Query: 302 LDRPRRILFWTPLSQ 309
           LDRPRRILFWTPLSQ
Sbjct: 301 LDRPRRILFWTPLSQ 284

BLAST of ClCG02G004040.1 vs. NCBI nr
Match: gi|778722997|ref|XP_011658599.1| (PREDICTED: uncharacterized protein LOC101206225 [Cucumis sativus])

HSP 1 Score: 470.3 bits (1209), Expect = 2.6e-129
Identity = 238/304 (78.29%), Postives = 252/304 (82.89%), Query Frame = 1

Query: 5   LSSAPSSSSVSWLSFFSSNSFAYFRTQPSAPSTSSCFLNRPPIRISNVFTNQQPTFPLHK 64
           +SS+ SSS +S LSFFSS+SF+  +T+PS PSTSSCFLN   I+ISN+FTNQQ T  LH 
Sbjct: 13  ISSSSSSSFISKLSFFSSDSFSCLKTKPSVPSTSSCFLNPSSIKISNLFTNQQQTITLHN 72

Query: 65  SKFRVSEGASEDEFWAAACLRIRTFNHFPPDSFGIDDHKKYLAEHEFEAMKERIAGKRVG 124
           SKFRVSEG S DE WAAA LR+RTFN  PPDSFGI DHKKYLAEHEFEAMKERIAGKRVG
Sbjct: 73  SKFRVSEGTSHDELWAAASLRVRTFNQLPPDSFGIHDHKKYLAEHEFEAMKERIAGKRVG 132

Query: 125 FKRVSCINATLPLSEISTLAEDLCSTCKEKQGSCVNEAASPHMKPPIYKGVHAREESERF 184
           FKRVSCINATLPLSEISTLAEDLCSTCK                               F
Sbjct: 133 FKRVSCINATLPLSEISTLAEDLCSTCK-------------------------------F 192

Query: 185 SDNGEDRVVVGSLDLNQCVRLPDEITGLKPEGIGADFARAYLSNVCVAKELQRNGLGYAL 244
           SDNGEDRVVVGSLD+NQCVRLPDEITG+KPEGIGADFARAYLSNVCVAKELQRNGLGYAL
Sbjct: 193 SDNGEDRVVVGSLDINQCVRLPDEITGMKPEGIGADFARAYLSNVCVAKELQRNGLGYAL 252

Query: 245 VAKAKTIAQDWGISDLYVHVAFDNEGGKKLYMKSGFVYESDEPSWQARFLDRPRRILFWT 304
           +AKAKTIA DWGISDLYVHVAF+NEGGKKLYMKSGFVYESDEPSWQARFLDRPRRILFWT
Sbjct: 253 IAKAKTIALDWGISDLYVHVAFNNEGGKKLYMKSGFVYESDEPSWQARFLDRPRRILFWT 285

Query: 305 PLSQ 309
           PLSQ
Sbjct: 313 PLSQ 285

BLAST of ClCG02G004040.1 vs. NCBI nr
Match: gi|641843474|gb|KDO62374.1| (hypothetical protein CISIN_1g023633mg [Citrus sinensis])

HSP 1 Score: 334.0 bits (855), Expect = 2.9e-88
Identity = 178/307 (57.98%), Postives = 218/307 (71.01%), Query Frame = 1

Query: 2   MTVLSSAPSSSSVSWLSFFSSNSFAYFRTQPSAPSTSSCFLNRPPIRISNVFT--NQQPT 61
           + VLSS+ S S+ S +S    ++    R++ SA + S  F  RP I + +V T  +Q+ +
Sbjct: 3   VAVLSSSISCSTTSIISLNHKHN----RSKFSAFTLSLRFPLRP-INLLHVCTPPHQEDS 62

Query: 62  FPLHKSKFRVSEGASEDEFWAAACLRIRTFNHFPPDSFGIDDHKKYLAEHEFEAMKERIA 121
             + KS   V E  +ED+ WAAACLR+R+F+ F PDSFG+ DHKK+LAE EFEAMKERIA
Sbjct: 63  LSIDKSSLVVDETTAEDQLWAAACLRVRSFHQFDPDSFGVQDHKKHLAEREFEAMKERIA 122

Query: 122 GKRVGFKRVSCINATLPLSEISTLAEDLCSTCKEKQGSCVNEAASPHMKPPIYKGVHARE 181
           GKR  F+ V+CINATLPLS+IS+++E+LC+ CK                           
Sbjct: 123 GKRKEFRTVACINATLPLSQISSVSEELCAECK--------------------------- 182

Query: 182 ESERFSDNGEDRVVVGSLDLNQCVRLPDEITGLKPEGIGADFARAYLSNVCVAKELQRNG 241
               F+D+GEDRVVVG+LDLNQC RLPDEITG KPEGIG DFARAYLSNVCVAKEL RNG
Sbjct: 183 ----FTDDGEDRVVVGTLDLNQCYRLPDEITGKKPEGIGGDFARAYLSNVCVAKELHRNG 242

Query: 242 LGYALVAKAKTIAQDWGISDLYVHVAFDNEGGKKLYMKSGFVYESDEPSWQARFLDRPRR 301
           LGY +VAK+K +AQ WGISDLYVHVAFDNE  KKLYMKSGF++E+DEP+W ARFLDRPRR
Sbjct: 243 LGYEIVAKSKLVAQGWGISDLYVHVAFDNEPAKKLYMKSGFIFENDEPAWHARFLDRPRR 273

Query: 302 ILFWTPL 307
           IL W  L
Sbjct: 303 ILLWIGL 273

BLAST of ClCG02G004040.1 vs. NCBI nr
Match: gi|596202432|ref|XP_007223715.1| (hypothetical protein PRUPE_ppa010119mg [Prunus persica])

HSP 1 Score: 334.0 bits (855), Expect = 2.9e-88
Identity = 167/270 (61.85%), Postives = 198/270 (73.33%), Query Frame = 1

Query: 36  STSSCFLNRPPIRISNVFTNQQPTFPLHKSKFRVSEGASEDEFWAAACLRIRTFNHFPPD 95
           S +S  + RPPI    + T++Q      +S   V+EG+SE E WAAACLR+R+F HF P 
Sbjct: 17  SPASLRVRRPPITACQLCTHKQSVRQFDQSILTVAEGSSESELWAAACLRVRSFYHFKPS 76

Query: 96  SFGIDDHKKYLAEHEFEAMKERIAGKRVGFKRVSCINATLPLSEIST--LAEDLCSTCKE 155
            FG+ DH++YLAE E EAMKER+ GKR GF++VSCINAT+PLS+IS+  +++D CS+CK 
Sbjct: 77  MFGLQDHRRYLAERELEAMKERVGGKRKGFRKVSCINATVPLSQISSPSVSDDFCSSCK- 136

Query: 156 KQGSCVNEAASPHMKPPIYKGVHAREESERFSDNGEDRVVVGSLDLNQCVRLPDEITGLK 215
                                         F++NGEDRVVVG+LDLNQCV LPDEITG +
Sbjct: 137 ------------------------------FNNNGEDRVVVGTLDLNQCVSLPDEITGNR 196

Query: 216 PEGIGADFARAYLSNVCVAKELQRNGLGYALVAKAKTIAQDWGISDLYVHVAFDNEGGKK 275
           PEGIGADFARAYLSNVCVAKEL RNGLGYALVAK+K +AQ+WGISDLYVHVA DNE  KK
Sbjct: 197 PEGIGADFARAYLSNVCVAKELHRNGLGYALVAKSKLVAQEWGISDLYVHVAVDNEPAKK 255

Query: 276 LYMKSGFVYESDEPSWQARFLDRPRRILFW 304
           LYMKSGFVYE DEP+WQARFLDRPRRIL W
Sbjct: 257 LYMKSGFVYEKDEPAWQARFLDRPRRILLW 255

BLAST of ClCG02G004040.1 vs. NCBI nr
Match: gi|645233887|ref|XP_008223557.1| (PREDICTED: uncharacterized protein LOC103323347 [Prunus mume])

HSP 1 Score: 332.8 bits (852), Expect = 6.4e-88
Identity = 169/276 (61.23%), Postives = 200/276 (72.46%), Query Frame = 1

Query: 30  TQPSAPSTSSCFLNRPPIRISNVFTNQQPTFPLHKSKFRVSEGASEDEFWAAACLRIRTF 89
           TQ SA   S C + R PI    + T++Q      +S   V+EG+SE E WAAACLR+R+F
Sbjct: 12  TQFSASPASLC-VRRRPITACQLCTHKQSVRHFDQSTLTVAEGSSESELWAAACLRVRSF 71

Query: 90  NHFPPDSFGIDDHKKYLAEHEFEAMKERIAGKRVGFKRVSCINATLPLSEIST--LAEDL 149
            HF P  FG+ DH++YLAE E EAMKER+ GKR GF++VSCINAT+PLS+IS+  +++D 
Sbjct: 72  YHFKPSMFGLQDHRRYLAERELEAMKERVGGKRKGFRKVSCINATIPLSQISSPSVSDDF 131

Query: 150 CSTCKEKQGSCVNEAASPHMKPPIYKGVHAREESERFSDNGEDRVVVGSLDLNQCVRLPD 209
           CS+CK                               F++NGEDRVVVG+LDLNQCV LPD
Sbjct: 132 CSSCK-------------------------------FNNNGEDRVVVGTLDLNQCVSLPD 191

Query: 210 EITGLKPEGIGADFARAYLSNVCVAKELQRNGLGYALVAKAKTIAQDWGISDLYVHVAFD 269
           EITG +PEGIGADFARAYLSNVCVAKEL RNGLGYA+VAK+K +AQ+WGISDLYVHVA D
Sbjct: 192 EITGNRPEGIGADFARAYLSNVCVAKELHRNGLGYAVVAKSKLVAQEWGISDLYVHVAVD 251

Query: 270 NEGGKKLYMKSGFVYESDEPSWQARFLDRPRRILFW 304
           NE  KKLYMKSGFVYE DEP+WQARFLDRPRRIL W
Sbjct: 252 NEPAKKLYMKSGFVYEKDEPAWQARFLDRPRRILLW 255

The following BLAST results are available for this feature:
Match NameE-valueIdentityDescription
A0A0A0K1B5_CUCSA1.8e-12978.29Uncharacterized protein OS=Cucumis sativus GN=Csa_7G013950 PE=4 SV=1[more]
M5XG35_PRUPE2.0e-8861.85Uncharacterized protein OS=Prunus persica GN=PRUPE_ppa010119mg PE=4 SV=1[more]
A0A067F4Q5_CITSI2.0e-8857.98Uncharacterized protein OS=Citrus sinensis GN=CISIN_1g023633mg PE=4 SV=1[more]
W9RVU4_9ROSA7.7e-8857.57Uncharacterized protein OS=Morus notabilis GN=L484_023026 PE=4 SV=1[more]
D7U1K2_VITVI2.9e-8758.28Putative uncharacterized protein OS=Vitis vinifera GN=VIT_11s0037g01280 PE=4 SV=... [more]
Match NameE-valueIdentityDescription
AT4G28030.11.5e-8253.40 Acyl-CoA N-acyltransferases (NAT) superfamily protein[more]
AT2G39000.15.0e-0638.60 Acyl-CoA N-acyltransferases (NAT) superfamily protein[more]
Match NameE-valueIdentityDescription
gi|659094224|ref|XP_008447948.1|4.7e-13178.10PREDICTED: uncharacterized protein LOC103490280 isoform X1 [Cucumis melo][more]
gi|778722997|ref|XP_011658599.1|2.6e-12978.29PREDICTED: uncharacterized protein LOC101206225 [Cucumis sativus][more]
gi|641843474|gb|KDO62374.1|2.9e-8857.98hypothetical protein CISIN_1g023633mg [Citrus sinensis][more]
gi|596202432|ref|XP_007223715.1|2.9e-8861.85hypothetical protein PRUPE_ppa010119mg [Prunus persica][more]
gi|645233887|ref|XP_008223557.1|6.4e-8861.23PREDICTED: uncharacterized protein LOC103323347 [Prunus mume][more]
The following terms have been associated with this mRNA:
Vocabulary: INTERPRO
TermDefinition
IPR000182GNAT_dom
IPR016181Acyl_CoA_acyltransferase
Vocabulary: Molecular Function
TermDefinition
GO:0008080N-acetyltransferase activity
GO Assignments
This mRNA is annotated with the following GO terms.
Category Term Accession Term Name
biological_process GO:0042967 acyl-carrier-protein biosynthetic process
biological_process GO:0019752 carboxylic acid metabolic process
biological_process GO:0042430 indole-containing compound metabolic process
biological_process GO:0050896 response to stimulus
cellular_component GO:0005575 cellular_component
molecular_function GO:0008080 N-acetyltransferase activity

This mRNA is a part of the following gene feature(s):

Feature NameUnique NameType
ClCG02G004040ClCG02G004040gene


The following polypeptide feature(s) derives from this mRNA:

Feature NameUnique NameType
ClCG02G004040.1ClCG02G004040.1-proteinpolypeptide


The following three_prime_UTR feature(s) are a part of this mRNA:

Feature NameUnique NameType
ClCG02G004040.1.three_prime_UTR1ClCG02G004040.1.three_prime_UTR1three_prime_UTR


The following CDS feature(s) are a part of this mRNA:

Feature NameUnique NameType
ClCG02G004040.1.cds7ClCG02G004040.1.cds7CDS
ClCG02G004040.1.cds6ClCG02G004040.1.cds6CDS
ClCG02G004040.1.cds5ClCG02G004040.1.cds5CDS
ClCG02G004040.1.cds4ClCG02G004040.1.cds4CDS
ClCG02G004040.1.cds3ClCG02G004040.1.cds3CDS
ClCG02G004040.1.cds2ClCG02G004040.1.cds2CDS
ClCG02G004040.1.cds1ClCG02G004040.1.cds1CDS


The following five_prime_UTR feature(s) are a part of this mRNA:

Feature NameUnique NameType
ClCG02G004040.1.five_prime_UTR1ClCG02G004040.1.five_prime_UTR1five_prime_UTR


Analysis Name: InterPro Annotations of watermelon (Charleston Gray)
Date Performed: 2016-09-28
IPR TermIPR DescriptionSourceSource TermSource DescriptionAlignment
IPR000182GNAT domainPFAMPF00583Acetyltransf_1coord: 191..280
score: 1.3
IPR000182GNAT domainPROFILEPS51186GNATcoord: 138..299
score: 11
IPR016181Acyl-CoA N-acyltransferaseGENE3DG3DSA:3.40.630.30coord: 221..287
score: 4.6
IPR016181Acyl-CoA N-acyltransferaseunknownSSF55729Acyl-CoA N-acyltransferases (Nat)coord: 218..291
score: 6.49
NoneNo IPR availablePANTHERPTHR23091N-TERMINAL ACETYLTRANSFERASEcoord: 38..133
score: 8.7E-95coord: 188..309
score: 8.7
NoneNo IPR availablePANTHERPTHR23091:SF234SUBFAMILY NOT NAMEDcoord: 188..309
score: 8.7E-95coord: 38..133
score: 8.7