Cla003970 (gene) Watermelon (97103) v1

NameCla003970
Typegene
OrganismCitrullus. lanatus (Watermelon (97103) v1)
DescriptionUnknown Protein (AHRD V1)
LocationChr7 : 3062322 .. 3063787 (+)
The following sequences are available for this feature:

Gene sequence (with intron)

Legend: CDS
Hold the cursor over a type above to highlight its positions in the sequence below.
ATGGCTTCAGATTCTAACTACTGGCTTCCCCCTCACTTTCTTTCCGACCACGACAACCTCACCGGGAAACCCACCTCCGTCGTCGCCTTCCCCACCGACTTTCCCTATGACTTCAACTCCTCCGCCGTTCATTCCCCTGTCGAATCTGTTCTTGGCGACGACGATAACGACGACGAGGAGGACTTTCTCGCCGCGTTGACGCAACGGCTTACTCAGTCCACGCTTCGTGATTCTCCAAAACTGCCGTCTGTCAACAAATCCCAAGTAGCCATCTTTTGATTTTTCTTTAATTTACTTGATTTTTGAGTCTTCAAATCGGTTTTTTGACTTCCGGCGACTCTGTTTTTTTTTTTTTTTCTTCTCCAGGCGAAGACGGCTATGGCTGGATCTCCTCAGTCTACTCTTACTGGGGTTGGAAGTTGGTCGGCTTGGAGTTCCGTGTCGAGCGACGGAAGCCCAAACGGACCGTCTCAGGCGCCATCTCCTCCGACCACCCCTTTCGGCGGCGATAATAACACTTGGGATCTTATCTATGCTGCTGCAGGGCAAGTTGCGAGGCTCAAAATGAATACTAATAGAGATGGGATTATTGGTCCTTCTCAAAGCTCTTCAAATCTTGTTTCTTCCATGAAAAACGTTGGATTTTACTCTCACCCTTCACAGGTGATTTTTACTCCCTTACTCTCTTTTTTGCTCTATTTTTCGATCTATAATTCTTAAACTATGTTTTATTTCCGTCTGTTTTCAGTTTGGAACAGAGCCTCCGATTTATAAACCGGAGAACTGTTTTAACTGGGGAAGACAAGTGAAGGTTGAGAATCAGCAGATCCATTGTCGAGGAGGAAATTTTCACCATGAAAACGAGAGATTTCTTCGTCCTGTGGATTTTCCTCAATCCGCTTGGCCTTCTCTCCAACCCCATCACCGGAGATACTCTTCTCAGCCTAGTACTCCCACTATACCCGCCGCCTACCACGGCGTCGGATCTGCCCCTAAAAGGGAGTGCGCCGGTACCGGCGTCTTCTTACCTCGGCGATGCGACAACAACCCACCACAATCTCGTAAAAGAGCAGGTTACTTCCTTTACTCCTTTAACAAATTGGAAATCACGGATTCTCAAATTCGAACTTGATTCTTAATTTTTGCTCTGTTTCTTCAGATTGTGCCTCAATCGCTCTGCTTCCAGGGAAGAACATTCAGGACTTGAACCGATCTGTTCCCCAAATGACGTCAAATCGCCGCCTTCTGCCGAGCTACGGTGAGCTTGTAGAAATGATTCACAGTTCTTCAGATCTGTTAAATAGTAATATGCTAATGAGTTGTTGTTCACTTTGTTCTAAATATCTCCAGAAGTTTTAATGTCTCAAAGAAACGCCATTTTCACACAACAGAGGCTGAGTTATCCCCGGCCGGCGGAGAGAGGCAAAAGCCATGAGTTTCTTCTTCCTCAGGAGTGGACATACTAA

mRNA sequence

ATGGCTTCAGATTCTAACTACTGGCTTCCCCCTCACTTTCTTTCCGACCACGACAACCTCACCGGGAAACCCACCTCCGTCGTCGCCTTCCCCACCGACTTTCCCTATGACTTCAACTCCTCCGCCGTTCATTCCCCTGTCGAATCTGTTCTTGGCGACGACGATAACGACGACGAGGAGGACTTTCTCGCCGCGTTGACGCAACGGCTTACTCAGTCCACGCTTCGTGATTCTCCAAAACTGCCGTCTGTCAACAAATCCCAAGCGAAGACGGCTATGGCTGGATCTCCTCAGTCTACTCTTACTGGGGTTGGAAGTTGGTCGGCTTGGAGTTCCGTGTCGAGCGACGGAAGCCCAAACGGACCGTCTCAGGCGCCATCTCCTCCGACCACCCCTTTCGGCGGCGATAATAACACTTGGGATCTTATCTATGCTGCTGCAGGGCAAGTTGCGAGGCTCAAAATGAATACTAATAGAGATGGGATTATTGGTCCTTCTCAAAGCTCTTCAAATCTTGTTTCTTCCATGAAAAACGTTGGATTTTACTCTCACCCTTCACAGTTTGGAACAGAGCCTCCGATTTATAAACCGGAGAACTGTTTTAACTGGGGAAGACAAGTGAAGGTTGAGAATCAGCAGATCCATTGTCGAGGAGGAAATTTTCACCATGAAAACGAGAGATTTCTTCGTCCTGTGGATTTTCCTCAATCCGCTTGGCCTTCTCTCCAACCCCATCACCGGAGATACTCTTCTCAGCCTAGTACTCCCACTATACCCGCCGCCTACCACGGCGTCGGATCTGCCCCTAAAAGGGAGTGCGCCGGTACCGGCGTCTTCTTACCTCGGCGATGCGACAACAACCCACCACAATCTCGTAAAAGAGCAGATTGTGCCTCAATCGCTCTGCTTCCAGGGAAGAACATTCAGGACTTGAACCGATCTGTTCCCCAAATGACGTCAAATCGCCGCCTTCTGCCGAGCTACGAAGTTTTAATGTCTCAAAGAAACGCCATTTTCACACAACAGAGGCTGAGTTATCCCCGGCCGGCGGAGAGAGGCAAAAGCCATGAGTTTCTTCTTCCTCAGGAGTGGACATACTAA

Coding sequence (CDS)

ATGGCTTCAGATTCTAACTACTGGCTTCCCCCTCACTTTCTTTCCGACCACGACAACCTCACCGGGAAACCCACCTCCGTCGTCGCCTTCCCCACCGACTTTCCCTATGACTTCAACTCCTCCGCCGTTCATTCCCCTGTCGAATCTGTTCTTGGCGACGACGATAACGACGACGAGGAGGACTTTCTCGCCGCGTTGACGCAACGGCTTACTCAGTCCACGCTTCGTGATTCTCCAAAACTGCCGTCTGTCAACAAATCCCAAGCGAAGACGGCTATGGCTGGATCTCCTCAGTCTACTCTTACTGGGGTTGGAAGTTGGTCGGCTTGGAGTTCCGTGTCGAGCGACGGAAGCCCAAACGGACCGTCTCAGGCGCCATCTCCTCCGACCACCCCTTTCGGCGGCGATAATAACACTTGGGATCTTATCTATGCTGCTGCAGGGCAAGTTGCGAGGCTCAAAATGAATACTAATAGAGATGGGATTATTGGTCCTTCTCAAAGCTCTTCAAATCTTGTTTCTTCCATGAAAAACGTTGGATTTTACTCTCACCCTTCACAGTTTGGAACAGAGCCTCCGATTTATAAACCGGAGAACTGTTTTAACTGGGGAAGACAAGTGAAGGTTGAGAATCAGCAGATCCATTGTCGAGGAGGAAATTTTCACCATGAAAACGAGAGATTTCTTCGTCCTGTGGATTTTCCTCAATCCGCTTGGCCTTCTCTCCAACCCCATCACCGGAGATACTCTTCTCAGCCTAGTACTCCCACTATACCCGCCGCCTACCACGGCGTCGGATCTGCCCCTAAAAGGGAGTGCGCCGGTACCGGCGTCTTCTTACCTCGGCGATGCGACAACAACCCACCACAATCTCGTAAAAGAGCAGATTGTGCCTCAATCGCTCTGCTTCCAGGGAAGAACATTCAGGACTTGAACCGATCTGTTCCCCAAATGACGTCAAATCGCCGCCTTCTGCCGAGCTACGAAGTTTTAATGTCTCAAAGAAACGCCATTTTCACACAACAGAGGCTGAGTTATCCCCGGCCGGCGGAGAGAGGCAAAAGCCATGAGTTTCTTCTTCCTCAGGAGTGGACATACTAA

Protein sequence

MASDSNYWLPPHFLSDHDNLTGKPTSVVAFPTDFPYDFNSSAVHSPVESVLGDDDNDDEEDFLAALTQRLTQSTLRDSPKLPSVNKSQAKTAMAGSPQSTLTGVGSWSAWSSVSSDGSPNGPSQAPSPPTTPFGGDNNTWDLIYAAAGQVARLKMNTNRDGIIGPSQSSSNLVSSMKNVGFYSHPSQFGTEPPIYKPENCFNWGRQVKVENQQIHCRGGNFHHENERFLRPVDFPQSAWPSLQPHHRRYSSQPSTPTIPAAYHGVGSAPKRECAGTGVFLPRRCDNNPPQSRKRADCASIALLPGKNIQDLNRSVPQMTSNRRLLPSYEVLMSQRNAIFTQQRLSYPRPAERGKSHEFLLPQEWTY
BLAST of Cla003970 vs. TrEMBL
Match: A0A0A0KQD2_CUCSA (Uncharacterized protein OS=Cucumis sativus GN=Csa_5G189920 PE=4 SV=1)

HSP 1 Score: 612.5 bits (1578), Expect = 3.5e-172
Identity = 299/367 (81.47%), Postives = 322/367 (87.74%), Query Frame = 1

Query: 1   MASDSNYWLPPHFLSDHDNLTGKPTSVVAFPTDFPYDFNSSAVHSPVESVLGDDDNDDEE 60
           MASDS ++LPPHFLSDHDNL  KPTS   FPTDFPYDF SS+VHSPV+SVLGDDDNDDE+
Sbjct: 1   MASDSTFYLPPHFLSDHDNLPPKPTSSALFPTDFPYDFTSSSVHSPVDSVLGDDDNDDEQ 60

Query: 61  DFLAALTQRLTQSTLRDSPKLPSVNKSQAKTAMAGSPQSTLTGVGSWSAWSSVSSDGSPN 120
           DFLAALTQRLTQSTLRDS KLPSV+KSQAK AMAGSPQSTL+GVGSWSAWSSVSSDGSPN
Sbjct: 61  DFLAALTQRLTQSTLRDSQKLPSVHKSQAKMAMAGSPQSTLSGVGSWSAWSSVSSDGSPN 120

Query: 121 GPSQAPSPPTTPFGGDNNTWDLIYAAAGQVARLKMNTNRDGIIGPSQSSSNLVSSMKNVG 180
           GPS APSPPTTPFGG+NNTWDLIYAAAGQVARLKMNT RDGIIGPSQSSSNLVS   N G
Sbjct: 121 GPSLAPSPPTTPFGGENNTWDLIYAAAGQVARLKMNTYRDGIIGPSQSSSNLVSPTNNAG 180

Query: 181 FYSHPSQFGTEPPIYKPENCFNWG-RQVKVENQQIHCRGGNFHHENERFLRPVDFPQSAW 240
           F+SHPSQFGT+PPIYKP+N  +W  RQVKVENQQIH RG   + ENERFLRP+D  QSAW
Sbjct: 181 FHSHPSQFGTDPPIYKPDNSSHWARRQVKVENQQIHYRGQEVYPENERFLRPLDITQSAW 240

Query: 241 PSLQPHHRRYSSQPSTPTIPAAYHGVGSAPKRECAGTGVFLPRRCDNNPPQSRKRADCAS 300
           PSL PHHRRY S PSTP  PAAYHGVGSAPK+ECAGTGVFLPRR D+N PQSRKRAD  S
Sbjct: 241 PSLHPHHRRYPSHPSTPAAPAAYHGVGSAPKKECAGTGVFLPRRYDSNTPQSRKRADSPS 300

Query: 301 IALLPGKNIQDLNRSVPQMTSNRRLLPSYEVLMSQRNAIFTQQRLSYPRPAERGKSHEFL 360
           +AL+P KNIQ+LN S+P   SNRRL PSYE L++QRNAIF QQRLSYPR AER K+HEFL
Sbjct: 301 VALVPAKNIQELNGSIP--PSNRRLQPSYEALIAQRNAIFAQQRLSYPRLAERSKTHEFL 360

Query: 361 LPQEWTY 367
           LPQEWTY
Sbjct: 361 LPQEWTY 365

BLAST of Cla003970 vs. TrEMBL
Match: W9S3V8_9ROSA (Uncharacterized protein OS=Morus notabilis GN=L484_005867 PE=4 SV=1)

HSP 1 Score: 294.7 bits (753), Expect = 1.6e-76
Identity = 191/424 (45.05%), Postives = 245/424 (57.78%), Query Frame = 1

Query: 4   DSNYWLPPHFLSDHDNL-----------------TGKPTSVVAFPTDFPYDFNS----SA 63
           D+ +WLPP  L++ D +                 T    S +AFPT+FPY+F+S    SA
Sbjct: 7   DAEFWLPPQILAEDDVVFVDKENFQFKNGATATSTALGASNMAFPTEFPYEFDSFGSNSA 66

Query: 64  VHSPVESVLG--DDDNDDEEDFLAALTQRLTQSTLRDSPKLPSVNKSQAKTAMAGSPQST 123
           + SPVESV+   + D+ DEEDF A LT+R  QSTLRDS KL      + +  ++GSPQST
Sbjct: 67  LSSPVESVVSSTETDSSDEEDFFAGLTRRFAQSTLRDSQKL------KPEWVLSGSPQST 126

Query: 124 LTGVGSWSAWSSVSSDGSPNGPSQAPSPPTTPFGGDNNTWDLIYAAAGQVARLKMNTNRD 183
           L+G+GSWS  S++S +GSPNGPSQ  SPPTTPFG  N+TWDLIYAAAGQVARLK+N    
Sbjct: 127 LSGIGSWSFRSTISRNGSPNGPSQVASPPTTPFGAKNDTWDLIYAAAGQVARLKVNGEEH 186

Query: 184 ---------GIIGPSQSSSNLVSSMKNVGFYSHPS--QFGTEPPIYKPENCFN-WGRQVK 243
                    G++ P   + N   S    GFYS+ S  Q  T+     P+ C + WGRQVK
Sbjct: 187 PKLSHHHGRGLLVPPARNPNNTGSC-GAGFYSNQSLAQNLTQFQGVIPQQCGSAWGRQVK 246

Query: 244 V---------------ENQQIHCRGGNFHHENERFLRPVDFPQSAWPSLQPHHRRYS-SQ 303
           V               + QQI  RG N  +EN R  RP++ PQSAWP LQ  ++  + +Q
Sbjct: 247 VGWSASAQQQQQQSHYQQQQIQNRGRNCGYENGRCGRPLNLPQSAWPPLQVQNQNQNQNQ 306

Query: 304 PSTPTIPAAYHGV---GSAPKRECAGTGVFLPRRCDNNPPQSRKRADCASIALLPGKNIQ 363
              P+ PA   GV   GS  K+ECAGTGVFLPRR   NPP+ RK++ C ++ LLP K +Q
Sbjct: 307 QHHPSRPAGMGGVFAGGSTVKKECAGTGVFLPRRY-TNPPEPRKKSGCPNV-LLPAKVVQ 366

Query: 364 DLNRSVPQMTSNRR-------LLPSYEVLMSQRNAIFTQQRLSYPRPAERGKSHEFLLPQ 367
            LN S   M +            P +E LM++RNA+  QQR S  RP E   +HE  LPQ
Sbjct: 367 ALNLSFEDMNNGHSQPRFGCGFAPDHEALMARRNALLEQQRRSL-RP-EGALNHEVRLPQ 419

BLAST of Cla003970 vs. TrEMBL
Match: M5XJR9_PRUPE (Uncharacterized protein OS=Prunus persica GN=PRUPE_ppa006379mg PE=4 SV=1)

HSP 1 Score: 268.9 bits (686), Expect = 9.4e-69
Identity = 177/421 (42.04%), Postives = 238/421 (56.53%), Query Frame = 1

Query: 4   DSNYWLPPHFLSD------------HDNLTGKPTSVVAFPTDFPYDFNSS----AVHSPV 63
           D  ++LP HFL+D            H N  G   SV  FPT+FPY+F+SS    A+ SPV
Sbjct: 7   DPEFYLPTHFLTDDVVLHNMDDNSFHQNGVG---SVARFPTEFPYEFDSSDSNSALSSPV 66

Query: 64  ESVLG--DDDNDDEEDFLAALTQRLTQSTLRDSPK-----LPSVNKSQAKTAMAGSPQST 123
           ESV+G  + ++ DEEDFL+ LT+RL QS+L+ + +     +P+ NK + +  MAGSPQS 
Sbjct: 67  ESVVGSTETESSDEEDFLSGLTRRLAQSSLQQTHQTQKLSVPNFNKDKPEWVMAGSPQSI 126

Query: 124 LTGVGSWSAWSSVSSDGSPNGPS-QAPSPPTTPFGGDNNTWDLIYAAAGQVARLKM---- 183
           L+G+GSWS      S+GSP GPS Q PSPPTTPFG  N+TWDLIYAAAGQVARLKM    
Sbjct: 127 LSGIGSWS------SNGSPTGPSSQVPSPPTTPFGAQNDTWDLIYAAAGQVARLKMTNGV 186

Query: 184 ------NTNRDGIIGPSQSSS--------NLVSSMKNVGFYSHPSQFGTEPPIYKPENCF 243
                 + +  G++GP +S S        N    + +   ++ P        + KP+   
Sbjct: 187 EGATKFSNHSRGLLGPPRSPSPSSLPCVKNPAPGLCSNQSFNQPQHVRQNQVLNKPQCSA 246

Query: 244 NWGRQVKV-------ENQQIHCRGGNF-HHENERFLRPVDFPQSAWPSL--QPHHRRYSS 303
            WG+Q ++       + QQI  RG +   +E+ R    V  PQSAWP L  Q H  ++  
Sbjct: 247 AWGKQGQLPWSAYQQQQQQIQSRGRSIPGYESGRCGHGVSIPQSAWPPLQVQQHQNQHPQ 306

Query: 304 QPSTPTIPAAYHGVGSAPKRECAGTGVFLPRRCDNNPPQSRKRADCASIALLPGKNIQDL 363
           + +    P   +  GS  KRECAGTGVFLPRR  N  P+ RK+A C ++ LLP K +Q L
Sbjct: 307 RNNASVRPILPN--GSNIKRECAGTGVFLPRRYSNPAPEPRKKAGCPTV-LLPAKVVQAL 366

Query: 364 NRSVPQMTS------NRRLLPSYEVLMSQRNAIFTQQRLSYPRPAERGKSHEFLLPQEWT 367
           N +   M S      N  L P +E L+++RNA+  QQRL   RP E   ++E  LPQEWT
Sbjct: 367 NLNFEDMNSQAPPRFNSGLAPDHEALLARRNALLAQQRLGGLRP-EGPLNYEVRLPQEWT 414

BLAST of Cla003970 vs. TrEMBL
Match: A0A061FLH8_THECC (WAS/WASL-interacting protein family member 2, putative isoform 1 OS=Theobroma cacao GN=TCM_042642 PE=4 SV=1)

HSP 1 Score: 254.6 bits (649), Expect = 1.8e-64
Identity = 175/418 (41.87%), Postives = 236/418 (56.46%), Query Frame = 1

Query: 4   DSNYWLPPHFLSDHD------NLT----GKPTSVV----AFPTDFPYDFNS----SAVHS 63
           D+ +WLP  FL+D D      NL     G  T ++     FPT+FPY+F+S    SA+ S
Sbjct: 7   DAEFWLPAKFLTDDDIVMEKENLKNKNGGNNTELLIPSHGFPTEFPYEFDSFDSSSALSS 66

Query: 64  PVESVLGDDDND--DEEDFLAALTQRLTQSTLRD-SPKLPSVNKSQAKTAMAGSPQSTLT 123
           PVESV+G  + +  DE++FLA LT+RL  ST +  +  + S++K++    +A SPQSTL+
Sbjct: 67  PVESVVGSTETESGDEDEFLAGLTRRLAHSTSQKFTVPVLSLDKTEKSGVLASSPQSTLS 126

Query: 124 GVGSWSAWSSVSSDGSPNGPSQAPSPPTTPFGGDNNTWDLIYAAAGQVARLKMN------ 183
           G+GSWS     SS+GSPNGPSQ PSPPTTPFG  N+TWDLIYAAAGQVARLKM+      
Sbjct: 127 GLGSWST----SSNGSPNGPSQVPSPPTTPFGAQNDTWDLIYAAAGQVARLKMSNEAPKY 186

Query: 184 TNRDGIIGPSQSSSNLVSSMKNVGFYSHPSQ------------FGTEPPIYKPENCFNWG 243
           T+ +   G  ++ S+ V    + G Y  PSQ             G +  + KP+      
Sbjct: 187 TSFNYGRGLPKAQSHAVMRNSSSGLY--PSQGLSYNLAQTNQYHGRQEQVLKPQCGAVMA 246

Query: 244 RQVKVENQQIHCRGGNFHHENERF-------LRPVDFPQSAWPSLQPHHRRYSSQP---S 303
           RQVK  N Q   +     H   R        +RP+  PQS+WP LQ   ++   QP   S
Sbjct: 247 RQVKASNWQAQLQQQQQQHIQSRARNNNVVGVRPLGLPQSSWPPLQVQSQQ-QQQPQHNS 306

Query: 304 TPTIPAAYHGVGSAPKRECAGTGVFLPRRCDNNPPQSRKRADCASIALLPGKNIQDLNRS 363
              + A +     + KRECAGTGVFLPRR   NPP+ RK++ C+++ LLP K +Q LN +
Sbjct: 307 GSGMRAMFLSGSGSVKRECAGTGVFLPRRY-GNPPEPRKKSGCSTV-LLPAKVVQALNLN 366

Query: 364 VP------QMTSNRRLLPSYEVLMSQRNAIFTQQRLSYPRPAERGKSHEFLLPQEWTY 367
                   Q   N     +Y+ L+++RNA+ TQ R  Y RP E G +HE  LPQEWTY
Sbjct: 367 FDDTNGHVQPHINPSFASNYDALLARRNALLTQARRGY-RP-EGGLNHEIHLPQEWTY 413

BLAST of Cla003970 vs. TrEMBL
Match: A5AD59_VITVI (Putative uncharacterized protein OS=Vitis vinifera GN=VITISV_030013 PE=4 SV=1)

HSP 1 Score: 248.8 bits (634), Expect = 1.0e-62
Identity = 173/426 (40.61%), Postives = 233/426 (54.69%), Query Frame = 1

Query: 7   YWLPPHFLSDHDNLTGKPT-----------SVVAFPTDFPYDFNS----SAVHSPVESVL 66
           +WLP HFL+D D L  K              +  FP++FPY+F+S    SA++SPVESV+
Sbjct: 10  FWLPSHFLTDEDXLMDKENFNDNGANPVGAEIHGFPSEFPYEFDSFGSSSALNSPVESVM 69

Query: 67  GDDDND-DEEDFLAALTQRLTQSTLRDSPKL-PSVNKSQAKTA------------MAGSP 126
              + + DEED L  L ++L  STL D+ KL PS +    + A            ++GSP
Sbjct: 70  SSTETESDEEDLLTQLKRQLAHSTLHDTQKLAPSFSSENQERADLGFVLTQKTWVLSGSP 129

Query: 127 QSTLTGVGSWSAWSSVSSDGSPNGPSQAPSPPTTPFGGDNNTWDLIYAAAGQVARLKMNT 186
           QSTL+ VG+WS  S+VSS+GSPNG S+  SPPTTP     + WDLIYAAAGQVARLKM+ 
Sbjct: 130 QSTLSAVGNWSGRSTVSSNGSPNGRSRVSSPPTTPLSEKTDAWDLIYAAAGQVARLKMSG 189

Query: 187 NRD------GIIGPSQSSSNL---VSSMKNVGFYSH-----------PSQFGTEPPIYKP 246
           +        G++GP +S   +    +   N GFYS+            SQ   +  + K 
Sbjct: 190 DGPKYQQGRGLLGPPRSPMPVPTQPAKNANTGFYSYQSLSNNISQTSQSQXIRQEQVLK- 249

Query: 247 ENCFNWGRQVK-----------VENQQIHCRGGNFHHENERFLRPVDFPQSAWPSLQPHH 306
           + C  WGR+ K            + QQIH R  +   E+ R  RP+  P SAWP LQ  H
Sbjct: 250 QQCSVWGREAKEAWFSQQQQQLQQQQQIHSR-RSVGLESGRCGRPLGLPPSAWPPLQHQH 309

Query: 307 RRYSSQPSTPTIPAAYHGVGSAPKRECAGTGVFLPRRCDNNPPQSRKRADCASIALLPGK 366
           +    Q S+  + A + G GS  KRE AGTGVFLPRR   N   SRK+  C+++ LLP +
Sbjct: 310 QH---QQSSSGMRAVFLG-GSGLKRESAGTGVFLPRRF-GNXSDSRKKPGCSTV-LLPAR 369

BLAST of Cla003970 vs. NCBI nr
Match: gi|659129883|ref|XP_008464895.1| (PREDICTED: uncharacterized protein LOC103502654 [Cucumis melo])

HSP 1 Score: 622.1 bits (1603), Expect = 6.3e-175
Identity = 302/367 (82.29%), Postives = 326/367 (88.83%), Query Frame = 1

Query: 1   MASDSNYWLPPHFLSDHDNLTGKPTSVVAFPTDFPYDFNSSAVHSPVESVLGDDDNDDEE 60
           MASDS ++LPPHFLSDHDNL  KPTS   FPTDFPYDF SS+VHSPV+SVLGDDDNDDE+
Sbjct: 1   MASDSTFYLPPHFLSDHDNLPPKPTSSALFPTDFPYDFTSSSVHSPVDSVLGDDDNDDEQ 60

Query: 61  DFLAALTQRLTQSTLRDSPKLPSVNKSQAKTAMAGSPQSTLTGVGSWSAWSSVSSDGSPN 120
           DFLAALTQRLTQSTLRDS KLPSV+KSQAK AMAGSPQSTL+GVGSWSAWSSVSSDGSPN
Sbjct: 61  DFLAALTQRLTQSTLRDSQKLPSVHKSQAKMAMAGSPQSTLSGVGSWSAWSSVSSDGSPN 120

Query: 121 GPSQAPSPPTTPFGGDNNTWDLIYAAAGQVARLKMNTNRDGIIGPSQSSSNLVSSMKNVG 180
           GPS APSPPTTPFGG+NNTWDLIYAAAGQVARLKMNT+RDGIIGPSQSSSNLVSS+ N G
Sbjct: 121 GPSLAPSPPTTPFGGENNTWDLIYAAAGQVARLKMNTHRDGIIGPSQSSSNLVSSVHNAG 180

Query: 181 FYSHPSQFGTEPPIYKPENCFNWG-RQVKVENQQIHCRGGNFHHENERFLRPVDFPQSAW 240
            YSHPSQFGT+PPIYKPEN  +WG RQVKVENQQIH RG +F+HENERFLRP+D  QSAW
Sbjct: 181 LYSHPSQFGTDPPIYKPENSSHWGRRQVKVENQQIHYRGQDFYHENERFLRPLDITQSAW 240

Query: 241 PSLQPHHRRYSSQPSTPTIPAAYHGVGSAPKRECAGTGVFLPRRCDNNPPQSRKRADCAS 300
           PSL PHHR Y SQPSTP   AAYHGVGSAPK+ECAGTGVFLPRR DNNPPQSR+RAD  S
Sbjct: 241 PSLHPHHRSYPSQPSTPAAHAAYHGVGSAPKKECAGTGVFLPRRYDNNPPQSRRRADSPS 300

Query: 301 IALLPGKNIQDLNRSVPQMTSNRRLLPSYEVLMSQRNAIFTQQRLSYPRPAERGKSHEFL 360
           +AL+P KNIQ LN S+P   SNRRL PSY+ L++QRN IF QQRLSYPR AER K+HEFL
Sbjct: 301 VALVPAKNIQGLNGSIP--PSNRRLQPSYDALIAQRNTIFAQQRLSYPRLAERSKTHEFL 360

Query: 361 LPQEWTY 367
           LPQEWTY
Sbjct: 361 LPQEWTY 365

BLAST of Cla003970 vs. NCBI nr
Match: gi|449464456|ref|XP_004149945.1| (PREDICTED: uncharacterized protein LOC101215147 [Cucumis sativus])

HSP 1 Score: 612.5 bits (1578), Expect = 5.0e-172
Identity = 299/367 (81.47%), Postives = 322/367 (87.74%), Query Frame = 1

Query: 1   MASDSNYWLPPHFLSDHDNLTGKPTSVVAFPTDFPYDFNSSAVHSPVESVLGDDDNDDEE 60
           MASDS ++LPPHFLSDHDNL  KPTS   FPTDFPYDF SS+VHSPV+SVLGDDDNDDE+
Sbjct: 1   MASDSTFYLPPHFLSDHDNLPPKPTSSALFPTDFPYDFTSSSVHSPVDSVLGDDDNDDEQ 60

Query: 61  DFLAALTQRLTQSTLRDSPKLPSVNKSQAKTAMAGSPQSTLTGVGSWSAWSSVSSDGSPN 120
           DFLAALTQRLTQSTLRDS KLPSV+KSQAK AMAGSPQSTL+GVGSWSAWSSVSSDGSPN
Sbjct: 61  DFLAALTQRLTQSTLRDSQKLPSVHKSQAKMAMAGSPQSTLSGVGSWSAWSSVSSDGSPN 120

Query: 121 GPSQAPSPPTTPFGGDNNTWDLIYAAAGQVARLKMNTNRDGIIGPSQSSSNLVSSMKNVG 180
           GPS APSPPTTPFGG+NNTWDLIYAAAGQVARLKMNT RDGIIGPSQSSSNLVS   N G
Sbjct: 121 GPSLAPSPPTTPFGGENNTWDLIYAAAGQVARLKMNTYRDGIIGPSQSSSNLVSPTNNAG 180

Query: 181 FYSHPSQFGTEPPIYKPENCFNWG-RQVKVENQQIHCRGGNFHHENERFLRPVDFPQSAW 240
           F+SHPSQFGT+PPIYKP+N  +W  RQVKVENQQIH RG   + ENERFLRP+D  QSAW
Sbjct: 181 FHSHPSQFGTDPPIYKPDNSSHWARRQVKVENQQIHYRGQEVYPENERFLRPLDITQSAW 240

Query: 241 PSLQPHHRRYSSQPSTPTIPAAYHGVGSAPKRECAGTGVFLPRRCDNNPPQSRKRADCAS 300
           PSL PHHRRY S PSTP  PAAYHGVGSAPK+ECAGTGVFLPRR D+N PQSRKRAD  S
Sbjct: 241 PSLHPHHRRYPSHPSTPAAPAAYHGVGSAPKKECAGTGVFLPRRYDSNTPQSRKRADSPS 300

Query: 301 IALLPGKNIQDLNRSVPQMTSNRRLLPSYEVLMSQRNAIFTQQRLSYPRPAERGKSHEFL 360
           +AL+P KNIQ+LN S+P   SNRRL PSYE L++QRNAIF QQRLSYPR AER K+HEFL
Sbjct: 301 VALVPAKNIQELNGSIP--PSNRRLQPSYEALIAQRNAIFAQQRLSYPRLAERSKTHEFL 360

Query: 361 LPQEWTY 367
           LPQEWTY
Sbjct: 361 LPQEWTY 365

BLAST of Cla003970 vs. NCBI nr
Match: gi|703136256|ref|XP_010106106.1| (hypothetical protein L484_005867 [Morus notabilis])

HSP 1 Score: 294.7 bits (753), Expect = 2.3e-76
Identity = 191/424 (45.05%), Postives = 245/424 (57.78%), Query Frame = 1

Query: 4   DSNYWLPPHFLSDHDNL-----------------TGKPTSVVAFPTDFPYDFNS----SA 63
           D+ +WLPP  L++ D +                 T    S +AFPT+FPY+F+S    SA
Sbjct: 7   DAEFWLPPQILAEDDVVFVDKENFQFKNGATATSTALGASNMAFPTEFPYEFDSFGSNSA 66

Query: 64  VHSPVESVLG--DDDNDDEEDFLAALTQRLTQSTLRDSPKLPSVNKSQAKTAMAGSPQST 123
           + SPVESV+   + D+ DEEDF A LT+R  QSTLRDS KL      + +  ++GSPQST
Sbjct: 67  LSSPVESVVSSTETDSSDEEDFFAGLTRRFAQSTLRDSQKL------KPEWVLSGSPQST 126

Query: 124 LTGVGSWSAWSSVSSDGSPNGPSQAPSPPTTPFGGDNNTWDLIYAAAGQVARLKMNTNRD 183
           L+G+GSWS  S++S +GSPNGPSQ  SPPTTPFG  N+TWDLIYAAAGQVARLK+N    
Sbjct: 127 LSGIGSWSFRSTISRNGSPNGPSQVASPPTTPFGAKNDTWDLIYAAAGQVARLKVNGEEH 186

Query: 184 ---------GIIGPSQSSSNLVSSMKNVGFYSHPS--QFGTEPPIYKPENCFN-WGRQVK 243
                    G++ P   + N   S    GFYS+ S  Q  T+     P+ C + WGRQVK
Sbjct: 187 PKLSHHHGRGLLVPPARNPNNTGSC-GAGFYSNQSLAQNLTQFQGVIPQQCGSAWGRQVK 246

Query: 244 V---------------ENQQIHCRGGNFHHENERFLRPVDFPQSAWPSLQPHHRRYS-SQ 303
           V               + QQI  RG N  +EN R  RP++ PQSAWP LQ  ++  + +Q
Sbjct: 247 VGWSASAQQQQQQSHYQQQQIQNRGRNCGYENGRCGRPLNLPQSAWPPLQVQNQNQNQNQ 306

Query: 304 PSTPTIPAAYHGV---GSAPKRECAGTGVFLPRRCDNNPPQSRKRADCASIALLPGKNIQ 363
              P+ PA   GV   GS  K+ECAGTGVFLPRR   NPP+ RK++ C ++ LLP K +Q
Sbjct: 307 QHHPSRPAGMGGVFAGGSTVKKECAGTGVFLPRRY-TNPPEPRKKSGCPNV-LLPAKVVQ 366

Query: 364 DLNRSVPQMTSNRR-------LLPSYEVLMSQRNAIFTQQRLSYPRPAERGKSHEFLLPQ 367
            LN S   M +            P +E LM++RNA+  QQR S  RP E   +HE  LPQ
Sbjct: 367 ALNLSFEDMNNGHSQPRFGCGFAPDHEALMARRNALLEQQRRSL-RP-EGALNHEVRLPQ 419

BLAST of Cla003970 vs. NCBI nr
Match: gi|596000548|ref|XP_007218081.1| (hypothetical protein PRUPE_ppa006379mg [Prunus persica])

HSP 1 Score: 268.9 bits (686), Expect = 1.3e-68
Identity = 177/421 (42.04%), Postives = 238/421 (56.53%), Query Frame = 1

Query: 4   DSNYWLPPHFLSD------------HDNLTGKPTSVVAFPTDFPYDFNSS----AVHSPV 63
           D  ++LP HFL+D            H N  G   SV  FPT+FPY+F+SS    A+ SPV
Sbjct: 7   DPEFYLPTHFLTDDVVLHNMDDNSFHQNGVG---SVARFPTEFPYEFDSSDSNSALSSPV 66

Query: 64  ESVLG--DDDNDDEEDFLAALTQRLTQSTLRDSPK-----LPSVNKSQAKTAMAGSPQST 123
           ESV+G  + ++ DEEDFL+ LT+RL QS+L+ + +     +P+ NK + +  MAGSPQS 
Sbjct: 67  ESVVGSTETESSDEEDFLSGLTRRLAQSSLQQTHQTQKLSVPNFNKDKPEWVMAGSPQSI 126

Query: 124 LTGVGSWSAWSSVSSDGSPNGPS-QAPSPPTTPFGGDNNTWDLIYAAAGQVARLKM---- 183
           L+G+GSWS      S+GSP GPS Q PSPPTTPFG  N+TWDLIYAAAGQVARLKM    
Sbjct: 127 LSGIGSWS------SNGSPTGPSSQVPSPPTTPFGAQNDTWDLIYAAAGQVARLKMTNGV 186

Query: 184 ------NTNRDGIIGPSQSSS--------NLVSSMKNVGFYSHPSQFGTEPPIYKPENCF 243
                 + +  G++GP +S S        N    + +   ++ P        + KP+   
Sbjct: 187 EGATKFSNHSRGLLGPPRSPSPSSLPCVKNPAPGLCSNQSFNQPQHVRQNQVLNKPQCSA 246

Query: 244 NWGRQVKV-------ENQQIHCRGGNF-HHENERFLRPVDFPQSAWPSL--QPHHRRYSS 303
            WG+Q ++       + QQI  RG +   +E+ R    V  PQSAWP L  Q H  ++  
Sbjct: 247 AWGKQGQLPWSAYQQQQQQIQSRGRSIPGYESGRCGHGVSIPQSAWPPLQVQQHQNQHPQ 306

Query: 304 QPSTPTIPAAYHGVGSAPKRECAGTGVFLPRRCDNNPPQSRKRADCASIALLPGKNIQDL 363
           + +    P   +  GS  KRECAGTGVFLPRR  N  P+ RK+A C ++ LLP K +Q L
Sbjct: 307 RNNASVRPILPN--GSNIKRECAGTGVFLPRRYSNPAPEPRKKAGCPTV-LLPAKVVQAL 366

Query: 364 NRSVPQMTS------NRRLLPSYEVLMSQRNAIFTQQRLSYPRPAERGKSHEFLLPQEWT 367
           N +   M S      N  L P +E L+++RNA+  QQRL   RP E   ++E  LPQEWT
Sbjct: 367 NLNFEDMNSQAPPRFNSGLAPDHEALLARRNALLAQQRLGGLRP-EGPLNYEVRLPQEWT 414

BLAST of Cla003970 vs. NCBI nr
Match: gi|1009177346|ref|XP_015869919.1| (PREDICTED: uncharacterized protein LOC107407188 [Ziziphus jujuba])

HSP 1 Score: 266.2 bits (679), Expect = 8.7e-68
Identity = 180/422 (42.65%), Postives = 237/422 (56.16%), Query Frame = 1

Query: 4   DSNYWLPPHFLSDHDNL---------------TGKPTSVVAFPTDFPYDFNS----SAVH 63
           D+ +WLPP FL+D D L               TG   S   FP++FPY+F+S    SA+ 
Sbjct: 7   DAEFWLPPKFLTDDDVLMDKDGFDKNGGGGGTTGFGHSHDGFPSEFPYEFDSFSSNSALS 66

Query: 64  SPVESVLG--DDDNDDEEDFLAALTQRLTQSTLRDSPKLPSVNKSQAKTAM--AGSPQST 123
           SPVESV+G  + ++ DEE+FL  LT+RL QSTL DS KL   +  Q K  M  +GSPQST
Sbjct: 67  SPVESVVGSTEAESSDEEEFLFGLTRRLAQSTLHDSQKLAVTSFPQDKHEMVLSGSPQST 126

Query: 124 LTGVGSWSAWSSVSSDGSPNGPSQAPSPPTTPFGGDNNTWDLIYAAAGQVARLKMN---- 183
           L+G+GSW A S++SSDGSPNGPSQ PSPPTTPFG  N+TWDLIY AAGQVARLKM     
Sbjct: 127 LSGIGSWCARSTISSDGSPNGPSQVPSPPTTPFGARNDTWDLIYEAAGQVARLKMTDEET 186

Query: 184 --TNR-DGIIGPSQSSSNLVSSMK--NVGFYSHPSQFGTE----------PPIYKPENCF 243
             +NR  G++GP  S +  V  ++  N G Y + +Q  ++              K +   
Sbjct: 187 KFSNRGKGLLGPHGSPNPSVPCVRNPNSGLYKNNNQGFSQNLARLQQVRPDQALKSQCSA 246

Query: 244 NWGRQVKVENQQIHCRGGNFHHENER----------FLRPVDFPQSAWPSLQPHHRRYSS 303
            W R+VK  N   H    N H++N++          + R ++ P S W   Q    ++  
Sbjct: 247 AWVREVK--NGWAHNHNHNHHNQNQQQIRNRGRNIGYERAMNLPHSVWRPQQIQQPQH-- 306

Query: 304 QPSTPTIPAAYHGVGSAPKRECAGTGVFLPRRCDNNPPQSRKRADCASIALLPGKNIQDL 363
              T T   A    GS  KREC+GTGVFLPR+   NPP+SRK+  C ++ LLP K +Q L
Sbjct: 307 ---TDTAMRAVLLGGSGVKRECSGTGVFLPRKY-GNPPESRKKTGCTTV-LLPAKVVQAL 366

Query: 364 NRSVPQMTSNRR-------LLPSYEVLMSQRNAIFTQQRLSYPRPAERGKSHEFLLPQEW 367
           N +   +   +          P +EVLM++RNA+  QQR S  RP E   +HE  LPQEW
Sbjct: 367 NLNFEAVNHGQAQPRFGTGFSPDHEVLMARRNALLAQQRRSL-RP-EGTLNHEVRLPQEW 417

The following BLAST results are available for this feature:
Match NameE-valueIdentityDescription
A0A0A0KQD2_CUCSA3.5e-17281.47Uncharacterized protein OS=Cucumis sativus GN=Csa_5G189920 PE=4 SV=1[more]
W9S3V8_9ROSA1.6e-7645.05Uncharacterized protein OS=Morus notabilis GN=L484_005867 PE=4 SV=1[more]
M5XJR9_PRUPE9.4e-6942.04Uncharacterized protein OS=Prunus persica GN=PRUPE_ppa006379mg PE=4 SV=1[more]
A0A061FLH8_THECC1.8e-6441.87WAS/WASL-interacting protein family member 2, putative isoform 1 OS=Theobroma ca... [more]
A5AD59_VITVI1.0e-6240.61Putative uncharacterized protein OS=Vitis vinifera GN=VITISV_030013 PE=4 SV=1[more]
Match NameE-valueIdentityDescription
gi|659129883|ref|XP_008464895.1|6.3e-17582.29PREDICTED: uncharacterized protein LOC103502654 [Cucumis melo][more]
gi|449464456|ref|XP_004149945.1|5.0e-17281.47PREDICTED: uncharacterized protein LOC101215147 [Cucumis sativus][more]
gi|703136256|ref|XP_010106106.1|2.3e-7645.05hypothetical protein L484_005867 [Morus notabilis][more]
gi|596000548|ref|XP_007218081.1|1.3e-6842.04hypothetical protein PRUPE_ppa006379mg [Prunus persica][more]
gi|1009177346|ref|XP_015869919.1|8.7e-6842.65PREDICTED: uncharacterized protein LOC107407188 [Ziziphus jujuba][more]
The following terms have been associated with this gene:
Vocabulary: INTERPRO
TermDefinition
GO Assignments
This gene is annotated with the following GO terms.
Category Term Accession Term Name
biological_process GO:0008150 biological_process
cellular_component GO:0005575 cellular_component
molecular_function GO:0003674 molecular_function
This gene is associated with the following unigenes:
Unigene NameAnalysis NameSequence type in Unigene
WMU16265watermelon unigene v2 vs TrEMBLtranscribed_cluster

The following mRNA feature(s) are a part of this gene:

Feature NameUnique NameType
Cla003970Cla003970.1mRNA


The following transcribed_cluster feature(s) are associated with this gene:

Feature NameUnique NameType
WMU16265WMU16265transcribed_cluster


Analysis Name: InterPro Annotations of watermelon (97103)
Date Performed: 2016-09-28
IPR TermIPR DescriptionSourceSource TermSource DescriptionAlignment
NoneNo IPR availablePANTHERPTHR33356FAMILY NOT NAMEDcoord: 1..366
score: 1.3
NoneNo IPR availablePANTHERPTHR33356:SF4SUBFAMILY NOT NAMEDcoord: 1..366
score: 1.3

The following gene(s) are paralogous to this gene:
GeneParalogueOrganismBlock
Cla003970Cla017942Watermelon (97103) v1wmwmB044
Cla003970Cla006563Watermelon (97103) v1wmwmB113