Cla002308 (gene) Watermelon (97103) v1

NameCla002308
Typegene
OrganismCitrullus. lanatus (Watermelon (97103) v1)
DescriptionUnknown Protein (AHRD V1)
LocationChr7 : 14431981 .. 14433222 (+)
The following sequences are available for this feature:

Gene sequence (with intron)

Legend: CDS
Hold the cursor over a type above to highlight its positions in the sequence below.
ATGGCTTCCCAGCAACCCACCTCCTCCAAAAGCCCTAAACCTATTGCTTCAGCCTCGAGGCATGGTAGTTGGAATCTGTCCGCTACCTCCACCACAACGGAACCCCTCGCCATTGTTCGCCCTAATAGTCCTGAGTTGGAAGCGACATCATCACCATCTTCGCCACACTCCAGGGACCTGTCTGAAGCTCAAGAGGTGGAAAGGAATGACGAGTCCACCGTATTCGAAGACATTCTAGACACCATTGATGAAGAAGACGATGACTGGTTGGACGGGTTGGTCGACCAGAGGATGAAAAATGAATCACGTGGGTCAGTCCCTGCGATGAGTGGTCTTGAGGAGTCGAACTCCGATGAAAGACCTATTGGAAAGGTAATAGTGGCGGTGGAAAAGAAGAAGTTCGCCACGAAAAGGAAAATCAGGCCAATAAGGGCCAGGGAAGTAGAAGCCGAAGAAGATGTACCTCCGCTAAGAAGGAAAAATGAGAAAGTAAAGGACGTGGGGACATCCGAGACTATCGCATCATCTTTCGCCGCATCATCTGTTGAGCGGTTAGCAGCGAGGGCGGCAGAGAAGTCCAAGAAGCTAAAAGAAGACATCAAAAAGATGAATGAAAGGACGAAGTCGTTTCTCACCGCAAGGAAGAGGAGAAAAGAGGAAATCGCATCTGGCGTACAAACTGTCAAAGAGGCCACCAGAAAAGCTGAGTTTCTCTGTCGCATAGCAAACATTGCGACGGAATTAGAAGTTGAGTTGGAAGTGACTGATGCGACGAGGTCACCCAAAGCTGAACGCATCAAGAAGAATGTGAAAAAAATACGGGAGGAGAAGAAGAAAGAAAAAGAGGTAGCAATCACCAAGGGGAAGCTACCGGAGAAAAATAAGACTGAGGTGGCAAAGGCACCTGCAGTGGTTATAAGGGACTTGGACGCAGGCAGAGTAAGGAGAACGCCTGCTAGTCCCATCAGTAGGAATGAGAAGGGAAAAGAGAAATTGGTTGAGGAACCAACAGAGGAGGCGGCGAAAAGCCAAAAGACACAATTCAGTGGGCTCTACACTGAAGTGGGATTTTTCCTAGAGCCAATTGAGCTGCCTGCCTTCATCATACAAGGAGTCGACGCAATGGGTTGGAGACAATTTTGTGAAAGTGCCCAAGTCATCCAACCCACCGCCGTGGAGGCATTTTACGAAGGGGAACCATTCACCGCAAGGCACACATTGTCAAAGTTGAAGATGAGGTGA

mRNA sequence

ATGGCTTCCCAGCAACCCACCTCCTCCAAAAGCCCTAAACCTATTGCTTCAGCCTCGAGGCATGGTAGTTGGAATCTGTCCGCTACCTCCACCACAACGGAACCCCTCGCCATTGTTCGCCCTAATAGTCCTGAGTTGGAAGCGACATCATCACCATCTTCGCCACACTCCAGGGACCTGTCTGAAGCTCAAGAGGTGGAAAGGAATGACGAGTCCACCGTATTCGAAGACATTCTAGACACCATTGATGAAGAAGACGATGACTGGTTGGACGGGTTGGTCGACCAGAGGATGAAAAATGAATCACGTGGGTCAGTCCCTGCGATGAGTGGTCTTGAGGAGTCGAACTCCGATGAAAGACCTATTGGAAAGGTAATAGTGGCGGTGGAAAAGAAGAAGTTCGCCACGAAAAGGAAAATCAGGCCAATAAGGGCCAGGGAAGTAGAAGCCGAAGAAGATGTACCTCCGCTAAGAAGGAAAAATGAGAAAGTAAAGGACGTGGGGACATCCGAGACTATCGCATCATCTTTCGCCGCATCATCTGTTGAGCGGTTAGCAGCGAGGGCGGCAGAGAAGTCCAAGAAGCTAAAAGAAGACATCAAAAAGATGAATGAAAGGACGAAGTCGTTTCTCACCGCAAGGAAGAGGAGAAAAGAGGAAATCGCATCTGGCGTACAAACTGTCAAAGAGGCCACCAGAAAAGCTGAGTTTCTCTGTCGCATAGCAAACATTGCGACGGAATTAGAAGTTGAGTTGGAAGTGACTGATGCGACGAGGTCACCCAAAGCTGAACGCATCAAGAAGAATGTGAAAAAAATACGGGAGGAGAAGAAGAAAGAAAAAGAGGTAGCAATCACCAAGGGGAAGCTACCGGAGAAAAATAAGACTGAGGTGGCAAAGGCACCTGCAGTGGTTATAAGGGACTTGGACGCAGGCAGAGTAAGGAGAACGCCTGCTAGTCCCATCAGTAGGAATGAGAAGGGAAAAGAGAAATTGGTTGAGGAACCAACAGAGGAGGCGGCGAAAAGCCAAAAGACACAATTCAGTGGGCTCTACACTGAAGTGGGATTTTTCCTAGAGCCAATTGAGCTGCCTGCCTTCATCATACAAGGAGTCGACGCAATGGGTTGGAGACAATTTTGTGAAAGTGCCCAAGTCATCCAACCCACCGCCGTGGAGGCATTTTACGAAGGGGAACCATTCACCGCAAGGCACACATTGTCAAAGTTGAAGATGAGGTGA

Coding sequence (CDS)

ATGGCTTCCCAGCAACCCACCTCCTCCAAAAGCCCTAAACCTATTGCTTCAGCCTCGAGGCATGGTAGTTGGAATCTGTCCGCTACCTCCACCACAACGGAACCCCTCGCCATTGTTCGCCCTAATAGTCCTGAGTTGGAAGCGACATCATCACCATCTTCGCCACACTCCAGGGACCTGTCTGAAGCTCAAGAGGTGGAAAGGAATGACGAGTCCACCGTATTCGAAGACATTCTAGACACCATTGATGAAGAAGACGATGACTGGTTGGACGGGTTGGTCGACCAGAGGATGAAAAATGAATCACGTGGGTCAGTCCCTGCGATGAGTGGTCTTGAGGAGTCGAACTCCGATGAAAGACCTATTGGAAAGGTAATAGTGGCGGTGGAAAAGAAGAAGTTCGCCACGAAAAGGAAAATCAGGCCAATAAGGGCCAGGGAAGTAGAAGCCGAAGAAGATGTACCTCCGCTAAGAAGGAAAAATGAGAAAGTAAAGGACGTGGGGACATCCGAGACTATCGCATCATCTTTCGCCGCATCATCTGTTGAGCGGTTAGCAGCGAGGGCGGCAGAGAAGTCCAAGAAGCTAAAAGAAGACATCAAAAAGATGAATGAAAGGACGAAGTCGTTTCTCACCGCAAGGAAGAGGAGAAAAGAGGAAATCGCATCTGGCGTACAAACTGTCAAAGAGGCCACCAGAAAAGCTGAGTTTCTCTGTCGCATAGCAAACATTGCGACGGAATTAGAAGTTGAGTTGGAAGTGACTGATGCGACGAGGTCACCCAAAGCTGAACGCATCAAGAAGAATGTGAAAAAAATACGGGAGGAGAAGAAGAAAGAAAAAGAGGTAGCAATCACCAAGGGGAAGCTACCGGAGAAAAATAAGACTGAGGTGGCAAAGGCACCTGCAGTGGTTATAAGGGACTTGGACGCAGGCAGAGTAAGGAGAACGCCTGCTAGTCCCATCAGTAGGAATGAGAAGGGAAAAGAGAAATTGGTTGAGGAACCAACAGAGGAGGCGGCGAAAAGCCAAAAGACACAATTCAGTGGGCTCTACACTGAAGTGGGATTTTTCCTAGAGCCAATTGAGCTGCCTGCCTTCATCATACAAGGAGTCGACGCAATGGGTTGGAGACAATTTTGTGAAAGTGCCCAAGTCATCCAACCCACCGCCGTGGAGGCATTTTACGAAGGGGAACCATTCACCGCAAGGCACACATTGTCAAAGTTGAAGATGAGGTGA

Protein sequence

MASQQPTSSKSPKPIASASRHGSWNLSATSTTTEPLAIVRPNSPELEATSSPSSPHSRDLSEAQEVERNDESTVFEDILDTIDEEDDDWLDGLVDQRMKNESRGSVPAMSGLEESNSDERPIGKVIVAVEKKKFATKRKIRPIRAREVEAEEDVPPLRRKNEKVKDVGTSETIASSFAASSVERLAARAAEKSKKLKEDIKKMNERTKSFLTARKRRKEEIASGVQTVKEATRKAEFLCRIANIATELEVELEVTDATRSPKAERIKKNVKKIREEKKKEKEVAITKGKLPEKNKTEVAKAPAVVIRDLDAGRVRRTPASPISRNEKGKEKLVEEPTEEAAKSQKTQFSGLYTEVGFFLEPIELPAFIIQGVDAMGWRQFCESAQVIQPTAVEAFYEGEPFTARHTLSKLKMR
BLAST of Cla002308 vs. Swiss-Prot
Match: NST1_YARLI (Stress response protein NST1 OS=Yarrowia lipolytica (strain CLIB 122 / E 150) GN=NST1 PE=3 SV=1)

HSP 1 Score: 59.3 bits (142), Expect = 1.1e-07
Identity = 62/245 (25.31%), Postives = 105/245 (42.86%), Query Frame = 1

Query: 114 ESNSDERPIGKVIVAVEKKKFATKRKIRPIRAREVEAEEDVPPLRRKNEKVKDVGTSETI 173
           E   +++   K I   EK+K A +R+ R  R  E      +   R++ EK ++    +  
Sbjct: 398 ERKKEKKRAQKAIKEEEKRKAAAEREEREKREAEEAERLRLEAERKQQEKEREKAARKAA 457

Query: 174 ASSFAASSVERLAARAAEKSKKLKEDIKKMNERTKSFLTARKRRKEEIASGVQTVKEATR 233
           + +   +  E  A  A EK +K    I++M ER +      +R KEE+ +  Q  ++  R
Sbjct: 458 SEAKQKARQEEQARLAREKEEKRLARIREMEERMRLAKEKEEREKEELRARQQQEEDERR 517

Query: 234 KAEFL--CRIANIATE--------LEVELEVTDATRSPKAERIK--KNVKKIREEKKKEK 293
           + E L   RI N   E        LE E E        + +RIK  +  +K+ EE++K  
Sbjct: 518 EKERLEEERIENERLEAERIENERLEKEREQQRLEEEKERQRIKEEREKQKLEEEREKRA 577

Query: 294 EVAITKGKLPEKNKTEV-AKAPAVVIRDLDAGRVRRTPASPIS-RNEKGKEKLVEEPTEE 345
            ++I   K P   KT++ A  P   +  L     +  P +P++   +    +L    T+ 
Sbjct: 578 SMSIPLSK-PLPGKTQIPASQPGTSLGGLQQPVPQAAPVAPVAMMPQSPSPQLPPGLTQH 637

BLAST of Cla002308 vs. Swiss-Prot
Match: SMC_AQUAE (Chromosome partition protein Smc OS=Aquifex aeolicus (strain VF5) GN=smc PE=3 SV=1)

HSP 1 Score: 53.9 bits (128), Expect = 4.8e-06
Identity = 61/245 (24.90%), Postives = 103/245 (42.04%), Query Frame = 1

Query: 62  EAQEVERNDESTVFEDILDTIDEEDDDWLDGLVDQRMKNESRGSVPAMSGLEESNSDERP 121
           E Q ++R  E+ +     + + +E +  L+ L   R   ES   +       E   +ER 
Sbjct: 219 ELQRIKRETEAKILLKEKEKLLKERERILNELSSLR---ESLEDITFQIQENEKELNERE 278

Query: 122 IGKVIVAVEKKKFATKRKIRPIRAREVEAEEDVPPLRRK-NEKVKDVGTSETIASSFAAS 181
             +++  V +K    K K+    A    AE  +    R+  E    V   E + ++  + 
Sbjct: 279 --RLLKEVNEKIMPFKEKVGKFTAEIENAERSIKEKERELKESENRVKNLEELINNLLSD 338

Query: 182 --SVERLAARAAEKSKKLKEDIKKMNERTKSFLTARKRRKEEIASGVQTVKEATRKAEFL 241
             ++ER       + +KLKE+ K + E  +  L   +  +E +      VK+   + E L
Sbjct: 339 KENLEREVGTLQLELEKLKEEYKSLKEVEREKLRELEEEEERLKITFDEVKKLEEEKEKL 398

Query: 242 CRIANIATELEVELEVTDATRSPKAERIKKNVKKI---REEK---KKEKEVAITKGKLPE 298
               N   + + ELE+  A    K ERIK+++ K+   REEK    KEKE  I + K  +
Sbjct: 399 TEKLNSLNKEKQELEIQRANLKNKIERIKEDINKLISEREEKIKEIKEKEQEIKRLKAIK 458

BLAST of Cla002308 vs. TrEMBL
Match: B3L5S9_PLAKH (MAEBL OS=Plasmodium knowlesi (strain H) GN=PKH_094500 PE=4 SV=1)

HSP 1 Score: 67.4 bits (163), Expect = 4.7e-08
Identity = 69/298 (23.15%), Postives = 121/298 (40.60%), Query Frame = 1

Query: 57   SRDLSEAQEVERNDESTVFEDILDTIDEEDDDWLDGLVDQRMKNESRGSVPAMSGLEESN 116
            + +  +A+EV + +E+   E++    +    +      + R   E++         EE+ 
Sbjct: 1211 AEEARKAEEVRKAEEARKAEEVRKAEEVRKAEEARKAEEVRKVEEAKKKAEEARKAEEAK 1270

Query: 117  SDERPIGKVIVAVEKKKFATKRKIRPIRAREVEAEEDVPPLRRKNEKVKDVGTSETIASS 176
                   K   A +KK    K+K    + +   A++     ++K E  K   T E    +
Sbjct: 1271 KKAEEAKKKAEAAKKKAEEAKKKAEAAKKKAEAAKKKAEAAKKKAEAAKKK-TEEAKKKA 1330

Query: 177  FAASSVERLAARAAEKSKKLKEDIKKMNERTKSFLTARKRRKEEI---ASGVQTVKEATR 236
             AA        +A E  KK +ED KK  E  K+   A K++ EE    A  V+  +EA +
Sbjct: 1331 EAAKKKAEEVRKAEEAKKKAEEDKKKAEEVKKA--EAAKKKAEEAKKKAEEVRKAEEAKK 1390

Query: 237  KAEFLCRIANIATELEVELEVTDATR----SPKAERIKKNVKKIREEKKKEKEVAITKGK 296
            KAE   +      + E   +  +A +    + KAE  KK  ++ R+ ++ +K+    K K
Sbjct: 1391 KAEEARKAEEAKKKAEEARKAEEAKKKAEEARKAEEAKKKAEEARKAEEAKKKAEEAKKK 1450

Query: 297  LPEKNKTEVAKAPAVVIRDLDAGRVRRTPASPISRNEKGKEKLVEEPTEEAAKSQKTQ 348
              E  K E AK  A   R  +  R +   A       K +E    E   +A + +K +
Sbjct: 1451 AEEARKAEEAKKKAEEARKAEEAR-KAEEARKAEEARKAEEARKAEEVRKAEEVRKAE 1504

BLAST of Cla002308 vs. TrEMBL
Match: Q64JV4_PLAVI (Merozoite surface protein 3b (Fragment) OS=Plasmodium vivax PE=4 SV=1)

HSP 1 Score: 65.9 bits (159), Expect = 1.4e-07
Identity = 87/351 (24.79%), Postives = 139/351 (39.60%), Query Frame = 1

Query: 10  KSPKPIASASRHGSWN---------LSATSTTTEPLAIVRPNSPELEATSSPSSPHSRDL 69
           KS K +AS++   S N         ++A     E   I   N+  +  T+S ++      
Sbjct: 261 KSVKELASSAEDASKNAKKEMAKAQIAAEVAKAEKAKIEAENAKLIADTASKAAEDIAKS 320

Query: 70  SEAQEVERNDESTVFEDILDTIDEEDDDWLDGLVDQRMKNESRGSVPAMSGLEESNSDER 129
           S+A ++ +N                    +    +++ K  +  +  A + L E+ + E 
Sbjct: 321 SKAAQIAKN--------------------VSAKAEEKSKVATEAADEAANALNEAENPES 380

Query: 130 PIG----KVIVAVEKKKFATKRKIRPIRAREVEAEEDVPPLRRKNEKVKDVGTSETIASS 189
            I     K   AV   + A K K +   A EV   E         E  K+ G ++  A  
Sbjct: 381 KIDDVRKKATEAVNAAEEAKKEKSKAEIAVEVAKAE---------EAKKEAGKAKVAAKQ 440

Query: 190 FAASSVERLAARAAEKSKKLKEDIKKMNERTKSFLTARKRRKEEIASGVQTVKEATRKAE 249
            A  S    A +AA+K+ K  ++  K+ E   S L + ++   EI + V  +KE  + A 
Sbjct: 441 VADKSKLEKAIQAADKASKKTDEASKLAEEALSDLESLEKETGEIKTKVNEIKEKVQNA- 500

Query: 250 FLCRIANIATELEVELEVTDAT-RSPKAERIKKNV--KKIREEKKKEKEVAITK-GKLPE 309
                 N A E   E  + + T    KAE  KK     K+  EK KE    I K  K  E
Sbjct: 501 -----INAALEAHKEKTIAEITVEVAKAEEAKKEADNAKVAAEKAKETAEKIAKTSKSTE 560

Query: 310 KNKTEVAKAPAVVIRDLDAGRVRRTPASPISRNEKGKEKLVEEPTEEAAKS 344
           K   EV KA        DA     T A+    +E+ K+K++ E  ++ A+S
Sbjct: 561 KITEEVRKATEFAKTAGDATTQAATEAAGDVSSEEQKQKVLLESIKQKAES 576

BLAST of Cla002308 vs. TrEMBL
Match: A0A0D4JCH9_STREE (Pneumococcal surface protein (Fragment) OS=Streptococcus pneumoniae GN=pspK PE=4 SV=1)

HSP 1 Score: 64.3 bits (155), Expect = 4.0e-07
Identity = 85/339 (25.07%), Postives = 135/339 (39.82%), Query Frame = 1

Query: 31  TTTEPLAIVRPNSPELEATSSPSSPHSRDLSEAQEVERNDESTVFE--DILDTIDEE--- 90
           T    +A     S  +E   S  +   +D+S+ QE+    +  V E  D++  IDEE   
Sbjct: 28  TVKAKMASTASASSAVEQAKSKVAEAEKDVSKGQEIYDEAQKKVQEKADVIRKIDEERQI 87

Query: 91  -DDDWLDGLVDQRMKNESRGSVPAMSGLEESNSDERPIGKVIVAVEKKKFATKRKIRPIR 150
            + +     ++ R   E +   P   G +E         K +  + KK+   K K     
Sbjct: 88  INGEVQKAYLELRQAQEEKNKYP---GRQEYTKKFDEANKKVDEITKKQ---KAKDEEYT 147

Query: 151 AREVEAEEDVPPL--RRKNEKVKDVGTSETIASSFAASSVERLAARAAEKSKKLKEDIKK 210
            +  E+  DV PL  + K EK K       +  +  A   E+ A     K KKLK ++ +
Sbjct: 148 KKMYESIPDVGPLLTKLKAEKDKLAAAKRQLEKAKVAEKTEKEA-----KVKKLKTELDQ 207

Query: 211 MNERTKSFLTARKRR---------KEEIASGVQTVKEAT-----RKAEFLCRIANIATEL 270
             E+ K      KR          + EIA     VKEA       +A  L     ++++ 
Sbjct: 208 AREKVKKQAEEDKRNYPTNTLKTLELEIAESYVKVKEAELELAKEEARELQDEDKLSSKR 267

Query: 271 EVELEVTDATRSPKAERIKKNVKKIREEKKKEKEVAITKGKLPEKNKTEVA--KAPAVVI 330
           +VE E  +A R    E +K + KK  EE K++ E A  K +   K K E A  KA     
Sbjct: 268 KVESEKAEAKR---LEELKTDRKKAAEEAKRKAEEAKQKAEEEAKRKAEEAKQKAEEEAK 327

Query: 331 RDLDAGRVRRTPASPISRNEKGKEKLVEEPTEEAAKSQK 346
           R  +  + +    +     E+ K K  EE  ++A +  K
Sbjct: 328 RKAEEAKQKAEEEAKRKAEEEAKRKAEEEAKQKAEEEAK 352

BLAST of Cla002308 vs. TrEMBL
Match: A0A0R0M2V0_9MICR (Uncharacterized protein (Fragment) OS=Pseudoloma neurophilia GN=M153_982000585 PE=4 SV=1)

HSP 1 Score: 63.5 bits (153), Expect = 6.7e-07
Identity = 63/250 (25.20%), Postives = 112/250 (44.80%), Query Frame = 1

Query: 109 MSGLEESNSDERPIGKVIVAVEKKKFATKRKIRPIRAREVEAEEDVPPLRRKNEKVKDVG 168
           +S ++ESN  ++ +   + A + K  A K+KI   + +  E ++ +   +++ EK K+  
Sbjct: 550 LSFIKESNFLDK-LKSTLAAGKNKMEAAKKKIEEAKKKAEEKKKQLEEKKKQLEKKKEEL 609

Query: 169 TSETIASSFAASSVERLAARAAEKSKKLKEDIKKMNERTKSFLTARKRRKEEIASGVQTV 228
             +       A   ++L  +  ++ +K KE++KK  E  K  L A+K+  E+    ++  
Sbjct: 610 KKQ-------AEEKKKLMEKKKQELEKKKEELKKQAEAKKQQLEAKKKELEKKKEELKKQ 669

Query: 229 KEATRKAEFLCRIANIATELEVELEVTDATRSPKAERIKKNVKKIREEKKKEKEVAITKG 288
            EA ++            E+E + +        K + + K ++K + E +K+K     K 
Sbjct: 670 AEAKKQ------------EMEKKKDELKKQAEAKKQELTKELEKKKGELEKKKGELEGKN 729

Query: 289 KLPEKNKTEVAK------APAVVIRDLDAGRVRRTPASPISRNEKGK---EKLVEEPTEE 348
             PEK   E+ K      A    + DLD     +TP     R E+ K   EK  EE TEE
Sbjct: 730 GEPEKKDEEIEKKDNNDGAKGEEVADLDKPGDEKTPV----RTEEEKTEDEKTEEEKTEE 775

Query: 349 AAKSQKTQFS 350
           A    +TQ S
Sbjct: 790 AKNDPETQES 775

BLAST of Cla002308 vs. TrEMBL
Match: A0A0R0M2V0_9MICR (Uncharacterized protein (Fragment) OS=Pseudoloma neurophilia GN=M153_982000585 PE=4 SV=1)

HSP 1 Score: 43.9 bits (102), Expect = 5.5e-01
Identity = 71/322 (22.05%), Postives = 134/322 (41.61%), Query Frame = 1

Query: 28  ATSTTTEPLAIVRPNSPELEATSSPSS-PHSRDLSEAQEVERNDESTVFEDILDTIDEED 87
           A     E ++ V P++ EL+    P+  P   D  E  E E+ ++  + +D  + I+ E 
Sbjct: 402 ADKAELEKMSRVEPDA-ELKKIIDPNDFPSPEDQKEKIETEQLEKEKLEKDQKEKIETE- 461

Query: 88  DDWLDGLVDQRMKNESRGSVPAMSGLEESNSDERPIGKVIVAVEKKKFATKRKIRPIRAR 147
                    Q  K+++         LE+    +  I       EK++   KRK + ++ +
Sbjct: 462 ---------QLEKDQTEKEQLEKEKLEDLKEKDEKIKDEKDEKEKEENLFKRKFKELKEK 521

Query: 148 EVEAEEDVPPLRRKNEKVKDVGTSETIASSFAASSVERLAARAAEKSKKLKEDIKKMNER 207
              A+E +  LR+          S +I+S F+      L     E+   +KE      ++
Sbjct: 522 ---AKEKIDSLRKSTLLNPIFSISHSISSFFSYL----LDFNENEELSFIKES--NFLDK 581

Query: 208 TKSFLTARKRRKEEIASGVQTVKEATRKAEFLCRIANIATELEVELEVTDATRSPKAERI 267
            KS L A K + E   +  + ++EA +KAE                         K +++
Sbjct: 582 LKSTLAAGKNKME---AAKKKIEEAKKKAE------------------------EKKKQL 641

Query: 268 KKNVKKIREEKKKEKEVAITKGKLPEKNKTEVAKAPAVVIRDLDAGRVRRTPASPISRNE 327
           ++  K++ ++K++ K+ A  K KL EK K E+ K    + +  +A +        +   +
Sbjct: 642 EEKKKQLEKKKEELKKQAEEKKKLMEKKKQELEKKKEELKKQAEAKK------QQLEAKK 670

Query: 328 KGKEKLVEEPTEEA-AKSQKTQ 348
           K  EK  EE  ++A AK Q+ +
Sbjct: 702 KELEKKKEELKKQAEAKKQEME 670


HSP 2 Score: 63.2 bits (152), Expect = 8.8e-07
Identity = 68/259 (26.25%), Postives = 106/259 (40.93%), Query Frame = 1

Query: 95  DQRMKNESRGSVPAMSGLEESNSDERPIGKVIVA-------VEK-KKFATKRKIRPIRAR 154
           D   K E   +  A +G EE    ++   K   A       VEK KK A K++       
Sbjct: 101 DAARKAEKEQAKQANAGKEEQQKAQKEADKAAKAAKEAEQKVEKAKKDAEKKEKEAADKA 160

Query: 155 EVEAEEDVPPLRRKNEKVKDVGTSETIASSFAASSVERLAARAAEKSKKLKEDIKKMNER 214
             EA++    +R+  EK +    +   A   AA   E+ AA AAEK +  KE  +K +  
Sbjct: 161 AKEAKKKEEEVRKAQEKAEKEAAAAVEAERKAAEKAEKEAAAAAEKERLAKEKAEK-DAA 220

Query: 215 TKSFLTARKRRKEEIASGVQTVKEATRKAEFLCRIANIATELEVELEVTDATRSPKAERI 274
            K+   A+K+ +EE  +  +  KEA +KAE   + A  A +   +    +     KAE  
Sbjct: 221 DKAAKEAKKKEEEERKAVEKAEKEAAKKAEEERKAAEKAEKEAAKKAEKERVAKEKAEAE 280

Query: 275 KKNVKKIREEKKKEKEVAITKGKLPEKNKTEVAKAPAVVIRDLDAGRVRRTPASPISRNE 334
                K  EE++  KE A       EK   E A A A   ++          A+     +
Sbjct: 281 AAAAVKKAEEEQAAKEKA-------EKEAAEAAAAKAE--QEAKEATAAAAAAAAAQAEQ 340

Query: 335 KGKEKLVEEPTEEAAKSQK 346
           +  EK  +E  +  A+ +K
Sbjct: 341 EANEKAAKEAADRKAEEEK 349

BLAST of Cla002308 vs. NCBI nr
Match: gi|805789896|ref|XP_012141575.1| (PREDICTED: microtubule-associated protein futsch, partial [Megachile rotundata])

HSP 1 Score: 71.2 bits (173), Expect = 4.6e-09
Identity = 107/400 (26.75%), Postives = 170/400 (42.50%), Query Frame = 1

Query: 42   NSPELEATSSPSSPHSRDLSEAQEVERNDESTVFEDILDTIDEEDDDWLDGLVDQRMKNE 101
            +SPE ++ S   SP  +D+ +    +  ++S V  D LD+  ++ +D L+  V ++ K  
Sbjct: 2474 SSPEKDSISE-KSPEVKDVGKVTPEDSLEKSPV--DRLDSGSQDLEDSLEKAVTEKRKES 2533

Query: 102  SRGSVPAMSGLEESNSDERPIGKVIVAVEKKKFATKRK----IRPIRAREVEAEEDVPPL 161
               SV  +S   E    +    KVI A+EK+  A   K    I   ++ E    EDV P 
Sbjct: 2534 I--SVEKLSKTPEEKEAKIDEEKVIEAIEKQIEAHPEKKTLDITAAKSPESIRSEDVSPA 2593

Query: 162  RRKNEK--------------VKDVGTSETIASSFAASSVER---LAARAAEKSKKLKEDI 221
                EK              +KD+  ++ I S+   + VE    L+   AEK+K  +E  
Sbjct: 2594 ETIFEKSPTEKRESIQILDAIKDMEPADAIVSAKEEAKVEAPKPLSKEEAEKAKTPEEKA 2653

Query: 222  KKMNERTKSFLTARKRRKEEIASGVQTVKEATRKAEFLCRIANIATELEVELEVTDATRS 281
            K   E+ KS     K  +E+  +  + VK    KA+         T  E E    +  ++
Sbjct: 2654 KTPEEKAKSPEEKAKSPEEKAKTPEEKVKTPEEKAK---------TPEEKEKTPEEKVKT 2713

Query: 282  PKAERIK---KNVKKIREEKKKE--KEVAITKGKLPEKNKTEVAKAPAVVIRDLDAGRVR 341
            P+ E++K   K  K    EK  E  KE  ITK    EK + EV+    + + D+     +
Sbjct: 2714 PE-EKVKIEEKEEKVTPREKSPESPKEKEITK---EEKEEKEVSVDRKLSVADVAVDEAK 2773

Query: 342  RTPASPISRNEK-------------GKEKLVEEP-TEEAAKSQKTQFSGLYTEVGFFLEP 401
            ++P SP+S  +K              +E + E P T E  K +KT+ + L  E     E 
Sbjct: 2774 KSP-SPVSAEDKISKEEKAEEIKEERRESVAERPVTVELDKEKKTEVADLAKEAEDKPEV 2833

BLAST of Cla002308 vs. NCBI nr
Match: gi|41814736|gb|AAS10478.1| (merozoite surface protein 3b, partial [Plasmodium vivax])

HSP 1 Score: 65.9 bits (159), Expect = 1.9e-07
Identity = 87/351 (24.79%), Postives = 139/351 (39.60%), Query Frame = 1

Query: 10  KSPKPIASASRHGSWN---------LSATSTTTEPLAIVRPNSPELEATSSPSSPHSRDL 69
           KS K +AS++   S N         ++A     E   I   N+  +  T+S ++      
Sbjct: 261 KSVKELASSAEDASKNAKKEMAKAQIAAEVAKAEKAKIEAENAKLIADTASKAAEDIAKS 320

Query: 70  SEAQEVERNDESTVFEDILDTIDEEDDDWLDGLVDQRMKNESRGSVPAMSGLEESNSDER 129
           S+A ++ +N                    +    +++ K  +  +  A + L E+ + E 
Sbjct: 321 SKAAQIAKN--------------------VSAKAEEKSKVATEAADEAANALNEAENPES 380

Query: 130 PIG----KVIVAVEKKKFATKRKIRPIRAREVEAEEDVPPLRRKNEKVKDVGTSETIASS 189
            I     K   AV   + A K K +   A EV   E         E  K+ G ++  A  
Sbjct: 381 KIDDVRKKATEAVNAAEEAKKEKSKAEIAVEVAKAE---------EAKKEAGKAKVAAKQ 440

Query: 190 FAASSVERLAARAAEKSKKLKEDIKKMNERTKSFLTARKRRKEEIASGVQTVKEATRKAE 249
            A  S    A +AA+K+ K  ++  K+ E   S L + ++   EI + V  +KE  + A 
Sbjct: 441 VADKSKLEKAIQAADKASKKTDEASKLAEEALSDLESLEKETGEIKTKVNEIKEKVQNA- 500

Query: 250 FLCRIANIATELEVELEVTDAT-RSPKAERIKKNV--KKIREEKKKEKEVAITK-GKLPE 309
                 N A E   E  + + T    KAE  KK     K+  EK KE    I K  K  E
Sbjct: 501 -----INAALEAHKEKTIAEITVEVAKAEEAKKEADNAKVAAEKAKETAEKIAKTSKSTE 560

Query: 310 KNKTEVAKAPAVVIRDLDAGRVRRTPASPISRNEKGKEKLVEEPTEEAAKS 344
           K   EV KA        DA     T A+    +E+ K+K++ E  ++ A+S
Sbjct: 561 KITEEVRKATEFAKTAGDATTQAATEAAGDVSSEEQKQKVLLESIKQKAES 576

BLAST of Cla002308 vs. NCBI nr
Match: gi|41814714|gb|AAS10467.1| (merozoite surface protein 3b, partial [Plasmodium vivax])

HSP 1 Score: 62.4 bits (150), Expect = 2.2e-06
Identity = 84/368 (22.83%), Postives = 151/368 (41.03%), Query Frame = 1

Query: 1   MASQQPTSSKSPKPIASASRHGSWNLSATSTTTEPLAIVRPNSPELEATSSPSSPHSRDL 60
           +AS    +SK+ K   + ++     ++A     E   I   N+  +  T+S ++      
Sbjct: 304 LASNAEDASKNAKKEMAKAQ-----IAAEVAKAEKAKIEAENAKLIADTASKAAEDIAKS 363

Query: 61  SEAQEVERNDESTVFEDILDTIDEEDDDWLDGLVDQRMKNESRGSVPAMSGLEESNSDER 120
           S+A ++ +N                    +    +++ K  +  +  A + L E+ + E 
Sbjct: 364 SKAAQIAKN--------------------VSAKAEEKSKVATEAADEAANALNEAENPES 423

Query: 121 PIGKVIVAVEKKKFATKRKIRPIRAREVEAEEDVP-PLRRKNEKVKDVGTSETIASSFAA 180
            I  V      KK AT+       A++ +++ ++   + +  E  K+ G ++  A   A 
Sbjct: 424 KIDDV------KKKATEAVNAAEEAKKEKSKAEIAVEVAKAEEAKKEAGKAKVAAKQVAD 483

Query: 181 SSVERLAARAAEKSKKLKEDIKKMNERTKSFLTARKRRKEEIASGVQTVKEATRKAEFLC 240
            S    A +AA+K+ K  ++  K+ E   S L + ++   EI + V  +KE  + A    
Sbjct: 484 KSKLEKAIQAADKASKKTDEASKLAEEALSDLESLEKETGEIKTKVNEIKEKVQNA---- 543

Query: 241 RIANIATELEVELEVTDAT-RSPKAERIKKNV--KKIREEKKKEKEVAITK-GKLPEKNK 300
              N A E   E  + + T    KAE  KK     K+  EK KE    I K  K  EK  
Sbjct: 544 --INAALEAHKEKTIAEITVEVAKAEEAKKEADNAKVAAEKAKETAEKIAKTSKSTEKIT 603

Query: 301 TEVAKAPAVVIRDLDAGRVRRTPAS-PISRNEKGKEKLVEEPTEEAAKSQKTQFSGL--Y 360
            EV KA        DA     T A+  +S  E+ ++K+++   ++A  + +     +   
Sbjct: 604 EEVRKATEFAKTAGDATTQAATEAAGDVSSEEQNQKKMLQSIKQKAESALEASQEAIKAK 634

BLAST of Cla002308 vs. NCBI nr
Match: gi|817063392|ref|XP_012253395.1| (PREDICTED: uncharacterized protein LOC105684553 isoform X5 [Athalia rosae])

HSP 1 Score: 62.0 bits (149), Expect = 2.8e-06
Identity = 75/302 (24.83%), Postives = 124/302 (41.06%), Query Frame = 1

Query: 1    MASQQPTSSKSPKPIASASRHGSWNLSATSTTTEPLAIVRPNSPELEATSSPSSPHSRDL 60
            + SQ+ TS+K P      S+          +  +P   V  +  + E   +P    S D 
Sbjct: 8882 ITSQKETSAKIPPVATQVSK---------KSEAKPKDQVHKSKTKPEKQETPKKDISLD- 8941

Query: 61   SEAQEVERNDESTVFEDILDTIDEEDDDWLDGLVDQRMKNESRGSVPAMSGLEESNSDER 120
             E Q +   D  T    +L +    + + L   + + +K+E   +V ++    E    E+
Sbjct: 8942 -EKQIISPVDSQTTTNKVLQSEVTVEQNIL---IKEDVKSEQSSTVDSIESKREEQKQEQ 9001

Query: 121  PIGKVIVAVEKKKFATKRKIRPIRAREVEAEEDVPPLRRKNEKVKDVGTSETIASSFAAS 180
                      K K A K+      A+  + E+   P   K EK K  G  E I +S  + 
Sbjct: 9002 -------TTSKSKKAQKQNASKKNAKPEQPEKSSKP--DKTEKSKKSG--EAITTSTTSC 9061

Query: 181  SVERLAARAAEKSKKLKEDIKKMNERTKSFLTARKRRKEEIASGVQTVKEATRKAEFLCR 240
            SV +    + E+ KK++E + K+    K      K  + EI S  Q +  A  K E    
Sbjct: 9062 SVTQSTDVSKEEIKKIEETVTKITAHPKE----EKTIETEIKSEKQILSTAVNKLE---- 9121

Query: 241  IANIATELEVELEVTDATRSPKAE-RIKKNVKKIRE-----EKKKEKEVAITKGKLPEKN 297
                  E  V+++ T     PK E ++KK+ K+  E     +KK EKE+A+ +  LP K 
Sbjct: 9122 -----EEQPVKIKAT-----PKVEHKLKKDTKREEETPQKGQKKSEKEIAVKQDSLPRKQ 9140

BLAST of Cla002308 vs. NCBI nr
Match: gi|817063398|ref|XP_012253398.1| (PREDICTED: nesprin-1 isoform X7 [Athalia rosae])

HSP 1 Score: 62.0 bits (149), Expect = 2.8e-06
Identity = 75/302 (24.83%), Postives = 124/302 (41.06%), Query Frame = 1

Query: 1    MASQQPTSSKSPKPIASASRHGSWNLSATSTTTEPLAIVRPNSPELEATSSPSSPHSRDL 60
            + SQ+ TS+K P      S+          +  +P   V  +  + E   +P    S D 
Sbjct: 7518 ITSQKETSAKIPPVATQVSK---------KSEAKPKDQVHKSKTKPEKQETPKKDISLD- 7577

Query: 61   SEAQEVERNDESTVFEDILDTIDEEDDDWLDGLVDQRMKNESRGSVPAMSGLEESNSDER 120
             E Q +   D  T    +L +    + + L   + + +K+E   +V ++    E    E+
Sbjct: 7578 -EKQIISPVDSQTTTNKVLQSEVTVEQNIL---IKEDVKSEQSSTVDSIESKREEQKQEQ 7637

Query: 121  PIGKVIVAVEKKKFATKRKIRPIRAREVEAEEDVPPLRRKNEKVKDVGTSETIASSFAAS 180
                      K K A K+      A+  + E+   P   K EK K  G  E I +S  + 
Sbjct: 7638 -------TTSKSKKAQKQNASKKNAKPEQPEKSSKP--DKTEKSKKSG--EAITTSTTSC 7697

Query: 181  SVERLAARAAEKSKKLKEDIKKMNERTKSFLTARKRRKEEIASGVQTVKEATRKAEFLCR 240
            SV +    + E+ KK++E + K+    K      K  + EI S  Q +  A  K E    
Sbjct: 7698 SVTQSTDVSKEEIKKIEETVTKITAHPKE----EKTIETEIKSEKQILSTAVNKLE---- 7757

Query: 241  IANIATELEVELEVTDATRSPKAE-RIKKNVKKIRE-----EKKKEKEVAITKGKLPEKN 297
                  E  V+++ T     PK E ++KK+ K+  E     +KK EKE+A+ +  LP K 
Sbjct: 7758 -----EEQPVKIKAT-----PKVEHKLKKDTKREEETPQKGQKKSEKEIAVKQDSLPRKQ 7776

The following BLAST results are available for this feature:
Match NameE-valueIdentityDescription
NST1_YARLI1.1e-0725.31Stress response protein NST1 OS=Yarrowia lipolytica (strain CLIB 122 / E 150) GN... [more]
SMC_AQUAE4.8e-0624.90Chromosome partition protein Smc OS=Aquifex aeolicus (strain VF5) GN=smc PE=3 SV... [more]
Match NameE-valueIdentityDescription
B3L5S9_PLAKH4.7e-0823.15MAEBL OS=Plasmodium knowlesi (strain H) GN=PKH_094500 PE=4 SV=1[more]
Q64JV4_PLAVI1.4e-0724.79Merozoite surface protein 3b (Fragment) OS=Plasmodium vivax PE=4 SV=1[more]
A0A0D4JCH9_STREE4.0e-0725.07Pneumococcal surface protein (Fragment) OS=Streptococcus pneumoniae GN=pspK PE=4... [more]
A0A0R0M2V0_9MICR6.7e-0725.20Uncharacterized protein (Fragment) OS=Pseudoloma neurophilia GN=M153_982000585 P... [more]
A0A0R0M2V0_9MICR5.5e-0122.05Uncharacterized protein (Fragment) OS=Pseudoloma neurophilia GN=M153_982000585 P... [more]
Match NameE-valueIdentityDescription
gi|805789896|ref|XP_012141575.1|4.6e-0926.75PREDICTED: microtubule-associated protein futsch, partial [Megachile rotundata][more]
gi|41814736|gb|AAS10478.1|1.9e-0724.79merozoite surface protein 3b, partial [Plasmodium vivax][more]
gi|41814714|gb|AAS10467.1|2.2e-0622.83merozoite surface protein 3b, partial [Plasmodium vivax][more]
gi|817063392|ref|XP_012253395.1|2.8e-0624.83PREDICTED: uncharacterized protein LOC105684553 isoform X5 [Athalia rosae][more]
gi|817063398|ref|XP_012253398.1|2.8e-0624.83PREDICTED: nesprin-1 isoform X7 [Athalia rosae][more]
The following terms have been associated with this gene:
Vocabulary: INTERPRO
TermDefinition
GO Assignments
This gene is annotated with the following GO terms.
Category Term Accession Term Name
biological_process GO:0008150 biological_process
cellular_component GO:0005575 cellular_component
molecular_function GO:0003674 molecular_function

The following mRNA feature(s) are a part of this gene:

Feature NameUnique NameType
Cla002308Cla002308.1mRNA


Analysis Name: InterPro Annotations of watermelon (97103)
Date Performed: 2016-09-28
IPR TermIPR DescriptionSourceSource TermSource DescriptionAlignment
NoneNo IPR availableunknownCoilCoilcoord: 263..283
score: -coord: 179..206
scor

The following gene(s) are orthologous to this gene:

None

The following gene(s) are paralogous to this gene:

None

The following block(s) are covering this gene:
GeneOrganismBlock
Cla002308Bottle gourd (USVL1VR-Ls)lsiwmB437
Cla002308Watermelon (97103) v2wmwmbB171