Cla97C11G207100 (gene) Watermelon (97103) v2

NameCla97C11G207100
Typegene
OrganismCitrullus lanatus (Watermelon (97103) v2)
Descriptionp-loop containing nucleoside triphosphate hydrolase superfamily protein
LocationCla97Chr11 : 992741 .. 996380 (+)
The following sequences are available for this feature:

Gene sequence (with intron)

Legend: polypeptideexonCDS
Hold the cursor over a type above to highlight its positions in the sequence below.
ATGAAGGCCCCAAGCACGAAAGAGGAGTGGACCTCATTGGCGCTTGAGGTGTGACCAATTTTTCAATGCTAAATTACAGATGACAAGTACCCCTCTTCATTATTCTAATCAGTATTGACACTCTTAATTCATGTACTACTTGTTATAAAACCCTCCAGGATTGGATCTATATACCTTTGCTAACAAACAAAATGAATTTCAAATATTTAGGAATGTTTGTGTGTAAATATTAGTTATTCAAAATTATATATTTTTGGTCAAATTACATTTTTAATCCCTAAAATTTCAAGAACTAAAAAAATTTATTTAGCTTTGAGTTTAATTTTTATTTAGCCTCTAGTCCTTTAAATTTAAAAGTCGAAACTAAGTTTTCAAGTTCAATTGACTAATAGAAAGAGTGTTATGTAGAGTTTTCCCGTGTAGAGCATACTAGTAACAAACGGTAGTTCTCAAGCTCAAGTGACAATTATTTAATATAGTGACAATTATTTCAAACAAACAAAGAAAGAGAGAGAGTTTTTTTAAGAGACAAACTAATAATAATAATAATAATAATAATAATAATAATAATAATAATGGAAAACAAAGAAAATATGAAACTCTCCCAACTACGAAAATAAACCAAGGTTGAGGCTTCAGGAAATTGTTTTAATCTATAGAGAGACTTCTTGAGTATACACTCCTTTCCTTTAGTATAGTCTTCAAATCTCTAAGGTATGTTCGAATACATTCCTACACTATGCCGCCATTTAGAAATGCATTTCATCAAATTGAAACAACGGCCAATCTAACTTTGTTGCAATAGAAAGAAGTATCCAAATAGTATTTAACTTGGCAACTAGAGCAAATGTTTTGTGCTAGTGCTGGTCAATCCCATAAACTTGAGTGAATCCTCTAATCTAAGGAGTCACAGAAGCTCTTAAAGGTGGTGACAAAATAGTGTAAGATAAATTATATTGAATAGCATGTTGGAGTTGTTAGTTTGCAAGATATGTGTGGAATTGTGGGGCCCACAAACTCCTTGGACCAAACAAGGAGTGGAATTTACAACTCCATTATCCCCAACTCCAACTCCTTGGGCCAAACACCCCCTTAGTGTTTTAGTGAAGGAGTATATGTATATGTAGCAACTAGTGAAAAGGAAATGACAGCTTGGGGTGTGTGGAAAGTTGATAGTTGGGATTCTTGTTTGGCATCTGGGGAATGTTGAACTTCAAAATATTTTTAGTTGTGGAACTTGTCCATTTTAATCTACCCAGACAAATATATATACTGGTTTGGTTGTTAGGATATTAACTGCCAGATAGATTTCATCCGCATTTCTGTTGCAAGTTATTTTCTGCTTATTATTTTATACATTATGGCAGTTCTTTTCAGAGGGTGTTTATGGGGCAACATCTTTGTGCTTTAAAAGAGCTGAAGACAGACGTAGAAGTGAATGGGCCAGGGCTGCTTCTCTTTGTGCAACTGCTGGCATTTTGGATGGTTCAAATCCTCAAATAGCTTGTAATGGCCTTCGAGAAGCTGCTAAATTTTATATTTCTGTGGATCGTGCTGAGATTGCTGCTAAGTGCTACATTAAGTTAAAAGAATATAAAACAGCAGGTACACAACTTCGGTTTTTGGAAGCATCATACATGTTTTTGAGCTAGTTTGAAGTCTAAGACTGTTCAAGGATCATAATGAGGAGTCAAATAGGCCTAGATGGTAAGGAAGGAATAATATGATAATTAGCAAGATAGTTTGTCAGGCCATTGTGGTGATATGATGTGGAACACCCATCTTAAATTGTCTTATTAATATGTCTCTGTTGTTACTGTAGCTCATACATATCTAACAAAATGTGGAGAAGCAAGGTTAGAGAATGCTGGTGATTGTTATATGTTGGCCAAATGCTACAAATTGGCTGCTGAGGCATATTCAAGGGGCAGATGTTTCTTAAAATTCTTTGATGTCTGCACTGCTGCAAATCTTTTTGACATGGGGTTGCAAGTGATCTGCAGCTGGAGGAAACATGATGATGTTAATCTGATTAAAAAATGTCAACATATCAAAAAGACCTGGCATTTGTTTCTGGTGAAAGGTGCCCTTCCCTATCACCAGCTTCAAAATTTTTGTTCCACGATGAAGTTTGTCAAAAGCTTCGACTCCATTGATGAAAAATATTCATTCCTCAGGACTTTAGGTCTCTCTGAGAAAAAATTGTTGCAAGAGGAAGAACTAAACGAGGTGGTGCATAAGGAAACCATATCTCAACATGAAGGGTCGTTTTCACTAGGATTGCAGCTTCAACCAAAACTTGAGTCAGTCTTAGTACACAAGGAAACGTCTCAAAATGAAACAAAGACTAAGGATAAGATGAATGTTGCTAATAACATGTTAACAACTAAAGGGTCTTCACGAGGGTCGAAGTTTCAACCTAAGCTCAAGTTGGTATGGAAGGAAACAACATCACAAAATGATACAAAGACGAAGGATAGGATGAAAGTGGCCGATAACATGTCTGTGGCTAAAGGGTCTTCACAAGGATTGCAATTTCAATCTAAGCTCGAGATGAAAACAGTATCACAAAATGATACGACAACGAGGGATAAGATGAAAATTGTTGAAACCTTGTCAACGGCTAAAGGGTATTCACAAGGATTGAAGTGTCAATCTGAGCTCATGTCGGTATTGAAGGAAACAACATCCCAAAATGATACAAAGACGAAGGATAAGATGAAACTGGCAGCTAAAGAGTCTTCACAAGGTTTGCAGTTTCAATGTAAGCTCGAGTTGGAAACCATATCACAAAATGGTACGACAACAAGGGATAATATGAAAGTTGCTGAAGACATGTCAATAGCTAAGGGGTCTTCACAAGGGTTGAAGTTTCAACCTAAGTTCAAGTCGGTATGGAAGGAAACAACATACCAAAATGGTACAAATACAAAGGGTAAGATGAAACTGGCTGATAACATGTCTACAGCGAAGGGGTCTTCACAAGGATTGCAGTTTAAATCCAAGCTTGAGTCGAGAACAGTGTCTCAAAATGATGTGATGACAATGGATAAGATCCAAGTTTGCAGAAGTCATGTCAACAAGTGAAGAGTCTACAAAAGGATTGCAGTTTCAACCAAAGCAGGAGTCAGTATGCAAGGAGAAAGCATCTCAAAATGATTCGAAGATTGGGGATAATCTGAAAGTTGCCCCTTTCATCTCAACCACTAAAGACTCTTCATACAAGTTCCAAATTAAGCCAAAGATTGCGTATGCCAAAGAGGAAATTGCAGCTCAAAATAATGTGAAGATTGAGAAAGACGCAGTGAATAATGTAAACAACAAGGCAGAAGCTTCACAAAAGCTGCAGCAGTGCAATCAGAAGCTCAAAAATGTACAAAAGGAAACAACAAGCTCGAGCGATTCAAGAGTGAAGAAGGATAAGATGAAAGAATCTGTTAACTTGTCAGAAGCTGGAGATCCATCACAACAGCTGCAAACTGAACAGAAACAGCTAAAACAAAAGGATGGGGAAGTTGAGAAGGGTAAACAGAAAGTGGCAGATCACAAGTTCATAGCCAAGCGTTACTGGAGAAAGGTAACAGAAAATGGTATGAAATCCAACTTTCAAGAGCATTTAGACATACGGAGGTTTCAGTAG

mRNA sequence

ATGAAGGCCCCAAGCACGAAAGAGGAGTGGACCTCATTGGCGCTTGAGTTCTTTTCAGAGGGTGTTTATGGGGCAACATCTTTGTGCTTTAAAAGAGCTGAAGACAGACGTAGAAGTGAATGGGCCAGGGCTGCTTCTCTTTGTGCAACTGCTGGCATTTTGGATGGTTCAAATCCTCAAATAGCTTGTAATGGCCTTCGAGAAGCTGCTAAATTTTATATTTCTGTGGATCGTGCTGAGATTGCTGCTAAGTGCTACATTAAGTTAAAAGAATATAAAACAGCAGCTCATACATATCTAACAAAATGTGGAGAAGCAAGGTTAGAGAATGCTGGTGATTGTTATATGTTGGCCAAATGCTACAAATTGGCTGCTGAGGCATATTCAAGGGGCAGATGTTTCTTAAAATTCTTTGATGTCTGCACTGCTGCAAATCTTTTTGACATGGGGTTGCAAGTGATCTGCAGCTGGAGGAAACATGATGATGTTAATCTGATTAAAAAATGTCAACATATCAAAAAGACCTGGCATTTGTTTCTGGTGAAAGGTGCCCTTCCCTATCACCAGCTTCAAAATTTTTGTTCCACGATGAAGTTTGTCAAAAGCTTCGACTCCATTGATGAAAAATATTCATTCCTCAGGACTTTAGGTCTCTCTGAGAAAAAATTGTTGCAAGAGGAAGAACTAAACGAGGTGGTGCATAAGGAAACCATATCTCAACATGAAGGGTCGTTTTCACTAGGATTGCAGCTTCAACCAAAACTTGAGTCAGTCTTAGTACACAAGGAAACGTCTCAAAATGAAACAAAGACTAAGGATAAGATGAATGTTGCTAATAACATGTTAACAACTAAAGGGTCTTCACGAGGGTCGAAGTTTCAACCTAAGCTCAAGTTGGTATGGAAGGAAACAACATCACAAAATGATACAAAGACGAAGGATAGGATGAAAGTGGCCGATAACATGTCTGTGGCTAAAGGGTCTTCACAAGGATTGCAATTTCAATCTAAGCTCGAGATGAAAACAGTATCACAAAATGATACGACAACGAGGGATAAGATGAAAATTGTTGAAACCTTGTCAACGGCTAAAGGGTATTCACAAGGATTGAAGTGTCAATCTGAGCTCATGTCGGTATTGAAGGAAACAACATCCCAAAATGATACAAAGACGAAGGATAAGATGAAACTGGCAGCTAAAGAGTCTTCACAAGGTTTGCAGTTTCAATGTAAGCTCGAGTTGGAAACCATATCACAAAATGGTACGACAACAAGGGATAATATGAAAGTTGCTGAAGACATGTCAATAGCTAAGGGGTCTTCACAAGGGTTGAAGTTTCAACCTAAGTTCAAGTCGGTATGGAAGGAAACAACATACCAAAATGGTACAAATACAAAGGGTAAGATGAAACTGGCTGATAACATGTCTACAGCGAAGGGGTCTTCACAAGGATTGCAGTTTAAATCCAAGCTTGAGTCGAGAACAGTGTCTCAAAATGATGTGATGACAATGGATAAGATCCAAGAGTCAGTATGCAAGGAGAAAGCATCTCAAAATGATTCGAAGATTGGGGATAATCTGAAAGTTGCCCCTTTCATCTCAACCACTAAAGACTCTTCATACAAGTTCCAAATTAAGCCAAAGATTGCGTATGCCAAAGAGGAAATTGCAGCTCAAAATAATGTGAAGATTGAGAAAGACGCAGTGAATAATGTAAACAACAAGGCAGAAGCTTCACAAAAGCTGCAGCAGTGCAATCAGAAGCTCAAAAATGTACAAAAGGAAACAACAAGCTCGAGCGATTCAAGAGTGAAGAAGGATAAGATGAAAGAATCTGTTAACTTGTCAGAAGCTGGAGATCCATCACAACAGCTGCAAACTGAACAGAAACAGCTAAAACAAAAGGATGGGGAAGTTGAGAAGGGTAAACAGAAAGTGGCAGATCACAAGTTCATAGCCAAGCGTTACTGGAGAAAGGTAACAGAAAATGGTATGAAATCCAACTTTCAAGAGCATTTAGACATACGGAGGTTTCAGTAG

Coding sequence (CDS)

ATGAAGGCCCCAAGCACGAAAGAGGAGTGGACCTCATTGGCGCTTGAGTTCTTTTCAGAGGGTGTTTATGGGGCAACATCTTTGTGCTTTAAAAGAGCTGAAGACAGACGTAGAAGTGAATGGGCCAGGGCTGCTTCTCTTTGTGCAACTGCTGGCATTTTGGATGGTTCAAATCCTCAAATAGCTTGTAATGGCCTTCGAGAAGCTGCTAAATTTTATATTTCTGTGGATCGTGCTGAGATTGCTGCTAAGTGCTACATTAAGTTAAAAGAATATAAAACAGCAGCTCATACATATCTAACAAAATGTGGAGAAGCAAGGTTAGAGAATGCTGGTGATTGTTATATGTTGGCCAAATGCTACAAATTGGCTGCTGAGGCATATTCAAGGGGCAGATGTTTCTTAAAATTCTTTGATGTCTGCACTGCTGCAAATCTTTTTGACATGGGGTTGCAAGTGATCTGCAGCTGGAGGAAACATGATGATGTTAATCTGATTAAAAAATGTCAACATATCAAAAAGACCTGGCATTTGTTTCTGGTGAAAGGTGCCCTTCCCTATCACCAGCTTCAAAATTTTTGTTCCACGATGAAGTTTGTCAAAAGCTTCGACTCCATTGATGAAAAATATTCATTCCTCAGGACTTTAGGTCTCTCTGAGAAAAAATTGTTGCAAGAGGAAGAACTAAACGAGGTGGTGCATAAGGAAACCATATCTCAACATGAAGGGTCGTTTTCACTAGGATTGCAGCTTCAACCAAAACTTGAGTCAGTCTTAGTACACAAGGAAACGTCTCAAAATGAAACAAAGACTAAGGATAAGATGAATGTTGCTAATAACATGTTAACAACTAAAGGGTCTTCACGAGGGTCGAAGTTTCAACCTAAGCTCAAGTTGGTATGGAAGGAAACAACATCACAAAATGATACAAAGACGAAGGATAGGATGAAAGTGGCCGATAACATGTCTGTGGCTAAAGGGTCTTCACAAGGATTGCAATTTCAATCTAAGCTCGAGATGAAAACAGTATCACAAAATGATACGACAACGAGGGATAAGATGAAAATTGTTGAAACCTTGTCAACGGCTAAAGGGTATTCACAAGGATTGAAGTGTCAATCTGAGCTCATGTCGGTATTGAAGGAAACAACATCCCAAAATGATACAAAGACGAAGGATAAGATGAAACTGGCAGCTAAAGAGTCTTCACAAGGTTTGCAGTTTCAATGTAAGCTCGAGTTGGAAACCATATCACAAAATGGTACGACAACAAGGGATAATATGAAAGTTGCTGAAGACATGTCAATAGCTAAGGGGTCTTCACAAGGGTTGAAGTTTCAACCTAAGTTCAAGTCGGTATGGAAGGAAACAACATACCAAAATGGTACAAATACAAAGGGTAAGATGAAACTGGCTGATAACATGTCTACAGCGAAGGGGTCTTCACAAGGATTGCAGTTTAAATCCAAGCTTGAGTCGAGAACAGTGTCTCAAAATGATGTGATGACAATGGATAAGATCCAAGAGTCAGTATGCAAGGAGAAAGCATCTCAAAATGATTCGAAGATTGGGGATAATCTGAAAGTTGCCCCTTTCATCTCAACCACTAAAGACTCTTCATACAAGTTCCAAATTAAGCCAAAGATTGCGTATGCCAAAGAGGAAATTGCAGCTCAAAATAATGTGAAGATTGAGAAAGACGCAGTGAATAATGTAAACAACAAGGCAGAAGCTTCACAAAAGCTGCAGCAGTGCAATCAGAAGCTCAAAAATGTACAAAAGGAAACAACAAGCTCGAGCGATTCAAGAGTGAAGAAGGATAAGATGAAAGAATCTGTTAACTTGTCAGAAGCTGGAGATCCATCACAACAGCTGCAAACTGAACAGAAACAGCTAAAACAAAAGGATGGGGAAGTTGAGAAGGGTAAACAGAAAGTGGCAGATCACAAGTTCATAGCCAAGCGTTACTGGAGAAAGGTAACAGAAAATGGTATGAAATCCAACTTTCAAGAGCATTTAGACATACGGAGGTTTCAGTAG

Protein sequence

MKAPSTKEEWTSLALEFFSEGVYGATSLCFKRAEDRRRSEWARAASLCATAGILDGSNPQIACNGLREAAKFYISVDRAEIAAKCYIKLKEYKTAAHTYLTKCGEARLENAGDCYMLAKCYKLAAEAYSRGRCFLKFFDVCTAANLFDMGLQVICSWRKHDDVNLIKKCQHIKKTWHLFLVKGALPYHQLQNFCSTMKFVKSFDSIDEKYSFLRTLGLSEKKLLQEEELNEVVHKETISQHEGSFSLGLQLQPKLESVLVHKETSQNETKTKDKMNVANNMLTTKGSSRGSKFQPKLKLVWKETTSQNDTKTKDRMKVADNMSVAKGSSQGLQFQSKLEMKTVSQNDTTTRDKMKIVETLSTAKGYSQGLKCQSELMSVLKETTSQNDTKTKDKMKLAAKESSQGLQFQCKLELETISQNGTTTRDNMKVAEDMSIAKGSSQGLKFQPKFKSVWKETTYQNGTNTKGKMKLADNMSTAKGSSQGLQFKSKLESRTVSQNDVMTMDKIQESVCKEKASQNDSKIGDNLKVAPFISTTKDSSYKFQIKPKIAYAKEEIAAQNNVKIEKDAVNNVNNKAEASQKLQQCNQKLKNVQKETTSSSDSRVKKDKMKESVNLSEAGDPSQQLQTEQKQLKQKDGEVEKGKQKVADHKFIAKRYWRKVTENGMKSNFQEHLDIRRFQ
BLAST of Cla97C11G207100 vs. NCBI nr
Match: KGN50807.1 (hypothetical protein Csa_5G266850 [Cucumis sativus])

HSP 1 Score: 381.7 bits (979), Expect = 5.1e-102
Identity = 233/302 (77.15%), Postives = 251/302 (83.11%), Query Frame = 0

Query: 1    MKAPSTKEEWTSLALEFFSEGVYGATSLCFKRAEDRRRSEWARAASLCATAGILDGSNPQ 60
            MK PSTKEEW+SL LEFFSEGVYGA SLCF+RAEDRRRSEWARAAS CATA      NPQ
Sbjct: 970  MKVPSTKEEWSSLGLEFFSEGVYGAASLCFERAEDRRRSEWARAASFCATA------NPQ 1029

Query: 61   IACNGLREAAKFYISVDRAEXXXXXXXXXXXXXXXXHTYLTKCGEAXXXXXXXXXXXXXX 120
            I+ N LREAA+ YIS+DRAE  XXXXXXXXX     +TYLTKCGEA  XXXXXXXXXXXX
Sbjct: 1030 ISRNALREAAEIYISLDRAEIAXXXXXXXXXYKTAAYTYLTKCGEARLXXXXXXXXXXXX 1089

Query: 121  XXXXXXXXSRGRCFLKFFDVCTAANLFDMGLQVICSWRKHDDVNLIKKCQHIKKTWHLFL 180
            XXXXXXXXS GRCFLKFFDVCTAANLFD GLQ ICSWRK+D+V+LIKKC+HIK+ WHLFL
Sbjct: 1090 XXXXXXXXSMGRCFLKFFDVCTAANLFDTGLQGICSWRKYDNVDLIKKCKHIKEAWHLFL 1149

Query: 181  VKGALPYHQLQNFCSTMKFVKSFDSIDEKYSFLRTLGLSEKKLLQEEELNEVVHKETISQ 240
             KGAL YHQLQNF S M+FV+SFDSIDEKY FL TLGLSE K+LQEEEL       TIS+
Sbjct: 1150 WKGALHYHQLQNFGSMMRFVESFDSIDEKYLFLGTLGLSENKMLQEEEL-------TISE 1209

Query: 241  HEGSFSLGLQLQPKLESVLVHKETSQNETKTKDKMNVANNMLTTKGSSRGSKFQPKLKLV 300
            +EG  S GL LQPKL SV VHKETSQN+TKTK KM VANN+ T KGSSRGSKFQPKLK V
Sbjct: 1210 NEGFHSPGLHLQPKLVSVSVHKETSQNDTKTKGKMKVANNISTAKGSSRGSKFQPKLKSV 1258

Query: 301  WK 303
            WK
Sbjct: 1270 WK 1258

BLAST of Cla97C11G207100 vs. NCBI nr
Match: XP_011656193.1 (PREDICTED: uncharacterized protein LOC101212224 [Cucumis sativus])

HSP 1 Score: 357.1 bits (915), Expect = 1.4e-94
Identity = 221/288 (76.74%), Postives = 239/288 (82.99%), Query Frame = 0

Query: 15   LEFFSEGVYGATSLCFKRAEDRRRSEWARAASLCATAGILDGSNPQIACNGLREAAKFYI 74
            +EFFSEGVYGA SLCF+RAEDRRRSEWARAAS CATA      NPQI+ N LREAA+ YI
Sbjct: 1797 VEFFSEGVYGAASLCFERAEDRRRSEWARAASFCATA------NPQISRNALREAAEIYI 1856

Query: 75   SVDRAEXXXXXXXXXXXXXXXXHTYLTKCGEAXXXXXXXXXXXXXXXXXXXXXXSRGRCF 134
            S+DRAE  XXXXXXXXX     +TYLTKCGEA  XXXXXXXXXXXXXXXXXXXXS GRCF
Sbjct: 1857 SLDRAEIAXXXXXXXXXYKTAAYTYLTKCGEARLXXXXXXXXXXXXXXXXXXXXSMGRCF 1916

Query: 135  LKFFDVCTAANLFDMGLQVICSWRKHDDVNLIKKCQHIKKTWHLFLVKGALPYHQLQNFC 194
            LKFFDVCTAANLFD GLQ ICSWRK+D+V+LIKKC+HIK+ WHLFL KGAL YHQLQNF 
Sbjct: 1917 LKFFDVCTAANLFDTGLQGICSWRKYDNVDLIKKCKHIKEAWHLFLWKGALHYHQLQNFG 1976

Query: 195  STMKFVKSFDSIDEKYSFLRTLGLSEKKLLQEEELNEVVHKETISQHEGSFSLGLQLQPK 254
            S M+FV+SFDSIDEKY FL TLGLSE K+LQEEEL       TIS++EG  S GL LQPK
Sbjct: 1977 SMMRFVESFDSIDEKYLFLGTLGLSENKMLQEEEL-------TISENEGFHSPGLHLQPK 2036

Query: 255  LESVLVHKETSQNETKTKDKMNVANNMLTTKGSSRGSKFQPKLKLVWK 303
            L SV VHKETSQN+TKTK KM VANN+ T KGSSRGSKFQPKLK VWK
Sbjct: 2037 LVSVSVHKETSQNDTKTKGKMKVANNISTAKGSSRGSKFQPKLKSVWK 2071

BLAST of Cla97C11G207100 vs. NCBI nr
Match: XP_022157076.1 (uncharacterized protein LOC111023887 [Momordica charantia] >XP_022157077.1 uncharacterized protein LOC111023887 [Momordica charantia])

HSP 1 Score: 265.0 bits (676), Expect = 7.0e-67
Identity = 138/234 (58.97%), Postives = 161/234 (68.80%), Query Frame = 0

Query: 1    MKAPSTKEEWTSLALEFFSEGVYGATSLCFKRAEDRRRSEWARAASLCATAGILDGSNPQ 60
            MK PS KEEW+S+ +E FSEGVYGA SLCF+RA+DR R E ARAASL ATAGILDGSN Q
Sbjct: 1651 MKVPSMKEEWSSVGVELFSEGVYGAASLCFERAQDRHRKELARAASLRATAGILDGSNSQ 1710

Query: 61   IACNGLREAAKFYISVDRAEXXXXXXXXXXXXXXXXHTYLTKCGEAXXXXXXXXXXXXXX 120
            + CN LREAA+ YIS+DRAE                + YLTKCGEA              
Sbjct: 1711 MTCNALREAAEIYISMDRAEVAAKCFIELKEYKTAAYIYLTKCGEAKLEDAGDCYMLAEC 1770

Query: 121  XXXXXXXXSRGRCFLKFFDVCTAANLFDMGLQVICSWRKHDDVNLIKKCQHIKKTWHLFL 180
                    SRG+CFLKF +VCT AN+FDMGLQVI  W K DDV+LI+K Q I   WH+FL
Sbjct: 1771 YKLAAEAYSRGKCFLKFLNVCTVANIFDMGLQVISRWSKFDDVDLIEKSQDI---WHVFL 1830

Query: 181  VKGALPYHQLQNFCSTMKFVKSFDSIDEKYSFLRTLGLSEKKLLQEEELNEVVH 235
             KGAL YHQLQ+F S MKFV SF+S+DEKYSFLR LGLSEK+LL E+++ E  +
Sbjct: 1831 EKGALHYHQLQDFSSMMKFVDSFNSMDEKYSFLRNLGLSEKELLLEKDVKETAN 1881

BLAST of Cla97C11G207100 vs. NCBI nr
Match: XP_016901636.1 (PREDICTED: uncharacterized protein LOC103495157 [Cucumis melo])

HSP 1 Score: 263.1 bits (671), Expect = 2.7e-66
Identity = 138/233 (59.23%), Postives = 159/233 (68.24%), Query Frame = 0

Query: 1    MKAPSTKEEWTSLALEFFSEGVYGATSLCFKRAEDRRRSEWARAASLCATAGILDGSNPQ 60
            MKA STKEEW+SL LE FS+GVYGA SLCF+RAEDR R EW RAASL ATAG L+ SNPQ
Sbjct: 1670 MKAQSTKEEWSSLGLELFSDGVYGAASLCFERAEDRLRKEWTRAASLRATAGSLNASNPQ 1729

Query: 61   IACNGLREAAKFYISVDRAEXXXXXXXXXXXXXXXXHTYLTKCGEAXXXXXXXXXXXXXX 120
            +ACN LREAA+ YIS+D AE                + YLTKCGEA              
Sbjct: 1730 MACNLLREAAEIYISMDHAEAAAKCFLELKEYKTAAYIYLTKCGEAKLEDAGDCYMLAEC 1789

Query: 121  XXXXXXXXSRGRCFLKFFDVCTAANLFDMGLQVICSWRKHDDVNLIKKCQHIKKTWHLFL 180
                    SRGRC  KF +VCT ANLF+M LQVI  WRK D+ +LI+KC+ IKK W +FL
Sbjct: 1790 YKLAAEAYSRGRCVFKFLNVCTVANLFEMALQVISDWRKCDNDDLIEKCEDIKKVWQVFL 1849

Query: 181  VKGALPYHQLQNFCSTMKFVKSFDSIDEKYSFLRTLGLSEKKLLQEEELNEVV 234
             KGAL YH+LQ+F S MKFVKSFDS+ EK SFLRTLGLSEK LL EE++ E +
Sbjct: 1850 EKGALHYHELQDFHSMMKFVKSFDSMVEKCSFLRTLGLSEKILLLEEDVEESI 1902

BLAST of Cla97C11G207100 vs. NCBI nr
Match: XP_022956552.1 (uncharacterized protein LOC111458260 isoform X2 [Cucurbita moschata])

HSP 1 Score: 262.3 bits (669), Expect = 4.5e-66
Identity = 141/233 (60.52%), Postives = 160/233 (68.67%), Query Frame = 0

Query: 1    MKAPSTKEEWTSLALEFFSEGVYGATSLCFKRAEDRRRSEWARAASLCATAGILDGSNPQ 60
            MKAPSTKEEW+SL LEFF EGVY A SLCF+RA+DR R EWARAASL ATA ILDGSNPQ
Sbjct: 1631 MKAPSTKEEWSSLGLEFFCEGVYVAASLCFERADDRLRREWARAASLRATACILDGSNPQ 1690

Query: 61   IACNGLREAAKFYISVDRAEXXXXXXXXXXXXXXXXHTYLTKCGEAXXXXXXXXXXXXXX 120
            +A N L+EAA+ YIS+DRAE                + Y  KCGEA              
Sbjct: 1691 MARNALQEAAEIYISMDRAEVAAKCFIELKEYQTAAYIYSKKCGEAKLEDAGDCYMLAEC 1750

Query: 121  XXXXXXXXSRGRCFLKFFDVCTAANLFDMGLQVICSWRKH--DDVNLIKKCQHIKKTWHL 180
                    SRGR FLKF +VCT ANLFDMGLQVICSWRKH   D +LI+KC   K+ WH+
Sbjct: 1751 YELAAEAYSRGRFFLKFLNVCTVANLFDMGLQVICSWRKHCDHDDDLIEKCLDFKEIWHV 1810

Query: 181  FLVKGALPYHQLQNFCSTMKFVKSFDSIDEKYSFLRTLGLSEKKLLQEEELNE 232
            FL KGAL YHQLQ+F S +KFV  FDS+DEK SFLRTLGLSEK LL E+++ E
Sbjct: 1811 FLQKGALHYHQLQDFRSILKFVDIFDSMDEKCSFLRTLGLSEKILLLEKDVEE 1863

BLAST of Cla97C11G207100 vs. TrEMBL
Match: tr|A0A0A0KQV9|A0A0A0KQV9_CUCSA (Uncharacterized protein OS=Cucumis sativus OX=3659 GN=Csa_5G266850 PE=4 SV=1)

HSP 1 Score: 381.7 bits (979), Expect = 3.4e-102
Identity = 233/302 (77.15%), Postives = 251/302 (83.11%), Query Frame = 0

Query: 1    MKAPSTKEEWTSLALEFFSEGVYGATSLCFKRAEDRRRSEWARAASLCATAGILDGSNPQ 60
            MK PSTKEEW+SL LEFFSEGVYGA SLCF+RAEDRRRSEWARAAS CATA      NPQ
Sbjct: 970  MKVPSTKEEWSSLGLEFFSEGVYGAASLCFERAEDRRRSEWARAASFCATA------NPQ 1029

Query: 61   IACNGLREAAKFYISVDRAEXXXXXXXXXXXXXXXXHTYLTKCGEAXXXXXXXXXXXXXX 120
            I+ N LREAA+ YIS+DRAE  XXXXXXXXX     +TYLTKCGEA  XXXXXXXXXXXX
Sbjct: 1030 ISRNALREAAEIYISLDRAEIAXXXXXXXXXYKTAAYTYLTKCGEARLXXXXXXXXXXXX 1089

Query: 121  XXXXXXXXSRGRCFLKFFDVCTAANLFDMGLQVICSWRKHDDVNLIKKCQHIKKTWHLFL 180
            XXXXXXXXS GRCFLKFFDVCTAANLFD GLQ ICSWRK+D+V+LIKKC+HIK+ WHLFL
Sbjct: 1090 XXXXXXXXSMGRCFLKFFDVCTAANLFDTGLQGICSWRKYDNVDLIKKCKHIKEAWHLFL 1149

Query: 181  VKGALPYHQLQNFCSTMKFVKSFDSIDEKYSFLRTLGLSEKKLLQEEELNEVVHKETISQ 240
             KGAL YHQLQNF S M+FV+SFDSIDEKY FL TLGLSE K+LQEEEL       TIS+
Sbjct: 1150 WKGALHYHQLQNFGSMMRFVESFDSIDEKYLFLGTLGLSENKMLQEEEL-------TISE 1209

Query: 241  HEGSFSLGLQLQPKLESVLVHKETSQNETKTKDKMNVANNMLTTKGSSRGSKFQPKLKLV 300
            +EG  S GL LQPKL SV VHKETSQN+TKTK KM VANN+ T KGSSRGSKFQPKLK V
Sbjct: 1210 NEGFHSPGLHLQPKLVSVSVHKETSQNDTKTKGKMKVANNISTAKGSSRGSKFQPKLKSV 1258

Query: 301  WK 303
            WK
Sbjct: 1270 WK 1258

BLAST of Cla97C11G207100 vs. TrEMBL
Match: tr|A0A1S4E082|A0A1S4E082_CUCME (uncharacterized protein LOC103495157 OS=Cucumis melo OX=3656 GN=LOC103495157 PE=4 SV=1)

HSP 1 Score: 263.1 bits (671), Expect = 1.8e-66
Identity = 138/233 (59.23%), Postives = 159/233 (68.24%), Query Frame = 0

Query: 1    MKAPSTKEEWTSLALEFFSEGVYGATSLCFKRAEDRRRSEWARAASLCATAGILDGSNPQ 60
            MKA STKEEW+SL LE FS+GVYGA SLCF+RAEDR R EW RAASL ATAG L+ SNPQ
Sbjct: 1670 MKAQSTKEEWSSLGLELFSDGVYGAASLCFERAEDRLRKEWTRAASLRATAGSLNASNPQ 1729

Query: 61   IACNGLREAAKFYISVDRAEXXXXXXXXXXXXXXXXHTYLTKCGEAXXXXXXXXXXXXXX 120
            +ACN LREAA+ YIS+D AE                + YLTKCGEA              
Sbjct: 1730 MACNLLREAAEIYISMDHAEAAAKCFLELKEYKTAAYIYLTKCGEAKLEDAGDCYMLAEC 1789

Query: 121  XXXXXXXXSRGRCFLKFFDVCTAANLFDMGLQVICSWRKHDDVNLIKKCQHIKKTWHLFL 180
                    SRGRC  KF +VCT ANLF+M LQVI  WRK D+ +LI+KC+ IKK W +FL
Sbjct: 1790 YKLAAEAYSRGRCVFKFLNVCTVANLFEMALQVISDWRKCDNDDLIEKCEDIKKVWQVFL 1849

Query: 181  VKGALPYHQLQNFCSTMKFVKSFDSIDEKYSFLRTLGLSEKKLLQEEELNEVV 234
             KGAL YH+LQ+F S MKFVKSFDS+ EK SFLRTLGLSEK LL EE++ E +
Sbjct: 1850 EKGALHYHELQDFHSMMKFVKSFDSMVEKCSFLRTLGLSEKILLLEEDVEESI 1902

BLAST of Cla97C11G207100 vs. TrEMBL
Match: tr|A0A0A0KQT9|A0A0A0KQT9_CUCSA (Uncharacterized protein OS=Cucumis sativus OX=3659 GN=Csa_5G262250 PE=4 SV=1)

HSP 1 Score: 258.8 bits (660), Expect = 3.3e-65
Identity = 135/233 (57.94%), Postives = 158/233 (67.81%), Query Frame = 0

Query: 1    MKAPSTKEEWTSLALEFFSEGVYGATSLCFKRAEDRRRSEWARAASLCATAGILDGSNPQ 60
            MKA STKEEW+SL LE FSEGVYGA SLCF+RAEDR R EW RAASL ATA  L+ SNPQ
Sbjct: 1056 MKAQSTKEEWSSLGLELFSEGVYGAASLCFERAEDRLRKEWTRAASLRATAATLNASNPQ 1115

Query: 61   IACNGLREAAKFYISVDRAEXXXXXXXXXXXXXXXXHTYLTKCGEAXXXXXXXXXXXXXX 120
            +ACN LREAA+ YIS+D AE                + YL+KCGEA              
Sbjct: 1116 MACNVLREAAEIYISMDHAEAAAKCFLELKEYKTAAYIYLSKCGEAKLEDAGDCYMLAEC 1175

Query: 121  XXXXXXXXSRGRCFLKFFDVCTAANLFDMGLQVICSWRKHDDVNLIKKCQHIKKTWHLFL 180
                    SRGRCF KF +VCT A+LF+M LQVI  WRK DD +LI+KC+ IKK W +FL
Sbjct: 1176 YKLAAEAYSRGRCFFKFLNVCTVAHLFEMALQVISDWRKCDDDDLIEKCEDIKKVWQVFL 1235

Query: 181  VKGALPYHQLQNFCSTMKFVKSFDSIDEKYSFLRTLGLSEKKLLQEEELNEVV 234
             KGAL YH+L++  S MKFVKSFDS+ +K SFLRTLGLSEK LL EE++ E +
Sbjct: 1236 EKGALHYHELEDVHSMMKFVKSFDSMVDKCSFLRTLGLSEKILLLEEDVEESI 1288

BLAST of Cla97C11G207100 vs. TrEMBL
Match: tr|A0A2G5E3C9|A0A2G5E3C9_AQUCA (Uncharacterized protein OS=Aquilegia coerulea OX=218851 GN=AQUCO_01300743v1 PE=4 SV=1)

HSP 1 Score: 131.7 bits (330), Expect = 6.1e-27
Identity = 82/257 (31.91%), Postives = 128/257 (49.81%), Query Frame = 0

Query: 1    MKAPSTKEEWTSLALEFFSEGVYGATSLCFKRAEDRRRSEWARAASLCATAGILDGSNPQ 60
            M+  S+KEEW++  ++ F+EG +   ++CF+RA D  R +W+RAA L A+A  + GSN +
Sbjct: 1674 MQVASSKEEWSARGVKLFNEGNFEMATMCFERAADSYREKWSRAAGLRASADRIQGSNSE 1733

Query: 61   IACNGLREAAKFYISVDRAEXXXXXXXXXXXXXXXXHTYLTKCGEAXXXXXXXXXXXXXX 120
             A    ++AAK Y S+ +A+                  YL KCGE+              
Sbjct: 1734 HANVARKQAAKIYESIGKADLAAKCFIELKQFKRAGKLYLEKCGESRLDDAGDCFSLAGC 1793

Query: 121  XXXXXXXXSRGRCFLKFFDVCTAANLFDMGLQVICSWRK--HDDVNLI-KKCQHIKKTWH 180
                    +RG+ F K   VCT   LF MGL  I  W++  H  ++++ ++ Q  +    
Sbjct: 1794 WSEAAEVYARGKNFSKCLSVCTKGELFGMGLNFIKHWKEDAHQVIDVVMQRQQDFEGMEQ 1853

Query: 181  LFLVKGALPYHQLQNFCSTMKFVKSFDSIDEKYSFLRTLGLSEKKLLQEEELNEVVHKET 240
             FL + A  YH+L++  S MKFVK F SID   +FLR+     + LL EEE    +   +
Sbjct: 1854 NFLERCAHHYHELRDTQSMMKFVKEFKSIDSIRTFLRSCNYLTELLLLEEEWGNYIEAAS 1913

Query: 241  ISQHEGSFSLGLQLQPK 255
            I++ +G      +L  K
Sbjct: 1914 IARDKGDLLFEAELLGK 1930

BLAST of Cla97C11G207100 vs. TrEMBL
Match: tr|A0A1U7Z1T2|A0A1U7Z1T2_NELNU (uncharacterized protein LOC104589402 OS=Nelumbo nucifera OX=4432 GN=LOC104589402 PE=4 SV=1)

HSP 1 Score: 125.6 bits (314), Expect = 4.4e-25
Identity = 83/260 (31.92%), Postives = 125/260 (48.08%), Query Frame = 0

Query: 1    MKAPSTKEEWTSLALEFFSEGVYGATSLCFKRAEDRRRSEWARAASLCATAGILDGSNPQ 60
            M+  S+KEEW+   ++ F+EG Y   ++CF+RA D  R +WA+AA L A A  + GSNP+
Sbjct: 1699 MQVASSKEEWSLRGIKLFNEGNYEMATMCFERAGDAYREKWAKAAGLRAAADRMRGSNPE 1758

Query: 61   IACNGLREAAKFYISVDRAEXXXXXXXXXXXXXXXXHTYLTKCGEAXXXXXXXXXXXXXX 120
            +A   L EAA+ + ++ RAE                  Y  KCG +              
Sbjct: 1759 MARIVLMEAAEIFQNIGRAEYAAKCFIELKEFQRAGMLYREKCGASSLEDAGDCFSMAEC 1818

Query: 121  XXXXXXXXSRGRCFLKFFDVCTAANLFDMGLQVICSWRKH----DDVNLIKKCQHIKKTW 180
                    ++G+ F K   VC    LF+MGL  I  W+++    DD   I   + + +  
Sbjct: 1819 WNFAAEVYAKGKYFSKCLSVCIRGKLFNMGLNFIEYWKENSTTGDDTFAI--TEELLEME 1878

Query: 181  HLFLVKGALPYHQLQNFCSTMKFVKSFDSIDEKYSFLRTLGLSEKKLLQEEELNEVVHKE 240
              FL K AL YH+L +  + M FV++F SID K  FLR+    ++ +L EEE    V   
Sbjct: 1879 RTFLEKCALHYHELNDTKAMMNFVRAFHSIDLKRVFLRSHNYLDELVLLEEESGNFVEAA 1938

Query: 241  TISQHEGSFSLGLQLQPKLE 257
            +I++ +G   L      K E
Sbjct: 1939 SIARLKGDLLLEADFLGKAE 1956

The following BLAST results are available for this feature:
Match NameE-valueIdentityDescription
KGN50807.15.1e-10277.15hypothetical protein Csa_5G266850 [Cucumis sativus][more]
XP_011656193.11.4e-9476.74PREDICTED: uncharacterized protein LOC101212224 [Cucumis sativus][more]
XP_022157076.17.0e-6758.97uncharacterized protein LOC111023887 [Momordica charantia] >XP_022157077.1 uncha... [more]
XP_016901636.12.7e-6659.23PREDICTED: uncharacterized protein LOC103495157 [Cucumis melo][more]
XP_022956552.14.5e-6660.52uncharacterized protein LOC111458260 isoform X2 [Cucurbita moschata][more]
Match NameE-valueIdentityDescription
tr|A0A0A0KQV9|A0A0A0KQV9_CUCSA3.4e-10277.15Uncharacterized protein OS=Cucumis sativus OX=3659 GN=Csa_5G266850 PE=4 SV=1[more]
tr|A0A1S4E082|A0A1S4E082_CUCME1.8e-6659.23uncharacterized protein LOC103495157 OS=Cucumis melo OX=3656 GN=LOC103495157 PE=... [more]
tr|A0A0A0KQT9|A0A0A0KQT9_CUCSA3.3e-6557.94Uncharacterized protein OS=Cucumis sativus OX=3659 GN=Csa_5G262250 PE=4 SV=1[more]
tr|A0A2G5E3C9|A0A2G5E3C9_AQUCA6.1e-2731.91Uncharacterized protein OS=Aquilegia coerulea OX=218851 GN=AQUCO_01300743v1 PE=4... [more]
tr|A0A1U7Z1T2|A0A1U7Z1T2_NELNU4.4e-2531.92uncharacterized protein LOC104589402 OS=Nelumbo nucifera OX=4432 GN=LOC104589402... [more]
Match NameE-valueIdentityDescription
Match NameE-valueIdentityDescription
GO Assignments
This gene is annotated with the following GO terms.
Category Term Accession Term Name
biological_process GO:0008150 biological_process
cellular_component GO:0005575 cellular_component
molecular_function GO:0005524 ATP binding
molecular_function GO:0016787 hydrolase activity
molecular_function GO:0003674 molecular_function
molecular_function GO:0005515 protein binding

The following mRNA feature(s) are a part of this gene:

Feature NameUnique NameType
Cla97C11G207100.1Cla97C11G207100.1mRNA


Analysis Name: InterPro Annotations of watermelon 97103 v2
Date Performed: 2019-05-12
IPR TermIPR DescriptionSourceSource TermSource DescriptionAlignment
NoneNo IPR availableCOILSCoilCoilcoord: 625..645
NoneNo IPR availableCOILSCoilCoilcoord: 575..595
NoneNo IPR availableMOBIDB_LITEmobidb-litedisorder_predictioncoord: 588..646
NoneNo IPR availableMOBIDB_LITEmobidb-litedisorder_predictioncoord: 631..646
NoneNo IPR availableMOBIDB_LITEmobidb-litedisorder_predictioncoord: 599..614
NoneNo IPR availableMOBIDB_LITEmobidb-litedisorder_predictioncoord: 616..630

The following gene(s) are paralogous to this gene:

None