CaUC01G000420 (gene) Watermelon (USVL246-FR2) v1

Overview
NameCaUC01G000420
Typegene
OrganismCitrullus amarus (Watermelon (USVL246-FR2) v1)
Descriptionnephrocystin-3
LocationCiama_Chr01: 291921 .. 298642 (+)
RNA-Seq ExpressionCaUC01G000420
SyntenyCaUC01G000420
Sequences
The following sequences are available for this feature:

Gene sequence (with intron)

Legend: exonCDSpolypeptidethree_prime_UTR
Hold the cursor over a type above to highlight its positions in the sequence below.
ATGGTGTAAAATGAACCTACAAAAAATAATGTGGAAAAGGCATACCTCCGCCATTGCAAGAGCATGAACAGTCTTATTACAAGTCCTATTACAAAAAGAAAACTTGAAACTATATGCTTAAAAAAATATCTCTAGTTAGAATCGATTTGTACTATAGATTTTAGTAAATACGTGCCAACTGAGATTAGTCTCACAATTCTTTATTCTCCAACCTTAATTAATAATTATGATGGAGAGAAGTAAACCATGTTTAAATTAGTTGATATAATTCCTCCTTTTTTACTTTTTTGTCCTTGCTTCTTGGACTTTCTCTAATGGGCTAGTGTGGGCTTGGGCCTCATGGACGGAAAAGCCCAAAAATATTGGTTCTAGGTAACTCGGCAGCTTAAAATCAGCTTGACTCAGTCCACTTCGTCCTCTCCCTTTATCCAGCTATTGTTCACCACGAGCTCAGTTTTTTCGGCTGCTCAACTCTCTAGTCCCTGCAATGTCGACTTTCCTTCTTCTATCTTCTCAATCTTCCACCTCGCGCAGGTTTTTTCTTTTCCTTATCTTCATCCCAACATTTTGCAACTTTGATGCCTTCTTTTACTTCTTCTTATCGGCGTTGTCTCTGTTTGTTTATCGCAATTCTTTTTTTTCAGTTATCACTTATGTATTTTGCTTCTATTACTTCATTTTTTGAAATTTGAATGCTGTTGATTTCACATCCATTTTCTCAAATTTTTTTTAGTATCTTTTCGAAGTCGGGTAGCGAGTTAGAATATTTCGTTATGACTCTTTATCGTTGAAATGTGTAACGTTGAGAAATCTTGTGGTTTCGTTGTTTTCACGGTTCTATCCAAATATTTCATTTGTTCTTCAATTTACGTGCATCAGTTACATGATAGAATACTCCGGTTTTAAGCTGCGAATTTGCAGCAAAATAGTAAAATCCTTTGAAAGTTTTAGTATTGTAGTAGGAAGCGTTTGATTATCCTTTGAAATAGAACCCTTTGTTTATTGAGAGTTTCTCATCTCCAAGGGATGGACTCCCCTTTCTTAGCAATGAATTTGGCCTAATACCTTTTCGCTGCATGAAAAGTGGTAAAATCATGTCTCTGTTGGCATTGAGGAAAGGAGGGGGAAAAAGAAAACGAACTAGCACAACCCAATATAATTTTGAATGTGGGCGATGATCTATTTCCATGTATCGTACATCTCTAGTTAAATATTTGATGCTCTATGTGGTGTAACTTAACAACACTTAGAATTACGGAGTAGCCACTGTTTCTTCTGCTTGTTACATTTGTTTTTAGGCCAATATATGCATCCACTACTGTACAATAGAGATGCTCAATGCATTTTAGGATATTAATTATTCTAATATATCTTTCTCGAACATCTTAGGGTGAGCGAAATCTGTCTTTGCGTGCAGTCATTGCAGTTGCAAAAGGGCATCACATGCCCTTCAGTTTGCCTTCAGAAACAGAAATATAATATCAAGCTTTATGCTGTACCTGTTGGAGCCTTTTCTTGTCGCTTTGCACCAACACCTTCTGCTTCTAATAGAGCTGATTTGGGGGGTCAGAGGAAACATATTTCTTCTGCCTTCACTGCTCCAAATGGATATCAGAGGTTGGGACCAAATTTAGTAAATTTCATGCTGTCAAGAGTTCTCTATGACATCCACATAAAATATTGAATGAAGGTTGTTTATTTTTTAGTAGGCCTGAGCTCATGCACAACCGTACTGATGGGAACAGCATGATTGAGTTTGAAGTTCAGTTGGAAGAATTATTCAATGAGGTCAAAATGATGGCTATGAGCGGGAGGAAGAATGATGCTGTCGAGCTACTTCAAGCTAATTATGAAGCTGTAAAAGAACAGATGGAATCAGGTGCTATTGGCATCGAACAAGCCGCTGTTCTTGACATCGTGGCTTTGGGGTATATAACTGTTGGAGACTTGAAGTTTGTTGCTTCTGTATTGGATATAGTAACTTCTTTTTCCCTTATCATCTGTGCAATAATTGATTTGAAAAGCTCAGTGCTTCTCTATTTTGTGCAACAATGATTGTACTTCCATGACTTTGTATTTTCTTCTTCTGTTTAGGTTATTATCTAGCATAGTTACTATCGAGGGACTTTTATGGGCTAAAAATTTTAATCAATAACCTAGGAACTGGCATTAGTTAGTTGGAGTTGTATGGCCCATGTTGAGTTGAGTTTATGATAGCCGGTGTGTATAGCGGGTCAGCAACCATGGACATATAGGACTTGGGAATTTTTTTTTTCCTCTTGATGACGTTTTTATTCAGTATTAAAAATTATAAGAGCATGTCTGGGCCTTATTCTTGCTCAAAGACAAGTAGACATGGTCATATAGCAATACAGTCACATTTTTTTAGGCAATTTTGGAGTAATAAAAAAATAGCTTTTAATGTTTGCCTCAGTCTTACGCTCCAAAATTATCCCATGCTTATACAGCTAAAGTTTCAGTTGGTCAATCTTCGAGACTCACTTCAATTTGAGTGAAATTCATTTCCTCCTTCTGATGGATTTTCTTTTATGCAGTTGAACAAGGTTGTTGACAGTCTGGAGGACAGTGAACCCTTTCTGGATTCAGTGCTTCTGCATATGGGGAGCATGTACTCAACTTTGAAAAAGTTTGAGAAATCAATGTCTGCATACAAGAGGGCTATTGATATCATAGAGAAGAAAAGTGGTAAGGGAGCTTTTGGGGACATGTATTTAATTGTCCACCTTATGCTTTAATGATTTTGCTCAAGTTCTACTGTTGCCGTTTATTTTCTGATTCAATTTATCTATTATTGTTATTATTATTATTATATAACCAAGTTTTCATTAAAAAAAATTTTTGAAAGACTACAAGGGCATACAAAAGGCAACTCCAAAATACGGGAGTCCGACATTAGCTACAAGAAGGACTCTAATTTAAAAGAAGAAGGCTAAGGATAATTACAAAAAGGTGTAGTGATTGACTCACGAAGAAGCATTAAACCTTATTACCTCCCAAACCTCCCCCAATTTCTCAAACATCTCTAAAACTTCAATTGTTTTTCTCAAGTCAAATACCCCACAAAACAGCAAAGAAGTTAGCATATGCCACACTTTGCCTTTCTCCCTATCGGAAGGATTCAGAAGCACGTTCTCTAACAAAGATTGCCCATCTCCATTATGCACCCACTAACACCAGAAGAACTCATCCATCAACCCCAAAGGAAATTGGCAGACAACCTAGGGCCGAGATGGAGAGAAGTGAGAACCTCCACCCAAAAGCTTCAGAAGCAAAACCTTCCCATTGTCAATAATCTATCACCACCTTAAATAAGGTTTTCCGATGATCTTGTTAAGAAGAAAATATTTCTCTAATTAAACCAGAAAGAAGAAAAATCTAATAAGCTGTTTAAAATGGAAATCAACAGCTATAATTAGGTATTGCGACTTTGGTGGATTTTATCTTGTTTTCCTTTATTTTCTATTCCCCTTTAGACCTTACAATGAATCCTTTGCAGGAGAGGACAGTACCTTTCTTATCACTCCAATTTTAGGGATGGCTAAAGTTCTTGGTATCATTGGAAGAACTGCAAAGGCTGTAGAGTTTTACCATCGTGCAATTTCACTTCTGGAATCAACCAGAGGCCTTGAGAACGAGGATTTGGTTATACCTTTATTTGGTCTGGGCAATCTAATGCTCAAAGAAGGAAAAGGCAAGGATGCAGAAACTTGTTTTGCTAGGTACTCTCTCTCTCTCTCTCTCTCTCTCTCTCTCTCTCTCTCTCTCTCTCTCTTTCTCTCCCCTTGCTTTTCACTATTGACTTACTTTCATGGCCCTATTTAGGATTGTGGACATATATAAAAAGTTATATGGAGAAAAGGATGGAAAAGTCGGAATGGCTATGTATTCTCTGGCTAATGCAAAGTGTGCAAGAGGTAGCTTCGGTCACTTGTCATCTTTCATACACCAGCTAAAATTTTCATTTGCAAAATATTCTTCCATTGGTTCGTAGGGGAAGCAGACGAAGCTGTTACCCTATATAGAAGAGCTTTGCACATTATCAAGGATTCAAATGATATGGCTTTAGACGACAGCGTGATGGAGAAGATGAGGATTGAGTTGGCAGAACTATTGCATGTTGTTGGAAGGTAATGTCATATTGATGTTGGGTCTTTTTTTTTCTCTGTTCATGTTTGGAGCTTGTATAAATGACCAATTGTTCTTGCTCAATTTATGTCTCTTTATTTCTTTCACATATCACATATAGTTATTGATAGAACCCAAGTTACTGGAATATATTGAAAGATAAAAAATAGAACTATAATAATTTCCAAGTGATTAAGGTACTTCACCCTCCTTCTCTTTGGATCACTGAAACCTCAACTCACTAAACAAATTGCCTACCTTCCGCCCATTCCACTCCGTCCATTTATATCAATTCTTGGCTAATAACTAACATGCTCTTATTATTCCAATAATATTCCTAACGTCCCAAGGGCATTCCTAGCTCTTTGCTTTTACCACCAGTGTTCATTAATACCTGGAATTTGGTTTAATACAGGGGGAGTGAAGGCAGAGAAATTCTAGAAGAGTGTTTGTTGATCAATGAAAGATCGAAAGGAAAAGAGCATCCCAGCTCAGTGAAGCACCTTGTAAACCTTGCAGCTTCTTATTCAAGGTCAAAGAATTATGCTGAGGCTGAGCGTCTGTTGCGAATTGGATTGGACATTATGATAAAGGCAGTGGGACCTGATGATCAATCAATTACAGTCCCCATGTTGAATCTTGCTGTCACTCTTTACAATCTAAAACAAGACGACAATGCTGAGCAACTTGCACTGGAAGTTTTGCGAATACGGGAAAGTGCTTTTGGAAAAGATTCTCTTCCTGTTGGTGAGACCTGAGATTTACTTCTATTTCTACTCAATGGATAATGGCTTTTGGATCTGATTTGTCATCTTCACATGAGCTCCTCTTTCTGCCGAATTGCATACAGGTGAGGCTCTAGATTGTCTGGTTTCCATTCAAAGCAGGCTAGGGAAGGATGAAAACGAGCTACTGAAGCTGCTTAAGAGAATTCTTAGAATCCAGGAGAGAGAGTTCGGGCATGATGGAAAAGAGGTCATTGATACCCTCAAGAAAATAGTGTTCTACGTGGAAAAACTAGGAATGAAAGATGAGAAGTTTCTACTTCAAAAACGACTGTCCAAGCTGCGGATGAAATTCAAGAACCAGATGCAATACTAAAGGTATTTATCATCCATTTTTCTTAATCTTGTGATTTATATTTTCCCTTCTTTGAAATAGAGATGCAGCTTTTGGATTAATTGTTTCGATAGATGTATATAATATTCCTAAACTTTAAAGAGTTCTAAACACACGTCTCCATCTTTGAATTATGATTCATGTGATCTTTTAATGTTAACTGAATATTTGCGTGATGTGTGGACAAAATTTAGCTTGCAAAGTGTAATGTGCCAGACATGTGATGGAAGGAAAGTCCAAAACTAGAAAAAATAAATTGATAAAGATATTTGATACCTTGCTTAAAATAACAATAGAATGGTATGTTATACGTTTCACTTTATTTTGTAATAAGTTAGTAACATAATTTATTAAGAACACACTTATGTTTTTTCTTTTACTCATCTTCTTTCCTTTCTTCTCTTCTCTTTCTCCTCATCCTCCTCACGTTTACCAAAAAAAGAAATGTAACAACCCCAACTTTTATGACCTCTAACTGCAGTCATTACTACGTGGATGATATTTTCAATATAAAATCTCAATTGATACTTTGTCATTGCGAGGTATGTATTATCAGTAAAATGTTTTATGGAAGATTATGCATAAAAGTATTTGTTTAGCTTAATGGACCAGTAGCTTTATTATTGCTTGTTCATTGACTTGATAAAGAATAGTTTTGTGTTGGTTGGTAGGCAACGACACCACGCAAAATAGCTTCAACTAGATGAGGTAGGCCCGCTAGAGGTTAGCCAGCGGTAGGGTCACTGGAACAATCACTCTAGAGCCAAGCAGGTCAGGCTAGGGACCCAATGCGGTATCCAGATTGAATTTGGCTCTTTGGCATAGCGGGCTGAGGGCAACATTTATTGTTGTCATGATGGAGAGAATGCAAACGTTGTTCCAGATTGCAATCACCAACCAGATGGCCTAGTCGAACCAAAGTCAACAAAACATGTAAGAGGAAGCAAAACATTTGGGAGATTTCAAGAAGTACAACCCAAACTTTTGATGGGTCACTAGCAAATCTCATAAAGATGGAGACCTAGTTGTGGTGCTGTCGTTGATAGAAATAATATTTGGTTATATAAAATGTCCCGAGCGTGAGGAATTGCTTTGTGTCGTTGACGGATGATGCCCAGATACGGTGGGAGTCTGTCGAGAGGTTGATCGACACTAGCGAAGGTCCGATGGATTGATCACGGTTTAACAAAGCCTTCTACTAGTGGTGCTTTGCTGCAGTAACAAGATTTAGCAAGCAAGCAGAGTTCTCGAACTTTAGACGGGTGAATCAATTGGTAGATGAGTAAGAAAATTCTAACAATTGCCAAAAAACAAATCAATTATTTATCTGTCGATTGATCGTTAGTCAAACGACCTTTCAGAACTATTTCAAAAGTTATGGATCAAATTTTCAATTCTTGAAATTGGTGACCAAATAAAAACTTTAAATTTAGACGTATTATACCCTTGAATTTATT

mRNA sequence

ATGGTCTATTGTTCACCACGAGCTCAGTTTTTTCGGCTGCTCAACTCTCTAGTCCCTGCAATGTCGACTTTCCTTCTTCTATCTTCTCAATCTTCCACCTCGCGCAGGGTGAGCGAAATCTGTCTTTGCGTGCAGTCATTGCAGTTGCAAAAGGGCATCACATGCCCTTCAGTTTGCCTTCAGAAACAGAAATATAATATCAAGCTTTATGCTGTACCTGTTGGAGCCTTTTCTTGTCGCTTTGCACCAACACCTTCTGCTTCTAATAGAGCTGATTTGGGGGGTCAGAGGAAACATATTTCTTCTGCCTTCACTGCTCCAAATGGATATCAGAGTAGGCCTGAGCTCATGCACAACCGTACTGATGGGAACAGCATGATTGAGTTTGAAGTTCAGTTGGAAGAATTATTCAATGAGGTCAAAATGATGGCTATGAGCGGGAGGAAGAATGATGCTGTCGAGCTACTTCAAGCTAATTATGAAGCTGTAAAAGAACAGATGGAATCAGGTGCTATTGGCATCGAACAAGCCGCTGTTCTTGACATCGTGGCTTTGGGGTATATAACTGTTGGAGACTTGAAGTTTGTTGCTTCTGTATTGGATATATTGAACAAGGTTGTTGACAGTCTGGAGGACAGTGAACCCTTTCTGGATTCAGTGCTTCTGCATATGGGGAGCATGTACTCAACTTTGAAAAAGTTTGAGAAATCAATGTCTGCATACAAGAGGGCTATTGATATCATAGAGAAGAAAAGTGGTAAGGGAGCTTTTGGGGACATACCTTACAATGAATCCTTTGCAGGAGAGGACAGTACCTTTCTTATCACTCCAATTTTAGGGATGGCTAAAGTTCTTGGTATCATTGGAAGAACTGCAAAGGCTGTAGAGTTTTACCATCGTGCAATTTCACTTCTGGAATCAACCAGAGGCCTTGAGAACGAGGATTTGGTTATACCTTTATTTGGTCTGGGCAATCTAATGCTCAAAGAAGGAAAAGGCAAGGATGCAGAAACTTGTTTTGCTAGGATTGTGGACATATATAAAAAGTTATATGGAGAAAAGGATGGAAAAGTCGGAATGGCTATGTATTCTCTGGCTAATGCAAAGTGTGCAAGAGGTAGCTTCGGTCACTTGTCATCTTTCATACACCAGCTAAAATTTTCATTTGCAAAATATTCTTCCATTGGGGAAGCAGACGAAGCTGTTACCCTATATAGAAGAGCTTTGCACATTATCAAGGATTCAAATGATATGGCTTTAGACGACAGCGTGATGGAGAAGATGAGGATTGAGTTGGCAGAACTATTGCATGTTGTTGGAAGGGGGAGTGAAGGCAGAGAAATTCTAGAAGAGTGTTTGTTGATCAATGAAAGATCGAAAGGAAAAGAGCATCCCAGCTCAGTGAAGCACCTTGTAAACCTTGCAGCTTCTTATTCAAGGTCAAAGAATTATGCTGAGGCTGAGCGTCTGTTGCGAATTGGATTGGACATTATGATAAAGGCAGTGGGACCTGATGATCAATCAATTACAGTCCCCATGTTGAATCTTGCTGTCACTCTTTACAATCTAAAACAAGACGACAATGCTGAGCAACTTGCACTGGAAGTTTTGCGAATACGGGAAAGTGCTTTTGGAAAAGATTCTCTTCCTGTTGGTGAGGCTCTAGATTGTCTGGTTTCCATTCAAAGCAGGCTAGGGAAGGATGAAAACGAGCTACTGAAGCTGCTTAAGAGAATTCTTAGAATCCAGGAGAGAGAGTTCGGGCATGATGGAAAAGAGGTCATTGATACCCTCAAGAAAATAGTGTTCTACGTGGAAAAACTAGGAATGAAAGATGAGAAGTTTCTACTTCAAAAACGACTGTCCAAGCTGCGGATGAAATTCAAGAACCAGATGCAATACTAAAGGCAACGACACCACGCAAAATAGCTTCAACTAGATGAGGTAGGCCCGCTAGAGGTTAGCCAGCGGTAGGGTCACTGGAACAATCACTCTAGAGCCAAGCAGGTCAGGCTAGGGACCCAATGCGGTATCCAGATTGAATTTGGCTCTTTGGCATAGCGGGCTGAGGGCAACATTTATTGTTGTCATGATGGAGAGAATGCAAACGTTGTTCCAGATTGCAATCACCAACCAGATGGCCTAGTCGAACCAAAGTCAACAAAACATGTAAGAGGAAGCAAAACATTTGGGAGATTTCAAGAAGTACAACCCAAACTTTTGATGGGTCACTAGCAAATCTCATAAAGATGGAGACCTAGTTGTGGTGCTGTCGTTGATAGAAATAATATTTGGTTATATAAAATGTCCCGAGCGTGAGGAATTGCTTTGTGTCGTTGACGGATGATGCCCAGATACGGTGGGAGTCTGTCGAGAGGTTGATCGACACTAGCGAAGGTCCGATGGATTGATCACGGTTTAACAAAGCCTTCTACTAGTGGTGCTTTGCTGCAGTAACAAGATTTAGCAAGCAAGCAGAGTTCTCGAACTTTAGACGGGTGAATCAATTGGTAGATGAGTAAGAAAATTCTAACAATTGCCAAAAAACAAATCAATTATTTATCTGTCGATTGATCGTTAGTCAAACGACCTTTCAGAACTATTTCAAAAGTTATGGATCAAATTTTCAATTCTTGAAATTGGTGACCAAATAAAAACTTTAAATTTAGACGTATTATACCCTTGAATTTATT

Coding sequence (CDS)

ATGGTCTATTGTTCACCACGAGCTCAGTTTTTTCGGCTGCTCAACTCTCTAGTCCCTGCAATGTCGACTTTCCTTCTTCTATCTTCTCAATCTTCCACCTCGCGCAGGGTGAGCGAAATCTGTCTTTGCGTGCAGTCATTGCAGTTGCAAAAGGGCATCACATGCCCTTCAGTTTGCCTTCAGAAACAGAAATATAATATCAAGCTTTATGCTGTACCTGTTGGAGCCTTTTCTTGTCGCTTTGCACCAACACCTTCTGCTTCTAATAGAGCTGATTTGGGGGGTCAGAGGAAACATATTTCTTCTGCCTTCACTGCTCCAAATGGATATCAGAGTAGGCCTGAGCTCATGCACAACCGTACTGATGGGAACAGCATGATTGAGTTTGAAGTTCAGTTGGAAGAATTATTCAATGAGGTCAAAATGATGGCTATGAGCGGGAGGAAGAATGATGCTGTCGAGCTACTTCAAGCTAATTATGAAGCTGTAAAAGAACAGATGGAATCAGGTGCTATTGGCATCGAACAAGCCGCTGTTCTTGACATCGTGGCTTTGGGGTATATAACTGTTGGAGACTTGAAGTTTGTTGCTTCTGTATTGGATATATTGAACAAGGTTGTTGACAGTCTGGAGGACAGTGAACCCTTTCTGGATTCAGTGCTTCTGCATATGGGGAGCATGTACTCAACTTTGAAAAAGTTTGAGAAATCAATGTCTGCATACAAGAGGGCTATTGATATCATAGAGAAGAAAAGTGGTAAGGGAGCTTTTGGGGACATACCTTACAATGAATCCTTTGCAGGAGAGGACAGTACCTTTCTTATCACTCCAATTTTAGGGATGGCTAAAGTTCTTGGTATCATTGGAAGAACTGCAAAGGCTGTAGAGTTTTACCATCGTGCAATTTCACTTCTGGAATCAACCAGAGGCCTTGAGAACGAGGATTTGGTTATACCTTTATTTGGTCTGGGCAATCTAATGCTCAAAGAAGGAAAAGGCAAGGATGCAGAAACTTGTTTTGCTAGGATTGTGGACATATATAAAAAGTTATATGGAGAAAAGGATGGAAAAGTCGGAATGGCTATGTATTCTCTGGCTAATGCAAAGTGTGCAAGAGGTAGCTTCGGTCACTTGTCATCTTTCATACACCAGCTAAAATTTTCATTTGCAAAATATTCTTCCATTGGGGAAGCAGACGAAGCTGTTACCCTATATAGAAGAGCTTTGCACATTATCAAGGATTCAAATGATATGGCTTTAGACGACAGCGTGATGGAGAAGATGAGGATTGAGTTGGCAGAACTATTGCATGTTGTTGGAAGGGGGAGTGAAGGCAGAGAAATTCTAGAAGAGTGTTTGTTGATCAATGAAAGATCGAAAGGAAAAGAGCATCCCAGCTCAGTGAAGCACCTTGTAAACCTTGCAGCTTCTTATTCAAGGTCAAAGAATTATGCTGAGGCTGAGCGTCTGTTGCGAATTGGATTGGACATTATGATAAAGGCAGTGGGACCTGATGATCAATCAATTACAGTCCCCATGTTGAATCTTGCTGTCACTCTTTACAATCTAAAACAAGACGACAATGCTGAGCAACTTGCACTGGAAGTTTTGCGAATACGGGAAAGTGCTTTTGGAAAAGATTCTCTTCCTGTTGGTGAGGCTCTAGATTGTCTGGTTTCCATTCAAAGCAGGCTAGGGAAGGATGAAAACGAGCTACTGAAGCTGCTTAAGAGAATTCTTAGAATCCAGGAGAGAGAGTTCGGGCATGATGGAAAAGAGGTCATTGATACCCTCAAGAAAATAGTGTTCTACGTGGAAAAACTAGGAATGAAAGATGAGAAGTTTCTACTTCAAAAACGACTGTCCAAGCTGCGGATGAAATTCAAGAACCAGATGCAATACTAA

Protein sequence

MVYCSPRAQFFRLLNSLVPAMSTFLLLSSQSSTSRRVSEICLCVQSLQLQKGITCPSVCLQKQKYNIKLYAVPVGAFSCRFAPTPSASNRADLGGQRKHISSAFTAPNGYQSRPELMHNRTDGNSMIEFEVQLEELFNEVKMMAMSGRKNDAVELLQANYEAVKEQMESGAIGIEQAAVLDIVALGYITVGDLKFVASVLDILNKVVDSLEDSEPFLDSVLLHMGSMYSTLKKFEKSMSAYKRAIDIIEKKSGKGAFGDIPYNESFAGEDSTFLITPILGMAKVLGIIGRTAKAVEFYHRAISLLESTRGLENEDLVIPLFGLGNLMLKEGKGKDAETCFARIVDIYKKLYGEKDGKVGMAMYSLANAKCARGSFGHLSSFIHQLKFSFAKYSSIGEADEAVTLYRRALHIIKDSNDMALDDSVMEKMRIELAELLHVVGRGSEGREILEECLLINERSKGKEHPSSVKHLVNLAASYSRSKNYAEAERLLRIGLDIMIKAVGPDDQSITVPMLNLAVTLYNLKQDDNAEQLALEVLRIRESAFGKDSLPVGEALDCLVSIQSRLGKDENELLKLLKRILRIQEREFGHDGKEVIDTLKKIVFYVEKLGMKDEKFLLQKRLSKLRMKFKNQMQY
Homology
BLAST of CaUC01G000420 vs. NCBI nr
Match: XP_038875999.1 (nephrocystin-3 isoform X1 [Benincasa hispida] >XP_038876001.1 nephrocystin-3 isoform X1 [Benincasa hispida] >XP_038876002.1 nephrocystin-3 isoform X1 [Benincasa hispida])

HSP 1 Score: 948.3 bits (2450), Expect = 3.3e-272
Identity = 517/614 (84.20%), Postives = 537/614 (87.46%), Query Frame = 0

Query: 21  MSTFLLLSSQSSTSRRVSEICLCVQSLQLQKGITCPSVCLQKQKYNIKLYAVPVGAFSCR 80
           M+TFLLLS  S T  R+SEICL VQSLQLQKGITC SVCLQKQK +IKLYAVPV AFSC 
Sbjct: 1   MATFLLLSPPSFTCNRMSEICLSVQSLQLQKGITCSSVCLQKQKCDIKLYAVPVRAFSCC 60

Query: 81  FAPTPSASNRADLGGQRKHISSAFTAPNGYQSRPELMHNRTDGNSMIEFEVQLEELFNEV 140
           FA TPSAS+RAD G QRKHI SAFTAPNGYQ            N M EFEVQLEELFNEV
Sbjct: 61  FASTPSASSRADSGRQRKHIPSAFTAPNGYQR-----------NRMTEFEVQLEELFNEV 120

Query: 141 KMMAMSGRKNDAVELLQANYEAVKEQMESGAIGIEQAAVLDIVALGYITVGDLKFVASVL 200
           +MM MSGRKNDAVELLQANYEAVKEQMESGAIGIEQAAVLDIVALGYITVGDLKFVASVL
Sbjct: 121 RMMTMSGRKNDAVELLQANYEAVKEQMESGAIGIEQAAVLDIVALGYITVGDLKFVASVL 180

Query: 201 DILNKVVDSLEDSEPFLDSVLLHMGSMYSTLKKFEKSMSAYKRAIDIIEKKSGKGAFGDI 260
           DILNKVVDSL+DSEPFLDSVLLHMGSMYSTLKKFEKS+SAYKR+IDIIEKK+        
Sbjct: 181 DILNKVVDSLKDSEPFLDSVLLHMGSMYSTLKKFEKSISAYKRSIDIIEKKN-------- 240

Query: 261 PYNESFAGEDSTFLITPILGMAKVLGIIGRTAKAVEFYHRAISLLESTRGLENEDLVIPL 320
                  GEDS+FLITPILGMAKV G IGRT KAVEFYHRAISLLES+RG ENEDLVIPL
Sbjct: 241 -------GEDSSFLITPILGMAKVFGTIGRTGKAVEFYHRAISLLESSRGFENEDLVIPL 300

Query: 321 FGLGNLMLKEGKGKDAETCFARIVDIYKKLYGEKDGKVGMAMYSLANAKCARGSFGHLSS 380
            GLGNLMLKEGKGKDAETCFARIV+IYKKLYGEKDGKVGMAMYSLANAKCAR        
Sbjct: 301 SGLGNLMLKEGKGKDAETCFARIVNIYKKLYGEKDGKVGMAMYSLANAKCAR-------- 360

Query: 381 FIHQLKFSFAKYSSIGEADEAVTLYRRALHIIKDSNDMALDDSVMEKMRIELAELLHVVG 440
                          GEADEAVTLYRRAL IIKDSNDMALD SVMEKMRI+LAELLHVVG
Sbjct: 361 ---------------GEADEAVTLYRRALQIIKDSNDMALDASVMEKMRIDLAELLHVVG 420

Query: 441 RGSEGREILEECLLINERSKGKEHPSSVKHLVNLAASYSRSKNYAEAERLLRIGLDIMIK 500
           RG+EGRE+LEECLLINER KGKEHPSSVKHLVNLAASYSRSKNYAEAERLLRIGLDIMIK
Sbjct: 421 RGNEGRELLEECLLINERLKGKEHPSSVKHLVNLAASYSRSKNYAEAERLLRIGLDIMIK 480

Query: 501 AVGPDDQSITVPMLNLAVTLYNLKQDDNAEQLALEVLRIRESAFGKDSLPVGEALDCLVS 560
           AVGPDDQSITVPMLNLAVTLYNLK+DD+AEQLALEVLRIRE+AFGKDSLPVGEALDCLVS
Sbjct: 481 AVGPDDQSITVPMLNLAVTLYNLKRDDDAEQLALEVLRIRENAFGKDSLPVGEALDCLVS 540

Query: 561 IQSRLGKDENELLKLLKRILRIQEREFGHDGKEVIDTLKKIVFYVEKLGMKDEKFLLQKR 620
           IQSRLGKDENELLKLL+RIL IQEREFGHDGKEVIDTLKKIVFY+EKLGMKDEKFLLQKR
Sbjct: 541 IQSRLGKDENELLKLLERILIIQEREFGHDGKEVIDTLKKIVFYMEKLGMKDEKFLLQKR 565

Query: 621 LSKLRMKFKNQMQY 635
           LS LRMKFKNQMQY
Sbjct: 601 LSMLRMKFKNQMQY 565

BLAST of CaUC01G000420 vs. NCBI nr
Match: XP_038876003.1 (nephrocystin-3 isoform X2 [Benincasa hispida])

HSP 1 Score: 945.7 bits (2443), Expect = 2.1e-271
Identity = 516/614 (84.04%), Postives = 536/614 (87.30%), Query Frame = 0

Query: 21  MSTFLLLSSQSSTSRRVSEICLCVQSLQLQKGITCPSVCLQKQKYNIKLYAVPVGAFSCR 80
           M+TFLLLS  S T  R+SEICL VQSLQLQKGITC SVCLQKQK +IKLYAVPV AFSC 
Sbjct: 1   MATFLLLSPPSFTCNRMSEICLSVQSLQLQKGITCSSVCLQKQKCDIKLYAVPVRAFSCC 60

Query: 81  FAPTPSASNRADLGGQRKHISSAFTAPNGYQSRPELMHNRTDGNSMIEFEVQLEELFNEV 140
           FA TPSAS+RAD G QRKHI SAFTAPNGYQ              M EFEVQLEELFNEV
Sbjct: 61  FASTPSASSRADSGRQRKHIPSAFTAPNGYQ-------------RMTEFEVQLEELFNEV 120

Query: 141 KMMAMSGRKNDAVELLQANYEAVKEQMESGAIGIEQAAVLDIVALGYITVGDLKFVASVL 200
           +MM MSGRKNDAVELLQANYEAVKEQMESGAIGIEQAAVLDIVALGYITVGDLKFVASVL
Sbjct: 121 RMMTMSGRKNDAVELLQANYEAVKEQMESGAIGIEQAAVLDIVALGYITVGDLKFVASVL 180

Query: 201 DILNKVVDSLEDSEPFLDSVLLHMGSMYSTLKKFEKSMSAYKRAIDIIEKKSGKGAFGDI 260
           DILNKVVDSL+DSEPFLDSVLLHMGSMYSTLKKFEKS+SAYKR+IDIIEKK+        
Sbjct: 181 DILNKVVDSLKDSEPFLDSVLLHMGSMYSTLKKFEKSISAYKRSIDIIEKKN-------- 240

Query: 261 PYNESFAGEDSTFLITPILGMAKVLGIIGRTAKAVEFYHRAISLLESTRGLENEDLVIPL 320
                  GEDS+FLITPILGMAKV G IGRT KAVEFYHRAISLLES+RG ENEDLVIPL
Sbjct: 241 -------GEDSSFLITPILGMAKVFGTIGRTGKAVEFYHRAISLLESSRGFENEDLVIPL 300

Query: 321 FGLGNLMLKEGKGKDAETCFARIVDIYKKLYGEKDGKVGMAMYSLANAKCARGSFGHLSS 380
            GLGNLMLKEGKGKDAETCFARIV+IYKKLYGEKDGKVGMAMYSLANAKCAR        
Sbjct: 301 SGLGNLMLKEGKGKDAETCFARIVNIYKKLYGEKDGKVGMAMYSLANAKCAR-------- 360

Query: 381 FIHQLKFSFAKYSSIGEADEAVTLYRRALHIIKDSNDMALDDSVMEKMRIELAELLHVVG 440
                          GEADEAVTLYRRAL IIKDSNDMALD SVMEKMRI+LAELLHVVG
Sbjct: 361 ---------------GEADEAVTLYRRALQIIKDSNDMALDASVMEKMRIDLAELLHVVG 420

Query: 441 RGSEGREILEECLLINERSKGKEHPSSVKHLVNLAASYSRSKNYAEAERLLRIGLDIMIK 500
           RG+EGRE+LEECLLINER KGKEHPSSVKHLVNLAASYSRSKNYAEAERLLRIGLDIMIK
Sbjct: 421 RGNEGRELLEECLLINERLKGKEHPSSVKHLVNLAASYSRSKNYAEAERLLRIGLDIMIK 480

Query: 501 AVGPDDQSITVPMLNLAVTLYNLKQDDNAEQLALEVLRIRESAFGKDSLPVGEALDCLVS 560
           AVGPDDQSITVPMLNLAVTLYNLK+DD+AEQLALEVLRIRE+AFGKDSLPVGEALDCLVS
Sbjct: 481 AVGPDDQSITVPMLNLAVTLYNLKRDDDAEQLALEVLRIRENAFGKDSLPVGEALDCLVS 540

Query: 561 IQSRLGKDENELLKLLKRILRIQEREFGHDGKEVIDTLKKIVFYVEKLGMKDEKFLLQKR 620
           IQSRLGKDENELLKLL+RIL IQEREFGHDGKEVIDTLKKIVFY+EKLGMKDEKFLLQKR
Sbjct: 541 IQSRLGKDENELLKLLERILIIQEREFGHDGKEVIDTLKKIVFYMEKLGMKDEKFLLQKR 563

Query: 621 LSKLRMKFKNQMQY 635
           LS LRMKFKNQMQY
Sbjct: 601 LSMLRMKFKNQMQY 563

BLAST of CaUC01G000420 vs. NCBI nr
Match: XP_022958168.1 (nephrocystin-3 isoform X3 [Cucurbita moschata] >XP_022958169.1 nephrocystin-3 isoform X3 [Cucurbita moschata])

HSP 1 Score: 927.5 bits (2396), Expect = 5.9e-266
Identity = 497/614 (80.94%), Postives = 534/614 (86.97%), Query Frame = 0

Query: 21  MSTFLLLSSQSSTSRRVSEICLCVQSLQLQKGITCPSVCLQKQKYNIKLYAVPVGAFSCR 80
           M+ FLLLSS S T  R+SEICLC+QSLQLQKGITC SV LQKQK NIKLYAVPV AFSC+
Sbjct: 1   MAAFLLLSSPSFTYHRMSEICLCMQSLQLQKGITCSSVSLQKQKCNIKLYAVPVRAFSCQ 60

Query: 81  FAPTPSASNRADLGGQRKHISSAFTAPNGYQSRPELMHNRTDGNSMIEFEVQLEELFNEV 140
           FA T SAS+RADLG QRKHI+SAFT PNGYQSRP  MH RTDG S  EFE QL+ELFNEV
Sbjct: 61  FASTGSASSRADLGSQRKHIASAFTVPNGYQSRPGHMHYRTDGTSTSEFEGQLDELFNEV 120

Query: 141 KMMAMSGRKNDAVELLQANYEAVKEQMESGAIGIEQAAVLDIVALGYITVGDLKFVASVL 200
           +M+ +SGRK+DAVELLQANYEAVKEQMESGAIGIEQAAVLDIVALGYITVGDLKFVAS+L
Sbjct: 121 RMLIVSGRKSDAVELLQANYEAVKEQMESGAIGIEQAAVLDIVALGYITVGDLKFVASIL 180

Query: 201 DILNKVVDSLEDSEPFLDSVLLHMGSMYSTLKKFEKSMSAYKRAIDIIEKKSGKGAFGDI 260
           DILN +VDSL+D+EPFLDSVLLHMGSMYSTLKK EKSMSAYKRAIDIIEKKSGK      
Sbjct: 181 DILNNIVDSLKDNEPFLDSVLLHMGSMYSTLKKLEKSMSAYKRAIDIIEKKSGK------ 240

Query: 261 PYNESFAGEDSTFLITPILGMAKVLGIIGRTAKAVEFYHRAISLLESTRGLENEDLVIPL 320
                    DS+FLITPILGMAKVLG  G+T KAVE YHRAIS+LESTRG E+EDLVIPL
Sbjct: 241 ---------DSSFLITPILGMAKVLGTSGKTTKAVESYHRAISILESTRGFEDEDLVIPL 300

Query: 321 FGLGNLMLKEGKGKDAETCFARIVDIYKKLYGEKDGKVGMAMYSLANAKCARGSFGHLSS 380
           F LGNL+LKEGKGKDAETCFARIV+IYKKLYGEKDGKVGMAMYSLANAKCAR        
Sbjct: 301 FSLGNLLLKEGKGKDAETCFARIVNIYKKLYGEKDGKVGMAMYSLANAKCAR-------- 360

Query: 381 FIHQLKFSFAKYSSIGEADEAVTLYRRALHIIKDSNDMALDDSVMEKMRIELAELLHVVG 440
                          GEADEA+TLYRRAL IIKDSN MALDDS MEKMRI+LAELLH VG
Sbjct: 361 ---------------GEADEAITLYRRALQIIKDSNYMALDDSEMEKMRIDLAELLHAVG 420

Query: 441 RGSEGREILEECLLINERSKGKEHPSSVKHLVNLAASYSRSKNYAEAERLLRIGLDIMIK 500
           RG EGRE+LEECLLINE+SKGK+HPSSVKHLVNLAASYSRSKNYAEAERLLRIGLDIM+K
Sbjct: 421 RGKEGRELLEECLLINEKSKGKDHPSSVKHLVNLAASYSRSKNYAEAERLLRIGLDIMVK 480

Query: 501 AVGPDDQSITVPMLNLAVTLYNLKQDDNAEQLALEVLRIRESAFGKDSLPVGEALDCLVS 560
           AVGPDDQSIT PMLNLAVTLYNLK+DD+AEQLALEVLRIRE+AFGKD LPVGEALDCLVS
Sbjct: 481 AVGPDDQSITNPMLNLAVTLYNLKRDDDAEQLALEVLRIRENAFGKDCLPVGEALDCLVS 540

Query: 561 IQSRLGKDENELLKLLKRILRIQEREFGHDGKEVIDTLKKIVFYVEKLGMKDEKFLLQKR 620
           IQSRLGKD+ ELLKLLKRILRIQE+ FG++ KEVIDTLKKIVFY++KLGMKDEKF +QKR
Sbjct: 541 IQSRLGKDDTELLKLLKRILRIQEKAFGYEAKEVIDTLKKIVFYMDKLGMKDEKFPVQKR 576

Query: 621 LSKLRMKFKNQMQY 635
           LS LR KFKNQMQY
Sbjct: 601 LSLLRTKFKNQMQY 576

BLAST of CaUC01G000420 vs. NCBI nr
Match: XP_022995907.1 (nephrocystin-3 isoform X3 [Cucurbita maxima] >XP_022995908.1 nephrocystin-3 isoform X3 [Cucurbita maxima])

HSP 1 Score: 921.4 bits (2380), Expect = 4.3e-264
Identity = 494/614 (80.46%), Postives = 532/614 (86.64%), Query Frame = 0

Query: 21  MSTFLLLSSQSSTSRRVSEICLCVQSLQLQKGITCPSVCLQKQKYNIKLYAVPVGAFSCR 80
           M+ FLLLSS S T  R+SEICLC+QSLQLQKGITC SV LQKQK NIKLYAVPV AFSCR
Sbjct: 1   MAAFLLLSSPSFTYHRMSEICLCMQSLQLQKGITCSSVSLQKQKCNIKLYAVPVRAFSCR 60

Query: 81  FAPTPSASNRADLGGQRKHISSAFTAPNGYQSRPELMHNRTDGNSMIEFEVQLEELFNEV 140
           FA T SAS+R DLG QRKHI+SAFT PNGYQSR   MH RTDGNS  EFE QL+ELFNEV
Sbjct: 61  FASTGSASSRDDLGSQRKHIASAFTVPNGYQSRAGHMHYRTDGNSASEFEGQLDELFNEV 120

Query: 141 KMMAMSGRKNDAVELLQANYEAVKEQMESGAIGIEQAAVLDIVALGYITVGDLKFVASVL 200
           +M+ +S RK+DAVELLQANYEAVKEQMESGA GIEQAAVLDIVALGYITVGDLKFVAS+L
Sbjct: 121 RMLIVSRRKSDAVELLQANYEAVKEQMESGACGIEQAAVLDIVALGYITVGDLKFVASIL 180

Query: 201 DILNKVVDSLEDSEPFLDSVLLHMGSMYSTLKKFEKSMSAYKRAIDIIEKKSGKGAFGDI 260
           DILN +VDSL+D+EPFLDSVLLHMGSMYSTLKK +KS+SAYKRAIDIIEKKSGK      
Sbjct: 181 DILNNIVDSLKDNEPFLDSVLLHMGSMYSTLKKLDKSVSAYKRAIDIIEKKSGK------ 240

Query: 261 PYNESFAGEDSTFLITPILGMAKVLGIIGRTAKAVEFYHRAISLLESTRGLENEDLVIPL 320
                    DS+FLITPILGMAKVLG  G+T KAVEFYHRAIS+LES RG E+EDLVIPL
Sbjct: 241 ---------DSSFLITPILGMAKVLGTSGKTTKAVEFYHRAISILESIRGFEDEDLVIPL 300

Query: 321 FGLGNLMLKEGKGKDAETCFARIVDIYKKLYGEKDGKVGMAMYSLANAKCARGSFGHLSS 380
           F LGNL+LKEGKGKDAETCFARIV+IYKKLYGEKDGKVGMAMYSLANAKCAR        
Sbjct: 301 FSLGNLLLKEGKGKDAETCFARIVNIYKKLYGEKDGKVGMAMYSLANAKCAR-------- 360

Query: 381 FIHQLKFSFAKYSSIGEADEAVTLYRRALHIIKDSNDMALDDSVMEKMRIELAELLHVVG 440
                          GEADEA+TLYRRAL IIKDSN MALDDS MEKMRI+LAELLH VG
Sbjct: 361 ---------------GEADEAITLYRRALQIIKDSNYMALDDSEMEKMRIDLAELLHAVG 420

Query: 441 RGSEGREILEECLLINERSKGKEHPSSVKHLVNLAASYSRSKNYAEAERLLRIGLDIMIK 500
           RG EGRE+LEECLLINE+SKGK+HPSSVKHLVNLAASYSRSKNYAEAERLLRIGLDIM+K
Sbjct: 421 RGKEGRELLEECLLINEKSKGKDHPSSVKHLVNLAASYSRSKNYAEAERLLRIGLDIMVK 480

Query: 501 AVGPDDQSITVPMLNLAVTLYNLKQDDNAEQLALEVLRIRESAFGKDSLPVGEALDCLVS 560
           AVGPDDQSIT PMLNLAVTLYNLK+DD+AEQLALEVLRIRE+AFGKD LPVGEALDCLVS
Sbjct: 481 AVGPDDQSITNPMLNLAVTLYNLKRDDDAEQLALEVLRIRENAFGKDCLPVGEALDCLVS 540

Query: 561 IQSRLGKDENELLKLLKRILRIQEREFGHDGKEVIDTLKKIVFYVEKLGMKDEKFLLQKR 620
           IQSRLGKDE ELLKLLKRILRIQE+ FG++ KEVIDTLKKIVFY++KLG+KDEKF +QKR
Sbjct: 541 IQSRLGKDETELLKLLKRILRIQEKAFGYEAKEVIDTLKKIVFYMDKLGLKDEKFPVQKR 576

Query: 621 LSKLRMKFKNQMQY 635
           LS LRMKFKNQMQY
Sbjct: 601 LSLLRMKFKNQMQY 576

BLAST of CaUC01G000420 vs. NCBI nr
Match: XP_023533270.1 (nephrocystin-3-like isoform X3 [Cucurbita pepo subsp. pepo] >XP_023533271.1 nephrocystin-3-like isoform X3 [Cucurbita pepo subsp. pepo])

HSP 1 Score: 917.1 bits (2369), Expect = 8.0e-263
Identity = 493/614 (80.29%), Postives = 531/614 (86.48%), Query Frame = 0

Query: 21  MSTFLLLSSQSSTSRRVSEICLCVQSLQLQKGITCPSVCLQKQKYNIKLYAVPVGAFSCR 80
           M+ FLLLSS S T  R+SEICLC+QSLQLQKG TC SV LQKQK NIKLYAVPV AFSCR
Sbjct: 1   MAAFLLLSSPSFTCHRMSEICLCMQSLQLQKGTTCSSVSLQKQKCNIKLYAVPVRAFSCR 60

Query: 81  FAPTPSASNRADLGGQRKHISSAFTAPNGYQSRPELMHNRTDGNSMIEFEVQLEELFNEV 140
           FA T SAS+RADLG QRKHI+SAFT PNGYQSR   MH RTDGNS  EFE QL+ELFNEV
Sbjct: 61  FASTGSASSRADLGSQRKHIASAFTVPNGYQSRAGHMHYRTDGNSTSEFEGQLDELFNEV 120

Query: 141 KMMAMSGRKNDAVELLQANYEAVKEQMESGAIGIEQAAVLDIVALGYITVGDLKFVASVL 200
           +M+ +SGRKNDAVELLQANYEAVKEQMESGA GIEQAAVLDIVALGYITVGDLKFVAS+L
Sbjct: 121 RMLIVSGRKNDAVELLQANYEAVKEQMESGASGIEQAAVLDIVALGYITVGDLKFVASIL 180

Query: 201 DILNKVVDSLEDSEPFLDSVLLHMGSMYSTLKKFEKSMSAYKRAIDIIEKKSGKGAFGDI 260
           DILN +VDSL+DSEPFLDSVLLHMGSMYSTLKK EKS+SAYKRAIDIIEKKS        
Sbjct: 181 DILNSIVDSLKDSEPFLDSVLLHMGSMYSTLKKLEKSVSAYKRAIDIIEKKS-------- 240

Query: 261 PYNESFAGEDSTFLITPILGMAKVLGIIGRTAKAVEFYHRAISLLESTRGLENEDLVIPL 320
                  G+DS+FLITPILGMAKVLG  GRT KAVE YHRAIS+LESTRG E+EDLVIPL
Sbjct: 241 -------GQDSSFLITPILGMAKVLGTNGRTTKAVECYHRAISILESTRGFEDEDLVIPL 300

Query: 321 FGLGNLMLKEGKGKDAETCFARIVDIYKKLYGEKDGKVGMAMYSLANAKCARGSFGHLSS 380
           F LGNL+LKEGKGKDAETCFARI++IYKKLYGEKDGKVGMAMYSLANAKCAR        
Sbjct: 301 FSLGNLLLKEGKGKDAETCFARIMNIYKKLYGEKDGKVGMAMYSLANAKCAR-------- 360

Query: 381 FIHQLKFSFAKYSSIGEADEAVTLYRRALHIIKDSNDMALDDSVMEKMRIELAELLHVVG 440
                          GEADEA+TLYRRAL II+DSN MALDDS MEKMRI+LAELLH VG
Sbjct: 361 ---------------GEADEAITLYRRALQIIEDSNYMALDDSEMEKMRIDLAELLHAVG 420

Query: 441 RGSEGREILEECLLINERSKGKEHPSSVKHLVNLAASYSRSKNYAEAERLLRIGLDIMIK 500
           R  EGRE+LEE LLINE+SKGK+HPSSVKHLVNLAASYSRSKNYAEAERLLRIGL+IM+K
Sbjct: 421 RRKEGRELLEESLLINEKSKGKDHPSSVKHLVNLAASYSRSKNYAEAERLLRIGLNIMVK 480

Query: 501 AVGPDDQSITVPMLNLAVTLYNLKQDDNAEQLALEVLRIRESAFGKDSLPVGEALDCLVS 560
           AVGPDDQSIT PMLNLAVTLYNLK+DD+AEQLALEVLRIRE+AFGKD LPVGEALDCLVS
Sbjct: 481 AVGPDDQSITNPMLNLAVTLYNLKRDDDAEQLALEVLRIRENAFGKDCLPVGEALDCLVS 540

Query: 561 IQSRLGKDENELLKLLKRILRIQEREFGHDGKEVIDTLKKIVFYVEKLGMKDEKFLLQKR 620
           IQSRLGKDE ELLKLLKRILRIQE+ FG++ KEVIDTLKKIVFY++KLG+KDEKF +QKR
Sbjct: 541 IQSRLGKDETELLKLLKRILRIQEKAFGYEAKEVIDTLKKIVFYMDKLGLKDEKFPVQKR 576

Query: 621 LSKLRMKFKNQMQY 635
           LS LRMKFKNQMQY
Sbjct: 601 LSMLRMKFKNQMQY 576

BLAST of CaUC01G000420 vs. ExPASy Swiss-Prot
Match: Q6AZT7 (Nephrocystin-3 OS=Xenopus laevis OX=8355 GN=nphp3 PE=2 SV=1)

HSP 1 Score: 89.7 bits (221), Expect = 1.3e-16
Identity = 89/366 (24.32%), Postives = 158/366 (43.17%), Query Frame = 0

Query: 262  YNESFAGEDS-TFLITPILGMAKVLGIIGRTAKAVEFYHRAISLLESTRGLENEDLVIPL 321
            Y ++  GE+  T L      + + L  +G  ++AV    R++ + E+    ++  +   L
Sbjct: 902  YEKNCEGEEKMTSLADLYETLGRFLKDLGLLSQAVTPLQRSLEIRETALDPDHPSVAQSL 961

Query: 322  FGLGNLMLKEGKGKDAETCFARIVDIYKKLYGEKDGKVGMAMYSLANAKCARGSFGHLSS 381
              L  + ++  K  +AE  + + ++I +  YG +  +V   + +LA     +  F     
Sbjct: 962  HQLAGVYMQSKKFGNAEQLYKQALEISENAYGSEHLRVARELDALAVLYQKQNKFEQAEQ 1021

Query: 382  FIHQ-LKFSFAKYSSIGEADEAVTLYRRALHIIKDSNDMALDDSVMEKMRIELAELLHVV 441
               + LK         G       L RRAL +  +   +  D S   +   EL  L ++ 
Sbjct: 1022 LRKKSLKIRQKSARRKGSMYGFALLRRRALQL--EELTLGKDTSDNARTLNELGVLYYLQ 1081

Query: 442  GRGSEGREILEECLLINERSKGKEHPSSVKHLVNLAASYSRSKNYAEAERLLRIGLDIMI 501
                     L+  L + ER  G +HP   + + NLAA Y+  K Y +AE L    LDI  
Sbjct: 1082 NNLETAETFLKRSLEMRERVLGADHPDCAQSINNLAALYNEKKQYDKAEELYERALDIRR 1141

Query: 502  KAVGPDDQSITVPMLNLAVTLYNLKQDDNAEQLALEVLRIRESAFGKDSLPVGEALDCLV 561
            +A+ PD  S+   + +LAV      + D A  L    + IR+ +FG     V  AL  L 
Sbjct: 1142 RALSPDHPSLAYTVKHLAVLYKRKGKLDKAVPLYELAVDIRQKSFGPKHPSVATALVNLA 1201

Query: 562  SIQSRLGKDENELLKLLKRILRIQEREFGHDGKEVIDTLKKIVFYVEKLGMKDEKFLLQK 621
             +  ++ K +++ L L +R ++I E   G     V +TLK +     + G  ++   L K
Sbjct: 1202 VLYCQM-KKQDDALPLYERAMKIYEDSLGRMHPRVGETLKNLAVLRYEEGDYEKAAELYK 1261

Query: 622  RLSKLR 626
            R  +++
Sbjct: 1262 RAMEIK 1264

BLAST of CaUC01G000420 vs. ExPASy Swiss-Prot
Match: P0CI65 (Nephrocystin-3 OS=Danio rerio OX=7955 GN=nphp3 PE=3 SV=1)

HSP 1 Score: 87.8 bits (216), Expect = 4.8e-16
Identity = 99/420 (23.57%), Postives = 168/420 (40.00%), Query Frame = 0

Query: 208  DSLEDSEPFLDS--VLLHMGSMYSTLKKFEKSMSAYKRAIDIIEKKSGKGAFGDIPYNES 267
            D+L++ E   +S   +  + ++Y TL +F K +    +A+  +++         +   E+
Sbjct: 902  DALKEFEKTCESEQSMSRLANLYETLGRFLKDLGLLSQAVAPLQR--------SLEIRET 961

Query: 268  FAGEDSTFLITPILGMAKVLGIIGRTAKAVEFYHRAISLLESTRGLENEDLVIPLFGLGN 327
                D   +   +  +A V     +   A + Y +A+ + E+  G E+  +   L  L  
Sbjct: 962  ALDPDHPSVAQSLHQLAGVYVHWRKFGNAEQLYKQAMEICENAYGPEHSTVARELDSLSL 1021

Query: 328  LMLKEGKGKDAETCFARIVDIYKKLYGEKDGKVGMAMYSLANAKCARGSFGHLSSFIHQL 387
            L  K+ K + AE    R V I +K   +K                     GH+  F    
Sbjct: 1022 LYQKQNKYEQAEKLRKRSVKIRQKTARQK---------------------GHMYGF---- 1081

Query: 388  KFSFAKYSSIGEADEAVTLYRRALHIIKDSNDMALDDSVMEKMRIELAELLHVVGRGSEG 447
                              L RRAL +  +   +  D +   K   EL  L ++       
Sbjct: 1082 ----------------ALLKRRALQL--EELTLGKDSTDCAKTLNELGVLYYLQNNLDAA 1141

Query: 448  REILEECLLINERSKGKEHPSSVKHLVNLAASYSRSKNYAEAERLLRIGLDIMIKAVGPD 507
            +  L   L + +R  G +HP   + L NLAA +S  K Y  AE L    LDI  +A+ PD
Sbjct: 1142 KLFLTRSLEMRQRVLGPDHPDCAQSLNNLAALHSERKEYESAEELYERALDIRKRALAPD 1201

Query: 508  DQSITVPMLNLAVTLYNLKQDDNAEQLALEVLRIRESAFGKDSLPVGEALDCLVSIQSRL 567
              S+   + +LA+      + + A  L    L IRE +FG     V  AL  L  +  +L
Sbjct: 1202 HPSLAYTLKHLAMLYKRRGKLEKAVPLYELALEIREKSFGPKHPSVATALVNLAVLYCQL 1261

Query: 568  GKDENELLKLLKRILRIQEREFGHDGKEVIDTLKKIVFYVEKLGMKDEKFLLQKRLSKLR 626
             K  ++ L L +R L++ E   G     V +TLK +     + G  ++   L KR  +++
Sbjct: 1262 -KQHSDALPLYERALKVYEDSLGRLHPRVGETLKNLAVLSYEEGDFEKAAELYKRAMEIK 1269

BLAST of CaUC01G000420 vs. ExPASy Swiss-Prot
Match: Q07866 (Kinesin light chain 1 OS=Homo sapiens OX=9606 GN=KLC1 PE=1 SV=2)

HSP 1 Score: 75.1 bits (183), Expect = 3.2e-12
Identity = 69/280 (24.64%), Postives = 120/280 (42.86%), Query Frame = 0

Query: 328 LKEGKGKDAETCFARIVDIYKKLYGEKDGKVGMAMYSLANAKCARGSFGHLSSFIHQLKF 387
           +   + KD ++    + D++     E D   G+     + A  A+     + + +  L  
Sbjct: 161 ISPSEDKDTDSTKEPLDDLFPN--DEDDPGQGIQQQHSSAAAAAQQGGYEIPARLRTLHN 220

Query: 388 SFAKYSSIGEADEAVTLYRRALHIIKDSNDMALDDSVMEKMRIELAELLHVVGRGSEGRE 447
              +Y+S G  + AV L ++AL  ++ ++    D   +  M   LA +     +  +   
Sbjct: 221 LVIQYASQGRYEVAVPLCKQALEDLEKTS--GHDHPDVATMLNILALVYRDQNKYKDAAN 280

Query: 448 ILEECLLINERSKGKEHPSSVKHLVNLAASYSRSKNYAEAERLLRIGLDIMIKAVGPDDQ 507
           +L + L I E++ GK+HP+    L NLA  Y +   Y EAE L +  L+I  K +G D  
Sbjct: 281 LLNDALAIREKTLGKDHPAVAATLNNLAVLYGKRGKYKEAEPLCKRALEIREKVLGKDHP 340

Query: 508 SITVPMLNLAVTLYNLKQDDNAEQLALEVLRIRESAFGKDSLPVGEALDCLVSIQSRLGK 567
            +   + NLA+   N  + +  E      L I ++  G D   V +  + L S   + GK
Sbjct: 341 DVAKQLNNLALLCQNQGKYEEVEYYYQRALEIYQTKLGPDDPNVAKTKNNLASCYLKQGK 400

Query: 568 DENELLKLLKRILRIQEREFGHDGKEVIDTLKKIVFYVEK 608
            +       + + R  EREFG     V D  K I  + E+
Sbjct: 401 FKQAETLYKEILTRAHEREFG----SVDDENKPIWMHAEE 432

BLAST of CaUC01G000420 vs. ExPASy Swiss-Prot
Match: Q5R581 (Kinesin light chain 1 OS=Pongo abelii OX=9601 GN=KLC1 PE=2 SV=3)

HSP 1 Score: 75.1 bits (183), Expect = 3.2e-12
Identity = 69/280 (24.64%), Postives = 120/280 (42.86%), Query Frame = 0

Query: 328 LKEGKGKDAETCFARIVDIYKKLYGEKDGKVGMAMYSLANAKCARGSFGHLSSFIHQLKF 387
           +   + KD ++    + D++     E D   G+     + A  A+     + + +  L  
Sbjct: 161 ISPSEDKDTDSTKEPLDDLFPN--DEDDPGQGIQQQHSSAAAAAQQGDYEIPARLRTLHN 220

Query: 388 SFAKYSSIGEADEAVTLYRRALHIIKDSNDMALDDSVMEKMRIELAELLHVVGRGSEGRE 447
              +Y+S G  + AV L ++AL  ++ ++    D   +  M   LA +     +  +   
Sbjct: 221 LVIQYASQGRYEVAVPLCKQALEDLEKTS--GHDHPDVATMLNILALVYRDQNKYKDAAN 280

Query: 448 ILEECLLINERSKGKEHPSSVKHLVNLAASYSRSKNYAEAERLLRIGLDIMIKAVGPDDQ 507
           +L + L I E++ GK+HP+    L NLA  Y +   Y EAE L +  L+I  K +G D  
Sbjct: 281 LLNDALAIREKTLGKDHPAVAATLNNLAVLYGKRGKYKEAEPLCKRALEIREKVLGKDHP 340

Query: 508 SITVPMLNLAVTLYNLKQDDNAEQLALEVLRIRESAFGKDSLPVGEALDCLVSIQSRLGK 567
            +   + NLA+   N  + +  E      L I ++  G D   V +  + L S   + GK
Sbjct: 341 DVAKQLNNLALLCQNQGKYEEVEYYYQRALEIYQTKLGPDDPNVAKTKNNLASCYLKQGK 400

Query: 568 DENELLKLLKRILRIQEREFGHDGKEVIDTLKKIVFYVEK 608
            +       + + R  EREFG     V D  K I  + E+
Sbjct: 401 FKQAETLYKEILTRAHEREFG----SVDDENKPIWMHAEE 432

BLAST of CaUC01G000420 vs. ExPASy Swiss-Prot
Match: P37285 (Kinesin light chain 1 OS=Rattus norvegicus OX=10116 GN=Klc1 PE=1 SV=2)

HSP 1 Score: 74.3 bits (181), Expect = 5.4e-12
Identity = 68/280 (24.29%), Postives = 121/280 (43.21%), Query Frame = 0

Query: 328 LKEGKGKDAETCFARIVDIYKKLYGEKDGKVGMAMYSLANAKCARGSFGHLSSFIHQLKF 387
           +   + KD+++    + D++     E D   G+     + A  A+     + + +  L  
Sbjct: 161 ISPSEDKDSDSSKEPLDDLFPN--DEDDPGQGIQQQHSSAAAAAQQGGYEIPARLRTLHN 220

Query: 388 SFAKYSSIGEADEAVTLYRRALHIIKDSNDMALDDSVMEKMRIELAELLHVVGRGSEGRE 447
              +Y+S G  + AV L ++AL  ++ ++    D   +  M   LA +     +  +   
Sbjct: 221 LVIQYASQGRYEVAVPLCKQALEDLEKTS--GHDHPDVATMLNILALVYRDQNKYKDAAN 280

Query: 448 ILEECLLINERSKGKEHPSSVKHLVNLAASYSRSKNYAEAERLLRIGLDIMIKAVGPDDQ 507
           +L + L I E++ G++HP+    L NLA  Y +   Y EAE L +  L+I  K +G D  
Sbjct: 281 LLNDALAIREKTLGRDHPAVAATLNNLAVLYGKRGKYKEAEPLCKRALEIREKVLGKDHP 340

Query: 508 SITVPMLNLAVTLYNLKQDDNAEQLALEVLRIRESAFGKDSLPVGEALDCLVSIQSRLGK 567
            +   + NLA+   N  + +  E      L I ++  G D   V +  + L S   + GK
Sbjct: 341 DVAKQLNNLALLCQNQGKYEEVEYYYQRALEIYQTKLGPDDPNVAKTKNNLASCYLKQGK 400

Query: 568 DENELLKLLKRILRIQEREFGHDGKEVIDTLKKIVFYVEK 608
            +       + + R  EREFG     V D  K I  + E+
Sbjct: 401 FKQAETLYKEILTRAHEREFG----SVDDENKPIWMHAEE 432

BLAST of CaUC01G000420 vs. ExPASy TrEMBL
Match: A0A6J1H2E0 (nephrocystin-3 isoform X3 OS=Cucurbita moschata OX=3662 GN=LOC111459474 PE=4 SV=1)

HSP 1 Score: 927.5 bits (2396), Expect = 2.9e-266
Identity = 497/614 (80.94%), Postives = 534/614 (86.97%), Query Frame = 0

Query: 21  MSTFLLLSSQSSTSRRVSEICLCVQSLQLQKGITCPSVCLQKQKYNIKLYAVPVGAFSCR 80
           M+ FLLLSS S T  R+SEICLC+QSLQLQKGITC SV LQKQK NIKLYAVPV AFSC+
Sbjct: 1   MAAFLLLSSPSFTYHRMSEICLCMQSLQLQKGITCSSVSLQKQKCNIKLYAVPVRAFSCQ 60

Query: 81  FAPTPSASNRADLGGQRKHISSAFTAPNGYQSRPELMHNRTDGNSMIEFEVQLEELFNEV 140
           FA T SAS+RADLG QRKHI+SAFT PNGYQSRP  MH RTDG S  EFE QL+ELFNEV
Sbjct: 61  FASTGSASSRADLGSQRKHIASAFTVPNGYQSRPGHMHYRTDGTSTSEFEGQLDELFNEV 120

Query: 141 KMMAMSGRKNDAVELLQANYEAVKEQMESGAIGIEQAAVLDIVALGYITVGDLKFVASVL 200
           +M+ +SGRK+DAVELLQANYEAVKEQMESGAIGIEQAAVLDIVALGYITVGDLKFVAS+L
Sbjct: 121 RMLIVSGRKSDAVELLQANYEAVKEQMESGAIGIEQAAVLDIVALGYITVGDLKFVASIL 180

Query: 201 DILNKVVDSLEDSEPFLDSVLLHMGSMYSTLKKFEKSMSAYKRAIDIIEKKSGKGAFGDI 260
           DILN +VDSL+D+EPFLDSVLLHMGSMYSTLKK EKSMSAYKRAIDIIEKKSGK      
Sbjct: 181 DILNNIVDSLKDNEPFLDSVLLHMGSMYSTLKKLEKSMSAYKRAIDIIEKKSGK------ 240

Query: 261 PYNESFAGEDSTFLITPILGMAKVLGIIGRTAKAVEFYHRAISLLESTRGLENEDLVIPL 320
                    DS+FLITPILGMAKVLG  G+T KAVE YHRAIS+LESTRG E+EDLVIPL
Sbjct: 241 ---------DSSFLITPILGMAKVLGTSGKTTKAVESYHRAISILESTRGFEDEDLVIPL 300

Query: 321 FGLGNLMLKEGKGKDAETCFARIVDIYKKLYGEKDGKVGMAMYSLANAKCARGSFGHLSS 380
           F LGNL+LKEGKGKDAETCFARIV+IYKKLYGEKDGKVGMAMYSLANAKCAR        
Sbjct: 301 FSLGNLLLKEGKGKDAETCFARIVNIYKKLYGEKDGKVGMAMYSLANAKCAR-------- 360

Query: 381 FIHQLKFSFAKYSSIGEADEAVTLYRRALHIIKDSNDMALDDSVMEKMRIELAELLHVVG 440
                          GEADEA+TLYRRAL IIKDSN MALDDS MEKMRI+LAELLH VG
Sbjct: 361 ---------------GEADEAITLYRRALQIIKDSNYMALDDSEMEKMRIDLAELLHAVG 420

Query: 441 RGSEGREILEECLLINERSKGKEHPSSVKHLVNLAASYSRSKNYAEAERLLRIGLDIMIK 500
           RG EGRE+LEECLLINE+SKGK+HPSSVKHLVNLAASYSRSKNYAEAERLLRIGLDIM+K
Sbjct: 421 RGKEGRELLEECLLINEKSKGKDHPSSVKHLVNLAASYSRSKNYAEAERLLRIGLDIMVK 480

Query: 501 AVGPDDQSITVPMLNLAVTLYNLKQDDNAEQLALEVLRIRESAFGKDSLPVGEALDCLVS 560
           AVGPDDQSIT PMLNLAVTLYNLK+DD+AEQLALEVLRIRE+AFGKD LPVGEALDCLVS
Sbjct: 481 AVGPDDQSITNPMLNLAVTLYNLKRDDDAEQLALEVLRIRENAFGKDCLPVGEALDCLVS 540

Query: 561 IQSRLGKDENELLKLLKRILRIQEREFGHDGKEVIDTLKKIVFYVEKLGMKDEKFLLQKR 620
           IQSRLGKD+ ELLKLLKRILRIQE+ FG++ KEVIDTLKKIVFY++KLGMKDEKF +QKR
Sbjct: 541 IQSRLGKDDTELLKLLKRILRIQEKAFGYEAKEVIDTLKKIVFYMDKLGMKDEKFPVQKR 576

Query: 621 LSKLRMKFKNQMQY 635
           LS LR KFKNQMQY
Sbjct: 601 LSLLRTKFKNQMQY 576

BLAST of CaUC01G000420 vs. ExPASy TrEMBL
Match: A0A6J1K9C6 (nephrocystin-3 isoform X3 OS=Cucurbita maxima OX=3661 GN=LOC111491290 PE=4 SV=1)

HSP 1 Score: 921.4 bits (2380), Expect = 2.1e-264
Identity = 494/614 (80.46%), Postives = 532/614 (86.64%), Query Frame = 0

Query: 21  MSTFLLLSSQSSTSRRVSEICLCVQSLQLQKGITCPSVCLQKQKYNIKLYAVPVGAFSCR 80
           M+ FLLLSS S T  R+SEICLC+QSLQLQKGITC SV LQKQK NIKLYAVPV AFSCR
Sbjct: 1   MAAFLLLSSPSFTYHRMSEICLCMQSLQLQKGITCSSVSLQKQKCNIKLYAVPVRAFSCR 60

Query: 81  FAPTPSASNRADLGGQRKHISSAFTAPNGYQSRPELMHNRTDGNSMIEFEVQLEELFNEV 140
           FA T SAS+R DLG QRKHI+SAFT PNGYQSR   MH RTDGNS  EFE QL+ELFNEV
Sbjct: 61  FASTGSASSRDDLGSQRKHIASAFTVPNGYQSRAGHMHYRTDGNSASEFEGQLDELFNEV 120

Query: 141 KMMAMSGRKNDAVELLQANYEAVKEQMESGAIGIEQAAVLDIVALGYITVGDLKFVASVL 200
           +M+ +S RK+DAVELLQANYEAVKEQMESGA GIEQAAVLDIVALGYITVGDLKFVAS+L
Sbjct: 121 RMLIVSRRKSDAVELLQANYEAVKEQMESGACGIEQAAVLDIVALGYITVGDLKFVASIL 180

Query: 201 DILNKVVDSLEDSEPFLDSVLLHMGSMYSTLKKFEKSMSAYKRAIDIIEKKSGKGAFGDI 260
           DILN +VDSL+D+EPFLDSVLLHMGSMYSTLKK +KS+SAYKRAIDIIEKKSGK      
Sbjct: 181 DILNNIVDSLKDNEPFLDSVLLHMGSMYSTLKKLDKSVSAYKRAIDIIEKKSGK------ 240

Query: 261 PYNESFAGEDSTFLITPILGMAKVLGIIGRTAKAVEFYHRAISLLESTRGLENEDLVIPL 320
                    DS+FLITPILGMAKVLG  G+T KAVEFYHRAIS+LES RG E+EDLVIPL
Sbjct: 241 ---------DSSFLITPILGMAKVLGTSGKTTKAVEFYHRAISILESIRGFEDEDLVIPL 300

Query: 321 FGLGNLMLKEGKGKDAETCFARIVDIYKKLYGEKDGKVGMAMYSLANAKCARGSFGHLSS 380
           F LGNL+LKEGKGKDAETCFARIV+IYKKLYGEKDGKVGMAMYSLANAKCAR        
Sbjct: 301 FSLGNLLLKEGKGKDAETCFARIVNIYKKLYGEKDGKVGMAMYSLANAKCAR-------- 360

Query: 381 FIHQLKFSFAKYSSIGEADEAVTLYRRALHIIKDSNDMALDDSVMEKMRIELAELLHVVG 440
                          GEADEA+TLYRRAL IIKDSN MALDDS MEKMRI+LAELLH VG
Sbjct: 361 ---------------GEADEAITLYRRALQIIKDSNYMALDDSEMEKMRIDLAELLHAVG 420

Query: 441 RGSEGREILEECLLINERSKGKEHPSSVKHLVNLAASYSRSKNYAEAERLLRIGLDIMIK 500
           RG EGRE+LEECLLINE+SKGK+HPSSVKHLVNLAASYSRSKNYAEAERLLRIGLDIM+K
Sbjct: 421 RGKEGRELLEECLLINEKSKGKDHPSSVKHLVNLAASYSRSKNYAEAERLLRIGLDIMVK 480

Query: 501 AVGPDDQSITVPMLNLAVTLYNLKQDDNAEQLALEVLRIRESAFGKDSLPVGEALDCLVS 560
           AVGPDDQSIT PMLNLAVTLYNLK+DD+AEQLALEVLRIRE+AFGKD LPVGEALDCLVS
Sbjct: 481 AVGPDDQSITNPMLNLAVTLYNLKRDDDAEQLALEVLRIRENAFGKDCLPVGEALDCLVS 540

Query: 561 IQSRLGKDENELLKLLKRILRIQEREFGHDGKEVIDTLKKIVFYVEKLGMKDEKFLLQKR 620
           IQSRLGKDE ELLKLLKRILRIQE+ FG++ KEVIDTLKKIVFY++KLG+KDEKF +QKR
Sbjct: 541 IQSRLGKDETELLKLLKRILRIQEKAFGYEAKEVIDTLKKIVFYMDKLGLKDEKFPVQKR 576

Query: 621 LSKLRMKFKNQMQY 635
           LS LRMKFKNQMQY
Sbjct: 601 LSLLRMKFKNQMQY 576

BLAST of CaUC01G000420 vs. ExPASy TrEMBL
Match: A0A6J1H1D9 (nephrocystin-3 isoform X1 OS=Cucurbita moschata OX=3662 GN=LOC111459474 PE=4 SV=1)

HSP 1 Score: 900.2 bits (2325), Expect = 4.9e-258
Identity = 481/590 (81.53%), Postives = 515/590 (87.29%), Query Frame = 0

Query: 45  QSLQLQKGITCPSVCLQKQKYNIKLYAVPVGAFSCRFAPTPSASNRADLGGQRKHISSAF 104
           QSLQLQKGITC SV LQKQK NIKLYAVPV AFSC+FA T SAS+RADLG QRKHI+SAF
Sbjct: 34  QSLQLQKGITCSSVSLQKQKCNIKLYAVPVRAFSCQFASTGSASSRADLGSQRKHIASAF 93

Query: 105 TAPNGYQSRPELMHNRTDGNSMIEFEVQLEELFNEVKMMAMSGRKNDAVELLQANYEAVK 164
           T PNGYQSRP  MH RTDG S  EFE QL+ELFNEV+M+ +SGRK+DAVELLQANYEAVK
Sbjct: 94  TVPNGYQSRPGHMHYRTDGTSTSEFEGQLDELFNEVRMLIVSGRKSDAVELLQANYEAVK 153

Query: 165 EQMESGAIGIEQAAVLDIVALGYITVGDLKFVASVLDILNKVVDSLEDSEPFLDSVLLHM 224
           EQMESGAIGIEQAAVLDIVALGYITVGDLKFVAS+LDILN +VDSL+D+EPFLDSVLLHM
Sbjct: 154 EQMESGAIGIEQAAVLDIVALGYITVGDLKFVASILDILNNIVDSLKDNEPFLDSVLLHM 213

Query: 225 GSMYSTLKKFEKSMSAYKRAIDIIEKKSGKGAFGDIPYNESFAGEDSTFLITPILGMAKV 284
           GSMYSTLKK EKSMSAYKRAIDIIEKKSGK               DS+FLITPILGMAKV
Sbjct: 214 GSMYSTLKKLEKSMSAYKRAIDIIEKKSGK---------------DSSFLITPILGMAKV 273

Query: 285 LGIIGRTAKAVEFYHRAISLLESTRGLENEDLVIPLFGLGNLMLKEGKGKDAETCFARIV 344
           LG  G+T KAVE YHRAIS+LESTRG E+EDLVIPLF LGNL+LKEGKGKDAETCFARIV
Sbjct: 274 LGTSGKTTKAVESYHRAISILESTRGFEDEDLVIPLFSLGNLLLKEGKGKDAETCFARIV 333

Query: 345 DIYKKLYGEKDGKVGMAMYSLANAKCARGSFGHLSSFIHQLKFSFAKYSSIGEADEAVTL 404
           +IYKKLYGEKDGKVGMAMYSLANAKCAR                       GEADEA+TL
Sbjct: 334 NIYKKLYGEKDGKVGMAMYSLANAKCAR-----------------------GEADEAITL 393

Query: 405 YRRALHIIKDSNDMALDDSVMEKMRIELAELLHVVGRGSEGREILEECLLINERSKGKEH 464
           YRRAL IIKDSN MALDDS MEKMRI+LAELLH VGRG EGRE+LEECLLINE+SKGK+H
Sbjct: 394 YRRALQIIKDSNYMALDDSEMEKMRIDLAELLHAVGRGKEGRELLEECLLINEKSKGKDH 453

Query: 465 PSSVKHLVNLAASYSRSKNYAEAERLLRIGLDIMIKAVGPDDQSITVPMLNLAVTLYNLK 524
           PSSVKHLVNLAASYSRSKNYAEAERLLRIGLDIM+KAVGPDDQSIT PMLNLAVTLYNLK
Sbjct: 454 PSSVKHLVNLAASYSRSKNYAEAERLLRIGLDIMVKAVGPDDQSITNPMLNLAVTLYNLK 513

Query: 525 QDDNAEQLALEVLRIRESAFGKDSLPVGEALDCLVSIQSRLGKDENELLKLLKRILRIQE 584
           +DD+AEQLALEVLRIRE+AFGKD LPVGEALDCLVSIQSRLGKD+ ELLKLLKRILRIQE
Sbjct: 514 RDDDAEQLALEVLRIRENAFGKDCLPVGEALDCLVSIQSRLGKDDTELLKLLKRILRIQE 573

Query: 585 REFGHDGKEVIDTLKKIVFYVEKLGMKDEKFLLQKRLSKLRMKFKNQMQY 635
           + FG++ KEVIDTLKKIVFY++KLGMKDEKF +QKRLS LR KFKNQMQY
Sbjct: 574 KAFGYEAKEVIDTLKKIVFYMDKLGMKDEKFPVQKRLSLLRTKFKNQMQY 585

BLAST of CaUC01G000420 vs. ExPASy TrEMBL
Match: A0A6J1K577 (nephrocystin-3 isoform X1 OS=Cucurbita maxima OX=3661 GN=LOC111491290 PE=4 SV=1)

HSP 1 Score: 894.0 bits (2309), Expect = 3.5e-256
Identity = 478/590 (81.02%), Postives = 513/590 (86.95%), Query Frame = 0

Query: 45  QSLQLQKGITCPSVCLQKQKYNIKLYAVPVGAFSCRFAPTPSASNRADLGGQRKHISSAF 104
           QSLQLQKGITC SV LQKQK NIKLYAVPV AFSCRFA T SAS+R DLG QRKHI+SAF
Sbjct: 34  QSLQLQKGITCSSVSLQKQKCNIKLYAVPVRAFSCRFASTGSASSRDDLGSQRKHIASAF 93

Query: 105 TAPNGYQSRPELMHNRTDGNSMIEFEVQLEELFNEVKMMAMSGRKNDAVELLQANYEAVK 164
           T PNGYQSR   MH RTDGNS  EFE QL+ELFNEV+M+ +S RK+DAVELLQANYEAVK
Sbjct: 94  TVPNGYQSRAGHMHYRTDGNSASEFEGQLDELFNEVRMLIVSRRKSDAVELLQANYEAVK 153

Query: 165 EQMESGAIGIEQAAVLDIVALGYITVGDLKFVASVLDILNKVVDSLEDSEPFLDSVLLHM 224
           EQMESGA GIEQAAVLDIVALGYITVGDLKFVAS+LDILN +VDSL+D+EPFLDSVLLHM
Sbjct: 154 EQMESGACGIEQAAVLDIVALGYITVGDLKFVASILDILNNIVDSLKDNEPFLDSVLLHM 213

Query: 225 GSMYSTLKKFEKSMSAYKRAIDIIEKKSGKGAFGDIPYNESFAGEDSTFLITPILGMAKV 284
           GSMYSTLKK +KS+SAYKRAIDIIEKKSGK               DS+FLITPILGMAKV
Sbjct: 214 GSMYSTLKKLDKSVSAYKRAIDIIEKKSGK---------------DSSFLITPILGMAKV 273

Query: 285 LGIIGRTAKAVEFYHRAISLLESTRGLENEDLVIPLFGLGNLMLKEGKGKDAETCFARIV 344
           LG  G+T KAVEFYHRAIS+LES RG E+EDLVIPLF LGNL+LKEGKGKDAETCFARIV
Sbjct: 274 LGTSGKTTKAVEFYHRAISILESIRGFEDEDLVIPLFSLGNLLLKEGKGKDAETCFARIV 333

Query: 345 DIYKKLYGEKDGKVGMAMYSLANAKCARGSFGHLSSFIHQLKFSFAKYSSIGEADEAVTL 404
           +IYKKLYGEKDGKVGMAMYSLANAKCAR                       GEADEA+TL
Sbjct: 334 NIYKKLYGEKDGKVGMAMYSLANAKCAR-----------------------GEADEAITL 393

Query: 405 YRRALHIIKDSNDMALDDSVMEKMRIELAELLHVVGRGSEGREILEECLLINERSKGKEH 464
           YRRAL IIKDSN MALDDS MEKMRI+LAELLH VGRG EGRE+LEECLLINE+SKGK+H
Sbjct: 394 YRRALQIIKDSNYMALDDSEMEKMRIDLAELLHAVGRGKEGRELLEECLLINEKSKGKDH 453

Query: 465 PSSVKHLVNLAASYSRSKNYAEAERLLRIGLDIMIKAVGPDDQSITVPMLNLAVTLYNLK 524
           PSSVKHLVNLAASYSRSKNYAEAERLLRIGLDIM+KAVGPDDQSIT PMLNLAVTLYNLK
Sbjct: 454 PSSVKHLVNLAASYSRSKNYAEAERLLRIGLDIMVKAVGPDDQSITNPMLNLAVTLYNLK 513

Query: 525 QDDNAEQLALEVLRIRESAFGKDSLPVGEALDCLVSIQSRLGKDENELLKLLKRILRIQE 584
           +DD+AEQLALEVLRIRE+AFGKD LPVGEALDCLVSIQSRLGKDE ELLKLLKRILRIQE
Sbjct: 514 RDDDAEQLALEVLRIRENAFGKDCLPVGEALDCLVSIQSRLGKDETELLKLLKRILRIQE 573

Query: 585 REFGHDGKEVIDTLKKIVFYVEKLGMKDEKFLLQKRLSKLRMKFKNQMQY 635
           + FG++ KEVIDTLKKIVFY++KLG+KDEKF +QKRLS LRMKFKNQMQY
Sbjct: 574 KAFGYEAKEVIDTLKKIVFYMDKLGLKDEKFPVQKRLSLLRMKFKNQMQY 585

BLAST of CaUC01G000420 vs. ExPASy TrEMBL
Match: A0A6J1H150 (nephrocystin-3 isoform X2 OS=Cucurbita moschata OX=3662 GN=LOC111459474 PE=4 SV=1)

HSP 1 Score: 894.0 bits (2309), Expect = 3.5e-256
Identity = 480/590 (81.36%), Postives = 514/590 (87.12%), Query Frame = 0

Query: 45  QSLQLQKGITCPSVCLQKQKYNIKLYAVPVGAFSCRFAPTPSASNRADLGGQRKHISSAF 104
           QSLQLQKGITC SV LQKQK NIKLYAVPV AFSC+FA T SAS+RADLG QRKHI+SAF
Sbjct: 34  QSLQLQKGITCSSVSLQKQKCNIKLYAVPVRAFSCQFASTGSASSRADLGSQRKHIASAF 93

Query: 105 TAPNGYQSRPELMHNRTDGNSMIEFEVQLEELFNEVKMMAMSGRKNDAVELLQANYEAVK 164
           T PNGYQ RP  MH RTDG S  EFE QL+ELFNEV+M+ +SGRK+DAVELLQANYEAVK
Sbjct: 94  TVPNGYQ-RPGHMHYRTDGTSTSEFEGQLDELFNEVRMLIVSGRKSDAVELLQANYEAVK 153

Query: 165 EQMESGAIGIEQAAVLDIVALGYITVGDLKFVASVLDILNKVVDSLEDSEPFLDSVLLHM 224
           EQMESGAIGIEQAAVLDIVALGYITVGDLKFVAS+LDILN +VDSL+D+EPFLDSVLLHM
Sbjct: 154 EQMESGAIGIEQAAVLDIVALGYITVGDLKFVASILDILNNIVDSLKDNEPFLDSVLLHM 213

Query: 225 GSMYSTLKKFEKSMSAYKRAIDIIEKKSGKGAFGDIPYNESFAGEDSTFLITPILGMAKV 284
           GSMYSTLKK EKSMSAYKRAIDIIEKKSGK               DS+FLITPILGMAKV
Sbjct: 214 GSMYSTLKKLEKSMSAYKRAIDIIEKKSGK---------------DSSFLITPILGMAKV 273

Query: 285 LGIIGRTAKAVEFYHRAISLLESTRGLENEDLVIPLFGLGNLMLKEGKGKDAETCFARIV 344
           LG  G+T KAVE YHRAIS+LESTRG E+EDLVIPLF LGNL+LKEGKGKDAETCFARIV
Sbjct: 274 LGTSGKTTKAVESYHRAISILESTRGFEDEDLVIPLFSLGNLLLKEGKGKDAETCFARIV 333

Query: 345 DIYKKLYGEKDGKVGMAMYSLANAKCARGSFGHLSSFIHQLKFSFAKYSSIGEADEAVTL 404
           +IYKKLYGEKDGKVGMAMYSLANAKCAR                       GEADEA+TL
Sbjct: 334 NIYKKLYGEKDGKVGMAMYSLANAKCAR-----------------------GEADEAITL 393

Query: 405 YRRALHIIKDSNDMALDDSVMEKMRIELAELLHVVGRGSEGREILEECLLINERSKGKEH 464
           YRRAL IIKDSN MALDDS MEKMRI+LAELLH VGRG EGRE+LEECLLINE+SKGK+H
Sbjct: 394 YRRALQIIKDSNYMALDDSEMEKMRIDLAELLHAVGRGKEGRELLEECLLINEKSKGKDH 453

Query: 465 PSSVKHLVNLAASYSRSKNYAEAERLLRIGLDIMIKAVGPDDQSITVPMLNLAVTLYNLK 524
           PSSVKHLVNLAASYSRSKNYAEAERLLRIGLDIM+KAVGPDDQSIT PMLNLAVTLYNLK
Sbjct: 454 PSSVKHLVNLAASYSRSKNYAEAERLLRIGLDIMVKAVGPDDQSITNPMLNLAVTLYNLK 513

Query: 525 QDDNAEQLALEVLRIRESAFGKDSLPVGEALDCLVSIQSRLGKDENELLKLLKRILRIQE 584
           +DD+AEQLALEVLRIRE+AFGKD LPVGEALDCLVSIQSRLGKD+ ELLKLLKRILRIQE
Sbjct: 514 RDDDAEQLALEVLRIRENAFGKDCLPVGEALDCLVSIQSRLGKDDTELLKLLKRILRIQE 573

Query: 585 REFGHDGKEVIDTLKKIVFYVEKLGMKDEKFLLQKRLSKLRMKFKNQMQY 635
           + FG++ KEVIDTLKKIVFY++KLGMKDEKF +QKRLS LR KFKNQMQY
Sbjct: 574 KAFGYEAKEVIDTLKKIVFYMDKLGMKDEKFPVQKRLSLLRTKFKNQMQY 584

BLAST of CaUC01G000420 vs. TAIR 10
Match: AT5G53080.1 (Tetratricopeptide repeat (TPR)-like superfamily protein )

HSP 1 Score: 577.0 bits (1486), Expect = 1.8e-164
Identity = 327/623 (52.49%), Postives = 426/623 (68.38%), Query Frame = 0

Query: 15  NSLVPAMSTFLLLSSQSSTSRRVSEICLCVQSLQLQKGITCPSVCLQKQKYNIKLYAVPV 74
           NSL+ + ++    ++QSS+           QS    +  T   VCL+ QK   KLY +P 
Sbjct: 5   NSLLSSTTSLTTWANQSSS-----------QSSLSPRYSTWQCVCLRNQKRKPKLYLIPA 64

Query: 75  GAFSCRFAPTPSASNRADLGGQRKHISSAFTAPNGYQSRPELMHNRTDGNS---MIEFEV 134
             F     P  S S            SS+ TA     S    +   T  N+   M EFE+
Sbjct: 65  RHFLS--TPIDSVS------------SSSITASRYATSGVSEVQRSTSSNNVTEMEEFEM 124

Query: 135 QLEELFNEVKMMAMSGRKNDAVELLQANYEAVKEQMESGAIGIEQAAVLDIVALGYITVG 194
           +L+ELFNEVK M   G+++DA++LL+ANY AVKE+++SG  GIEQAAVLDI+ALGY+ VG
Sbjct: 125 ELQELFNEVKSMVKIGKESDAMDLLRANYVAVKEELDSGLKGIEQAAVLDIIALGYMAVG 184

Query: 195 DLKFVASVLDILNKVVDSLEDSEPFLDSVLLHMGSMYSTLKKFEKSMSAYKRAIDIIEKK 254
           DLK V ++LD++NK+VD+L+DSEP LDSVL+H+GSMYS + KFE ++  ++RAI I+E +
Sbjct: 185 DLKPVPALLDMINKIVDNLKDSEPLLDSVLMHVGSMYSVIGKFENAILVHQRAIRILENR 244

Query: 255 SGKGAFGDIPYNESFAGEDSTFLITPILGMAKVLGIIGRTAKAVEFYHRAISLLESTRGL 314
            GK                +T L+TP+LGMAK     G+  KA+  Y R +++LE  RG 
Sbjct: 245 YGK---------------CNTLLVTPLLGMAKSFASDGKATKAIGVYERTLTILERNRGS 304

Query: 315 ENEDLVIPLFGLGNLMLKEGKGKDAETCFARIVDIYKKLYGEKDGKVGMAMYSLANAKCA 374
           E+EDLV+PLF LG L+LKEGK  +AE  F  IV+IYKK+YGE+DG+VGMAM SLANAKC+
Sbjct: 305 ESEDLVVPLFSLGKLLLKEGKAAEAEIPFTSIVNIYKKIYGERDGRVGMAMCSLANAKCS 364

Query: 375 RGSFGHLSSFIHQLKFSFAKYSSIGEADEAVTLYRRALHIIKDSNDMALDDSVMEKMRIE 434
           +                       G+A+EAV +YR AL IIKDSN M +D+S++E MRI+
Sbjct: 365 K-----------------------GDANEAVDIYRNALRIIKDSNYMTIDNSILENMRID 424

Query: 435 LAELLHVVGRGSEGREILEECLLINERSKGKEHPSSVKHLVNLAASYSRSKNYAEAERLL 494
           LAELLH VGRG EGRE+LEECLLINER KGK HPS   HL+NLAASYSRSKNY EAERLL
Sbjct: 425 LAELLHFVGRGDEGRELLEECLLINERFKGKNHPSMATHLINLAASYSRSKNYVEAERLL 484

Query: 495 RIGLDIMIKAVGPDDQSITVPMLNLAVTLYNLKQDDNAEQLALEVLRIRESAFGKDSLPV 554
           R  L+IM  +VG + QSIT PMLNLAVTL  L +D+ AEQ+AL+VLRIRE AFG+DSLPV
Sbjct: 485 RTCLNIMEVSVGSEGQSITFPMLNLAVTLSQLNRDEEAEQIALKVLRIREKAFGEDSLPV 544

Query: 555 GEALDCLVSIQSRLGKDENELLKLLKRILRIQEREFGHDGKEVIDTLKKIVFYVEKLGMK 614
           GEALDCLVSIQ+RLG+D+ E+L LLKR++ IQE+EFG   +E+I TL+KI+ ++EKL MK
Sbjct: 545 GEALDCLVSIQARLGRDDGEILGLLKRVMMIQEKEFGPSAQELIVTLQKIIHFLEKLEMK 564

Query: 615 DEKFLLQKRLSKLRMKFKNQMQY 635
           D+KF  ++RL+ LR ++K  + Y
Sbjct: 605 DDKFKFRRRLALLRERYKQSLSY 564

The following BLAST results are available for this feature:
Match NameE-valueIdentityDescription
XP_038875999.13.3e-27284.20nephrocystin-3 isoform X1 [Benincasa hispida] >XP_038876001.1 nephrocystin-3 iso... [more]
XP_038876003.12.1e-27184.04nephrocystin-3 isoform X2 [Benincasa hispida][more]
XP_022958168.15.9e-26680.94nephrocystin-3 isoform X3 [Cucurbita moschata] >XP_022958169.1 nephrocystin-3 is... [more]
XP_022995907.14.3e-26480.46nephrocystin-3 isoform X3 [Cucurbita maxima] >XP_022995908.1 nephrocystin-3 isof... [more]
XP_023533270.18.0e-26380.29nephrocystin-3-like isoform X3 [Cucurbita pepo subsp. pepo] >XP_023533271.1 neph... [more]
Match NameE-valueIdentityDescription
Q6AZT71.3e-1624.32Nephrocystin-3 OS=Xenopus laevis OX=8355 GN=nphp3 PE=2 SV=1[more]
P0CI654.8e-1623.57Nephrocystin-3 OS=Danio rerio OX=7955 GN=nphp3 PE=3 SV=1[more]
Q078663.2e-1224.64Kinesin light chain 1 OS=Homo sapiens OX=9606 GN=KLC1 PE=1 SV=2[more]
Q5R5813.2e-1224.64Kinesin light chain 1 OS=Pongo abelii OX=9601 GN=KLC1 PE=2 SV=3[more]
P372855.4e-1224.29Kinesin light chain 1 OS=Rattus norvegicus OX=10116 GN=Klc1 PE=1 SV=2[more]
Match NameE-valueIdentityDescription
A0A6J1H2E02.9e-26680.94nephrocystin-3 isoform X3 OS=Cucurbita moschata OX=3662 GN=LOC111459474 PE=4 SV=... [more]
A0A6J1K9C62.1e-26480.46nephrocystin-3 isoform X3 OS=Cucurbita maxima OX=3661 GN=LOC111491290 PE=4 SV=1[more]
A0A6J1H1D94.9e-25881.53nephrocystin-3 isoform X1 OS=Cucurbita moschata OX=3662 GN=LOC111459474 PE=4 SV=... [more]
A0A6J1K5773.5e-25681.02nephrocystin-3 isoform X1 OS=Cucurbita maxima OX=3661 GN=LOC111491290 PE=4 SV=1[more]
A0A6J1H1503.5e-25681.36nephrocystin-3 isoform X2 OS=Cucurbita moschata OX=3662 GN=LOC111459474 PE=4 SV=... [more]
Match NameE-valueIdentityDescription
AT5G53080.11.8e-16452.49Tetratricopeptide repeat (TPR)-like superfamily protein [more]
InterPro
Analysis Name: InterPro Annotations of Watermelon (USVL246-FR2) v1
Date Performed: 2022-01-31
IPR TermIPR DescriptionSourceSource TermSource DescriptionAlignment
NoneNo IPR availableCOILSCoilCoilcoord: 231..251
NoneNo IPR availablePFAMPF13374TPR_10coord: 513..548
e-value: 3.0E-6
score: 26.9
NoneNo IPR availablePFAMPF13424TPR_12coord: 280..349
e-value: 3.8E-7
score: 30.3
coord: 428..497
e-value: 6.0E-10
score: 39.3
NoneNo IPR availablePANTHERPTHR45641TETRATRICOPEPTIDE REPEAT PROTEIN (AFU_ORTHOLOGUE AFUA_6G03870)coord: 133..621
IPR019734Tetratricopeptide repeatSMARTSM00028tpr_5coord: 468..501
e-value: 59.0
score: 8.2
coord: 317..350
e-value: 84.0
score: 6.8
coord: 275..308
e-value: 30.0
score: 10.8
coord: 510..543
e-value: 94.0
score: 6.4
coord: 218..251
e-value: 7.9
score: 15.4
IPR019734Tetratricopeptide repeatPFAMPF13181TPR_8coord: 220..249
e-value: 0.0018
score: 18.3
IPR019734Tetratricopeptide repeatPROSITEPS50005TPRcoord: 218..251
score: 9.2929
IPR011990Tetratricopeptide-like helical domain superfamilyGENE3D1.25.40.10Tetratricopeptide repeat domaincoord: 199..426
e-value: 4.7E-22
score: 80.3
IPR011990Tetratricopeptide-like helical domain superfamilyGENE3D1.25.40.10Tetratricopeptide repeat domaincoord: 427..633
e-value: 7.5E-29
score: 102.4
IPR011990Tetratricopeptide-like helical domain superfamilySUPERFAMILY48452TPR-likecoord: 220..497

Relationships

The following mRNA feature(s) are a part of this gene:

Feature NameUnique NameType
CaUC01G000420.1CaUC01G000420.1mRNA


GO Annotation
GO Assignments
This gene is annotated with the following GO terms.
Category Term Accession Term Name
biological_process GO:0009658 chloroplast organization
biological_process GO:0031425 chloroplast RNA processing
cellular_component GO:0009507 chloroplast
molecular_function GO:0005515 protein binding