Revisiting chameleon sequences in the Protein Data Bank
Abstract: The steady growth of the Protein Data Bank suggests the periodic repetition of searches for sequences that form different secondary structures in different protein structures, called chameleon sequences. This paper presents a fast (n log(n)) algorithm for such search and presents the results on all protein structures in the PDB. The longest such sequence found consists of 20 residues.