On the number of distinct squares in strings

Jiang, Mei

On the number of distinct squares in strings

dc.contributor.advisor	Franek, Frantisek	en_US
dc.contributor.advisor	Deza, Antoine	en_US
dc.contributor.advisor	Fred Hoppe, Nedialko Nedialkov	en_US
dc.contributor.author	Jiang, Mei	en_US
dc.contributor.department	Computing and Software	en_US
dc.date.accessioned	2014-06-18T17:05:41Z
dc.date.available	2014-06-18T17:05:41Z
dc.date.created	2014-01-30	en_US
dc.date.issued	2014-04	en_US
dc.description.abstract	<p>We investigate the problem of the maximum number of distinct primitively rooted squares in a string. In comparison to considering general strings, the number of distinct symbols in the string is introduced as an additional parameter of the problem. Let S(d,n) = max {s(x) \| x is a (d,n)-string}, where s(x) denotes the number of distinct primitively rooted squares in a string x and a (d,n)-string denotes a string of length n with exactly d distinct symbols.</p> <p>Inspired by the d-step approach which was instrumental in Santos' tackling of the Hirsch conjecture, we introduce a (d,n-d) table with entries S(d,n) where d is the index for the rows and n-d is the index for the columns. We examine the properties of the S(d,n) function in the context of (d,n-d) table and conjecture that the value of S(d,n) is no more than n-d. We present several equivalent properties with the conjecture. We discuss the significance of the main diagonal of the (d,n-d) table, i.e. the square-maximal (d, 2d)-strings for their relevance to the conjectured bound for all strings. We explore their structural properties under both assumptions, complying or not complying with the conjecture, with the intention to derive a contradiction. The result yields novel properties and statements equivalent with the conjecture with computational application to the determination of the values S(d,n).</p> <p>To further populate the (d,n-d) table, we design and implement an efficient computational framework for computing S(d,n). Instead of generating all possible (d,n)-strings as the brute-force approach needs to do, the computational effort is significantly reduced by narrowing down the search space for square-maximal strings. With an easily accessible lower bound obtained either from the previously computed values inductively or by an effective heuristic search, only a relatively small set of candidate strings that might possibly exceed the lower bound is generated. To this end, the notions of s-cover and the density of a string are introduced and utilized. In special circumstances, the computational efficiency can be further improved by starting the s-cover with a double square structure. In addition, we present an auxiliary algorithm that returns the required information including the number of distinct squares for each generated candidate string. This algorithm is a modified version of FJW algorithm, an implementation based on Crochemore's partition algorithm, developed by Franek, Jiang and Weng. As of writing of this thesis, we have been able to obtain the maximum number of distinct squares in binary strings till the length of 70.</p>	en_US
dc.description.degree	Doctor of Philosophy (PhD)	en_US
dc.identifier.other	opendissertations/8776	en_US
dc.identifier.other	9853	en_US
dc.identifier.other	5044059	en_US
dc.identifier.uri	http://hdl.handle.net/11375/13945
dc.subject	string	en_US
dc.subject	square	en_US
dc.subject	primitively rooted square	en_US
dc.subject	parameterized approach	en_US
dc.subject	d-step approach	en_US
dc.subject	run	en_US
dc.subject	Theory and Algorithms	en_US
dc.subject	Theory and Algorithms	en_US
dc.title	On the number of distinct squares in strings	en_US
dc.type	thesis	en_US

Files

Original bundle

Now showing 1 - 1 of 1

Name:: fulltext.pdf
Size:: 1.76 MB
Format:: Adobe Portable Document Format

Download

Collections

Open Access Dissertations and Theses