Welcome to the upgraded MacSphere! We're putting the finishing touches on it; if you notice anything amiss, email macsphere@mcmaster.ca

On the number of distinct squares in strings

dc.contributor.advisorFranek, Frantiseken_US
dc.contributor.advisorDeza, Antoineen_US
dc.contributor.advisorFred Hoppe, Nedialko Nedialkoven_US
dc.contributor.authorJiang, Meien_US
dc.contributor.departmentComputing and Softwareen_US
dc.date.accessioned2014-06-18T17:05:41Z
dc.date.available2014-06-18T17:05:41Z
dc.date.created2014-01-30en_US
dc.date.issued2014-04en_US
dc.description.abstract<p>We investigate the problem of the maximum number of distinct primitively rooted squares in a string. In comparison to considering general strings, the number of distinct symbols in the string is introduced as an additional parameter of the problem. Let S(d,n) = max {s(x) | x is a (d,n)-string}, where s(x) denotes the number of distinct primitively rooted squares in a string x and a (d,n)-string denotes a string of length n with exactly d distinct symbols.</p> <p>Inspired by the d-step approach which was instrumental in Santos' tackling of the Hirsch conjecture, we introduce a (d,n-d) table with entries S(d,n) where d is the index for the rows and n-d is the index for the columns. We examine the properties of the S(d,n) function in the context of (d,n-d) table and conjecture that the value of S(d,n) is no more than n-d. We present several equivalent properties with the conjecture. We discuss the significance of the main diagonal of the (d,n-d) table, i.e. the square-maximal (d, 2d)-strings for their relevance to the conjectured bound for all strings. We explore their structural properties under both assumptions, complying or not complying with the conjecture, with the intention to derive a contradiction. The result yields novel properties and statements equivalent with the conjecture with computational application to the determination of the values S(d,n).</p> <p>To further populate the (d,n-d) table, we design and implement an efficient computational framework for computing S(d,n). Instead of generating all possible (d,n)-strings as the brute-force approach needs to do, the computational effort is significantly reduced by narrowing down the search space for square-maximal strings. With an easily accessible lower bound obtained either from the previously computed values inductively or by an effective heuristic search, only a relatively small set of candidate strings that might possibly exceed the lower bound is generated. To this end, the notions of s-cover and the density of a string are introduced and utilized. In special circumstances, the computational efficiency can be further improved by starting the s-cover with a double square structure. In addition, we present an auxiliary algorithm that returns the required information including the number of distinct squares for each generated candidate string. This algorithm is a modified version of FJW algorithm, an implementation based on Crochemore's partition algorithm, developed by Franek, Jiang and Weng. As of writing of this thesis, we have been able to obtain the maximum number of distinct squares in binary strings till the length of 70.</p>en_US
dc.description.degreeDoctor of Philosophy (PhD)en_US
dc.identifier.otheropendissertations/8776en_US
dc.identifier.other9853en_US
dc.identifier.other5044059en_US
dc.identifier.urihttp://hdl.handle.net/11375/13945
dc.subjectstringen_US
dc.subjectsquareen_US
dc.subjectprimitively rooted squareen_US
dc.subjectparameterized approachen_US
dc.subjectd-step approachen_US
dc.subjectrunen_US
dc.subjectTheory and Algorithmsen_US
dc.subjectTheory and Algorithmsen_US
dc.titleOn the number of distinct squares in stringsen_US
dc.typethesisen_US

Files

Original bundle

Now showing 1 - 1 of 1
Loading...
Thumbnail Image
Name:
fulltext.pdf
Size:
1.76 MB
Format:
Adobe Portable Document Format