Bio-inspired constrained clustering: A case study on aspect-based sentiment analysis

dc.contributor.authorQasem, Mohammed
dc.contributor.examiningcommitteeWang, Yang (Computer Science) Annakkage, Udaya (Electrical and Computer Engineering) Yang, Laurence T. (Computer Science, St. Francis Xavier University)en_US
dc.contributor.supervisorThulasiraman, Parimala (Computer Science)en_US
dc.date.accessioned2018-07-11T18:51:48Z
dc.date.available2018-07-11T18:51:48Z
dc.date.issued2018-07-04en_US
dc.date.submitted2018-07-04T14:46:09Zen
dc.degree.disciplineComputer Scienceen_US
dc.degree.levelDoctor of Philosophy (Ph.D.)en_US
dc.description.abstractClustering is an important problem in the era of big data. Exact algorithmic clustering approaches are not affordable for many real-world applications (RWA), requiring innovative, and approximation algorithms. Among them are bio or nature-inspired techniques such as “ant brood clustering algorithm” (ACA) inspired by how real ants brood sort their nests. ACA's mathematical model assumes a static radius of perception which is not adaptable to RWA. I address this issue by developing an adaptive clustering algorithm, called “ACA with Adaptive Radius (ACA-AR)” using kernel density estimation, a non-parametric statistical model, to measure average dissimilarity of data objects in ant’s neighborhood. I extend this algorithm to a search-based semi-supervised constrained clustering algorithm (CACA-AR) that incorporates supervisory information to guide the clustering algorithm towards solutions where constraints are minimally violated. I evaluate the accuracy of CACA-AR on benchmark datasets and provide a feasibility study on one RWA, aspect-based sentiment analysis. The F1-score results show that CACA-AR outperforms baseline techniques, multi-class logistic regression, and lexicon based approaches by 20%.en_US
dc.description.noteOctober 2018en_US
dc.identifier.urihttp://hdl.handle.net/1993/33133
dc.language.isoengen_US
dc.rightsopen accessen_US
dc.subjectant, constrained clustering, sentiment analysisen_US
dc.titleBio-inspired constrained clustering: A case study on aspect-based sentiment analysisen_US
dc.typedoctoral thesisen_US
Files
Original bundle
Now showing 1 - 1 of 1
Loading...
Thumbnail Image
Name:
Qasem_Mohammed.pdf
Size:
2.27 MB
Format:
Adobe Portable Document Format
Description:
License bundle
Now showing 1 - 1 of 1
Loading...
Thumbnail Image
Name:
license.txt
Size:
2.2 KB
Format:
Item-specific license agreed to upon submission
Description: