A Comparative Study on Two Strategies for Distributed Classification

dc.contributor.advisorWu, Qiang
dc.contributor.authorXu, Honglan
dc.contributor.committeememberHong, Don
dc.contributor.committeememberLiu, Yeqian
dc.contributor.committeememberGreen, Lisa
dc.contributor.departmentBasic & Applied Sciencesen_US
dc.date.accessioned2018-06-05T20:04:58Z
dc.date.available2018-06-05T20:04:58Z
dc.date.issued2018-05-30
dc.description.abstractDistributed learning is an effective tool to process big data. An easy and effective distributed learning approach is the divide and conquer method. It first partitions the whole data set into multiple subsets. A base learning algorithm is then applied to each subset. Finally the results from these subsets are coupled together. In the classification setting, many classification algorithms can be used in the second stage. Typical ones include the logistic regression and support vector machines. For the third stage, both voting and averaging can be used as the coupling strategies. In this thesis, empirical studies are done to thoroughly compare the effectiveness of these two coupling strategies. Averaging is found to be more effective in most scenarios.
dc.description.degreeM.S.
dc.identifier.urihttp://jewlscholar.mtsu.edu/xmlui/handle/mtsu/5685
dc.publisherMiddle Tennessee State University
dc.subject.umiMathematics
dc.thesis.degreegrantorMiddle Tennessee State University
dc.thesis.degreelevelMasters
dc.titleA Comparative Study on Two Strategies for Distributed Classification
dc.typeThesis

Files

Original bundle

Now showing 1 - 1 of 1
Loading...
Thumbnail Image
Name:
Xu_mtsu_0170N_10966.pdf
Size:
268.18 KB
Format:
Adobe Portable Document Format

Collections