<rdf:RDF xmlns:crm='http://www.cidoc-crm.org/rdfs/cidoc_crm_v5.0.2_english_label.rdfs#' xmlns:dc='http://purl.org/dc/elements/1.1/' xmlns:dcterms='http://purl.org/dc/terms/' xmlns:doap='http://usefulinc.com/ns/doap#' xmlns:edm='http://www.europeana.eu/schemas/edm/' xmlns:ekt='https://www.semantics.gr/authorities/schemanamespaces/ekt#' xmlns:foaf='http://xmlns.com/foaf/0.1/' xmlns:ore='http://www.openarchives.org/ore/terms/' xmlns:owl='http://www.w3.org/2002/07/owl#' xmlns:rdaGr2='http://rdvocab.info/ElementsGr2/' xmlns:rdf='http://www.w3.org/1999/02/22-rdf-syntax-ns#' xmlns:rdfs='http://www.w3.org/2000/01/rdf-schema#' xmlns:skos='http://www.w3.org/2004/02/skos/core#' xmlns:svcs='http://rdfs.org/sioc/services#' xmlns:wgs84_pos='http://www.w3.org/2003/01/geo/wgs84_pos#' xmlns:xalan='http://xml.apache.org/xalan'><edm:ProvidedCHO rdf:about='https://www.openarchives.gr/aggregator-openarchives/edm/psepheda/000004-2159_27904'><dc:contributor xml:lang='el'>Ευαγγελίδης, Γεώργιος</dc:contributor><dc:creator xml:lang='el'>Παπανικολάου, Μερκούριος</dc:creator><dc:description xml:lang='el'>Διπλωματική εργασία--Πανεπιστήμιο Μακεδονίας, Θεσσαλονίκη, 2022.</dc:description><dc:description xml:lang='en'>Approved for entry into archive by Κυριακή Μπαλτά (balta@uom.gr) on 2022-11-22T11:25:29Z (GMT) No. of bitstreams: 2
license_rdf: 914 bytes, checksum: 24013099e9e6abb1575dc6ce0855efd5 (MD5)
PapanikolaouMerkouriosMSc2022.pdf: 791181 bytes, checksum: 9603df5796e570350b6ade2315776ff7 (MD5)</dc:description><dc:description xml:lang='en'>Made available in DSpace on 2022-11-22T11:25:30Z (GMT). No. of bitstreams: 2
license_rdf: 914 bytes, checksum: 24013099e9e6abb1575dc6ce0855efd5 (MD5)
PapanikolaouMerkouriosMSc2022.pdf: 791181 bytes, checksum: 9603df5796e570350b6ade2315776ff7 (MD5)
  Previous issue date: 2022-10-14</dc:description><dc:description xml:lang='en'>Submitted by ΜΕΡΚΟΥΡΙΟΣ ΠΑΠΑΝΙΚΟΛΑΟΥ (aid20006@uom.edu.gr) on 2022-11-18T09:26:57Z
No. of bitstreams: 2
license_rdf: 914 bytes, checksum: 24013099e9e6abb1575dc6ce0855efd5 (MD5)
PapanikolaouMerkouriosMSc2022.pdf: 791181 bytes, checksum: 9603df5796e570350b6ade2315776ff7 (MD5)</dc:description><dc:description xml:lang='en'>K – Nearest Neighbor (k-NN) classifier is one of them most widely used supervised  
learning algorithms. Its popularity is mainly due to its simplicity, effectiveness, ease of 
implementation and ability to add new data in the training set at any time. However, one  
of the major drawbacks of this algorithm is the fact that its performance mainly depends 
on the parameter value k, i.e. the number of nearest neighbors that the algorithm examines 
in order to classify the unlabeled instance. In most cases, it is a fixed value, independent 
of data distribution. The most frequently used technique for the “best” k determination is 
the cross validation as there is no general rule for choosing the k value due to its 
dependency on the training dataset. A large k value results in a noise tolerant classifier, as 
its search area is large. On the other hand, a small k value results in a noise sensitive 
classifier, as the search area is limited. So, selecting a ﬁxed k value throughout the dataset 
does not take into account its special features, like data distribution, class separation, 
imbalanced classes, sparse and dense neighborhoods and noisy subspaces. 
In recent years, a lot of research have been conducted in order to tackle the above-
mentioned disadvantage. The research has led to various approaches of k-NN classiﬁer, 
which mainly combine it with various other techniques for k value determination. In the 
present study, a thorough literature review is conducted in order to summarize all the 
achievements made to date in this ﬁeld. This procedure led to a pool of twenty-eight (28) 
publications, covering a time period from 1986 till 2020 (with median value 2009). These 
studies are presented in this work, describing the techniques used for dynamic k 
determination. For each study, several indicators are recorded, namely the technique used 
for k selection, the level of k selection, the number of datasets used for experiments, 
whether statistical tests were conducted or not, the total number of citations each research 
has received as well as the average citations per year. 
Apart from the above, a new alternative version of k-NN algorithm is proposed. 
The proposed algorithm is an extension of a previous work, found in the literature. The 
approach is a k-free k-NN variation, in the sense that the user does not select the parameter, 
but it is selected dynamically, depending on the area where each unlabeled data point lies. 
The algorithm falls into the group of the studies that exploit prototype and clustering 
techniques in order to represent the initial dataset. Through a recursive process, 
homogenous clusters are created, each of which are represented by a representative. Moreover, a new term is introduced, namely the Sphere of Influnce (SoI), an index 
which indicates the size of each created cluster. This index, in combination with the 
indicator depth (d), provides useful information about the subspace that each representative 
lies. Finally, heuristics are proposed in order to exploit the information provided from SoI 
and d and convert it in a k value, unique for every unlabeled instance. 
Extensive experiments were conducted on thirty (30) datasets for all proposed 
heuristics. Some of these datasets contained artificial noise in order to test the proposed 
algorithm in real life situations. The experiments showed a very competitive performance 
– in terms of accuracy – of the proposed algorithm in comparison with some commonly 
used k-NN variations. Moreover, Wilcoxon statistical test was used to find statistically 
significant differences.</dc:description><dc:identifier>http://dspace.lib.uom.gr/handle/2159/27904</dc:identifier><dc:publisher xml:lang='el'>Πανεπιστήμιο Μακεδονίας</dc:publisher><dc:rights xml:lang='el'>Αναφορά Δημιουργού-Μη Εμπορική Χρήση 4.0 Διεθνές</dc:rights><dc:rights xml:lang='en'>http://creativecommons.org/licenses/by-nc/4.0/</dc:rights><dc:subject rdf:resource='http://semantics.gr/authorities/EKT-voc-classifier/605963148'></dc:subject><dc:subject xml:lang='en'>k-nearest neighbor (k- NN)</dc:subject><dc:subject xml:lang='en'>Supervised machine learning</dc:subject><dc:subject xml:lang='en'>Dynamic k value determination</dc:subject><dc:subject xml:lang='en'>Heuristics</dc:subject><dc:subject xml:lang='en'>Classification</dc:subject><dc:title xml:lang='en'>Dynamic / Adaptive determination of parameter k IN k-NN classifier</dc:title><dc:type rdf:resource='http://semantics.gr/authorities/openarchives-item-types/metaptyxiakh-ergasia'></dc:type><dc:type xml:lang='en'>Electronic Thesis or Dissertation</dc:type><dc:type xml:lang='en'>Text</dc:type><dcterms:created>2022</dcterms:created></edm:ProvidedCHO><skos:Concept rdf:about='http://semantics.gr/authorities/EKT-voc-classifier/605963148'><skos:prefLabel xml:lang='el'>Τεχνητή νοημοσύνη</skos:prefLabel><skos:prefLabel xml:lang='en'>Artificial Intelligence</skos:prefLabel><skos:broader rdf:resource='http://semantics.gr/authorities/EKT-voc-classifier/1532468312'></skos:broader><skos:relatedMatch rdf:resource='http://id.loc.gov/authorities/subjects/sh85079324'></skos:relatedMatch><skos:relatedMatch rdf:resource='http://id.loc.gov/authorities/subjects/sh85031234'></skos:relatedMatch><skos:exactMatch rdf:resource='http://vocabularies.unesco.org/thesaurus/concept3052'></skos:exactMatch><skos:exactMatch rdf:resource='http://id.loc.gov/authorities/subjects/sh85008180'></skos:exactMatch><skos:exactMatch rdf:resource='http://semantics.gr/authorities/EKT-voc/605963148'></skos:exactMatch><skos:closeMatch rdf:resource='http://id.loc.gov/authorities/subjects/sh94004659'></skos:closeMatch><skos:note xml:lang='en'>isi - Computer Science, Artificial Intelligence covers resources that focus on research and techniques to create machines that attempt to efficiently reason, problem-solve, use knowledge representation, and perform analysis of contradictory or ambiguous information. This category includes resources on artificial intelligence technologies such as expert systems, fuzzy systems, natural language processing, speech recognition, pattern recognition, computer vision, decision-support systems, knowledge bases, and neural networks.</skos:note></skos:Concept><skos:Concept rdf:about='http://semantics.gr/authorities/openarchives-item-types/metaptyxiakh-ergasia'><skos:prefLabel xml:lang='el'>Μεταπτυχιακή εργασία</skos:prefLabel><skos:prefLabel xml:lang='en'>Master thesis</skos:prefLabel><skos:broader rdf:resource='http://semantics.gr/authorities/openarchives-item-types/Research-Paper-'></skos:broader><skos:exactMatch rdf:resource='http://vocab.getty.edu/aat/300077723'></skos:exactMatch></skos:Concept><ore:Aggregation rdf:about='https://www.openarchives.gr/aggregator-openarchives/edm/aggregation/provider/000004-2159_27904%231'><edm:aggregatedCHO rdf:resource='https://www.openarchives.gr/aggregator-openarchives/edm/psepheda/000004-2159_27904'></edm:aggregatedCHO><edm:dataProvider>Πανεπιστήμιο Μακεδονίας</edm:dataProvider><edm:isShownAt rdf:resource='https://dspace.lib.uom.gr/handle/2159/27904'></edm:isShownAt><edm:provider>Greek Aggregator OpenArchives.gr | National Documentation Centre (EKT)</edm:provider><edm:rights rdf:resource='http://creativecommons.org/licenses/by-nc/4.0/'></edm:rights></ore:Aggregation></rdf:RDF>