<?xml version="1.0"?>
<Articles JournalTitle="Journal of Biostatistics and Epidemiology">
  <Article>
    <Journal>
      <PublisherName>Tehran University of Medical Sciences</PublisherName>
      <JournalTitle>Journal of Biostatistics and Epidemiology</JournalTitle>
      <Issn>2383-4196</Issn>
      <Volume>4</Volume>
      <Issue>2</Issue>
      <PubDate PubStatus="epublish">
        <Year>2018</Year>
        <Month>10</Month>
        <Day>31</Day>
      </PubDate>
    </Journal>
    <title locale="en_US">Na&#xEF;ve Bayes evidence accumulation K-modes clustering: A new method for classifying binary data and its application on real data of injecting drug users</title>
    <FirstPage>72</FirstPage>
    <LastPage>78</LastPage>
    <Language>EN</Language>
    <AuthorList>
      <Author>
        <FirstName>Zahra</FirstName>
        <LastName>Zamaninasab</LastName>
        <affiliation locale="en_US">HIV/STI Surveillance Research Center, and WHO Collaborating Center for HIV Surveillance, Institute for Futures Studies in Health, Department of Biostatistics and Epidemiology, School of Public Health, Kerman university of Medical Sciences, Kerman, Iran</affiliation>
      </Author>
      <Author>
        <FirstName>Hamid</FirstName>
        <LastName>Sharifi</LastName>
        <affiliation locale="en_US">HIV/STI Surveillance Research Center, and WHO Collaborating Center for HIV Surveillance, Institute for Futures Studies in Health, Kerman University of Medical Sciences, Kerman, Iran AND Department of Biostatistics and Epidemiology, Faculty of Public Health, Kerman University of Medical Sciences, Kerman, Iran</affiliation>
      </Author>
      <Author>
        <FirstName>Abbas</FirstName>
        <LastName>Bahrampour</LastName>
        <affiliation locale="en_US">Modelling in Health Research Center, Institute for Futures Studies in Health, Department of Biostatistics and Epidemiology, Health Faculty, Kerman University of Medical Sciences, Kerman, Iran</affiliation>
      </Author>
    </AuthorList>
    <History>
      <PubDate PubStatus="received">
        <Year>2018</Year>
        <Month>04</Month>
        <Day>12</Day>
      </PubDate>
    </History>
    <abstract locale="en_US">Background &amp; Aim: Clustering is the method of classifying discrete data such as Kmodes, and Na&#xEF;ve Bayes classifier is the classification to predict the unknown real classes. In this research, we improve the K-modes results by applying the Evidence Accumulation (EA) method to keep the initial mode vector to use in the Na&#xEF;ve Bayes EA K-Mode.
Methods &amp; Materials: The methods are applied to four real datasets, which the true classes are specified, for checking the external validity and purity of our methods. The free programming software R with package klaR for K-modes, EA, and package e1071 for Na&#xEF;ve Bayes is used. In addition, the methods are applied to the data of Injecting Drug Users (IDU) national dataset with sample size 2546.
Results: The EA K-modes algorithm applied to five real datasets then with the kept initial mode vector, rerun the K-modes. The results indicate the purity in the EA K-modes (0.544, 0.862, 0.914, 0.944, 0.625) has significant different with classic K-modes (0.497, 0.610, 0.404, 0.650, 0.625). Finally, we applied the Na&#xEF;ve Bayes classifier with prior probability finds in EA K-modes. For K=2 Na&#xEF;ve Bayes EA K-modes made better clustering (0.71, 0.873 against 0.625, 0.862 EA k-mode and 0.497, 0.61 K-mode).
Conclusion: In this paper, we proposed Na&#xEF;ve Bayes EA K-modes as a new method for clustering of binary data. Our new method leads to stable clustering compare with the previous studies. The Na&#xEF;ve Bayes EA K-modes method improves the purity and establishes a better separation.</abstract>
    <web_url>https://jbe.tums.ac.ir/index.php/jbe/article/view/179</web_url>
    <pdf_url>https://jbe.tums.ac.ir/index.php/jbe/article/download/179/160</pdf_url>
  </Article>
</Articles>
