Skip to main content

Showing 1–23 of 23 results for author: Kowsari, K

  1. arXiv:2106.11077  [pdf, other

    cs.CY cs.CL

    Toward a Knowledge Discovery Framework for Data Science Job Market in the United States

    Authors: Mojtaba Heidarysafa, Kamran Kowsari, Masoud Bashiri, Donald E. Brown

    Abstract: The growth of the data science field requires better tools to understand such a fast-paced growing domain. Moreover, individuals from different backgrounds became interested in following a career as data scientists. Therefore, providing a quantitative guide for individuals and organizations to understand the skills required in the job market would be crucial. This paper introduces a framework to a… ▽ More

    Submitted 20 July, 2021; v1 submitted 14 June, 2021; originally announced June 2021.

  2. arXiv:2011.04802  [pdf, other

    cs.LG cs.AI q-bio.PE q-bio.QM stat.ML

    Sparse Longitudinal Representations of Electronic Health Record Data for the Early Detection of Chronic Kidney Disease in Diabetic Patients

    Authors: Jinghe Zhang, Kamran Kowsari, Mehdi Boukhechba, James Harrison, Jennifer Lobo, Laura Barnes

    Abstract: Chronic kidney disease (CKD) is a gradual loss of renal function over time, and it increases the risk of mortality, decreased quality of life, as well as serious complications. The prevalence of CKD has been increasing in the last couple of decades, which is partly due to the increased prevalence of diabetes and hypertension. To accurately detect CKD in diabetic patients, we propose a novel framew… ▽ More

    Submitted 17 November, 2020; v1 submitted 9 November, 2020; originally announced November 2020.

    Comments: Accepted in IEEE BIBM 2020

  3. arXiv:2010.16052  [pdf, other

    eess.SP cs.AI cs.HC cs.LG stat.ML

    HHAR-net: Hierarchical Human Activity Recognition using Neural Networks

    Authors: Mehrdad Fazli, Kamran Kowsari, Erfaneh Gharavi, Laura Barnes, Afsaneh Doryab

    Abstract: Activity recognition using built-in sensors in smart and wearable devices provides great opportunities to understand and detect human behavior in the wild and gives a more holistic view of individuals' health and well being. Numerous computational methods have been applied to sensor streams to recognize different daily activities. However, most methods are unable to capture different layers of act… ▽ More

    Submitted 10 November, 2020; v1 submitted 28 October, 2020; originally announced October 2020.

    Comments: Accepted in IHCI2020

  4. arXiv:2006.07187  [pdf, other

    eess.IV cs.AI cs.CV cs.LG stat.ML

    HMIC: Hierarchical Medical Image Classification, A Deep Learning Approach

    Authors: Kamran Kowsari, Rasoul Sali, Lubaina Ehsan, William Adorno, Asad Ali, Sean Moore, Beatrice Amadi, Paul Kelly, Sana Syed, Donald Brown

    Abstract: Image classification is central to the big data revolution in medicine. Improved information processing methods for diagnosis and classification of digital medical images have shown to be successful via deep learning approaches. As this field is explored, there are limitations to the performance of traditional supervised classifiers. This paper outlines an approach that is different from the curre… ▽ More

    Submitted 23 June, 2020; v1 submitted 12 June, 2020; originally announced June 2020.

    Journal ref: Information 11, no. 6 (2020): 318

  5. arXiv:2006.06627  [pdf, other

    cs.LG cs.CV eess.IV q-bio.TO stat.ML

    Diagnosis and Analysis of Celiac Disease and Environmental Enteropathy on Biopsy Images using Deep Learning Approaches

    Authors: Kamran Kowsari

    Abstract: Celiac Disease (CD) and Environmental Enteropathy (EE) are common causes of malnutrition and adversely impact normal childhood development. Both conditions require a tissue biopsy for diagnosis and a major challenge of interpreting clinical biopsy images to differentiate between these gastrointestinal diseases is striking histopathologic overlap between them. In the current study, we propose four… ▽ More

    Submitted 11 June, 2020; originally announced June 2020.

    Comments: PhD dissertation, Univ Virginia (May 2020)

  6. arXiv:2004.06518  [pdf, other

    cs.SI cs.AI cs.CL cs.LG stat.ML

    Gender Detection on Social Networks using Ensemble Deep Learning

    Authors: Kamran Kowsari, Mojtaba Heidarysafa, Tolu Odukoya, Philip Potter, Laura E. Barnes, Donald E. Brown

    Abstract: Analyzing the ever-increasing volume of posts on social media sites such as Facebook and Twitter requires improved information processing methods for profiling authorship. Document classification is central to this task, but the performance of traditional supervised classifiers has degraded as the volume of social media has increased. This paper addresses this problem in the context of gender dete… ▽ More

    Submitted 9 September, 2020; v1 submitted 13 April, 2020; originally announced April 2020.

  7. arXiv:1912.03804  [pdf, other

    cs.CL cs.IR

    Women in ISIS Propaganda: A Natural Language Processing Analysis of Topics and Emotions in a Comparison with Mainstream Religious Group

    Authors: Mojtaba Heidarysafa, Kamran Kowsari, Tolu Odukoya, Philip Potter, Laura E. Barnes, Donald E. Brown

    Abstract: Online propaganda is central to the recruitment strategies of extremist groups and in recent years these efforts have increasingly extended to women. To investigate ISIS' approach to targeting women in their online propaganda and uncover implications for counterterrorism, we rely on text mining and natural language processing (NLP). Specifically, we extract articles published in Dabiq and Rumiyah… ▽ More

    Submitted 8 December, 2019; originally announced December 2019.

  8. arXiv:1910.03084  [pdf, other

    eess.IV cs.CV cs.LG q-bio.QM stat.ML

    CeliacNet: Celiac Disease Severity Diagnosis on Duodenal Histopathological Images Using Deep Residual Networks

    Authors: Rasoul Sali, Lubaina Ehsan, Kamran Kowsari, Marium Khan, Christopher A. Moskaluk, Sana Syed, Donald E. Brown

    Abstract: Celiac Disease (CD) is a chronic autoimmune disease that affects the small intestine in genetically predisposed children and adults. Gluten exposure triggers an inflammatory cascade which leads to compromised intestinal barrier function. If this enteropathy is unrecognized, this can lead to anemia, decreased bone density, and, in longstanding cases, intestinal cancer. The prevalence of the disorde… ▽ More

    Submitted 7 October, 2019; originally announced October 2019.

    Comments: accepted at IEEE International Conference on Bioinformatics and Biomedicine (IEEE BIBM 2019)

  9. arXiv:1904.08067  [pdf, other

    cs.LG cs.AI cs.CL cs.IR stat.ML

    Text Classification Algorithms: A Survey

    Authors: Kamran Kowsari, Kiana Jafari Meimandi, Mojtaba Heidarysafa, Sanjana Mendu, Laura E. Barnes, Donald E. Brown

    Abstract: In recent years, there has been an exponential growth in the number of complex documents and texts that require a deeper understanding of machine learning methods to be able to accurately classify texts in many applications. Many machine learning approaches have achieved surpassing results in natural language processing. The success of these learning algorithms relies on their capacity to understa… ▽ More

    Submitted 20 May, 2020; v1 submitted 16 April, 2019; originally announced April 2019.

  10. arXiv:1904.05773  [pdf, other

    eess.IV cs.CV cs.LG q-bio.QM stat.ML

    Diagnosis of Celiac Disease and Environmental Enteropathy on Biopsy Images Using Color Balancing on Convolutional Neural Networks

    Authors: Kamran Kowsari, Rasoul Sali, Marium N. Khan, William Adorno, S. Asad Ali, Sean R. Moore, Beatrice C. Amadi, Paul Kelly, Sana Syed, Donald E. Brown

    Abstract: Celiac Disease (CD) and Environmental Enteropathy (EE) are common causes of malnutrition and adversely impact normal childhood development. CD is an autoimmune disorder that is prevalent worldwide and is caused by an increased sensitivity to gluten. Gluten exposure destructs the small intestinal epithelial barrier, resulting in nutrient mal-absorption and childhood under-nutrition. EE also results… ▽ More

    Submitted 9 October, 2019; v1 submitted 10 April, 2019; originally announced April 2019.

  11. arXiv:1811.06193  [pdf, other

    cs.CV cs.AI cs.MM cs.RO cs.SE

    From Videos to URLs: A Multi-Browser Guide To Extract User's Behavior with Optical Character Recognition

    Authors: Mojtaba Heidarysafa, James Reed, Kamran Kowsari, April Celeste R. Leviton, Janet I. Warren, Donald E. Brown

    Abstract: Tracking users' activities on the World Wide Web (WWW) allows researchers to analyze each user's internet behavior as time passes and for the amount of time spent on a particular domain. This analysis can be used in research design, as researchers may access to their participant's behaviors while browsing the web. Web search behavior has been a subject of interest because of its real-world applica… ▽ More

    Submitted 19 May, 2020; v1 submitted 15 November, 2018; originally announced November 2018.

  12. arXiv:1810.07382  [pdf, other

    cs.CL cs.IR cs.LG cs.NE stat.ML

    Analysis of Railway Accidents' Narratives Using Deep Learning

    Authors: Mojtaba Heidarysafa, Kamran Kowsari, Laura E. Barnes, Donald E. Brown

    Abstract: Automatic understanding of domain specific texts in order to extract useful relationships for later use is a non-trivial task. One such relationship would be between railroad accidents' causes and their correspondent descriptions in reports. From 2001 to 2016 rail accidents in the U.S. cost more than $4.6B. Railroads involved in accidents are required to submit an accident report to the Federal Ra… ▽ More

    Submitted 20 May, 2020; v1 submitted 17 October, 2018; originally announced October 2018.

    Comments: accepted in IEEE International Conference on Machine Learning and Applications (IEEE ICMLA)

  13. arXiv:1810.04793  [pdf, other

    q-bio.QM cs.AI cs.IR cs.LG stat.ML

    Patient2Vec: A Personalized Interpretable Deep Representation of the Longitudinal Electronic Health Record

    Authors: Jinghe Zhang, Kamran Kowsari, James H. Harrison, Jennifer M. Lobo, Laura E. Barnes

    Abstract: The wide implementation of electronic health record (EHR) systems facilitates the collection of large-scale health data from real clinical settings. Despite the significant increase in adoption of EHR systems, this data remains largely unexplored, but presents a rich data source for knowledge discovery from patient health histories in tasks such as understanding disease correlations and predicting… ▽ More

    Submitted 25 October, 2018; v1 submitted 10 October, 2018; originally announced October 2018.

    Comments: Accepted by IEEE Access

  14. arXiv:1808.08121  [pdf

    cs.LG cs.CV cs.IR cs.NE stat.ML

    An Improvement of Data Classification Using Random Multimodel Deep Learning (RMDL)

    Authors: Mojtaba Heidarysafa, Kamran Kowsari, Donald E. Brown, Kiana Jafari Meimandi, Laura E. Barnes

    Abstract: The exponential growth in the number of complex datasets every year requires more enhancement in machine learning methods to provide robust and accurate data classification. Lately, deep learning approaches have achieved surpassing results in comparison to previous machine learning algorithms. However, finding the suitable structure for these models has been a challenge for researchers. This paper… ▽ More

    Submitted 22 August, 2018; originally announced August 2018.

    Comments: published in International Journal of Machine Learning and Computing (IJMLC). arXiv admin note: substantial text overlap with arXiv:1805.01890

  15. arXiv:1805.01890  [pdf, other

    cs.LG cs.AI cs.CV cs.NE stat.ML

    RMDL: Random Multimodel Deep Learning for Classification

    Authors: Kamran Kowsari, Mojtaba Heidarysafa, Donald E. Brown, Kiana Jafari Meimandi, Laura E. Barnes

    Abstract: The continually increasing number of complex datasets each year necessitates ever improving machine learning methods for robust and accurate categorization of these data. This paper introduces Random Multimodel Deep Learning (RMDL): a new ensemble, deep learning approach for classification. Deep learning models have achieved state-of-the-art results across many domains. RMDL solves the problem of… ▽ More

    Submitted 31 May, 2018; v1 submitted 3 May, 2018; originally announced May 2018.

    Comments: Best Paper award ACM ICISDM

  16. arXiv:1709.09268  [pdf, other

    cs.LG cs.AI stat.AP stat.ML

    FSL-BM: Fuzzy Supervised Learning with Binary Meta-Feature for Classification

    Authors: Kamran Kowsari, Nima Bari, Roman Vichr, Farhad A. Goodarzi

    Abstract: This paper introduces a novel real-time Fuzzy Supervised Learning with Binary Meta-Feature (FSL-BM) for big data classification task. The study of real-time algorithms addresses several major concerns, which are namely: accuracy, memory consumption, and ability to stretch assumptions and time complexity. Attaining a fast computational model providing fuzzy logic and supervised learning is one of t… ▽ More

    Submitted 15 November, 2017; v1 submitted 26 September, 2017; originally announced September 2017.

    Comments: FICC2018

  17. arXiv:1709.08267  [pdf, other

    cs.LG cs.AI cs.CL cs.CV cs.IR

    HDLTex: Hierarchical Deep Learning for Text Classification

    Authors: Kamran Kowsari, Donald E. Brown, Mojtaba Heidarysafa, Kiana Jafari Meimandi, Matthew S. Gerber, Laura E. Barnes

    Abstract: The continually increasing number of documents produced each year necessitates ever improving information processing methods for searching, retrieving, and organizing text. Central to these information processing methods is document classification, which has become an important application for supervised learning. Recently the performance of these traditional classifiers has degraded as the number… ▽ More

    Submitted 6 October, 2017; v1 submitted 24 September, 2017; originally announced September 2017.

    Comments: ICMLA 2017

  18. arXiv:1704.07468  [pdf, other

    cs.LG cs.AI cs.CC cs.CL cs.DS

    GaKCo: a Fast GApped k-mer string Kernel using COunting

    Authors: Ritambhara Singh, Arshdeep Sekhon, Kamran Kowsari, Jack Lanchantin, Beilun Wang, Yanjun Qi

    Abstract: String Kernel (SK) techniques, especially those using gapped $k$-mers as features (gk), have obtained great success in classifying sequences like DNA, protein, and text. However, the state-of-the-art gk-SK runs extremely slow when we increase the dictionary size ($Σ$) or allow more mismatches ($M$). This is because current gk-SK uses a trie-based algorithm to calculate co-occurrence of mismatched… ▽ More

    Submitted 18 September, 2017; v1 submitted 24 April, 2017; originally announced April 2017.

    Comments: @ECML 2017

  19. arXiv:1602.05920  [pdf, other

    cs.CV cs.GR cs.LG cs.MM cs.RO

    Weighted Unsupervised Learning for 3D Object Detection

    Authors: Kamran Kowsari, Manal H. Alassaf

    Abstract: This paper introduces a novel weighted unsupervised learning for object detection using an RGB-D camera. This technique is feasible for detecting the moving objects in the noisy environments that are captured by an RGB-D camera. The main contribution of this paper is a real-time algorithm for detecting each object using weighted clustering as a separate cluster. In a preprocessing step, the algori… ▽ More

    Submitted 4 June, 2018; v1 submitted 18 February, 2016; originally announced February 2016.

    Comments: IJACSA

  20. arXiv:1503.06483  [pdf

    cs.DB cs.AI cs.DS cs.IR cs.LG

    Construction of FuzzyFind Dictionary using Golay Coding Transformation for Searching Applications

    Authors: Kamran Kowsari, Maryam Yammahi, Nima Bari, Roman Vichr, Faisal Alsaby, Simon Y. Berkovich

    Abstract: Searching through a large volume of data is very critical for companies, scientists, and searching engines applications due to time complexity and memory complexity. In this paper, a new technique of generating FuzzyFind Dictionary for text mining was introduced. We simply mapped the 23 bits of the English alphabet into a FuzzyFind Dictionary or more than 23 bits by using more FuzzyFind Dictionary… ▽ More

    Submitted 22 March, 2015; originally announced March 2015.

  21. arXiv:1503.00245  [pdf

    cs.DB cs.AI cs.IR cs.MM

    Novel Metaknowledge-based Processing Technique for Multimedia Big Data clustering challenges

    Authors: Nima Bari, Roman Vichr, Kamran Kowsari, Simon Y. Berkovich

    Abstract: Past research has challenged us with the task of showing relational patterns between text-based data and then clustering for predictive analysis using Golay Code technique. We focus on a novel approach to extract metaknowledge in multimedia datasets. Our collaboration has been an on-going task of studying the relational patterns between datapoints based on metafeatures extracted from metaknowledge… ▽ More

    Submitted 1 March, 2015; originally announced March 2015.

    Comments: IEEE Multimedia Big Data (BigMM 2015)

  22. arXiv:1503.00244  [pdf

    cs.DB cs.AI cs.IR cs.LG

    23-bit Metaknowledge Template Towards Big Data Knowledge Discovery and Management

    Authors: Nima Bari, Roman Vichr, Kamran Kowsari, Simon Y. Berkovich

    Abstract: The global influence of Big Data is not only growing but seemingly endless. The trend is leaning towards knowledge that is attained easily and quickly from massive pools of Big Data. Today we are living in the technological world that Dr. Usama Fayyad and his distinguished research fellows discussed in the introductory explanations of Knowledge Discovery in Databases (KDD) predicted nearly two dec… ▽ More

    Submitted 1 March, 2015; originally announced March 2015.

    Comments: IEEE Data Science and Advanced Analytics (DSAA'2014)

  23. arXiv:1312.6117   

    cs.LG

    Comparison three methods of clustering: k-means, spectral clustering and hierarchical clustering

    Authors: Kamran Kowsari

    Abstract: Comparison of three kind of the clustering and find cost function and loss function and calculate them. Error rate of the clustering methods and how to calculate the error percentage always be one on the important factor for evaluating the clustering methods, so this paper introduce one way to calculate the error rate of clustering methods. Clustering algorithms can be divided into several categor… ▽ More

    Submitted 13 November, 2014; v1 submitted 19 December, 2013; originally announced December 2013.

    Comments: This paper has been withdrawn by the author due to improve add more results

    MSC Class: 68T10 ACM Class: H.3.3; I.5.3