A novel hybrid CNN-KNN ensemble voting classifier for Parkinson’s disease prediction from hand sketching images

Saleh, Shawki; Ouhmida, Asmae; Cherradi, Bouchaib; Al-Sarem, Mohammed; Hamida, Soufiane; Alblwi, Abdulaziz; Mahyoob, Mohammad; Bouattane, Omar

doi:10.1007/s11042-024-19314-5

A novel hybrid CNN-KNN ensemble voting classifier for Parkinson’s disease prediction from hand sketching images

1235: Supporting Medicine and Healthcare with Multimedia Tools
Published: 14 May 2024

(2024)
Cite this article

Download PDF

Multimedia Tools and Applications Aims and scope Submit manuscript

A novel hybrid CNN-KNN ensemble voting classifier for Parkinson’s disease prediction from hand sketching images

Download PDF

80 Accesses
Explore all metrics

Abstract

Parkinson's disease is a progressive neurodegenerative disorder that causes significant physical disabilities and reduces the quality of life. This disease is caused by the loss of dopamine-producing cells in the brain. Its symptoms are Speech disorders, muscle rigidity, bradykinesia and tremors that cause involuntary shaking or trembling, typically starting in the hands, fingers, or limbs at rest. In this work, we focused on predicting this disease via the hand tremor, which appears in the speed and pen pressure that vary between healthy and affected people during sketching spiral and waves. To enhance the medical services, improve lifestyle and for early detection of people with Parkinson’s disease, we proposed an ensemble voting classifier that combines the Convolutional Neural Network and K-Nearest Neighbours (KNN) to decide whether or not a person has Parkinson's disease based on predicting spiral and wave sketching separately. Contrary to the traditional Convolutional Neural Network, the proposed architecture offers better flexibility in scenarios where data may be small and imbalanced to avoid overfitting or when capturing the nuanced relationships between data points (by considering their neighbours) can be beneficial. Moreover, the proposed system has been built to automate the extraction of features from images and perform classification. This work represents several approaches, such as image processing, developing Convolutional Neural Networks models, hyper-parameters tuning, transfer learning, feature extraction and developing a hybrid classifier that combines deep learning and machine learning to enhance the performance of the prediction. We have developed six models to predict Parkinson's disease using a Spiral-Wave dataset and provided a detailed explanation and comparison between their performances. Based on these models, we built the hybrid ensemble voting CNN-KNN classifier that reached 96.67% accuracy and 93.33% and 100% sensitivity and precision, respectively. This system demonstrates better performance compared to existing systems in the literature that predict Parkinson's disease based on hand tremors.

Graphical Abstract

ESDC-LSH: Ensemble Support-Vector Deep Convolutional Based Levy Selfish Herd Optimization for Prediction and Classification of Parkinson’s Disease

Article 01 April 2024

Parkinson’s disease diagnosis using convolutional neural networks and figure-copying tasks

Article Open access 08 September 2021

Early Diagnosis of Parkinson’s Disease Using Hand Drawings Images

Use our pre-submission checklist

Avoid common mistakes on your manuscript.

1 Introduction

Parkinson's disease is a chronic progressive neurodegenerative disorder that results in disability in the performance of daily activities [1]. The disease is primarily characterized by motor symptoms such as tremors, stiffness, slowness of movement (bradykinesia), muscle rigidity, and postural instability [2,3,4]. Also, patients with this disease have nonmotor symptoms that negatively impact disability levels [5]. The level of disease severity in Parkinson's disease is associated with disability, with loss of independent function reported between specific scores on the Unified PD Rating Scale (UPDRS) [6]. Unfortunately, this disease has no cure, but medications and surgical treatments can significantly alleviate these symptoms, enhancing the quality of life for individuals suffering from this disease [7]. According to the World Health Organisation, Parkinson’s disease can lead to physical disability, highlighting the importance of early intervention and rehabilitation to mitigate disability and improve long-term outcomes.

Artificial Intelligence (AI) has become increasingly pervasive across diverse domains, revolutionizing industries through its application in various fields such as finance, data analysis, smart agriculture, natural language processing, sentiment analysis [8, 9], medical field [10], healthcare [11] and more. Traditionally, disease diagnosis has relied on the experience of doctors. But recently, the techniques of AI and Machine Learning (ML) algorithms have played a critical role in the medical field [10] and decision support systems that can provide valuable insights and recommendations to healthcare professionals due to tremendous advancements in computational power and hardware. Furthermore, these technologies have brought a revolution in treatment strategies and delivering advanced medical diagnostics for many diseases such as cardiovascular diagnosis [12, 13], breast cancer [14, 15], diabetes disease [16,17,18,19], brain tumours [20, 21], liver illnesses [22], COVID-19 [23, 24], skin disease classification [25], Alzheimer, Parkinson's disease [26], etc.

Based on the differentiation and contrast between healthy people and Parkinson's disease, AI techniques and ML algorithms are used to predict this disease by leveraging data analysis, pattern recognition and predictive modelling to identify early signs, monitor progression and develop treatment approaches. Pen-pressure variation and drawing speed are signs of hand tremors produced by patients' inability to control their nervous system [27,28,29]. Accordingly, this paper proposes a hybrid classifier for early prediction of Parkinson's disease via hand tremors based on drawing spirals and waveforms.

The spiral-wave dataset used in this work is relatively small. Convolutional Neural Networks (CNNs) typically require large amounts of data for practical training. When data augmentation is employed, a CNN model may overfit the training dataset, resulting in high accuracy on the training set but poor performance on unseen data. This overfitting occurs because the model might learn the noise or outliers in the training dataset. Several studies in the literature have proposed classifiers to predict Parkinson's disease through drawing (or sketching). These classifiers are often based on transfer learning (TL) and fine-tuning of pre-trained models such as VGG16, VGG19 and ResNet50. Additionally, some studies have proposed ML classifiers for predicting Parkinson's disease using features extracted manually from drawing images or through separate approaches. Thus, this work aims to automate feature extraction based on convolution layers, reduce the computational complexity and increase the classification speed by reducing the trainable parameters (e.g., VGG16 has a total of 138 million parameters) and enhance the performance by combining deep learning techniques and ML algorithms.

In this work, we have developed, trained and evaluated two Convolutional Neural Network (CNN) models to predict PD using a spiral and wave dataset. Using the transfer learning (TL) approach, we have frozen their Convolutional layers and replaced the last layers (the fully connected artificial neural network) with K-Nearest Neighbours (KNN) algorithms to exploit the advantages of both DL and ML in order to build strong hybrid CNN-KNN models for predicting PD, reduce the computational complexity and increase the speed of classification. As a result of training and evaluation, these CNN-KNN architectures provided high speed while prediction and more accuracy than the first two models. Based on the last two models (Spiral-CNN-KNN and Wave-CNN-KNN), we proposed an ensemble voting classifier to predict Parkinson's disease according to the average of the probabilities of the spiral and wave drawings. We can summarize the contributions of this work as.

Provide insight into the possibility of diagnosing Parkinson's disease using hand sketching based on tremors in the early stages that is crucial to enhance the quality of life for individuals suffering from this disease.
Present a summary of the available databases to train models for disease diagnosis that support decision-makers in predicting Parkinson's disease in its early stages for early intervention and rehabilitation to mitigate disability and improve long-term outcomes.
Provide the best adjustment of data augmentation parameters that increase the limited training dataset's size and diversity to improve the generalization and robustness of a model and avoid the deformation of data, which reflects poor performance.
Build an ensemble classifier for predicting Parkinson's disease based on a hybrid architecture that combines DL and ML using TL to enhance the prediction.
Clarify the seriousness of misclassification in the medical field and propose solutions to address these critical problems that may cause death, primarily when the model classifies a person as healthy while he/she carrying the disease.

This paper is organized and formatted in the following manner: Sect. 2 provides an overview of the state of the art concerning the available approaches for diagnosing Parkinson's disease and the related works. Then, Sect. 3 details the materials and methodology that have been employed to develop and evaluate the proposed classifier. The results of the development and evaluation processes and the findings are presented in Sect. 4. Finally, Sect. 5 concludes this paper with the research outcomes and essential findings discussed throughout the previous sections of the paper.

2 Related work

Diagnosing Parkinson's disease often involves a multi-faceted approach, which includes conducting a thorough physical examination, evaluating neurological symptoms, and utilizing imaging techniques like MRI or CT scans to identify possible abnormalities in the brain. Sometimes, healthcare professionals may also employ dopamine transporter imaging and genetic testing as supplementary diagnostic tools. Laboratory tests may be conducted to eliminate the possibility of other conditions presenting similar symptoms. Typically, the diagnosis of Parkinson's disease is established by a specialist, such as a neurologist, and this process may entail multiple appointments and tests to validate the diagnosis and establish an optimal treatment plan.

Recently, techniques of Artificial intelligence and algorithms of Machine Learning have played a primary role in the medical diagnostics field to improve the diagnosis of many diseases, especially in Parkinson's disease diagnosis via some signs and symptoms such as speech disorder [30,31,32], hand-writing disorders [33], EEG signals [34, 35], hand tremor [36], nocturnal breathing signals [37], detection of PD through smell signatures [38], early diagnosing using brain scans MRI [39, 40], Urine biomarkers discovery by metabolomics and machine learning [41], sketching of the spiral and wave, etc.

Patients with Parkinson's disease (PD) face challenges in executing motor-based tasks, such as writing and drawing, due to alterations in the functionality of neuronal mechanisms responsible for controlling body movement (or control of body limb movements). Therefore, this symptom attracted the attention of the scientific research community to build different datasets based on handwriting and drawing in order to find patterns and discover contrasts between PD and healthy people and to use these variances for early predicting PD. PaHaW (Stands for Parkinson's Disease Handwriting) is a composite handwriting and drawing dataset collected from 75 persons (PD/Health: 37/38) (19 males and 18 females / 20 males and 18 females) respectively. The dataset acquisition was a collaborative effort involving St. Anne's University Hospital in Brno, Czech Republic, and the Movement Disorders Centre at the First Department of Neurology, Masaryk University [42, 43]. Moreover, HandPD is a dataset of spiral and meander sketching. It was collected from (PD/healthy: 74/18) (59 males and 15 females / 6 males and 12 females) at Botucatu Medical School, São Paulo State University—Brazil. This dataset has 368 composite samples of Spiral and Meander collected from these volunteers by asking every person to repeat the drawing four times. Also, NewHandPD is a spiral, meander, circle and signal dataset. It is an upgraded version of the last dataset and collected from (PD/healthy: 31/35) (21 males and 10 females / 18 males and 17 females) by asking every person to repeat the drawing of spiral, meander and circle four times, whereas, the signals (handwritten dynamics) have been gathered using the smart pen that has been used for sketching. Both HandPD and NewHandPD datasets are available online.^{Footnote 1}

Moreover, Spiral and Wave^{Footnote 2} is a composite spiral and wave dataset comprising 102 samples (PD/healthy: 51/51). The participants have been recruited to draw Archimedean spiral and Sinusoidal wave [44]. There are many other datasets concerning Parkinson's disease that we did not mention, such as Cube-Triangle, handwriting, acoustic [45], accelerometer, gyroscope, etc.

The study [46] aimed to differentiate individuals at different stages of Parkinson's disease by analyzing speed and pen pressure in sketches. They recruited and assessed 55 volunteers (27 patients have PD, and 28 persons are Healthy) to draw a spiral on an A3 sheet. They extracted features from the sketches and established a correlation factor with Parkinson's disease severity. The study validated the methods using the Mann–Whitney test, revealing a significant difference in the correlation factor across Parkinson's disease stages.

The authors in the study [47] developed two CNN Architectures for classifying Parkinson's disease using two separate spiral and wave sketches datasets. Then, they built an ensemble voting classifier based on two sub-classifiers, Random Forest (RF) and Logistic Regression (LR) and trained this ensemble classifier using the prediction probabilities of the two CNN Architectures. The model provided an accuracy of 93.3%, a sensitivity of 94% and a precision equal to 93.5%.

The paper [48] aimed to compare three different approaches for predicting Parkinson's disease via hand tremor using two different datasets, which are Spiral-Wave and Cube-Triangle. In the first approach, they trained pre-trained models, Inception-ResNet-v2, Xception, Inception-v3, MobileNet-v2, ResNet50 and VGG19, to train them from scratch with random weights. In the second, they used Transfer Learning (TL) with fine-tuning using the same datasets and pre-trained models. In the last, two shallow CNNs have been built, trained and evaluated. They found that Transfer Learning (TL) with fine-tuning provided the best 91.6% and 100% accuracy using first and second datasets, respectively.

In paper [49], they used the transfer learning (TL) approach to fine-tune the pre-trained model (VGG-19) and adapted it for predicting Parkinson's disease using the spiral-wave dataset. The training and evaluation of the model have been done by two cross-validation groups, which are fourfold and tenfold. The model provided accuracy equal to 86.5% and 87.3% with the fourfold cross-validation of the spiral and wave, while it provided an accuracy of 88.5% concerning the spiral set and 88% concerning the wave set when tenfold cross-validation was used.

Also, Drotar published many studies to predict Parkinson's disease using different techniques and classifiers [42, 43, 50,51,52,53]. Pereira also focuses on Analysing handwriting movements and extracting the features for predicting Parkinson's disease using different deep-learning models and machine-learning algorithms [54,55,56,57].

The study [58] introduces a system for PD diagnosis based on pre-trained CNN models, transfer learning, and Bilinear Pooling. The study uses CNN architectures like Efficient-Net B0, Mobile-Net V2, and a custom CNN model, initially trained on ImageNet and adapted with TL. These are combined with bilinear pooling, forming three Bilinear CNN models applied to DaTSCAN images of PD. Using 2720 images from the Parkinson’s Progression Marker Initiative (PPMI) dataset, the Bilinear CNN EfficientNet-B0-MobileNet-V2 model achieved the highest accuracy of 98.47% compared to other methods.

The study [44] sought to enhance the objective assessment of tremors in Parkinson's disease by incorporating and evaluating the histograms of oriented gradients (HOG) in the analysis of sinusoidal and spiral handwriting patterns. To automate detecting tremors in participants of Parkinson's disease, the authors employed the HOG descriptor as a features extractor from spiral and wave drawing to feed four classifiers: a support vector machine, K-Nearest Neighbor (KNN), random forest and one-dimensional CNN. This latter reached an accuracy of 83.1%, and this accuracy is the best compared with the other classifiers used in the study.

The scientific research community seeks to find robust models to early predict Parkinson's disease using different datasets, techniques and classifiers [59, 60] to enhance the lifestyle of the people with illness. In this section, we provided some related work and datasets that reflect this field's state of the art. Table 1 compares several studies to highlight the techniques and approaches mentioned in this section.

Table 1 Comparison between the classifiers, approaches of related works in literature

Full size table

3 Materials and methods

3.1 Disability levels and stages of Parkinson’s disease

Parkinson's disease is a progressive neurological disorder that impacts daily life and can lead to disability [1] in its advanced stages. According to the Parkinson's Foundation, there are some stages of progression of Parkinson's disease, which can be summarised into five stages.

In stage 1, individuals exhibit mild symptoms that predominantly affect one side of the body. These symptoms can include tremors, changes in posture, walking difficulties, and alterations in facial expressions. As they progress to stage 2, these symptoms intensify, impacting both sides of the body or the midline, manifesting as challenges in walking and maintaining proper posture. However, living independently remains feasible at this stage, although everyday tasks become increasingly cumbersome.

By stage 3, considered the mid-stage, the hallmark becomes a notable loss of balance, especially during turns or when pushed. Falls become more frequent, and while motor symptoms continue to escalate, the individual can still maintain an independent lifestyle, albeit with growing restrictions in their daily activities. Disabilities are mild to moderately pronounced. In stage 4, symptoms reach a fully developed state and become severely incapacitating. Individuals can still stand and walk without aid but may rely on devices like canes or walkers for safety. They require considerable assistance for daily tasks, making independent living unviable. Finally, in stage 5, the most debilitating phase, stiffness in the legs can render standing or walking impossible. Individuals become either bedridden or confined to a wheelchair and need external assistance. Constant care becomes imperative for all daily activities.

3.2 Proposed classifier global overview

Unfortunately, Parkinson's disease has no cure, but medications and surgical treatments can significantly alleviate these symptoms [7]. Therefore, early diagnosis is crucial to enhance the quality of life for individuals suffering from this disease. One of the symptoms of Parkinson's disease in its early stages is hand tremors, and for early diagnosis of this disease, we used a Composite Index of Speed and pen pressure using the Spiral-Wave dataset. In this work, we proposed an ensemble voting classifier based on hybrid CNN-KNN architecture to predict Parkinson’s disease via Spiral-Wave drawing.

As shown in Fig. 1, the flowchart represents several stages and different techniques that have been used to reach the target. The first step in the developing process is obtaining, processing and splitting the dataset. Next, we augmented this dataset to train the model and evaluate its performance. Then, hyperparameter tuning has been used to obtain the best architecture for the model, which helps it to learn effectively on the training set and provide the optimal performance on an unseen dataset.

3.3 Dataset description

This study's Spiral and Wave dataset is available on the Kaggle repository. It has 102 composite samples (102 spiral images and 102 wave images) of two classes of Parkinson's and healthy (PD/Healthy: 51/51) of age in years’ average (average ± SD) (PD/Healthy: 67.65 ± 9.10 / 67.05 ± 8.40). According to the study [44], the origin of this dataset was created by Paulo Folador and his partners at the Federal University of Uberlândia, Brazil. Figure 2 represents briefly samples of the Spiral-Wave drawings of two classes which are Parkinson's and Healthy.

The creator of this data was inspired by the study [46], which found that it was possible to detect Parkinson’s disease by asking the patient to draw a spiral and then track the Speed of drawing and the Pen pressure. The authors mentioned that the drawing speed was slower and the pen pressure lower among Parkinson’s patients, and this was especially pronounced for patients with more acute or advanced forms of the disease [61].

The dataset consists of 204 pre-processed images and is pre-split into a training set and a testing set, consisting of Spiral: 102 images, 72 training, and 30 testing and Wave: 102 images, 72 training, and 30 testing. Table 2 represents some information regarding the dataset type, splitting, the age average and the number of participants including the people with Parkinson's disease and the control group (healthy).

Table 2 Description of spiral and wave dataset of Parkinson’s disease

Full size table

3.4 Dataset visualization

Hand tremors, Speed of drawing and Pen-Pressure may vary from person to person [46]. A skeleton image can be generated to understand the dissimilarities between the two classes' Spiral-Wave drawings. This skeleton image can then be transformed into a new data frame, where each row represents the coordinates of non-zero pixels in the respective image.

Creating these skeletons has many details, starting by reading each image from the dataset as a grayscale and subtracting each pixel from one to inverse the image's background into black and the sketch to white. Then, the image has been smoothed using the Median filter and threshold using Yen's method. These processes aim to refine and segment the images, ensuring that the drawings are distinctly extracted, separating the drawing pixels from the noise. The final operation is the skeletonization using the pre-developed algorithm from the Skimage package. For more details, the source code is available on the Kaggle repository.^{Footnote 3}

Figure 3 represents the results of the skeletonization process that reflects the entire dataset in one image. As we noticed, all drawings start from one point and finish at one point. The drawing of the control group (health individuals) is more regular and smoother than the people with Parkinson's disease, and classification algorithms can detect these tremors or irregularities to predict Parkinson's disease.

3.5 Dataset augmentation

The dataset used for this study has Spiral images and Wave images that have been resized to 256 × 256 pixels in height and width for Spiral and 256 × 512 pixels in height and width for Wave images, respectively. To prepare the dataset for training, increase its diversity, improve performance on unseen examples and enhance the reliability of deep learning (DL) models, we performed the dataset augmentation technique, which is a valuable technique that may help combat overfitting, enhance model generalization [62] and obtain the optimal benefit from this limited Spiral-Wave dataset more effectively [63]. For this mission, we used a pre-developed data augmentation model from TensorFlow to facilitate this process. The data augmentation parameters have been selected based on some previous studies [47] and the functionality of each parameter to enrich the drawing dataset [63] and consider some conditions that prevent the deformation of the images using the trial and error approach. The parameters are (top, bottom, right and left) shifting, brightness range, shearing range, zoom-in, zoom-out and rotation in two directions. The rest parameters, such as horizontal and vertical flip take the default values that do not change the original image. Table 3 represents the parameters applied to both Spiral and Wave image augmentation models.

Table 3 Data augmentation parameters of spiral and wave images

Full size table

We generated fifteen new images from every single original image in the training set, which means we have got ((15*72) + 72 = 1152) spiral images and 1152 wave images for the training, while the testing set (30 + 30 = 60 images) remained as they were and did not undergo any changes. We mention that every image is unique due to the combination of the randomly selected parameters from the mentioned ranges.

As we notice in the result of this augmentation in Fig. 4a, some Spiral-Wave images have been distorted, and some images lost part of them because the sketches are close to their edges (borders). To avoid this deformation, we proposed a solution for this problem: padding the images before applying the data augmentation approach. If we apply padding P on an image of size W x H pixels, the output image will have dimensions of (W + 2P) x (H + 2P). We aimed to preserve all information in the middle and on the borders of the images. Therefore, we added 25 pixels (P = 25) on the left, right, top and bottom sides of all images, with permanent colours equal to (239, 239 and 239) for (R, G and B) channels. Figure 4b and Fig. 4c represent the Padding approach and the results of this process, respectively.

After padding the images, we repeated the date augmentation approach with the same parameters’ settings mentioned in Table 3. Both original images (No-padding) and Padded images will be used for training and evaluating our models to compare their effect on the performance.

As we mentioned previously, twenty-five pixels were added to each side (top, bottom, right and left) of the original image with constant colours equal to 239 for each channel (RGB) to reflect the background as well as possible. Figure 4c represents two examples of the original and the padded images. Then, the size of the padded images was resized to 256 × 256 and 256 × 512 for spiral and wave images, respectively. Also, the images retained or preserved most of their characteristics and resolution thanks to the minor change they underwent, as we saw in the examples.

3.6 Performance evaluation

The confusion matrix is especially valuable because it does not just provide a single metric, like accuracy, but offers insights into the types of mistakes the model makes. So, the performance of all classifiers used in this paper was evaluated using multiple criteria, including accuracy, specificity, sensitivity, precision, f1-score, and Matthew's correlation coefficient (MCC). All these key Indicators are calculated using the confusion matrix in Table 4.

Table 4 Confusion matrix

Full size table

In the context of a classifier for Parkinson's disease, several vital terms [64] denote its performance. True Positives (TP) are cases where the classifier accurately identifies a patient with Parkinson's. It means the actual condition and the classifier's prediction concur in diagnosing the disease. Conversely, True Negatives (TN) represent instances where the classifier correctly confirms that an individual does not have Parkinson's and is genuinely healthy. However, not all predictions are accurate. False Positives (FP) occur when the classifier mistakenly indicates a healthy person has Parkinson's. Such errors can lead to unwarranted stress and potentially unnecessary medical interventions for the individual. On the other hand, False Negatives (FN) represent a scenario where the classifier overlooks the presence of Parkinson's in a patient who indeed has the disease, potentially leading to a lack of necessary medical attention. In medical diagnostics, minimizing both FP and FN are paramount, given the significant ramifications they can have on patient care and well-being.

Several performance metrics such as accuracy, precision, recall, and the F1 score can be derived from the confusion matrix to evaluate the classifier's performance more holistically.

Accuracy: it is a metric that quantifies the overall correctness of the classifier in predicting the actual labels across both the Parkinson's disease class and the healthy class and can be calculated using Eq. 1.
$${\mathrm{Accuracy}}=\frac{TP+TN}{TP+TN+FP+FN}\times 100$$
(1)
Recall (Sensitivity or True Positive Rate): quantifies the model's capability to capture all potential positive cases (Parkinson's) from the actual positive class and avoid the miss classification of positive cases as determined in Eq. 2.
$${\mathrm{Sensitivity}}=\frac{TP}{TP+FN}\times 100$$
(2)
Precision: provides insight into the reliability of positive classifications made by the models and reflects the classifiers' ability to correctly predict the actual healthy instances and avoid the miss classification of the control group (healthy instances) as shown in Eq. 3.
$${\mathrm{Precision}}=\frac{TP}{TP+FP}\times 100$$
(3)
F1-score: represents the harmonic mean of precision and recall. It offers a balanced measure between the two metrics, especially useful in situations with uneven class distributions and can be calculated using Eq. 4.
$$F1=2\times \frac{{Recall}\times{Precision}}{{Recall}+{Precision}}$$
(4)

3.7 Classifiers development overview

In this work, we proposed a Convolutional Neural Network with K-Nearest Neighbours algorithm (CNN-KNN) architecture (Fig. 5) instead of the standard architecture of Convolutional Neural Network (Fig. 6) to predict Parkinson’s disease through hand tremors using spiral and wave sketching. Combining CNN with traditional ML algorithms can offer several advantages in specific scenarios. CNN excels at processing structured data such as images or audio, extracting meaningful features through convolutional and pooling layers. On the other hand, traditional ML algorithms are distinguished by their speed and robust performance and are well-suited for handling structured tabular data. We adopted this hybrid architecture to leverage the strengths of both approaches to build a robust and trustworthy model for predicting Parkinson's disease compared to the models in previous literature.

3.8 The development of the proposed CNN architecture

Firstly, two convolutional neural network (CNN) models were constructed with identical architectures for both the Spiral and Wave datasets, as illustrated in Fig. 7. Figure 7a depicts the architecture of the Spiral model, while Fig. 7b represents the architecture of the Wave model. These architectures showcasing all hyperparameters including the size of input layers, convolution layers, kernels, max pooling, activation functions, and output. We selected these hyperparameters using the pre-developed Random Search approach available in TensorFlow packages by comparing and sorting the testing results. The range of search was from 8 to 46 for numbers of filters, 3, 5 or 7 for kernel size, ‘same’ for padding, ‘sigmoid, tanh or ReLU’ for activation functions, 50 to 3000 for dense layers and ‘SoftMax’ for the two output layers. Moreover, we used ‘Adam’ as an optimizer with a learning rate equal to 0.00001 and ‘sparse categorical cross entropy’ as a loss function. Then, we trained and evaluated each model separately using the Spiral and Wave sets. Finally, we saved these models to be used later.

3.9 The development of the proposed CNN-KNN architecture

Based on the models in the previous section, we loaded the pre-trained models. These models have sequence layers: an input layer, three mixed convolutional with Pooling layers, a flattened layer, (3000, 1000, 500, 100) dense layers, and one output layer.

To preserve the value of the trainable parameters, we have frozen all first layers until the flattened layer and removed all subsequent dense layers. The idea is to use the convolutional layers of the pre-trained models as feature extractors after removing the last layers. We passed all images through the frozen convolutional layers; we extracted relevant features from the augmented dataset of spiral and wave separately.

The result of the predicting spiral dataset through the flattened layers of the spiral model was a new numerical dataset consisting of 1152 samples and 8192 features, and we got the same results from the wave model using the augmented wave images.

Then, we used each new numerical dataset to train KNN models separately. The first classifier is spiral KNN which has the value of hyperparameters adjusted on (n neighbours: 4, weights: 'distance', algorithm: 'brute', leaf size: 25, p: 2). The second classifier is wave KNN that has the value of hyperparameters adjusted on (n neighbours: 2, weights: 'distance', algorithm: 'brute', leaf size: 33, p: 1). These hyperparameters selected manually based on the testing results using trial and error approach.

We automated the proposed architecture (CNN-KNN) to receive the spiral test images one by one, extract features, feed them to the KNN classifier to predict them, and the same process for the wave model. The final prediction is based on the average prediction probabilities of one spiral image and one wave image provided via the spiral and wave classifiers.

Figure 8 represents the proposed CNN-KNN architecture to predict Parkinson’s disease. This architecture consists of three main sections: Input, Feature Extraction and Voting Classifier.

Input: It performs image preprocessing such as padding, resizing, normalization and data augmentation to feed the model with significant, uniform and relevant images.
Feature Extraction: It represents the convolutional layers that play a crucial role in feature extraction and provide flat numerical features that will be the foundation for subsequent classifiers (KNNs).
Voting Classifier: It is an ensemble voting classifier which consists of two sub-classifiers and produces the final prediction based on the average of the prediction probabilities.

Table 5 represents the optimal hyperparameters of all elements used in this study. We mention that the spiral and wave models have the same architectures, while the KNNs classifiers have different hyperparameters. All these hyperparameters have been selected using the random search approach or manually to reach the performance required based on training and testing outcomes.

Table 5 The optimal hyperparameters of the models and classifiers used in this work

Full size table

3.10 Feature extraction based convolutional layers

Convolutional layers in neural networks are instrumental for feature extraction from an image. Figure 9 clarifies how these layers scan the input data, capturing local patterns and features using learnable filters (kernels). During training, the filters adapt their weights through backpropagation and gradient descent, enabling them to specialize in detecting task-relevant features. The output of the convolutional layers consists of feature maps, representing high-level representations of the input data. These hierarchical features, ranging from edges to objects, can then be used in subsequent layers for tasks such as classification or regression. Overall, convolutional layers play a vital role in deep learning models, extracting meaningful features that enhance performance and generalization.

According to Eq. 5, the dimension of output matrices of the convolutional layer can be calculated based on the size of the input image including (Width: Col, Height: Row and Number of Channels: Nc), the kernel size (Height, Width and depth of kernels: Fw, Fh and Fc) number of filters (Fn) with taking into consideration if padded image (p) and the stride of kernel across it (s).

$$\begin{array}{c}\left[Col,Row,{N}_{c}\right]*\left[{F}_{w},{F}_{h},{F}_{c}\right]=\\ \left[\left[\frac{Col+2p-{F}_{w}}{s}+1\right],\left[\frac{Row+2p-{F}_{h}}{s}+1\right],{F}_{n}\right]\end{array}$$

(5)

In Conv2D, one filter should have the same depth as the depth of the input matrix and produce one 2D matrix according to the default architecture of TensorFlow. As shown in Fig. 9, the input is an RGB image of size (6 × 6x3) and two kernels of size ((3 × 3x3),2). Each layer (kernel) in the filter scans one channel of the input image and produces one primary matrix of size (4 × 4).

Contrary to correlation operation, the convolution operation uses a filter rotated 180° degrees before applying it to the input. This operation is also known as kernel flipping. The reason behind this flipping is to ensure that the convolution operation aligns with the mathematical definition of convolution. It simplifies the mathematical formulation and aligns more closely with how convolution is defined in mathematical terms.

The pixel values of each primary matrix can be calculated using Eq. 6, whereas the pixel values of the final matrix are calculated by adding the Bias to the sum of the counterpart's pixels in primary matrices [65, 66] as defined in Eq. 7. This convolutional operation has returned two matrices of size ((4 × 4x1),2) due to the two filters used in this example.

$$G\left[n,m\right]=\left({f}^{*}h\right)\left[n,m\right]=\sum_{y=0}^{{F}_{h}-1}\sum_{x=0}^{{F}_{W}-1}h\left[x,y\right]f\left[x,y\right]$$

(6)

$${G}_{\mathrm{out}}\left(n,m\right)=\sum_{z=0}^{{\mathrm{depth}}-1}\left(\left(\sum_{y=0}^{{F}_{h}-1}\sum_{x=0}^{{F}_{W}-1}h\left[x,y\right]*f[x,y]\right)+{\mathrm{bias}}\right)$$

(7)

where the Receptive Field in each channel of the input image has been denoted by f, one layer of the kernel (filter) by h and the primary matrix by G[n, m] with the indexes of columns and rows (n, m) respectively.

Moreover, the Rectified Linear Unit (ReLU) defined in Eq. 8 has been used as an activation function. It is a non-linear function that introduces non-linearity into the network, allowing it to learn complex patterns, improve the learning efficiency in some cases and make nonlinear transformations. This function removes the negative pixels from the output matrix and replaces them with zero [67].

$$f\left(x\right)={x}^{+}=max\left(0,x\right)$$

(8)

Then, we used Max pooling to reduce the dimensions of output matrices while retaining their essential features [66]. Max pooling [68] is a subsampling operation that takes the maximum value from a group of values in a matrix.

We have dealt with a (2 × 2) max pooling operation that is given by Eq. 9. Where y is the maximum value within R that is a (2 × 2) window (or filter) slides over the input matrix elements (${x}_{ij}$)

$$y={max}_{\left(i,j\right)\in R}{x}_{ij}$$

(9)

The output matrices will be an input of the next convolutional layer, and the previous processes will repeat until the Flatten Layer, which reshapes the most prominent features in the last matrices into a one-dimensional vector to be used as an input for the fully connected layers or any ML algorithm.

3.11 Ensemble voting classifier based on KNN algorithm

In this work, we used the k-Nearest Neighbours (k-NN) classifier, a simple, instance-based learning algorithm used in supervised machine learning. Instead of constructing a general model during the training phase like many other algorithms, it memorizes the entire training dataset. Predictions are made for a new data point by considering the k training examples closest to that point [69, 70]. For a new, unseen instance, the algorithm searches for the k-training samples closest to the point based on the hyperparameter (P = 1 or P = 2) using the most popular equations to calculate the distance. Since P equals one, in this case, the algorithm will use the Manhattan distance given by Eq. 10. If P equals two, the distance is calculated using the Euclidean distance given by Eq. 11. To automate the algorithm, the Minkowski distance [70] is a generalization of both the Euclidean distance and the Manhattan distance as defined in Eq. 12.

$${D}_{\mathrm{Manhattan}}\left(X,Y\right)={\Vert x-y\Vert }_{1}={\sum }_{i=1}^{n}\mid {x}_{i}-{y}_{i}\mid$$

(10)

$${D}_{\mathrm{Euclidean}}\left(X,Y\right)=\sqrt{{\sum }_{i=1}^{n}{\left({x}_{i}-{y}_{i}\right)}^{2}}$$

(11)

$${D}_{\mathrm{Minkowski}}\left(X,Y\right)={\left({\sum }_{i=1}^{n}{\left|{x}_{i}-{y}_{i}\right|}^{p}\right)}^\frac{1}{p}$$

(12)

As mentioned, the weights hyperparameter in the two KNN algorithms has been set to (distance). Thus, the influence of each neighbour on the prediction is weighted by the inverse of their distance to the query point. This means that closer neighbours will significantly influence the prediction more than those further away. For each of the k neighbours, the weight ${w}_{i}$ of the ${i}^{th}$ neighbour is calculated using Eq. 13, where $x$ is the query point and ${x}_{i}$ is the ${i}^{th}$ neighbour.

$${w}_{i}=\frac{1}{{\mathrm{Distance}}\left(x,{x}_{i}\right)}$$

(13)

For classification, instead of each of the K neighbours getting one vote, each neighbour gets ${w}_{i}$ votes. The class with the highest cumulative weighted vote is the predicted class. The proposed ensemble voting classifier has been built based on two independent KNN models: spiral KNN to predict spiral images and wave KNN to predict wave images. The final prediction of Parkinson's disease is the average probability of each model, as shown in Fig. 10. So, each model returns the probability in two columns corresponding to the classes (Parkinson and healthy).

The resulting value for each class is the probability that the input sample belongs to that class based on the weighted votes of its k nearest neighbours. To compute weighted votes for each class c (Parkinson or healthy), sum the weights of the neighbours that belong to class c (Parkinson or healthy) using Eq. 14, where y_i is the actual label of the class and c is the target class. Then, to convert these weighted votes into probabilities, divide the weighted vote for each class by the total weight of the k neighbours using Eq. 15, where P(c|x) represents the probability of class c given the input sample x.

$${\mathrm{WeightedVote}}\left(c\right)={\sum }_{i:{{\mathrm{y}}}_{i}={\mathrm{c}}}{w}_{i}$$

(14)

$$P\left(c|x\right)=\frac{{\mathrm{WeightedVote}}\left(c\right)}{{\sum }_{i=1}^{k}{w}_{i}}$$

(15)

In mathematical terms, for a given class c and an input sample x, Eq. 16 provides the class probabilities for input samples. Where ${w}_{i}$ is the weight of the ${i}^{th}$ neighbour and $I\left({y}_{i}=c\right)$ is an indicator function that is (one) if the label ${y}_{i}$ of the ${i}^{th}$ neighbour is c and (zero) otherwise.

$$P\left(c|x\right)=\frac{{\sum }_{i\in \mathrm{neighbors of}{\mathrm{x}}}{w}_{i}\times I\left({y}_{i}=c\right)}{{\sum }_{i\in \mathrm{neighbors of}{\mathrm{x}}}{w}_{i}}$$

(16)

4 Results and discussion

This section represents the results of seven models, which are CNN model trained and tested using the original spiral set (Spiral-CNN No-Padding), CNN model trained and tested using the original wave set (Wave-CNN No-Padding), CNN model trained and tested using the padded spiral set (Spiral-CNN Padding), CNN model trained and tested using the padded wave set (Wave-CNN Padding), hybrid CNN-KNN model trained and tested using the padded spiral set (Spiral-CNN-KNN), hybrid CNN-KNN model trained and tested using the padded wave set (Wave-CNN-KNN) and the Ensemble Voting Classifier that has been developed based on the last two models (Ensemble Voting Classifier).

4.1 Spiral and wave models with No-Padding dataset

In methodology, we mentioned that we developed two CNN models and tuned their hyperparameters. Moreover, these models have been trained, evaluated and tested using the original spiral and wave dataset. Training and validation curves are graphical representations commonly used in ML and DL to monitor the performance of a model during training. Figure 11a represents the accuracy curves of training and validation that depict the performance of the Spiral Model on the No-Padding spiral set (Spiral-CNN No-Padding) over thirty successive epochs. In addition, Fig. 11b represents the Loss curve during the training and validation process. It represents how predictions were different from the desired targets.

Then, this model was evaluated using unseen images (test set) consisting of 30 images. Among these, 22 images were correctly classified as true positives (TP) and true negatives (TN), indicating the model's capability to identify Healthy people and PD with an accuracy reached 73.3%. In fact, the model demonstrated promising performance by correctly classifying 22 images, it also exhibited limitations, misclassifying 4 images as false positives (FP) and 4 images as false negatives (FN). This suggests opportunities for enhancing the model's sensitivity and precision, which are crucial for minimizing misclassifications. Figure 12 represents the symmetric confusion matrix of the model.

Also, Wave-Model (Wave-CNN No-Padding) has been trained, validated and evaluated using the No-Padding wave set. The accuracy curves of training and validation have been illustrated in Fig. 13a, whereas Fig. 13b represents the Loss curves. As we notice in the figures, the accuracy curves exhibited an increasing trend and stabilized at high values closer to each other than Spiral-Model, whereas loss curves decreased and stabilized at low values. This convergence is reflected well in the model's performance during the test. As shown in the confusion matrix in Fig. 14, the model misclassified 2 images as FP and 4 images as FN.

4.2 Spiral and wave models with padded dataset

When we augmented the Spiral and Wave dataset used with the previous models, we noticed that some images had been distorted and some images lost part of them because the sketches were close to their edges (borders), as we mentioned previously. Furthermore, the outcomes of the previous models show high misclassification because of deformation of the images during the data augmentation processes. For these reasons, we padded all images to avoid excessive deformation. The purpose was to obtain meaningful images and preserve all information in the middle of these images. These padded images have been used to train and evaluate two CNN models.

The first model is the Spiral Model with the padded set (Spiral-CNN Padding). We trained and evaluated it using the Spiral padded images and obtained more accurate performance. Figure 15a represents the training and validation accuracy curves, whereas Fig. 15b represents Loss curves. The confusion matrix in Fig. 16 illustrates the critical performance indicators found using the testing set.

The second model is the Wave Model with the padded set (Wave-CNN Padding). This model has been trained and evaluated using the Wave padded images, and we noticed that the model was learning meaningful patterns from the padded Wave set and generalized well to the validation and testing sets. Figure 17a represents the training and validation accuracy curves, whereas Fig. 17b represents Loss curves. The confusion matrix in Fig. 18 illustrates the key performance indicators found using the testing set. The proposed solution minimizes models’ misclassification and increase the performance from 80% to 86.66%.

4.3 Spiral and wave CNN-KNN architecture with padded dataset

Transfer learning (TL) is a technique in deep learning that involves leveraging the knowledge gained from pre-trained models to solve new, related tasks. We aimed to enhance the performance of our models using the pre-trained CNN model that trained with the padded Spiral and Wave dataset.

After removing the last layers, we used the convolutional layers of the pre-trained models as feature extractors. We passed all images through the frozen convolutional layers; we extracted relevant features from the images. The flattened numerical features have been used to produce a new dataset that has been used to train the k-Nearest Neighbours (KNN) Algorithm.

This hybrid CNN-KNN architecture has been trained and evaluated using the padded spiral set. We presented its performance in the confusion matrix in Fig. 19. The second hybrid model has also been trained and evaluated using just the padded wave set. The confusion matrix in Fig. 20 reflects how the performance increased compared to both Wave Models.

4.4 Ensemble voting classifier

This work aims to predict Parkinson’s disease via hand tremors using two different approaches: spiral sketching and wave sketching. To predict the disease using spiral and wave, we built two separate models for each of them: Spiral-CNN-KNN and Wave-CNN-KNN. These models provide two independent predictions.

To obtain a single decision on whether or not a person has Parkinson's disease, we have built, trained and evaluated an Ensemble Voting Classifier based on both spiral and wave CNN-KNN architectures. The final decision of this classifier is the average of probabilities of the sub-classifiers. Then, this hybrid system was evaluated using unseen images (test set) comprising 60 combined samples, with each individual represented by one spiral and one wave image. The final classification decision is determined by averaging the predictions made by the sub-systems based on spiral and wave drawings. Based on the results in Fig. 21, 29 individuals were correctly classified as true positives (TP) and true negatives (TN), indicating the model's capability to identify Healthy people and PD with an accuracy reached 96.67%. In fact, the model demonstrated promising performance by correctly classifying 29 individuals, it also exhibited limitations, misclassifying 1 individual as a healthy while he/she actually has Parkinson's disease (FN). The proposed hybrid system provides high performance compared to the previous models. These results demonstrate the potential of our proposed hybrid approach in effectively identifying individuals with PD using both spiral and wave images.

Table 6 represents a comparison between the key performance indicators of seven models, which are CNN model trained and tested using the original spiral set, CNN model trained and tested using the original wave set, CNN model trained and tested using the padded spiral set, CNN model trained and tested using the padded wave set, hybrid CNN-KNN model trained and tested using the padded spiral set, hybrid CNN-KNN model trained and tested using the padded wave set and the Ensemble Voting Classifier that has been developed based on the last two models.

Table 6 Comparison between six models developed as a foundation for the proposed classifier

Full size table

For contrast and easy comparison between models across the different key performance indicators, Fig. 22 provides a visual representation of the models' performance side-by-side.

Moreover, Fig. 23 represents three trends that support practical analysis of performance enhancement of Spiral models, Wave models and the Ensemble Voting classifier that reached 96.67% accuracy, 93.33% sensitivity (Recall), 100% precision and 96.55% of f1-score.

Table 7 compares the proposed classifier (CNN-KNN Ensemble Voting Classifier) and several classifiers in the previous literature. This comparison concerns the publication date, the Dataset used for training and evaluating the classifiers, the methodology of feature extraction and the performance provided by each classifier.

Table 7 Comparison between the proposed classifier and related works in previous literature (only hand drawing dataset)

Full size table

Diagnosing Parkinson's disease typically involves a comprehensive evaluation of multiple symptoms and clinical indicators rather than relying on a single symptom such as speech disorders, hand tremors, brain scans MRI, etc. Parkinson's disease is a complex neurodegenerative disorder, and its diagnosis is often based on a combination of motor and non-motor symptoms. As elucidated in the literature review section, certain diagnostic methodologies, including the analysis of speech disorders [30] and scans [58], exhibit high accuracy. However, relying solely on these methods is insufficient for achieving a precise diagnosis of the disease. Hence, the diversity of diagnostic methods for predicting Parkinson's disease based on Artificial intelligence and Machine learning can guide us to build a complete early diagnostic embedded system to predict the disease via several symptoms.

The findings can be summarized in four points: 1) Hyperparameter tuning is a crucial step to build a robust model compatible with a custom dataset to achieve high performance, reduce overfitting and enhance its ability to generalize to unseen data (Test Set). 2) Deformation of the dataset during the augmentation process may negatively impact the performance. As a proposed solution, we padded all original images to preserve their essential information. 3) CNN-KNN architecture may increase performance in some cases due to exploiting the advantages of Convolutional Layers as feature extractors and ML algorithms as classifiers. 4) Ensemble voting classifiers combine predictions from multiple individual models. They can often achieve higher accuracy and better overall performance than using a single model due to combining the strengths and weaknesses of spiral drawing and wave drawing to make the final decision concerning people with Parkinson's disease and healthy people.

5 Conclusion and future work

In this study, we proposed a novel hybrid CNN-KNN voting classifier for predicting Parkinson’s disease through hand tremors using a spiral-wave sketching dataset. Our classifier combines the advantages of both deep learning and machine learning techniques, and leverages the strengths and weaknesses of spiral drawing and wave drawing to make accurate predictions. Our classifier achieved 96.67% accuracy, 93.33% sensitivity and 100% precision on our dataset, which is higher than the existing methods that use only one type of drawing or one type of model. Our work enhances medical services by providing a fast and reliable tool for diagnosing Parkinson’s disease at an early stage. However, our work also has limitations and challenges that must be addressed in future research. First, our dataset is relatively small and imbalanced, which may affect the performance and robustness of our classifier. We plan to collect more data from different sources and apply more advanced data augmentation techniques to increase the diversity and quality of our data. Second, our classifier is based on hand tremors only, which may not capture all the symptoms and features of Parkinson’s disease. We intend to incorporate other modalities, such as voice, gait, or facial expressions, to improve our classifier’s comprehensiveness and accuracy. Third, our classifier has not been tested on actual patients or in clinical settings, which may raise some ethical or practical issues. We aim to conduct more experiments and evaluations with medical experts and patients to validate our classifier’s effectiveness and usability. We hope our work will inspire more research on Parkinson’s disease prediction using hand tremors and pave the way for developing more advanced and intelligent systems for enhancing medical services.

Data availability

The data used in this paper is available on request to corresponding author.

Notes

References

Ellis TD, Colón-Semenza C, DeAngelis TR et al (2021) Evidence for early and regular physical therapy and exercise in parkinson’s disease. Semin Neurol 41:189–205. https://doi.org/10.1055/s-0041-1725133
Article Google Scholar
Bloem BR, Okun MS, Klein C (2021) Parkinson’s disease. The Lancet 397:2284–2303. https://doi.org/10.1016/S0140-6736(21)00218-X
Article Google Scholar
Herz DM, Brown P (2023) Moving, fast and slow: behavioural insights into bradykinesia in Parkinson’s disease. Brain awad069. https://doi.org/10.1093/brain/awad069
Yang Y, Tang B, Guo J (2016) Parkinson’s disease and cognitive impairment. Parkinson’s Disease 2016:1–8. https://doi.org/10.1155/2016/6734678
Article Google Scholar
Raggi A, Leonardi M, Carella F et al (2011) Impact of nonmotor symptoms on disability in patients with parkinson’s disease. Int J Rehabil Res 34:316–320. https://doi.org/10.1097/MRR.0b013e32834d4b66
Article Google Scholar
Shulman LM, Gruber-Baldini AL, Anderson KE et al (2008) The evolution of disability in parkinson disease: evolution of disability in PD. Mov Disord 23:790–796. https://doi.org/10.1002/mds.21879
Article Google Scholar
Joseph CB (2023) Parkinson disease. J Consum Health Internet 27:221–224. https://doi.org/10.1080/15398285.2023.2212529
Article Google Scholar
Dey RK, Das AK (2024) Neighbour adjusted dispersive flies optimization based deep hybrid sentiment analysis framework. Multimed Tools Appl. https://doi.org/10.1007/s11042-023-17953-8
Article Google Scholar
Dey RK, Das AK (2022) A Simple Strategy for Handling ‘NOT’ Can Improve the Performance of Sentiment Analysis. In: Das AK, Nayak J, Naik B et al (eds) Computational Intelligence in Pattern Recognition. Springer Nature Singapore, Singapore, pp 255–267
Chapter Google Scholar
Ait Ali N, Cherradi B, El Abbassi A et al (2018) GPU fuzzy c-means algorithm implementations: performance analysis on medical image segmentation. Multimed Tools Appl 77:21221–21243. https://doi.org/10.1007/s11042-017-5589-6
Article Google Scholar
Saleh S, Cherradi B, El Gannour O et al (2023) Healthcare monitoring system for automatic database management using mobile application in IoT environment. Bulletin EEI 12:1055–1068. https://doi.org/10.11591/eei.v12i2.4282
Article Google Scholar
Dritsas E, Trigka M (2023) Efficient data-driven machine learning models for cardiovascular diseases risk prediction. Sensors 23:1161. https://doi.org/10.3390/s23031161
Article Google Scholar
Kresoja K-P, Unterhuber M, Wachter R et al (2023) A cardiologist’s guide to machine learning in cardiovascular disease prognosis prediction. Basic Res Cardiol 118:10. https://doi.org/10.1007/s00395-023-00982-7
Article Google Scholar
Abunasser B, AL-Hiealy MR, Zaqout I, Abu-Naser S (2023) Convolution neural network for breast cancer detection and classification using deep learning. Asian Pac J Cancer Prev 24:531–544. https://doi.org/10.31557/APJCP.2023.24.2.531
Article Google Scholar
Trang NTH, Long KQ, An PL, Dang TN (2023) Development of an artificial intelligence-based breast cancer detection model by combining mammograms and medical health records. Diagnostics 13:346. https://doi.org/10.3390/diagnostics13030346
Article Google Scholar
Makroum MA, Adda M, Bouzouane A, Ibrahim H (2022) Machine learning and smart devices for diabetes management: systematic review. Sensors 22:1843. https://doi.org/10.3390/s22051843
Article Google Scholar
Daley BJ, Ni’Man M, Neves MR, et al (2022) mHealth apps for gestational diabetes mellitus that provide clinical decision support or artificial intelligence: a scoping review. Diabet Med 39. https://doi.org/10.1111/dme.14735
Daanouni O, Cherradi B, Tmiri A (2021) Automatic Detection of Diabetic Retinopathy Using Custom CNN and Grad-CAM. In: Saeed F, Al-Hadhrami T, Mohammed F, Mohammed E (eds) Advances on Smart and Soft Computing. Springer Singapore, Singapore, pp 15–26
Chapter Google Scholar
Daanouni O, Cherradi B, Tmiri A (2020) Diabetes Diseases Prediction Using Supervised Machine Learning and Neighbourhood Components Analysis. In: Proceedings of the 3rd International Conference on Networking, Information Systems & Security. ACM, Marrakech Morocco, pp 1–5
Jyothi P, Singh AR (2023) Deep learning models and traditional automated techniques for brain tumor segmentation in MRI: a review. Artif Intell Rev 56:2923–2969. https://doi.org/10.1007/s10462-022-10245-x
Article Google Scholar
Chandni SM, Kushwaha AKS (2023) The power of deep learning for intelligent tumor classification systems: a review. Comput Electr Eng 106:108586. https://doi.org/10.1016/j.compeleceng.2023.108586
Article Google Scholar
Prakash NN, Rajesh V, Namakhwa DL et al (2023) A DenseNet CNN-based liver lesion prediction and classification for future medical diagnosis. Sci Afr 20:e01629. https://doi.org/10.1016/j.sciaf.2023.e01629
Article Google Scholar
Sofia MA, Shabaz M, Asenso E (2023) Machine learning based model for detecting depression during covid-19 crisis. Sci Afr 20:e01716. https://doi.org/10.1016/j.sciaf.2023.e01716
Article Google Scholar
Vinod DN, Prabaharan SRS (2023) Elucidation of infection asperity of CT scan images of COVID-19 positive cases: a machine learning perspective. Scientific African 20:e01681. https://doi.org/10.1016/j.sciaf.2023.e01681
Article Google Scholar
Hamida S, Gannour OE, Lamalem Y et al (2023) Efficient Medical Diagnosis Hybrid System based on RF-DNN Mixed Model for Skin Diseases Classification. 2023 3rd International Conference on Innovative Research in Applied Science, Engineering and Technology (IRASET). IEEE, Mohammedia, Morocco, pp 01–08
Google Scholar
Saleh S, Cherradi B, Laghmati S et al (2023) Healthcare Embedded System for Predicting Parkinson’s Disease Based on AI of Things. 2023 3rd International Conference on Innovative Research in Applied Science, Engineering and Technology (IRASET). IEEE, Mohammedia, Morocco, pp 1–7
Google Scholar
Letanneux A, Danna J, Velay J-L et al (2014) From micrographia to parkinson’s disease dysgraphia: parkinson’s disease dysgraphia. Mov Disord 29:1467–1475. https://doi.org/10.1002/mds.25990
Article Google Scholar
Moetesum M, Diaz M, Masroor U et al (2022) A survey of visual and procedural handwriting analysis for neuropsychological assessment. Neural Comput & Applic 34:9561–9578. https://doi.org/10.1007/s00521-022-07185-6
Article Google Scholar
Rosenblum S, Samuel M, Zlotnik S et al (2013) Handwriting as an objective tool for parkinson’s disease diagnosis. J Neurol 260:2357–2361. https://doi.org/10.1007/s00415-013-6996-x
Article Google Scholar
Saleh S, Cherradi B, El Gannour O et al (2023) Predicting patients with parkinson’s disease using machine learning and ensemble voting technique. Multimed Tools Appl. https://doi.org/10.1007/s11042-023-16881-x
Article Google Scholar
Ouhmida A, Raihani A, Cherradi B, Lamalem Y (2022) Parkinson’s disease classification using machine learning algorithms: performance analysis and comparison. 2022 2nd International Conference on Innovative Research in Applied Science, Engineering and Technology (IRASET). IEEE, Meknes, Morocco, pp 1–6
Google Scholar
Ouhmida A, Terrada O, Raihani A et al (2021) Voice-Based Deep Learning Medical Diagnosis System for Parkinson’s Disease Prediction. 2021 International Congress of Advanced Technology and Engineering (ICOTEN). IEEE, Taiz, Yemen, pp 1–5
Google Scholar
Taleb C, Likforman-Sulem L, Mokbel C, Khachab M (2020) Detection of parkinson’s disease from handwriting using deep learning: a comparative study. Evol Intel. https://doi.org/10.1007/s12065-020-00470-0
Article Google Scholar
Göker H (2023) Automatic detection of parkinson’s disease from power spectral density of electroencephalography (EEG) signals using deep learning model. Phys Eng Sci Med 46:1163–1174. https://doi.org/10.1007/s13246-023-01284-x
Article Google Scholar
Zhang R, Jia J, Zhang R (2022) EEG analysis of Parkinson’s disease using time–frequency analysis and deep learning. Biomed Signal Process Control 78:103883. https://doi.org/10.1016/j.bspc.2022.103883
Article Google Scholar
Fraiwan L, Khnouf R, Mashagbeh AR (2016) Parkinson’s disease hand tremor detection system for mobile application. J Med Eng Technol 40:127–134. https://doi.org/10.3109/03091902.2016.1148792
Article Google Scholar
Yang Y, Yuan Y, Zhang G et al (2022) Artificial intelligence-enabled detection and assessment of parkinson’s disease using nocturnal breathing signals. Nat Med 28:2207–2215. https://doi.org/10.1038/s41591-022-01932-x
Article Google Scholar
Kulkarni S, Kalayil NG, James J et al (2020) Detection of Parkinson’s Disease through Smell Signatures. 2020 International Conference on Communication and Signal Processing (ICCSP). IEEE, Chennai, India, pp 808–812
Chapter Google Scholar
Zhu S (2022) Early Diagnosis of Parkinson’s Disease by Analyzing Magnetic Resonance Imaging Brain Scans and Patient Characteristic. 2022 10th International Conference on Bioinformatics and Computational Biology (ICBCB). IEEE, Hangzhou, China, pp 116–123
Chapter Google Scholar
Bhan A, Kapoor S, Gulati M, Goyal A (2021) Early Diagnosis of Parkinson’s Disease in brain MRI using Deep Learning Algorithm. 2021 Third International Conference on Intelligent Communication Technologies and Virtual Mobile Networks (ICICV). IEEE, Tirunelveli, India, pp 1467–1470
Chapter Google Scholar
Wang X, Hao X, Yan J et al (2023) Urine biomarkers discovery by metabolomics and machine learning for parkinson’s disease diagnoses. Chin Chem Lett 34:108230. https://doi.org/10.1016/j.cclet.2023.108230
Article Google Scholar
Drotár P, Mekyska J, Rektorová I et al (2016) Evaluation of handwriting kinematics and pressure for differential diagnosis of parkinson’s disease. Artif Intell Med 67:39–46. https://doi.org/10.1016/j.artmed.2016.01.004
Article Google Scholar
Drotár P, Mekyska J, Rektorová I et al (2014) Analysis of in-air movement in handwriting: a novel marker for parkinson’s disease. Comput Methods Programs Biomed 117:405–411. https://doi.org/10.1016/j.cmpb.2014.08.007
Article Google Scholar
Folador JP, Santos MCS, Luiz LMD et al (2021) On the use of histograms of oriented gradients for tremor detection from sinusoidal and spiral handwritten drawings of people with parkinson’s disease. Med Biol Eng Comput 59:195–214. https://doi.org/10.1007/s11517-020-02303-9
Article Google Scholar
Ouhmida A, Raihani A, Cherradi B, Terrada O (2021) A novel approach for parkinson’s disease detection based on voice classification and features selection techniques. Int J Onl Eng 17:111. https://doi.org/10.3991/ijoe.v17i10.24499
Article Google Scholar
Zham P, Kumar DK, Dabnichki P et al (2017) Distinguishing different stages of parkinson’s disease using composite index of speed and pen-pressure of sketching a spiral. Front Neurol 8:435. https://doi.org/10.3389/fneur.2017.00435
Article Google Scholar
Chakraborty S, Aich S, Jong-Seong-Sim, et al (2020) Parkinson’s Disease Detection from Spiral and Wave Drawings using Convolutional Neural Networks: A Multistage Classifier Approach. In: 2020 22nd International Conference on Advanced Communication Technology (ICACT). IEEE, Phoenix Park, PyeongChang, Korea (South), pp 298–303
Das A, Das HS, Choudhury A et al (2021) Detection of Parkinson’s Disease from Hand-Drawn Images Using Deep Transfer Learning. In: Sharma H, Saraswat M, Kumar S, Bansal JC (eds) Intelligent Learning for Computer Vision. Springer Singapore, Singapore, pp 67–84
Chapter Google Scholar
Shaban M (2020) Deep Convolutional Neural Network for Parkinson’s Disease Based Handwriting Screening. 2020 IEEE 17th International Symposium on Biomedical Imaging Workshops (ISBI Workshops). IEEE, Iowa City, IA, USA, pp 1–4
Google Scholar
Drotar P, Mekyska J, Smekal Z et al (2015) Contribution of different handwriting modalities to differential diagnosis of Parkinson’s Disease. 2015 IEEE International Symposium on Medical Measurements and Applications (MeMeA) Proceedings. IEEE, Torino, Italy, pp 344–348
Chapter Google Scholar
Drotar P, Mekyska J, Rektorova I et al (2013) A new modality for quantitative evaluation of Parkinson’s disease: In-air movement. 13th IEEE International Conference on BioInformatics and BioEngineering. IEEE, Chania, Greece, pp 1–4
Google Scholar
Drotar P, Mekyska J, Smekal Z et al (2013) Prediction potential of different handwriting tasks for diagnosis of Parkinson’s. 2013 E-Health and Bioengineering Conference (EHB). IEEE, IASI, Romania, pp 1–4
Google Scholar
Drotar P, Mekyska J, Rektorova I et al (2015) Decision support framework for parkinson’s disease based on novel handwriting markers. IEEE Trans Neural Syst Rehabil Eng 23:508–516. https://doi.org/10.1109/TNSRE.2014.2359997
Article Google Scholar
Pereira CR, Pereira DR, da Silva FA et al (2015) A Step Towards the Automated Diagnosis of Parkinson’s Disease: Analyzing Handwriting Movements. 2015 IEEE 28th International Symposium on Computer-Based Medical Systems. IEEE, Sao Carlos, Brazil, pp 171–176
Chapter Google Scholar
Pereira CR, Pereira DR, Papa JP et al (2016) Convolutional Neural Networks Applied for Parkinson’s Disease Identification. In: Holzinger A (ed) Machine Learning for Health Informatics. Springer International Publishing, Cham, pp 377–390
Chapter Google Scholar
Pereira CR, Weber SAT, Hook C et al (2016) Deep Learning-Aided Parkinson’s Disease Diagnosis from Handwritten Dynamics. 2016 29th SIBGRAPI Conference on Graphics, Patterns and Images (SIBGRAPI). IEEE, Sao Paulo, Brazil, pp 340–346
Chapter Google Scholar
Pereira CR, Pereira DR, Rosa GH et al (2018) Handwritten dynamics assessment through convolutional neural networks: an application to parkinson’s disease identification. Artif Intell Med 87:67–77. https://doi.org/10.1016/j.artmed.2018.04.001
Article Google Scholar
Khachnaoui H, Chikhaoui B, Khlifa N, Mabrouk R (2023) Enhanced parkinson’s disease diagnosis through convolutional neural network models applied to SPECT DaTSCAN images. IEEE Access 11:91157–91172. https://doi.org/10.1109/ACCESS.2023.3308075
Article Google Scholar
Al-Sarem M, Saeed F, Boulila W et al (2021) Feature Selection and Classification Using CatBoost Method for Improving the Performance of Predicting Parkinson’s Disease. In: Saeed F, Al-Hadhrami T, Mohammed F, Mohammed E (eds) Advances on Smart and Soft Computing. Springer Singapore, Singapore, pp 189–199
Chapter Google Scholar
Saeed F, Al-Sarem M, Al-Mohaimeed M et al (2022) Enhancing parkinson’s disease prediction using machine learning and feature selection methods. Comput Mater & Contin 71:5639–5658. https://doi.org/10.32604/cmc.2022.023124
Article Google Scholar
Rosebrock A (2019) Detecting Parkinson’s Disease with OpenCV, Computer Vision, and the Spiral/Wave Test. In: PyImageSearch. https://pyimagesearch.com/2019/04/29/detecting-parkinsons-disease-with-opencv-computer-vision-and-the-spiral-wave-test/. Accessed 3 Oct 2023
Shorten C, Khoshgoftaar TM (2019) A survey on image data augmentation for deep learning. J Big Data 6:60. https://doi.org/10.1186/s40537-019-0197-0
Article Google Scholar
Saravanan S, Ramkumar K, Narasimhan K et al (2023) Explainable artificial intelligence (EXAI) models for early prediction of parkinson’s disease based on spiral and wave drawings. IEEE Access 11:68366–68378. https://doi.org/10.1109/ACCESS.2023.3291406
Article Google Scholar
Hamida S, Cherradi B, Raihani A, Ouajji H (2019) Performance Evaluation of Machine Learning Algorithms in Handwritten Digits Recognition. 2019 1st International Conference on Smart Systems and Data Science (ICSSD). IEEE, Rabat, Morocco, pp 1–6
Google Scholar
Fan C-L, Chung Y-J (2022) Design and optimization of cnn architecture to identify the types of damage imagery. Mathematics 10:3483. https://doi.org/10.3390/math10193483
Article Google Scholar
Ketkar N, Moolayil J (2021) Convolutional Neural Networks. Deep Learning with Python. Apress, Berkeley, CA, pp 197–242
Chapter Google Scholar
Dey RK, Das AK (2023) Modified term frequency-inverse document frequency based deep hybrid framework for sentiment analysis. Multimed Tools Appl 82:32967–32990. https://doi.org/10.1007/s11042-023-14653-1
Article Google Scholar
Moujahid H, Cherradi B, Al-Sarem M, Bahatti L (2021) Diagnosis of COVID-19 Disease Using Convolutional Neural Network Models Based Transfer Learning. In: Saeed F, Mohammed F, Al-Nahari A (eds) Innovative Systems for Intelligent Health Informatics. Springer International Publishing, Cham, pp 148–159
Chapter Google Scholar
Müller P (2023) Flexible k nearest neighbors classifier: derivation and application for ion-mobility spectrometry-based indoor localization. https://doi.org/10.48550/ARXIV.2304.10151
Gao X, Li G (2020) A KNN model based on manhattan distance to identify the SNARE proteins. IEEE Access 8:112922–112931. https://doi.org/10.1109/ACCESS.2020.3003086
Article Google Scholar

Download references

Acknowledgements

The authors extend their appreciation to the King Salman Center for Disability Research for funding this work through Research Group no KSRG -2023-542.

Author information

Authors and Affiliations

EEIS Laboratory, ENSET of Mohammedia, Hassan II University of Casablanca, Mohammedia, Morocco
Shawki Saleh, Asmae Ouhmida, Bouchaib Cherradi, Soufiane Hamida & Omar Bouattane
STIE Team, CRMEF Casablanca-Settat, Provincial Section of El Jadida, 24000, El Jadida, Morocco
Bouchaib Cherradi
College of Computer Science and Engineering, Taibah University, 42353, Medina, Saudi Arabia
Mohammed Al-Sarem
2IACS Laboratory, ENSET of Mohammedia, Hassan II University of Casablanca, Mohammedia, Morocco
Bouchaib Cherradi & Soufiane Hamida
Department of Computer Science, Applied College, Taibah University, Medina, Saudi Arabia
Abdulaziz Alblwi
Dept. of Languages and Translation, College of Arts and Humanities, Taibah University, Medina, Saudi Arabia
Mohammad Mahyoob
Technical Community College, Taiz, Yemen
Mohammad Mahyoob

Authors

Shawki Saleh
View author publications
You can also search for this author in PubMed Google Scholar
Asmae Ouhmida
View author publications
You can also search for this author in PubMed Google Scholar
Bouchaib Cherradi
View author publications
You can also search for this author in PubMed Google Scholar
Mohammed Al-Sarem
View author publications
You can also search for this author in PubMed Google Scholar
Soufiane Hamida
View author publications
You can also search for this author in PubMed Google Scholar
Abdulaziz Alblwi
View author publications
You can also search for this author in PubMed Google Scholar
Mohammad Mahyoob
View author publications
You can also search for this author in PubMed Google Scholar
Omar Bouattane
View author publications
You can also search for this author in PubMed Google Scholar

Corresponding author

Correspondence to Bouchaib Cherradi.

Ethics declarations

Conflict of interest

The authors declare that they have no conflict of interest.

Additional information

Publisher's Note

Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.

Rights and permissions

Springer Nature or its licensor (e.g. a society or other partner) holds exclusive rights to this article under a publishing agreement with the author(s) or other rightsholder(s); author self-archiving of the accepted manuscript version of this article is solely governed by the terms of such publishing agreement and applicable law.

Reprints and permissions

About this article

Cite this article

Saleh, S., Ouhmida, A., Cherradi, B. et al. A novel hybrid CNN-KNN ensemble voting classifier for Parkinson’s disease prediction from hand sketching images. Multimed Tools Appl (2024). https://doi.org/10.1007/s11042-024-19314-5

Download citation

Received: 28 January 2024
Revised: 22 April 2024
Accepted: 27 April 2024
Published: 14 May 2024
DOI: https://doi.org/10.1007/s11042-024-19314-5

Keywords

Use our pre-submission checklist

Avoid common mistakes on your manuscript.

A novel hybrid CNN-KNN ensemble voting classifier for Parkinson’s disease prediction from hand sketching images

Abstract

Graphical Abstract

Similar content being viewed by others

ESDC-LSH: Ensemble Support-Vector Deep Convolutional Based Levy Selfish Herd Optimization for Prediction and Classification of Parkinson’s Disease

Parkinson’s disease diagnosis using convolutional neural networks and figure-copying tasks

Early Diagnosis of Parkinson’s Disease Using Hand Drawings Images

1 Introduction

2 Related work

3 Materials and methods

3.1 Disability levels and stages of Parkinson’s disease

3.2 Proposed classifier global overview

3.3 Dataset description

3.4 Dataset visualization

3.5 Dataset augmentation

3.6 Performance evaluation

3.7 Classifiers development overview

3.8 The development of the proposed CNN architecture

3.9 The development of the proposed CNN-KNN architecture

3.10 Feature extraction based convolutional layers

3.11 Ensemble voting classifier based on KNN algorithm

4 Results and discussion

4.1 Spiral and wave models with No-Padding dataset

4.2 Spiral and wave models with padded dataset

4.3 Spiral and wave CNN-KNN architecture with padded dataset

4.4 Ensemble voting classifier

5 Conclusion and future work

Data availability

Notes

References

Acknowledgements

Author information

Authors and Affiliations

Corresponding author

Ethics declarations

Conflict of interest

Additional information

Publisher's Note

Rights and permissions

About this article

Cite this article

Share this article

Keywords

Search

Navigation