Handling dropout probability estimation in convolution neural networks using meta-heuristics
Article
De Rosa, G., Papa, J. and Yang, X. 2018. Handling dropout probability estimation in convolution neural networks using meta-heuristics. Soft Computing. 22 (18), pp. 6147-6156. https://doi.org/10.1007/s00500-017-2678-4
Type | Article |
---|---|
Title | Handling dropout probability estimation in convolution neural networks using meta-heuristics |
Authors | De Rosa, G., Papa, J. and Yang, X. |
Abstract | Deep learning-based approaches have been paramount in recent years, mainly due to their outstanding results in several application domains, ranging from face and object recognition to handwritten digit identification. Convolutional Neural Networks (CNN) have attracted a considerable attention since they model the intrinsic and complex brain working mechanisms. However, one main shortcoming of such models concerns their overfitting problem, which prevents the network from predicting unseen data effectively. In this paper, we address this problem by means of properly selecting a regularization parameter known as Dropout in the context of CNNs using meta-heuristic-driven techniques. As far as we know, this is the first attempt to tackle this issue using this methodology. Additionally, we also take into account a default dropout parameter and a dropout-less CNN for comparison purposes. The results revealed that optimizing Dropout-based CNNs is worthwhile, mainly due to the easiness in finding suitable dropout probability values, without needing to set new parameters empirically. |
Keywords | Convolutional Neural Networks; Dropout; Meta-Heuristic Optimization |
Publisher | Springer |
Journal | Soft Computing |
ISSN | 1432-7643 |
Publication dates | |
Online | 23 Jun 2017 |
01 Sep 2018 | |
Publication process dates | |
Deposited | 26 Jun 2017 |
Accepted | 05 Jun 2017 |
Output status | Published |
Accepted author manuscript | |
Copyright Statement | This is a post-peer-review, pre-copyedit version of an article published in Soft Computing. The final authenticated version is available online at Springer via http://dx.doi.org/10.1007/s00500-017-2678-4 |
Digital Object Identifier (DOI) | https://doi.org/10.1007/s00500-017-2678-4 |
Web of Science identifier | WOS:000442576400018 |
Language | English |
https://repository.mdx.ac.uk/item/8710q
Download files
67
total views15
total downloads0
views this month0
downloads this month