Malmö University Publications
Change search
Link to record
Permanent link

Direct link
Kovács, György, Postdoctoral researcherORCID iD iconorcid.org/0000-0002-0546-116X
Publications (5 of 5) Show all publications
Pirinen, A., Abid, N., Paszkowsky, N. A., Ohlson Timoudas, T., Scheirer, R., Ceccobello, C., . . . Persson, A. (2024). Creating and Leveraging a Synthetic Dataset of Cloud Optical Thickness Measures for Cloud Detection in MSI. Remote Sensing, 16(4), Article ID 694.
Open this publication in new window or tab >>Creating and Leveraging a Synthetic Dataset of Cloud Optical Thickness Measures for Cloud Detection in MSI
Show others...
2024 (English)In: Remote Sensing, E-ISSN 2072-4292, Vol. 16, no 4, article id 694Article in journal (Refereed) Published
Abstract [en]

Cloud formations often obscure optical satellite-based monitoring of the Earth’s surface, thus limiting Earth observation (EO) activities such as land cover mapping, ocean color analysis, and cropland monitoring. The integration of machine learning (ML) methods within the remote sensing domain has significantly improved performance for a wide range of EO tasks, including cloud detection and filtering, but there is still much room for improvement. A key bottleneck is that ML methods typically depend on large amounts of annotated data for training, which are often difficult to come by in EO contexts. This is especially true when it comes to cloud optical thickness (COT) estimation. A reliable estimation of COT enables more fine-grained and application-dependent control compared to using pre-specified cloud categories, as is common practice. To alleviate the COT data scarcity problem, in this work, we propose a novel synthetic dataset for COT estimation, which we subsequently leverage for obtaining reliable and versatile cloud masks on real data. In our dataset, top-of-atmosphere radiances have been simulated for 12 of the spectral bands of the Multispectral Imagery (MSI) sensor onboard Sentinel-2 platforms. These data points have been simulated under consideration of different cloud types, COTs, and ground surface and atmospheric profiles. Extensive experimentation of training several ML models to predict COT from the measured reflectivity of the spectral bands demonstrates the usefulness of our proposed dataset. In particular, by thresholding COT estimates from our ML models, we show on two satellite image datasets (one that is publicly available, and one which we have collected and annotated) that reliable cloud masks can be obtained. The synthetic data, the newly collected real dataset, code and models have been made publicly available.

Place, publisher, year, edition, pages
MDPI, 2024
Keywords
cloud detection, cloud optical thickness, datasets, machine learning
National Category
Earth Observation
Research subject
Machine Learning
Identifiers
urn:nbn:se:mau:diva-75693 (URN)10.3390/rs16040694 (DOI)001177031000001 ()2-s2.0-85185890836 (Scopus ID)
Funder
Vinnova, 2021-03643; 2023-02787
Note

Validerad;2024;Nivå 2;2024-04-09 (sofila);

Full text license: CC BY

Available from: 2025-04-30 Created: 2025-04-30 Last updated: 2025-05-07Bibliographically approved
Abid, N., Noman, M. K., Kovács, G., Islam, S. M., Adewumi, T., Lavery, P., . . . Liwicki, M. (2024). Seagrass classification using unsupervised curriculum learning (UCL). Ecological Informatics, 83, Article ID 102804.
Open this publication in new window or tab >>Seagrass classification using unsupervised curriculum learning (UCL)
Show others...
2024 (English)In: Ecological Informatics, ISSN 1574-9541, E-ISSN 1878-0512, Vol. 83, article id 102804Article in journal (Refereed) Published
Abstract [en]

Seagrass ecosystems are pivotal in marine environments, serving as crucial habitats for diverse marine species and contributing significantly to carbon sequestration. Accurate classification of seagrass species from underwater images is imperative for monitoring and preserving these ecosystems. This paper introduces Unsupervised Curriculum Learning (UCL) to seagrass classification using the DeepSeagrass dataset. UCL progressively learns from simpler to more complex examples, enhancing the model's ability to discern seagrass features in a curriculum-driven manner. Experiments employing state-of-the-art deep learning architectures, convolutional neural networks (CNNs), show that UCL achieved overall 90.12 % precision and 89 % recall, which significantly improves classification accuracy and robustness, outperforming some traditional supervised learning approaches like SimCLR, and unsupervised approaches like Zero-shot CLIP. The methodology of UCL involves four main steps: high-dimensional feature extraction, pseudo-label generation through clustering, reliable sample selection, and fine-tuning the model. The iterative UCL framework refines CNN's learning of underwater images, demonstrating superior accuracy, generalization, and adaptability to unseen seagrass and background samples of undersea images. The findings presented in this paper contribute to the advancement of seagrass classification techniques, providing valuable insights into the conservation and management of marine ecosystems. The code and dataset are made publicly available and can be assessed here: https://github.com/nabid69/Unsupervised-Curriculum-Learning—UCL.

 

Place, publisher, year, edition, pages
Elsevier, 2024
Keywords
Seagrass, Deep learning, Unsupervised classification, Curriculum learning, Unsupervised curriculum learning, Underwater digital imaging
National Category
Computer Sciences Computer graphics and computer vision
Research subject
Machine Learning
Identifiers
urn:nbn:se:mau:diva-75687 (URN)10.1016/j.ecoinf.2024.102804 (DOI)001307982900001 ()2-s2.0-85202895926 (Scopus ID)
Note

Validerad;2024;Nivå 2;2024-09-09 (hanlid);

Full text license: CC BY

Available from: 2025-04-30 Created: 2025-04-30 Last updated: 2025-05-07Bibliographically approved
Sabry, S. S., Adewumi, T., Abid, N., Kovács, G., Liwicki, F. & Liwicki, M. (2022). HaT5: Hate Language Identification using Text-to-Text Transfer Transformer. In: 2022 International Joint Conference on Neural Networks (IJCNN): Conference Proceedings: . Paper presented at IEEE World Congress on Computational Intelligence (IEEE WCCI 2022), Padua, Italy, July 18-23, 2022. Institute of Electrical and Electronics Engineers (IEEE)
Open this publication in new window or tab >>HaT5: Hate Language Identification using Text-to-Text Transfer Transformer
Show others...
2022 (English)In: 2022 International Joint Conference on Neural Networks (IJCNN): Conference Proceedings, Institute of Electrical and Electronics Engineers (IEEE) , 2022Conference paper, Published paper (Refereed)
Abstract [en]

We investigate the performance of a state-of-the-art (SoTA) architecture T5 (available on the SuperGLUE) and compare it with 3 other previous SoTA architectures across 5 different tasks from 2 relatively diverse datasets. The datasets are diverse in terms of the number and types of tasks they have. To improve performance, we augment the training data by using a new autoregressive conversational AI model checkpoint. We achieve near-SoTA results on a couple of the tasks - macro F1 scores of 81.66% for task A of the OLID 2019 dataset and 82.54% for task A of the hate speech and offensive content (HASOC) 2021 dataset, where SoTA are 82.9% and 83.05%, respectively. We perform error analysis and explain why one of the models (Bi-LSTM) makes the predictions it does by using a publicly available algorithm: Integrated Gradient (IG). This is because explainable artificial intelligence (XAI) is essential for earning the trust of users. The main contributions of this work are the implementation method of T5, which is discussed; the data augmentation, which brought performance improvements; and the revelation on the shortcomings of the HASOC 2021 dataset. The revelation shows the difficulties of poor data annotation by using a small set of examples where the T5 model made the correct predictions, even when the ground truth of the test set were incorrect (in our opinion). We also provide our model checkpoints on the HuggingFace hub1. https://huggingface.co/sana-ngu/HaT5_augmentation https://huggingface.co/sana-ngu/HaT5.

Place, publisher, year, edition, pages
Institute of Electrical and Electronics Engineers (IEEE), 2022
Keywords
Hate Speech, Data Augmentation, Transformer, T5
National Category
Natural Language Processing
Research subject
Machine Learning
Identifiers
urn:nbn:se:mau:diva-75694 (URN)10.1109/IJCNN55064.2022.9892696 (DOI)000867070906060 ()2-s2.0-85140754070 (Scopus ID)978-1-7281-8671-9 (ISBN)
Conference
IEEE World Congress on Computational Intelligence (IEEE WCCI 2022), Padua, Italy, July 18-23, 2022
Note

ISBN för värdpublikation: 978-1-7281-8671-9

Available from: 2025-04-30 Created: 2025-04-30 Last updated: 2025-05-06Bibliographically approved
Abid, N., Kovács, G., Wedin, J., Paszkowsky, N. A., Shafait, F. & Liwicki, M. (2022). UCL: Unsupervised Curriculum Learning for Utility Pole Detection from Aerial Imagery. In: Proceedings of the Digital Image Computing: Technqiues and Applications (DICTA): . Paper presented at 2022 International Conference on Digital Image Computing: Techniques and Applications (DICTA), November 30 - December 2, 2022, Sydney, Australia. IEEE
Open this publication in new window or tab >>UCL: Unsupervised Curriculum Learning for Utility Pole Detection from Aerial Imagery
Show others...
2022 (English)In: Proceedings of the Digital Image Computing: Technqiues and Applications (DICTA), IEEE , 2022Conference paper, Published paper (Refereed)
Abstract [en]

This paper introduces a machine learning-based approach for detecting electric poles, an essential part of power grid maintenance. With the increasing popularity of deep learning, several such approaches have been proposed for electric pole detection. However, most of these approaches are supervised, requiring a large amount of labeled data, which is time-consuming and labor-intensive. Unsupervised deep learning approaches have the potential to overcome the need for huge amounts of training data. This paper presents an unsupervised deep learning framework for utility pole detection. The framework combines Convolutional Neural Network (CNN) and clustering algorithms with a selection operation. The CNN architecture for extracting meaningful features from aerial imagery, a clustering algorithm for generating pseudo labels for the resulting features, and a selection operation to filter out reliable samples to fine-tune the CNN architecture further. The fine-tuned version then replaces the initial CNN model, thus improving the framework, and we iteratively repeat this process so that the model learns the prominent patterns in the data progressively. The presented framework is trained and tested on a small dataset of utility poles provided by “Mention Fuvex” (a Spanish company utilizing long-range drones for power line inspection). Our extensive experimentation demonstrates the progressive learning behavior of the proposed method and results in promising classification scores with significance test having p−value<0.00005 on the utility pole dataset.

Place, publisher, year, edition, pages
IEEE, 2022
Keywords
Aerial Imagery, Electric Poles, Computer Vision, Deep Learning, Unsupervised Learning
National Category
Computer Sciences
Research subject
Machine Learning
Identifiers
urn:nbn:se:mau:diva-75685 (URN)10.1109/DICTA56598.2022.10034610 (DOI)2-s2.0-85148606239 (Scopus ID)978-1-6654-5642-5 (ISBN)
Conference
2022 International Conference on Digital Image Computing: Techniques and Applications (DICTA), November 30 - December 2, 2022, Sydney, Australia
Available from: 2025-04-30 Created: 2025-04-30 Last updated: 2025-05-06Bibliographically approved
Abid, N., Shahzad, M., Malik, M. I., Schwanecke, U., Ulges, A., Kovács, G. & Shafait, F. (2021). UCL: Unsupervised Curriculum Learning for Water Body Classification from Remote Sensing Imagery. International Journal of Applied Earth Observation and Geoinformation, 105, Article ID 102568.
Open this publication in new window or tab >>UCL: Unsupervised Curriculum Learning for Water Body Classification from Remote Sensing Imagery
Show others...
2021 (English)In: International Journal of Applied Earth Observation and Geoinformation, ISSN 1569-8432, E-ISSN 1872-826X, Vol. 105, article id 102568Article in journal (Refereed) Published
Abstract [en]

This paper presents a Convolutional Neural Networks (CNN) based Unsupervised Curriculum Learning approach for the recognition of water bodies to overcome the stated challenges for remote sensing based RGB imagery. The unsupervised nature of the presented algorithm eliminates the need for labelled training data. The problem is cast as a two class clustering problem (water and non-water), while clustering is done on deep features obtained by a pre-trained CNN. After initial clusters have been identified, representative samples from each cluster are chosen by the unsupervised curriculum learning algorithm for fine-tuning the feature extractor. The stated process is repeated iteratively until convergence. Three datasets have been used to evaluate the approach and show its effectiveness on varying scales: (i) SAT-6 dataset comprising high resolution aircraft images, (ii) Sentinel-2 of EuroSAT, comprising remote sensing images with low resolution, and (iii) PakSAT, a new dataset we created for this study. PakSAT is the first Pakistani Sentinel-2 dataset designed to classify water bodies of Pakistan. Extensive experiments on these datasets demonstrate the progressive learning behaviour of UCL and reported promising results of water classification on all three datasets. The obtained accuracies outperform the supervised methods in domain adaptation, demonstrating the effectiveness of the proposed algorithm.

Place, publisher, year, edition, pages
Elsevier, 2021
Keywords
Sentinel-2, Aircraft Imagery, Remote Sensing, Water classification, Deep Learning, Unsupervised Curriculum Learning, Multi-scale Classification
National Category
Computer Sciences
Research subject
Machine Learning
Identifiers
urn:nbn:se:mau:diva-75688 (URN)10.1016/j.jag.2021.102568 (DOI)000716818200002 ()2-s2.0-85121593506 (Scopus ID)
Note

Validerad;2021;Nivå 2;2021-11-08 (johcin);

Full text license: CC BY-NC-ND

Available from: 2025-04-30 Created: 2025-04-30 Last updated: 2025-05-07Bibliographically approved
Organisations
Identifiers
ORCID iD: ORCID iD iconorcid.org/0000-0002-0546-116X

Search in DiVA

Show all publications