Analysing Maritime Piracy Reports Using a Text Mining–Based Frequency and Thematic Approach
DOI:
https://doi.org/10.5281/zenodo.20297788Keywords:
Text Mining, Maritime Piracy, Porter Stemming, Maritime Security, Incident AnalysisAbstract
Maritime piracy poses persistent risks to crew safety, vessel operations, and global trade efficiency. To identify dominant patterns and trends in piracy incidents, this study applies a text mining-based analysis to maritime piracy reports published between 2021 and 2023 by the International Maritime Bureau’s (IMB) Piracy Reporting Centre. The Porter Stemming algorithm is used to preprocess unstructured textual data and extract high-frequency terms related to piracy events. Based on word frequency distributions and semantic similarity, recurring terms are grouped into thematic categories representing attack characteristics, locations, operational methods, and incident timelines. The results indicate that piracy reports primarily emphasise attack locations, threat execution, vessel boarding actions, and the temporal progression of incidents, while post-incident response measures receive comparatively less attention. The study demonstrates that a purely data-driven text mining approach can effectively reveal structural patterns in maritime piracy reports and support situational awareness without relying on expert judgement or multi-criteria decision-making models.
References
Ahadh, A., Binish, G. V., & Srinivasan, R. (2021). Text mining of accident reports using semi-supervised keyword extraction and topic modeling. Process Safety and Environmental Protection, 155, 455–465. https://doi.org/10.1016/j.psep.2021.09.022
Ali, N. H., & Ibrahim, N. S. (2012). Porter Stemming Algorithm for Semantic Checking. Iccit, June, 253–258.
Bai, X., Zhang, X., Li, K. X., Zhou, Y., & Yuen, K. F. (2021). Research topics and trends in the maritime transport: A structural topic model. Transport Policy, 102(November 2020), 11–24. https://doi.org/10.1016/j.tranpol.2020.12.013
Chintalapudi, N., Battineni, G., Canio, M. Di, Sagaro, G. G., & Amenta, F. (2021). Text mining with sentiment analysis on seafarers’ medical documents. International Journal of Information Management Data Insights, 1(1), 100005. https://doi.org/10.1016/j.jjimei.2020.100005
He, Z., Wang, C., Gao, J., & Xie, Y. (2023). Assessment of global shipping risk caused by maritime piracy. Heliyon, 9(10), e20988. https://doi.org/10.1016/j.heliyon.2023.e20988
Liu, C., & Yang, S. (2022). Using text mining to establish knowledge graph from accident/incident reports in risk assessment. Expert Systems with Applications, 207(June), 117991. https://doi.org/10.1016/j.eswa.2022.117991
Papadimitriou, S., Lyridis, D. V., Koliousis, I. G., Tsioumas, V., Sdoukopoulos, E., & Stavroulakis, P. J. (2018). Strategic planning of short sea shipping within maritime clusters. In The Dynamics of Short Sea Shipping: New Practices and Trends (pp. 37-59). Cham: Springer International Publishing.
Paul, K. (2012). Classifying maritime near-miss and injury report using text mining. Lamar University-Beaumont.
Shi, H., Li, Y., Jiang, Z., & Zhang, J. (2021). Comprehensive power quality evaluation method of microgrid with dynamic weighting based on CRITIC. Measurement and Control (United Kingdom), 54(5–6), 1097–1104. https://doi.org/10.1177/00202940211016092
Tirunagari, S. (2015). Data Mining of Causal Relations from Text: Analysing Maritime Accident Investigation Reports. ArXiv Preprint ArXiv:1507.02447. http://arxiv.org/abs/1507.02447
XU, N., MA, L., Liu, Q., WANG, L., & Deng, Y. (2021). An improved text mining approach to extract safety risk factors from construction accident reports. Safety Science, 138(June 2020). https://doi.org/10.1016/j.ssci.2021.105216
Zhong, B., Pan, X., Love, P. E. D., Sun, J., & Tao, C. (2020). Hazard analysis: A deep learning and text mining framework for accident prevention. Advanced Engineering Informatics, 46(June), 101152. https://doi.org/10.1016/j.aei.2020.101152
Zhou, Y., Wang, X., & Yuen, K. F. (2021). Sustainability disclosure for container shipping: A text-mining approach. Transport Policy, 110, 465-477. https://doi.org/10.1016/j.tranpol.2021.06.020