HIERARCHICAL TEXT CLASSIFICATION FOR WEB OF SCIENCE SCIENTIFIC FIELDS

Pouyan Jahani Rad, Mahdi Bahaghighat

DOI Number
https://doi.org/10.2298/FUEE2404703J
First page
703
Last page
732

Abstract


This research focuses on making an efficient text classifier to map given corpora to specific scientific fields. Our study is set on the classification of different scientific fields according to the categories of the Web of Science (WOS). We design and develop various deep learning architectures such as Convolutional Neural Network (CNN), Deep Neural Network (DNN), and Recurrent Neural Network (RNN) at both the parent and child levels. To make our model perform better, we effectively use Hyperband Tuning. We aim to construct a precise hierarchical text classifier for lower levels, and smaller general model sizes. The evaluation employs a special metric known as the hierarchical confusion matrix. Results based on a broad investigation of Word Embedding, Document Embedding, and Hyperband Tuning show that the hierarchical combination of CNN and DNN in parent-child levels can achieve greater accuracy. Our model scored genuinely well, with an F1 score of 94.29% and an accuracy of 99.33%. Although using an RNN at the parent level and another at the child level led to lower accuracy, it effectively reduced the overall model size. We also conducted a comprehensive evaluation of various model architectures using the AoI2WoS dataset. By incorporating Google News word embeddings, we tested different combinations of RNN-DNN and RNN-RNN models on the AoI2WoS dataset. The RNN-DNN model yielded the best results, achieving an accuracy of 98.71% and an F1-score of 91.87%. These findings not only push forward the development of hierarchical text classification but also provide potent tools for utilizing research in scientometrics, and bibliometric researches.


Keywords

Hierarchical text classification, deep learning, scientometrics, Google Scholar, WOS

Full Text:

PDF

References


S. Hasani, M. Bahaghighat and M. Mirfatahia, "The mediating effect of the brand on the relationship between social network marketing and consumer behavior", Acta Technica Napocensis, vol. 60, no. 2, pp. 1-6, 2019.

E. Amouee, M. Mohammadi Zanjireh, M. Bahaghighat, and M. Ghorbani, "A new anomalous text detection approach using unsupervised methods", FU Elec. Energ., vol. 33, pp. 631-653, 2020.

M. Ghorbani, M. Bahaghighat, Q. Xin and F. Özen, "ConvLSTMConv network: a deep learning approach for sentiment analysis in cloud computing," J. Cloud Comput., vol. 9, no. 1, p. 16, 2020.

A. Shamseen, M. Mohammadi Zanjireh, M. Bahaghighat and Q. Xin, "Developing a parallel classifier for mining in big data sets", IIUM Eng. J., vol. 22, no. 2, pp. 119-134, 07/04 2021.

M. T. Sereshki, M. M. Zanjireh and M. Bahaghighat, "Textual outlier detection with an unsupervised method using text similarity and density peak", Acta Universitatis Sapientiae, Informatica, vol. 15, no. 1, pp. 91-110, 2023.

N. Hussain, H. Turab Mirza, G. Rasool, I. Hussain and M. Kaleem, "Spam review detection techniques: A systematic literature review", Appl. Sci., vol. 9, no. 5, p. 987, 2019.

R. A. Stein, P. A. Jaques and J. F. Valiati, "An analysis of hierarchical text classification using word embeddings", Inform. Sci., vol. 471, pp. 216-232, 2019.

F. Zhao, Z. Wu, L. He and X.-Y. Dai, "Label-correction capsule network for hierarchical text classification", IEEE/ACM Trans. Audio Speech Lang. Proc., vol. 31, pp. 2158-2168, 2023.

O. Birgin and K. Özkan, "Comparing the concept images and hierarchical classification skills of students at different educational levels regarding parallelograms: a cross-sectional study", Int. J. Math. Educ. Sci. Technol., vol. 55, no. 4, pp. 850-882, 2024.

A. Fernández, S. del Río, A. Bawakid and F. Herrera, "Fuzzy rule based classification systems for big data with MapReduce: granularity analysis", Adv. Data Anal. Classif., vol. 11, pp. 711-730, 2017.

A. Aldoseri, K. Al-Khalifa and A. Hamouda, "A roadmap for integrating automation with process optimization for AI-powered digital transformation", Preprints 2023, p. 2023101055 2023.

M. Kappi and B. Mallikarjuna, "Artificial intelligence and machine learning for disaster prediction: a scientometric analysis of highly cited papers", Natural Hazards, pp. 1-21, 2024.

F. Gargiulo, S. Silvestri, M. Ciampi and G. De Pietro, "Deep neural network for hierarchical extreme multi-label text classification," Appl. Soft. Comput., vol. 79, pp. 125-138, 2019.

Z. Rahimi and M. M. Homayounpour, "Tens-embedding: a tensor-based document embedding method," Expert Syst. Appl., vol. 162, p. 113770, 2020.

R. Miikkulainen et al., "Evolving deep neural networks," in Artificial intelligence in the age of neural networks and brain computing: Elsevier, 2024, pp. 269-287.

P. Cavalin and L. Oliveira, "Confusion matrix-based building of hierarchical classification", in Proceedings of the Progress in Pattern Recognition, Image Analysis, Computer Vision, and Applications: 23rd Iberoamerican Congress, CIARP 2018, Madrid, Spain, November 19-22, 2018, pp. 271-278.

L. Li, K. G. Jamieson, G. DeSalvo, A. Rostamizadeh and A. Talwalkar, "Hyperband: Bandit-Based Configuration Evaluation for Hyperparameter Optimization," in Proceedings of the ICLR (Poster), 2017, p. 53.

S. Minaee, N. Kalchbrenner, E. Cambria, N. Nikzad, M. Chenaghlu and J. Gao, "Deep learning--based text classification: a comprehensive review", ACM Comput. Surveys (CSUR), vol. 54, no. 3, pp. 1-40, 2021.

J. Zhang, Y. Li, F. Shen, Y. He, H. Tan and Y. He, "Hierarchical text classification with multi-label contrastive learning and KNN", Neurocomputing, vol. 577, p. 127323, 2024.

K. Kowsari, D. E. Brown, M. Heidarysafa, K. J. Meimandi, M. S. Gerber and L. E. Barnes, "HDLTex: Hierarchical Deep Learning for Text Classification", in Proceedings of the 16th IEEE International Conference on Machine Learning and Applications (ICMLA), 2017, pp. 364-371.

N. Jindal and B. Liu, "Review spam detection," in Proceedings of the 16th International Conference on World Wide Web, 2007, pp. 1189-1190.

N. C. Dang, M. N. Moreno-García and F. De la Prieta, "Sentiment analysis based on deep learning: A comparative study," Electronics, vol. 9, no. 3, p. 483, 2020.

M. Bahaghighat and P. Jahani Rad, "AoI2WoS: Mapping Area of Interest in Google Scholar Profile to Web Of Science (WoS) Scientific Fields Categories", Mendeley Data, vol. V1, 2024.

K. Srinidhi, A. Harika, S. S. Voodarla and J. Abhiram, "Sentiment Analysis of Social Media Comments using Natural Language Procesing", in Proceedings of International Conference on Inventive Computation Technologies (ICICT), 2024, pp. 762-768.

L. Dai, J. Mao, L. Xu, X. Fan and X. Zhou, "SecNLP: An NLP classification model watermarking framework based on multi-task learning", Comput. Speech Lang., vol. 86, p. 101606, 2024.

Q. Li, S. Jing and L. Chen, "Hierarchical Knowledge Distillation on Text Graph for Data-limited Attribute Inference", arXiv preprint, no: 2401.06802, 2024.

C. Montenegro, R. Santana and J. A. Lozano, "Introducing multi-dimensional hierarchical classification: Characterization, solving strategies and performance measures," Neurocomputing, vol. 533, pp. 141-160, 2023.

R. Cerri, R. C. Barros and A. C. de Carvalho, "Hierarchical multi-label classification for protein function prediction: A local approach based on neural networks," in Proceedings of the 11th IEEE International Conference on Intelligent Systems Design and Applications, pp. 337-343.

J. N. Hernandez, L. E. Sucar and E. F. Morales, "A hybrid global-local approach for hierarchical classification," in Proceedings of the 26th Florida Artificial Intelligence Research Society Conference (FLAIRS), 2013, pp. 432-437.

A. Rahman, V. Srikumar and A. D. Smith, "Predicting electricity consumption for commercial and residential buildings using deep recurrent neural networks", Appl. Energy, vol. 212, pp. 372-385, 2018.

M. Zulqarnain, R. Ghazali, M. G. Ghouse and M. F. Mushtaq, "Efficient processing of GRU based on word embedding for text classification" JOIV: Int. J. Inform. Vis., vol. 3, no. 4, pp. 377-383, 2019.

S. Wang, M. Huang and Z. Deng, "Densely connected CNN with multi-scale feature attention for text classification", in Proceedings of the 27th International Joint Conference on Artificial Intelligence (IJCAI), 2018, vol. 18, pp. 4468-4474.

H. Wang, J. He, X. Zhang and S. Liu, "A short text classification method based on N‐gram and CNN", Chin. J. Electron., vol. 29, no. 2, pp. 248-254, 2020.

R. Romero, P. Celard, J. M. Sorribes-Fdez, A. Seara Vieira, E. L. Iglesias and L. Borrajo, "MobyDeep: A lightweight CNN architecture to configure models for text classification," Knowledge-Based Syst., vol. 257, p. 109914, 2022.

I. Goodfellow, Y. Bengio and A. Courville, Deep learning. MIT press, 2016.

J. Huang, J. Zheng, S. Gao, W. Liu and J. Lin, "Grid text classification method based on DNN neural network," in Proceedings of the International Conference on Computer Science Communication and Network Security (CSCNS2019), MATEC Web of Conferences, 2020, vol. 309, p. 03016.

A. Kosmopoulos, I. Partalas, E. Gaussier, G. Paliouras and I. Androutsopoulos, "Evaluation measures for hierarchical classification: a unified view and novel approaches", Data Min. Knowl. Discov., vol. 29, pp. 820-865, 2015.

F. Brucker, F. Benites and E. Sapozhnikova, "An empirical comparison of flat and hierarchical performance measures for multi-label classification with hierarchy extraction", in Proceedings of the Knowledge-Based and Intelligent Information and Engineering Systems: 15th International Conference, KES 2011, Kaiserslautern, Germany, September 12-14, 2011, Proceedings, Part I 15, Springer, 2011, pp. 579-589.

E. Hindocha, V. Yazhiny, A. Arunkumar and P. Boobalan, "Short-text Semantic Similarity using GloVe word embedding", Int. Res. J. Eng. Technol., vol. 6, no. 4, 2019.

D. Karani, "Introduction to word embedding and word2vec", Towards Data Science, vol. 1, 2018.

I. N. Khasanah, "Sentiment Classification Using fastText Embedding and Deep Learning Model", Procedia Comput. Sci., vol. 189, pp. 343-350, 2021.

C. N. Mohammed and A. M. Ahmed, "A semantic-based model with a hybrid feature engineering process for accurate spam detection", J. Electr. Syst. Inform. Technol., vol. 11, no. 1, p. 26, 2024.

M. Khuntia and D. Gupta, "Indian News Headlines Classification using Word Embedding Techniques and LSTM Model", Procedia Comput. Sci., vol. 218, pp. 899-907, 2023.

J. Wang, W. Feng, Y. Chen, H. Yu, M. Huang and P. S. Yu, "Visual domain adaptation with manifold embedded distribution alignment", In Proceedings of the 26th ACM International Conference on Multimedia, 2018, pp. 402-410.

Q. Le and T. Mikolov, "Distributed representations of sentences and documents," in Proceedings of the 31st International Conference on Machine Learning, 2014, pp. 1188-1196.

G. Balikas and M.-R. Amini, "An empirical study on large scale text classification with skip-gram embeddings," arXiv e-prints, p. 1606.06623, 2016.

M. Khodak, T. Li, L. Li, M. Balcan, V. Smith and A. Talwalkar, "Weight sharing for hyperparameter optimization in federated learning", In Proceedings of the International Workshop on Federated Learning for User Privacy and Data Confidentiality in Conjunction with ICML, 2020, vol. 2020.

Y. Qian, Y. Liu and Q. Z. Sheng, "Understanding hierarchical structural evolution in a scientific discipline: A case study of artificial intelligence", J. Informetrics, vol. 14, no. 3, p. 101047, 2020.

H. Teng, N. Wang, H. Zhao, Y. Hu and H. Jin, "Enhancing semantic text similarity with functional semantic knowledge (FOP) in patents", J. Informetrics, vol. 18, no. 1, p. 101467, 2024.


Refbacks

  • There are currently no refbacks.


ISSN: 0353-3670 (Print)

ISSN: 2217-5997 (Online)

COBISS.SR-ID 12826626