Machine learning and its use in the automatic extraction of metadata from academic articles

https://doi.org/10.21744/ijecs.v7n1.1782

Authors

  • Rashid Turgunbaev Tashkent state university of information technologies, Uzbekistan

Keywords:

extraction, machine learning, metadata, reinforcement learning, semi-supervised learning, supervised learning, unsupervised learning

Abstract

This article provides detailed information on machine learning processes, learning methods, supervised learning, unsupervised learning, semi-supervised learning, reinforcement learning, and the use of machine learning algorithms in the automatic extraction of metadata. Classification and regressive types of supervised learning, including decision trees, decision rules, Naive Bayes classifiers, Bayesian trust networks, nearest neighbor classifiers, linear discriminant functions, logistic regression, support vector machines, artificial neural networks, clustering and dimensionality reduction methods of unsupervised learning, semi-supervised learning, and reinforcement learning methods are also discussed.

Downloads

Download data is not yet available.

References

Al-Jarrah, O. Y., Yoo, P. D., Muhaidat, S., Karagiannidis, G. K., & Taha, K. (2015). Efficient machine learning for big data: A review. Big Data Research, 2(3), 87-93. https://doi.org/10.1016/j.bdr.2015.04.001

Anselmi, F., Leibo, J. Z., Rosasco, L., Mutch, J., Tacchetti, A., & Poggio, T. (2016). Unsupervised learning of invariant representations. Theoretical Computer Science, 633, 112-121. https://doi.org/10.1016/j.tcs.2015.06.048

Berrar, D. (2019). Bayes' Theorem and Naive Bayes Classifier.

Darlington, R. B., & Hayes, A. F. (2017). Regression analysis and linear models. New York, NY: Guilford, 603-611.

Day, M. Y., Tsai, R. T. H., Sung, C. L., Hsieh, C. C., Lee, C. W., Wu, S. H., ... & Hsu, W. L. (2007). Reference metadata extraction using a hierarchical knowledge representation framework. Decision Support Systems, 43(1), 152-167. https://doi.org/10.1016/j.dss.2006.08.006

Fukunaga, K. (2013). Introduction to statistical pattern recognition. Elsevier.

Gursoy, M. E., Inan, A., Nergiz, M. E., & Saygin, Y. (2017). Differentially private nearest neighbor classification. Data Mining and Knowledge Discovery, 31, 1544-1575.

Han, H., Giles, C. L., Manavoglu, E., Zha, H., Zhang, Z., & Fox, E. A. (2003). Automatic document metadata extraction using support vector machines. In 2003 Joint Conference on Digital Libraries, 2003. Proceedings. (pp. 37-48). IEEE.

Kononenko, I. (2007). Chapter 1-Introduction, Editor (s): Igor Kononenko, Matjaž Kukar. Machine Learning and Data Mining, Woodhead Publishing, 1-36.

Kubat, M., & Kubat, M. (2021). Artificial neural networks. An introduction to machine learning, 117-143.

Leng, Y., Xu, X., & Qi, G. (2013). Combining active learning and semi-supervised learning to construct SVM classifier. Knowledge-Based Systems, 44, 121-131. https://doi.org/10.1016/j.knosys.2013.01.032

Liu, L., & Özsu, M. T. (Eds.). (2009). Encyclopedia of database systems (Vol. 6). New York: Springer.

Mahesh, B. (2020). Machine learning algorithms-a review. International Journal of Science and Research (IJSR).[Internet], 9(1), 381-386.

Mair, C., Kadoda, G., Lefley, M., Phalp, K., Schofield, C., Shepperd, M., & Webster, S. (2000). An investigation of machine learning based prediction systems. Journal of systems and software, 53(1), 23-29. https://doi.org/10.1016/S0164-1212(00)00005-4

Mitchell, T. M. (1997). Does machine learning really work?. AI magazine, 18(3), 11-11.

O’Doherty, J. P., Lee, S. W., & McNamee, D. (2015). The structure of reinforcement-learning mechanisms in the human brain. Current Opinion in Behavioral Sciences, 1, 94-100. https://doi.org/10.1016/j.cobeha.2014.10.004

Safder, I., Hassan, S. U., Visvizi, A., Noraset, T., Nawaz, R., & Tuarob, S. (2020). Deep learning-based extraction of algorithmic metadata in full-text scholarly documents. Information processing & management, 57(6), 102269. https://doi.org/10.1016/j.ipm.2020.102269

Sharma, H., & Kumar, S. (2016). A survey on decision tree algorithms of classification in data mining. International Journal of Science and Research (IJSR), 5(4), 2094-2097..

Shimodaira, H. (2015). Discriminant functions. Learning and Data Note, 10.

Siswa, T. A. Y. (2020). The effectiveness of artificial intelligence on education: learning during the pandemic and in the future. International Journal of Engineering & Computer Science, 3(1), 24-30. https://doi.org/10.31295/ijecs.v3n1.195

Skluzacek, T. J., Kumar, R., Chard, R., Harrison, G., Beckman, P., Chard, K., & Foster, I. T. (2018). Skluma: An extensible metadata extraction pipeline for disorganized data. In 2018 IEEE 14th International Conference on e-Science (e-Science) (pp. 256-266). IEEE.

Suthaharan, S., & Suthaharan, S. (2016). Support vector machine. Machine learning models and algorithms for big data classification: thinking with examples for effective learning, 207-235.

Sutton, R. S., & Barto, A. G. (2018). Reinforcement learning: An introduction. MIT press.

Tiwari, A. (2022). Supervised learning: From theory to applications. In Artificial intelligence and machine learning for EDGE computing (pp. 23-32). Academic Press. https://doi.org/10.1016/B978-0-12-824054-0.00026-5

Tkaczyk, D., Szostek, P., Fedoryszak, M., Dendek, P. J., & Bolikowski, ?. (2015). CERMINE: automatic extraction of structured metadata from scientific literature. International Journal on Document Analysis and Recognition (IJDAR), 18, 317-335.

Tsai, C. F., Hsu, Y. F., Lin, C. Y., & Lin, W. Y. (2009). Intrusion detection by machine learning: A review. expert systems with applications, 36(10), 11994-12000. https://doi.org/10.1016/j.eswa.2009.05.029

Turgunbaev, R. (2021). Keysga asoslangan fikrlash va uni akademik metama’lumotlarni avtomatik ekstraksiya qilishda tadbiq qilinishi. Science and Education, 2(9), 129-144.

Turgunbaev, R. (2021a). Metadata in Data Search. In " ONLINE-CONFERENCES" PLATFORM (pp. 93-96).

Turgunbaev, R. (2021b). The role and importance of metadata in information retrieval. Science and Education, 2(8), 353-359.

Turgunbaev, R. (2021c). Metadata: characteristics, types and standards. Science and Education, 2(5), 167-175.

Vieira, S., Pinaya, W. H. L., & Mechelli, A. (2020). Introduction to machine learning. In Machine learning (pp. 1-20). Academic Press. https://doi.org/10.1016/B978-0-12-815739-8.00001-8

Zhou, X., & Belkin, M. (2014). Semi-supervised learning. In Academic press library in signal processing (Vol. 1, pp. 1239-1269). Elsevier. https://doi.org/10.1016/B978-0-12-396502-8.00022-X

Published

2024-03-28

How to Cite

Turgunbaev, R. (2024). Machine learning and its use in the automatic extraction of metadata from academic articles. International Journal of Engineering & Computer Science, 7(1), 1-7. https://doi.org/10.21744/ijecs.v7n1.1782