Machine learning and its use in the automatic extraction of metadata from academic articles
Keywords:
extraction, machine learning, metadata, reinforcement learning, semi-supervised learning, supervised learning, unsupervised learningAbstract
This article provides detailed information on machine learning processes, learning methods, supervised learning, unsupervised learning, semi-supervised learning, reinforcement learning, and the use of machine learning algorithms in the automatic extraction of metadata. Classification and regressive types of supervised learning, including decision trees, decision rules, Naive Bayes classifiers, Bayesian trust networks, nearest neighbor classifiers, linear discriminant functions, logistic regression, support vector machines, artificial neural networks, clustering and dimensionality reduction methods of unsupervised learning, semi-supervised learning, and reinforcement learning methods are also discussed.
Downloads
References
Al-Jarrah, O. Y., Yoo, P. D., Muhaidat, S., Karagiannidis, G. K., & Taha, K. (2015). Efficient machine learning for big data: A review. Big Data Research, 2(3), 87-93. https://doi.org/10.1016/j.bdr.2015.04.001
Anselmi, F., Leibo, J. Z., Rosasco, L., Mutch, J., Tacchetti, A., & Poggio, T. (2016). Unsupervised learning of invariant representations. Theoretical Computer Science, 633, 112-121. https://doi.org/10.1016/j.tcs.2015.06.048
Berrar, D. (2019). Bayes' Theorem and Naive Bayes Classifier.
Darlington, R. B., & Hayes, A. F. (2017). Regression analysis and linear models. New York, NY: Guilford, 603-611.
Day, M. Y., Tsai, R. T. H., Sung, C. L., Hsieh, C. C., Lee, C. W., Wu, S. H., ... & Hsu, W. L. (2007). Reference metadata extraction using a hierarchical knowledge representation framework. Decision Support Systems, 43(1), 152-167. https://doi.org/10.1016/j.dss.2006.08.006
Fukunaga, K. (2013). Introduction to statistical pattern recognition. Elsevier.
Gursoy, M. E., Inan, A., Nergiz, M. E., & Saygin, Y. (2017). Differentially private nearest neighbor classification. Data Mining and Knowledge Discovery, 31, 1544-1575.
Han, H., Giles, C. L., Manavoglu, E., Zha, H., Zhang, Z., & Fox, E. A. (2003). Automatic document metadata extraction using support vector machines. In 2003 Joint Conference on Digital Libraries, 2003. Proceedings. (pp. 37-48). IEEE.
Kononenko, I. (2007). Chapter 1-Introduction, Editor (s): Igor Kononenko, Matjaž Kukar. Machine Learning and Data Mining, Woodhead Publishing, 1-36.
Kubat, M., & Kubat, M. (2021). Artificial neural networks. An introduction to machine learning, 117-143.
Leng, Y., Xu, X., & Qi, G. (2013). Combining active learning and semi-supervised learning to construct SVM classifier. Knowledge-Based Systems, 44, 121-131. https://doi.org/10.1016/j.knosys.2013.01.032
Liu, L., & Özsu, M. T. (Eds.). (2009). Encyclopedia of database systems (Vol. 6). New York: Springer.
Mahesh, B. (2020). Machine learning algorithms-a review. International Journal of Science and Research (IJSR).[Internet], 9(1), 381-386.
Mair, C., Kadoda, G., Lefley, M., Phalp, K., Schofield, C., Shepperd, M., & Webster, S. (2000). An investigation of machine learning based prediction systems. Journal of systems and software, 53(1), 23-29. https://doi.org/10.1016/S0164-1212(00)00005-4
Mitchell, T. M. (1997). Does machine learning really work?. AI magazine, 18(3), 11-11.
O’Doherty, J. P., Lee, S. W., & McNamee, D. (2015). The structure of reinforcement-learning mechanisms in the human brain. Current Opinion in Behavioral Sciences, 1, 94-100. https://doi.org/10.1016/j.cobeha.2014.10.004
Safder, I., Hassan, S. U., Visvizi, A., Noraset, T., Nawaz, R., & Tuarob, S. (2020). Deep learning-based extraction of algorithmic metadata in full-text scholarly documents. Information processing & management, 57(6), 102269. https://doi.org/10.1016/j.ipm.2020.102269
Sharma, H., & Kumar, S. (2016). A survey on decision tree algorithms of classification in data mining. International Journal of Science and Research (IJSR), 5(4), 2094-2097..
Shimodaira, H. (2015). Discriminant functions. Learning and Data Note, 10.
Siswa, T. A. Y. (2020). The effectiveness of artificial intelligence on education: learning during the pandemic and in the future. International Journal of Engineering & Computer Science, 3(1), 24-30. https://doi.org/10.31295/ijecs.v3n1.195
Skluzacek, T. J., Kumar, R., Chard, R., Harrison, G., Beckman, P., Chard, K., & Foster, I. T. (2018). Skluma: An extensible metadata extraction pipeline for disorganized data. In 2018 IEEE 14th International Conference on e-Science (e-Science) (pp. 256-266). IEEE.
Suthaharan, S., & Suthaharan, S. (2016). Support vector machine. Machine learning models and algorithms for big data classification: thinking with examples for effective learning, 207-235.
Sutton, R. S., & Barto, A. G. (2018). Reinforcement learning: An introduction. MIT press.
Tiwari, A. (2022). Supervised learning: From theory to applications. In Artificial intelligence and machine learning for EDGE computing (pp. 23-32). Academic Press. https://doi.org/10.1016/B978-0-12-824054-0.00026-5
Tkaczyk, D., Szostek, P., Fedoryszak, M., Dendek, P. J., & Bolikowski, ?. (2015). CERMINE: automatic extraction of structured metadata from scientific literature. International Journal on Document Analysis and Recognition (IJDAR), 18, 317-335.
Tsai, C. F., Hsu, Y. F., Lin, C. Y., & Lin, W. Y. (2009). Intrusion detection by machine learning: A review. expert systems with applications, 36(10), 11994-12000. https://doi.org/10.1016/j.eswa.2009.05.029
Turgunbaev, R. (2021). Keysga asoslangan fikrlash va uni akademik metama’lumotlarni avtomatik ekstraksiya qilishda tadbiq qilinishi. Science and Education, 2(9), 129-144.
Turgunbaev, R. (2021a). Metadata in Data Search. In " ONLINE-CONFERENCES" PLATFORM (pp. 93-96).
Turgunbaev, R. (2021b). The role and importance of metadata in information retrieval. Science and Education, 2(8), 353-359.
Turgunbaev, R. (2021c). Metadata: characteristics, types and standards. Science and Education, 2(5), 167-175.
Vieira, S., Pinaya, W. H. L., & Mechelli, A. (2020). Introduction to machine learning. In Machine learning (pp. 1-20). Academic Press. https://doi.org/10.1016/B978-0-12-815739-8.00001-8
Zhou, X., & Belkin, M. (2014). Semi-supervised learning. In Academic press library in signal processing (Vol. 1, pp. 1239-1269). Elsevier. https://doi.org/10.1016/B978-0-12-396502-8.00022-X
Published
How to Cite
Issue
Section
Copyright (c) 2024 International journal of engineering & computer science
This work is licensed under a Creative Commons Attribution-NonCommercial-NoDerivatives 4.0 International License.
Articles published in the International Journal of Engineering & Computer Science (IJECS) are available under Creative Commons Attribution Non-Commercial No Derivatives Licence (CC BY-NC-ND 4.0). Authors retain copyright in their work and grant IJECS right of first publication under CC BY-NC-ND 4.0. Users have the right to read, download, copy, distribute, print, search, or link to the full texts of articles in this journal, and to use them for any other lawful purpose.
Articles published in IJECS can be copied, communicated and shared in their published form for non-commercial purposes provided full attribution is given to the author and the journal. Authors are able to enter into separate, additional contractual arrangements for the non-exclusive distribution of the journal's published version of the work (e.g., post it to an institutional repository or publish it in a book), with an acknowledgment of its initial publication in this journal.