Large language models and their role in modern scientific discoveries
https://doi.org/10.17726/philIT.2024.1.3
Abstract
Today, large language models are very powerful, informational and analytical tools that significantly accelerate most of the existing methods and methodologies for processing informational processes. Scientific information is of particular importance in this capacity, which gradually involves the power of large language models. This interaction of science and qualitative new opportunities for working with information lead us to new, unique scientific discoveries, their great quantitative diversity. There is an acceleration of scientific research, a reduction in the time spent on its implementation – the freed up time can be spent both on solving new scientific problems and on scientific creativity, which, although it may not necessarily lead to a specific solution to a particular scientific problem, but is able to demonstrate the beauty of science in various disciplinary areas. As a result, the interaction of large language models and scientific information is at the same time a research for solutions to scientific problems, scientific problems, and scientific creativity. Solving scientific problems requires the ability to efficiently process big data, which cannot be done without an effective method – one of the significant methods was the Transformer architecture, introduced in 2017 and comprehensively integrated into the GPT‑3 model, which, as of September 2020, was the largest and most advanced language model in the world. Therefore, GPT‑3 can be called the basis of most scientific developments carried out in the context of using large language models. The interaction of science and large language models has become a factor in the emergence of a large number of questions, among which are: «Is the result of data analysis new knowledge?», «What are the prospects for scientific creativity in the era of big computing?». Currently, these issues are extremely important, because they allow us to develop the foundations for effective human‑computer interaction. Therefore, this study analyzes the issues presented.
About the Author
V. Yu. FilimonovRussian Federation
Vladimir Yu. Filimonov - postgraduate student at the Institute of Top-Qualification Personnel Training, Pyatigorsk State University.
Pyatigorsk
References
1. Romera‑Paredes, B., Barekatain, M., Novikov, A. et al. (2024) Mathematical discoveries from program search with large language models. Nature 625, P. 468‑475.
2. Trinh, T.H., Wu, Y., Le, Q.V. et al. (2024) Solving Olympiad geometry without human demonstrations. Nature 625, P. 476‑482.
3. Gonthier, G. et al. (2013) A Machine‑Checked Proof of the Odd Order Theorem. In: Blazy, S., Paulin‑Mohring, C., Pichardie, D. (eds) Interactive Theorem Proving. ITP 2013. Lecture Notes in Computer Science, vol. 7998. Springer, Berlin, Heidelberg.
4. Iten, R., Metger, T., Wilming, H., del Rio L., Renner R. (2020) Discovering Physical Concepts with Neural Networks. Phys. Rev. Lett. Vol. 124, P. 1‑6.
5. Fawzi, A., Balog, M., Huang, A. et al. (2022) Discovering faster matrix multiplication algorithms with reinforcement learning. Nature 610, P. 47‑53.
6. Melnikov, A. A. (2019) Predicting quantum advantage by quantum walk with convolutional neural networks. New Journal of Physics, Vol. 21, No. 12. P. 1‑11.
7. Liew, A. (2007) Understanding Data, Information, Knowledge And Their Inter‑Relationships. Journal of Knowledge Management Practice. Vol. 7, P. 1‑10.
8. Поручиков М. А. Data analysis. Samara: Izd‑vo Samarskogo universiteta, 2016. 88 с.
9. Data analysis technologies / Садовникова Н. П., Щербаков М. В. Volgograd: VolgGTU, 2021. 75 с.
10. Никифоров А. Л. Analysis of the concept of «knowledge»: approaches and problems // Epistemology & Philosophy of Science. 2009. № 3. С. 61‑73.
11. Касавин И. Т. Encyclopedia of Epistemology and Philosophy of Science. M.: «Kanon+», ROOI «Reabilitaciya», 2009. 1248 c.
12. Барышников П. Н. Metaforicheskie osnovaniya komp`yutacionalizma v kognitivny`x naukax i filosofii soznaniya // Filosofiya nauki i texniki. 2018. № 2. С. 61‑72.
13. Барышников П. Н. Filosofiya it, high‑hume i … mifologiya // Filosofskie problemy` informacionny`x texnologij i kiberprostranstva. 2012. № 1. С. 15‑23.
14. Digital Пётр – Распознавание рукописей Петра I с помощью компьютерного зрения // SberA I. https://fusionbrain.ai/digital-petr.
15. AlphaFold reveals the structure of the protein universe // Google DeepMind. https://deepmind.google/discover/blog/alphafold-reveals-the-structure-of-the-protein-universe.
16. Летние конференции Турнира городов // Международный математический Турнир Городов. https://turgor.ru/lktg.
17. Технология обработки сейсмических данных на основе асимптотических методов и методов машинного обучения для поиска и описания трещиноватых коллекторов // Российский научный фонд. https://www.rscf.ru/project/21-71-20002.
Review
For citations:
Filimonov V.Yu. Large language models and their role in modern scientific discoveries. Philosophical Problems of IT & Cyberspace (PhilIT&C). 2024;(1):42-57. (In Russ.) https://doi.org/10.17726/philIT.2024.1.3