Python: Empowering Data Science Applications and Research

Authors

  • Mritunjay Ranjan Assistant Professor, School of Computer Science and Engineering, Sandip University, Nashik, Maharashtra, India
  • Krishna Barot Student, School of Computer Science and Engineering, Sandip University Nashik, Maharashtra, India
  • Vaishnavi Khairnar Student, School of Computer Science and Engineering, Sandip University Nashik, Maharashtra, India
  • Vaishnavi Rawal Student, School of Computer Science and Engineering, Sandip University Nashik, Maharashtra, India
  • Anujaa Pimpalgaonkar Assistant Professor, School of Computer Science and Engineering, Sandip University, Nashik, Maharashtra, India
  • Shilpi Saxena Assistant Professor, School of Computer Science and Engineering, Sandip University, Nashik, Maharashtra, India
  • Arif Md. Sattar Assistant Professor, Department of Computer Science and Information Technology, Anugrah Memorial College, Gaya, Bihar, India

DOI:

https://doi.org/10.37591/joosdt.v10i1.576

Keywords:

Data Science (DS), Machine Learning (ML), IDE, GUI, Python, data analytics, artificial intelligence, Programming Language

Abstract

It was in 1991 that the first mention of Python came to light. Since it is one of the best programming languages, it is widely used in the data analytics industry. It is speedy, easy to use, and can smoothly alter data with no hiccups. It aids in the processes of data analytics generally, such as data collection, analysis, modelling, and visualization. Python stands out from other languages because it provides all the necessary tools for developing cutting-edge applications and machine learning quickly and easily. The fields of data science and analytics have benefited greatly from recent advances in computing power, the capacity to store vast volumes of data, and knowledge of relevant methodologies in areas such as Data Analytics, Artificial Intelligence, Machine Learning, etc. The goal of this study is to have a discussion on the. All the fundamental resources for developing Python applications may be found in a single place with the help of an IDE (Integrated Development Environment). Integrated development environments (IDEs) are research-oriented software packages that bring together a familiar Graphical User Interface (GUI) with the most widely used programming languages and tools. It often consists of a source code editor for writing programs and local build automation for creating a local version of the software, like compiling computer source code. Additionally, it enables you to duplicate the programs for use on your personal computer. These editors may be used by developers and software engineers to design apps for personal computers or the web; by DevOps engineers to carry out continuous integration; and to aid in job automation and boost the software engineer's productivity and expertise. Some of the many great advantages of adopting a top-tier Python integrated development environment are listed below. Build automation, code linking, testing, and debugging are just some of the features available in Python IDEs that may help you save time. This research will begin with an overview of the Python Data Science Library before moving on to a discussion of data science's many practical applications. In addition, this research provides a comprehensive evaluation of Python IDEs used in the fields of data science and research.

References

Cabo C. Effectiveness of flowcharting as a scaffolding tool to learn python. In 2018 IEEE Frontiers in Education Conference (FIE). 2018 Oct 3; 1–7.

Vadlamani A, Kalicheti R, Chimalakonda S. API Scanner - Towards Automated Detection of Deprecated APIs in Python Libraries. 2021 IEEE/ACM 43rd International Conference on Software Engineering: Companion Proceedings (ICSE-Companion), Madrid, ES. 2021; 5–8. doi: 10.1109/ICSE-Companion52605.2021.00022.

Loulergue F, Philippe J. New List Skeletons for the Python Skeleton Library. 2019 20th International Conference on Parallel and Distributed Computing, Applications and Technologies (PDCAT), Gold Coast, QLD, Australia. 2019; 392–397. doi: 10.1109/PDCAT46702.2019.00077.

Ponnapalli VS, sai Manish AV, Ramu P, Sudhiksha S, Greeshma M. Array Factor Code Development of Fractal Array Antenna using Python: A Mini-Study on Free and Open Source Software for Antennas. In 2021 IEEE International Conference on Recent Trends on Electronics, Information, Communication & Technology (RTEICT). 2021 Aug 27; 448–451.

Li Y, Schwiebert L. Boosting Python performance on Intel Processors: A case study of optimizing music recognition. In 2016 IEEE 6th Workshop on Python for High-Performance and Scientific Computing (PyHPC). 2016 Nov 14; 52–58.

Hoving R, Slot G, Jansen S. Python: Characteristics identification of a free open source software ecosystem. 2013 7th IEEE International Conference on Digital Ecosystems and Technologies (DEST), Menlo Park, CA, USA. 2013; 13–18. doi: 10.1109/DEST.2013.6611322.

Guillermo M, et al. Graph Database-modelled Public Transportation Data for Geographic Insight Web Application. 2022 IEEE/ACIS 23rd International Conference on Software Engineering, Artificial Intelligence, Networking and Parallel/Distributed Computing (SNPD), Taichung, Taiwan. 2022; 2–7. doi: 10.1109/SNPD54884.2022.10051802.

Foyer C, Conejero J, Ejarque J, Badia RM, Tate A, McIntosh-Smith S. Enabling System Wide Shared Memory for Performance Improvement in PyCOMPSs Applications. 2020 IEEE/ACM 9th Workshop on Python for High-Performance and Scientific Computing (PyHPC), GA, USA. 2020; 22–31. doi: 10.1109/PyHPC51966.2020.00008.

Elewah A, Badawi AA, Khalil H, Rahnamayan S, Elgazzar K. 3D-RadViz: Three Dimensional Radial Visualization for Large-Scale Data Visualization. 2021 IEEE Congress on Evolutionary Computation (CEC), Kraków, Poland. 2021; 1037–1046. doi: 10.1109/CEC45853.2021.9504983.

van Oort B, Cruz L, Aniche M, van Deursen A. The Prevalence of Code Smells in Machine Learning projects. 2021 IEEE/ACM 1st Workshop on AI Engineering - Software Engineering for AI (WAIN), Madrid, Spain. 2021; 1–8. doi: 10.1109/WAIN52551.2021.00011.

Serrano-Muñoz A, Elguea-Aguinaco Í, Chrysostomou D, BØgh S, Arana-Arexolaleiba N. A Scalable and Unified Multi-Control Framework for KUKA LBR iiwa Collaborative Robots. 2023 IEEE/SICE International Symposium on System Integration (SII), Atlanta, GA, USA. 2023; 1–5. doi: 10.1109/SII55687.2023.10039308.

Pinard A, Hammerling DM, Baker AH. Assessing Differences in Large Spatio-temporal Climate Datasets with a New Python package. 2020 IEEE International Conference on Big Data (Big Data), Atlanta, GA, USA. 2020; 2699–2707. doi: 10.1109/BigData50022.2020.9378100.

Loulergue F, Philippe J. New List Skeletons for the Python Skeleton Library. 2019 20th International Conference on Parallel and Distributed Computing, Applications and Technologies (PDCAT), Gold Coast, QLD, Australia. 2019; 392–397. doi: 10.1109/PDCAT46702.2019.00077.

Tohid R, et al. Asynchronous Execution of Python Code on Task-Based Runtime Systems. 2018 IEEE/ACM 4th International Workshop on Extreme Scale Programming Models and Middleware (ESPM2), Dallas, TX, USA. 2018; 37–45. doi: 10.1109/ESPM2.2018.00009.

Ibba P, et al. Fruit Meter: An AD5933-Based Portable Impedance Analyzer for Fruit Quality Characterization. 2020 IEEE International Symposium on Circuits and Systems (ISCAS), Seville, Spain. 2020; 1–5. doi: 10.1109/ISCAS45731.2020.9181287.

Stančin I, Jović A. An overview and comparison of free Python libraries for data mining and big data analysis. 2019 42nd International Convention on Information and Communication Technology, Electronics and Microelectronics (MIPRO), Opatija, Croatia. 2019; 977–982. doi: 10.23919/MIPRO.2019.8757088.

Singh G, Gupta I, Singh J, Kaur N. Face Recognition using Open Source Computer Vision Library (OpenCV) with Python. 2022 10th International Conference on Reliability, Infocom Technologies and Optimization (Trends and Future Directions) (ICRITO), Noida, India. 2022; 1–6. doi: 10.1109/ICRITO56286.2022.9964836.

Nance D, Tomov S, Wong K. A Python Library for Matrix Algebra on GPU and Multicore Architectures. 2022 IEEE 19th International Conference on Mobile Ad Hoc and Smart Systems (MASS), Denver, CO, USA. 2022; 770–775. doi: 10.1109/MASS56207.2022.00121.

Mucesh S, et al. A machine learning approach to galaxy properties: joint redshift–stellar mass probability distributions with Random Forest. Mon Notices Royal Astron Soc. 2021 Jan; 502(2): 2770–2786. doi: 10.1093/mnras/stab164.

Ortigoza Capetillo GM, Lorandi Medina AP, Neri I. Modeling and simulating urban afforestation: an alternative for urban climate change mitigation. 2020 IEEE International Conference on Engineering Veracruz (ICEV), Boca del Rio, Mexico. 2020; 1–4. doi: 10.1109/ICEV50249.2020.9289691.

Qin M. Machine Translation Technology Based on Natural Language Processing. 2022 European Conference on Natural Language Processing and Information Retrieval (ECNLPIR), Hangzhou, China. 2022; 10–13. doi: 10.1109/ECNLPIR57021.2022.00014

López CD, Cvetković M, Palensky P. Enhancing Power Factory Dynamic Models with Python for Rapid Prototyping. 2019 IEEE 28th International Symposium on Industrial Electronics (ISIE), Vancouver, BC, Canada. 2019; 93–99. doi: 10.1109/ISIE.2019.8781432.

Published

2023-06-22

How to Cite

Ranjan, M., Krishna Barot, Vaishnavi Khairnar, Vaishnavi Rawal, Anujaa Pimpalgaonkar, Shilpi Saxena, & Arif Md. Sattar. (2023). Python: Empowering Data Science Applications and Research. JOURNAL OF OPERATING SYSTEMS DEVELOPMENT &Amp; TRENDS, 10(1), 27–33. https://doi.org/10.37591/joosdt.v10i1.576