Big Data Analytics Using Artificial Intelligence: Apache Spark For Scalable Batch Processing

The rapid proliferation of data in the digital age has made big data analytics a critical tool for deriving insights and making informed decisions. However, processing and analyzing large datasets, often reaching hundreds of terabytes, presents significant challenges. This paper explores the use of Apache Spark, a powerful distributed computing framework, for batch processing in big data analytics using artificial intelligence (AI) techniques.

Uploaded by

International Journal of Innovative Science and Research Technology

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

13 views

Big Data Analytics Using Artificial Intelligence: Apache Spark For Scalable Batch Processing

Uploaded by

International Journal of Innovative Science and Research Technology

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 3

Volume 9, Issue 8, August – 2024 International Journal of Innovative Science and Research Technology

ISSN No:-2456-2165 https://doi.org/10.38124/ijisrt/IJISRT24AUG1656

Big Data Analytics using Artificial

Intelligence: Apache Spark for Scalable
Batch Processing
Himanshu Gupta
Meta
NJ, USA

Abstract:- The rapid proliferation of data in the digital age II. METHODOLOGY
has made big data analytics a critical tool for deriving
insights and making informed decisions. However,  Data Description
processing and analyzing large datasets, often reaching The dataset used in this research consists of several
hundreds of terabytes, presents significant challenges. This hundred terabytes of log data from a global e-commerce
paper explores the use of Apache Spark, a powerful platform, encompassing transaction records, user behavior
distributed computing framework, for batch processing in analytics, and clickstream data. The dataset is stored in a
big data analytics using artificial intelligence (AI) distributed file system compatible with Apache Hadoop, S3,
techniques. We evaluate the scalability, efficiency, and such as HDFS.
accuracy of AI models when applied to massive datasets
processed in Spark. Our experiments demonstrate that  Apache Spark for Batch Processing
Apache Spark, coupled with machine learning and deep Apache Spark was chosen for its ability to handle large-
learning techniques, offers a robust solution for handling scale batch processing with high efficiency. The data was
large-scale data analytics tasks. We also discuss the preprocessed using Spark’s RDDs and DataFrames API, which
challenges associated with such large-scale processing and allowed for efficient manipulation and transformation of the
propose strategies for optimizing performance and data.
resource utilization.
 AI Techniques
I. INTRODUCTION We implemented a range of AI models, including:

As the world becomes increasingly data-driven, the  Random Forest: For classification and regression tasks,
ability to process and analyze vast amounts of data has become particularly in predicting customer behavior.
crucial for businesses and researchers alike. Big data analytics  K-Means Clustering: Used for customer segmentation
enables the extraction of valuable insights from datasets that based on transaction patterns.
are too large, complex, or fast-changing for traditional data-
processing software to handle. The advent of distributed These models were trained on subsets of the data,
computing frameworks like Apache Spark has revolutionized leveraging Spark’s MLlib and deep learning libraries, such as
the field, offering the scalability and processing power required TensorFlow integrated with Spark.
to manage these large datasets effectively.
III. EXPERIMENTAL SETUP
Artificial Intelligence (AI) has become an indispensable
tool in big data analytics, providing advanced techniques for The experiments were conducted on a distributed cluster
data mining, pattern recognition, predictive analytics, and comprising 50 nodes, each equipped with 512GB of RAM and
more. However, applying AI to big data, particularly when 32 cores. The models were evaluated on metrics such as
dealing with hundreds of terabytes of information, presents accuracy, processing time, and resource utilization. We also
unique challenges, including data preprocessing, model experimented with different configurations of Spark’s in-
training, and resource management. memory processing to identify the optimal settings for large-
scale data processing.
This paper investigates the integration of AI techniques
with Apache Spark for batch processing of big data. We focus IV. DISCUSSIONS
on the challenges of processing large-scale datasets, evaluate
the performance of AI models in this context, and suggest  Performance Analysis
optimizations to improve efficiency and scalability. Our results indicate that Apache Spark is capable of
processing several hundred terabytes of data within a
reasonable timeframe, making it a suitable choice for batch
processing in big data environments. The random forest model

IJISRT24AUG1656 www.ijisrt.com 2121

Volume 9, Issue 8, August – 2024 International Journal of Innovative Science and Research Technology
ISSN No:-2456-2165 https://doi.org/10.38124/ijisrt/IJISRT24AUG1656

achieved an accuracy of 85% in predicting customer churn,  Data Transformation: Convert data into suitable formats
while the CNN model performed exceptionally well with for analysis (e.g., vectorAssembler for feature
image data, reaching an accuracy of 92%. engineering).

 Scalability  Segmentation Approach:

The scalability tests demonstrated that Spark’s in-
memory processing and data parallelism significantly reduced  K-Means Clustering: Apply K-Means clustering algorithm
processing times as the size of the dataset increased. However, using Spark's MLlib library (KMeans class) to group
the need for substantial computational resources was evident, customers based on their behavior and demographics.
particularly when training deep learning models on large  Clustering Formula:
datasets.
J(W,C) = ∑i=1n ∑j=1k wij * ||xi - cj||^2
 Resource Utilization
Resource utilization was optimized through careful Where:
management of Spark's caching mechanisms and the use of
data partitioning strategies to minimize data skew. However, + J(W,C) = clustering objective function
the experiments revealed that efficient resource management is + W = cluster assignment matrix
critical to avoiding bottlenecks, particularly in I/O operations. + C = cluster centers
+ n = number of customers
 Challenges and Limitations + k = number of clusters
One of the key challenges encountered was the + wij = weight of customer i in cluster j
management of intermediate data, which can quickly consume + xi = customer i's feature vector
memory and storage resources. Additionally, tuning the AI + cj = cluster j's center
models to achieve high accuracy without compromising
processing speed proved to be complex, requiring extensive  Segmentation Steps:
experimentation with hyperparameters and Spark
configurations.  Data Preparation: Prepare data as described above.
 K-Means Clustering: Apply K-Means clustering using
 Proposed Framework Spark's MLlib library.
Our framework combines Spark with AI techniques for  Cluster Evaluation: Evaluate clustering quality using
scalable Big Data Analytics. We propose a novel formula for metrics like Silhouette Coefficient, Calinski-Harabasz
optimal cluster size identification: Index, and Davies-Bouldin Index.
 Segment Interpretation: Analyze and interpret clusters to
Cluster Size (CS) = (Total Data Size (TDS) x Processing
identify customer segments.
Factor (PF)) / (Number of Nodes (NN) x Node Memory (NM))
 Spark Implementation:
Where:
 Spark Cluster Setup: Configure a Spark cluster with
 TDS = Total data size in bytes
necessary resources (e.g., nodes, memory, cores).
 PF = Processing factor (0.5 for light processing, 0.8 for  Spark Code:
heavy processing)
 NN = Number of nodes in the cluster Python
 NM = Node memory in bytes from pyspark.ml.clustering import KMeans
from pyspark.ml.feature import VectorAssembler
 Customer Segmentation Overview:
Customer segmentation is a crucial task in marketing and # Load and prepare data
customer relationship management. This design proposes a data = spark.read.csv("customer_data.csv", header=True,
scalable approach using Apache Spark to segment customers inferSchema=True)
based on their behavior and demographics. vectorAssembler = VectorAssembler(inputCols=["feature1",
"feature2"], outputCol="features")
 Data Preparation: data = vectorAssembler.transform(data)
 Data Ingestion: Collect customer data from various sources # Apply K-Means clustering
(e.g., transactions, surveys, social media) using Spark's kmeans = KMeans(k=5, seed=42)
data ingestion tools (e.g., Spark Streaming, Spark SQL). model = kmeans.fit(data)
 Data Cleaning: Handle missing values, outliers, and data
quality issues using Spark's data cleaning functions (e.g., # Evaluate clustering quality
dropna, fillna, transform). silhouette = model.summary.silhouette()
print("Silhouette Coefficient:", silhouette)

IJISRT24AUG1656 www.ijisrt.com 2122

V. CONCLUSION

This study has demonstrated the effectiveness of Apache

Spark for batch processing in AI-driven big data analytics. Our
experiments show that Spark's in-memory processing
capabilities, combined with advanced AI techniques, can
handle large-scale datasets efficiently. However, the study also
highlights the challenges associated with resource management
and the need for further optimization of AI models for large-
scale data processing.

Future research should focus on improving the integration

of AI techniques with distributed frameworks like Spark,
particularly in optimizing deep learning models for big data
environments. Additionally, exploring the use of newer
technologies, such as federated learning and edge computing,
could provide more scalable and efficient solutions for big data
analytics.

REFERENCES

[1]. Zaharia, M., Chowdhury, M., Das, T., Dave, A., Ma, J.,
McCauley, M., & others. (2012). Resilient Distributed
Datasets: A Fault-Tolerant Abstraction for In-Memory
Cluster Computing. In *Proceedings of the 9th
USENIX Symposium on Networked Systems Design
and Implementation* (NSDI 12), 15-28.
[2]. Armbrust, M., Xin, R. S., Lian, C., Huai, Y., Liu, D.,
Bradley, J. K., & others. (2015). Spark SQL: Relational
Data Processing in Spark. In *Proceedings of the 2015
ACM SIGMOD International Conference on
Management of Data* (pp. 1383-1394).
[3]. Dean, J., & Ghemawat, S. (2008). MapReduce:
Simplified Data Processing on Large Clusters.
*Communications of the ACM*, 51(1), 107-113.
[4]. Chen, Y., Alspaugh, S., & Katz, R. H. (2012).
Interactive Analytical Processing in Big Data Systems:
A Cross-Industry Study of MapReduce Workloads.
*Proceedings of the VLDB Endowment*, 5(12), 1802-
1813.
[5]. Kang, Y., Luo, Y., Tong, Y., & Wang, B. (2020).
Efficient Distributed Machine Learning on Big Data.
*IEEE Transactions on Big Data*, 6(2), 238-252.
[6]. Meng, X., Bradley, J., Yuvaz, B., Sparks, E.,
Venkataraman, S., Liu, D., & others. (2016). Mllib:
Machine Learning in Apache Spark. *Journal of
Machine Learning Research*, 17(1), 1235-1241.
[7]. Apache Spark Documentation. (n.d.). MLlib: Machine
Learning Library.
[8]. Zaharia, M., et al. (2010). Spark: Cluster computing
with working sets. HotCloud'10.
[9]. Lloyd, S. (1982). Least squares quantization in PCM.
IEEE Transactions on Information Theory, 28(2), 129-
137.

IJISRT24AUG1656 www.ijisrt.com 2123

Cloud Computing MCQ Solved
No ratings yet
Cloud Computing MCQ Solved
47 pages
Question#1/104: Not Be The Same
100% (1)
Question#1/104: Not Be The Same
61 pages
CS 382-Network-Centric Computing-Muhammad Fareed Zaffar
No ratings yet
CS 382-Network-Centric Computing-Muhammad Fareed Zaffar
4 pages
BDA Exp E1.Docx - Google Docs
No ratings yet
BDA Exp E1.Docx - Google Docs
5 pages
Big Data Processing: Jiaul Paik
No ratings yet
Big Data Processing: Jiaul Paik
47 pages
bda_23
No ratings yet
bda_23
12 pages
Unlock The Value of Big Data With The DX2000 From NEC
No ratings yet
Unlock The Value of Big Data With The DX2000 From NEC
23 pages
Technology Skills
No ratings yet
Technology Skills
6 pages
BDAT1002 Lecture 2
No ratings yet
BDAT1002 Lecture 2
31 pages
Elie Niring
No ratings yet
Elie Niring
6 pages
CS8091-Big-Data-Analytics
No ratings yet
CS8091-Big-Data-Analytics
28 pages
Big Data Technologies
No ratings yet
Big Data Technologies
31 pages
Fundamentals of Big Data and Business Analytics
No ratings yet
Fundamentals of Big Data and Business Analytics
6 pages
Vivek Varma K: Data Scientist - Data Analyst
No ratings yet
Vivek Varma K: Data Scientist - Data Analyst
5 pages
AI & Data Science
No ratings yet
AI & Data Science
152 pages
WWW Doubtly in Big Data Analytics Semester 7 Mu Ai Ds Viva Qna
No ratings yet
WWW Doubtly in Big Data Analytics Semester 7 Mu Ai Ds Viva Qna
7 pages
Deloitte Pyspark Interview Questions for Data Engineer 2024 _ by Ronit Malhotra _ Jun, 2024 _ Medium
No ratings yet
Deloitte Pyspark Interview Questions for Data Engineer 2024 _ by Ronit Malhotra _ Jun, 2024 _ Medium
9 pages
Introduction To Big Data PDF
No ratings yet
Introduction To Big Data PDF
16 pages
A Comparison of Big Data Analytics Approaches Based On Hadoop Mapreduce
No ratings yet
A Comparison of Big Data Analytics Approaches Based On Hadoop Mapreduce
9 pages
Spark Scenario Based Interview Questions !! For Interview
No ratings yet
Spark Scenario Based Interview Questions !! For Interview
4 pages
Jim Xiang: - Santa Clara, CA
No ratings yet
Jim Xiang: - Santa Clara, CA
5 pages
CUSTOMER SEGMENTATION 2
No ratings yet
CUSTOMER SEGMENTATION 2
19 pages
cloud ppt
No ratings yet
cloud ppt
6 pages
Parallel Computing Based Distribution Network - Reliability Evaluation Technology Research
No ratings yet
Parallel Computing Based Distribution Network - Reliability Evaluation Technology Research
6 pages
Preparation Topics
No ratings yet
Preparation Topics
3 pages
roadmap
No ratings yet
roadmap
3 pages
ML Cia A1 2382487
No ratings yet
ML Cia A1 2382487
8 pages
An Improved K-Means Cluster Algorithm Using Map Reduce Techniques To Mining of Inter and Intra Cluster Datain Big Data Analytics
No ratings yet
An Improved K-Means Cluster Algorithm Using Map Reduce Techniques To Mining of Inter and Intra Cluster Datain Big Data Analytics
12 pages
machine learning tree
No ratings yet
machine learning tree
13 pages
Google Cloud Analytics Lakehouse
No ratings yet
Google Cloud Analytics Lakehouse
47 pages
Dhyanesh Babu-Profile
No ratings yet
Dhyanesh Babu-Profile
6 pages
2020300053_BDA_EXP6_CHINMAY
No ratings yet
2020300053_BDA_EXP6_CHINMAY
9 pages
MSC Datascience Unit1
No ratings yet
MSC Datascience Unit1
20 pages
Ali Kone
No ratings yet
Ali Kone
6 pages
1) Discuss Big Data Architecture in Detail With Help of Neat and Clean Diagram
No ratings yet
1) Discuss Big Data Architecture in Detail With Help of Neat and Clean Diagram
18 pages
Unit-I
No ratings yet
Unit-I
38 pages
Goldman Sachs
No ratings yet
Goldman Sachs
4 pages
MonishKunar DataAnalyst Resume
No ratings yet
MonishKunar DataAnalyst Resume
3 pages
Data Engineer
No ratings yet
Data Engineer
5 pages
A Comparative Study On Apache Spark and Map Reduce With Performance Analysis Using KNN and Page Rank Algorithm
No ratings yet
A Comparative Study On Apache Spark and Map Reduce With Performance Analysis Using KNN and Page Rank Algorithm
6 pages
OpenStack-Architecture To A Big Data Solution
No ratings yet
OpenStack-Architecture To A Big Data Solution
9 pages
Da QB Ans (GKJ)
No ratings yet
Da QB Ans (GKJ)
45 pages
Big data Handling Techniques
No ratings yet
Big data Handling Techniques
21 pages
BigData Cs-704 Practical
No ratings yet
BigData Cs-704 Practical
28 pages
Final Synopsis
No ratings yet
Final Synopsis
7 pages
Intro Big Data
No ratings yet
Intro Big Data
36 pages
Guru.Charan G.
No ratings yet
Guru.Charan G.
8 pages
Jeevads Cv
No ratings yet
Jeevads Cv
1 page
iran
No ratings yet
iran
7 pages
Bvraju Institute of Technology, Narsapur: Code No: A46H2
No ratings yet
Bvraju Institute of Technology, Narsapur: Code No: A46H2
8 pages
BDA MQP 1
No ratings yet
BDA MQP 1
29 pages
Predictive Data Analytics With Python
100% (1)
Predictive Data Analytics With Python
97 pages
Da Unit Ii
No ratings yet
Da Unit Ii
25 pages
Dibs Final Paper 2015
No ratings yet
Dibs Final Paper 2015
9 pages
4a.introduction to Apache Spark
No ratings yet
4a.introduction to Apache Spark
28 pages
Ketulkumar Polara: Data Scientist Email: Phone
No ratings yet
Ketulkumar Polara: Data Scientist Email: Phone
6 pages
Kadi Sarva Vishwavidyalaya: LDRP Institute of Technology and Research Gandhinagar
No ratings yet
Kadi Sarva Vishwavidyalaya: LDRP Institute of Technology and Research Gandhinagar
44 pages
Mehdi RESUME
No ratings yet
Mehdi RESUME
8 pages
Title: Data Science: Foundations, Techniques, and Applications
No ratings yet
Title: Data Science: Foundations, Techniques, and Applications
5 pages
Syllabus
No ratings yet
Syllabus
3 pages
walmart data engineering question
No ratings yet
walmart data engineering question
10 pages
Abhishek Data Scientist Resume
No ratings yet
Abhishek Data Scientist Resume
5 pages
PySpark Essentials: A Practical Guide to Distributed Computing
From Everand
PySpark Essentials: A Practical Guide to Distributed Computing
Robert Johnson
No ratings yet
Case Study of Methylcobalamin in Pharmamarketing
No ratings yet
Case Study of Methylcobalamin in Pharmamarketing
5 pages
Case Study of Atenolol
No ratings yet
Case Study of Atenolol
5 pages
Securing the Human Element in AI-Powered Cyber Defences: A Zero Trust Perspective
No ratings yet
Securing the Human Element in AI-Powered Cyber Defences: A Zero Trust Perspective
10 pages
Predicting Genetic Disorders: Implementation and Deployment on EC2 instances in AWS
No ratings yet
Predicting Genetic Disorders: Implementation and Deployment on EC2 instances in AWS
13 pages
Optimizing Light Vehicle Fleet Longevity: Addressing Operational, Environmental and Maintenance Challenges at the Tarkwa Mine Site
No ratings yet
Optimizing Light Vehicle Fleet Longevity: Addressing Operational, Environmental and Maintenance Challenges at the Tarkwa Mine Site
8 pages
Healthify: A Conversational AI for Mental Health Support Using Groq and LangChain Frameworks
No ratings yet
Healthify: A Conversational AI for Mental Health Support Using Groq and LangChain Frameworks
7 pages
The Role of Internal Initiatives and External Forces for Sustainable Waste Reduction in the Pharmaceutical Industry
No ratings yet
The Role of Internal Initiatives and External Forces for Sustainable Waste Reduction in the Pharmaceutical Industry
12 pages
Development of an Improved Model of Unstructured Supplementary Service Data (USSD) Technology in Academic Services: Case Study RP Huye College
No ratings yet
Development of an Improved Model of Unstructured Supplementary Service Data (USSD) Technology in Academic Services: Case Study RP Huye College
15 pages
IoT-Based Autonomous System for Monitoring and Tracking Traffic Rule Violations and Non- Compliance
No ratings yet
IoT-Based Autonomous System for Monitoring and Tracking Traffic Rule Violations and Non- Compliance
6 pages
Hypothetical Learning Trajectory and Video Program with Differentiated Multi-Language in Teaching Mathematics 9
No ratings yet
Hypothetical Learning Trajectory and Video Program with Differentiated Multi-Language in Teaching Mathematics 9
12 pages
A Novel Method for Distance Calculation from Forensic Sketches Converted from Images
No ratings yet
A Novel Method for Distance Calculation from Forensic Sketches Converted from Images
6 pages
Experimental Investigation of Expansive Soil Properties Stabilized by using Fly Ash, Waste Cement Bag Fiber, and Lime
No ratings yet
Experimental Investigation of Expansive Soil Properties Stabilized by using Fly Ash, Waste Cement Bag Fiber, and Lime
6 pages
FLEXIBOX: An Innovative Portable Food Storage Solution
No ratings yet
FLEXIBOX: An Innovative Portable Food Storage Solution
4 pages
Doctor Appointment Booking and Handwriting Recognition System
No ratings yet
Doctor Appointment Booking and Handwriting Recognition System
8 pages
Stock Market Price Prediction
No ratings yet
Stock Market Price Prediction
6 pages
Alzheimers and Brain Tumor Detection Using Deep Learning
No ratings yet
Alzheimers and Brain Tumor Detection Using Deep Learning
9 pages
Hexatalk using ANN and DNNS
No ratings yet
Hexatalk using ANN and DNNS
4 pages
Innovative Financial Models for Enhancing Affordability and Investment in Domestic Energy Resources: A Sustainable Approach to United States Energy Transition
No ratings yet
Innovative Financial Models for Enhancing Affordability and Investment in Domestic Energy Resources: A Sustainable Approach to United States Energy Transition
8 pages
Blockchain-Driven Halal Supply Chains: Enhancing Transparency and Efficiency While Ensuring Shariah Adherence
No ratings yet
Blockchain-Driven Halal Supply Chains: Enhancing Transparency and Efficiency While Ensuring Shariah Adherence
6 pages
Effectiveness of Traffic Management Unit in Ozamiz City as Perceived by the Stakeholders
No ratings yet
Effectiveness of Traffic Management Unit in Ozamiz City as Perceived by the Stakeholders
8 pages
Machine Learning Towards Sustainable Agriculture for Crop Recommendation
No ratings yet
Machine Learning Towards Sustainable Agriculture for Crop Recommendation
4 pages
Cytomegalovirus in Blood Donors: IgG Detection by ELISA Technique in Analamanga Transfusion Center, Antananarivo
No ratings yet
Cytomegalovirus in Blood Donors: IgG Detection by ELISA Technique in Analamanga Transfusion Center, Antananarivo
5 pages
AI-Powered Prompt-based Image Generator
No ratings yet
AI-Powered Prompt-based Image Generator
5 pages
Appropriation and use of Scientific and Technical Information by Postgraduate Learners in the Arts and Humanities at the University of Kinshasa: Results of Surveys Conducted in the Faculty Library from 2017 to 2022
No ratings yet
Appropriation and use of Scientific and Technical Information by Postgraduate Learners in the Arts and Humanities at the University of Kinshasa: Results of Surveys Conducted in the Faculty Library from 2017 to 2022
6 pages
Sustainability and Environmental Challenges in Lugbe, Abuja, Nigeria: Assessing Urban Development, Resource Management, and Climate Resilience
No ratings yet
Sustainability and Environmental Challenges in Lugbe, Abuja, Nigeria: Assessing Urban Development, Resource Management, and Climate Resilience
8 pages
IoT Based Soil Nutrients Monitoring Decision System
No ratings yet
IoT Based Soil Nutrients Monitoring Decision System
4 pages
Advances in Cellular and Molecular Biology Assays: A Review of Gold Standard Methods
No ratings yet
Advances in Cellular and Molecular Biology Assays: A Review of Gold Standard Methods
13 pages
Comparative Study of Herbal Chewable Tablet and Kofol Chewable Tablet
No ratings yet
Comparative Study of Herbal Chewable Tablet and Kofol Chewable Tablet
7 pages
Deforestation in Akoko Edo Local Government Area, Edo State, Nigeria: Impact, Drivers, and Mitigation Strategies
No ratings yet
Deforestation in Akoko Edo Local Government Area, Edo State, Nigeria: Impact, Drivers, and Mitigation Strategies
6 pages
Employee Retention Policy and Performance of Selected Tertiary Institutions in Nigeria
No ratings yet
Employee Retention Policy and Performance of Selected Tertiary Institutions in Nigeria
12 pages
Class+4+ +Lecture+Note.
No ratings yet
Class+4+ +Lecture+Note.
31 pages
Https Github Com Prasadkaru MongoDB Lesson Code 1704253560
No ratings yet
Https Github Com Prasadkaru MongoDB Lesson Code 1704253560
48 pages
Automotive Big Data
No ratings yet
Automotive Big Data
10 pages
Evaluating The Impact of Cloud-Based Microservices
No ratings yet
Evaluating The Impact of Cloud-Based Microservices
8 pages
Abhay Chaturvedi (Team Lead) : Abdul Noman Ansari Harshit
No ratings yet
Abhay Chaturvedi (Team Lead) : Abdul Noman Ansari Harshit
17 pages
A Complete Guide To The Google Cloud Platform
71% (7)
A Complete Guide To The Google Cloud Platform
50 pages
SAP HANA 2.0 SPS 05 What's New? SAP HANA Native Storage Extension
No ratings yet
SAP HANA 2.0 SPS 05 What's New? SAP HANA Native Storage Extension
50 pages
TSE - What Is Arrowhead
No ratings yet
TSE - What Is Arrowhead
2 pages
Family Guide Hpe Storage Portfolio
No ratings yet
Family Guide Hpe Storage Portfolio
58 pages
OEL - RHEL - Performance Tuning
No ratings yet
OEL - RHEL - Performance Tuning
93 pages
41 - 200810 Iss PRG Vastech PDF
No ratings yet
41 - 200810 Iss PRG Vastech PDF
16 pages
BW Portal Implementation
No ratings yet
BW Portal Implementation
58 pages
Frangipani: A Scalable Distributed File System
No ratings yet
Frangipani: A Scalable Distributed File System
3 pages
Commvault Cloud Architecture Guide For Microsoft Azure
No ratings yet
Commvault Cloud Architecture Guide For Microsoft Azure
41 pages
7320X Series 10254050100G Data Center Switches
No ratings yet
7320X Series 10254050100G Data Center Switches
11 pages
Hierarchical Planning
No ratings yet
Hierarchical Planning
18 pages
AMI System Using DLMS - White Paper
No ratings yet
AMI System Using DLMS - White Paper
6 pages
Leveraging Database Technologies For Efficient and Effective Data Analytics
No ratings yet
Leveraging Database Technologies For Efficient and Effective Data Analytics
8 pages
DIKSHA - Strategy - Approach - Paper PDF
No ratings yet
DIKSHA - Strategy - Approach - Paper PDF
38 pages
Ab Initio One Architecture
No ratings yet
Ab Initio One Architecture
12 pages
UsingVoltDB For Developers
No ratings yet
UsingVoltDB For Developers
375 pages
Big Data Analysis
No ratings yet
Big Data Analysis
9 pages
Mydbaas: A Framework For Database-As-A-Service Monitoring: (Araujodavid, Flavio, Jose - Macedo, Franzejr) @
No ratings yet
Mydbaas: A Framework For Database-As-A-Service Monitoring: (Araujodavid, Flavio, Jose - Macedo, Franzejr) @
6 pages
ESG Technical Validation Preview Dell EMC VxRail With Intel Optane Aug 2019
No ratings yet
ESG Technical Validation Preview Dell EMC VxRail With Intel Optane Aug 2019
2 pages
RAFED Technical Proposal 2
No ratings yet
RAFED Technical Proposal 2
22 pages
Scalable AI Infrastructure: Designing For Real-World Deep Learning Use Cases
No ratings yet
Scalable AI Infrastructure: Designing For Real-World Deep Learning Use Cases
12 pages
Big Data Aiche
No ratings yet
Big Data Aiche
24 pages