Group 37

1. The document describes a proposed system for unstructured data mining using open source tools like PostgreSQL, Hadoop, R and Jaspersoft. 2. The system aims to process unstructured data inputs like PDFs and images into structured data for analysis and visualization to help decision making. 3. Work done so far includes installation of tools, integration of R with PostgreSQL and Hadoop for processing and analyzing unstructured data, and using Talend for converting a PDF to text.

Uploaded by

Pooja Ban

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PPTX, PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

105 views28 pages

Group 37

Uploaded by

Pooja Ban

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PPTX, PDF, TXT or read online on Scribd

You are on page 1/ 28

WELCOME !!!!

Unstructured Data Mining

Using Open Source Stack

Jagruti Wagh B120204420

Ashvini Dukare B120204252
Jidnyasa Gondane B120204260

Guided By –
Neeta Maitre
Sponsorship and External
Guide

Principal Global Services Pvt

Ltd.

Guided by:
Mr.Ajit Aher
Miss.Deepa Nagaliker
Mr.Sunil Chawla
Problem Definition
Reading and transformation of unstructured
data inputs into semi-structured or
structured form.
Using a data warehouse and visualisation
tools for storing and displaying of the
processed data.
Processing of the data will be done by
analytic engine, which will help to get
insights for better managerial and strategic
decision making.
Displaying the processed data in dashboard
using various visualization tools.
Motivation
Flavours of existing system include many
online tools with functionalities for
conversion of:
◦ Pdf to text
◦ Excel or csv file to text
◦ Image to text
These current systems have limited end-
to-end functionality which motivated us
for designing the system with better
utilization of functionality of open source
stack.
Introduction
About 90% of today’s data is
unstructured.
The unstructured data is in various
forms like pdfs, images, videos, white
papers, researches, etc.
The proposed system will help in getting
insights for making quick decisions as it
is the need of any business.
The system should be user friendly,
compatible and could be enhanced
easily in the future.
Scope
The various forms of data could
be indentified and processed
accordingly by the proposed
system.
Implementation of the
functionality for better decision
making and giving the idea of
trending business world.
Visualization of data in form of
dashboard- pie charts, bar-
Domain Specific Keywords
Decision support systems
Data mining
Text mining
Sentiment analysis
 Clustering and classification
 Business intelligence
Proposed System
Module wise Description
Login:
◦ Access allowed to only authorized users.
◦ Proper authentication and password
change/recovery facilities
Data Processing Module:
◦ Asking for the type of data file to be
transformed
◦ Transform input file into intermediate file
most likely text file
◦ Storing this file on HDFS
Module wise Description contd…
Data Storing and Analysing Module:
◦ Using PostgreSQL for querying the large
datasets
◦ Analyzing the queried data using analytic
engine with R Programming
◦ Storing the analyzed data back in new
database
Data Visualization Module:
◦ Visualizing the data generated in various
forms of reports or dashboards using
JasperSoft.
Technology used
PostgreSQL
◦ Full support for outer-joins
◦ Easy processing of sub-queries.

Hadoop HDFS
◦ Storage functionality for
large data-sets of different forms.
◦ Use of rmr, rhbase, rhdfs for
integration with R.
Technology Used contd…
Talend ETL
◦ Simple graphical tools
◦ Simple creation of routines and jobs
◦ Cost effective

R Studio
◦ Open-source IDE
◦ Support for inbuilt functions of R
◦ Easy to integrate with other
softwares.
Technology used contd…
JasperSoft
◦ Open-source BI tool
◦ Supports variety of targets
◦ Generates dynamic content
◦ Used in Java EE.
Programming Languages
Java
◦ Talend ETL- Routine or Job
◦ Hadoop

R Programming
◦ R studio – Data Mining Algorithms
Literature Survey
ETL Tool – Talend, Informatica
◦ A tool for Java Programmers
◦ Save lots of time by generating
code for the user
◦ Multiple algorithms on a record.
◦ Easy to use.
◦ Software BUNDLED, unlike
Informatica.
Literature Survey contd…
Hadoop HDFS-
◦ For storing large business data
PostgreSQL –
◦ Support full outer join over MySQL
◦ Support wide variety of Languages
◦ Strong support for sub queries
◦ Less Hassel with custom data
type,table inheritance
Literature Survey contd…
Visualization
Tool- Jaspersoft,
Pentaho Report Design
◦ Heavy focus on reporting and
analysis.
◦ Better web user interface than
Pentaho and easier to use.
◦ Benefits from better marketing,
informational web sites and
documentation.
Conclusions
The proposed system will help
analyst to take quick decisions in
easy way by means of
visualization.
This analysis can be very well
implemented in following domains:
◦ News Reporters
◦ Politicians
◦ Medicine and Health-care
Time Line
Work Done so far
Installation of the following :
◦ Java
◦ R
◦ R-Studio
◦ Talend
◦ Hadoop
Integration of the following:
◦ R-Hadoop
◦ R-postgreSQL

Pdf-text conversion in Talend

Integration of R-
PostgreSQL
Installing of RPostgreSQL library in R and
connecting to the database created :
We have already installed R, so open a terminal
and type R, to get into the R-prompt.
A. Installing RPostgreSQL package
>system(‘gksudo “apt-get –y install postgresql-
9.3 libpq-dev” ‘)
>install.packages(“RPostgreSQL”)//select CRAN
as 0.cloud

>library(RPostgreSQL)
//if this command runs without giving any
error, means you have installed RPostgresql
package successfully.
Conversion of PDF to Text using
Talend
We first need to create a Routine which will
contain Java code to convert PDF to text.
Create a Job which will contain the graphical
representation with the Help of Palettes.
3 palettes required:

◦ tInputFileDelimited
◦ tMap
◦ tLogRow
Run the Job and we will get a text file of the
selected PDF file at the desired location
which is nothing but the Job folder at current.
Integration of R-Hadoop
Download rJava from link-
https://cran.r-project.org/web/packages/rJava/index.html
Open RStudio
Goto: Tools->install packages
Select Package Archive file from the Tab name
” Install from” ->browse from Package Archive Tab ->select downloaded
tar.gz file ->click on install.
Goto command prompt of RStudio and Type command-
install.packages(c("rJava", "Rcpp", "RJSONIO", "bitops", "digest",
"functional", "stringr", "plyr", "reshape2", "caTools"))

Download RHadoop Packages from link rmr2, rhbase, rhdfs

https://github.com/RevolutionAnalytics/RHadoop/wiki/Downloads
Open RStudio
Goto Tools->install packages
Select Package Archive file from the tab name
“Install from” ->browse from Package Archive Tab ->select downloaded
tar.gz file->click on install.
We are now ready to perform operation on file saved in HDFS using R
programming.
Integration of R-Hadoop contd…
For Exporting of data from the hdfs to R is done
by using the following commands in R:
Sys.setenv(HADOOP_CMD=”/usr/local/hadoop/b
in/hadoop”) //setting env variable
Library(rhdfs)
Hdfs.init()
F=hdfs.file(“/project/xyz.csv”,”r”,buffersize=10
3)
M=hdfs.read(F)
C=rawToChar(M)
Data=read.table(textConnection(C),sep=”,”)
References
A Study of Open-Source Data
Mining Tools for Forecasting
-Nurdatillah Hasim, Norhaidah Abu
Haris
Efficiency Evaluation of Open
Source ETL Tools - Tim A. Majchrzak,
Tobias Jansen, Herbert Kuchen
A Study of the Contributors of
PostgreSQL - Daniel M. German
Websites
We have referred the following YouTube
videos for these part:
1.For installing and getting started with
PostgreSql:
◦ https://youtu.be/67XGzdzv9k0
2. For Connectivity of R and PostgreSQL:
◦ https://youtu.be/90j5rX6iSGI
http://www.bogotobogo.com/Hadoop/BigData
_hadoop_Install_on_ubuntu_single_node_clus
ter.php
Integration - www.youtube.com
◦ R- Hadoop
◦ R PostgreSQL
◦ Jaspersoft PostgreSQL

Statistical Methods For Categorical Data Analysis
100% (1)
Statistical Methods For Categorical Data Analysis
301 pages
Statistics Made Easy Presentation
100% (2)
Statistics Made Easy Presentation
226 pages
BSC Microbiology Syllabus 1st Year PDF
60% (5)
BSC Microbiology Syllabus 1st Year PDF
22 pages
Developing and Using HR Audit Tools
No ratings yet
Developing and Using HR Audit Tools
32 pages
Robotics Appendix
No ratings yet
Robotics Appendix
97 pages
AEE Lab 7th Sem Btech
No ratings yet
AEE Lab 7th Sem Btech
53 pages
Biomedical Technology Assessment: The 3Q Method
No ratings yet
Biomedical Technology Assessment: The 3Q Method
101 pages
Design and Fabricate The Prototype Model of Drain
No ratings yet
Design and Fabricate The Prototype Model of Drain
20 pages
War Fighting Techniques
100% (5)
War Fighting Techniques
201 pages
Drums Syllabus Guide: WWW - Rockschool.co - Uk v1.0
No ratings yet
Drums Syllabus Guide: WWW - Rockschool.co - Uk v1.0
41 pages
High-Speed Low-Power Viterbi Decoder Design For TCM Decoders
No ratings yet
High-Speed Low-Power Viterbi Decoder Design For TCM Decoders
20 pages
BHEL Supplier Manual
No ratings yet
BHEL Supplier Manual
112 pages
8 - State Based Design
No ratings yet
8 - State Based Design
29 pages
Wind Ventillation System Final Report
No ratings yet
Wind Ventillation System Final Report
114 pages
2018 12 Rock N Rescue Catalog
No ratings yet
2018 12 Rock N Rescue Catalog
148 pages
Technical Specification For Under Sleeper Pads No E-01-0048 - 15.07.2021
No ratings yet
Technical Specification For Under Sleeper Pads No E-01-0048 - 15.07.2021
18 pages
Flex Sensor
No ratings yet
Flex Sensor
51 pages
Eco-Cut Hackathon
No ratings yet
Eco-Cut Hackathon
14 pages
STAT
No ratings yet
STAT
13 pages
The Design of Microlearning Experiences: A Research Agenda: Article
No ratings yet
The Design of Microlearning Experiences: A Research Agenda: Article
10 pages
Regenerative Braking System: Group Guide
No ratings yet
Regenerative Braking System: Group Guide
14 pages
Att - 1619504089277 - Fundamental Analysis On Hul1
No ratings yet
Att - 1619504089277 - Fundamental Analysis On Hul1
12 pages
Law of Contract 1 Group Assignment
No ratings yet
Law of Contract 1 Group Assignment
8 pages
Xii Biology Qp.docx
No ratings yet
Xii Biology Qp.docx
7 pages
High-Speed Low-Power Viterbi Decoder Design For TCM Decoders
No ratings yet
High-Speed Low-Power Viterbi Decoder Design For TCM Decoders
13 pages
Persistent Competition
No ratings yet
Persistent Competition
12 pages
An Accessible, Open-Source, Realtime AODV Simulation in MATLAB
No ratings yet
An Accessible, Open-Source, Realtime AODV Simulation in MATLAB
11 pages
Inglés Iii Unidad 2 - Let's Dream and Plan!
No ratings yet
Inglés Iii Unidad 2 - Let's Dream and Plan!
15 pages
32.light Intensity Control Circuit Using Electrical Device.
100% (1)
32.light Intensity Control Circuit Using Electrical Device.
6 pages
AMDashboard ASMCases 2023 Updated
No ratings yet
AMDashboard ASMCases 2023 Updated
3 pages
Fault Analysis For Protection of Machine From Over Voltage and Curent
No ratings yet
Fault Analysis For Protection of Machine From Over Voltage and Curent
5 pages
Omalista: An Approach For User Assistance To Rack Up The Tagged Wish Cart
No ratings yet
Omalista: An Approach For User Assistance To Rack Up The Tagged Wish Cart
5 pages
The Development of Solar Dryers Used For Grape Drying
No ratings yet
The Development of Solar Dryers Used For Grape Drying
10 pages
Industrial Temperature Invigilation &alerting Sysytem Through Voice.
No ratings yet
Industrial Temperature Invigilation &alerting Sysytem Through Voice.
4 pages
Child Tracking Device: Dhanalakshmi. M Hemamalini. S
No ratings yet
Child Tracking Device: Dhanalakshmi. M Hemamalini. S
4 pages
Automobiles Motorcycles: BMW (Bayerische Motoren Werke in German, or Bavarian Motor Works in English) Is A German
No ratings yet
Automobiles Motorcycles: BMW (Bayerische Motoren Werke in German, or Bavarian Motor Works in English) Is A German
5 pages
Appendix 4 - Results Frameworks FNS and Water
No ratings yet
Appendix 4 - Results Frameworks FNS and Water
3 pages
Details of Be Project (Industrial) : (Type The Document Title)
No ratings yet
Details of Be Project (Industrial) : (Type The Document Title)
3 pages
CheetahPerc JKM330 HC
No ratings yet
CheetahPerc JKM330 HC
2 pages
Proportional-Pressure Regulator
No ratings yet
Proportional-Pressure Regulator
4 pages
Design and Analysis of A Composite Helical Gear
No ratings yet
Design and Analysis of A Composite Helical Gear
1 page
Automatic Mains Change Over Switch For Ups.
No ratings yet
Automatic Mains Change Over Switch For Ups.
4 pages
Tiraboschi, J. - On Disgust - A Menippean Interview. Interview With Robert Wilson
No ratings yet
Tiraboschi, J. - On Disgust - A Menippean Interview. Interview With Robert Wilson
11 pages
Front
No ratings yet
Front
1 page
Lever Hub
No ratings yet
Lever Hub
1 page
Theoretical Analysis of Soil Nailing, Design, Performance and Future Aspect
No ratings yet
Theoretical Analysis of Soil Nailing, Design, Performance and Future Aspect
1 page
Effect of Earth Quake and Wind Load On Residential Building by Using Etabs
No ratings yet
Effect of Earth Quake and Wind Load On Residential Building by Using Etabs
1 page
9.an Integrated System For Regional
No ratings yet
9.an Integrated System For Regional
3 pages
Timothy
No ratings yet
Timothy
2 pages
Devising A Solar Powered Standalone Vehicle Using GSM Communication Network
No ratings yet
Devising A Solar Powered Standalone Vehicle Using GSM Communication Network
3 pages
Bush
No ratings yet
Bush
1 page
Metromax Q Series Spec Sheet
No ratings yet
Metromax Q Series Spec Sheet
2 pages
Baseline Programme Narrative - LIS-02
100% (1)
Baseline Programme Narrative - LIS-02
10 pages
1.protection of Busbar Distribution From Over Load
100% (1)
1.protection of Busbar Distribution From Over Load
4 pages
G-20 (Maths) Topicwise Test Series: Date DAY Unit Topics/Subtopics To Be Covered
No ratings yet
G-20 (Maths) Topicwise Test Series: Date DAY Unit Topics/Subtopics To Be Covered
2 pages
Data Engineering with Google Cloud Platform: A guide to leveling up as a data engineer by building a scalable data platform with Google Cloud
From Everand
Data Engineering with Google Cloud Platform: A guide to leveling up as a data engineer by building a scalable data platform with Google Cloud
Adi Wijaya
No ratings yet
The Ultimate Django Guide: From Beginner to Advanced Web Development
From Everand
The Ultimate Django Guide: From Beginner to Advanced Web Development
Jiho Seok
No ratings yet
Pentaho Solutions and Architecture: Definitive Reference for Developers and Engineers
From Everand
Pentaho Solutions and Architecture: Definitive Reference for Developers and Engineers
Richard Johnson
No ratings yet
SAP HANA SYSTEM REPLICATION SCENARIOS
From Everand
SAP HANA SYSTEM REPLICATION SCENARIOS
Giridhar Kankanala
No ratings yet
Programming APIs with C# and .NET: Develop high-performance APIs that ensure seamless application communication and enhanced security
From Everand
Programming APIs with C# and .NET: Develop high-performance APIs that ensure seamless application communication and enhanced security
Jesse Liberty
No ratings yet
Learn Hadoop in 24 Hours
From Everand
Learn Hadoop in 24 Hours
Alex Nordeen
No ratings yet
Big Data Analytics
From Everand
Big Data Analytics
Nitin Kumar Yadav
No ratings yet
Learn SAP BI in 24 Hours
From Everand
Learn SAP BI in 24 Hours
Alex Nordeen
3/5 (1)
Practical Data Strategies and Recipes
From Everand
Practical Data Strategies and Recipes
Tom Henricksen
No ratings yet
Mastering PostgreSQL 12 - Third Edition: Advanced techniques to build and administer scalable and reliable PostgreSQL database applications, 3rd Edition
From Everand
Mastering PostgreSQL 12 - Third Edition: Advanced techniques to build and administer scalable and reliable PostgreSQL database applications, 3rd Edition
Hans-Jurgen Schonig
No ratings yet
Google Cloud Platform for Data Engineering: From Beginner to Data Engineer using Google Cloud Platform
From Everand
Google Cloud Platform for Data Engineering: From Beginner to Data Engineer using Google Cloud Platform
alasdair gilchrist
5/5 (1)
Introduction to Oracle Database Administration
From Everand
Introduction to Oracle Database Administration
Ying Wang
5/5 (1)
SAS Interview Questions You'll Most Likely Be Asked
From Everand
SAS Interview Questions You'll Most Likely Be Asked
Vibrant Publishers
No ratings yet
Hadoop Blueprints
From Everand
Hadoop Blueprints
Anurag Shrivastava
No ratings yet
Odoo 10 Development Essentials
From Everand
Odoo 10 Development Essentials
Daniel Reis
No ratings yet
Mastering RethinkDB
From Everand
Mastering RethinkDB
Shahid Shaikh
No ratings yet
Effective Business Intelligence with QuickSight
From Everand
Effective Business Intelligence with QuickSight
Rajesh Nadipalli
No ratings yet
Practical Business Intelligence
From Everand
Practical Business Intelligence
Ahmed Sherif
3/5 (1)
Hands-On Machine Learning Recommender Systems with Apache Spark
From Everand
Hands-On Machine Learning Recommender Systems with Apache Spark
Ernesto Lee
No ratings yet
Mastering Hadoop
From Everand
Mastering Hadoop
Sandeep Karanth
No ratings yet
SAP XI Exchange Infrastructure
From Everand
SAP XI Exchange Infrastructure
equitypress
1/5 (3)
Learning Informatica PowerCenter 9.x
From Everand
Learning Informatica PowerCenter 9.x
Rahul Malewar
3/5 (4)
Learning Powershell DSC: Get started with the fundamentals of PowerShell DSC and utilize its power to automate deployment and configuration of your servers
From Everand
Learning Powershell DSC: Get started with the fundamentals of PowerShell DSC and utilize its power to automate deployment and configuration of your servers
James Pogran
No ratings yet
Oracle GoldenGate 11g Implementer's guide
From Everand
Oracle GoldenGate 11g Implementer's guide
John P Jeffries
5/5 (1)
Learning Hunk: A quick, practical guide to rapidly visualizing and analyzing your Hadoop data using Hunk
From Everand
Learning Hunk: A quick, practical guide to rapidly visualizing and analyzing your Hadoop data using Hunk
Dmitry Anoshin
No ratings yet
HDInsight Essentials - Second Edition
From Everand
HDInsight Essentials - Second Edition
Rajesh Nadipalli
No ratings yet
Oracle Business Intelligence : The Condensed Guide to Analysis and Reporting
From Everand
Oracle Business Intelligence : The Condensed Guide to Analysis and Reporting
Yuli Vasiliev
No ratings yet
PostgreSQL 9 Administration Cookbook LITE: Configuration, Monitoring and Maintenance
From Everand
PostgreSQL 9 Administration Cookbook LITE: Configuration, Monitoring and Maintenance
Simon Riggs
3/5 (1)
Visual Basic 2010 Coding Briefs Data Access
From Everand
Visual Basic 2010 Coding Briefs Data Access
Kevin Hough
5/5 (1)
Six Minute Guide to IPv6
From Everand
Six Minute Guide to IPv6
Daryl Moon
5/5 (1)
Efficient Workflow with RStudio: Definitive Reference for Developers and Engineers
From Everand
Efficient Workflow with RStudio: Definitive Reference for Developers and Engineers
Richard Johnson
No ratings yet
Data Driven Guide for Python Programming : Master Essentials to Advanced Data Structures
From Everand
Data Driven Guide for Python Programming : Master Essentials to Advanced Data Structures
Younes Hamdani
No ratings yet
Mastering DuckDB: High-Performance Analytics Made Easy
From Everand
Mastering DuckDB: High-Performance Analytics Made Easy
Robert Johnson
No ratings yet
Securing Hadoop
From Everand
Securing Hadoop
Sudheesh Narayanan
4/5 (2)
DeepSeek for Data Analysis: The Future of Data Analysis for Business Professionals
From Everand
DeepSeek for Data Analysis: The Future of Data Analysis for Business Professionals
Mohammod Shaharuzzaman
No ratings yet
Data Engineering with Scala and Spark: Build streaming and batch pipelines that process massive amounts of data using Scala
From Everand
Data Engineering with Scala and Spark: Build streaming and batch pipelines that process massive amounts of data using Scala
Eric Tome
No ratings yet
PostgreSQL Administration Essentials
From Everand
PostgreSQL Administration Essentials
Hans-Jurgen Schonig
No ratings yet
Building Modern Data Applications Using Databricks Lakehouse: Develop, optimize, and monitor data pipelines on Databricks
From Everand
Building Modern Data Applications Using Databricks Lakehouse: Develop, optimize, and monitor data pipelines on Databricks
Will Girten
No ratings yet
Exploring Hadoop Ecosystem (Volume 1): Batch Processing
From Everand
Exploring Hadoop Ecosystem (Volume 1): Batch Processing
Wei Liu
No ratings yet
Learning Jupyter
From Everand
Learning Jupyter
Dan Toomey
3.5/5 (4)
DynamoDB Applied Design Patterns
From Everand
DynamoDB Applied Design Patterns
Uchit Vyas
3/5 (1)
Windows Server 2012 Unified Remote Access Planning and Deployment
From Everand
Windows Server 2012 Unified Remote Access Planning and Deployment
Erez Ben-Ari
No ratings yet
Unstructured Data Analysis: Entity Resolution and Regular Expressions in SAS
From Everand
Unstructured Data Analysis: Entity Resolution and Regular Expressions in SAS
Matthew Windham
No ratings yet
Getting Started with Oracle Data Integrator 11g: A Hands-On Tutorial
From Everand
Getting Started with Oracle Data Integrator 11g: A Hands-On Tutorial
David Hecksel
5/5 (2)
Hyper-V 2016 Best Practices
From Everand
Hyper-V 2016 Best Practices
Benedict Berger
No ratings yet
Learning Azure DocumentDB
From Everand
Learning Azure DocumentDB
Becker Riccardo
No ratings yet
R Programming - a Comprehensive Guide: Software
From Everand
R Programming - a Comprehensive Guide: Software
Editor IJSMI
No ratings yet
Exploring Hadoop Ecosystem (Volume 2): Stream Processing
From Everand
Exploring Hadoop Ecosystem (Volume 2): Stream Processing
Wei Liu
No ratings yet
Oracle Warehouse Builder 11g: Getting Started
From Everand
Oracle Warehouse Builder 11g: Getting Started
Bob Griesemer
No ratings yet
End-to-End Data Science with SAS: A Hands-On Programming Guide
From Everand
End-to-End Data Science with SAS: A Hands-On Programming Guide
James Gearheart
No ratings yet
Mastering Apache Cassandra - Second Edition
From Everand
Mastering Apache Cassandra - Second Edition
Nishant Neeraj
No ratings yet
The Data Detective's Toolkit: Cutting-Edge Techniques and SAS Macros to Clean, Prepare, and Manage Data
From Everand
The Data Detective's Toolkit: Cutting-Edge Techniques and SAS Macros to Clean, Prepare, and Manage Data
Kim Chantala
No ratings yet
Learn SAP Basis in 24 Hours
From Everand
Learn SAP Basis in 24 Hours
Alex Nordeen
4.5/5 (2)
The Datadog Handbook: A Guide to Monitoring, Metrics, and Tracing
From Everand
The Datadog Handbook: A Guide to Monitoring, Metrics, and Tracing
Robert Johnson
No ratings yet
Oracle Quick Guides: Part 2 - Oracle Database Design
From Everand
Oracle Quick Guides: Part 2 - Oracle Database Design
Malcolm Coxall
No ratings yet
Software Development on the SAP HANA Platform
From Everand
Software Development on the SAP HANA Platform
Mark Walker
4.5/5 (2)
Learning Dynamics NAV Patterns: Create solutions that are easy to maintain, are quick to upgrade, and follow proven concepts and design
From Everand
Learning Dynamics NAV Patterns: Create solutions that are easy to maintain, are quick to upgrade, and follow proven concepts and design
Marije Brummel
No ratings yet
C# 2010 Coding Briefs Data Access
From Everand
C# 2010 Coding Briefs Data Access
Kevin Hough
No ratings yet
SAP Basis Configuration Frequently Asked Questions
From Everand
SAP Basis Configuration Frequently Asked Questions
Equity Press
3.5/5 (4)

Group 37

Uploaded by

Group 37

Uploaded by

WELCOME !!!!

Unstructured Data Mining

Jagruti Wagh B120204420

Principal Global Services Pvt

Pdf-text conversion in Talend

Download RHadoop Packages from link rmr2, rhbase, rhdfs

You might also like