22 WebIntelligence Tools Feb2008
22 WebIntelligence Tools Feb2008
2005
III. Visualization
NetDraw JUNG Analysts Notebook and Starlight
2005
2005
Offline Explorer
Project list
Download URLs
Download level
2005
SpidersRUs
SpidersRUs Digital Library Toolkit was developed by Artificial Intelligence Lab at the University of Arizona. http://ai.eller.arizona.edu/spidersrus/
Provide modular tools for spidering, indexing, searching for building digital libraries in different languages in a simple DIY (Do-ItYourself) way. Users can create their own search engines easily and quickly via the friendly user interface.
SpidersRUs can automate the development of vertical search engines in different domains and languages. It can work on nonEnglish languages such as Asian and Middle East languages.
2005
SpidersRUs
2005
Google Scholar
Google Scholar provides a simple way to broadly search for scholarly literature. http://scholar.google.com/ Features of Google Scholar: Search diverse sources from one convenient place Find papers, abstracts and citations Locate the complete paper through your library or on the web Learn about key papers and scholars in any area of research
2005
Google Scholar
Search for Bioterrorism in Google Scholar
List of papers citing this paper
366 citations
2005
2005
Results: all the related inlink Web pages Input link URL and search
2005
10
Google Translation
Google's Translate function. http://www.google.com/language_tools?hl=en The input and output languages can be Arabic, Chinese, Dutch, English, French, German, Greek, Italian, Japanese, Korean, Portugese, Russian or Spanish. Major functions of Google Translation include:
Search multilingual Web pages
Search the Internet in one language and get the results in another one.
Translate text
Translate free text into multiple languages.
2005
11
Google Translation
2005
12
GATE
Generalised Architecture for Text Engineering (GATE) is a toolkit for Text Mining. It was developed by NLP group at the University of Sheffield (UK). http://gate.ac.uk Information Extraction tasks:
Named Entity Recognition (NE)
Finds names, places, dates, etc. Identifies identity relations between entities in texts. Adds descriptive information to NE results (using CO). Finds relations between TE entities. Fits TE and TR results into specified event scenarios.
Template Element Construction (TE) Template Relation Construction (TR) Scenario Template Production (ST)
2005
13
GATE
Project information
Attributes
Results display
2005
* Picture is from http://nlp.shef.ac.uk
14
15
2005
16
SOM
The multi-level self-organizing map neural network algorithm was developed by Artificial Intelligence Lab at the University of Arizona. Using a 2D map display, similar topics are positioned closer according to their co-occurrence patterns; more important topics occupy larger regions.
2005
17
SOM
Example: FMD Paper Content Map (2001~2005)
Different Topics
18
Weka
Weka was developed at the University of Waikato in New Zealand. http://www.cs.waikato.ac.nz/~ml/ Tools include: Data preprocessing (e.g., Data Filters), Classification (e.g., BayesNet, KNN, C4.5 Decision Tree, Neural Networks, SVM), Regression (e.g., Linear Regression, Isotonic Regression, SVM for Regression), Clustering (e.g., Simple K-means, Expectation Maximization (EM), Farthest First), Association rules (e.g., Apriori Algorithm, Predictive Accuracy, Confirmation Guided), Feature Selection (e.g., Cfs Subset Evaluation, Information Gain, Chisquared Statistic), and Visualization (e.g., View different two-dimensional plots of the data).
19
2005
Weka
The value set of the chosen attribute and the # of input items with each value
2005
20
Visualization: NetDraw
NetDraw is a open source program written by Steve Borgatti from Analytic Technologies for visualizing both 1-mode and 2mode social network data. http://www.analytictech.com/downloadnd.htm Handle multiple relations at the same time, and can use node attributes to set colors, shapes, and sizes of nodes. Pictures can be saved in metafile, jpg, gif and bitmap formats.
Two basic kinds of layouts are implemented: a circle and an MDS/ spring embedding based on geodesic distance. You can also rotate, flip, shift, resize and zoom configurations.
2005
21
NetDraw
Different functions
The networks: nodes representing the individuals and links representing the relations
2005
22
JUNG
The Java Universal Network/Graph Framework (JUNG) is a software library for the modeling, analysis, and visualization of data that can be represented as a graph or network. It was developed by School of Information and Computer Science at the University of California, Irvine. http://jung.sourceforge.net/index.html The current distribution of JUNG includes implementations of a number of algorithms from graph theory, data mining, and social network analysis: Clustering Decomposition Optimization Random Graph Generation Statistical Analysis Calculation of Network Distances and Flows and Importance Measures (Centrality, PageRank, HITS, etc.).
23
2005
JUNG
2005
24
2005
25
Analysts Notebook, i2
Starlight, PNL
2005
26