DataWalk-Introduction-To-Open-Source-Intelligence-Tools_Spring2020
DataWalk-Introduction-To-Open-Source-Intelligence-Tools_Spring2020
Introduction To Open-Source
Intelligence Tools
Spring 2020
www.datawalk.com
www.datawalk.com 1/37
Multi-Search Engines
www.datawalk.com 2/37
Advanced Search Engines
Use free tools below to query multiple search engines simultaneously Highly recommended The most classic of advanced search engines. Allows you to
search Google resources in an advanced way without using
google.com/ separate queries. Also use specific forms i.e. site:.gov
advanced_search filetype:csv or “” to get precise results.
Highly recommended
intelx.io/tools Search Highly recommended Yandex Search is a web search engine. Yandex Search also
features “parallel” search that presents results from both main
yandex.com web index and specialized information resources, including
Highly recommended news, shopping, blogs, images and videos on a single page.
Highly recommended
Search engine for websites available on the Tor network, for
ahmia.fi sources that are commonly known as DeepWeb or darknet.
Ahmia provides hidden content not only to Tor users.
www.datawalk.com 3/37
Metasearch Engines
www.datawalk.com 4/37
Metasearch Engines
A Metasearch engines are search tools that use another Highly recommended Highly recommended
search engine's data to generate its own results. Metasearch
engines simultaneously send out queries to third party alltheinternet.com myallsearch.com
search engines and cumulate and summary results.
search.carrot2.org etools.ch
Note that in this guide we present only a sample of available tools. The
success of your searches depends on defining the purpose of your
search, the choice of tool, and your imagination!
www.datawalk.com 5/37
Data Breach / Leak
www.datawalk.com 6/37
Data Breach / Leak
Have I Been Pwned allows you to search across multiple data breaches to
see if your email address has been compromised
Highly recommended
haveibeenpwned.com
Highly recommended
dehashed.com
Pastebin dump collection - get dumps by date, domain, search by email, etc.
Highly recommended
psbdmp.ws
Specific search terms such as email addresses, domains, URLs, IPs, CIDRs, Bitcoin
addresses, IPFS hashes, etc
Highly recommended
intelx.io
www.datawalk.com 7/37
Social Media Search Engines
Various tools are available to help you find information on particular people, users,
ID numbers, trends, keywords, mutual acquaintances, visited places, and “likes” on
Facebook, Twitter, Instagram, and other social networks.
www.datawalk.com
Social Media Search Engines
Highly recommended Highly recommended Highly recommended Highly recommended
social-searcher.com wiki.kenburbary.com/social- shadowdragon.io hashatit.com
meda-monitoring-wiki
Free browser of social media. Without ShadowDragon is a cybersecurity Everywhere on social media, content is
logging in, users can search information A set of free and paid tools to monitor solutions developer that simplifies being generated at unheard of speeds.
published on Twitter, Google+, social media content and user complex multi-technology, multi- Hashtags help you navigate the ever-
Facebook, YouTube, Instagram, LinkedIn, activities. environment digital investigations with expanding internet, and HASHATIT keeps
Tumblr, Reddit, Flickr, Dailymotion and easy-to-use tools. you on top of hashtags.
Vimeo.
www.datawalk.com 9/37
Search Engines
Of People And Business
Email Addresses
Looking for information on particular people, email addresses or telephone
numbers? Want to check an email address of a person of interest on the basis
of a domain? These search engines will help you identify hidden content.
www.datawalk.com
Search Engines Of People And Business Email Addresses
Highly recommended
Highly recommended
A search engine of people and a catalog of WebMii is a next-generation search engine: E-mail address search engine -
pipl.com telephone numbers, email addresses and find all information on people and get an searches 150 million records in a
photos. Look for people by their surnames, evaluation of their visibility in the web. few seconds.
Pipl's unique identity resolution email addresses, addresses and phones
engine connects the world's personal, for free.
professional and social identity data to
give analysts and investigators an
unmatched global index of over 3 billion
trusted identity profiles.
Highly recommended Highly recommended Highly recommended
Find any email in under five seconds. A solution for finding and verifying A comprehensive search engine for people.
professional email addresses. You can find a person on the basis of their
email address.
www.datawalk.com 11/37
Search Engines Of Domains,
Websites, Emails And IP
Registry
Looking for information on particular domains or IP addresses?
Take advantage of search engines for exploration and investigation.
12/37
Search Engines Of Domains, Websites, Emails And IP Registry
www.datawalk.com 13/37
Search Engines Of Domains, Websites, Emails And IP Registry
www.datawalk.com 14/37
Deep Reconnaissance
15/37
Deep reconnaissance
Highly recommended
spiderfoot.net
Highly recommended
spiderfoot.net/hx/
www.datawalk.com 17 /20
16/37
Other OSINT Tools
www.datawalk.com 17/37
Other OSINT Tools
Highly recommended
Enter a user name and click one button to check if the name is
already in use, across dozens of social networking sites. In
namechk.com addition, check the availability of domains of this name. Forget the
painstaking task of visiting each site to check; with namechk you
can easily get a condensed result for free.
Highly recommended
Most tools free Internet archive, allows you to search, among others archival
versions of web pages (wayback machine). Thanks to this you
archive.org have access to unique archival information, e.g., bank accounts,
phone numbers, e-mail addresses, and also images and other
content that can provide context of the case under investigation.
Highly recommended
Most tools free This tool is designed to provide a quick method to search
usernames and hashtags on TikTok via a browser. All results take
osintcombine.com/tiktok- the user to the source on the TikTok.com website or are provided
quick-search as a Google search result. NO data is stored, managed or
processed by OSINT Combine.
www.datawalk.com 18/37
Other OSINT Tools
Want to find similar websites? Just enter a domain name and you
will immediately get information about services with similar
content. You can check websites visited by other users and
similarsites.com keywords they were looking for. Also, you will find out if a website
of interest is active and how much time users spend there on
average.
Highly recommended
Highly recommended
Takes a domain or IP address and does a reverse lookup to quickly
show all other domains hosted from the same server. Useful for
viewdns.info/reverseip finding phishing sites or identifying other sites on the same
shared hosting server.
Highly recommended
www.datawalk.com 19/37
Other OSINT Tools
Highly recommended
Highly recommended
www.datawalk.com 21/30
Web Scrapers
If you want to download large amounts of
data from particular web sites for further
analysis, here are some tools that can help.
www.datawalk.com 21/37
WEB SCRAPERS
Highly recommended
When used as an extension to a browser (e.g., Chrome), Webscraper.io enables you to create a map
webscraper.io of a website so that you can decide which data should be extracted. Scanned data can be exported
to a CSV file.
octoparse.com data-miner.io
Octoparse is a powerful web scraper which is easy to use. It can work with A very easy to use Chrome extension that enables you to scan data from
both static and dynamic websites by means of AJAX, JavaScript, cookies, tables and lists on separate pages to CSV or XLS files.
etc. You can download an installer and run a data download task on any
website, even if the website is paginated or requires logging-in.
www.datawalk.com 22/37
Cryptocurrency
Analysis And Lookup
www.datawalk.com 23/37
Cryptocurrency analysis and lookup
moneroblocks.info bitcoinwhoswho.com
www.datawalk.com 24/37
FTP Search
www.datawalk.com 25/37
FTP Search
Highly recommended
globalfilesearch.com
FTP
Highly recommended
searchftps.net
www.datawalk.com 27/30
26/37
Maps
Various advanced maps, satellite imagery and information.
www.datawalk.com 27/37
Maps
Highly recommended
search.earthdata.nasa.gov/search
Highly recommended
Highly recommended
digitalglobe.com/ecosystem/open-data
Highly recommended
worldview.earthdata.nasa.gov
www.datawalk.com 28/37
The Deep Web
And The Dark Web
Most people have access to only the indexed
World Wide Web (www) which is only perhaps 6%
of all content published on the Internet. The most
popular search engine for this is Google.
www.datawalk.com 29/37
World Wide Web
The Deep Web And The Dark Web
Only about from 1% to 6% of the
content on the Internet is openly
available and indexed by search
Deep Web engines such as Google.
The deep web is part of the World Wide Web whose contents are not indexed by standard web
search engines. There are many common examples of deep web sources, such as private content
on social media sites like Facebook and Twitter, as well as email messages, chat messages,
electronic bank statements, electronic health records (EHRs), etc. There are tools (e.g.,
IntelTechniques) that allow access to some of this information.
Highly recommended
Hidden Wiki Deep Web links Deep Web
hiddenwikitor.com
Over 90% of the
information on the
Highly recommended Internet is in the deep
ahmia.fi Ahmia searches hidden services on the Tor network 7.9 web (e.g., password
protected) and is not
accessible by search
Highly recommended
Discover Hidden Services and access to Tor's onion sites
Dark Web Zettabytes engines.
www.datawalk.com 30/37
Internet of Things
Search Engines for the Internet of Things
www.datawalk.com 31/37
Highly recommended
zoomeye.org
Highly recommended
Cyberspace Search Engine recording
information of devices, websites, thingful.net
services and components.
IoT
Highly recommended
shodan.io Highly recommended
iotscanner.bullguard.com
Shodan is a search engine of devices
connected to the Internet (i.e., IoT). Find
video cameras, buildings and other Check if your internet-connected
devices connected to the web. devices at home are public on Shodan.
If they are, this means they are
accessible to the public, and hackers.
www.datawalk.com 32/37
Combine your
data with OSINT
data to detect
operations and
spot targets.
www.datawalk.com 33/37
The ultimate solution: Use DataWalk to connect OSINT with your Example scenario (Video): Cryptocurrency
internal data. analysis
DataWalk is an Enterprise-class software platform that enables you Easily check bitcoin wallets, instantly track their transactions,
and gain insights via the darknet to identify bad actors and
to easily connect your internal data with OSINT and your other beneficial owners.
external data sources to quickly provide you with a remarkably
comprehensive view of all desired data. Being able to quickly link
together all of this data in an integrated view, coupled with powerful
tools for data analysis, can enable you to dramatically accelerate
your investigations with DataWalk.
Example integrations
with OSINT subscription
services available in Example scenario (Video): Human Trafficking
DataWalk.
Leveraging indicators of human trafficking from online ads
and using DataWalk risk scoring to combine indicators with
other internal and external data sources to rank the suspects.
www.datawalk.com 34/37
OSINT Cheat-Sheet - Page 1
www.datawalk.com 35/37
36/20
OSINT Cheat-Sheet - Page 2
censys.io hunch.ly
osintcombine.com/
analyzeid.com Mark your
google- analytics-id- FTP Search favorites
explorer
securitytrails.com osintcombine.com/
tiktok- quick-search globalfilesearch.com
Find a file among millions
of files located on FTP
servers.
spyonweb.com searchftps.net
blockchain.com/ search.earthdata.nas
pipl.com a.gov/search
explorer
voilanorbert.com
www.datawalk.com 37/37
DataWalk is a next-generation analytical platform which
enables you to easily combine data from many different
sources (including OSINT) for instant link analysis, geospatial
analysis, visual querying, and collaborative investigations.
The content provided here has been created on a best-effort basis, but DataWalk cannot guarantee the accuracy, completeness, reliability, usability or timeliness of this material. Please professionally check, or to have professionally checked, the suitability of
all content for its intended use. The contents and works in this document compiled by DataWalk are subject to copyright law. Copying, processing, distribution and any kind of use outside the limits of copyright law require the written consent of DataWalk. In
case the content is not created by DataWalk, the copyrights of third parties are being observed. In particular content of third parties are marked as such. However, if a user becomes aware of a copyright infringement, DataWalk asks for notification. Upon
www.datawalk.com notification of such violations, DataWalk will remove the content immediately.