Name	Name	Last commit message	Last commit date
Latest commit History 657 Commits
conda-build-recipe	conda-build-recipe
conda	conda
docs	docs
requirements	requirements
src/nlpia	src/nlpia
tests	tests
.coveragerc	.coveragerc
.gitignore	.gitignore
.pre-commit-config.yaml	.pre-commit-config.yaml
.travis.big.yml	.travis.big.yml
.travis.yml	.travis.yml
AUTHORS	AUTHORS
AUTHORS.rst	AUTHORS.rst
CHANGELOG.md	CHANGELOG.md
CONTRIBUTING.md	CONTRIBUTING.md
LICENSE.txt	LICENSE.txt
README.md	README.md
README.rst	README.rst
appveyor.yml	appveyor.yml
pytest.ini	pytest.ini
secrets.cfg.EXAMPLE_TEMPLATE	secrets.cfg.EXAMPLE_TEMPLATE
setup.cfg	setup.cfg
setup.py	setup.py
setup27.cfg	setup27.cfg
tox.ini	tox.ini

NLPIA

Community-driven code for Natural Language Processing in Action.

Description

A community-developed book about building socially responsible NLP pipelines that give back to the communities they interact with.

Getting Started

You'll need a bash shell on your machine. Git has installers that include bash shell for all three major OSes.

Once you have Git installed, launch a bash terminal. It will usually be found among your other applications with the name git-bash.

Install Anaconda3 (Python3.6)

If you're installing Anaconda3 using a GUI, be sure to check the box that updates your PATH variable. Also, at the end, the Anaconda3 installer will ask if you want to install VSCode. Microsoft's VSCode is supposed to be an OK editor for Python so feel free to use it.

Install an Editor

You can skip this step if you are happy using jupyter notebook or VSCode or the editor built into Anaconda3.

I like Sublime Text. It's a lot cleaner more mature. Plus it has more plugins written by individual developers like you.

Install Git and Bash

Linux -- already installed
MacOSX -- already installed
Windows

If you're on Linux or Mac OS, you're good to go. Just figure out how to launch a terminal and make sure you can run ipython or jupyter notebook in it. This is where you'll play around with your own NLP pipeline.

On Windows you have a bit more work to do. Supposedly Windows 10 will let you install Ubuntu with a terminal and bash. But the terminal and shell that comes with git is probably a safer bet. It's mained by a broader open source community.

Clone this repository

git clone https://github.com/totalgood/nlpia.git

Install nlpia

You have two tools you can use to install nlpia:

5.1. conda
5.2. pip

5.1. `conda`

In most cases, conda will be able to install python packages faster and more reliably than pip, because packages like python-levenshtein require you to compile a C library during installation, and Windows doesn't have an installer that will "just work."

So use conda (part of the Anaconda package that we already installed) to create an environment called nlpiaenv:

cd nlpia  # make sure you're in the nlpia directory that contains `setup.py`
conda env create -n nlpiaenv -f conda/environment.yml
conda install pip  # to get the latest version of pip
pip install -e .

Whenever you want to be able to import or run any nlpia modules, you'll need to activate this conda environment first:

source activate nlpiaenv
python -c "print(import nlpia)"

Skip to Step 4 if you have successfully created and activated an environment containing the nlpia package.

5.2. `pip`

Linux-based OSes like Ubuntu and OSX come with C++ compilers built-in, so you may be able to install the dependencies using pip instead of conda. But if you're on Windows and you want to install packages, like python-levenshtein that need compiled C++ libraries, you'll need a compiler. Fortunately Microsoft still lets you download a compiler for free, just make sure you follow the links to the Visual Studio "Build Tools" and not the entire Visual Studio package.

Once you have a compiler on your OS you can install nlpia using pip:

cd nlpia  # make sure you're in the nlpia directory that contains `setup.py`
pip install --upgrade pip
mkvirtualenv nlpiaenv
source nlpiaenv/bin/activate
pip install -r requirements-test.txt
pip install -e .
pip install -r requirements-deep.txt

The chatbots(including TTS and STT audio drivers) that come with nlpia may not be compatible with Windows due to problems install pycrypto. If you are on a Linux or Darwin(Mac OSX) system or want to try to help us debug the pycrypto problem feel free to install the chatbot requirements:

# pip install -r requirements-chat.txt
# pip install -r requirements-voice.txt

Have Fun!

Check out the code examples from the book in nlpia/nlpia/book/examples to get ideas:

cd nlpia/book/examples
ls

Contributing

Help your fellow readers by contributing to your shared code and knowledge. Here are some ideas for a few features others might find handy.

Feature 1: Glossary Compiler

Skeleton code and APIs that could be added to the https://github.com/totalgood/nlpia/blob/master/src/nlpia/transcoders.py:`transcoders.py` module.

def find_acronym(text):
    """Find parenthetical noun phrases in a sentence and return the acronym/abbreviation/term as a pair of strings.

    >>> find_acronym('Support Vector Machine (SVM) are a great tool.')
    ('SVM', 'Support Vector Machine')
    """
    return (abbreviation, noun_phrase)

def glossary_from_dict(dict, format='asciidoc'):
    """ Given a dict of word/acronym: definition compose a Glossary string in ASCIIDOC format """
    return text

def glossary_from_file(path, format='asciidoc'):
    """ Given an asciidoc file path compose a Glossary string in ASCIIDOC format """
    return text


def glossary_from_dir(path, format='asciidoc'):
    """ Given an path to a directory of asciidoc files compose a Glossary string in ASCIIDOC format """
    return text

Feature 2: Semantic Search

Use a parser to extract only natural language sentences and headings/titles from a list of lines/sentences from an asciidoc book like "Natural Language Processing in Action". Use a sentence segmenter in https://github.com/totalgood/nlpia/blob/master/src/nlpia/transcoders.py:[nlpia.transcoders] to split a book, like NLPIA, into a seequence of sentences.

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Repository files navigation

NLPIA

Description

Getting Started

5.1. `conda`

5.2. `pip`

Contributing

Feature 1: Glossary Compiler

Feature 2: Semantic Search

About

Uh oh!

Releases

Packages

Languages

License

learner-python-R/nlpia

Folders and files

Latest commit

History

Repository files navigation

NLPIA

Description

Getting Started

5.1. conda

5.2. pip

Contributing

Feature 1: Glossary Compiler

Feature 2: Semantic Search

About

Resources

License

Contributing

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Languages

5.1. `conda`

5.2. `pip`

Packages