Skip to content

cran/phonetisr

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

1 Commit
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 

Repository files navigation

phonetisr: A Naive IPA Tokeniser

This package is a (naive) tokeniser of phonetic transcriptions in the International Phonetic Alphabet (IPA).

With phonetisr, you can parse texts and word lists transcribed in IPA and tokenise them into phones so that you can perform quantitative analyses.

Installation

You can install the development version from GitHub with:

# install.packages("remotes")
remotes::install_github("stefanocoretta/phonetisr")

Usage

library(phonetisr)

# IPA strings to be tokenised
ipa <- c("pʰãkʰ", "tʰum̥", "ɛkʰɯ")

# List of character sequences to be considered single phones
ph <- c("", "", "", "", "")

# Tokenise strings
phonetise(ipa, multi = ph)
#> [[1]]
#> [1] "pʰ" "ã"  "kʰ"
#> 
#> [[2]]
#> [1] "tʰ" "u"  "m̥" 
#> 
#> [[3]]
#> [1] "ɛ"  "kʰ" "ɯ"

Roadmap

  • Scan for illegal (non-IPA) characters.
  • Provide a list of default multi-character phones.
  • Functions for data import/export.
  • Ignore diacritics.

About

❗ This is a read-only mirror of the CRAN R package repository. phonetisr — A Naive IPA Tokeniser. Homepage: https://github.com/stefanocoretta/phonetisrhttps://stefanocoretta.github.io/phonetisr/

Resources

License

Stars

Watchers

Forks

Packages

No packages published

Languages