I am trying to use phonetic algorithms like Soundex and/or Metaphone to generate words that sound similar to a given dictionary word. Do I have to have a corpus of all dictionary words for doing that? Is there another way to generate words that sound similar to a given word without using a corpus? I am trying to do it in Python.
Is there a way to generate words that sound similar to a given dictionary word without using a corpus?
1.3k views Asked by user2832492 At
1
There are 1 answers
Related Questions in PYTHON
- How to store a date/time in sqlite (or something similar to a date)
- Instagrapi recently showing HTTPError and UnknownError
- How to Retrieve Data from an MySQL Database and Display it in a GUI?
- How to create a regular expression to partition a string that terminates in either ": 45" or ",", without the ": "
- Python Geopandas unable to convert latitude longitude to points
- Influence of Unused FFN on Model Accuracy in PyTorch
- Seeking Python Libraries for Removing Extraneous Characters and Spaces in Text
- Writes to child subprocess.Popen.stdin don't work from within process group?
- Conda has two different python binarys (python and python3) with the same version for a single environment. Why?
- Problem with add new attribute in table with BOTO3 on python
- Can't install packages in python conda environment
- Setting diagonal of a matrix to zero
- List of numbers converted to list of strings to iterate over it. But receiving TypeError messages
- Basic Python Question: Shortening If Statements
- Python and regex, can't understand why some words are left out of the match
Related Questions in SOUNDEX
- How do I figure out which word sounds most similar to a given word?
- What options are available for performance tuning a SoundEx query?
- Any implementation of Reverse Soundex in python?
- getting NULL value for string 'marketing' and 'makeing' as soundex drops vowels only as both have same soundex string value
- How do I implement fuzzy searching for a word within a field?
- The soundex function from oracle has a result different from official documentation
- How to check %match between 2 string in prestosql?
- SQL Server Soundex() and Difference() to Compare a Columns observations to Itself
- Why is my stored procedure query returning extra results?
- Using SOUNDEX function on WHERE clause in MySQL
- Fuzzy search on Oracle database
- Implementing Soundex Encoding Algorithm
- How to use Soundex() in googlecolab for python?
- c# comparing strings with a bit of leniency
- Using Soundex in wordpress or woocommerce default search Query?
Related Questions in PHONETICS
- How do I find the maximum phonetic and synctactic similarity between two strings?
- How to separate Phonetic, Word Break and Word Join keywords from list of keywords using python?
- How to extract phonetic language from the web with Python
- How to easily convert English audio files to IPA (phonetics) with time stamps on Windows?
- What phonetic notation or transliteration style Google uses for showing the transliteration of the Bangla Words in Google Translate?
- Looping through an array of float column in a Pyspark DataFrame to find which values pass through a condition
- ɔ̃ does not exist in JavaScript
- Fieldwork audio recording for acoustic analysis: stereo or mono? appropriate gain?
- illegal_argument_exception","reason":"Unknown filter type [phonetic] for [phonetic]
- Praat script to remove specific boundaries
- How can i search for phonetic word in ms sql
- Can Mandarin pronunciation be coded by ARPABET phone set?
- Phonetic Algorithms for Postgresql
- Is there a python library I could use to convert audio to phonemes?
- Parallelizing phonetic distance between all pairwise combinations of words in a document
Related Questions in METAPHONE
- What is the minemum percentage in Metaphone3 that I can use it to tell these names are matched
- Name Matching using Double Metaphone on BigQuery
- Performance for join table with string comparison
- configure double metaphone in french java apache
- Why is Java's Double Metaphone only giving four letter codes?
- variables equal with doublemetaphone on pyspark
- How to make a fulltext search
- How to decide which Encoder to use for which language in Elasticsearch "Phonetic Token filter"?
- Populate Rows in CSV with jellyfish.metaphone() value of row
- MySQL select possible regular expression question
- Difference between Metaphone 3 and Double Metaphone
- Double Metaphone algorithms for Full Names
- Soundex or Metaphone algorithm for typos in search term
- I want to use double metaphone algorithm in stored procedure does oracle have any inbuilt function for this?
- PostgreSQL: Address matching using fuzzymatch from two tables
Popular Questions
- How do I undo the most recent local commits in Git?
- How can I remove a specific item from an array in JavaScript?
- How do I delete a Git branch locally and remotely?
- Find all files containing a specific text (string) on Linux?
- How do I revert a Git repository to a previous commit?
- How do I create an HTML button that acts like a link?
- How do I check out a remote Git branch?
- How do I force "git pull" to overwrite local files?
- How do I list all files of a directory?
- How to check whether a string contains a substring in JavaScript?
- How do I redirect to another webpage?
- How can I iterate over rows in a Pandas DataFrame?
- How do I convert a String to an int in Java?
- Does Python have a string 'contains' substring method?
- How do I check if a string contains a specific word?
Trending Questions
- UIImageView Frame Doesn't Reflect Constraints
- Is it possible to use adb commands to click on a view by finding its ID?
- How to create a new web character symbol recognizable by html/javascript?
- Why isn't my CSS3 animation smooth in Google Chrome (but very smooth on other browsers)?
- Heap Gives Page Fault
- Connect ffmpeg to Visual Studio 2008
- Both Object- and ValueAnimator jumps when Duration is set above API LvL 24
- How to avoid default initialization of objects in std::vector?
- second argument of the command line arguments in a format other than char** argv or char* argv[]
- How to improve efficiency of algorithm which generates next lexicographic permutation?
- Navigating to the another actvity app getting crash in android
- How to read the particular message format in android and store in sqlite database?
- Resetting inventory status after order is cancelled
- Efficiently compute powers of X in SSE/AVX
- Insert into an external database using ajax and php : POST 500 (Internal Server Error)
If you don't use a corpus, then you will probably have to manually define a set of rules to split a word in phonetic parts and then find the list of close phonemes. This can generate similar sounding words but most won't exist. If you want to generate close sounding words that exist, then you necessarily need a corpus.
You didn't precise the goal of your task, but you may be interested in the works of Will Leben "Sounder I" (and II and III) and Jabberwocky sentences.