I found tutorials where class based LM is implemented using Brown clustering passing just number of classes you want but I want to implement a class based model where I give class assignments initially. I tried this http://projects.csail.mit.edu/cgi-bin/wiki/view/SLS/SriLM. But this gives -99 to all ngrams in LM. There is very less documentation regarding this, Can anyone help me out?
How to implement supervised class based language model in SRILM?
717 views Asked by Ranjeet Singh At
1
There are 1 answers
Related Questions in NLP
- Seeking Python Libraries for Removing Extraneous Characters and Spaces in Text
- Clarification on T5 Model Pre-training Objective and Denoising Process
- The training accuracy and the validation accuracy curves are almost parallel to each other. Is the model overfitting?
- Give Bert an input and ask him to predict. In this input, can Bert apply the first word prediction result to all subsequent predictions?
- Output of Cosine Similarity is not as expected
- Getting an error while using the open ai api to summarize news atricles
- SpanRuler on Retokenized tokens links back to original token text, not the token text with a split (space) introduced
- Should I use beam search on validation phase?
- Dialogflow failing to dectect the correct intent
- How to detect if two sentences are simmilar, not in meaning, but in syllables/words?
- Is BertForSequenceClassification using the CLS vector?
- Issue with memory when using spacy_universal_sentence_encoder for similarity detection
- Why does the Cloud Natural Language Model API return so many NULLs?
- Is there any OCR or technique that can recognize/identify radio buttons printed out in the form of pdf document?
- Model, lexicon to do fine grained emotions analysis on text in r
Related Questions in SPEECH-RECOGNITION
- How to Avoid Speech Recognition from Recognizing Speaker Playback in Unity
- recognize_google fails with WinError 10060
- React native voice isn't detecting my voice
- Comparing analog signal from Electret mic with samples
- Unable to convert Speech to Text using Azure Speech-to-Text service
- Python Script Not Generating Sync Map Despite Successful Command Line Execution
- Automatic speech recognition from scratch
- google speech transcribe-streaming-audio with single_utterance and time limit
- Azure AI Speech Service - No punctuation on Recognized return
- How to get the microphone to record sound with Google Speech recognition on Raspberry Pi 3?
- How to fix the below mention error in python
- How to increase the time for which the Microsoft Speech Service SDK listens in a single go?
- Make real time prediction with Keras
- AttributeError: module 'speech_recognition' has no attribute 'Microphone'
- Is there any way to do this without writing the file to memory first?
Related Questions in SRILM
- How to compute a simple maximum likelihood LM with SRILM
- make fails installing SRI LM
- Why is the ngram-merge of srilm taking wrong input?
- Trouble installling SRILM on mac (Mohave)
- Gettig make *** [World] Error 2 when installing SRILM on Ubuntu 18.04 Virtual Box, pthread warning
- Error while running make command for srilm
- How to implement supervised class based language model in SRILM?
- I cant install the toolkit srilm on linux mint
- Build language on SRILM
- Error at ARPA model training with SRILM
- language model with SRILM
- How to get a CTM File from CMU Sphinx?
- Trouble installing SRILM on MacOS
- What is the underlying algorithm for predicting hidden events using a hidden event language model?
- Convert ngrams count files into ARPA format
Popular Questions
- How do I undo the most recent local commits in Git?
- How can I remove a specific item from an array in JavaScript?
- How do I delete a Git branch locally and remotely?
- Find all files containing a specific text (string) on Linux?
- How do I revert a Git repository to a previous commit?
- How do I create an HTML button that acts like a link?
- How do I check out a remote Git branch?
- How do I force "git pull" to overwrite local files?
- How do I list all files of a directory?
- How to check whether a string contains a substring in JavaScript?
- How do I redirect to another webpage?
- How can I iterate over rows in a Pandas DataFrame?
- How do I convert a String to an int in Java?
- Does Python have a string 'contains' substring method?
- How do I check if a string contains a specific word?
Trending Questions
- UIImageView Frame Doesn't Reflect Constraints
- Is it possible to use adb commands to click on a view by finding its ID?
- How to create a new web character symbol recognizable by html/javascript?
- Why isn't my CSS3 animation smooth in Google Chrome (but very smooth on other browsers)?
- Heap Gives Page Fault
- Connect ffmpeg to Visual Studio 2008
- Both Object- and ValueAnimator jumps when Duration is set above API LvL 24
- How to avoid default initialization of objects in std::vector?
- second argument of the command line arguments in a format other than char** argv or char* argv[]
- How to improve efficiency of algorithm which generates next lexicographic permutation?
- Navigating to the another actvity app getting crash in android
- How to read the particular message format in android and store in sqlite database?
- Resetting inventory status after order is cancelled
- Efficiently compute powers of X in SSE/AVX
- Insert into an external database using ajax and php : POST 500 (Internal Server Error)
I've done this before but it was several years ago. Let me see if I can retrace the steps for you.
The first step is to create the file that specifies the classes. It should have three columns. First is the class id, then the probability of that word given the class, and lastly the word.
Next step is to replace all the words in the training data with their class ids. You can use the SRILM
replace-words-with-classesscript or you can write your own script to do it.Now you train a language model using
ngram-countjust like you would for a regular non-class n-gram model.For evaluation you just specify the language model and also the class file.