Yes, the same file "johndoediary.mallet", is used for LDA and hLDA.
Post by Antonio Jesús Hernández BlancoHi Michael,
You forgot to put the "run" before
cc.mallet.topics.tui.mallet.HierarchicalLDATUI
mallet run cc.mallet.topics.tui.HierarchicalLDATUI --help
Hierarchical LDA with a fixed tree depth.
--help TRUE|FALSE
Print this command line option usage information. Give argument of
TRUE for longer documentation
Default is false
--prefix-code 'JAVA CODE'
Java code you want run before any other interpreted code. Note that
the text is interpreted without modification, so unlike some other Java
code options, you need to include any necessary 'new's when creating
objects.
Default is null
--config FILE
Read command option values from a file
Default is null
--input FILENAME
The filename from which to read the list of training instances. Use -
for stdin. The instances must be FeatureSequence or
FeatureSequenceWithBigrams, not FeatureVector
Default is null
--testing FILENAME
The filename from which to read the list of instances for held-out
likelihood calculation. Use - for stdin. The instances must be
FeatureSequence or FeatureSequenceWithBigrams, not FeatureVector
Default is null
--output-state FILENAME
The filename in which to write the Gibbs sampling state after at the
end of the iterations. By default this is null, indicating that no file
will be written.
Default is null
--random-seed INTEGER
.....
Hola Antonio,
Thanks for responding.
Can I just confirm you get the following
*C:\Mallet>bin\mallet cc.mallet.topics.tui.HierarchicalLDATUI --help*
*Mallet 2.0 commands:*
* import-dir load the contents of a directory into mallet
instances (one*
*per file)*
* import-file load a single file into mallet instances (one per
line)*
* import-svmlight load a single SVMLight format data file into mallet
instance*
*s (one per line)*
* info get information about Mallet instances*
* train-classifier train a classifier from Mallet data files*
* classify-dir classify data from a single file with a saved
classifier*
* classify-file classify the contents of a directory with a saved
classifier*
* classify-svmlight classify data from a single file in SVMLight format*
* train-topics train a topic model from Mallet data files*
* infer-topics use a trained topic model to infer topics for new
documents*
* evaluate-topics estimate the probability of new documents given a
trained mo**del*
* prune remove features based on frequency or information gain*
* split divide data into testing, training, and validation
portions*
* bulk-load for big input files, efficiently prune vocabulary
and import*
* docs*
*Include --help with any option for more information*
Because it looks similar to the general info details, so doesn't tell me
anything specific about running HDA
*C:\Mallet>bin\mallet -info*
*Mallet 2.0 commands:*
* import-dir load the contents of a directory into mallet
instances (one*
*per file)*
* import-file load a single file into mallet instances (one per
line)*
* import-svmlight load a single SVMLight format data file into mallet
instance*
*s (one per line)*
* info get information about Mallet instances*
* train-classifier train a classifier from Mallet data files*
* classify-dir classify data from a single file with a saved
classifier*
* classify-file classify the contents of a directory with a saved
classifier*
* classify-svmlight classify data from a single file in SVMLight format*
* train-topics train a topic model from Mallet data files*
* infer-topics use a trained topic model to infer topics for new
documents*
* evaluate-topics estimate the probability of new documents given a
trained mo**del*
* prune remove features based on frequency or information gain*
* split divide data into testing, training, and validation
portions*
* bulk-load for big input files, efficiently prune vocabulary
and import*
* docs*
*Include --help with any option for more information*
On Mon, 15 Feb 2016 at 15:11 Antonio Jesús Hernández Blanco <
Post by Antonio Jesús Hernández BlancoHi Michael,
LDA is by default in mallet when used train-topics option. Now, for HLDA
use cc.mallet.topics.tui.HierarchicalLDATUI, to know its options, from
command line: mallet run cc.mallet.topics.tui.HierarchicalLDATUI --help
Forgive the very basis question but on Windows how can I get a HDA topic
model using the Mallet command line i.e not writing my own Java code?
I tried the --help but there isn't a parameter I can pass to bin\mallet
topic-model to say I want LDA or HDA used
Any pointers?
Michael
_______________________________________________
--
Antonio Jesús Hernández Blanco
Doctoral Student
Computer Science
Department of Software and Computer Systems
University of Alicante
Mov. (+34) 622 64 90 50
(+593) 982 21 69 73
Tel. (+34) 965 90 00 00 (Ext. 2961)
Fax: (+34) 965 90 93 26
_______________________________________________
mallet-dev mailing list
https://mailman.cs.umass.edu/mailman/listinfo/mallet-dev
--
Antonio Jesús Hernández Blanco
Doctoral Student
Computer Science
Department of Software and Computer Systems
University of Alicante
Mov. (+34) 622 64 90 50
(+593) 982 21 69 73
Tel. (+34) 965 90 00 00 (Ext. 2961)
Fax: (+34) 965 90 93 26
Mov. (+34) 622 64 90 50
Tel. (+34) 965 90 00 00 (Ext. 2961)