在确定新文档的主题时,我们可以使gensim lda模型使用预定主题分布吗?
例如:
import gensim
texts=[['a','a','a'],['b','b','b'],['c','c','c']]
d = gensim.corpora.Dictionary(texts)
bow = [d.doc2bow(doc) for doc in texts]
import numpy as np
user_topics=np.array([[1,0],[0,1,1]])
model = gensim.models.LdaModel(corpus=bow,id2word=d,num_topics=3,random_state=1,eta=user_topics)
model.get_topics()
返回
array([[0.9728407,0.01204113,0.01511812],[0.01140388,0.9742677,0.01432837],[0.02468761,0.00788806,0.9674243 ]],dtype=float32)
model.get_topics()
是否可以返回与'eta'相同的分布?
即
[[1,1]]