<?xml version="1.0"?>
<?xml-stylesheet type="text/css" href="http://www.cslt.org/mediawiki/skins/common/feed.css?303"?>
<feed xmlns="http://www.w3.org/2005/Atom" xml:lang="zh-cn">
		<id>http://www.cslt.org/mediawiki/index.php?action=history&amp;feed=atom&amp;title=2014-03-07</id>
		<title>2014-03-07 - 版本历史</title>
		<link rel="self" type="application/atom+xml" href="http://www.cslt.org/mediawiki/index.php?action=history&amp;feed=atom&amp;title=2014-03-07"/>
		<link rel="alternate" type="text/html" href="http://www.cslt.org/mediawiki/index.php?title=2014-03-07&amp;action=history"/>
		<updated>2026-04-04T16:08:57Z</updated>
		<subtitle>本wiki的该页面的版本历史</subtitle>
		<generator>MediaWiki 1.23.3</generator>

	<entry>
		<id>http://www.cslt.org/mediawiki/index.php?title=2014-03-07&amp;diff=9296&amp;oldid=prev</id>
		<title>Zhaomy：/* GFbank */</title>
		<link rel="alternate" type="text/html" href="http://www.cslt.org/mediawiki/index.php?title=2014-03-07&amp;diff=9296&amp;oldid=prev"/>
				<updated>2014-03-07T04:58:55Z</updated>
		
		<summary type="html">&lt;p&gt;‎&lt;span dir=&quot;auto&quot;&gt;&lt;span class=&quot;autocomment&quot;&gt;GFbank&lt;/span&gt;&lt;/span&gt;&lt;/p&gt;
&lt;table class='diff diff-contentalign-left'&gt;
				&lt;col class='diff-marker' /&gt;
				&lt;col class='diff-content' /&gt;
				&lt;col class='diff-marker' /&gt;
				&lt;col class='diff-content' /&gt;
				&lt;tr style='vertical-align: top;'&gt;
				&lt;td colspan='2' style=&quot;background-color: white; color:black; text-align: center;&quot;&gt;←上一版本&lt;/td&gt;
				&lt;td colspan='2' style=&quot;background-color: white; color:black; text-align: center;&quot;&gt;2014年3月7日 (五) 04:58的版本&lt;/td&gt;
				&lt;/tr&gt;&lt;tr&gt;&lt;td colspan=&quot;2&quot; class=&quot;diff-lineno&quot;&gt;第56行：&lt;/td&gt;
&lt;td colspan=&quot;2&quot; class=&quot;diff-lineno&quot;&gt;第56行：&lt;/td&gt;&lt;/tr&gt;
&lt;tr&gt;&lt;td class='diff-marker'&gt;&amp;#160;&lt;/td&gt;&lt;td style=&quot;background-color: #f9f9f9; color: #333333; font-size: 88%; border-style: solid; border-width: 1px 1px 1px 4px; border-radius: 0.33em; border-color: #e6e6e6; vertical-align: top; white-space: pre-wrap;&quot;&gt;&lt;div&gt;&amp;#160;&amp;#160; &amp;#160;  clean&amp;#160; &amp;#160;  25db&amp;#160; &amp;#160; 5db&lt;/div&gt;&lt;/td&gt;&lt;td class='diff-marker'&gt;&amp;#160;&lt;/td&gt;&lt;td style=&quot;background-color: #f9f9f9; color: #333333; font-size: 88%; border-style: solid; border-width: 1px 1px 1px 4px; border-radius: 0.33em; border-color: #e6e6e6; vertical-align: top; white-space: pre-wrap;&quot;&gt;&lt;div&gt;&amp;#160;&amp;#160; &amp;#160;  clean&amp;#160; &amp;#160;  25db&amp;#160; &amp;#160; 5db&lt;/div&gt;&lt;/td&gt;&lt;/tr&gt;
&lt;tr&gt;&lt;td class='diff-marker'&gt;&amp;#160;&lt;/td&gt;&lt;td style=&quot;background-color: #f9f9f9; color: #333333; font-size: 88%; border-style: solid; border-width: 1px 1px 1px 4px; border-radius: 0.33em; border-color: #e6e6e6; vertical-align: top; white-space: pre-wrap;&quot;&gt;&lt;div&gt;gf&amp;#160;  4.22&amp;#160; &amp;#160; &amp;#160; 5.60&amp;#160; &amp;#160; 73.03&lt;/div&gt;&lt;/td&gt;&lt;td class='diff-marker'&gt;&amp;#160;&lt;/td&gt;&lt;td style=&quot;background-color: #f9f9f9; color: #333333; font-size: 88%; border-style: solid; border-width: 1px 1px 1px 4px; border-radius: 0.33em; border-color: #e6e6e6; vertical-align: top; white-space: pre-wrap;&quot;&gt;&lt;div&gt;gf&amp;#160;  4.22&amp;#160; &amp;#160; &amp;#160; 5.60&amp;#160; &amp;#160; 73.03&lt;/div&gt;&lt;/td&gt;&lt;/tr&gt;
&lt;tr&gt;&lt;td class='diff-marker'&gt;−&lt;/td&gt;&lt;td style=&quot;color:black; font-size: 88%; border-style: solid; border-width: 1px 1px 1px 4px; border-radius: 0.33em; border-color: #ffe49c; vertical-align: top; white-space: pre-wrap;&quot;&gt;&lt;div&gt;fb &lt;del class=&quot;diffchange diffchange-inline&quot;&gt;&amp;#160; &amp;#160; &amp;#160; &amp;#160; &amp;#160; &amp;#160; &lt;/del&gt;5.87&amp;#160; &amp;#160; 84.12&lt;/div&gt;&lt;/td&gt;&lt;td class='diff-marker'&gt;+&lt;/td&gt;&lt;td style=&quot;color:black; font-size: 88%; border-style: solid; border-width: 1px 1px 1px 4px; border-radius: 0.33em; border-color: #a3d3ff; vertical-align: top; white-space: pre-wrap;&quot;&gt;&lt;div&gt;fb &lt;ins class=&quot;diffchange diffchange-inline&quot;&gt;&amp;#160; 4.31&amp;#160; &amp;#160; &amp;#160; &lt;/ins&gt;5.87&amp;#160; &amp;#160; 84.12&lt;/div&gt;&lt;/td&gt;&lt;/tr&gt;
&lt;tr&gt;&lt;td class='diff-marker'&gt;&amp;#160;&lt;/td&gt;&lt;td style=&quot;background-color: #f9f9f9; color: #333333; font-size: 88%; border-style: solid; border-width: 1px 1px 1px 4px; border-radius: 0.33em; border-color: #e6e6e6; vertical-align: top; white-space: pre-wrap;&quot;&gt;&lt;div&gt;&amp;lt;/pre&amp;gt;&lt;/div&gt;&lt;/td&gt;&lt;td class='diff-marker'&gt;&amp;#160;&lt;/td&gt;&lt;td style=&quot;background-color: #f9f9f9; color: #333333; font-size: 88%; border-style: solid; border-width: 1px 1px 1px 4px; border-radius: 0.33em; border-color: #e6e6e6; vertical-align: top; white-space: pre-wrap;&quot;&gt;&lt;div&gt;&amp;lt;/pre&amp;gt;&lt;/div&gt;&lt;/td&gt;&lt;/tr&gt;
&lt;tr&gt;&lt;td class='diff-marker'&gt;−&lt;/td&gt;&lt;td style=&quot;color:black; font-size: 88%; border-style: solid; border-width: 1px 1px 1px 4px; border-radius: 0.33em; border-color: #ffe49c; vertical-align: top; white-space: pre-wrap;&quot;&gt;&lt;div&gt;&lt;del style=&quot;font-weight: bold; text-decoration: none;&quot;&gt;&lt;/del&gt;&lt;/div&gt;&lt;/td&gt;&lt;td colspan=&quot;2&quot;&gt;&amp;#160;&lt;/td&gt;&lt;/tr&gt;
&lt;tr&gt;&lt;td class='diff-marker'&gt;&amp;#160;&lt;/td&gt;&lt;td style=&quot;background-color: #f9f9f9; color: #333333; font-size: 88%; border-style: solid; border-width: 1px 1px 1px 4px; border-radius: 0.33em; border-color: #e6e6e6; vertical-align: top; white-space: pre-wrap;&quot;&gt;&lt;/td&gt;&lt;td class='diff-marker'&gt;&amp;#160;&lt;/td&gt;&lt;td style=&quot;background-color: #f9f9f9; color: #333333; font-size: 88%; border-style: solid; border-width: 1px 1px 1px 4px; border-radius: 0.33em; border-color: #e6e6e6; vertical-align: top; white-space: pre-wrap;&quot;&gt;&lt;/td&gt;&lt;/tr&gt;
&lt;tr&gt;&lt;td class='diff-marker'&gt;&amp;#160;&lt;/td&gt;&lt;td style=&quot;background-color: #f9f9f9; color: #333333; font-size: 88%; border-style: solid; border-width: 1px 1px 1px 4px; border-radius: 0.33em; border-color: #e6e6e6; vertical-align: top; white-space: pre-wrap;&quot;&gt;&lt;div&gt;===Engine optimization===&lt;/div&gt;&lt;/td&gt;&lt;td class='diff-marker'&gt;&amp;#160;&lt;/td&gt;&lt;td style=&quot;background-color: #f9f9f9; color: #333333; font-size: 88%; border-style: solid; border-width: 1px 1px 1px 4px; border-radius: 0.33em; border-color: #e6e6e6; vertical-align: top; white-space: pre-wrap;&quot;&gt;&lt;div&gt;===Engine optimization===&lt;/div&gt;&lt;/td&gt;&lt;/tr&gt;
&lt;/table&gt;</summary>
		<author><name>Zhaomy</name></author>	</entry>

	<entry>
		<id>http://www.cslt.org/mediawiki/index.php?title=2014-03-07&amp;diff=9295&amp;oldid=prev</id>
		<title>Cslt：以内容“==Resoruce Building== * Current text resource has been re-arranged and listed  == AM development ==  === Sparse DNN ===  * Optimal Brain Damage(OBD).   # GA-based block...”创建新页面</title>
		<link rel="alternate" type="text/html" href="http://www.cslt.org/mediawiki/index.php?title=2014-03-07&amp;diff=9295&amp;oldid=prev"/>
				<updated>2014-03-07T02:51:26Z</updated>
		
		<summary type="html">&lt;p&gt;以内容“==Resoruce Building== * Current text resource has been re-arranged and listed  == AM development ==  === Sparse DNN ===  * Optimal Brain Damage(OBD).   # GA-based block...”创建新页面&lt;/p&gt;
&lt;p&gt;&lt;b&gt;新页面&lt;/b&gt;&lt;/p&gt;&lt;div&gt;==Resoruce Building==&lt;br /&gt;
* Current text resource has been re-arranged and listed&lt;br /&gt;
&lt;br /&gt;
== AM development ==&lt;br /&gt;
&lt;br /&gt;
=== Sparse DNN ===&lt;br /&gt;
&lt;br /&gt;
* Optimal Brain Damage(OBD). &lt;br /&gt;
&lt;br /&gt;
# GA-based block sparsity&lt;br /&gt;
&lt;br /&gt;
=== Efficient DNN training ===&lt;br /&gt;
&lt;br /&gt;
# Asymmetric window: Great improvement on training set(WER 34% to 24%), however the improvement is lost on test. Overfitting? &lt;br /&gt;
&lt;br /&gt;
===Multi GPU training===&lt;br /&gt;
* Error encountered&lt;br /&gt;
&lt;br /&gt;
===GMM - DNN co-training===&lt;br /&gt;
* Error encountered&lt;br /&gt;
&lt;br /&gt;
=== Multilanguage training===&lt;br /&gt;
&lt;br /&gt;
# Pure Chinese training reached 4.9%&lt;br /&gt;
# Chinese + English reduced to 7.9%&lt;br /&gt;
# English phone set should discriminate beginning phone and ending phone&lt;br /&gt;
# Should set up multilingual network structure which shares low layers but separate languages at high layers&lt;br /&gt;
&lt;br /&gt;
===Noise training===&lt;br /&gt;
&lt;br /&gt;
* Train with wsj database by corrupting data with various noise types&lt;br /&gt;
:* White noise + car noise training partially completed&lt;br /&gt;
:* Mixture training produces better performance for both car and white noise&lt;br /&gt;
:* Unknown noise testing is on progress&lt;br /&gt;
&lt;br /&gt;
===AMR compression re-training===&lt;br /&gt;
* WeChat uses AMR compression method, which requires adaptation for our AM&lt;br /&gt;
* Test AMR &amp;amp; non-AMR model &lt;br /&gt;
&lt;br /&gt;
&amp;lt;pre&amp;gt;&lt;br /&gt;
               test-wav       WAV     AMR&lt;br /&gt;
        model&lt;br /&gt;
        WAV                   4.31     26.09&lt;br /&gt;
        AMR                  13.80      6.77 &lt;br /&gt;
&amp;lt;/pre&amp;gt;&lt;br /&gt;
&lt;br /&gt;
* Prepare to do adaptation &lt;br /&gt;
&lt;br /&gt;
===GFbank===&lt;br /&gt;
* Finished the first round of gfbank training &amp;amp; test&lt;br /&gt;
* The same gmm model (mfcc feature) was used to get the alignment&lt;br /&gt;
* Traing fbank &amp;amp; gfbank based on the mfcc alignment&lt;br /&gt;
* Clean training and noise test&lt;br /&gt;
&lt;br /&gt;
&amp;lt;pre&amp;gt;&lt;br /&gt;
     clean     25db    5db&lt;br /&gt;
gf   4.22      5.60    73.03&lt;br /&gt;
fb             5.87    84.12&lt;br /&gt;
&amp;lt;/pre&amp;gt;&lt;br /&gt;
&lt;br /&gt;
&lt;br /&gt;
===Engine optimization===&lt;br /&gt;
&lt;br /&gt;
* Investigating LOUDS FST. &lt;br /&gt;
&lt;br /&gt;
&lt;br /&gt;
==Word to Vector==&lt;br /&gt;
&lt;br /&gt;
* Test a training toolkit Standford University, which can involve global information into word2vector training&lt;br /&gt;
:* C++ implementation (instead of python) for data pre-processing. Failed. Just use python.&lt;br /&gt;
&lt;br /&gt;
* Basic wordvector plus global sense&lt;br /&gt;
:* 1 MB corpus costs 5 mins,vocab size 16698&lt;br /&gt;
:* 10 MB corpus costs about 82 mins vocab size 56287&lt;br /&gt;
&lt;br /&gt;
* Improved wordvector with multi sense&lt;br /&gt;
:* Almost impossible with the toolkit&lt;br /&gt;
:* Can think of pre-training vectors and then do clusering&lt;br /&gt;
&lt;br /&gt;
* WordVecteor-based keyword extraction&lt;br /&gt;
:* Prepared 7 category totally 500+ articles&lt;br /&gt;
:* A problem in keyword identification. Fix it by using the article vector space&lt;br /&gt;
&lt;br /&gt;
* Investigating Senna toolkit from NEC. Intending to implement POS tagging based on word vectors. &lt;br /&gt;
&lt;br /&gt;
==LM development==&lt;br /&gt;
&lt;br /&gt;
===NN LM===&lt;br /&gt;
&lt;br /&gt;
* Character-based NNLM (6700 chars, 7gram), 500M data training done.&lt;br /&gt;
:* Performance lower than word-based NNLM&lt;br /&gt;
:* Prepare to run boundary-involved char NNLM&lt;br /&gt;
&lt;br /&gt;
* WordVector-based word and char NNLM training done&lt;br /&gt;
:* Google wordvecotr-based NNLM is worse than random initialized NNLM&lt;br /&gt;
&lt;br /&gt;
&lt;br /&gt;
===3T Sogou LM===&lt;br /&gt;
&lt;br /&gt;
&lt;br /&gt;
* Improved training&lt;br /&gt;
:* 3T LM   + Tencent 80k lM: performance worse than the original 80K LM &lt;br /&gt;
:* Need to check if it is caused by the mismatched vocabu9lary&lt;br /&gt;
:* 3T LM  + QA LM : use online1 as the EM target, performance worse than QA LM&lt;br /&gt;
:* Probably due to the incorrect EM target&lt;br /&gt;
&lt;br /&gt;
==QA Matching==&lt;br /&gt;
&lt;br /&gt;
* Working on edit FST for fuzzy matching&lt;br /&gt;
* TF/IDF score matching completed&lt;br /&gt;
&lt;br /&gt;
==Embedded development==&lt;br /&gt;
&lt;br /&gt;
* CLG embedded decoder is almost done. Online compiler is on progress.&lt;br /&gt;
* English scoring is under go&lt;br /&gt;
&lt;br /&gt;
&lt;br /&gt;
&lt;br /&gt;
==Speech QA==&lt;br /&gt;
&lt;br /&gt;
* N-best with entity LM was analyzed&lt;br /&gt;
* Entity-class LM comparision&lt;br /&gt;
:* re-segmentation &amp;amp; re-train&lt;br /&gt;
:* SRILM class-based LM ???&lt;br /&gt;
:* Subgraph integration from Zhiyong&lt;br /&gt;
&lt;br /&gt;
* WER summary is done&lt;br /&gt;
* Prepare to compose a paper&lt;/div&gt;</summary>
		<author><name>Cslt</name></author>	</entry>

	</feed>