“2026-01-19”版本间的差异

来自cslt Wiki
跳转至: 导航搜索
第133行: 第133行:
 
||
 
||
 
* LLM
 
* LLM
** Got swarm crosswords experiment with metric detection working
+
** Get swarm crosswords experiment with metric detection working
 
** Implement multi-process of LLM instances in experiments
 
** Implement multi-process of LLM instances in experiments
 
*** p.s.: try multi-threading, fails because a single Python interpreter only initializes one CUDA context, causing resource contention between different instances.
 
*** p.s.: try multi-threading, fails because a single Python interpreter only initializes one CUDA context, causing resource contention between different instances.

2026年1月19日 (一) 10:49的版本

People This Week Next Week Task Tracking (DeadLine)
Dong Wang
  • Recheck AI textbooks for primary and middle schools.
Lantian Li
Wenqiang Du
  • Statistics the accounts for three companies, our laboratory and AIGE.
  • Check AI textbooks
Yang Wei
Ying Shi
  • Thesis
  • Huawei multi-talker ASR project(Mandarin 2-mix test with FunASR-Nano-800M)
    • ground-truth WER 3.89%
    • separated WER 12.11%
Yue Gu
  • write my Phd thesis
  • the seminar
Lily
  • Participated in reviewing the AI handbooks (primary/middle/high school)
  • Compiled course materials for the AI Handbook (Senior Edition)
Pengqi Li
  • Writing the Experiments section.
    • Focus on hard-to-explain results by formulating hypotheses and conducting deeper analysis.
Junming Yuan
  • Aug-MT-HuBERT(failed)
    • Continued pre-training for 600K steps. there is still no improvement observed on clean-speech tasks.
      • Aug-MT-HuBERT has more substitution errors in clean ASR adaptation.
  • Incorporate a "learn-not-to-listen" mechanism into MT-HuBERT and retrain the backbone. (in progress)
    • At 425K steps, PR(PER%): 8.18%, ASR(WER%): 9.32%, SD(DER%): 5.05%
  • prepare the slides of the seminar
  • middle school AI textbook checking(picture, table)
Yu Zhang
  • GPU Util: [1]
  • LLM
    • Rewrite the graph connectivity of Swarm MMLU (the original topology does not align with our expectations).
    • Add context segmentation logic to the ECS computation, so that we can trace which preceding node a given text segment originates from.
Junhui Chen
  • LLM
    • Get swarm crosswords experiment with metric detection working
    • Implement multi-process of LLM instances in experiments
      • p.s.: try multi-threading, fails because a single Python interpreter only initializes one CUDA context, causing resource contention between different instances.
Jiaying Wang
Xiaoxue Luo
  • 2-5mix multi_head separation model for Huawei project[2]
  • prepare 2mix speech separation results for ASR test
Bochao Hu
  • read some papers
  • refactor P2S code, using ms-swift and adding n-best seq to input.
Hongcheng Zhang
  • Complete the training of short audio descriptions for AudioSet-SL in WavCaps
    • METEOR: 0.3373, ROUGE-L: 0.3121, BLEU-4 : 0.0769, CIDEr : 0.6207
Weiman Sun
  • Label audioset vedio dataset
  • Analyze the representation of MT-LLM inference and review relevant literature