top of page

Playground

Besides formal and comprehensive projects, I am also interested in solving some interesting problems. Because of the 'casual' traits of these mini projects, these all could be published on GitHub. 

WechatIMG7397.jpeg
1. Mini flow

It uses python to implement a mini tensorflow framework, which supports tensor and model difinition, backpropagation, autograd, and optimizers. This is a good source to learn fundamental deep learning knowledge.  

>> GitHub

WX20221005-220821_2x.png
2. Movie Comment Spider

In this repo, I implement a web spider framework that could get the movie comment content and rate stars from the biggest movie comment website -- Douban.com. It is a good source for Chinese NLP learners and researchers to get the corpus. 

>> GitHub

WX20221005-222034_2x.png
3. Chinese News Person’s Opinion Mining

In this repo, I implement an unsupervised learning method to mine the persons their statements in Chinese news.  It uses word embedding, heuristic search, and dependency parsing methods, could extract the exact person and his / her statements in complicated and complex sentences. 

>> GitHub

WechatIMG7599.png
4. CIKM Text Challenge

In this repo, I implement a text classification challenge for CIKM. Using textCNN and embedding methods could get the top 10% in this competition. 

>> GitHub

WechatIMG7608.png
5. An experiment of Recurrent Lab

In this repo, I implement an RNN network from scratch. Because the RNNs are difficult to be understood and analyzed. Therefore, this repo tests the different occasions and monitors the weight and bias change.

>> GitHub

WechatIMG7604.jpeg
6. Machine Translation using Sequence to Sequence

In this repo, I implement a Neural Machine Translation model. Based on the Seq2Seq and Attention.

>> GitHub

WechatIMG7610.png
7. Chinse PM2.5 Data Crawler

I implement a web spider in this repo to get PM2.5 information about different Chinese cities. It is a good resource for learners and researchers. 

>> GitHub

WechatIMG7611.png
8. Gambling Transaction Mining

It uses semi-supervised learning and unsupervised learning to find illegal transactions. 

>> GitHub

WechatIMG7606.png
9. AI Drawing Cat

By a very simple intimation method, the program could draw an input picture as an animated drawing process. 

>> GitHub

WechatIMG7613.jpeg
10. TextCNN

In this project, I implement a text CNN classifier, which is a general tool for classifying text.

>> GitHub

WechatIMG7620.jpeg
11. Paper Key Entities Extractor

In this project, using semi-supervised and embedding ways could get the important key entities and words from a paper. If you input some initial keywords, the model will give you more about the related entities in this paper.

>> GitHub

WechatIMG7616.png
12. Toxic Text Classification

In this project, using word-embedding, text CNN and GRU, I implement a model to solve the Kaggle toxic text classification problem, which became the top 10% in this challenge after some simple finetuning.

>> GitHub

WechatIMG7617.jpeg
13. Majority Voting Nets

In this project, I want to try a method to solve the weak-supervised learning problem. By building different minor networks, the network could give the final result by voting. Although this is useful, I find it is just a type of ensemble learning.

>> GitHub

WechatIMG7619.jpeg
14. Music Embedding

In this project, I use the word embedding way to embed the music elements into vectors. After getting these embeddings, we could generate music just using the Seq2Seq model or similar Language Model in NLP. 

>> GitHub

bottom of page