Chapter 63

পেপার রিপ্রোডিউস

Reproducing Papers

🔬 কেন Reproduce করবেন?

পেপার পড়া আর পেপার implement করা সম্পূর্ণ আলাদা skill। Reproduce করতে গেলে hidden assumption, hyperparameter trick, এবং engineering detail সব ধরা পড়ে। এটাই আপনাকে paper-reader থেকে researcher-এ পরিণত করে।

সঠিক পেপার বেছে নিন (প্রথম reproduction)

Code release আছে (official GitHub)।
Dataset public/downloadable।
Compute affordable (single GPU/Colab)।
4-8 পৃষ্ঠার "workshop"/"short" paper, full SOTA নয়।
উদাহরণ: simple GAN, small transformer, একটা specific ablation।

Step-by-Step Workflow

1. পেপার deeply পড়ুন

সব hyperparameter, architecture detail, training schedule, dataset split table-এ লিখুন। Appendix প্রায়ই সবচেয়ে দরকারি অংশ।

2. Environment isolate করুন

# Conda env + exact versions
conda create -n repro python=3.10
pip install torch==2.1.0 transformers==4.36 numpy==1.26

3. Data পরীক্ষা করুন

Paper-এর split (train/val/test) match হচ্ছে?
Preprocessing — tokenizer, normalization, augmentation।
Sample size + class distribution paper-এর সাথে মিলিয়ে দেখুন।

4. Model skeleton লিখুন

Architecture figure দেখে layer-by-layer implement করুন। প্রতিটা module-এর shape print করুন। Total parameter count paper-এর সাথে compare করুন (±5% তে মেলা উচিত)।

5. Training loop

# Paper-এর মতো:
# - Optimizer: AdamW, lr=3e-4, β=(0.9, 0.999), wd=0.01
# - Scheduler: cosine, warmup=1000 step
# - Batch size: 256 (gradient accumulation দিয়ে fit)
# - Seed: 42
# - Mixed precision: bf16