What is BERT
Google language model for text understanding
BERT (Bidirectional Encoder Representations from Transformers)
BERT is a pre-trained language model from Google that revolutionized natural language processing (NLP).
Key Features
| Feature | Description | |---------|-------------| | Bidirectional | Analyzes context from left and right simultaneously | | Pre-training | Trained on Wikipedia + BookCorpus (3.3B words) | | Transformer | Based on attention architecture | | Fine-tuning | Easily adaptable to specific tasks |
Pre-training Tasks
- Masked Language Model (MLM) — predicting masked words
- Next Sentence Prediction (NSP) — determining sentence relationships
BERT Applications
| Task | Example | |------|---------| | Text Classification | Sentiment analysis of reviews | | NER | Extracting names, dates, organizations | | Question Answering | Answering questions from text | | Semantic Search | Searching by meaning, not words |
Model Versions
- BERT-Base — 12 layers, 110M parameters
- BERT-Large — 24 layers, 340M parameters
- RuBERT — for Russian language
- MultiBERT — 104 languages