BERT (Bidirectional Encoder Representations)
BERT-base vs BERT-large
Two main BERT configurations: base (12 layers, 768 hidden dimensions, 110M parameters) and large (24 layers, 1024 dimensions, 340M parameters) for different performance/resource trade-offs.
← Tillbaka