Token level embeddings from BERT model on mxnet and gluonnlp
Keras implementation of BERT with pre-trained weights
Chainer implementation of "BERT: Pre-training of Deep Bidirectional Transformers for Language Understanding"
Implementation of BERT that could load official pre-trained models for feature extraction and prediction
BERT-NER (nert-bert) with google bert https://github.com/google-research.
TensorFlow code and pre-trained models for BERT
A BERT model for scientific text.
Code for paper Fine-tune BERT for Extractive Summarization