legacy-datasets/wikipedia
Updated • 127k • 630
How to use SzegedAI/charmen-electra with Transformers:
# Use a pipeline as a high-level helper
from transformers import pipeline
pipe = pipeline("feature-extraction", model="SzegedAI/charmen-electra", trust_remote_code=True) # Load model directly
from transformers import AutoModel
model = AutoModel.from_pretrained("SzegedAI/charmen-electra", trust_remote_code=True, dtype="auto")A byte-based transformer model trained on Hungarian language. In order to use the model you will need a custom Tokenizer which is available at: https://github.com/szegedai/byte-offset-tokenizer.
Since we use a custom architecture with Gradient Boosting, Down- and Up-Sampling, you have to enable Trusted Remote Code like:
model = AutoModel.from_pretrained("SzegedAI/charmen-electra", trust_remote_code=True)