Struct tokenizers::models::wordpiece::WordPieceTrainer
source · pub struct WordPieceTrainer { /* private fields */ }
Expand description
Trains a WordPiece
model.
Implementations§
source§impl WordPieceTrainer
impl WordPieceTrainer
pub fn min_frequency(&self) -> u64
pub fn set_min_frequency(&mut self, freq: u64)
pub fn vocab_size(&self) -> usize
pub fn set_vocab_size(&mut self, size: usize)
pub fn show_progress(&self) -> bool
pub fn set_show_progress(&mut self, show_progress: bool)
pub fn special_tokens(&self) -> &[AddedToken]
pub fn set_special_tokens(&mut self, special_tokens: Vec<AddedToken>)
pub fn limit_alphabet(&self) -> Option<usize>
pub fn set_limit_alphabet(&mut self, limit: Option<usize>)
pub fn initial_alphabet(&self) -> &HashSet<char>
pub fn set_initial_alphabet(&mut self, alphabet: HashSet<char>)
pub fn continuing_subword_prefix(&self) -> &Option<String>
pub fn set_continuing_subword_prefix(&mut self, prefix: Option<String>)
pub fn end_of_word_suffix(&self) -> &Option<String>
pub fn set_end_of_word_suffix(&mut self, suffix: Option<String>)
pub fn builder() -> WordPieceTrainerBuilder
pub fn train(&self, model: &mut WordPiece) -> Result<Vec<AddedToken>>
Trait Implementations§
source§impl Clone for WordPieceTrainer
impl Clone for WordPieceTrainer
source§fn clone(&self) -> WordPieceTrainer
fn clone(&self) -> WordPieceTrainer
Returns a copy of the value. Read more
1.0.0 · source§fn clone_from(&mut self, source: &Self)
fn clone_from(&mut self, source: &Self)
Performs copy-assignment from
source
. Read moresource§impl Default for WordPieceTrainer
impl Default for WordPieceTrainer
source§fn default() -> WordPieceTrainer
fn default() -> WordPieceTrainer
Returns the “default value” for a type. Read more
source§impl<'de> Deserialize<'de> for WordPieceTrainer
impl<'de> Deserialize<'de> for WordPieceTrainer
source§fn deserialize<__D>(__deserializer: __D) -> Result<Self, __D::Error>where
__D: Deserializer<'de>,
fn deserialize<__D>(__deserializer: __D) -> Result<Self, __D::Error>where
__D: Deserializer<'de>,
Deserialize this value from the given Serde deserializer. Read more
source§impl From<WordPieceTrainer> for TrainerWrapper
impl From<WordPieceTrainer> for TrainerWrapper
source§fn from(from: WordPieceTrainer) -> Self
fn from(from: WordPieceTrainer) -> Self
Converts to this type from the input type.
source§impl Serialize for WordPieceTrainer
impl Serialize for WordPieceTrainer
source§impl Trainer for WordPieceTrainer
impl Trainer for WordPieceTrainer
type Model = WordPiece
source§fn train(&self, model: &mut WordPiece) -> Result<Vec<AddedToken>>
fn train(&self, model: &mut WordPiece) -> Result<Vec<AddedToken>>
The actual training method. This will return a new trained Model as well as a list
of
special_tokens
to be added directly to the tokenizer along with the model.source§fn should_show_progress(&self) -> bool
fn should_show_progress(&self) -> bool
Whether we should show progress during the training.
Auto Trait Implementations§
impl Freeze for WordPieceTrainer
impl RefUnwindSafe for WordPieceTrainer
impl Send for WordPieceTrainer
impl Sync for WordPieceTrainer
impl Unpin for WordPieceTrainer
impl UnwindSafe for WordPieceTrainer
Blanket Implementations§
source§impl<T> BorrowMut<T> for Twhere
T: ?Sized,
impl<T> BorrowMut<T> for Twhere
T: ?Sized,
source§fn borrow_mut(&mut self) -> &mut T
fn borrow_mut(&mut self) -> &mut T
Mutably borrows from an owned value. Read more