Rust char.len_utf8用法及代碼示例

本文簡要介紹rust語言中 char.len_utf8 的用法。

用法

pub const fn len_utf8(self) -> usize

返回此 char 以 UTF-8 編碼時所需的字節數。

該字節數始終介於 1 和 4 之間，包括 1 和 4。

例子

基本用法：

let len = 'A'.len_utf8();
assert_eq!(len, 1);

let len = 'ß'.len_utf8();
assert_eq!(len, 2);

let len = 'ℝ'.len_utf8();
assert_eq!(len, 3);

let len = '💣'.len_utf8();
assert_eq!(len, 4);

&str 類型保證其內容是 UTF-8，因此我們可以比較每個代碼點表示為 char 與 &str 本身的長度：

// as chars
let eastern = '東';
let capital = '京';

// both can be represented as three bytes
assert_eq!(3, eastern.len_utf8());
assert_eq!(3, capital.len_utf8());

// as a &str, these two are encoded in UTF-8
let tokyo = "東京";

let len = eastern.len_utf8() + capital.len_utf8();

// we can see that they take six bytes total...
assert_eq!(6, tokyo.len());

// ... just like the &str
assert_eq!(len, tokyo.len());

相關用法

注：本文由純淨天空篩選整理自rust-lang.org大神的英文原創作品 char.len_utf8。非經特殊聲明，原始代碼版權歸原作者所有，本譯文未經允許或授權，請勿轉載或複製。