当前位置：首页 > news >正文

Python str 字符串方法的全面、系统、分类详解

news 2026/7/7 1:44:35

前提：字符串是不可变对象

所有字符串方法不会修改原字符串，而是返回新字符串（或整数、布尔值等）。

s = "hello" s.upper() # 返回 "HELLO"，但 s 仍是 "hello"

一、大小写转换类

方法	功能	示例
`str.upper()`	转为大写	`"Hello".upper()`→`"HELLO"`
`str.lower()`	转为小写	`"HELLO".lower()`→`"hello"`
`str.capitalize()`	首字母大写，其余小写	`"hello WORLD".capitalize()`→`"Hello world"`
`str.title()`	每个单词首字母大写	`"hello world".title()`→`"Hello World"`
`str.swapcase()`	大小写互换	`"Hello".swapcase()`→`"hELLO"`
`str.casefold()`	更彻底的小写（用于无大小写比较）	`"ß".casefold()`→`"ss"`

casefold()vslower()：
lower()仅做基本小写转换
casefold()用于语言无关的大小写折叠（如德语 ß → ss），适合做相等性比较：
"Maße".casefold() == "MASSE".casefold() # True

二、查找与索引类

方法	功能	返回值	异常
`str.find(sub[, start[, end]])`	查找子串首次出现位置	索引（int）或`-1`	无
`str.rfind(sub[, start[, end]])`	从右查找子串首次出现	索引或`-1`	无
`str.index(sub[, start[, end]])`	同`find`，但未找到抛异常	索引	`ValueError`
`str.rindex(sub[, start[, end]])`	从右查找，未找到抛异常	索引	`ValueError`
`str.count(sub[, start[, end]])`	统计子串出现次数	整数 ≥0	无

推荐：优先用find()（安全），除非你明确希望未找到时崩溃。

text = "apple apple" print(text.find("apple")) # 0 print(text.rfind("apple")) # 6 print(text.count("p")) # 4

三、判断类（返回 bool）

通用判断

方法	条件
`str.startswith(prefix)`	以指定前缀开头（`prefix`可为元组）
`str.endswith(suffix)`	以指定后缀结尾（`suffix`可为元组）

url = "https://example.com" url.startswith(("http://", "https://")) # True file = "data.csv" file.endswith((".csv", ".txt")) # True

内容类型判断

方法	说明	注意事项
`str.isalpha()`	所有字符为字母（Unicode 字母）	空格、数字、符号 → False
`str.isdigit()`	所有字符为数字（含上标²、下标等）	不含小数点
`str.isnumeric()`	所有字符为数值字符（含汉字“一二三”）	范围最广
`str.isdecimal()`	所有字符为十进制数字（0-9）	最严格，仅`0123456789`
`str.isalnum()`	字母或数字（`isalpha() or isdigit()`）
`str.isspace()`	所有字符为空白（空格、\t、\n、\r 等）	空字符串 → False
`str.islower()`	至少一个字母且全为小写	无字母 → False
`str.isupper()`	至少一个字母且全为大写	无字母 → False
`str.istitle()`	每个单词首字母大写（其余小写）	`"Hello World".istitle()`→ True
`str.isidentifier()`	是否为合法 Python 标识符	`"123abc".isidentifier()`→ False

数字判断区别示例：

s1 = "123" s2 = "²" # 上标 2 s3 = "一二三" print(s1.isdecimal(), s1.isdigit(), s1.isnumeric()) # True True True print(s2.isdecimal(), s2.isdigit(), s2.isnumeric()) # False True True print(s3.isdecimal(), s3.isdigit(), s3.isnumeric()) # False False True

四、去除空白/指定字符

方法	功能	默认行为
`str.strip([chars])`	去除首尾字符	去除空白（空格、\t、\n 等）
`str.lstrip([chars])`	仅去除左侧
`str.rstrip([chars])`	仅去除右侧

chars是字符集合，不是前缀/后缀！
它会移除任意顺序的这些字符，直到遇到不在chars中的字符。

" hello ".strip() # "hello" "www.example.com".strip("wcom.") # "example"（注意：'e' 保留，因为 '.' 被移除后 'e' 不在 chars 中） "xyxxyyhelloxyx".strip("xy") # "hello"

五、分割与连接

分割

方法	功能	特点
`str.split(sep=None, maxsplit=-1)`	按分隔符分割	`sep=None`时按任意空白分割，忽略连续空白
`str.rsplit(sep=None, maxsplit=-1)`	从右开始分割	当`maxsplit`有效时与`split`不同
`str.splitlines(keepends=False)`	按行分割	自动处理`\n`,`\r\n`,`\r`等；`keepends=True`保留换行符

"a b c".split() # ['a', 'b', 'c'] "a,b,c".split(",", 1) # ['a', 'b,c'] "line1\nline2\r\nline3".splitlines() # ['line1', 'line2', 'line3']

连接

方法	功能
`str.join(iterable)`	将可迭代对象中的字符串用当前字符串连接

"-".join(["a", "b", "c"]) # "a-b-c" "".join(map(str, [1, 2, 3])) # "123"

最佳实践：拼接大量字符串时，务必使用join()，而非+循环。

六、替换与修改

方法	功能	参数
`str.replace(old, new[, count])`	替换子串	`count`：最多替换次数
`str.expandtabs(tabsize=8)`	将`\t`替换为空格	指定制表符宽度

"hello world".replace("l", "L", 2) # "heLLo world" "col1\tcol2".expandtabs(4) # "col1 col2"

七、对齐与填充

方法	功能	默认填充字符
`str.center(width[, fillchar])`	居中对齐	空格`' '`
`str.ljust(width[, fillchar])`	左对齐	空格
`str.rjust(width[, fillchar])`	右对齐	空格
`str.zfill(width)`	用`'0'`填充到指定宽度（考虑符号）	`'0'`

"42".center(10, '-') # "---42----" "42".zfill(5) # "00042" "-42".zfill(5) # "-0042"（符号位保留）

八、编码与字节转换

方法	功能	常用参数
`str.encode(encoding='utf-8', errors='strict')`	转为`bytes`	`encoding`:`'utf-8'`,`'ascii'`,`'gbk'`等 `errors`:`'strict'`（默认）,`'ignore'`,`'replace'`

"你好".encode('utf-8') # b'\xe4\xbd\xa0\xe5\xa5\xbd' "café".encode('ascii', errors='replace') # b'caf?'

注意：str类型没有.decode()方法！解码是bytes的方法：
b"hello".decode('utf-8') # "hello"

九、分区（Partitioning）

方法	功能	返回值
`str.partition(sep)`	从左分割为 (前, sep, 后)	三元组（tuple）
`str.rpartition(sep)`	从右分割	三元组

优势：即使sep不存在，也不会报错，而是返回('', '', 原字符串)或(原字符串, '', '')。

"foo@bar.com".partition("@") # ('foo', '@', 'bar.com') "no-at-sign".partition("@") # ('no-at-sign', '', '')

十、格式化相关（现代方式见 f-string）

虽然 f-string 是首选，但以下方法仍有用：

方法	用途
`str.format(args, *kwargs)`	格式化模板
`str.format_map(mapping)`	使用字典映射格式化（支持缺失键处理）

"{name} is {age} years old".format(name="Alice", age=30) "{name} lives in {country}".format_map({"name": "Bob"}) # KeyError! # 可继承 dict 实现 __missing__ 避免错误

十一、其他实用方法

方法	功能
`str.translate(table)`	按映射表批量替换字符（高效）
`str.maketrans(x[, y[, z]])`	创建`translate`用的映射表

# 删除所有元音字母 trans = str.maketrans('', '', 'aeiou') "hello world".translate(trans) # "hll wrld" # 字符替换 trans = str.maketrans('ae', 'AE') "apple".translate(trans) # "AppLE"

总结：高频使用场景速查

场景	推荐方法
拼接列表 → 字符串	`"sep".join(list)`
安全查找子串	`.find()`/`.startswith()`
清理输入	`.strip()`+`.lower()`
分割 CSV 行	`.split(",")`
按行处理文本	`.splitlines()`
数字字符串补零	`.zfill(width)`
大小写无关比较	`str1.casefold() == str2.casefold()`
批量字符替换/删除	`.translate()`+`.maketrans()`