或许你不知道的10条SQL技巧

架构师之路 · 公众号 · 架构 · 2017-07-16 12:09

正文

这几天在写索引，想到一些有意思的 TIPS ，希望大家有收获。

一、一些常见的 SQL 实践

（ 1 ）负向条件查询不能使用索引

select * from order where status != 0 and stauts != 1

not in/not exists 都不是好习惯

可以优化为 in 查询：

select * from order where status in (2,3)

（ 2 ）前导模糊查询不能使用索引

select * from order where desc like '%XX'

而非前导模糊查询则可以：

select * from order where desc like 'XX%'

（ 3 ）数据区分度不大的字段不宜使用索引

select * from user where sex =1

原因：性别只有男，女，每次过滤掉的数据很少，不宜使用索引。

经验上，能过滤 80% 数据时就可以使用索引。对于订单状态，如果状态值很少，不宜使用索引，如果状态值很多，能够过滤大量数据，则应该建立索引。

（ 4 ）在属性上进行计算不能命中索引

select * from order where YEAR(date) < = '2017'

即使 date 上建立了索引，也会全表扫描，可优化为值计算：

select * from order where date < = CURDATE()

或者：

select * from order where date < = '2017-01-01'

二、并非周知的 SQL 实践

（ 5 ）如果业务大部分是单条查询，使用 Hash 索引性能更好，例如用户中心

select * from user where uid=?
select * from user where login_name=?

原因：

B-Tree 索引的时间复杂度是 O(log(n))

Hash 索引的时间复杂度是 O(1)

（ 6 ）允许为 null 的列，查询有潜在大坑

单列索引不存 null 值，复合索引不存全为 null 的值，如果列允许为 null ，可能会得到“不符合预期”的结果集

select * from user where name != 'shenjian'

如果 name 允许为 null ，索引不存储 null 值，结果集中不会包含这些记录。

所以，请使用not null约束以及默认值。

（ 7 ）复合索引最左前缀，并不是值 SQL 语句的 where 顺序要和复合索引一致

用户中心建立了 (login_name, passwd) 的复合索引

select * from user where login_name=? and passwd=?
select * from user where passwd=? and login_name=?

都能够命中索引

select * from user where login_name=?

也能命中索引 ，满足复合索引最左前缀

select * from user where passwd=?

不能命中索引 ，不满足复合索引最左前缀

（ 8 ）使用 ENUM 而不是字符串

ENUM保存的是 TINYINT ，别在枚举中搞一些“中国”“北京”“技术部”这样的字符串，字符串空间又大，效率又低。

三、小众但有用的 SQL 实践

（ 9 ）如果明确知道只有一条结果返回，

或许你不知道的10条SQL技巧

正文

请到「今天看啥」查看全文