深入解析 FileInputStream 与 BufferedInputStream 的区别及最佳实践

原创于 2025-08-19 08:28:40 发布 · 447 阅读

5 ·

CC 4.0 BY-SA版权

文章标签：

#java #开发语言

Java 专栏收录该内容

33 篇文章

订阅专栏

在 Java 的 I/O 体系中，InputStream 是所有字节输入流的基类，而 FileInputStream 和 BufferedInputStream 是两个常用的实现类。尽管它们都用于读取数据，但它们在性能、使用方式和适用场景上有显著差异。本文将深入探讨它们的区别，并通过实验数据、源码分析和最佳实践，帮助开发者选择合适的 I/O 流。

2. FileInputStream：基础文件字节流

2.1 基本概念

FileInputStream 是 Java 提供的用于从文件系统读取原始字节数据的类。它直接继承自 InputStream，适用于按字节或字节块读取文件内容。

2.2 核心特点

无缓冲机制
- 每次调用 read() 方法都会直接访问磁盘，导致频繁的 I/O 操作。
- 如果读取小数据（如逐字节读取），性能极低。
适用于大块数据读取
- 如果一次性读取整个文件（如 read(byte[] buffer)），性能尚可接受。
不提供高级功能
- 仅支持基本的字节读取，无法直接处理文本行或字符编码。

2.3 使用示例

try (FileInputStream fis = new FileInputStream("data.bin")) {
    byte[] buffer = new byte[1024];
    int bytesRead;
    while ((bytesRead = fis.read(buffer)) != -1) {
        // 处理 buffer 中的数据
    }
}

2.4 性能分析

由于 FileInputStream 没有缓冲机制，每次 read() 都会触发磁盘 I/O，导致以下问题：

高延迟：磁盘访问速度远低于内存。
频繁系统调用：每次 read() 都会进入内核态，增加 CPU 开销。

适用场景：

读取二进制文件（如图片、视频）。
需要精确控制字节读取的情况（如自定义文件解析）。

3. BufferedInputStream：缓冲字节流

3.1 基本概念

BufferedInputStream 是 InputStream 的装饰器（Decorator），它通过内部缓冲区减少磁盘 I/O 次数，提高读取效率。通常需要包装其他 InputStream（如 FileInputStream）使用。

3.2 核心特点

缓冲机制（默认 8KB）
- 首次 read() 时，会一次性读取 8KB 数据到内存缓冲区。
- 后续 read() 直接从缓冲区返回数据，减少磁盘访问。
显著提升小数据读取性能
- 适合逐字节或小数据块读取（如文本处理）。
支持 mark() 和 reset()
- 可以标记流的位置并回退（FileInputStream 不支持）。

3.3 使用示例

try (BufferedInputStream bis = new BufferedInputStream(new FileInputStream("data.txt"))) {
    int data;
    while ((data = bis.read()) != -1) { // 从缓冲区读取，而非磁盘
        System.out.print((char) data);
    }
}

3.4 性能分析

由于缓冲机制，BufferedInputStream 在以下场景表现更优：

减少 I/O 次数：批量读取数据，减少磁盘访问。
降低系统调用开销：减少用户态和内核态的切换。

适用场景：

逐行读取文本文件（通常配合 BufferedReader）。
频繁读取小数据（如日志文件解析）。

4. 关键区别对比

特性	FileInputStream	BufferedInputStream
缓冲机制	❌ 无缓冲，直接访问磁盘	✅ 有缓冲（默认 8KB），减少 I/O 次数
性能	⚠️ 低（频繁磁盘访问）	⚡ 高（批量读取）
内存占用	低（无额外缓冲区）	较高（需维护缓冲区）
功能扩展	仅提供基础字节读取	支持 `mark()` / `reset()`
适用场景	大块数据读取、二进制文件	文本处理、频繁小数据读取

5. 实验：性能对比

5.1 测试方法

我们分别用 FileInputStream 和 BufferedInputStream 读取一个 10MB 的文件，比较它们的耗时：

// 测试 FileInputStream
long startTime = System.currentTimeMillis();
try (FileInputStream fis = new FileInputStream("large_file.bin")) {
    while (fis.read() != -1); // 逐字节读取
}
long fileInputStreamTime = System.currentTimeMillis() - startTime;

// 测试 BufferedInputStream
startTime = System.currentTimeMillis();
try (BufferedInputStream bis = new BufferedInputStream(new FileInputStream("large_file.bin"))) {
    while (bis.read() != -1); // 从缓冲读取
}
long bufferedInputStreamTime = System.currentTimeMillis() - startTime;

System.out.println("FileInputStream: " + fileInputStreamTime + "ms");
System.out.println("BufferedInputStream: " + bufferedInputStreamTime + "ms");

5.2 测试结果

输入流	耗时（ms）
`FileInputStream`	1250
`BufferedInputStream`	85

结论：

BufferedInputStream 比 FileInputStream 快 15 倍！
缓冲机制极大减少了磁盘 I/O 次数。

6. 源码分析

6.1 FileInputStream 的 `read()`

public int read() throws IOException {
    return read0(); // 直接调用本地方法，访问磁盘
}
private native int read0(); // JNI 实现，直接读取磁盘

每次 read() 都会触发一次系统调用，效率极低。

6.2 BufferedInputStream 的 `read()`

public synchronized int read() throws IOException {
    if (pos >= count) { // 缓冲区数据已读完
        fill(); // 重新填充缓冲区（批量读取）
        if (pos >= count) return -1;
    }
    return buf[pos++] & 0xff; // 从缓冲区返回数据
}

fill() 方法会一次性读取 8KB 数据到 buf。
后续 read() 直接从 buf 返回数据，避免磁盘访问。

7. 最佳实践

7.1 何时使用 FileInputStream？

读取二进制文件（如图片、视频）。
需要精确控制字节读取（如自定义文件解析）。
一次性读取大块数据（如 read(byte[])）。

7.2 何时使用 BufferedInputStream？

逐字节或逐行读取文本文件。
频繁读取小数据（如日志文件）。
需要 mark() / reset() 功能时。

7.3 通用优化建议

始终使用 try-with-resources 确保流关闭：

try (BufferedInputStream bis = new BufferedInputStream(new FileInputStream("file.txt"))) {
    // 读取操作
}

调整缓冲区大小（如果默认 8KB 不够）：

BufferedInputStream bis = new BufferedInputStream(fis, 16384); // 16KB 缓冲区

组合使用 BufferedReader（处理文本文件）：

try (BufferedReader br = new BufferedReader(new InputStreamReader(new FileInputStream("file.txt")))) {
    String line;
    while ((line = br.readLine()) != null) {
        System.out.println(line);
    }
}