OpenDataLoader PDF
Version: 2.2.0
Copyright 2025-2026 Hancom, Inc.
License: Apache License, Version 2.0
Source: https://github.com/opendataloader-project/opendataloader-pdf

Licensed under the Apache License, Version 2.0 (the "License");
you may not use this file except in compliance with the License.
You may obtain a copy of the License at

    http://www.apache.org/licenses/LICENSE-2.0

Unless required by applicable law or agreed to in writing, software
distributed under the License is distributed on an "AS IS" BASIS,
WITHOUT WARRANTIES OR CONDITIONS OF ANY KIND, either express or implied.
See the License for the specific language governing permissions and
limitations under the License.

---

Usage in kordoc:
The PDF table border detection algorithm (src/pdf/line-detector.ts) is derived
from OpenDataLoader PDF's TableBorderBuilder. The algorithm was re-implemented
in TypeScript with modifications for pdfjs-dist compatibility and Korean
document optimization.

Modifications include:
- TypeScript implementation (original: Java)
- pdfjs-dist v4/v5 operator list parsing (original: Apache PDFBox)
- Korean text gap merging within table cells
- Adjacent grid merging for split table recovery
- Coordinate clustering with configurable tolerances
