Hacker News
new
|
past
|
comments
|
ask
|
show
|
jobs
|
submit
login
antegamisou
on June 10, 2024
|
parent
|
context
|
favorite
| on:
Elsevier embeds a hash in the PDF metadata that is...
> 1. Download the PDF from Elsevier.
2. Convert pages to PNG images
3. Merge them into a new PDF
4. Run them through OCR
out_of_protocol
on June 10, 2024
[–]
Wouldn't help if marks are visible, e.g. some regular string "copy №1234" somewhere or something more complicated like uneven spacing between lines or words
Guidelines
|
FAQ
|
Lists
|
API
|
Security
|
Legal
|
Apply to YC
|
Contact
Search:
2. Convert pages to PNG images
3. Merge them into a new PDF
4. Run them through OCR