Skip to content

Conversation

nightbloos
Copy link

Due to the reason that sometimes URLs can be to some pages/files that are a pretty big one - we want to be able to abort the reading body for those pages.

Alexandr Filioglo added 5 commits August 7, 2020 13:18
- added support for og:type
- fixed incorrect handling of relative paths
- changed to "silent" checks Content-Length in Head requests
…o 2 different functions `GetDocument` and `ParseDocument` (#2)
Due to the reason that we "re-create" new link - only by scheme, host, and path - is present risk to lose some other data from the original link.
Previously `/some/path.png?param=value`, was transformed into `http://mydomain.com/some/path.png`
Now this issue should be fixed, and the output should be `http://mydomain.com/some/path.png?param=value`
For some ULRs was found that we can't get for strange reason the `og:type` data.
One of this ULRs - was youtube links.
Was detected that in YouTube they keep metadata in body (and not in head as other normal services).
And because previously the criteria for breaking loop of procession of tokens was "we have Title + description + ogImage and we passed head" - we were not able to process all other optional meta after that we pass head.

Now we are able to control how much tokens we can process before breaking loop (or if we found required optional fields already)
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

None yet

Projects

None yet

Development

Successfully merging this pull request may close these issues.

2 participants