summaryrefslogtreecommitdiffstats
path: root/test/test_utils.py
Commit message (Collapse)AuthorAgeFilesLines
...
* [utils] Improve urljoinSergey M․2016-12-171-0/+3
|
* [utils] Add convenience urljoinSergey M․2016-12-131-0/+14
|
* Update coding style after pycodestyle 2.1.0Yen Chi Hsuan2016-11-171-0/+1
| | | | | | | | | | In pycodestyle 2.1.0, E305 was introduced, which requires two blank lines after top level declarations, too. See https://github.com/PyCQA/pycodestyle/issues/400 See also #10689; thanks @stepshal for first mentioning this issue and initial patches
* [utils] Introduce base_urlSergey M․2016-11-021-0/+8
|
* [utils] Lower priority for rare date formats and add testsSergey M․2016-09-291-0/+2
|
* [utils] Use native french month namesSergey M․2016-09-141-4/+2
|
* [utils] Improve month_by_name and add testsSergey M․2016-09-141-0/+11
|
* [utils] Improve mimetype2extSergey M․2016-09-021-0/+9
|
* [utils] Recognize units with full names in parse_filenameYen Chi Hsuan2016-08-201-0/+1
| | | | Reference: https://en.wikipedia.org/wiki/Template:Quantities_of_bytes
* [utils] Correct octal/hexadecimal number detection in js_to_jsonYen Chi Hsuan2016-08-191-0/+3
|
* [utils] Recognize lowercase units in parse_filesizeSergey M․2016-08-181-0/+2
|
* [test_utils] add test for option with not str valueRemita Amine2016-08-131-0/+1
|
* [utils] Add support TV Parental Guidelines ratings in parse_age_limitSergey M․2016-08-071-0/+15
|
* [utils] Fix unified_timestamp for formats parsed by parsedate_tz()Yen Chi Hsuan2016-08-051-0/+1
|
* Merge pull request #8876 from remitamine/html5_mediaYen Chi Hsuan2016-07-101-0/+24
|\ | | | | [extractor/common] add helper method to extract html5 media entries
| * [utils] add helper function for parsing codecsremitamine2016-06-261-0/+24
| |
* | [utils] Add get_element_by_classYen Chi Hsuan2016-07-061-0/+9
| | | | | | | | For #9950
* | [test_utils] add test for smuggling a smuggled urlRemita Amine2016-07-041-0/+6
| |
* | [utils] Add urshift()Yen Chi Hsuan2016-06-261-0/+5
| | | | | | | | Used in IqiyiIE and LeIE
* | [utils] Add unified_timestampSergey M․2016-06-251-0/+21
|/
* [utils] Don't transform numbers not starting with a zeroYen Chi Hsuan2016-06-161-0/+3
| | | | Fix test_Viidea and maybe others
* [utils] Decode HTML5 entitiesYen Chi Hsuan2016-06-101-0/+2
| | | | Used in test_Vporn_1. Also related to #9270
* Added sanitization support for Hungarian letters Ő and Űbzc6p2016-06-021-2/+2
|
* [utils] Allow None in remove_{start,end}Sergey M․2016-05-191-0/+12
|
* [test_utils] PEP 8Sergey M․2016-05-141-1/+0
|
* [utils] Process non-base 10 integers in js_to_jsonSergey M․2016-05-141-0/+19
|
* [utils] js_to_json: various improvementsfelix2016-05-141-0/+12
| | | | now JS object literals like { /* " */ 0: ",]\xaa<\/p>", } will be correctly converted to JSON.
* [utils] Add Œ and œ found in French to ACCENT_CHARSYen Chi Hsuan2016-05-121-2/+2
| | | | Fixes #9463
* [utils,compat] Move struct_pack and struct_unpack to compat.pyYen Chi Hsuan2016-05-101-4/+0
|
* Instead of replacing accented characters with an underscore when sanitizing ↵Adam Thalhammer2016-05-021-1/+0
| | | | file names in restricted mode, replace them with their non-accented equivalents fixes #9347
* Instead of replacing accented characters with an underscore when sanitizing ↵Adam Thalhammer2016-05-021-2/+7
| | | | file names in restricted mode, replace them with their non-accented equivalents fixes #9347
* Merge pull request #9110 from remitamine/parse_durationSergey M2016-04-211-0/+1
|\ | | | | [utils] imporove parse_duration to handle more formats
| * [utils] imporove parse_duration to handle more formatsremitamine2016-04-211-0/+1
| |
* | [test/utils] Add test for date_from_strJaime Marquínez Ferrándiz2016-04-091-0/+8
|/
* [test/test_utils] Update for escape_url change (again)Yen Chi Hsuan2016-03-231-1/+1
|
* [test/test_utils] Update for escape_url changeYen Chi Hsuan2016-03-231-1/+1
|
* [utils] lookup_unit_table: Match word boundary instead of end of stringJaime Marquínez Ferrándiz2016-03-191-0/+2
|
* [utils] PEP 8Sergey M․2016-03-161-3/+3
|
* Merge pull request #8092 from bpfoley/twitter-thumbnailremitamine2016-03-161-0/+40
|\ | | | | [utils] Add extract_attributes for extracting html tag attributes
| * [utils] Add extract_attributes for extracting html tag attributesBrian Foley2016-03-031-0/+40
| | | | | | | | | | | | | | This is much more robust than just using regexps, and handles all the common scenarios, such as empty/no values, repeated attributes, entity decoding, mixed case names, and the different possible value quoting schemes.
* | [bbc] Generalize unit table lookup and add parse_countSergey M․2016-03-131-0/+10
| |
* | [test/test_utils] add more tests for update_url_queryremitamine2016-03-031-0/+9
| |
* | [test/test_utils] add tests for update_url_queryremitamine2016-03-031-0/+28
|/
* [utils] Multiple changes to base_n()Yen Chi Hsuan2016-02-271-0/+12
| | | | | | | | 1. Renamed to encode_base_n() 2. Allow tables longer than 62 characters 3. Raise ValueError instead of AssertionError for invalid input data 4. Return the first character in the table instead of '0' for number 0 5. Add tests
* [utils] Remove AM/PM from unified_strdate patternsSergey M․2016-02-251-0/+1
|
* [utils] Add OHDave's RSA encryption functionYen Chi Hsuan2016-02-201-0/+8
|
* [utils] Allow dot in strip_jsonpSergey M․2016-02-071-0/+4
|
* [utils] Add ability to control skipping false values in dict_getSergey M․2016-02-071-2/+11
|
* [utils] Add dict_get convenience methodSergey M․2016-02-071-0/+14
|
* [test_utils] Add tests for encode_compat_strSergey M․2015-12-201-0/+5
|