15. Changelog
This page lists what changed in each version of WP Content Crawler.
Important
Please backup your database before upgrading or downgrading the plugin
15.1. v1.9.0 - 13 December 2019
Important
This version requires at least PHP 7.2
- New
- Spinning (See: Spinning Tab). You can spin the crawled post’s content and other details using Spin Rewriter (See: Spin Rewriter API Options). Note that this is a paid service. You can select what parts of the post should be spun, as well (See: Spinnable Fields).
- It is now much easier to find CSS selectors. Visual Inspector can now find CSS
selectors in four different modes named as
unique
,similar
,similar (specific)
andcontains
. See Selection behavior for more information. - Translate with Amazon Translate (See: Amazon Translate API Options) and Yandex Translate (See: Yandex Translate API Options) APIs.
- Select what types of information should be translated (See: Translatable Fields).
- Remove all scripts, empty HTML tags, and HTML comments from all parts of the post by simply checking a few checkboxes. The settings are available under Quick Fixes Section section (See Remove empty HTML elements and comments and Remove scripts settings).
- It is now possible to reset the general settings of the plugin. Just click to
Reset General Settings
button in General Settings Page. - Save all pages of the target post as a single page (See: Save as single page?)
- Permissions to limit which user roles can view pages and update settings of the plugin (See:
Permissions). The permissions contain
wpcc
in their name. So, you can look for those in your user role editor plugin. - Detailed documentation that explains features of the plugin with images, videos, and examples and includes several quick tutorials guiding you on how to do certain things.
- Improvement
- You can save WooCommerce product attributes to WooCommerce’s Attributes page (See: Attribute Name Selectors). If a taxonomy already exists for an attribute, the values of the attributes will be added to already-existing WooCommerce product attributes.
- Makes it easy to test categories that are already-defined in site settings. You can select one of the categories in Tester Page.
- Options Box is added for Post Date Selectors setting.
- Makes it possible to enable/disable setting
srcset
attributes of saved media elements in the templates. The option is available under Post Tab of General Settings Page (See: Set srcset attributes of saved media when possible). - Save posts as “pending” or “private” (See: Post Status).
- You can disable tooltip used throughout the plugin (General Settings > Advanced > Other > Disable Tooltip). By disabling it, you can fix the problem causing the buttons of Manual Crawling Tool to not work.
- Fix
- Active tabs are not restored in General Settings and Tools pages.
- Plugin cannot get the content when there is an invalid URL in the target page’s source code.
- URLs are not resolved properly in Visual Inspector.
- The last post URL found in a category page has a featured image although it should not.
- Infinite loop occurring when the next page URL of the last page of a post points to the first page of the post.
- Translation test results are not shown when the test text is too long.
- Other
- Tested and works in WordPress 5.3, 5.2, 5.1, 5.0, 4.9, in PHP 7.4, 7.3, 7.2, in Chrome, Firefox, Opera, Safari, Edge, and with WooCommerce 3.8, 3.7, 3.6, 3.5.
- Updates third party libraries.
- Other small fixes and improvements
- Requires at least PHP 7.2, which is the oldest actively-supported version. See: Supported Versions at PHP.net.
15.2. v1.8.0 - 1 January 2019
- New
- Save WooCommerce product details much more easily using the options specifically defined for WooCommerce products. To see the options under Post tab of site settings, just select the post type as “product” either in General Settings or by defining custom general settings. The options are available for WooCommerce versions 3.3, 3.4, and the latest one, 3.5.
- Save categories.
- Save post slugs.
- Save taxonomy values.
- Save the posts into custom post categories. You can define custom post category taxonomies in general settings so that you can select the custom post categories when saving a post.
- Options Box. For the settings that have Options Box button, you can define several settings for each item found by given CSS selectors. Options Box contains find-replace, calculation, and templating options. You can also take notes. It also allows you to use JSON values in calculations and templates.
- Rename, copy, and move saved files. You can also define title, description, caption, and alt texts for the saved media files using templates in which you can use any short code. It is also possible to give random names to the saved files.
- Recent tests in Site Tester page. You can now repeat your previous tests easily.
- Caching responses of test URLs in site settings page. You can now configure the settings faster and send less number of requests to the target site.
- Replace iframe and script HTML elements with short codes by just checking a checkbox. The options are available under Templates tab.
- Quick save button in the site settings. Now you can save the site settings faster (much faster).
- You can use delimiters and modifiers for regular expressions in find-replace options.
- Sets srcset attribute values of img elements of the saved image files in the templates when different sizes of the images are available.
- Warnings. When there is an error, you will get a warning showing the details of the warning/error.
- Saves “alt” and “title” values of media items when they are saved as attachments.
- Adds “wpcc/post/settings/meta-key-defaults” filter that you can use to set default values of site settings.
- Convert character encoding to UTF8 when target page’s HTML has a different encoding. You can enable the option under General Settings > Advanced.
- You can now navigate between tabs and settings much more easily when you activate fixing tabs and content navigation under Main tab of site settings.
- Adds “find and replace in raw response HTML” option for post and category settings. Using this, you can fix HTML errors that prevent the plugin from being able to parse the HTML code.
- Adds “wpcc/bot/response-content” filter that can be used to manipulate raw response content.
- Improvement
- Manual crawling tool has been redesigned. Now, you can manually crawl multiple URLs or insert post URLs to the database so that they can be crawled later. You can also perform parallel crawling. Moreover, you can recrawl the posts directly from the manual crawling tool.
- When testing your settings in the site settings page, all manipulations defined in your settings will be applied. By this way, you can conduct more robust tests and figure out the cause of a misbehavior more easily.
- Short code buttons now contain custom short codes defined by you.
- The files that are saved when testing are now deleted from the file system after the test.
- Shows all types of saved posts, including custom post types, in the dashboard.
- Uses the HTTP user agent defined in the settings when saving media.
- Fix
- You can now enter cookies without decoding them. Just copy and paste the values retrieved from your browser.
- In “Find and replace in custom meta” option, only one replacement was applied to each meta key. Now, all replacements will be applied sequentially.
- Images having “&” symbol in their URLs are not saved properly.
- Invalid chars coming after the file’s extension in the file’s URL (such as png:s) cause the files not to be saved with the right extension.
- When there were no API keys for translation services and the translation was active, a fatal error were shown. Now, it is handled silently.
- Scroll animation does not work.
- When testing find and replace settings for custom short codes, all test data options are required although one of them is enough to perform the test.
- The plugin causes the text editor in “Edit Page” page to be double.
- Relative URLs should be resolved automatically.
- Other
- Tested and works in PHP 5.6, 7.0, 7.1, and 7.2 and in Chrome, Firefox, Opera, Safari, and Edge.
- Updates limits and API versions of Google’s and Microsoft’s translation services.
- Updates third party libraries.
- Other small fixes and improvements.
- No longer supports Internet Explorer.
15.3. v1.7.0 - 22 October 2017
- New
- Translate posts automatically by using Google Cloud Translation API or Microsoft Translator Text API.
- Randomize proxies. By checking this option, you can make the plugin randomly order the proxies you entered.
- Over 50 filters and actions are added. If you are a developer, you can now use these to extend the plugin however you like.
- Fix
- The proxies were used when there was an error getting the target page’s source code. Now, they are always used, even when testing.
- Plugin’s pages were not shown properly with PHP 7.1.
- Other
- UI and UX improvements.
15.4. v1.6.0 - 4 March 2017
- New
- Date selectors.
- Add/remove minutes to/from the post date. You can schedule post publishing by this way.
- Scheduled post delete.
- Duplicate post checking via URL, title and/or content.
- More HTML manipulation options: exchange element attributes, remove element attributes, find and replace in element attributes, manipulate HTML of an element.
- Find and replace in custom short code and custom post meta content
- Improvement
- More counts are shown in site listing.
- Save all images in the post content by checking a single checkbox.
- Reorder settings that can have multiple values.
- If the main template is empty, it will be considered as it contains [wcc-main-content] shortcode in it.
- An option to always use UTF8 encoding.
- Load general settings with a button when you are overwriting them for a site.
- Settings are grouped and reordered for better navigation.
- Auto refresh the dashboard every few seconds.
- Track CRON events and the next sites that will be processed by the CRON events in the dashboard.
- Better notifications for the required settings when performing a test.
- Auto find for next page URL, post date and post title in DEV tools.
- Remove elements using a CSS selector in DEV tools. This can be used to remove blocking elements to better select the elements you want.
- Fix
- Sometimes thumbnail images and post URLs did not match when category pages were crawled.
- When importing site settings, form validation should not be performed.
- Other
- Small bug fixes and improvements.
15.5. v1.5.1 - 7 February 2017
- New
- Dashboard. See what’s going on behind the scenes.
- Other
- Bug fixes and improvements.
15.6. v1.4.1 - 27 January 2017
- Fix
- URLs in the queue should be saved uniformly according to their categories.
15.7. v1.4.0 - 26 January 2017
- New
- Post recrawling. Recrawl posts to update them regularly.
- Proxy tester. Test if your proxies work correctly.
- Cookies. Attach cookies to every request that is made to the target site.
- Other
- Removes Lodash.
- Small bug fixes and improvements.
15.8. v1.3.0 - 14 January 2017
- New
- Visual Inspector
- Fix
- Assets are not loaded on Windows servers.
- “General settings” link on plugins page does not work.
- Plugin does not crawl all active sites when there are more than 10 active sites.
15.9. v1.2.0 - 30 August 2016
- New
- You can now use proxy.
- Set connection timeout in seconds.
- Post title and excerpt templates in which you can use custom short codes.
- Find and replace in custom short code data.
- Maximum number of categories that can be added automatically via CSS selectors to the category map increased.
- Add custom post meta without a selector.
- You can set how many times URL collection and post crawling events should run each time for a site. For instance, you can save 3 posts every minute, or run URL collection 5 times every 2 minute.
- You can collect post URLs in reverse order for each category page.
- Remove links from all short code data. This will not touch the links manually added to the templates.
- Notifications. You can now set CSS selectors whose values should not be empty for category and post pages. When an empty value is found using those selectors, you can get an email notification.
- Fix
- Downloaded file’s name does not have a proper file extension if the file on the target site is generated dynamically.
- Crawling stops if there is a request exception.
- Crawling stops if target page’s HTML could not be retrieved.