html2rss
diff --git a/‎get-involved/index.md‎
Lines changed: 3 additions & 1 deletion b/‎get-involved/index.md‎
Lines changed: 3 additions & 1 deletion
diff --git a/‎get-involved/sponsoring.md‎
Lines changed: 20 additions & 0 deletions b/‎get-involved/sponsoring.md‎
Lines changed: 20 additions & 0 deletions
diff --git a/‎ruby-gem/how-to/index.md‎
Lines changed: 3 additions & 23 deletions b/‎ruby-gem/how-to/index.md‎
Lines changed: 3 additions & 23 deletions
diff --git a/‎ruby-gem/how-to/managing-feed-configs.md‎
Lines changed: 63 additions & 0 deletions b/‎ruby-gem/how-to/managing-feed-configs.md‎
Lines changed: 63 additions & 0 deletions
diff --git a/‎ruby-gem/how-to/scraping-json.md‎
Lines changed: 94 additions & 0 deletions b/‎ruby-gem/how-to/scraping-json.md‎
Lines changed: 94 additions & 0 deletions
diff --git a/‎ruby-gem/index.md‎
Lines changed: 6 additions & 15 deletions b/‎ruby-gem/index.md‎
Lines changed: 6 additions & 15 deletions
diff --git a/‎ruby-gem/reference/auto-source.md‎
Lines changed: 12 additions & 22 deletions b/‎ruby-gem/reference/auto-source.md‎
Lines changed: 12 additions & 22 deletions
diff --git a/‎ruby-gem/reference/channel.md‎
Lines changed: 12 additions & 14 deletions b/‎ruby-gem/reference/channel.md‎
Lines changed: 12 additions & 14 deletions
@@ -1,12 +1,14 @@
 ---
 layout: default
 title: Get Involved
+nav_order: 5
 has_children: true
-nav_order: 4
 ---
 
 # Get Involved
 
+- [**Sponsoring**]({{ '/get-involved/sponsoring' | relative_url }})
+
 Engage with the `html2rss` project. Contribute and connect with the community.
 
 - [**Project Roadmap**]({{ 'https://github.com/orgs/html2rss/projects/3/views/1' }}): View current work, plans, and priorities.
 
@@ -0,0 +1,20 @@
+---
+layout: default
+title: Sponsoring
+parent: Get Involved
+nav_order: 4
+---
+
+# Sponsoring html2rss
+
+`html2rss` is an open-source project, and its development is made possible by the support of our community. If you find `html2rss` useful, please consider sponsoring the project.
+
+## Why Sponsor?
+
+- **Ensure the project's longevity:** Your sponsorship helps to ensure that the project remains actively maintained and developed.
+- **Support new features:** Your contribution will help to fund the development of new features and improvements.
+- **Show your appreciation:** Sponsoring is a great way to show your appreciation for the project and the work that goes into it.
+
+## How to Sponsor
+
+You can sponsor the project through [GitHub Sponsors](https://github.com/sponsors/gildesmarais).
@@ -1,31 +1,11 @@
 ---
 layout: default
 title: How-To Guides
-nav_order: 3
 parent: Ruby Gem
+nav_order: 2
 has_children: true
 ---
 
-# How-To Guides: Practical `html2rss` Configurations
+# How-To Guides
 
-This section provides a collection of ready-to-use `html2rss` configuration examples for various popular websites and common use cases. These examples demonstrate how to tackle different HTML structures and content types.
-
-Use these as a starting point, modify them to fit your specific needs, or get inspiration for building your own custom feeds.
-
----
-
-### How to Use an Example
-
-1.  **Copy the YAML:** Copy the entire YAML configuration block for the example you're interested in.
-2.  **Save as `.yml`:** Save the copied content into a file, e.g., `my-example.yml`.
-3.  **Generate the Feed:** Run `html2rss` from your terminal:
-    ```bash
-    html2rss feed my-example.yml > my-example.xml
-    ```
-4.  **Enjoy!** Open `my-example.xml` in your favorite RSS reader.
-
----
-
-### Contribute Your Own Examples!
-
-Have you created a useful `html2rss` configuration? We encourage you to share it with the community by contributing to the [`html2rss-configs`](https://github.com/html2rss/html2rss-configs) repository.
+This section provides practical examples and solutions for common tasks when using the `html2rss` gem.
@@ -0,0 +1,63 @@
+---
+layout: default
+title: Managing Feed Configs
+parent: How-To Guides
+grand_parent: Ruby Gem
+nav_order: 7
+---
+
+# Managing Feed Configurations with YAML
+
+For easier management, especially when using the CLI or `html2rss-web`, you can store your feed configurations in a YAML file.
+
+## Global and Feed-Specific Configurations
+
+You can define global settings that apply to all feeds, and then define individual feed configurations under the `feeds` key.
+
+```yml
+# Global settings
+headers:
+  "User-Agent": "Mozilla/5.0 (iPhone; CPU iPhone OS 10_3_1 like Mac OS X) AppleWebKit/603.1.30 (KHTML, like Gecko) Version/10.0 Mobile/14E304 Safari/602.1"
+  "Accept": "text/html"
+
+# Feed-specific settings
+feeds:
+  my-first-feed:
+    channel:
+      url: "https://example.com/blog"
+    selectors:
+      # ...
+  my-second-feed:
+    channel:
+      url: "https://example.com/news"
+    selectors:
+      # ...
+```
+
+## Building Feeds from a YAML File
+
+### Ruby
+
+```ruby
+require 'html2rss'
+
+# Build a specific feed from the YAML file
+my_feed_config = Html2rss.config_from_yaml_file('feeds.yml', 'my-first-feed')
+rss = Html2rss.feed(my_feed_config)
+puts rss
+
+# If the YAML file contains only one feed, you can omit the feed name
+single_feed_config = Html2rss.config_from_yaml_file('single.yml')
+rss = Html2rss.feed(single_feed_config)
+puts rss
+```
+
+### Command Line
+
+```sh
+# Build a specific feed
+html2rss feed feeds.yml my-first-feed
+
+# Build a feed from a single-feed YAML file
+html2rss feed single.yml
+```
@@ -0,0 +1,94 @@
+---
+layout: default
+title: Scraping JSON Responses
+parent: How-To Guides
+grand_parent: Ruby Gem
+nav_order: 6
+---
+
+# Scraping JSON Responses
+
+When a website returns a JSON response (i.e., with a `Content-Type` of `application/json`), `html2rss` converts the JSON to XML, allowing you to use CSS selectors for data extraction.
+
+> [!NOTE]
+> The JSON response must be an Array or a Hash for the conversion to work.
+
+## JSON to XML Conversion Examples
+
+### JSON Object
+
+A JSON object like this:
+
+```json
+{
+  "data": [{ "title": "Headline", "url": "https://example.com" }]
+}
+```
+
+is converted to this XML structure:
+
+```xml
+<object>
+  <data>
+    <array>
+      <object>
+        <title>Headline</title>
+        <url>https://example.com</url>
+      </object>
+    </array>
+  </data>
+</object>
+```
+
+You would use `array > object` as your `items` selector.
+
+### JSON Array
+
+A JSON array like this:
+
+```json
+[{ "title": "Headline", "url": "https://example.com" }]
+```
+
+is converted to this XML structure:
+
+```xml
+<array>
+  <object>
+    <title>Headline</title>
+    <url>https://example.com</url>
+  </object>
+</array>
+```
+
+You would use `array > object` as your `items` selector.
+
+## Configuration Examples
+
+### Ruby
+
+```ruby
+Html2rss.feed(
+  headers: {
+    Accept: 'application/json'
+  },
+  channel: {
+    url: 'http://domainname.tld/whatever.json'
+  },
+  selectors: {
+    title: { selector: 'foo' }
+  }
+)
+```
+
+### YAML
+
+```yml
+headers:
+  Accept: application/json
+channel:
+  url: "http://domainname.tld/whatever.json"
+selectors:
+  title:
+    selector: "foo"
+```
@@ -5,25 +5,16 @@ nav_order: 3
 has_children: true
 ---
 
-# The html2rss Ruby Gem ([GitHub Repo](https://github.com/html2rss/html2rss))
+# The html2rss Ruby Gem
 
-This section documents the `html2rss` Ruby gem, the core library for `html2rss-web`. This documentation targets developers using the gem directly. For an easier start, use the [web application]({{ '/web-application' | relative_url }}).
+This section provides comprehensive documentation for the `html2rss` Ruby gem.
 
 ## Getting Started
 
-Start with the [Installation guide]({{ '/ruby-gem/tutorials/installation' | relative_url }}). Then, create your [first feed]({{ '/ruby-gem/tutorials/your-first-feed' | relative_url }}).
+If you are new to `html2rss`, we recommend starting with the [tutorials]({{ '/ruby-gem/tutorials' | relative_url }}).
 
 ## Documentation Sections
 
-- **[Tutorials]({{ '/ruby-gem/tutorials' | relative_url }})**: Step-by-step guides to get you started.
-- **[How-To Guides]({{ '/ruby-gem/how-to' | relative_url }})**: Solutions to common problems and tasks.
-- **[Reference]({{ '/ruby-gem/reference' | relative_url }})**: Technical details and configuration options.
-
-## Advanced Topics
-
-- [**Handling Dynamic Content and JavaScript**]({{ '/ruby-gem/how-to/handling-dynamic-content' | relative_url }}): Process JavaScript-heavy websites.
-- [**Customizing HTTP Requests**]({{ '/ruby-gem/how-to/custom-http-requests' | relative_url }}): Send custom HTTP headers.
-- [**Dynamic Parameters in URLs and Headers**]({{ '/ruby-gem/how-to/dynamic-parameters' | relative_url }}): Use dynamic parameters in URLs and headers.
-- [**Advanced Content Extraction with Selectors**]({{ '/ruby-gem/how-to/advanced-content-extraction' | relative_url }}): Advanced content extraction.
-- [**Styling Your RSS Feed**]({{ '/ruby-gem/how-to/styling-rss-feed' | relative_url }}): Add stylesheets to RSS feeds.
-- [**Debugging Your Configuration**]({{ '/support/troubleshooting' | relative_url }}): Debug feed configurations.
+- **[Tutorials]({{ '/ruby-gem/tutorials' | relative_url }})**: Step-by-step guides to help you get started with `html2rss`.
+- **[How-To Guides]({{ '/ruby-gem/how-to' | relative_url }})**: Practical examples and solutions for common tasks.
+- **[Reference]({{ '/ruby-gem/reference' | relative_url }})**: Detailed information on configuration options.
@@ -6,37 +6,33 @@ parent: Reference
 grand_parent: Ruby Gem
 ---
 
-# `auto_source`
+# Auto Source
 
-The `auto_source` scraper is the easiest way to create a feed. It intelligently finds items on a page without requiring you to specify CSS selectors.
+The `auto_source` scraper automatically finds items on a page, so you don't have to specify CSS selectors.
 
-You can enable it in your YAML config like this:
+To enable it, add `auto_source: {}` to your configuration:
 
 ```yaml
 channel:
   url: https://example.com
 auto_source: {}
 ```
 
----
-
-## How it Works
-
-The `auto_source` scraper uses a series of strategies to find content:
+## How It Works
 
-1.  **`schema`:** It looks for structured data in the form of `<script type="json/ld">` tags. Many websites use this to provide machine-readable information about their content, often following the [Schema.org](https://schema.org/) standard.
-2.  **`semantic_html`:** It searches for semantic HTML5 tags like `<article>`, `<main>`, and `<section>`. These tags are often used to define the main content of a page.
-3.  **`html`:** As a last resort, it analyzes the entire HTML structure to find frequently occurring selectors that are likely to contain the main content.
+`auto_source` uses the following strategies to find content:
 
----
+1.  **`schema`:** Parses `<script type="json/ld">` tags containing structured data (e.g., [Schema.org](https://schema.org/)).
+2.  **`semantic_html`:** Searches for semantic HTML5 tags like `<article>`, `<main>`, and `<section>`.
+3.  **`html`:** Analyzes the HTML structure to find frequently occurring selectors that are likely to contain the main content.
 
-## Fine-Tuning `auto_source`
+## Fine-Tuning
 
-You can customize the behavior of the `auto_source` scraper to improve its accuracy.
+You can customize `auto_source` to improve its accuracy.
 
 ### Scraper Options
 
-You can enable or disable specific scrapers and adjust their settings.
+Enable or disable specific scrapers and adjust their settings:
 
 ```yaml
 auto_source:
@@ -51,19 +47,13 @@ auto_source:
       use_top_selectors: 3 # default: 5
 ```
 
-- `minimum_selector_frequency`: The minimum number of times a selector must appear to be considered a candidate for the main content.
-- `use_top_selectors`: The number of top candidate selectors to consider.
-
 ### Cleanup Options
 
-You can also clean up the results to remove unwanted items.
+Remove unwanted items from the results:
 
 ```yaml
 auto_source:
   cleanup:
     keep_different_domain: false # default: true
     min_words_title: 4 # default: 3
 ```
-
-- `keep_different_domain`: Whether to keep items that link to a different domain.
-- `min_words_title`: The minimum number of words a title must have to be included.
@@ -6,9 +6,9 @@ parent: Reference
 grand_parent: Ruby Gem
 ---
 
-# `channel`
+# Channel
 
-The `channel` key contains information about the RSS feed itself, such as its title, URL, and description.
+The `channel` configuration block defines the metadata for your RSS feed.
 
 ```yaml
 channel:
@@ -21,16 +21,14 @@ channel:
   time_zone: "Europe/Berlin"
 ```
 
----
-
-## Channel Options
+## Options
 
-| Attribute     | Required     | Type    | Default        | Remark                                                                                                                                  |
-| :------------ | :----------- | :------ | :------------- | :-------------------------------------------------------------------------------------------------------------------------------------- |
-| `url`         | **Required** | String  |                | The URL of the website to scrape.                                                                                                       |
-| `title`       | Optional     | String  | Auto-generated | The title of the RSS feed.                                                                                                              |
-| `description` | Optional     | String  | Auto-generated | Retrieved from meta description tags.                                                                                                   |
-| `author`      | Optional     | String  | Blank          | Format: `email (Name)`.                                                                                                                 |
-| `ttl`         | Optional     | Integer | Auto-generated | Time to live in minutes. `html2rss` will use the `max-age` from the response headers if available, otherwise it will default to `360`.  |
-| `language`    | Optional     | String  | Auto-generated | Determined by the `lang` attribute of the `<html>` tag.                                                                                 |
-| `time_zone`   | Optional     | String  | `'UTC'`        | The time zone to use for parsing dates. See a [list of valid time zones](https://en.wikipedia.org/wiki/List_of_tz_database_time_zones). |
+| Attribute     | Required     | Description                                                                                                                              |
+| :------------ | :----------- | :--------------------------------------------------------------------------------------------------------------------------------------- |
+| `url`         | **Required** | The URL of the website to scrape.                                                                                                        |
+| `title`       | Optional     | The title of the RSS feed. Defaults to the website's title.                                                                              |
+| `description` | Optional     | A description for the RSS feed. Defaults to the website's meta description.                                                              |
+| `author`      | Optional     | The author of the feed, in the format `email (Name)`.                                                                                    |
+| `ttl`         | Optional     | The "time to live" for the feed in minutes. Defaults to the `max-age` from the response headers, or `360`.                               |
+| `language`    | Optional     | The language of the feed. Defaults to the `lang` attribute of the `<html>` tag.                                                          |
+| `time_zone`   | Optional     | The time zone for parsing dates. See the [list of tz database time zones](https://en.wikipedia.org/wiki/List_of_tz_database_time_zones). |