Parsing SUSI.AI Blog Feed
Our SUSI.AI web chat is improving every day.Recently we decided to add a blog page to our SUSI.AI web chat to show all the latest blogs related to SUSI.AI. These blogs are written by our developer team. We have the following feed from which we need to parse the information of blogs and display:
http://blog.fossasia.org/tag/susi-ai/feed/
This feed is in RSS (Really Simple Syndication) format. We decided to use Google feed API service to parse this RSS content, but that service is now deprecated. Then we decided to convert this RSS into JSON format and then we will parse the information.
We have used the rss2json API for converting our RSS content into JSON. We make an ajax call to this API for fetching the JSON content:
$.ajax({ url: 'https://api.rss2json.com/v1/api.json', method: 'GET', dataType: 'json', data: { 'rss_url': 'http://blog.fossasia.org/tag/susi-ai/feed/', 'api_key': api_key: '0000000000000000000000000000000000000000', // put your api key here, 'count': 50 } }).done(function (response) { if(response.status !== 'ok'){ throw response.message; } this.setState({ posts: response.items, postRendered: true}); }.bind(this));
Explanation:
- URL: This is base URL of API to which we are making calls in order to fetch JSON data.
- rss_url: This is the URL of RSS feed which needs to be converted into JSON format.
- api_key: This is the key, which can be generated after making an account on the website
- count: Count of feed items to return, the default is 20.
The converted JSON response looks like:
This can be checked here.
Now we have used cards of material-ui to show the content of this JSON response. On the success of our ajax call, we update the array(named as posts) present in the initial state of our component with response.items( an array containing information of blogs).
We map through each element in an array named posts and return the corresponding card, containing relevant information in it. We parsed the following information from JSON:
- Name of the author: posts.author
- Title of the blog: posts.title
- Link to blog on WordPress: posts.link
Publish date of the blog:
The publish date is in this format: “2017-08-05 09:05:27” (example), we need to format this date. We used following code to do that:
let date = posts.pubDate.split(' '); let d = new Date(date[0]); dateFormat(d, 'dddd, mmmm dS, yyyy') // dateFormat is an npm package
This converts “2017-08-05 09:05:27” to “Saturday, August 5th, 2017”.
Description and Featured Images of the blog:
We needed to show a short description of the blog and a featured image. There is an element in our posts array with name description that contains HTML, starting from featured image and followed by a short description. We needed to convert this HTML into simple text for using this. I used htmlToText npm package for this purpose:
let description = htmlToText.fromString(posts.description).split(‘…’);
description variable now contains simple text. A simple example :
“[https://blog.fossasia.org/wp-content/uploads/2017/08/image2-768×442.png] Our SUSI.AI Web Chat has many static pages like Overview, Devices, Team and Support.We have separate CSS files for each component. Recently, we faced a problem regarding design pattern where CSS files of one component were affecting another component. This blog is all about solving this issue and we take an example of distortion ”
Now, this text contains both link for featured image and our short description text. For getting the link for featured image, I have used regex(Regular Expression) in the following code and saved the link in the variable image and for short description I have used the split function :
let text = description[0].split(']'); let image = susi // temporary image for initialisation let regExp = /\[(.*?)\]/; let imageUrl = regExp.exec(description[0]); if(imageUrl) { image = imageUrl[1] }
After successfully parsing this information from JSON, we can have cards with details and card looks like this:
As we need all this to be rendered when the component mounts, so we put our ajax call inside componentDidMount() function of our react component.
Resources:
- Complete implementation code: https://github.com/fossasia/chat.susi.ai/blob/master/src/components/Blog/Blog.react.js
- Blog Page: http://chat.susi.ai/blog
- Regex in JavaScript: https://developer.mozilla.org/en/docs/Web/JavaScript/Guide/Regular_Expressions
- html-to-text npm package: https://www.npmjs.com/package/html-to-text
- dateFormat npm package: https://www.npmjs.com/package/dateformat
You must be logged in to post a comment.