Engineering@Fisheye Analytics
This is where engineers at Fisheye Analytics share experiences about technologies they use, discuss the systems they build and write about interesting tech stuff they find.
Friday, July 1, 2011
Search @ Fisheye : 2.5 million queries per day and rocketing!
A little more than two years ago we created our first search engine for News called Flamingo. We later created search engines for social media too. Inorder to provide our clients with fresh results as they appear on the web, we index the data our web crawlers fetch into two types of indexes. One of them, the "real time index" always contains data for the past 24 hours to power up realtime search and the other "monthly index" contains data for the past month. The latter can grow into hundreds of gigabytes.
Our web crawlers create these search indexes distributed across a cluster of machines. I will save writing about how our web crawlers work to another post.
Monday, March 21, 2011
Sharing some of our Numbers
We are growing at Fisheye Analytics. We are seeing more data and providing more intelligence to our clients. Some of our numbers are shared below.
1 million News + 1.5 million Blog posts + 21 million Tweets + 2 million Facebook public posts and more created are extracted and analyzed every day
1 million News + 1.5 million Blog posts + 21 million Tweets + 2 million Facebook public posts and more created are extracted and analyzed every day
25 TB of data extracted every month from the web (news & social media). For comparison, this could be roughly equal to the size of all 4 billion tweets Twitter receives over a month currently! (*)
Tuesday, February 1, 2011
We are HIRING
2010 was a year of big achievements for the engineering team at Fisheye. Last year, we scaled our technology that could serve 10 times more clients than we could in 2009. We also developed a new tool, Media Lens, which revolutionized the way clients monitored the media that was totally built on top of an open-source search server instead of the traditional way of reading from database. This year, we are expanding our tech team to build exciting tools that make a difference in the media industry.
Friday, January 21, 2011
How a day looks like in our server farms
Transcript of the above presentation :
How Stuff Works - Technology @ Fisheye analytics : How a day looks like in our server farm
by Ashwin Reddy Gayam
Thursday, December 30, 2010
An experiment that worked
When a traditional way of doing analytics with a DBMS didn’t offer good performance for our large loads, we looked to Sphinx, a full text search engine to do analytics, something which Sphinx is not intended to be used for. We experimented. We married MySQL with Sphinx to offer great performance in analytics crunching hundreds of millions of client data records. We did something unusual for serious production use but succeeded. Best part is that all the software used is free and open source. Keep reading for a detailed story.
Wednesday, September 22, 2010
Technologies used at Fisheye Analytics
What could be a better way to start a "Hello World" post in this new engineering blog than writing about the various technologies we use at Fisheye Analytics? Most of the technologies and tools we use are free and open source. I will give a short listing of some of the important ones here.
Subscribe to:
Posts (Atom)
