Big data - Spark

Top 5 big data -projektia (avoimen lähdekoodin)

February 13, 2013, 11:00 am

Näitähän riittää, top-listoja siis. Tässä yksi, jossa ainakin mielenkiintoisia vaihtoehtoja esiteltynä:Siliconangle kirjoitti tuosta myös. Kunniamaininnan sai mielenkiintoinen Googlen Spark, globaali...

View Article

Image may be NSFW.
Clik here to view.

Introducing Spark

June 3, 2014, 7:10 pm

MapReduce was developed mainly for batch-oriented jobs and it was optimized for throughput rather than latency. The inherent high latency in MapReduce makes it very unattractive for use cases where we...

View Article

So what makes Spark Lightning Fast?

June 11, 2014, 1:53 am

Apache Spark claims that its a lightning-fast cluster [source]. It does make one wonder how come it is able to overcome the latency issues in MapReduce. In my previous blog I gave a brief introduction...

View Article

Image may be NSFW.
Clik here to view.

A closer look at Spark

June 17, 2014, 1:50 am

In our previous posts we gave a brief Introduction to Spark. Today we are going to have a more closer look at the Spark technology stack.Spark is 100% compatible with any Hadoop data storage system....

View Article

Image may be NSFW.
Clik here to view.

DataStax and Databricks unite

June 27, 2014, 12:44 am

Realizing the importance of in-memory processing for transaction processing of the Distributed DBMS Datastax has decided to partner with Databricks [1]. This partnership is also a strong indicator of...

View Article

Image may be NSFW.
Clik here to view.

Spark Streaming - part 1

June 29, 2014, 11:25 pm

The importance of Real time big data analytics is becoming of enormous importance with every passing day. It enables us to take right decisions at the right time. Social networking sites like Twitter...

View Article

Image may be NSFW.
Clik here to view.

Spark Streaming - part 2

June 30, 2014, 11:09 pm

In my previous post we discussed the challenges in the existing streaming systems and the motivation for Spark Streaming. As mentioned in my last post the biggest challenge was inefficient fault...

View Article

Databricks keräsi Sparkilla $33 miljoonaa

July 1, 2014, 8:21 am

Kiinnostus ja usko Apachen Spark-projektiin ja muistinvaraisen analytiikan merkitykseen big datan seuraavana vaiheena sai jälleen sijoittajat liikkeelle. Yhdysvaltalainen Databricks ilmoitti...

View Article

Image may be NSFW.
Clik here to view.

Spark Streaming - part 3

July 3, 2014, 1:51 am

The most important feature of Spark Streaming is its robust fault recovery and efficient straggler handling. Today we will see how actually it is achieved in Spark Streaming. The robust fault recovery...

View Article

Spark

July 16, 2014, 9:52 pm

Olemme kirjoittaneet tässä ja teknisessä blogissa paljon Apache-projekti Sparkista, jonka avulla muistinvaraisen analytiikan nopeuden saa helposti valjastettua käyttöönsä. Nyt kyseessä kuitenkin toinen...

View Article

Hadoop yhä vaan nopeampi

September 24, 2014, 9:05 am

Hadoopin kehitys on ollut huimaa ja ennusteet vielä huimempia. Markkinoiden koon ennakoidaan kasvavan nopeasti, jopa 25-kertaisiksi vuoteen 2020 mennessä ja on yhä vaikeampi löytää big data...

View Article

Nopeuskilpailua datalla

October 11, 2014, 7:16 pm

Apache Spark teki maailmaennätyksenDatarbricks rikkoi Yahoon Hadoopilla tekemän maailmanennätyksen 100 teratavun datamassan järjestelyssä. Aikaisempi ennätys oli 2 100 koneen Hadoop-klusterilla...

View Article

More Pages to Explore .....

Top 5 big data -projektia (avoimen lähdekoodin)

Introducing Spark

So what makes Spark Lightning Fast?

A closer look at Spark

DataStax and Databricks unite

Spark Streaming - part 1

Spark Streaming - part 2

Databricks keräsi Sparkilla $33 miljoonaa

Spark Streaming - part 3

Spark

Hadoop yhä vaan nopeampi

Nopeuskilpailua datalla

Alce para colorear

Vimeo 10.6.2 by Vimeo.com, Inc.

Latest Images

Vimeo 10.7.0 by Vimeo.com, Inc.

HANGAD

MAKAKAALAM

Doodle Jump 3.11.30 by Lima Sky LLC

Doodle Jump 3.11.30 by Lima Sky LLC

Vimeo 10.6.1 by Vimeo.com, Inc.