Search This Blog

Hovertrx

Add URL

Translate

Saturday, November 10, 2018

Great list of resources: data science, visualization, machine learning, big data Posted by Vincent Granville

Fantastic resource created by Andrea Motosi. I've only included the 5 categories that are the most relevant to our audience, though it has 31 categories total, including a few on distributed systems and Hadoop. Click here to view the 31 categories. You might also want to check our our our internal resources (the first section below).
Data Science Central - Resources
Machine Learning
  • Apache Mahout: machine learning library for Hadoop
  • Ayasdi Core: tool for topological data analysis
  • brain: Neural networks in JavaScript
  • Cloudera Oryx: real-time large-scale machine learning
  • Concurrent Pattern: machine learning library for Cascading
  • convnetjs: Deep Learning in Javascript. Train Convolutional Neural Networks (or ordinary ones) in your browser
  • Decider: Flexible and Extensible Machine Learning in Ruby
  • etcML: text classification with machine learning
  • Etsy Conjecture: scalable Machine Learning in Scalding
  • Google Sibyl: System for Large Scale Machine Learning at Google
  • H2O: statistical, machine learning and math runtime for Hadoop
  • IBM Watson: cognitive computing system
  • MLbase: distributed machine learning libraries for the BDAS stack
  • MLPNeuralNet: Fast multilayer perceptron neural network library for iOS and Mac OS X
  • nupic: Numenta Platform for Intelligent Computing: a brain-inspired machine intelligence platform, and biologically accurate neural network based on cortical learning algorithms
  • PredictionIO: machine learning server buit on Hadoop, Mahout and Cascading
  • scikit-learn: scikit-learn: machine learning in Python
  • Spark MLlib: a Spark implementation of some common machine learning (ML) functionality
  • Sparkling Water: combine H2OÕs Machine Learning capabilities with the power of the Spark platform
  • Vahara: Machine learning and natural language processing with Apache Pig
  • Viv: global platform that enables developers to plug into and create an intelligent, conversational interface to anything
  • Vowpal Wabbit: learning system sponsored by Microsoft and Yahoo!
  • WEKA: suite of machine learning software
  • Wit: Natural Language for the Internet of Things
  • Wolfram Alpha: computational knowledge engine
Visualization
  • Arbor: graph visualization library using web workers and jQuery
  • CartoDB: open-source or freemium hosting for geospatial databases with powerful front-end editing capabilities and a robust API
  • Chart.js: open source HTML5 Charts visualizations
  • Crossfilter: avaScript library for exploring large multivariate datasets in the browser. Works well with dc.js and d3.js
  • Cubism: JavaScript library for time series visualization
  • Cytoscape: JavaScript library for visualizing complex networks
  • D3: javaScript library for manipulating documents
  • DC.js: Dimensional charting built to work natively with crossfilter rendered using d3.js. Excellent for connecting charts/additional metadata to hover events in D3
  • Envisionjs: dynamic HTML5 visualization
  • Freeboard: pen source real-time dashboard builder for IOT and other web mashups
  • Gephi: An award-winning open-source platform for visualizing and manipulating large graphs and network connections
  • Google Charts: simple charting API
  • Grafana: graphite dashboard frontend, editor and graph composer
  • Graphite: scalable Realtime Graphing
  • Highcharts: simple and flexible charting API
  • IPython: provides a rich architecture for interactive computing
  • Keylines: toolkit for visualizing the networks in your data
  • Matplotlib: plotting with Python
  • NVD3: chart components for d3.js
  • Peity: Progressive SVG bar, line and pie charts
  • Plot.ly: Easy-to-use web service that allows for rapid creation of complex charts, from heatmaps to histograms. Upload data to create and style charts with Plotly’s online spreadsheet. Fork others’ plots.
  • Recline: simple but powerful library for building data applications in pure Javascript and HTML
  • Redash: open-source platform to query and visualize data
  • Sigma.js: JavaScript library dedicated to graph drawing
  • Vega: a visualization grammar
Graph Databases
  • Apache Giraph: implementation of Pregel, based on Hadoop
  • Apache Spark Bagel: implementation of Pregel, part of Spark
  • ArangoDB: multi model distribuited database
  • Facebook TAO: TAO is the distributed data store that is widely used at facebook to store and serve the social graph
  • Faunus: Hadoop-based graph analytics engine for analyzing graphs represented across a multi-machine compute cluster
  • Google Cayley: open-source graph database
  • Google Pregel: graph processing framework
  • GraphLab PowerGraph: a core C++ GraphLab API and a collection of high-performance machine learning and data mining toolkits built on top of the GraphLab API
  • GraphX: resilient Distributed Graph System on Spark
  • Gremlin: graph traversal Language
  • InfiniteGraph: distributed graph database
  • Infovore: RDF-centric Map/Reduce framework
  • Intel GraphBuilder: tools to construct large-scale graphs on top of Hadoop
  • MapGraph: Massively Parallel Graph processing on GPUs
  • Neo4j: graph database writting entirely in Java
  • OrientDB: document and graph database
  • Phoebus: framework for large scale graph processing
  • Sparksee: scalable high-performance graph database
  • Titan: distributed graph database, built over Cassandra
  • Twitter FlockDB: distribuited graph database
NewSQL
  • Actian Ingres: commercially supported, open-source SQL relational database management system
  • BayesDB: statistic oriented SQL database
  • Cockroach: Scalable, Geo-Replicated, Transactional Datastore
  • Datomic: distributed database designed to enable scalable, flexible and intelligent applications
  • FoundationDB: distributed database, inspired by F1
  • Google F1: distributed SQL database built on Spanner
  • Google Spanner: globally distributed semi-relational database
  • H-Store: is an experimental main-memory, parallel database management system that is optimized for on-line transaction processing (OLTP) applications
  • HandlerSocket: NoSQL plugin for MySQL/MariaDB
  • IBM DB2: object-relational database management system
  • InfiniSQL: infinity scalable RDBMS
  • MemSQL: in memory SQL database witho optimized columnar storage on flash
  • NuoDB: SQL/ACID compliant distributed database
  • Oracle Database: object-relational database management system
  • Oracle TimesTen in-Memory Database: in-memory, relational database management system with persistence and recoverability
  • Pivotal GemFire XD: Low-latency, in-memory, distributed SQL data store. Provides SQL interface to in-memory table data, persistable in HDFS
  • SAP HANA: is an in-memory, column-oriented, relational database management system
  • SenseiDB: distributed, realtime, semi-structured database
  • Sky: database used for flexible, high performance analysis of behavioral data
  • SymmetricDS: open source software for both file and database synchronization
  • Teradata Database: complete relational database management system
  • VoltDB: in-memory NewSQL database
Other

Friday, November 9, 2018

Free Work Space For Veterans At WeWork helps entreprenuership


By Erika Prafder

At Wework on 57th Street Brett D'Alessandro (left) and Alexa Modero (right) run Backpacks for Life. 

Upon his return from deployment in Afghanistan as a sergeant in the Marines, Brett D’Alessandro had a tough time re-integrating.
“I went through hell, as a lot of vets do,” he says.
With medical help and that of organizations such as the Starlight Military Program, Veteran Racing Team and Operation Rebound, D’Alessandro recovered, but on the way to an appointment, he spotted a homeless vet.
“I went to my motel and pulled together a backpack filled with hats, gloves and warm clothing layers, and brought it to him,” he says.
Days later, “I saw him with a little boy, his son, who thanked me for the pack and said he didn’t have to go to school with books in hand. I thought about other vets suffering out there. There are so many resources, but a big disconnect between vets navigating them and finding them,” says D’Alessandro.
Inspired to help, he and his girlfriend, Alexa Modero, created Backpacks for Life, a 501c3 nonprofit organization that collects toiletries and basic supplies in backpacks and distributes them to needy vets with the help of Stand Down events, veteran service organizations, first responders and police officers. They also coach and mentor homeless and at-risk veterans.
Since launching his nonprofit in 2014 and leaving the military in 2016, the charity has given out 5,500 packs. To create revenue, they have developed personalized packs equipped with a locking cable and reflective material.
Then, in August, they were accepted to the Veterans in Residence program at WeWork. This nationwide effort was launched in 2016 by the global network of workspaces and Bunker Labs, a nonprofit that helps vets and their families start and grow their businesses.
Through the partnership, WeWork builds specially designed quarters with dedicated lounges and meeting rooms, providing vets with six months of free access to workspaces, facilities and services of WeWork’s community, which is currently over 300,000 strong globally.
With their Bunker Labs affiliation, each local group meets weekly with a team leader to work on business plans and to support one another in growth efforts. Citywide networking events with other veteran entrepreneurs are also hosted. Today, there are about 150 Veterans in Residence across 14 US cities.
“We host about 60,000 events per year, including fireside chats with famous veteran chief marketing officers or CEOs,” says Padden Murphy, WeWork’s global head of policy and impact.

Since every entrepreneur’s journey is different, “we’re trying to leverage and activate member companies and members to help folks build a product, a team, help with product launches and form strategic partnerships. If you’re a Vet in Residence, your story is our story.”
If, after their six-month program, vets need more time, “we are experimenting with extending for another six months,” says Murphy. “Vets make amazing entrepreneurs and business owners. They have ideas to help their communities. The number of vet-owned businesses is impressive and large — about 9.1 percent here. About 5 million people are employed by them and their payroll is about $200 billion. They are a core part and natural drivers of our economy.”
Nancy Preston, an Army veteran and West Point graduate, can attest to that. After leaving the military in 2009, Preston worked in the corporate world for eight years before deciding to start Milk Money Kitchens Inc., a consulting and coaching firm for food entrepreneurs.
“Cooks just want to cook,” she says. “Many have no idea what their numbers are. I wanted to help them start and run their businesses, optimize efficiency and make them more profitable.”
Preston joined Veterans in Residence in August, and the benefits are invaluable.
“The relationships I’ve built with other companies and their networks expanded mine by five times. I’ve generated good leads. Through lunch-and-learns and happy hours, the program organizers have put me in front of real-estate investors and legal firms, helping me to structure my company better. There’s been measurable progress,” she says.
Another advantage is the legitimacy the workspace offers, says Preston.
“Before, I was conducting business meetings in restaurants and coffee shops. Being able to meet clients at any of WeWork’s offices has earned me new revenue.”
Veterans can apply for the program at WeWork.com.

Followers