ai – blog.satyr.nl

paraphrase-id-tensorflow

2020-02-24 satyr.nl

GitHub – nelson-liu/paraphrase-id-tensorflow

Various models and code (Manhattan LSTM, Siamese LSTM + Matching Layer, BiMPM) for the paraphrase identification task, specifically with the Quora Question Pairs dataset.

Programming

MLBox

2019-06-04 satyr.nl

Home – Welcome to MLBox’s official documentation — MLBox Documentation

MLBox is a powerful Automated Machine Learning python library. It provides the following features:

Fast reading and distributed data preprocessing/cleaning/formatting.
Highly robust feature selection and leak detection.
Accurate hyper-parameter optimization in high-dimensional space.
State-of-the art predictive models for classification and regression (Deep Learning, Stacking, LightGBM,…).
Prediction with models interpretation.

Programming

ryanjay0/miles-deep: Deep Learning Porn Video Classifier/Editor with Caffe

2019-02-17 satyr.nl

ryanjay0/miles-deep: Deep Learning Porn Video Classifier/Editor with Caffe

Using a deep convolutional neural network with residual connections, Miles Deep quickly classifies each second of a pornographic video into 6 categories based on sexual act with 95% accuracy. Then it uses that classification to automatically edit the video. It can remove all the scenes not containing sexual contact, or edit out just a specific act.

Programming

xgboost-node

2019-02-12 satyr.nl

xgboost-node

XGBoost-Node is the first port of XGBoost to run existing XGBoost model with Node.js.

XGBoost is a library from DMLC. It is designed and optimized for boosted trees. The underlying algorithm of XGBoost is an extension of the classic gbm algorithm. With multi-threads and regularization, XGBoost is able to utilize more computational power and get a more accurate prediction.

Programming

SmoothGrad

2018-10-28 satyr.nl

SmoothGrad

When a machine learning model makes a prediction, often times we would like to determine which features of the input (pixels, for images) were important for the prediction. If the model makes a misprediction, we might want to know which features contributed to the misclassification. We can visualize the feature importance mask as a grayscale image with the same dimensions as the original image with brightness corresponding to importance of the pixel.

Programming

Auto-Keras

2018-10-16 satyr.nl

Auto-Keras

Auto-Keras is an open source software library for automated machine learning (AutoML). It is developed by DATA Lab at Texas A&M University and community contributors. The ultimate goal of AutoML is to provide easily accessible deep learning tools to domain experts with limited data science or machine learning background. Auto-Keras provides functions to automatically search for architecture and hyperparameters of deep learning models.

Programming

Large Collection of Neural Nets, Numpy, Pandas, Matplotlib, Scikit and ML Cheat Sheets – Data Science Central

2018-10-15 satyr.nl

Large Collection of Neural Nets, Numpy, Pandas, Matplotlib, Scikit and ML Cheat Sheets – Data Science Central

This collection covers much more than the topics listed in the title. It also features Azure, Python, Tensorflow, data visualization, and many other cheat sheets.

Programming

deep-ttf: Survival analsyis and time-to-failure predictive modeling using Weibull distributions and Recurrent Neural Networks in Keras

2018-10-01 satyr.nl

GitHub – gm-spacagna/deep-ttf: Survival analsyis and time-to-failure predictive modeling using Weibull distributions and Recurrent Neural Networks in Keras

A tutorial on how to implement an algorithm for predictive maintenance using survival analysis theory and gated Recurrent Neural Networks in Keras.

Programming

Machine Learning Crash Course

2018-07-19 satyr.nl

Machine Learning Crash Course Â |Â Google Developers

Machine Learning Crash Course features a series of lessons with video lectures, real-world case studies, and hands-on practice exercises.

Cunning Linguist

conceptnet-numberbatch

2018-07-08 satyr.nl

GitHub – commonsense/conceptnet-numberbatch

ConceptNet Numberbatch consists of state-of-the-art semantic vectors (also known as word embeddings) that can be used directly as a representation of word meanings or as a starting point for further machine learning.

Gadgets and Machines

Use TensorFlow AI on Raspberry Pi – The MagPi MagazineThe MagPi Magazine

2018-07-06 satyr.nl

Use TensorFlow AI on Raspberry Pi – The MagPi MagazineThe MagPi Magazine

Although Raspberry Pi isn’t officially supported by Google, there are example models included for the Raspberry Pi and it can be fun (if a bit hacky) to get TensorFlow up and running on a Pi. And there are lots of interesting community projects around that put TensorFlow to good use.

Programming

Learn How to Code and Deploy Machine Learning Models on Structured Streaming

2018-07-01 satyr.nl

Learn How to Code and Deploy Machine Learning Models on Structured Streaming

how to deploy machine learning models on streaming data and covers all 3 necessary areas of a successful production application: infrastructure, technology and monitoring.

Programming

PaddlePaddle – PArallel Distributed Deep LEarning

2016-09-06 satyr.nl

PaddlePaddle – PArallel Distributed Deep LEarning

PaddlePaddle (PArallel Distributed Deep LEarning) is an easy-to-use, efficient, flexible and scalable deep learning platform, which is originally developed by Baidu scientists and engineers for the purpose of applying deep learning to many products at Baidu.

Programming

cranium for Node.js – Pincer.io

2016-07-212020-12-26 satyr.nl

cranium for Node.js – Pincer.io

The amount of data you need to build a good classifier increases with the number of features you have, so out of memory errors become a problem when dealing with thousands of features. For example, Weka fails to perform logistic regression with more than a couple thousand features on a 5mb dataset. Cranium never assumes that your instances can fit in memory, so you can use it on terabytes of data.

Cranium works with node streams, so you have a lot of flexibility with your input. Using streams sacrifies speed for memory efficiency — Cranium uses a constant amount of memory that is typically below 100mb. The speed penalty is significant: Cranium runs about 500x slower than LearnKit. If your dataset can fit in memory, Cranium is probably not right for you.

Programming

dnn: Deep learning library for node.js

2016-04-04 satyr.nl

GitHub – junku901/dnn: Deep learning library for node.js. (Includes Logistic-Regression, MLP, RBM, DBN, CRBM, CDBN)

Deep learning library for node.js. (Includes Logistic-Regression, MLP, RBM, DBN, CRBM, CDBN)

Programming

PaddlePaddle – PArallel Distributed Deep LEarning

2015-07-242020-12-26 satyr.nl

PaddlePaddle – PArallel Distributed Deep LEarning

PaddlePaddle (PArallel Distributed Deep LEarning) is an easy-to-use, efficient, flexible and scalable deep learning platform, which is originally developed by Baidu scientists and engineers for the purpose of applying deep learning to many products at Baidu.

Tag: ai