giangzuzana’s gists

giangzuzana / parallel_merge_sort.py

Created April 17, 2022 23:53 — forked from stephenmcd/parallel_merge_sort.py

Parallel Merge Sort

	import math
	import multiprocessing
	import random
	import sys
	import time


	def merge(*args):
	# Support explicit left/right args, as well as a two-item
	# tuple which works more cleanly with multiprocessing.

giangzuzana / sentiment_classification.py

Created October 30, 2020 23:58 — forked from bonzanini/sentiment_classification.py

Sentiment analysis with scikit-learn

	# You need to install scikit-learn:
	# sudo pip install scikit-learn
	#
	# Dataset: Polarity dataset v2.0
	# http://www.cs.cornell.edu/people/pabo/movie-review-data/
	#
	# Full discussion:
	# https://marcobonzanini.wordpress.com/2015/01/19/sentiment-analysis-with-python-and-scikit-learn

giangzuzana / mutual_info.py

Created November 5, 2018 19:54 — forked from GaelVaroquaux/mutual_info.py

Estimating entropy and mutual information with scikit-learn

	'''
	Non-parametric computation of entropy and mutual-information

	Adapted by G Varoquaux for code created by R Brette, itself
	from several papers (see in the code).

	These computations rely on nearest-neighbor statistics
	'''
	import numpy as np

giangzuzana / cuda-setup.md

Created July 16, 2018 07:50 — forked from soareschen/cuda-setup.md

CUDA setup on Ubuntu 16.04 and LXD

This gist explains the steps required to install CUDA on Ubuntu 16.04 as well as enabling it inside LXD containers.

The setup assumes GTX 10 series hardware, tested with my GTX 1070.

Driver Installation

On 64-bit systems, install 32-bit OpenGL libaries first so that the driver will install

giangzuzana / simple-pinterest-crawler-example.py

Created March 21, 2017 21:04 — forked from rightson/simple-pinterest-crawler-example.py

	# -- coding: utf-8 --

	import pycurl
	from BeautifulSoup import BeautifulSoup


	class BruteFrocePinterestCrawler:
	def __init__(self):
	self.content = ''
	self.url = ''

giangzuzana / imdb-sentiment-vw.sh

Last active August 29, 2015 14:19 — forked from jwf-zz/imdb-sentiment-vw.sh

	#!/bin/bash

	# Requires vw (https://github.com/JohnLangford/vowpal_wabbit/wiki/),
	# the IMDB dataset (http://ai.stanford.edu/~amaas/data/sentiment/aclImdb_v1.tar.gz),
	# and the perf utility from http://osmot.cs.cornell.edu/kddcup/software.html.

	cat aclImdb/train/labeledBow.feat \| \
	sed -n 's/^$[7-9]\\|10$\s/&/p' \| \
	sed -e "s/^$[7-9]\\|10$\s//" \| \
	awk '{ print "1 '"'"'pos_" (NR-1) " \|features " $0}' > train.vw

giangzuzana / gist:ab77a1576e3343ca65f5

Last active August 29, 2015 14:16 — forked from amundo/gist:288282

	#!/usr/bin/env python
	# -- coding: utf-8 --
	# see http://www.fileslip.net/news/2010/02/04/language-id-project-the-basic-algorithm/

	from math import sqrt

	you = {'pennies': 1, 'nickels': 2, 'dimes': 3, 'quarters': 4 }
	me = {'pennies': 0, 'nickels': 3, 'dimes': 1, 'quarters': 1 }
	abby = {'pennies': 2, 'nickels': 1, 'dimes': 0, 'quarters': 3 }