proszkinasenne2's comments

proszkinasenne2 | 5 months ago | on: Django 6.0 alpha 1 released

Based on the current state of affairs [1] you may get more structure around tasks in the app (or django packages running tasks) which is probably a nice thing. Other than that you will still need to have a backend implementation [2] and the easiest path there is (that I can think of) a Celery wrapper =)

[1] https://github.com/django/django/blob/main/django/tasks/base... [2] https://github.com/django/django/blob/main/django/tasks/back...

proszkinasenne2 | 5 months ago | on: Django 6.0 alpha 1 released

Seems like... not really.

> Django’s new Tasks framework makes it easy to define and enqueue such work. It does not provide a worker mechanism to run Tasks. The actual execution must be handled by infrastructure outside Django, such as a separate process or service.

You'll likely still need an actual backend, task runner, cache implementation with a backend etc.

proszkinasenne2 | 4 years ago | on: Ask HN: Can web scraping be the basis of a viable business model?

Sure, it can be! Also, as some people have already pointed out, this is often a gray area where people go beyond violating ToS. Some good examples are privacy violations (scraping personal data), credentials stuffing etc.

Recently, there is a boom of "anti-bot" services. These are essentially SaaS businesses that "protect" websites from being scraped by automated software. As you onboard the first customer who wants to extract data from a bot-protected website, you are going to run into an unlimited waterfall of stupid troubles. Your bots will be blocked, will consume excessive amount of data, kill your CPU/GPU performance.

I have shared some highlights on how to bypass these recently on HN [1], but it is sadly only the tip of the iceberg. On the other hand, since the post has been featured on HN I have been reached by more than 50 companies and individuals whose business operating model is based solely on data extraction/automated scraping. These are (in my opinion) successful companies, and two out of these are part of YC.

[1] https://news.ycombinator.com/item?id=29060272

proszkinasenne2 | 5 years ago | on: Bromite: A Privacy-Enhanced Chromium Fork for Android

Anything based on Chromium is vulnerable to all specialised fingerprinting techniques such as this one https://niespodd.github.io/persistent-tracking-shader-cache/ and many others that I listed here https://github.com/niespodd/browser-fingerprinting

Some parts of Chromium seem to be intentionally exposing fingerprinting surfaces and, because its changing quickly with new features and addons, keeping up with patches like Bromite does is incredibly challenging task

proszkinasenne2 | 5 years ago | on: Ask HN: Why do people use Puppeteer for webscraping instead of SaaS services?

That's true when it's a one-time job: pull the data and disappear. I also see how this is the case for most freelancers on Fiverr or Freelancer. This is the tool they know, so they use it. However I imagine there is a number of companies that strongly rely on continous data scraping - let it be for price comparison - and I've seen one heavily using Puppeteer

proszkinasenne2 | 6 years ago | on: Ask HN: A tool/SaaS to manage employee access/accounts in one place

"Comprehensive and up to date overview on the access information" by pulling data from the services I use and (optionally) managing the permissions for me.

Rippling [1] calls it "provisioning/de-provisioning" and that's exactly what I am looking for (maybe with periodical checks to make sure we remove contractors permissions once their job is done) with the exception that I don't want to set up a new HR system just to get this feature.

[1] - https://www.rippling.com/provisioning-de-provisioning/

proszkinasenne2 | 6 years ago | on: Ask HN: A tool/SaaS to manage employee access/accounts in one place

Comprehensive and up to date overview on the access information.

Github is in charge of a tech team lead, Slack is someone else. Once an employee leaves, I want to make sure access is limited accordingly. Sending a bunch of emails and coordinating, then double-checking is the worst nightmare. If the # of services you use is 5 it's all good. If you use >5 you have to check each individually every employee sign-off.

We once had an ex-employee receiving Github updates two years after he left us.

proszkinasenne2 | 7 years ago | on: Show HN: Quickly find quality remote developers from Eastern Europe

How big is your database of developers/agencies today? How many successful hire leads have you generated?

The idea is great and I myself have seen too many people in DACH regions overpaying for local developers because they don’t have trust to “unreliable remote eastern European freelancers”.

It would be awesome if you provide some case studies on how you helped some big names (apart from saying that Google trusts your guys).

proszkinasenne2 | 7 years ago | on: Show HN: Coffee Chat – Trade your expertise

Very cool! Keeping fingers crossed.

Speaking to people with domain expertise willing to use their time to help you is an invaluable experience. A short 10-15 minutes talk can save hours or days or researching, experiments, proving yourself wrong.

Good luck!

PS. Tried to reach you at [email protected], but got:

554 5.7.1 <[email protected]>: Recipient address rejected: this address does not exist

page 1