proszkinasenne2's comments

proszkinasenne2 | 5 months ago | on: Django 6.0 alpha 1 released

Based on the current state of affairs [1] you may get more structure around tasks in the app (or django packages running tasks) which is probably a nice thing. Other than that you will still need to have a backend implementation [2] and the easiest path there is (that I can think of) a Celery wrapper =)

[1] https://github.com/django/django/blob/main/django/tasks/base... [2] https://github.com/django/django/blob/main/django/tasks/back...

proszkinasenne2 | 5 months ago | on: Django 6.0 alpha 1 released

Seems like... not really.

> Django’s new Tasks framework makes it easy to define and enqueue such work. It does not provide a worker mechanism to run Tasks. The actual execution must be handled by infrastructure outside Django, such as a separate process or service.

You'll likely still need an actual backend, task runner, cache implementation with a backend etc.

proszkinasenne2 | 8 months ago | on: Watch videos in your preferred language

"Was this helpful?" > "No" > "Why can't this bullshit can't be disabled globally?" > "Submit"

proszkinasenne2 | 1 year ago | on: Humans now share the web equally with bots

Imperva with it's core product around blocking bots publishing statistics on how crowded platforms are with bots. ¯\_(ツ)_/¯

proszkinasenne2 | 3 years ago | on: Chromium based browsers leak user local IP via WebRTC foundation attribute

In Chrome/Chromium there is a WebRTC Network Limiter [1] extension that let you set "Use only my default public IP address" policy and render the method I presented ineffective.

[1] https://chrome.google.com/webstore/detail/webrtc-network-lim...

proszkinasenne2 | 3 years ago | on: Chromium based browsers leak user local IP via WebRTC foundation attribute

It's both security and privacy issue. Whonix wiki explains the latter in more detail https://www.whonix.org/wiki/Data_Collection_Techniques#:~:te...

proszkinasenne2 | 4 years ago | on: Ask HN: Can web scraping be the basis of a viable business model?

Sure, it can be! Also, as some people have already pointed out, this is often a gray area where people go beyond violating ToS. Some good examples are privacy violations (scraping personal data), credentials stuffing etc.

Recently, there is a boom of "anti-bot" services. These are essentially SaaS businesses that "protect" websites from being scraped by automated software. As you onboard the first customer who wants to extract data from a bot-protected website, you are going to run into an unlimited waterfall of stupid troubles. Your bots will be blocked, will consume excessive amount of data, kill your CPU/GPU performance.

I have shared some highlights on how to bypass these recently on HN [1], but it is sadly only the tip of the iceberg. On the other hand, since the post has been featured on HN I have been reached by more than 50 companies and individuals whose business operating model is based solely on data extraction/automated scraping. These are (in my opinion) successful companies, and two out of these are part of YC.

[1] https://news.ycombinator.com/item?id=29060272

proszkinasenne2 | 5 years ago | on: Ask HN: How will Google track users once 3rd party cookies are blocked?

- Trust Token API to verify whether you are a bot or not - Federated Learning of Cohorts (FLoC)

https://www.google.com/amp/s/blog.google/products/ads-commer...

As well as there is plenty of techniques allowing device fingerprinting that Google (might) use.

https://github.com/niespodd/browser-fingerprinting

proszkinasenne2 | 5 years ago | on: Bromite: A Privacy-Enhanced Chromium Fork for Android

Anything based on Chromium is vulnerable to all specialised fingerprinting techniques such as this one https://niespodd.github.io/persistent-tracking-shader-cache/ and many others that I listed here https://github.com/niespodd/browser-fingerprinting

Some parts of Chromium seem to be intentionally exposing fingerprinting surfaces and, because its changing quickly with new features and addons, keeping up with patches like Bromite does is incredibly challenging task

proszkinasenne2 | 5 years ago | on: Ask HN: Why do people use Puppeteer for webscraping instead of SaaS services?

I thought about it too but when you consider cost of running headless Puppeteer (lets say on AWS) and the cost of a good proxy that is charged per GB its often as expensive (if not more) as some of these SaaS-es. This is the case especially for websites with some heavyweight JS/CSS/img assets.

proszkinasenne2 | 5 years ago | on: Ask HN: Why do people use Puppeteer for webscraping instead of SaaS services?

That's true when it's a one-time job: pull the data and disappear. I also see how this is the case for most freelancers on Fiverr or Freelancer. This is the tool they know, so they use it. However I imagine there is a number of companies that strongly rely on continous data scraping - let it be for price comparison - and I've seen one heavily using Puppeteer

proszkinasenne2 | 5 years ago | on: Show HN: Cloakbits - Headless web scraping with bypass for anti-bot WAFs

Agreed. I have changed the text, so that it is less confusing. Thank you for pointing this out!

proszkinasenne2 | 5 years ago | on: Show HN: Cloakbits - Headless web scraping with bypass for anti-bot WAFs

@jjgreen I am genuinely interested what are the existing solutions and how people deal with the problem. This is why it's "Ask HN". If there is none and someone would be interested in using our tool, why creating two topics?

proszkinasenne2 | 6 years ago | on: Launch HN: Motion (YC W20) – defense against online distractions and addictions

@qiyuxuan96 Hi there. I am developing a cross-browser extension development/deployment SaaS. Would you be interested in hearing more?

We make life easier with things like: - versioning, - packaging extensions for different extension galleries, - collecting payments, - gathering analytics (extension views, installations etc.).

Ping me at [email protected] if interested.

proszkinasenne2 | 6 years ago | on: uBlock Origin 1.20

Is there a good short answer for why uBlock is using constructs like this:

if ( self.browser instanceof Object ) { self.chrome = self.browser; } else { self.browser = self.chrome; }

Instead of using a polyfill like that one https://github.com/mozilla/webextension-polyfill?

proszkinasenne2 | 6 years ago | on: Ask HN: A tool/SaaS to manage employee access/accounts in one place

"Comprehensive and up to date overview on the access information" by pulling data from the services I use and (optionally) managing the permissions for me.

Rippling [1] calls it "provisioning/de-provisioning" and that's exactly what I am looking for (maybe with periodical checks to make sure we remove contractors permissions once their job is done) with the exception that I don't want to set up a new HR system just to get this feature.

[1] - https://www.rippling.com/provisioning-de-provisioning/

proszkinasenne2 | 6 years ago | on: Ask HN: A tool/SaaS to manage employee access/accounts in one place

Any ideas how much is Rippling for something around 30 PCs? Only the provisioning/de-provisioning. They sell payroll + Core HR within a package with the provis./de-provis., but we are outside US plus already have an HR tool.

proszkinasenne2 | 6 years ago | on: Ask HN: A tool/SaaS to manage employee access/accounts in one place

Comprehensive and up to date overview on the access information.

Github is in charge of a tech team lead, Slack is someone else. Once an employee leaves, I want to make sure access is limited accordingly. Sending a bunch of emails and coordinating, then double-checking is the worst nightmare. If the # of services you use is 5 it's all good. If you use >5 you have to check each individually every employee sign-off.

We once had an ex-employee receiving Github updates two years after he left us.

proszkinasenne2 | 7 years ago | on: Show HN: Quickly find quality remote developers from Eastern Europe

How big is your database of developers/agencies today? How many successful hire leads have you generated?

The idea is great and I myself have seen too many people in DACH regions overpaying for local developers because they don’t have trust to “unreliable remote eastern European freelancers”.

It would be awesome if you provide some case studies on how you helped some big names (apart from saying that Google trusts your guys).

proszkinasenne2 | 7 years ago | on: Show HN: Coffee Chat – Trade your expertise

Very cool! Keeping fingers crossed.

Speaking to people with domain expertise willing to use their time to help you is an invaluable experience. A short 10-15 minutes talk can save hours or days or researching, experiments, proving yourself wrong.

Good luck!

PS. Tried to reach you at [email protected], but got:

554 5.7.1 <[email protected]>: Recipient address rejected: this address does not exist