siganakis | 3 years ago | on: The Ethereum merge is done
siganakis's comments
siganakis | 4 years ago | on: When they warn of rare disorders, these prenatal tests are usually wrong
It was absolute hell. The key problem here is the waiting and uncertainty. You have the NIPT at 10w, but you can’t have the amniocentesis until several weeks later. When that came back fine, there were questions about whether it was a “mosaic” meaning only a small proportion of cells are effected. We were only really in the clear after the 20 week ultrasound.
That’s a lot of weeks to be consumed by wondering about whether to terminate the pregnancy, or wait it out for more information. I have a masters in bioinformatics (in genomics!) and my knowledge of stats and the science was next to useless in the face of these decisions.
I know of couples who simply couldn’t deal with this uncertainty and chose to terminate on the basis of this test alone.
Fortunately for us our child was fine and is a perfectly healthy 18 month old now, but I wouldn’t do the rare trisomy test again.
siganakis | 4 years ago | on: What the Heck is a Data Mesh?
The "data mesh" is essentially the collection of these independent "data-products".
We already see management problems with self-service analytics like PowerBI, Tableau & Looker. Its too easy for people to create dashboards / reports that are subtly wrong and which cause confusion. There is a balance between empowering to build data products and centralised control. Too much empowerment of people who don't understand the right way to do something leads to a horrible mess of contradictory data. Not enough, and people can't effectively do their job. Governance and process is the key to finding the balance and enforcing it.
The issue with the data-mesh is that there isn't really any great tooling to support the management or development of data products, or a data-mesh generally. I am sure this will change over time as vendors start building hype around it.
siganakis | 4 years ago | on: What the Heck is a Data Mesh?
If marketing, finance and sales is dependent on a centralised data team for every new thing, the data team quickly becomes the bottleneck, stifling innovation and frustrating teams. Incorporating the principles of a Data Mesh enables those teams to manage their own data, according to well defined governance standards that enable interoperability.
The reality is that different teams are already managing their own data (via excel spreadsheets, web-apps, etc). If we can apply a bit more rigor to how these datasets are managed (e.g. so they can be shared, integrated, secured, etc), then the whole organisation benefits.
siganakis | 6 years ago | on: Cloud AI Platform Pipelines
If Google were to kill it, you could easily run it on any other hosted Kubernetes service.
I haven't used Cloud AI Platform Pipelines, but have spent a lot of time working with Kubeflow Pipelines and its pretty great!
[1] https://github.com/kubeflow/pipelines
[2] https://www.kubeflow.org/docs/aws/ (Deploy to AWS)
[3] https://www.kubeflow.org/docs/azure/ (Deploy to Azure)
siganakis | 7 years ago | on: Gitlab.com performance degradation: Postgresql split – Only one DB node active
I love the product, but hate that I can't rely on it.
siganakis | 10 years ago | on: Against Method
[0]: https://en.wikipedia.org/wiki/Thomas_Kuhn
[1]: https://en.wikipedia.org/wiki/The_Structure_of_Scientific_Re...
siganakis | 10 years ago | on: Kerf: a columnar tick database for Linux, OS X, BSD, iOS, Android
I'd like to look into it further, but I can't find any information about licensing. Given that there is no source code in the repo, it appears that this isn't an open source project.
siganakis | 11 years ago | on: SpringRole – Everyone is a Recruiter
Is it possible to view / check / correct any information about myself if I am a "passive user" on your platform?
Many eu countries and Austalia have privacy laws around these basic rights.
siganakis | 11 years ago | on: Ask HN: What is a good salary in X city?
siganakis | 11 years ago | on: Ask HN: Who is using the .NET stack for their startup?
We have our own retry logic, which also logs the issue so we are aware of how frequently errors occur while a command / transaction is being executed.
This is using SQL Azure with the "Business" tier, so it will be interesting to see how the new (much more highly priced tiers) Standard and Premium tiers go.
siganakis | 11 years ago | on: Ask HN: Who is using the .NET stack for their startup?
Azure is a bit hit and miss. Its brilliant for getting something up an running quickly (using websites / SQL Server), but is a little flaky at scale.
Key problems include connection issues with SQL Server, connection issues with their hosted Redis service, pricing of SQL Server when using advanced features like geo-replication.
All in all though, its a pretty good development experience once you get your head around the fact that in the cloud services fail and there is nothing you can do about it except plan for it.
Oh and the Bizspark program they have gives you $100 worth of free hosting on Azure which is always nice.
siganakis | 11 years ago | on: Ask HN: What DB to use for huge time series?
Coming from that background, C and especially C# must seem extremely verbose.
For example (from Wikipedia):
In K, finding the prime numbers from 1 to R is done with [0]:
(!R)@&{&/x!/:2_!x}'!R
And APL[1]: (~R∊R∘.×R)/R←1↓ιR
Its truly awesome stuff.[0]: http://en.wikipedia.org/wiki/K_(programming_language) [1]: http://en.wikipedia.org/wiki/APL_(programming_language)
siganakis | 11 years ago | on: The laws of shitty dashboards
The dirty little secret of the business intelligence / dashboard industry is that no one logs into them.
A daily email helps with this problem, as people tend to read emails, even if its only a glance.
siganakis | 11 years ago | on: Building New SQL (2013) [pdf]
The issue is that composability is often tied to actually moving data around in the database which has terrible performance. That is, you can compose a query of multiple queries that dump partial data sets into temp tables.
Views get you part of the way there, but they are designed to be long lived and are visible to all database users until they are dropped. This means its dangerous to change them or clean them up, as its not always clear who they are being used by.
Ephermal or temporary views that are session/connection based, or even loadable as modules would be useful to me.
siganakis | 11 years ago | on: Building New SQL (2013) [pdf]
I would prefer a syntax layer that can be compiled / transformed back to SQL but that does basic things like having a query start with the tables, then joins, then groupings then the final projection.
Also a less cumbersome way to use the "WITH" statement to form named sub-queries.
Perhaps something like:
SELECT
COUNT(*) as columns,
column_type,
table_name
FROM (
SELECT c.id,
c.type AS column_type,
t.name AS table_name
FROM tables t
INNER JOIN columns c
ON t.id = c.table_id
WHERE t.system=false;
) a
HAVING COUNT(*) > 1
ORDER BY columns DESC
Being re-written as: # Use ":=" to replace WITH for named ephermal views
# Replace "WHERE" with "?", "SELECT" with "|>" at the end
non_system := tables
? system=false
|> name:table_name, is:table_id
# Replace INNER JOIN with "*="
non_system_columns := non_system.table_id *= columns.table_id
|> c.id, c.type:column_type
# GROUP BY columns are automatically generated by non-aggregated columns
column_types := non_system_columns
|> COUNT(*):columns DESC, column_type, table_name
So the final query may look like: non_system := tables ? system=false
|> name:table_name, is:table_id
non_system_columns := non_system.table_id *= columns.table_id
|> c.id, c.type:column_type
non_system_columns
|> COUNT(*):columns DESC, column_type, table_name
Any thoughts on this?siganakis | 11 years ago | on: Show HN: Domino, a PaaS for data science
This means that your data (in files) needs to be in the working directory, and is versioned along side your code. Sounds pretty cool, but I am not sure how it would scale for large / constantly changing data sets.
siganakis | 11 years ago | on: Are Pixels Productivity? A study 24 years in the making
Unfortunately my carpentry skills lacking, and the Dell Single Monitor Arm [1] are NOT suitable for mounting 2 monitors on top of each other (not enough height).
[1]: http://accessories.us.dell.com/sna/productdetail.aspx?c=us&l...
siganakis | 11 years ago | on: Are Pixels Productivity? A study 24 years in the making
The article was just supposed to be a bit of fun about my personal experience with different monitor configurations. I thought that this might be interesting to the HN crowd since so many of us spend so much time in front of screens.
siganakis | 11 years ago | on: Are Pixels Productivity? A study 24 years in the making
Also, HN seems to have dropped the "∝" symbol from the title!
https://www.afr.com/companies/financial-services/anz-the-fir...