About pointers on Modern C++

Posted on 2017-04-03 Edited on 2025-08-10 In C++

雖然C++11之後沒對Pointer做任何加強，不過也沒縮減他的能力
Modern C++ 不鼓勵直接使用Raw Pointer，用了一堆Toolkit做取代方案
來分析一下Raw Pointer有哪些問題以及怎麼做比較好

The problem of raw pointer

Raw Pointer最大的問題是語義不夠強
拿以下兩個例子來說

1 2	int* produce(); void consume(int *);

dosomething傳回的指標需要釋放嘛？這問題除了查看文件或是看Sourece Code外別無他法。因此很容易誤用
同樣的問題，consume參數的Pointer需要在函數中釋放嘛？假設consume釋放了記憶體，不過caller傳進來的的參數不是透過allocated拿到的(stack array or something)，然後城市就掛掉了
從這兩個範例來看，你不能從函數宣告知道指標該怎麼處理
其他的Memory Leak等問題就不詳述了，以下是我對Raw Pointer和Modern C++的一些見解

Reference

Reference不是什麼新東西，C++98就有了，不過這也是有效減少Pointer issue的方式之一
Reference和Pointer的差異就不詳述了，上面兩個範例可以用Reference表示

1 2	int produce(); void consume(int&);

這樣一看，對於原先的版本，關於記憶體該誰釋放這點就很清楚了

std::vector

萬一原先的函數是要回傳一個array，而非單一元素，同樣在函數宣告無法很好的表達出來
不過用vector就知道我需要回傳一個vector

1 2	std::vector<int> produce(); void consume(std::vector<int>&);

std::string_view

假設我們要處理的是一個char array

1 2	char* produce(); void consume(char *);

如果用std::string可以更好的表達語義

1 2	std::string produce(); void consume(std::string &);

由於頻繁的Memory allcation/deallcation會造成不小的開銷
假設你餵給consume的參數是一個const char pointer，會做以下的事情

隱性的建構一個std::string物件
呼叫std::string(const char*)建構式
執行consume
結束之後把物件釋放掉
類似的問題也在produce出現，有時在produce的回傳值我們並不需要另一個物件
在C++98之前我們通常都這麼做
1
std::string& produce();
不過如果caller方沒寫好一點用都沒有
1
2
std::string& obj = produce(); // (O)
std::string obj = produce(); // (X)
後者還是會建立個物件, 然後呼叫Copy Constructor。
因此C++17之後將string_view列入STL，類似的實作已經出現在各大Library了
1
2
std::string_view produce();
void consume(std::string_view);
這樣誤用的機會又更小了
Smart Pointers
相信都寫過類似這樣的程式碼
1
2
3
4
5
6
7
8
9
10
11
12
void doSomething() {
int *arr = new int[100];
if (cond1) {
delete [] ar;
return;
}
// Do something
if (cond2) {
}
// Do anotherthing
return;
}
每次都需要在每個回傳路徑檢查是否記憶體正確釋放，當重夠很多次之後，整個程式碼被遺忘的機會更多，這個時候讓編譯器幫忙可以少很多事端
1
2
3
4
5
6
7
8
9
10
11
void doSomething() {
std::unique_ptr<int> arr(new int[100]);
if (cond1) {
return;
}
// Do something
if (cond2) {
}
// Do anotherthing
return;
}
Smart Pointer還有shared pointer和weak pointer，這裡就不細說了

std::optional

這是另外一個跟指標有關, 不過跟上面不太相同的問題
假設我們現在有個需求

搜尋一個陣列
如果找到符合條件的話, 回傳給Caller

Caller使用這個符合條件的值做修改
類似的程式碼可能長這樣

int* findArray(int *arr, int size, int v)
{
	for (int i = 0; i < size; i++)
		if (arr[i] == v) return arr + i;
	return NULL;
}
int *p = findArray(arr, size, v);
if (p) 
	*p = anotherValue;

這個問題不能用Refernce解決，因為Reference不允許Dereference null，因此在之前的作法還是得退化至Pointer Solution
不過有了std::optional之後，語義有所提昇
上面的例子可以寫成

std::optional<int&> findArray(std::vector<int> &arr, int v)
{
	for (size_t i = 0; i < arr.size(); i++)
		if (arr[i] == v) return arr[i];
	return {};
}
std::option<int&> p = findArray(arr, v);
if (p) 
	*p = anotherValue;

Conclusion

雖然Raw Pointer威力強大，無所不能，但未了減少失控。做些房物措施無可厚非
用些Modern C++的技巧可以少犯不少錯誤，如果真的需要最佳化的時候，在把這些拿掉蛻化成Raw Pointer也不遲
先講究不傷身體，在講究效果..

Introduction to std::error_code

Posted on 2017-03-12 Edited on 2025-08-10 In C++

前陣子沉迷於大唐雙龍傳，所以耍廢了一陣子，來紀錄一下新學到的觀念

How to deal error in C language

簡單直接的作法，定義error，然後把ReturnCode當作ErrorCode回傳

enum errors
{
	SUCCESS = 0,
	NOTFOUND,
};
int openFile(const char *filename, int *pfd) {
	int fd = open(filename, , O_RDONLY);
	if (fd == -1) return NOTFOUND;
	*pfd = fd;
	return SUCCESS;
}

不過這邊有個小問題，當兩個不同的Componet有相同的ErrorCode怎麼辦，假設LibA和LibB都有NOTFOUND的定義
通常的作法就是改名，然後用類似LibA_NOTFOUND和LibB_NOTFOUND來繞過

How to deal error in pre C++11

當然是用Exception來表示Error

int openFile(const char *filename) {
	int fd = open(filename, , O_RDONLY);
	if (fd == -1) 
		throw std::exception("File not fould");
	return fd;
}

遮方式當然也有缺點，需要考慮Runtime overhead，有些Coding Style不鼓勵用Exception(例如Google)

std::error_code in C++11

C++11從Boost引進了error_code的觀念

int openFile(const char *filename, std::error_code &ec) {
	int fd = open(filename, , O_RDONLY);
	if (fd == -1) 
		ec = std::error_code(errno, std::system_category());
	return fd;
}
std::error_code ec;
int fd = openFile(filePath, ec);
if (ec) {
	std::cout << "Category: " << ec.category().name()
			  << "Value: " << ec.value() << '\n'
	      << "Message: " << ec.message() << '\n';
}

這邊作法類似於C語言的作法，不過不同的是引進了Category的觀念
於是可以自定義一個Category和自己的Error_code，解決了Conflict的問題
不過如果需要的話，當然也可以當Exception丟出去

1 2	int fd = openFile(filePath, ec); if (ec) throw ec;

當你需要時才把Exception丟出

std::error_condtion

如果要對Error做條件處理該怎麼做，野引進了error_condtion的觀念

int fd = openFile(filePath, ec);
std::error_condition cond1(1, std::system_category());
std::error_condition cond2(2, std::system_category());
if (ec == cond1) {
}
else if (ec == cond2) {
}
else {
}

Custom your categlory and error_code

從Stackoverflow看到如何自定義ErrorCode和Category的方式，記錄下來

#include <iostream>
using namespace std;
#include <system_error>
#include <cassert>

namespace mylib
{
	namespace errc {

		enum my_error
		{
			failed = 0
		};

		inline const char* error_message(int c)
		{
			static const char* err_msg[] =
			{
				"Failed",
			};

			assert(c < sizeof(err_msg) / sizeof(err_msg[0]));
			return err_msg[c];
		}

		class my_error_category : public std::error_category
		{
		public:

			my_error_category()
			{ }

			std::string message(int c) const
			{
				return error_message(c);
			}

			const char* name() const noexcept { return "My Error Category"; }

			const static error_category& get()
			{
				const static my_error_category category_const;
				return category_const;
			}
		};

		inline std::error_code make_error_code(my_error e)
		{
			return std::error_code(static_cast<int>(e), my_error_category::get());
		}

	} // end namespace errc
} // end namespace mylib

namespace std {

	template<>
	struct is_error_code_enum<mylib::errc::my_error>
		: std::true_type
	{ };

} // end namespace std

int main()
{
	std::error_code ec1 = mylib::errc::make_error_code(mylib::errc::failed); // works
	std::error_code ec2 = mylib::errc::failed; // works
	bool result = (ec2 == mylib::errc::failed); // works

	std::cout << ec1 << std::endl;
}

Kancloud document format converter

Posted on 2017-02-20 Edited on 2025-08-10 In Go

前情提要

看云是個非常不錯的文件共享平台, 有提供mobi, epub, pdf不同的格式放至電子閱讀器上看, 不過人生最困難的就是那個but, 雖然可以在E-INK上看, 不過由於CSS Layout的關係, 在螢幕上看的效果跟在E-INK上相差太大
我就是不想一直盯著螢幕, 所以只好自力救濟, 寫出一個Converter

Golang practice

由於主要語言是C/C++, 之前有看過golang的語法而未正式動手
這次特別練習了一下, 不難
運用到的Library只有goquery
將網頁抓下來, 去除掉不必要的部分, 再利用pandoc將html轉成markdown格式

Github Repo

– go_epub

C++ lambda interact with C's Callback function

Posted on 2017-02-16 Edited on 2025-08-10 In C , C++

從最簡單的情況說起

煤捕捉任何狀態的lambda可以直接轉化成普通的C function型態

typedef void(*callback)();
void testFunc(callback cb)
{
	cb();
}
testFunc([] {
	cout << "Lambda function" << endl;
});

帶狀態的情形

通常callback還會帶一個void *參數，可以讓你上下其手

typedef void(*callback)(void *);
void testFunc(callback cb, void *user_data)
{
	cb(user_data);
}
	int v = 123;
	testFunc([](void *user_data) {
		int *v = static_cast<int *>(user_data);
		cout << "Lambda function " << *v<< endl;
	}, &v);

當需要更多上下文時，需要自行定義structure

// local variables
int x = 0;
float y = 1;

// locally defined uncopyable, unmovable type
struct MtEverest
{
	MtEverest() = default;
	MtEverest(const MtEverest& that) = delete; // no copy
	MtEverest(const MtEverest&& that) = delete; // no move
} mt_everest;

// create "user-data" payload  
auto payload = std::tie(x, y, mt_everest);
testFunc([](void *user_data) {
	auto& payload_tup = *reinterpret_cast<decltype(payload)*>(user_data);
	auto& xx = std::get<0>(payload_tup);
	auto& yy = std::get<1>(payload_tup);
	auto& me = std::get<2>(payload_tup);
}, &payload);

更高明的解法

當C API沒辦法有void *的指標讓你上下其手，或是packing/unpacking的程式碼讓人不耐
可以考慮以下這個解法

template <typename Lambda>
static auto lambdacb(Lambda &&l)
{
	thread_local auto* p = &l; // initial assignment, allows using auto
  p = &l;
	return []() { return (*p)(); };
}

typedef void(*callback)();
void testFunc(callback cb)
{
	cb();
}
// local variables
int x = 0;
float y = 1;

testFunc(lambdacb([&x, &y] {
	cout << x << " " << y << endl;
}));

Reference

— Lambda Magic
— Lambda Callbacks
— Technical Debt

How to deal with adjacent element in container

Posted on 2017-01-22 Edited on 2025-08-10 In C++

看到這篇有感而發，現實中常見的問題之一，為了方便說明，把原先的問題簡化，求鄰近元素後者比前面大的數對

最初的方法

原先我只會用這種方法

int calc(const std::vector<int> &v)
{
	int count = 0;
	for (size_t i = 0; i < v.size() - 1; i++)
		if (v[i + 1] > v[i]) count++;
	return count;
}

利用vector來作，實在不高明

Better Solution

參考上面blog的作法

template <typename T>
int calc(const T& v)
{
	int count = 0;
	for (auto it1 = v.cbegin(), it2 = v.cend(); it1 != v.cend(); it2 = it1, ++it1)
		if (it2 != v.end())
			if (*it1 > *it2) count++;
	return count;
}

好一點了，不限定要是vector，不過還是要思考一下it1和it2的關聯性

Range-V3 Solution

可能成為下一代STL的Range-v3，其作法就類似FP中的pipeline的方式處理，隱藏了iterator的存在

template <typename T>
int calc(const T& v)
{
	using namespace ranges;
	auto larger = [](auto front, auto back) { return front > back; };
	return distance(v | view::adjacent_remove_if(larger)) - 1;
}

Range-V3 Solution Ver 2, Sliding Window

Range-v3最近加入了Sliding Window的觀念，比起上面的方式更加通用，不過還是不知道怎麼把他轉成Ranges轉成Tuple，只好寫成這樣

template <typename T>
int calc(const T& v)
{
        using namespace ranges;
        return count_if(v | view::sliding(2), [](const auto &v) {
                auto begin = ranges::begin(v);
                auto front = *begin++;
                auto back = *begin++;
                return front < back;
        });
}

Conclusion

抽象的程度越高，Debug的難度也自然越高，看著gcc或是clang吐出來的compilation error真是一個頭兩個大，尤其是跟Range-v3扯上關係

Reference

— range-v3
— Super expressive code by Raising Levels of Abstraction
— Ranges: the STL to the Next Level

Use std::variant to represent composite design pattern

Posted on 2017-01-17 Edited on 2025-08-10 In C++

最近摸索出來的心得，先來看看傳統的 Composite design pattern要怎麼作

#include <vector>
#include <memory>
using namespace std;

class Component
{
public:
    virtual ~Component() = default;
};

class Leaf : public Component
{
};

class Composite : public Component
{   
    vector<unique_ptr<Component>> children;
public:
    void add(Component *ele)
    {   
        children.push_back(std::unique_ptr<Component>(ele));
	}
};

這個方法不差，不過還是有幾點可以改良的
– 每個Subtype都必須繼承Component，就算沒有任何is-a的關聯性還是必須這麼作
– 當要Clone一份物件出來的話，需要另外一個Prototype Pattern
例如

class Component
{
public:
    virtual ~Component() = default;
    virtual Component* clone() = 0;
};

class Leaf : public Component
{
public:
    Component* clone() override { return new Leaf(); }
};

class Composite : public Component
{
    vector<unique_ptr<Component>> children;
public:
    void add(Component *ele)
    {
        children.push_back(std::unique_ptr<Component>(ele));
    }
    Component* clone() override {
        Composite* composite = new Composite();
        for (const auto &child : children)
                composite->children.push_back(
                        std::unique_ptr<Component>(child->clone()));
        return composite;
    }
};

每個都這麼作實在很醜，並且又容易錯

Solution based on std::variant

使用std::variant可以解決上面的問題，乾淨俐落

class Leaf;
class Composite;
using CompositeVar = variant<Leaf, Composite>;
class Leaf final
{   
public:
    Leaf() = default;
    Leaf(const Leaf&) = default;
};

class Composite final
{   
    vector<CompositeVar> children;
public:
    Composite() = default;
    Composite(const Composite&) = default;
    void add(const CompositeVar &ele)
    {   
        children.push_back(ele);
    }
};

不過這也不是萬用解，當你的CompositeVar 中的type數目是有限的，可以採用這個方式
不然就是每加入一種Type，就必須重新編譯一次，反而沒有舊版的彈性

Introduction to Emscripten, asm.js, WebAssembly

Posted on 2017-01-15 Edited on 2025-08-10

安裝 Emscripten SDK

首先必須先安裝以下package

1	$ apt-get install wget python git g++ cmake nodejs

從[網站(http://kripken.github.io/emscripten-site/docs/getting_started/downloads.html)下載Portable SDK進行編譯安裝。安裝過程如下

$ tar zxvf emsdk-portable.tar.gz
$ cd emsdk_portable
$ ./emsdk update
$ ./emsdk install clang-tag-e1.34.1-64bit
$ ./emsdk install emscripten-tag-1.34.1-64bit
$ ./emsdk activate
$ ./emsdk_env.sh

修改~/.emscripten`

NODE_JS = nodejs

寫個程式測試

#include <stdio.h>
int main( )
{
        printf("Hello World!\n");
}

將奇編譯成Javascript

1	$ ./emcc hello.c -o hello.out.js

用nodejs執行它

1	$ nodejs hello.out.js

Asm.js

Asm.js是Javascript的一個子集合，限制Javascript的特性，使其能夠對應體作最佳化。在編譯時加上-s ASM_JS=1即可。

1	$ ./emcc -s ASM_JS=1 hello.c -o hello.out.js

如果有需要的話可以參考

WebAssembly

最近新推出來的玩意，在2016年各家Browsers終於推出preview版本，而Emscripten更能直接輸出WebAssembly
請參考WebAssembly
另外一個方式就是使用自定義的clang來輸出WebAssembly

其他的參考連結
— A Look at Web Assembly and Molecular Analysis
— Webassembly initial steps tutorial or how to start with wasm

Reference

Range based's erase on C++

Posted on 2016-12-05 Edited on 2025-08-10 In C++

看到了Eraser iterators這篇不發表一下文章實在對不起作者啊
把Erase的痛點都寫出來了
在Range based loop要Erase只有以下兩種方法

for (auto it = array.begin(); it != array.end();)
{
	if (*it == delete)
	{
		it = array.erase(it);
	}
	else
	{
		it++;
	}
}

或者是

array.erase(std::remove_if(array.begin(), array.end(),
	[](auto& item)
{
	bool deleting = condition;
	return deleting;
}
), array.end());

第一種很直覺，不過跟C++11提倡的Range based loop算是一種退化
而第二種出現了4個array，一不小心還會有打錯字的風險
天才作者提出了第三種方式

for (auto& item : iter::eraser(array))
{
	if (*item == value) // Access item via deref
		item.mark_for_erase();  // Item is marked for deletion, but is still valid until end of loop iteration
}

非常漂亮的解法！

Variant in C++ 17

Posted on 2016-11-25 Edited on 2025-08-10 In C++

從Boost Variant談起

1
2
3

#include <boost/variant.hpp>
boost::variant<int, std::string> v;
v = "Hello World!";

boost::get

使用boost::get需要給出正確型別，不然會拋出 Exception

1 2	std::cout << boost::get<std::string>(v) << std::endl; // Hello World! std::cout << boost::get<int>(v) << std::endl; // terminate called after throwing an instance of 'boost::bad_get'

Use RTTI

void var_print(const boost::variant<int, std::string> &v)
{
        if (v.type() == typeid(int)) {
                std::cout << boost::get<int>(v) << std::endl;
        } else if (v.type() == typeid(std::string)) {
                std::cout << boost::get<std::string>(v) << std::endl;
        }
}

每增加一種型別就要修改程式碼，並且影響性能

Visitor Pattern

class var_visitor : public boost::static_visitor<void>
{
public:
        void operator()(int i) const {
                std::cout << i << std::endl;
        }
        void operator()(const std::string& str) const {
                std::cout << str << std::endl;
        }
        // the default case:
        template <typename T> void operator()(T const &) const {
                std::cout << "FALLBACK: " << __PRETTY_FUNCTION__ << "\n";
        }
};
boost::apply_visitor(var_visitor(), v);

不過C++17的visit功能更強，實現更優雅

class var_visitor
{
public
	void operator()(int i) const {
		std::cout << i << std::endl;
	}
	void operator()(const std::string& str) const {
		std::cout << str << std::endl;
	}
	// the default case:
 template <typename T> void operator()(T const &) const {
		std::cout << "FALLBACK: " << __PRETTY_FUNCTION__ << "\n";
	}
};
visit(var_visitor(), v);

智逾期他的visit方式就先打住，有空在研究，介紹variant可以作些什麼

Stack-based run-time polymorphism

傳統基於heap based的polymorphism都是這麼做的
– 分配一塊記憶體
– 將物件創造於記憶體上
– 根據vtbl呼叫virtual function
– 歸還記憶體

所以一般都會寫出這樣的程式碼

struct Base {
        virtual ~Base() = default;
        virtual void func() const = 0;
};
struct Der1 : public Base {
        void func() const override { cout << "Der1" << endl; }
};
struct Der2 : public Base {
        void func() const override { cout << "Der2" << endl; }
};

unique_ptr<Base> create(int v)
{
        if (v)
                return make_unique<Der1>();
        else
                return make_unique<Der2>();
}

auto test = [](const Base &obj) {
     obj.func();
};
auto obj = create(0);
test(*obj);

而基於stack的作法，省去了最前面跟最後面的步驟，因此速度更快，如果有機會devirtualize的話，連vtbl都可以不需要

using derv = variant<Der1, Der2>;
erv create(int v)
{
        if (v)  
                return Der1();
        else
                return Der2();
}

template <typename BaseType, typename ... Types>
BaseType& cast_to_base(variant<Types ...>& v)
{
    return visit([](BaseType& arg) -> BaseType& { return arg; }, v);
}

auto test = [](const Base &obj) {
     obj.func();
};
derv obj = create(0);
test(cast_to_base<Base>(obj));

Reference

– variant in C++14
– 浅谈boost.variant的几种访问方式
– 让boost.variant支持lambda表达式访问
– default visitor function for boost::variant
– visiting variants using lambdas - part 1
– visiting variants using lambdas - part 2
– Polymorphism Polymorphism

One_of implementation in variadic template

Posted on 2016-11-06 Edited on 2025-08-10 In C++11 C++17

不得不說，這真的是個炫技，不過還真有用。
原先的程式碼，煩譟又容易錯

1 2	if (thing.x == 1 \|\| thing.x == 2 \|\| thing.x == 3) dosomething();

可以改寫成這樣
C++11版

template<typename U, typename ... T>
bool one_of(U&& u, T && ... t)
{
	bool match = false;
	(void)std::initializer_list<bool>{ (match = match || u == t)... };
	return match;
}
	if (one_of(thing.x, 1, 2, 3))
		dosomething();

關劍在Varadic Template Pack expansion，他創造了一個initial_list，其值就是當下match的值。
將initial_list的值印出來更容易看出變化，因此小小變化一下

template<typename U, typename ... T>
bool one_of(U&& u, T && ... t)
{
	bool match = false;
	auto list = std::initializer_list<bool>{ (match = match || u == t)... };
	for (auto v : list)
		std::cout << v << std::endl;
	return match;
}

部過我們的inital_list根本沒用到, 所以就直接宣告成void讓編譯氣決定最佳化了
C++17版可以寫得更簡單

template<typename U, typename ... T>
bool one_of(U&& u, T && ... t)
{
  return ( (u == t) || ...  );
}

Reference

– Parameter pack
– C++17 Fold Expressions
– A Data Point for MSVC vs Clang Code Generation

The problem of raw pointer

Reference

std::vector

std::string_view

Smart Pointers

std::optional

Conclusion

How to deal error in C language

How to deal error in pre C++11

std::error_code in C++11

std::error_condtion

Custom your categlory and error_code

前情提要

Golang practice

Github Repo

從最簡單的情況說起

帶狀態的情形

更高明的解法

Reference

最初的方法

Better Solution

Range-V3 Solution

Range-V3 Solution Ver 2, Sliding Window

Conclusion

Reference

Solution based on std::variant

安裝 Emscripten SDK

Asm.js

WebAssembly

Reference

從Boost Variant談起

boost::get

Use RTTI

Visitor Pattern

Stack-based run-time polymorphism

Reference

Reference